Skip to content
Change the repository type filter

All

    Repositories list

    • BrushEdit

      Public
      The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"
      Python
      Other
      1840750Updated Dec 26, 2024Dec 26, 2024
    • DiTCtrl

      Public
      Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation"
      Python
      Other
      08200Updated Dec 25, 2024Dec 25, 2024
    • FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction
      JavaScript
      Other
      49600Updated Dec 23, 2024Dec 23, 2024
    • ColorFlow

      Public
      The official implementation of paper "ColorFlow: Retrieval-Augmented Image Sequence Colorization"
      Python
      Other
      2227160Updated Dec 23, 2024Dec 23, 2024
    • DI-PCG

      Public
      Code release of our paper "DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation".
      Python
      Other
      26910Updated Dec 20, 2024Dec 20, 2024
    • BrushNet

      Public
      [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
      Python
      Other
      1281.5k460Updated Dec 17, 2024Dec 17, 2024
    • Divot

      Public
      Diffusion Powers Video Tokenizer for Comprehension and Generation
      Python
      Other
      13701Updated Dec 10, 2024Dec 10, 2024
    • Boosting Generative Novel View Synthesis with Sparse and Unposed Images
      Python
      Other
      56210Updated Dec 9, 2024Dec 9, 2024
    • Moto

      Public
      Latent Motion Token as the Bridging Language for Robot Manipulation
      Python
      Other
      05810Updated Dec 8, 2024Dec 8, 2024
    • SEED-Voken: A Series of Powerful Visual Tokenizers
      Python
      Apache License 2.0
      3179540Updated Dec 4, 2024Dec 4, 2024
    • FluxKits

      Public
      Python
      Apache License 2.0
      26220Updated Nov 27, 2024Nov 27, 2024
    • InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
      Python
      Apache License 2.0
      3823.5k1053Updated Nov 11, 2024Nov 11, 2024
    • PhotoMaker [CVPR 2024]
      Jupyter Notebook
      Other
      7719.7k1434Updated Oct 31, 2024Oct 31, 2024
    • SEED-Story: Multimodal Long Story Generation with Large Language Model
      Python
      Other
      5877040Updated Oct 11, 2024Oct 11, 2024
    • Official Code for MotionCtrl [SIGGRAPH 2024]
      Python
      Apache License 2.0
      731.4k280Updated Sep 20, 2024Sep 20, 2024
    • ST-LLM

      Public
      [ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"
      Python
      Apache License 2.0
      413390Updated Sep 10, 2024Sep 10, 2024
    • mllm-npu

      Public
      mllm-npu: training multimodal large language models on Ascend NPUs
      Python
      Apache License 2.0
      28730Updated Aug 29, 2024Aug 29, 2024
    • MasaCtrl

      Public
      [ICCV 2023] Consistent Image Synthesis and Editing
      Python
      Apache License 2.0
      29750212Updated Aug 19, 2024Aug 19, 2024
    • Plot2Code

      Public
      Python
      31700Updated Aug 17, 2024Aug 17, 2024
    • GFPGAN

      Public
      GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
      Python
      Other
      6k36k35224Updated Jul 26, 2024Jul 26, 2024
    • CustomNet

      Public
      Python
      Apache License 2.0
      1026761Updated Jul 22, 2024Jul 22, 2024
    • ViT-Lens

      Public
      [CVPR 2024] ViT-Lens: Towards Omni-modal Representations
      Python
      Other
      1016730Updated Jul 2, 2024Jul 2, 2024
    • T2I-Adapter
      Python
      2123.5k856Updated Jun 21, 2024Jun 21, 2024
    • SmartEdit

      Public
      Official code of SmartEdit [CVPR-2024 Highlight]
      Python
      8270170Updated Jun 21, 2024Jun 21, 2024
    • LLaMA-Pro

      Public
      [ACL 2024] Progressive LLaMA with Block Expansion.
      Python
      Apache License 2.0
      36488220Updated May 20, 2024May 20, 2024
    • NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
      Python
      Other
      2040791Updated May 14, 2024May 14, 2024
    • BTS

      Public
      BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild
      Other
      02740Updated Apr 16, 2024Apr 16, 2024
    • UMT

      Public
      UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.
      Python
      Other
      1919310Updated Apr 15, 2024Apr 15, 2024
    • BEBR

      Public
      Official code for "Binary embedding based retrieval at Tencent"
      Python
      Apache License 2.0
      14220Updated Mar 7, 2024Mar 7, 2024
    • DeSRA

      Public
      Official codes for DeSRA (ICML 2023)
      Python
      012950Updated Feb 2, 2024Feb 2, 2024