Skip to content
Change the repository type filter

All

    Repositories list

    • FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction
      13800Updated Dec 13, 2024Dec 13, 2024
    • Divot

      Public
      Diffusion Powers Video Tokenizer for Comprehension and Generation
      Python
      Other
      13101Updated Dec 10, 2024Dec 10, 2024
    • Boosting Generative Novel View Synthesis with Sparse and Unposed Images
      Python
      Other
      14410Updated Dec 9, 2024Dec 9, 2024
    • Moto

      Public
      Latent Motion Token as the Bridging Language for Robot Manipulation
      Python
      Other
      04700Updated Dec 8, 2024Dec 8, 2024
    • SEED-Voken: A Series of Powerful Visual Tokenizers
      Python
      Apache License 2.0
      3077240Updated Dec 4, 2024Dec 4, 2024
    • FluxKits

      Public
      Python
      Apache License 2.0
      25620Updated Nov 27, 2024Nov 27, 2024
    • InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
      Python
      Apache License 2.0
      3753.5k1052Updated Nov 11, 2024Nov 11, 2024
    • PhotoMaker [CVPR 2024]
      Jupyter Notebook
      Other
      7699.6k1434Updated Oct 31, 2024Oct 31, 2024
    • SEED-Story: Multimodal Long Story Generation with Large Language Model
      Python
      Other
      5876340Updated Oct 11, 2024Oct 11, 2024
    • Official Code for MotionCtrl [SIGGRAPH 2024]
      Python
      Apache License 2.0
      731.3k280Updated Sep 20, 2024Sep 20, 2024
    • ST-LLM

      Public
      [ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"
      Python
      Apache License 2.0
      412790Updated Sep 10, 2024Sep 10, 2024
    • mllm-npu

      Public
      mllm-npu: training multimodal large language models on Ascend NPUs
      Python
      Apache License 2.0
      28530Updated Aug 29, 2024Aug 29, 2024
    • MasaCtrl

      Public
      [ICCV 2023] Consistent Image Synthesis and Editing
      Python
      Apache License 2.0
      28747212Updated Aug 19, 2024Aug 19, 2024
    • Plot2Code

      Public
      Python
      31600Updated Aug 17, 2024Aug 17, 2024
    • GFPGAN

      Public
      GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
      Python
      Other
      6k36k35124Updated Jul 26, 2024Jul 26, 2024
    • CustomNet

      Public
      Python
      Apache License 2.0
      1026761Updated Jul 22, 2024Jul 22, 2024
    • BrushNet

      Public
      [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
      Python
      Other
      1241.5k450Updated Jul 17, 2024Jul 17, 2024
    • ViT-Lens

      Public
      [CVPR 2024] ViT-Lens: Towards Omni-modal Representations
      Python
      Other
      1016530Updated Jul 2, 2024Jul 2, 2024
    • T2I-Adapter
      Python
      2113.5k856Updated Jun 21, 2024Jun 21, 2024
    • SmartEdit

      Public
      Official code of SmartEdit [CVPR-2024 Highlight]
      Python
      8266160Updated Jun 21, 2024Jun 21, 2024
    • LLaMA-Pro

      Public
      [ACL 2024] Progressive LLaMA with Block Expansion.
      Python
      Apache License 2.0
      36485220Updated May 20, 2024May 20, 2024
    • NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
      Python
      Other
      2040791Updated May 14, 2024May 14, 2024
    • BTS

      Public
      BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild
      Other
      02740Updated Apr 16, 2024Apr 16, 2024
    • UMT

      Public
      UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.
      Python
      Other
      1919210Updated Apr 15, 2024Apr 15, 2024
    • BEBR

      Public
      Official code for "Binary embedding based retrieval at Tencent"
      Python
      Apache License 2.0
      14220Updated Mar 7, 2024Mar 7, 2024
    • DeSRA

      Public
      Official codes for DeSRA (ICML 2023)
      Python
      012850Updated Feb 2, 2024Feb 2, 2024
    • ViSFT

      Public
      Python
      Apache License 2.0
      23310Updated Jan 20, 2024Jan 20, 2024
    • MM-RealSR

      Public
      Codes for "Metric Learning based Interactive Modulation for Real-World Super-Resolution"
      Python
      BSD 3-Clause "New" or "Revised" License
      12158100Updated Jan 16, 2024Jan 16, 2024
    • HOSNeRF

      Public
      HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video
      Python
      Apache License 2.0
      76631Updated Dec 12, 2023Dec 12, 2023
    • VTLayout

      Public
      0310Updated Oct 23, 2023Oct 23, 2023