Skip to content
Change the repository type filter

All

    Repositories list

    • LLaVA

      Public
      Visual Instruction Tuning: Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
      Python
      Apache License 2.0
      2.2k101Updated Jul 17, 2024Jul 17, 2024
    • Open-Sora

      Public
      Open-Sora: Democratizing Efficient Video Production for All
      Python
      Apache License 2.0
      2.2k400Updated Apr 8, 2024Apr 8, 2024
    • Cog demo for GPT-2 finetuned on World of Warcraft quests
      Python
      0000Updated Apr 8, 2024Apr 8, 2024
    • Generates a texture for 3D models created by the TripoSR image-to-3D model
      Python
      MIT License
      13000Updated Mar 19, 2024Mar 19, 2024
    • texify

      Public
      Math OCR model that outputs LaTeX and markdown
      Python
      GNU General Public License v3.0
      67000Updated Mar 8, 2024Mar 8, 2024
    • Cog wrapper for collabora/WhisperSpeech
      Python
      4000Updated Mar 5, 2024Mar 5, 2024
    • [ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
      Python
      MIT License
      87000Updated Mar 5, 2024Mar 5, 2024
    • PhotoMaker model by TenCent as cog image
      Jupyter Notebook
      Other
      7674501Updated Feb 23, 2024Feb 23, 2024
    • Cog image for dreamcraft3D, an image-to-3D-mesh-model
      Python
      Apache License 2.0
      0200Updated Feb 21, 2024Feb 21, 2024
    • Official implementation of "LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching"
      Python
      MIT License
      34000Updated Dec 22, 2023Dec 22, 2023
    • WavJourney: Compositional Audio Creation with LLMs
      Python
      Other
      46100Updated Dec 8, 2023Dec 8, 2023
    • Cog wrapper for SDXL img blend using compel
      Python
      3000Updated Dec 7, 2023Dec 7, 2023
    • Pipeline for fast video editing
      Python
      Apache License 2.0
      1900Updated Dec 4, 2023Dec 4, 2023
    • EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
      Python
      Apache License 2.0
      632600Updated Dec 1, 2023Dec 1, 2023
    • Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
      Python
      Apache License 2.0
      123800Updated Nov 6, 2023Nov 6, 2023
    • LaTeX-OCR

      Public
      pix2tex: Using a ViT to convert images of equations into LaTeX code.
      Python
      MIT License
      1k500Updated Nov 6, 2023Nov 6, 2023
    • Cog pipeline for XMem and ProPainter
      Python
      MIT License
      1300Updated Oct 25, 2023Oct 25, 2023
    • XMem

      Public
      [ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
      Python
      GNU General Public License v3.0
      193200Updated Oct 19, 2023Oct 19, 2023
    • Python
      1200Updated Oct 17, 2023Oct 17, 2023
    • [ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
      Python
      Other
      667100Updated Oct 11, 2023Oct 11, 2023