Skip to content
@SHI-Labs

SHI Labs

Computer Vision, Machine Learning, and AI Systems & Applications

Pinned Loading

  1. Neighborhood-Attention-Transformer Neighborhood-Attention-Transformer Public

    Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022

    Python 1.1k 86

  2. Versatile-Diffusion Versatile-Diffusion Public

    Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023

    Python 1.3k 83

  3. OneFormer OneFormer Public

    OneFormer: One Transformer to Rule Universal Image Segmentation, arxiv 2022 / CVPR 2023

    Jupyter Notebook 1.5k 135

  4. Prompt-Free-Diffusion Prompt-Free-Diffusion Public

    Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024

    Python 735 36

  5. Smooth-Diffusion Smooth-Diffusion Public

    Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024

    Python 323 8

  6. VCoder VCoder Public

    VCoder: Versatile Vision Encoders for Multimodal Large Language Models, arXiv 2023 / CVPR 2024

    Python 266 15

Repositories

Showing 10 of 59 repositories
  • OLA-VLM Public

    OLA-VLM: Elevating Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024

    SHI-Labs/OLA-VLM’s past year of commit activity
    Python 14 1 0 1 Updated Dec 13, 2024
  • NATTEN Public

    Neighborhood Attention Extension. Bringing attention to a neighborhood near you!

    SHI-Labs/NATTEN’s past year of commit activity
    Cuda 383 31 11 3 Updated Dec 2, 2024
  • Compact-Transformers Public

    Escaping the Big Data Paradigm with Compact Transformers, 2021 (Train your Vision Transformers in 30 mins on CIFAR-10 with a single GPU!)

    SHI-Labs/Compact-Transformers’s past year of commit activity
    Python 505 Apache-2.0 80 10 2 Updated Nov 5, 2024
  • Diffusion-Driven-Test-Time-Adaptation-via-Synthetic-Domain-Alignment Public

    Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment

    SHI-Labs/Diffusion-Driven-Test-Time-Adaptation-via-Synthetic-Domain-Alignment’s past year of commit activity
    Python 20 2 0 0 Updated Oct 29, 2024
  • OneFormer Public

    OneFormer: One Transformer to Rule Universal Image Segmentation, arxiv 2022 / CVPR 2023

    SHI-Labs/OneFormer’s past year of commit activity
    Jupyter Notebook 1,526 MIT 135 34 4 Updated Oct 3, 2024
  • Smooth-Diffusion Public

    Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024

    SHI-Labs/Smooth-Diffusion’s past year of commit activity
    Python 323 MIT 8 10 0 Updated Sep 24, 2024
  • FineStyle Public

    FineStyle: Fine-grained Controllable Style Personalization for Text-to-image Models

    SHI-Labs/FineStyle’s past year of commit activity
    2 MIT 0 0 0 Updated Sep 4, 2024
  • Agriculture-Vision Public

    [CVPR 2020 & 2021 & 2022 & 2023] Agriculture-Vision Dataset, Prize Challenge and Workshop: A joint effort with many great collaborators to bring Agriculture and Computer Vision/AI communities together to benefit humanity!

    SHI-Labs/Agriculture-Vision’s past year of commit activity
    206 33 2 1 Updated Jul 27, 2024
  • CuMo Public

    CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

    SHI-Labs/CuMo’s past year of commit activity
    Python 137 Apache-2.0 10 0 0 Updated Jun 8, 2024
  • StyleNAT Public

    New flexible and efficient image generation framework that sets new SOTA on FFHQ-256 with FID 2.05, 2022

    SHI-Labs/StyleNAT’s past year of commit activity
    Python 100 MIT 12 0 0 Updated Jun 4, 2024