open-vocabulary

(CVPR 2023) PLA: Language-Driven Open-Vocabulary 3D Scene Understanding & (CVPR2024) RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding

deep-learning open-world 3d-scene-understanding open-vocabulary cvpr2023

Updated Jun 28, 2024
Python

hovsg / HOV-SG

Star

[RSS2024] Official implementation of "Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation"

natural-language-understanding robot-navigation 3d-scene-graph robot-planning open-vocabulary

Updated Aug 29, 2024
Python

clin1223 / VLDet

Star

[ICLR 2023] PyTorch implementation of VLDet （https://arxiv.org/abs/2211.14843）

pytorch object-detection multi-modal vision-and-language open-vocabulary iclr2023

Updated Mar 22, 2024
Python

wusize / ovdet

Star

[CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection

object-detection open-vocabulary cvpr2023

Updated Oct 25, 2023
Python

wusize / CLIPSelf

Star

[ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction

detection open-vocabulary vision-language-model

Updated Feb 5, 2024
Python

FoundationVision / GenerateU

Star

[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

open-world object-detection multimodality open-vocabulary mllm open-vocabulary-detection

Updated Mar 25, 2024
Python

sunanhe / MKT

Star

Official implementation of "Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer".

pytorch transfer-learning multi-label-classification open-vocabulary

Updated Nov 7, 2024
Python

Surrey-UP-Lab / RegionSpot

Star

Recognize Any Regions

open-world object-detection zero-shot instance-segmentation auto-labeling vision-language-pretraining open-vocabulary vision-language-model multimodal-representation-learning vision-foundation-model vision-language-foundation-model

Updated Sep 27, 2024
Python

Jiahao000 / MosaicFusion

Star

[IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation

pytorch object-detection instance-segmentation diffusion-models long-tailed open-vocabulary

Updated Oct 8, 2024
Python

CVMI-Lab / CoDet

Star

(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection

object-detection open-vocabulary open-vocabulary-detection

Updated Apr 26, 2024
Python

zhang-tao-whu / DVIS_Plus

Star

video-segmentation video-instance-segmentation video-semantic-segmentation open-vocabulary

Updated Jul 4, 2024
Python

ArrowLuo / SegCLIP

Star

PyTorch implementation of ICML 2023 paper "SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation"

transfer-learning semantic-segmentation contrastive-learning zero-shot-semantic-segmentation vision-language-pretraining open-vocabulary open-vocabulary-semantic-segmentation

Updated Jun 28, 2023
Python

VinAIResearch / Open3DIS

Star

Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance (CVPR 2024)

3d-point-clouds 3d-instance-segmentation 3d-scene-understanding open-vocabulary cvpr2024

Updated Nov 12, 2024
Python

Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet200 and Replica datasets with up ∼16x speedup compared to the best existing method in literature.

point-clouds instance-segmentation 3d-computer-vision open-vocabulary