vision-language-dataset

Here are 4 public repositories matching this topic...

Q-Future / Q-Bench

①[ICLR2024 Spotlight] (GPT-4V/Gemini-Pro/Qwen-VL-Plus+16 OS MLLMs) A benchmark for multi-modality LLMs (MLLMs) on low-level vision and visual quality assessment.

quality-assessment iclr image-quality-assessment low-level-vision gpt-4 large-language-models vision-language-dataset visual-large-language-models

Updated Aug 12, 2024
Jupyter Notebook

oakink / OakInk2

Star

🌴[CVPR 2024] OakInk2: A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion

computer-vision deep-learning pytorch dataset object-manipulation motion-generation digital-human dexterous-manipulation vision-language-dataset long-horizon-robotic-manipulation bimanual-manipulation

Updated Dec 6, 2024
Python

SHTUPLUS / GITM-MR

Star

The official implementation for the ICCV 2023 paper "Grounded Image Text Matching with Mismatched Relation Reasoning".

vision-and-language vision-and-language-pre-training vision-language-dataset vision-language-model vision-language-learning

Updated Dec 8, 2023
Python

unitaryai / VTC-dataset

Star

dataset video-understanding video-text-retrieval vision-language-pretraining vision-language-dataset

Updated May 1, 2024
Python

Improve this page

Add a description, image, and links to the vision-language-dataset topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-language-dataset topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vision-language-dataset

Here are 4 public repositories matching this topic...

Q-Future / Q-Bench

oakink / OakInk2

SHTUPLUS / GITM-MR

unitaryai / VTC-dataset

Improve this page

Add this topic to your repo