Grounding Language Models for Compositional and Spatial Reasoning
nlp computer-vision deep-learning dataset image-captioning image-retrieval spatial-reasoning multimodal vision-and-language grounding vsr caption-retrieval winoground visual-spatial-reasoning
-
Updated
Oct 26, 2022 - Jupyter Notebook