zjr2000

Follow

😢

Focusing

Jinrui Zhang zjr2000

😢

Focusing

Follow

Master student at SUSTech, ShenZhen, China. My research focuses on Computer Vision, specifically exploring the intersection of vision and language learning.

63 followers · 42 following

Southern University of Science and Technology
Shen Zhen

Achievements

Achievements

Highlights

Pro

zjr2000/README.md

Hi there 👋

This is Jinrui,

🏫 I'm a graduate student from SUSTech with a bachelor's degree in Computer Science.

🔭 I’m currently working on toward M.S. degree with SUSTech.

🌱 I’m currently focus on vision-language research.

Pinned Loading

ttengwang/Caption-Anything ttengwang/Caption-Anything Public

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/sp…

Python 1.7k 103
REVERIE REVERIE Public

[ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models

Python 15
GVL GVL Public

Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos

Python 27 6
LLMVA-GEBC LLMVA-GEBC Public

Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)

Python 29 2
Context-GEBC Context-GEBC Public

Second-place solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2022 workshop)

Python 4 1
Awesome-Multimodal-Chatbot Awesome-Multimodal-Chatbot Public

Awesome Multimodal Assistant is a curated list of multimodal chatbots/conversational assistants that utilize various modes of interaction, such as text, speech, images, and videos, to provide a sea…

71 6