This is Jinrui,
🏫 I'm a graduate student from SUSTech with a bachelor's degree in Computer Science.
🔭 I’m currently working on toward M.S. degree with SUSTech.
🌱 I’m currently focus on vision-language research.
This is Jinrui,
🏫 I'm a graduate student from SUSTech with a bachelor's degree in Computer Science.
🔭 I’m currently working on toward M.S. degree with SUSTech.
🌱 I’m currently focus on vision-language research.
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/sp…
Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)
Second-place solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2022 workshop)
Awesome Multimodal Assistant is a curated list of multimodal chatbots/conversational assistants that utilize various modes of interaction, such as text, speech, images, and videos, to provide a sea…