My research interests lie in deep learning and computer vision, focusing on multimodal machine learning. 🌟Looking for research interns.
-
The University of Hong Kong
- Hong Kong
- ttengwang.com
Highlights
- Pro
Pinned Loading
-
TencentARC/FLM
TencentARC/FLM PublicAccelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)
-
Caption-Anything
Caption-Anything PublicCaption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/sp…
-
dense-video-captioning-pytorch
dense-video-captioning-pytorch PublicSecond-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)
-
Awesome_Prompting_Papers_in_Computer_Vision
Awesome_Prompting_Papers_in_Computer_Vision PublicA curated list of prompt-based paper in computer vision and vision-language learning.
-
Awesome_Long_Form_Video_Understanding
Awesome_Long_Form_Video_Understanding PublicAwesome papers & datasets specifically focused on long-term videos.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.