- MemVLT: Xiaokun Feng, Xuchen Li, Shiyu Hu, Dailing Zhang, Meiqi Wu, Jing Zhang, Xiaotang Chen, Kaiqi Huang
"MemVLT: Visual-Language Tracking with Adaptive Memory-based Prompts" NeurIPS 2024
-
OneTracker: Lingyi Hong, Shilin Yan, Renrui Zhang, Wanyun Li, Xinyu Zhou, Pinxue Guo, Kaixun Jiang, Yiting Cheng, Jinglun Li, Zhaoyu Chen, Wenqiang Zhang
"OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning" CVPR 2024
[paper] -
QueryNLT: Yanyan Shao, Shuting He, Qi Ye, Yuchao Feng, Wenhan Luo, Jiming Chen
"Context-Aware Integration of Language and Visual References for Natural Language Tracking" CVPR 2024
[paper]
[code]
- Elysium: Han Wang, Yanjie Wang, Yongjie Ye, Yuxiang Nie, Can Huang
"Elysium: Exploring Object-level Perception in Videos via MLLM" ECCV 2024
[paper]
[code]
- UVLTrack: Yinchao Ma, Yuyang Tang, Wenfei Yang, Tianzhu Zhang, Jinpeng Zhang, Mengxue Kang
"Unifying Visual and Vision-Language Tracking via Contrastive Learning" AAAI 2024
[paper]
[code]
- ATTrack: Jiawei Ge, Jiuxin Cao, Xuelin Zhu, Xinyu Zhang, Chang Liu, Kun Wang, Bo Liu
"Consistencies are All You Need for Semi-supervised Vision-Language Tracking" ACM MM 2024
[paper]
- DMTrack: Guangtong Zhang, Bineng Zhong, Qihua Liang, Zhiyi Mo, Shuxiang Song
"Diffusion Mask-Driven Visual-language Tracking" IJCAI 2024
[paper]
- OSDT: Guangtong Zhang, Bineng Zhong, Qihua Liang, Zhiyi Mo, Ning Li, Shuxiang Song
"One-Stream Stepwise Decreasing for Vision-Language Tracking" TCSVT 2024
[paper]
- TTCTrack: Zhongjie Mao, Yucheng Wang, Xi Chen, Jia Yan
"Textual Tokens Classification for Multi-Modal Alignment in Vision-Language Tracking" ICASSP 2024
[paper]
- DTLLM-VLT: Xuchen Li, Xiaokun Feng, Shiyu Hu, Meiqi Wu, Dailing Zhang, Jing Zhang, Kaiqi Huang
"DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM" CVPRW 2024
[paper]
-
SATracker: Jiawei Ge, Xiangmei Chen, Jiuxin Cao, Xuelin Zhu, Weijia Liu, Bo Liu
"Beyond Visual Cues: Synchronously Exploring Target-Centric Semantics for Vision-Language Tracking" ArXiv 2024
[paper] -
VLT-MI: Xuchen Li, Shiyu Hu, Xiaokun Feng, Dailing Zhang, Meiqi Wu, Jing Zhang, Kaiqi Huang
"Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark" ArXiv 2024
[paper]
- JointNLT: Li Zhou, Zikun Zhou, Kaige Mao, Zhenyu He
"Joint Visual Grounding and Tracking with Natural Language Specification" CVPR 2023
[paper]
[code]
- DecoupleTNL: Ding Ma, Xiangqian Wu
"Tracking by Natural Language Specification with Long Short-term Context Decoupling" ICCV 2023
[paper]
- MGIT: Shiyu Hu, Dailin Zhang, Meiqi Wu, Xiaokun Feng, Xuchen Li, Xin Zhao, Kaiqi Huang
"A Multi-modal Global Instance Tracking Benchmark (MGIT): Better Locating Target in Complex Spatio-temporal and causal Relationship" NeurIPS 2023
[paper]
[platform]
- All in One: Chunhui Zhang, Xin Sun, Li Liu, Yiqian Yang, Qiong Liu, Xi Zhou, Yanfeng Wang
"All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment" ACM MM 2023
[paper]
[code]
- OVLM: Huanlong Zhang, Jingchao Wang, Jianwei Zhang, Tianzhu Zhang, Bineng Zhong
"One-stream Vision-Language Memory Network for Object Tracking" TMM 2023
[paper]
-
MMTrack: Yaozong Zheng, Bineng Zhong, Qihua Liang, Guorong Li, Rongrong Ji, Xianxian Li
"Towards Unified Token Learning for Vision-Language Tracking" TCSVT 2023
[paper]
[code] -
TransNLT: Rong Wang, Zongheng Tang, Qianli Zhou, Xiaoqian Liu, Tianrui Hui, Quange Tan, Si Liu
"Unified Transformer With Isomorphic Branches for Natural Language Tracking" TCSVT 2023
[paper]
- TransVLT: Haojie Zhao, Xiao Wang, Dong Wang, Huchuan Lu, Xiang Ruan
"Transformer vision-language tracking via proxy token guided cross-modal fusion" PRL 2023
[paper]
- VLT_OST: Mingzhe Guo, Zhipeng Zhang, Liping Jing , Haibin Ling, Heng Fan
"Divert More Attention to Vision-Language Object Tracking" ArXiv 2023
[paper]
[code]
- VLT_TT: Mingzhe Guo, Zhipeng Zhang, Heng Fan, Liping Jing
"Divert More Attention to Vision-Language Tracking" NeurIPS 2022
[paper]
[code]
- CTRTNL: Yihao L, Jun Yu, Zhongpeng Cai, Yuwen Pan
"Cross-Modal Target Retrieval for Tracking by Natural Language" CVPRW 2022
[paper]
-
SNLT: Qi Feng, Vitaly Ablavsky, Qinxun Bai, Stan Sclaroff
"Siamese Natural Language Tracker: Tracking by Natural Language Descriptions With Siamese Trackers" CVPR 2021
[paper]
[code] -
TNL2K: Xiao Wang, Xiujun Shu, Zhipeng Zhang, Bo Jiang, Yaowei Wang, Yonghong Tian, Feng Wu
"Towards More Flexible and Accurate Object Tracking with Natural Language: Algorithms and Benchmark" CVPR 2021
[paper]
[platform]
- CapsuleTNL: Ding Ma, Xiangqian Wu
"Capsule-based Object Tracking with Natural Language Specification" ACM MM 2021
[paper]
- GTI: Zhengyuan Yang, Tushar Kumar, Tianlang Chen, Jingsong Su, Jiebo Luo
"Grounding-Tracking-Integration" TCSVT 2021
[paper]
- RTTNLD: Qi Feng, Vitaly Ablavsky, Qinxun Bai, Guorong Li, Stan Sclaroff
"Real-time Visual Object Tracking with Natural Language Description" WACV 2020
[paper]
- NLRPN: Qi Feng, Vitaly Ablavsky, Qinxun Bai, Stan Sclaroff
"Robust Visual Object Tracking with Natural Language Region Proposal Network" ArXiv 2019
[paper]
- DAT: Xiao Wang, Chenglong Li, Rui Yang, Tianzhu Zhang, Jin Tang, Bin Luo
"Describe and Attend to Track: Learning Natural Language guided Structural Representation and Visual Attention for Object Tracking" ArXiv 2018
[paper]