Chen, Shaoxiang
15 publications
CVPRW
2025
UniToken: Harmonizing Multimodal Understanding and Generation Through Unified Visual Encoding
NeurIPS
2024
ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model
CVPR
2021
Towards Bridging Event Captioner and Sentence Localizer for Weakly Supervised Dense Event Captioning