Tong, Zhan

11 publications

CVPR 2025 Contextual AD Narration with Interleaved Multimodal Sequence Hanlin Wang, Zhan Tong, Kecheng Zheng, Yujun Shen, Limin Wang
CVPR 2024 Bootstrapping SparseFormers from Vision Foundation Models Ziteng Gao, Zhan Tong, Kevin Qinghong Lin, Joya Chen, Mike Zheng Shou
ICLR 2024 SparseFormer: Sparse Visual Recognition via Limited Latent Tokens Ziteng Gao, Zhan Tong, Limin Wang, Mike Zheng Shou
ICCV 2023 Efficient Video Action Detection with Token Dropout and Context Refinement Lei Chen, Zhan Tong, Yibing Song, Gangshan Wu, Limin Wang
ICLR 2023 Soft Neighbors Are Positive Supporters in Contrastive Visual Representation Learning Chongjian Ge, Jiangliu Wang, Zhan Tong, Shoufa Chen, Yibing Song, Ping Luo
CVPR 2023 VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking Limin Wang, Bingkun Huang, Zhiyu Zhao, Zhan Tong, Yinan He, Yi Wang, Yali Wang, Yu Qiao
NeurIPS 2022 AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition Shoufa Chen, Chongjian Ge, Zhan Tong, Jiangliu Wang, Yibing Song, Jue Wang, Ping Luo
ICLR 2022 EViT: Expediting Vision Transformers via Token Reorganizations Youwei Liang, Chongjian Ge, Zhan Tong, Yibing Song, Jue Wang, Pengtao Xie
NeurIPS 2022 VideoMAE: Masked Autoencoders Are Data-Efficient Learners for Self-Supervised Video Pre-Training Zhan Tong, Yibing Song, Jue Wang, Limin Wang
ICCV 2021 MGSampler: An Explainable Sampling Strategy for Video Action Recognition Yuan Zhi, Zhan Tong, Limin Wang, Gangshan Wu
CVPR 2021 TDN: Temporal Difference Networks for Efficient Action Recognition Limin Wang, Zhan Tong, Bin Ji, Gangshan Wu