ML Anthology
Authors
Search
About
Tong, Zhan
11 publications
CVPR
2025
Contextual AD Narration with Interleaved Multimodal Sequence
Hanlin Wang
,
Zhan Tong
,
Kecheng Zheng
,
Yujun Shen
,
Limin Wang
CVPR
2024
Bootstrapping SparseFormers from Vision Foundation Models
Ziteng Gao
,
Zhan Tong
,
Kevin Qinghong Lin
,
Joya Chen
,
Mike Zheng Shou
ICLR
2024
SparseFormer: Sparse Visual Recognition via Limited Latent Tokens
Ziteng Gao
,
Zhan Tong
,
Limin Wang
,
Mike Zheng Shou
ICCV
2023
Efficient Video Action Detection with Token Dropout and Context Refinement
Lei Chen
,
Zhan Tong
,
Yibing Song
,
Gangshan Wu
,
Limin Wang
ICLR
2023
Soft Neighbors Are Positive Supporters in Contrastive Visual Representation Learning
Chongjian Ge
,
Jiangliu Wang
,
Zhan Tong
,
Shoufa Chen
,
Yibing Song
,
Ping Luo
CVPR
2023
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Limin Wang
,
Bingkun Huang
,
Zhiyu Zhao
,
Zhan Tong
,
Yinan He
,
Yi Wang
,
Yali Wang
,
Yu Qiao
NeurIPS
2022
AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Shoufa Chen
,
Chongjian Ge
,
Zhan Tong
,
Jiangliu Wang
,
Yibing Song
,
Jue Wang
,
Ping Luo
ICLR
2022
EViT: Expediting Vision Transformers via Token Reorganizations
Youwei Liang
,
Chongjian Ge
,
Zhan Tong
,
Yibing Song
,
Jue Wang
,
Pengtao Xie
NeurIPS
2022
VideoMAE: Masked Autoencoders Are Data-Efficient Learners for Self-Supervised Video Pre-Training
Zhan Tong
,
Yibing Song
,
Jue Wang
,
Limin Wang
ICCV
2021
MGSampler: An Explainable Sampling Strategy for Video Action Recognition
Yuan Zhi
,
Zhan Tong
,
Limin Wang
,
Gangshan Wu
CVPR
2021
TDN: Temporal Difference Networks for Efficient Action Recognition
Limin Wang
,
Zhan Tong
,
Bin Ji
,
Gangshan Wu