Fan, Haoqi

23 publications

CVPR 2025 LLaVA-Critic: Learning to Evaluate Multimodal Models Tianyi Xiong, Xiyao Wang, Dong Guo, Qinghao Ye, Haoqi Fan, Quanquan Gu, Heng Huang, Chunyuan Li
ICLR 2025 Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning Qinghao Ye, Xianhan Zeng, Fu Li, Chunyuan Li, Haoqi Fan
NeurIPS 2024 Classification Done Right for Vision-Language Pre-Training Zilong Huang, Qinghao Ye, Bingyi Kang, Jiashi Feng, Haoqi Fan
CVPR 2023 Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention at Vision Transformer Inference Haoran You, Yunyang Xiong, Xiaoliang Dai, Bichen Wu, Peizhao Zhang, Haoqi Fan, Peter Vajda, Yingyan Lin
ICCV 2023 Diffusion Models as Masked Autoencoders Chen Wei, Karttikeya Mangalam, Po-Yao Huang, Yanghao Li, Haoqi Fan, Hu Xu, Huiyu Wang, Cihang Xie, Alan Yuille, Christoph Feichtenhofer
ICML 2023 Hiera: A Hierarchical Vision Transformer Without the Bells-and-Whistles Chaitanya Ryali, Yuan-Ting Hu, Daniel Bolya, Chen Wei, Haoqi Fan, Po-Yao Huang, Vaibhav Aggarwal, Arkabandhu Chowdhury, Omid Poursaeed, Judy Hoffman, Jitendra Malik, Yanghao Li, Christoph Feichtenhofer
NeurIPS 2023 MAViL: Masked Audio-Video Learners Po-Yao Huang, Vasu Sharma, Hu Xu, Chaitanya Ryali, Haoqi Fan, Yanghao Li, Shang-Wen Li, Gargi Ghosh, Jitendra Malik, Christoph Feichtenhofer
CVPR 2023 Scaling Language-Image Pre-Training via Masking Yanghao Li, Haoqi Fan, Ronghang Hu, Christoph Feichtenhofer, Kaiming He
ICCV 2023 The Effectiveness of MAE Pre-Pretraining for Billion-Scale Pretraining Mannat Singh, Quentin Duval, Kalyan Vasudev Alwala, Haoqi Fan, Vaibhav Aggarwal, Aaron Adcock, Armand Joulin, Piotr Dollar, Christoph Feichtenhofer, Ross Girshick, Rohit Girdhar, Ishan Misra
CVPR 2022 MViTv2: Improved Multiscale Vision Transformers for Classification and Detection Yanghao Li, Chao-Yuan Wu, Haoqi Fan, Karttikeya Mangalam, Bo Xiong, Jitendra Malik, Christoph Feichtenhofer
NeurIPS 2022 Masked Autoencoders as Spatiotemporal Learners Christoph Feichtenhofer, Haoqi Fan, Yanghao Li, Kaiming He
CVPR 2022 Masked Feature Prediction for Self-Supervised Visual Pre-Training Chen Wei, Haoqi Fan, Saining Xie, Chao-Yuan Wu, Alan Yuille, Christoph Feichtenhofer
CVPR 2022 MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition Chao-Yuan Wu, Yanghao Li, Karttikeya Mangalam, Haoqi Fan, Bo Xiong, Jitendra Malik, Christoph Feichtenhofer
CVPR 2022 On the Importance of Asymmetry for Siamese Representation Learning Xiao Wang, Haoqi Fan, Yuandong Tian, Daisuke Kihara, Xinlei Chen
CVPR 2022 Reversible Vision Transformers Karttikeya Mangalam, Haoqi Fan, Yanghao Li, Chao-Yuan Wu, Bo Xiong, Christoph Feichtenhofer, Jitendra Malik
CVPR 2022 Unified Transformer Tracker for Object Tracking Fan Ma, Mike Zheng Shou, Linchao Zhu, Haoqi Fan, Yilei Xu, Yi Yang, Zhicheng Yan
CVPR 2021 A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning Christoph Feichtenhofer, Haoqi Fan, Bo Xiong, Ross Girshick, Kaiming He
CVPR 2021 Beyond Short Clips: End-to-End Video-Level Learning with Collaborative Memories Xitong Yang, Haoqi Fan, Lorenzo Torresani, Larry S. Davis, Heng Wang
ICCV 2021 HiT: Hierarchical Transformer with Momentum Contrast for Video-Text Retrieval Song Liu, Haoqi Fan, Shengsheng Qian, Yiru Chen, Wenkui Ding, Zhongyuan Wang
ICCV 2021 Multiscale Vision Transformers Haoqi Fan, Bo Xiong, Karttikeya Mangalam, Yanghao Li, Zhicheng Yan, Jitendra Malik, Christoph Feichtenhofer
ICCV 2021 Multiview Pseudo-Labeling for Semi-Supervised Learning from Video Bo Xiong, Haoqi Fan, Kristen Grauman, Christoph Feichtenhofer
AAAI 2018 Efficient K-Shot Learning with Regularized Deep Networks Donghyun Yoo, Haoqi Fan, Vishnu Naresh Boddeti, Kris M. Kitani
CVPR 2016 Going Deeper into First-Person Activity Recognition Minghuang Ma, Haoqi Fan, Kris M. Kitani