Qian, Rui

28 publications

NeurIPS 2025 CAR-Flow: Condition-Aware Reparameterization Aligns Source and Target for Better Flow Matching Chen Chen, Pengsheng Guo, Liangchen Song, Jiasen Lu, Rui Qian, Tsu-Jui Fu, Xinze Wang, Wei Liu, Yinfei Yang, Alex Schwing
CVPR 2025 Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction Rui Qian, Shuangrui Ding, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang
CVPR 2025 OVO-Bench: How Far Is Your Video-LLMs from Real-World Online Video Understanding? Junbo Niu, Yifei Li, Ziyang Miao, Chunjiang Ge, Yuanhang Zhou, Qihao He, Xiaoyi Dong, Haodong Duan, Shuangrui Ding, Rui Qian, Pan Zhang, Yuhang Zang, Yuhang Cao, Conghui He, Jiaqi Wang
CVPR 2025 Reasoning to Attend: Try to Understand How <SEG> Token Works Rui Qian, Xin Yin, Dejing Dou
ICCV 2025 SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree Shuangrui Ding, Rui Qian, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Yuwei Guo, Dahua Lin, Jiaqi Wang
ECCV 2024 Betrayed by Attention: A Simple yet Effective Approach for Self-Supervised Video Object Segmentation Shuangrui Ding, Rui Qian, Haohang Xu, Dahua Lin, Hongkai Xiong
TMLR 2024 Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models Cristina Nader Vasconcelos, Abdullah Rashwan, Austin Waters, Trevor Walker, Keyang Xu, Jimmy Yan, Rui Qian, Yeqing Li, Shixin Luo, Yasumasa Onoe, Zarana Parekh, Ivana Kajic, Mandy Guo, Wenlei Zhou, Sarah Rosston, Roopal Garg, Hongliang Fei, Jordi Pont-Tuset, Su Wang, Henna Nandwani, Andrew Bunner, Kevin Swersky, David J. Fleet, Oliver Wang, Jason Michael Baldridge
ECCV 2024 Rethinking Image-to-Video Adaptation: An Object-Centric Perspective Rui Qian, Shuangrui Ding, Dahua Lin
NeurIPS 2024 Streaming Long Video Understanding with Large Language Models Rui Qian, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Shuangrui Ding, Dahua Lin, Jiaqi Wang
ICML 2024 VideoPrism: A Foundational Visual Encoder for Video Understanding Long Zhao, Nitesh Bharadwaj Gundavarapu, Liangzhe Yuan, Hao Zhou, Shen Yan, Jennifer J. Sun, Luke Friedman, Rui Qian, Tobias Weyand, Yue Zhao, Rachel Hornung, Florian Schroff, Ming-Hsuan Yang, David A Ross, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko, Ting Liu, Boqing Gong
ICCV 2023 Prune Spatio-Temporal Tokens by Semantic-Aware Temporal Accumulation Shuangrui Ding, Peisen Zhao, Xiaopeng Zhang, Rui Qian, Hongkai Xiong, Qi Tian
ICCV 2023 Semantics Meets Temporal Correspondence: Self-Supervised Object-Centric Learning in Videos Rui Qian, Shuangrui Ding, Xian Liu, Dahua Lin
CVPR 2023 Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation Lingting Zhu, Xian Liu, Xuanyu Liu, Rui Qian, Ziwei Liu, Lequan Yu
CVPR 2022 Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision Liangzhe Yuan, Rui Qian, Yin Cui, Boqing Gong, Florian Schroff, Ming-Hsuan Yang, Hartwig Adam, Ting Liu
ECCV 2022 Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset Grant Van Horn, Rui Qian, Kimberly Wilber, Hartwig Adam, Oisin Mac Aodha, Serge Belongie
CVPR 2022 Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation Xian Liu, Qianyi Wu, Hang Zhou, Yinghao Xu, Rui Qian, Xinyi Lin, Xiaowei Zhou, Wayne Wu, Bo Dai, Bolei Zhou
CVPR 2022 Motion-Aware Contrastive Video Representation Learning via Foreground-Background Merging Shuangrui Ding, Maomao Li, Tianyu Yang, Rui Qian, Haohang Xu, Qingyi Chen, Jue Wang, Hongkai Xiong
ECCV 2022 Static and Dynamic Concepts for Self-Supervised Video Representation Learning Rui Qian, Shuangrui Ding, Xian Liu, Dahua Lin
AAAI 2022 TA2N: Two-Stage Action Alignment Network for Few-Shot Action Recognition Shuyuan Li, Huabin Liu, Rui Qian, Yuxi Li, John See, Mengjuan Fei, Xiaoyuan Yu, Weiyao Lin
AAAI 2022 Visual Sound Localization in the Wild by Cross-Modal Interference Erasing Xian Liu, Rui Qian, Hang Zhou, Di Hu, Weiyao Lin, Ziwei Liu, Bolei Zhou, Xiaowei Zhou
ICCV 2021 Enhancing Self-Supervised Video Representation Learning via Multi-Level Feature Optimization Rui Qian, Yuxi Li, Huabin Liu, John See, Shuangrui Ding, Xian Liu, Dian Li, Weiyao Lin
CVPR 2021 Simple Copy-Paste Is a Strong Data Augmentation Method for Instance Segmentation Golnaz Ghiasi, Yin Cui, Aravind Srinivas, Rui Qian, Tsung-Yi Lin, Ekin D. Cubuk, Quoc V. Le, Barret Zoph
CVPR 2021 Spatiotemporal Contrastive Video Representation Learning Rui Qian, Tianjian Meng, Boqing Gong, Ming-Hsuan Yang, Huisheng Wang, Serge Belongie, Yin Cui
NeurIPS 2021 VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text Hassan Akbari, Liangzhe Yuan, Rui Qian, Wei-Hong Chuang, Shih-Fu Chang, Yin Cui, Boqing Gong
NeurIPS 2020 Discriminative Sounding Objects Localization via Self-Supervised Audiovisual Matching Di Hu, Rui Qian, Minyue Jiang, Xiao Tan, Shilei Wen, Errui Ding, Weiyao Lin, Dejing Dou
AAAI 2020 Finding Action Tubes with a Sparse-to-Dense Framework Yuxi Li, Weiyao Lin, Tao Wang, John See, Rui Qian, Ning Xu, Limin Wang, Shugong Xu
ECCV 2020 Multiple Sound Sources Localization from Coarse to Fine Rui Qian, Di Hu, Heinrich Dinkel, Mengyue Wu, Ning Xu, Weiyao Lin
AAAI 2019 Weakly Supervised Scene Parsing with Point-Based Distance Metric Learning Rui Qian, Yunchao Wei, Honghui Shi, Jiachen Li, Jiaying Liu, Thomas S. Huang