Zhong, Yiwu

14 publications

ICCV 2025 AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning Yiwu Zhong, Zhuoming Liu, Yin Li, Liwei Wang
ICCV 2025 Fine-Grained Spatiotemporal Grounding on Egocentric Videos Shuo Liang, Yiwu Zhong, Zi-Yuan Hu, Yeyao Tao, Liwei Wang
CVPR 2025 PAVE: Patching and Adapting Video Large Language Models Zhuoming Liu, Yiquan Li, Khoi Duc Nguyen, Yiwu Zhong, Yin Li
AAAI 2025 Revisiting Tampered Scene Text Detection in the Era of Generative AI Chenfan Qu, Yiwu Zhong, Fengjun Guo, Lianwen Jin
NeurIPSW 2024 TemporalBench: Benchmarking Fine-Grained Temporal Understanding for Multimodal Video Models Mu Cai, Reuben Tan, Jianrui Zhang, Bocheng Zou, Kai Zhang, Yao Feng, Fangrui Zhu, Jing Gu, Yiwu Zhong, Yuzhang Shang, Yao Dou, Jaden Park, Jianfeng Gao, Yong Jae Lee, Jianwei Yang
CVPR 2024 Towards Learning a Generalist Model for Embodied Navigation Duo Zheng, Shijia Huang, Lin Zhao, Yiwu Zhong, Liwei Wang
CVPR 2024 Towards Modern Image Manipulation Localization: A Large-Scale Dataset and Novel Methods Chenfan Qu, Yiwu Zhong, Chongyu Liu, Guitao Xu, Dezhi Peng, Fengjun Guo, Lianwen Jin
ICCV 2023 Learning Concise and Descriptive Attributes for Visual Recognition An Yan, Yu Wang, Yiwu Zhong, Chengyu Dong, Zexue He, Yujie Lu, William Yang Wang, Jingbo Shang, Julian McAuley
CVPR 2023 Learning Procedure-Aware Video Representation from Instructional Videos and Their Narrations Yiwu Zhong, Licheng Yu, Yang Bai, Shangwen Li, Xueting Yan, Yin Li
CVPR 2022 Grounded Language-Image Pre-Training Liunian Harold Li, Pengchuan Zhang, Haotian Zhang, Jianwei Yang, Chunyuan Li, Yiwu Zhong, Lijuan Wang, Lu Yuan, Lei Zhang, Jenq-Neng Hwang, Kai-Wei Chang, Jianfeng Gao
CVPR 2022 RegionCLIP: Region-Based Language-Image Pretraining Yiwu Zhong, Jianwei Yang, Pengchuan Zhang, Chunyuan Li, Noel Codella, Liunian Harold Li, Luowei Zhou, Xiyang Dai, Lu Yuan, Yin Li, Jianfeng Gao
ICCV 2021 A Simple Baseline for Weakly-Supervised Scene Graph Generation Jing Shi, Yiwu Zhong, Ning Xu, Yin Li, Chenliang Xu
ICCV 2021 Learning to Generate Scene Graph from Natural Language Supervision Yiwu Zhong, Jing Shi, Jianwei Yang, Chenliang Xu, Yin Li
ECCV 2020 Comprehensive Image Captioning via Scene Graph Decomposition Yiwu Zhong, Liwei Wang, Jianshu Chen, Dong Yu, Yin Li