Xie, Chen-Wei

12 publications

ICLR 2025 ACE: All-Round Creator and Editor Following Instructions via Diffusion Transformer Zhen Han, Zeyinzi Jiang, Yulin Pan, Jingfeng Zhang, Chaojie Mao, Chen-Wei Xie, Yu Liu, Jingren Zhou
ICLR 2025 Aligned Better, Listen Better for Audio-Visual Large Language Models Yuxin Guo, Shuailei Ma, Shijie Ma, Xiaoyi Bao, Chen-Wei Xie, Kecheng Zheng, Tingyu Weng, Siyang Sun, Yun Zheng, Wei Zou
CVPR 2025 BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs Zhantao Yang, Ruili Feng, Keyu Yan, Huangji Wang, Zhicai Wang, Shangwen Zhu, Han Zhang, Jie Xiao, Pingyu Wu, Kai Zhu, Jixuan Chen, Chen-Wei Xie, Yue Yang, Hongyang Zhang, Yu Liu, Fan Cheng
NeurIPS 2025 CAPability: A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness Zhihang Liu, Chen-Wei Xie, Bin Wen, Feiwu Yu, JixuanChen, Pandeng Li, Boqiang Zhang, Nianzu Yang, YingluLi, Zuan Gao, Yun Zheng, Hongtao Xie
CVPR 2025 Hybrid-Level Instruction Injection for Video Token Compression in Multi-Modal Large Language Models Zhihang Liu, Chen-Wei Xie, Pandeng Li, Liming Zhao, Longxiang Tang, Yun Zheng, Chuanbin Liu, Hongtao Xie
CVPR 2025 Learning Visual Generative Priors Without Text Shuailei Ma, Kecheng Zheng, Ying Wei, Wei Wu, Fan Lu, Yifei Zhang, Chen-Wei Xie, Biao Gong, Jiapeng Zhu, Yujun Shen
NeurIPS 2025 UFO: A Unified Approach to Fine-Grained Visual Perception via Open-Ended Language Interface Hao Tang, Chen-Wei Xie, Haiyang Wang, Xiaoyi Bao, Tingyu Weng, Pandeng Li, Yun Zheng, Liwei Wang
ECCV 2024 FuseTeacher: Modality-Fused Encoders Are Strong Vision Supervisors Chen-Wei Xie, Siyang Sun, Liming Zhao, Pandeng Li, Shuailei Ma, Yun Zheng
NeurIPS 2023 MomentDiff: Generative Video Moment Retrieval from Random to Real Pandeng Li, Chen-Wei Xie, Hongtao Xie, Liming Zhao, Lei Zhang, Yun Zheng, Deli Zhao, Yongdong Zhang
ICCV 2023 Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval Pandeng Li, Chen-Wei Xie, Liming Zhao, Hongtao Xie, Jiannan Ge, Yun Zheng, Deli Zhao, Yongdong Zhang
CVPR 2023 RA-CLIP: Retrieval Augmented Contrastive Language-Image Pre-Training Chen-Wei Xie, Siyang Sun, Xiong Xiong, Yun Zheng, Deli Zhao, Jingren Zhou
IJCAI 2017 Deep Descriptor Transforming for Image Co-Localization Xiu-Shen Wei, Chen-Lin Zhang, Yao Li, Chen-Wei Xie, Jianxin Wu, Chunhua Shen, Zhi-Hua Zhou