Wei, Xinyu

9 publications

NeurIPS 2025 Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO Chengzhuo Tong, Ziyu Guo, Renrui Zhang, Wenyu Shan, Xinyu Wei, Zhenghao Xing, Hongsheng Li, Pheng-Ann Heng
ICLR 2025 Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want Weifeng Lin, Xinyu Wei, Ruichuan An, Peng Gao, Bocheng Zou, Yulin Luo, Siyuan Huang, Shanghang Zhang, Hongsheng Li
AAAI 2025 Event2Tracking: Reconstructing Multi-Agent Soccer Trajectories Using Long-Term Multimodal Context Harry Hughes, Michael Horton, Xinyu Wei, Harshala Gammulle, Clinton Fookes, Sridha Sridharan, Patrick Lucey
ICLR 2025 MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine Renrui Zhang, Xinyu Wei, Dongzhi Jiang, Ziyu Guo, Yichi Zhang, Chengzhuo Tong, Jiaming Liu, Aojun Zhou, Shanghang Zhang, Peng Gao, Hongsheng Li
NeurIPS 2025 Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos Weifeng Lin, Xinyu Wei, Ruichuan An, Tianhe Ren, Tingwei Chen, Renrui Zhang, Ziyu Guo, Wentao Zhang, Lei Zhang, Hongsheng Li
ICLR 2025 PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions Weifeng Lin, Xinyu Wei, Renrui Zhang, Le Zhuo, Shitian Zhao, Siyuan Huang, Junlin Xie, Peng Gao, Hongsheng Li
CVPR 2024 Cloud-Device Collaborative Learning for Multimodal Large Language Models Guanqun Wang, Jiaming Liu, Chenxuan Li, Yuan Zhang, Junpeng Ma, Xinyu Wei, Kevin Zhang, Maurice Chong, Renrui Zhang, Yijiang Liu, Shanghang Zhang
CVPRW 2024 IDAdapter: Learning Mixed Features for Tuning-Free Personalization of Text-to-Image Models Siying Cui, Jia Guo, Xiang An, Jiankang Deng, Yongle Zhao, Xinyu Wei, Ziyong Feng
ICCVW 2015 Predicting Ball Ownership in Basketball from a Monocular View Using Only Player Trajectories Xinyu Wei, Long Sha, Patrick Lucey, Peter Carr, Sridha Sridharan, Iain A. Matthews