Mi, Yapeng

1 publications

NeurIPS 2025 Iterative Tool Usage Exploration for Multimodal Agents via Step-Wise Preference Tuning Pengxiang Li, Zhi Gao, Bofei Zhang, Yapeng Mi, Xiaojian Ma, Chenrui Shi, Tao Yuan, Yuwei Wu, Yunde Jia, Song-Chun Zhu, Qing Li