Mi, Yapeng
1 publications
NeurIPS
2025
Iterative Tool Usage Exploration for Multimodal Agents via Step-Wise Preference Tuning
Pengxiang Li, Zhi Gao, Bofei Zhang, Yapeng Mi, Xiaojian Ma, Chenrui Shi, Tao Yuan, Yuwei Wu, Yunde Jia, Song-Chun Zhu, Qing Li