Fan, YuTao

3 publications

TMLR 2026 The Landscape of Agentic Reinforcement Learning for LLMs: A Survey Guibin Zhang, Hejia Geng, Xiaohang Yu, Zhenfei Yin, Zaibin Zhang, Zelin Tan, Heng Zhou, Zhong-Zhi Li, Xiangyuan Xue, Yijiang Li, Yifan Zhou, Yang Chen, Chen Zhang, Yutao Fan, Zihu Wang, Songtao Huang, Francisco Piedrahita Velez, Yue Liao, Hongru Wang, Mengyue Yang, Heng Ji, Jun Wang, Shuicheng Yan, Philip Torr, Lei Bai
NeurIPS 2025 BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset Zhiheng Xi, Guanyu Li, YuTao Fan, Honglin Guo, Yufang Liu, Xiaoran Fan, Jiaqi Liu, Dingjinchao, Wangmeng Zuo, Zhenfei Yin, Lei Bai, Tao Ji, Tao Gui, Qi Zhang, Xuanjing Huang
ICLR 2025 Visual-O1: Understanding Ambiguous Instructions via Multi-Modal Multi-Turn Chain-of-Thoughts Reasoning Minheng Ni, YuTao Fan, Lei Zhang, Wangmeng Zuo