Yang, Fangkai

12 publications

ICLR 2026 DoVer: Intervention-Driven Auto Debugging for LLM Multi-Agent Systems Ming Ma, Jue Zhang, Fangkai Yang, Yu Kang, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang
ICLR 2026 Pretrain Value, Not Reward: Decoupled Value Policy Optimization Chenghua Huang, Lu Wang, Fangkai Yang, Pu Zhao, Qingwei Lin, Dongmei Zhang, Saravan Rajmohan
ICLR 2026 RePrompt: Reasoning-Augmented Reprompting for Text-to-Image Generation via Reinforcement Learning Mingrui Wu, Lu Wang, Pu Zhao, Fangkai Yang, Jianjin Zhang, Jianfeng Liu, Yuefeng Zhan, Weihao Han, Hao Sun, Jiayi Ji, Xiaoshuai Sun, Qingwei Lin, Weiwei Deng, Dongmei Zhang, Feng Sun, Rongrong Ji
TMLR 2026 VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model Mengzhuo Chen, Jiani Zheng, Lu Wang, Fangkai Yang, Chaoyun Zhang, Lingrui Mei, Wenjie Yin, Qingwei Lin, Dongmei Zhang, Saravan Rajmohan
TMLR 2025 Large Action Models: From Inception to Implementation Lu Wang, Fangkai Yang, Chaoyun Zhang, Junting Lu, Jiaxu Qian, Shilin He, Pu Zhao, Bo Qiao, He Huang, Si Qin, Qisheng Su, Jiayi Ye, Yudi Zhang, Jian-Guang Lou, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang
ICLR 2025 Self-Evolved Reward Learning for LLMs Chenghua Huang, Zhizhen Fan, Lu Wang, Fangkai Yang, Pu Zhao, Zeqi Lin, Qingwei Lin, Dongmei Zhang, Saravan Rajmohan, Qi Zhang
IJCAI 2023 Measuring Acoustics with Collaborative Multiple Agents Yinfeng Yu, Changan Chen, Lele Cao, Fangkai Yang, Fuchun Sun
AAAI 2019 Logic-Based Sequential Decision-Making Daoming Lyu, Fangkai Yang, Bo Liu, Daesub Yoon
AAAI 2019 SDRL: Interpretable and Data-Efficient Deep Reinforcement Learning Leveraging Symbolic Planning Daoming Lyu, Fangkai Yang, Bo Liu, Steven Gustafson
IJCAI 2018 PEORL: Integrating Symbolic Planning and Hierarchical Reinforcement Learning for Robust Decision-Making Fangkai Yang, Daoming Lyu, Bo Liu, Steven Gustafson
IJCAI 2016 Planning with Task-Oriented Knowledge Acquisition for a Service Robot Kai Chen, Fangkai Yang, Xiaoping Chen
IJCAI 2013 Action Language BC: Preliminary Report Joohyung Lee, Vladimir Lifschitz, Fangkai Yang