Feng, Yihao

29 publications

ICLR 2025 Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents Kexun Zhang, Weiran Yao, Zuxin Liu, Yihao Feng, Zhiwei Liu, R N Rithesh, Tian Lan, Lei Li, Renze Lou, Jiacheng Xu, Bo Pang, Yingbo Zhou, Shelby Heinecke, Silvio Savarese, Huan Wang, Caiming Xiong
ICLR 2025 Longhorn: State Space Models Are Amortized Online Learners Bo Liu, Rui Wang, Lemeng Wu, Yihao Feng, Peter Stone, Qiang Liu
ICCV 2025 Structured Policy Optimization: Enhance Large Vision-Language Model via Self-Referenced Dialogue Guohao Sun, Can Qin, Yihao Feng, Zeyuan Chen, Ran Xu, Sohail Dianat, Majid Rabbani, Raghuveer Rao, Zhiqiang Tao
AAAI 2025 Text2Data: Low-Resource Data Generation with Textual Control Shiyu Wang, Yihao Feng, Tian Lan, Ning Yu, Yu Bai, Ran Xu, Huan Wang, Caiming Xiong, Silvio Savarese
NeurIPS 2024 APIGen: Automated PIpeline for Generating Verifiable and Diverse Function-Calling Datasets Zuxin Liu, Thai Hoang, Jianguo Zhang, Ming Zhu, Tian Lan, Shirley Kokane, Juntao Tan, Weiran Yao, Zhiwei Liu, Yihao Feng, Rithesh Murthy, Liangwei Yang, Silvio Savarese, Juan Carlos Niebles, Huan Wang, Shelby Heinecke, Caiming Xiong
ICLRW 2024 Bolaa: Benchmarking and Orchestrating LLM Autonomous Agents Zhiwei Liu, Weiran Yao, Jianguo Zhang, Le Xue, Shelby Heinecke, R N Rithesh, Yihao Feng, Zeyuan Chen, Juan Carlos Niebles, Devansh Arpit, Ran Xu, Phil L Mui, Huan Wang, Caiming Xiong, Silvio Savarese
CVPR 2024 HIVE: Harnessing Human Feedback for Instructional Visual Editing Shu Zhang, Xinyi Yang, Yihao Feng, Can Qin, Chia-Chih Chen, Ning Yu, Zeyuan Chen, Huan Wang, Silvio Savarese, Stefano Ermon, Caiming Xiong, Ran Xu
ICLRW 2024 REX: Rapid Exploration and eXploitation for AI Agents R N Rithesh, Shelby Heinecke, Juan Carlos Niebles, Zhiwei Liu, Le Xue, Weiran Yao, Yihao Feng, Zeyuan Chen, Akash Gokul, Devansh Arpit, Ran Xu, Phil L Mui, Huan Wang, Caiming Xiong, Silvio Savarese
ICLR 2024 Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization Weiran Yao, Shelby Heinecke, Juan Carlos Niebles, Zhiwei Liu, Yihao Feng, Le Xue, R N Rithesh, Zeyuan Chen, Jianguo Zhang, Devansh Arpit, Ran Xu, Phil L Mui, Huan Wang, Caiming Xiong, Silvio Savarese
ICLRW 2024 Text2Data: Low-Resource Data Generation with Textual Control Shiyu Wang, Yihao Feng, Tian Lan, Ning Yu, Yu Bai, Ran Xu, Huan Wang, Caiming Xiong, Silvio Savarese
ICLRW 2024 The Agent Ohana: Designing Unified Data and Training Pipeline for Effective Agent Learning Jianguo Zhang, Tian Lan, R N Rithesh, Zhiwei Liu, Weiran Yao, Juntao Tan, Thai Quoc Hoang, Liangwei Yang, Yihao Feng, Zuxin Liu, Ming Zhu, Tulika Manoj Awalgaonkar, Juan Carlos Niebles, Silvio Savarese, Shelby Heinecke, Huan Wang, Caiming Xiong
ECCVW 2024 xGen-VideoSyn-1: High-Fidelity Text-to-Video Synthesis with Compressed Representations Can Qin, Congying Xia, Krithika Ramakrishnan, Michael S. Ryoo, Lifu Tu, Yihao Feng, Manli Shu, Honglu Zhou, Anas Awadalla, Jun Wang, Senthil Purushwalkam, Le Xue, Yingbo Zhou, Huan Wang, Silvio Savarese, Juan Carlos Niebles, Zeyuan Chen, Ran Xu, Caiming Xiong
NeurIPS 2023 FAMO: Fast Adaptive Multitask Optimization Bo Liu, Yihao Feng, Peter Stone, Qiang Liu
ICLR 2023 Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-Oriented Dialogue Systems Yihao Feng, Shentao Yang, Shujian Zhang, Jianguo Zhang, Caiming Xiong, Mingyuan Zhou, Huan Wang
NeurIPS 2023 LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning Bo Liu, Yifeng Zhu, Chongkai Gao, Yihao Feng, Qiang Liu, Yuke Zhu, Peter Stone
AAAI 2023 Metric Residual Network for Sample Efficient Goal-Conditioned Reinforcement Learning Bo Liu, Yihao Feng, Qiang Liu, Peter Stone
NeurIPS 2023 Preference-Grounded Token-Level Guidance for Language Model Fine-Tuning Shentao Yang, Shujian Zhang, Congying Xia, Yihao Feng, Caiming Xiong, Mingyuan Zhou
NeurIPS 2023 UniControl: A Unified Diffusion Model for Controllable Visual Generation in the Wild Can Qin, Shu Zhang, Ning Yu, Yihao Feng, Xinyi Yang, Yingbo Zhou, Huan Wang, Juan Carlos Niebles, Caiming Xiong, Silvio Savarese, Stefano Ermon, Yun Fu, Ran Xu
NeurIPS 2022 A Unified Framework for Alternating Offline Model Training and Policy Learning Shentao Yang, Shujian Zhang, Yihao Feng, Mingyuan Zhou
NeurIPSW 2022 Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-Oriented Dialogue Systems Yihao Feng, Shentao Yang, Shujian Zhang, Jianguo Zhang, Caiming Xiong, Mingyuan Zhou, Huan Wang
NeurIPSW 2022 Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-Oriented Dialogue Systems Yihao Feng, Shentao Yang, Shujian Zhang, Jianguo Zhang, Caiming Xiong, Mingyuan Zhou, Huan Wang
ICML 2022 Regularizing a Model-Based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning Shentao Yang, Yihao Feng, Shujian Zhang, Mingyuan Zhou
ICLR 2021 Non-Asymptotic Confidence Intervals of Off-Policy Evaluation: Primal and Dual Bounds Yihao Feng, Ziyang Tang, Na Zhang, Qiang Liu
ICML 2020 Accountable Off-Policy Evaluation with Kernel Bellman Statistics Yihao Feng, Tongzheng Ren, Ziyang Tang, Qiang Liu
ICLR 2020 Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation Ziyang Tang, Yihao Feng, Lihong Li, Dengyong Zhou, Qiang Liu
NeurIPS 2020 Off-Policy Interval Estimation with Lipschitz Value Iteration Ziyang Tang, Yihao Feng, Na Zhang, Jian Peng, Qiang Liu
NeurIPS 2019 A Kernel Loss for Solving the Bellman Equation Yihao Feng, Lihong Li, Qiang Liu
ICLR 2018 Action-Dependent Control Variates for Policy Optimization via Stein Identity Hao Liu, Yihao Feng, Yi Mao, Dengyong Zhou, Jian Peng, Qiang Liu
UAI 2017 Learning to Draw Samples with Amortized Stein Variational Gradient Descent Yihao Feng, Dilin Wang, Qiang Liu