Mao, Hangyu

17 publications

TMLR 2025 QPO: Query-Dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learning Yilun Kong, Hangyu Mao, Zhao Qi, Bin Zhang, Jingqing Ruan, Li Shen, Yongzhe Chang, Xueqian Wang, Rui Zhao, Dacheng Tao
ICLRW 2025 Query-Dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learning Yilun Kong, Hangyu Mao, Qi Zhao, Bin Zhang, Jingqing Ruan, Li Shen, Yongzhe Chang, Xueqian Wang, Rui Zhao, Dacheng Tao
ICML 2025 Reidentify: Context-Aware Identity Generation for Contextual Multi-Agent Reinforcement Learning Zhiwei Xu, Kun Hu, Xin Xin, Weiliang Meng, Yiwei Shi, Hangyu Mao, Bin Zhang, Dapeng Li, Jiangjin Yin
AAAI 2025 SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks Yongyan Wen, Siyuan Li, Rongchang Zuo, Lei Yuan, Hangyu Mao, Peng Liu
ICLRW 2024 Controlling Large Language Model-Based Agents for Large-Scale Decision-Making: An Actor-Critic Approach Bin Zhang, Hangyu Mao, Jingqing Ruan, Ying Wen, Yang Li, Shao Zhang, Zhiwei Xu, Dapeng Li, Ziyue Li, Rui Zhao, Guoliang Fan, Lijuan Li
IJCAI 2024 PTDE: Personalized Training with Distilled Execution for Multi-Agent Reinforcement Learning Yiqun Chen, Hangyu Mao, Jiaxin Mao, Shiguang Wu, Tianle Zhang, Bin Zhang, Wei Yang, Hongxing Chang
ICML 2024 Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer Approach Bin Zhang, Hangyu Mao, Lijuan Li, Zhiwei Xu, Dapeng Li, Rui Zhao, Guoliang Fan
ICLRW 2024 TPTU-V2: Boosting Task Planning and Tool Usage of Large Language Model-Based Agents in Real-World Systems Yilun Kong, Jingqing Ruan, YiHong Chen, Bin Zhang, Tianpeng Bao, Shi Shiwei, Du Guo Qing, Xiaoru Hu, Hangyu Mao, Ziyue Li, Xingyu Zeng, Rui Zhao, Xueqian Wang
IJCAI 2024 X-Light: Cross-City Traffic Signal Control Using Transformer on Transformer as Meta Multi-Agent Reinforcement Learner Haoyuan Jiang, Ziyue Li, Hua Wei, Xuantang Xiong, Jingqing Ruan, Jiaming Lu, Hangyu Mao, Rui Zhao
ICLR 2023 Boosting Multiagent Reinforcement Learning via Permutation Invariant and Permutation Equivariant Networks Jianye Hao, Xiaotian Hao, Hangyu Mao, Weixun Wang, Yaodong Yang, Dong Li, Yan Zheng, Zhen Wang
NeurIPSW 2023 TPTU: Task Planning and Tool Usage of Large Language Model-Based AI Agents Jingqing Ruan, YiHong Chen, Bin Zhang, Zhiwei Xu, Tianpeng Bao, Du Guo Qing, Shi Shiwei, Hangyu Mao, Ziyue Li, Xingyu Zeng, Rui Zhao
IJCAI 2022 Fast and Fine-Grained Autoscaler for Streaming Jobs with Reinforcement Learning Mingzhe Xing, Hangyu Mao, Zhen Xiao
NeurIPS 2022 Multiagent Q-Learning with Sub-Team Coordination Wenhan Huang, Kai Li, Kun Shao, Tianze Zhou, Matthew Taylor, Jun Luo, Dongge Wang, Hangyu Mao, Jianye Hao, Jun Wang, Xiaotie Deng
AAAI 2022 What About Inputting Policy in Value Function: Policy Representation and Policy-Extended Value Function Approximator Hongyao Tang, Zhaopeng Meng, Jianye Hao, Chen Chen, Daniel Graves, Dong Li, Changmin Yu, Hangyu Mao, Wulong Liu, Yaodong Yang, Wenyuan Tao, Li Wang
NeurIPS 2021 An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning Tianpei Yang, Weixun Wang, Hongyao Tang, Jianye Hao, Zhaopeng Meng, Hangyu Mao, Dong Li, Wulong Liu, Yingfeng Chen, Yujing Hu, Changjie Fan, Chengwei Zhang
AAAI 2020 Learning Agent Communication Under Limited Bandwidth by Message Pruning Hangyu Mao, Zhengchao Zhang, Zhen Xiao, Zhibo Gong, Yan Ni
AAAI 2020 Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning Hangyu Mao, Wulong Liu, Jianye Hao, Jun Luo, Dong Li, Zhengchao Zhang, Jun Wang, Zhen Xiao