Wu, Qingyun

25 publications

TMLR 2026 A Survey of Self-Evolving Agents: What, When, How, and Where to Evolve on the Path to Artificial Super Intelligence Huan-ang Gao, Jiayi Geng, Wenyue Hua, Mengkang Hu, Xinzhe Juan, Hongzhang Liu, Shilong Liu, Jiahao Qiu, Xuan Qi, Qihan Ren, Yiran Wu, Hongru Wang, Han Xiao, Yuhang Zhou, Shaokun Zhang, Jiayi Zhang, Jinyu Xiang, Yixiong Fang, Qiwen Zhao, Dongrui Liu, Cheng Qian, Zhenhailong Wang, Minda Hu, Huazheng Wang, Qingyun Wu, Heng Ji, Mengdi Wang
NeurIPS 2025 Absolute Zero: Reinforced Self-Play Reasoning with Zero Data Andrew Zhao, Yiran Wu, Yang Yue, Tong Wu, Quentin Xu, Yang Yue, Matthieu Lin, Shenzhi Wang, Qingyun Wu, Zilong Zheng, Gao Huang
ICML 2025 BEST-Route: Adaptive LLM Routing with Test-Time Optimal Compute Dujian Ding, Ankur Mallick, Shaokun Zhang, Chi Wang, Daniel Madrigal, Mirian Del Carmen Hipolito Garcia, Menglin Xia, Laks V. S. Lakshmanan, Qingyun Wu, Victor Rühle
ICLRW 2025 EcoAct: Economic Agent Determines When to Register What Action Shaokun Zhang, Jieyu Zhang, Dujian Ding, Jiale Liu, Mirian Del Carmen Hipolito Garcia, Ankur Mallick, Daniel Madrigal, Menglin Xia, Victor Rühle, Qingyun Wu, Chi Wang
TMLR 2025 Fair Online Influence Maximization Xiangqi Wang, Shaokun Zhang, Jose Efraim Aguilar Escamilla, Qingyun Wu, Xiangliang Zhang, Jian Kang, Huazheng Wang
TMLR 2025 Hard Work Does Not Always Pay Off: On the Robustness of NAS to Data Poisoning Zachary Coalson, Huazheng Wang, Qingyun Wu, Sanghyun Hong
ICML 2025 Which Agent Causes Task Failures and When? on Automated Failure Attribution of LLM Multi-Agent Systems Shaokun Zhang, Ming Yin, Jieyu Zhang, Jiale Liu, Zhiguang Han, Jingyang Zhang, Beibin Li, Chi Wang, Huazheng Wang, Yiran Chen, Qingyun Wu
ICML 2024 Adversarial Attacks on Combinatorial Multi-Armed Bandits Rishab Balasubramanian, Jiawei Li, Prasad Tadepalli, Huazheng Wang, Qingyun Wu, Haoyu Zhao
NeurIPSW 2024 AutoDefense: Multi-Agent LLM Defense Against Jailbreak Attacks Yifan Zeng, Yiran Wu, Xiao Zhang, Huazheng Wang, Qingyun Wu
ICLRW 2024 AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Qingyun Wu, Gagan Bansal, Jieyu Zhang, Yiran Wu, Beibin Li, Erkang Zhu, Li Jiang, Xiaoyun Zhang, Shaokun Zhang, Jiale Liu, Ahmed Hassan Awadallah, Ryen W White, Doug Burger, Chi Wang
NeurIPSW 2024 Embodied LLM Agents Learn to Cooperate in Organized Teams Xudong Guo, Kaixuan Huang, Jiale Liu, Wenhui Fan, Natalia Vélez, Qingyun Wu, Huazheng Wang, Thomas L. Griffiths, Mengdi Wang
ICLR 2024 IDEAL: Influence-Driven Selective Annotations Empower In-Context Learners in Large Language Models Shaokun Zhang, Xiaobo Xia, Zhaoqing Wang, Ling-Hao Chen, Jiale Liu, Qingyun Wu, Tongliang Liu
ICLRW 2024 MathChat: Converse to Tackle Challenging Math Problems with LLM Agents Yiran Wu, Feiran Jia, Shaokun Zhang, Hangyu Li, Erkang Zhu, Yue Wang, Yin Tat Lee, Richard Peng, Qingyun Wu, Chi Wang
ICML 2024 Offline Training of Language Model Agents with Functions as Learnable Weights Shaokun Zhang, Jieyu Zhang, Jiale Liu, Linxin Song, Chi Wang, Ranjay Krishna, Qingyun Wu
ICML 2024 Refined Coreset Selection: Towards Minimal Coreset Size Under Model Performance Constraints Xiaobo Xia, Jiale Liu, Shaokun Zhang, Qingyun Wu, Hongxin Wei, Tongliang Liu
NeurIPSW 2024 StateFlow: Enhancing LLM Task-Solving Through State-Driven Workflows Yiran Wu, Tianwei Yue, Shaokun Zhang, Chi Wang, Qingyun Wu
NeurIPS 2023 Multi-Fidelity Multi-Armed Bandits Revisited Xuchuang Wang, Qingyun Wu, Wei Chen, John C.S. Lui
ICLR 2023 Targeted Hyperparameter Optimization with Lexicographic Preferences over Multiple Objectives Shaokun Zhang, Feiran Jia, Chi Wang, Qingyun Wu
NeurIPS 2023 Unified Off-Policy Learning to Rank: A Reinforcement Learning Perspective Zeyu Zhang, Yi Su, Hui Yuan, Yiran Wu, Rishab Balasubramanian, Qingyun Wu, Huazheng Wang, Mengdi Wang
AISTATS 2021 Unifying Clustered and Non-Stationary Bandits Chuanhao Li, Qingyun Wu, Hongning Wang
ICML 2021 ChaCha for Online AutoML Qingyun Wu, Chi Wang, John Langford, Paul Mineiro, Marco Rossi
ICLR 2021 Economic Hyperparameter Optimization with Blended Search Strategy Chi Wang, Qingyun Wu, Silu Huang, Amin Saied
AAAI 2021 Frugal Optimization for Cost-Related Hyperparameters Qingyun Wu, Chi Wang, Silu Huang
NeurIPS 2018 Bandit Learning with Implicit Feedback Yi Qi, Qingyun Wu, Hongning Wang, Jie Tang, Maosong Sun
AAAI 2017 Factorization Bandits for Interactive Recommendation Huazheng Wang, Qingyun Wu, Hongning Wang