Du, Yali

53 publications

NeurIPS 2025 Abstract Counterfactuals for Language Model Agents Edoardo Pona, Milad Kazemi, Yali Du, David Watson, Nicola Paoletti
MLJ 2025 Capturing the Context-Aware Code Change via Dynamic Control Flow Graph for Commit Message Generation Yali Du, Ying Li, Yi-Fan Ma, Ming Li
NeurIPS 2025 Causality Meets Locality: Provably Generalizable and Scalable Policy Learning for Networked Systems Hao Liang, Shuqing Shi, Yudi Zhang, Biwei Huang, Yali Du
NeurIPS 2025 Evaluating Generalization Capabilities of LLM-Based Agents in Mixed-Motive Scenarios Using Concordia Chandler Smith, Marwa Abdulhai, Manfred Diaz, Marko Tesic, Rakshit Trivedi, Sasha Vezhnevets, Lewis Hammond, Jesse Clifton, Minsuk Chang, Edgar A. Duéñez-Guzmán, John P Agapiou, Jayd Matyas, Danny Karmon, Beining Zhang, Jim Dilkes, Akash Kundu, Jord Nguyen, Emanuel Tewolde, Jebish Purbey, Ram Mohan Rao Kadiyala, Siddhant Gupta, Aliaksei Korshuk, Buyantuev Alexander, Ilya Makarov, Gang Zhao, Rolando Fernandez, Zhihan Wang, Caroline Wang, Jiaxun Cui, Lingyun Xiao, Di Yang Shi, Yoonchang Sung, Arrasy Rahman, Peter Stone, Yipeng Kang, Hyeonggeun Yun, Ananya Ananya, Taehun Cha, Zhiqiang Wu, Elizaveta Tennant, Olivia Macmillan-Scott, Marta Emili García Segura, Diana Riazi, Fuyang Cui, Sriram Ganapathi Subramanian, Toryn Q. Klassen, Nico Schiavone, Mogtaba Alim, Sheila A. McIlraith, Manuel Sebastian Rios Beltran, Oswaldo Peña, Carlos Saith Rodriguez Rojas, Manuela Chacon-Chamorro, Ruben Manrique, Luis Felipe Giraldo, Nicanor Quijano, Yiding Wang, Yuxuan Chen, Fangwei Zhong, Mengmeng Wang, Wenming Tu, Zhaowei Zhang, Ziang Chen, Zixia Jia, Xue Feng, Zilong Zheng, Chichen Lin, Weijian Fan, Chenao Liu, Sneheel Sarangi, Ziyan Wang, Shuqing Shi, Yali Du, Avinaash Anand Kulandaivel, Yang Liu, Wu Ruiyang, Chetan Talele, 陆孙嘉, Gema Parreño Piqueras, Shamika Dhuri, Bain McHale, Tim Baarslag, Dylan Hadfield-Menell, Natasha Jaques, Jose Hernandez-Orallo, Joel Z Leibo
ICML 2025 GRU: Mitigating the Trade-Off Between Unlearning and Retention for LLMs Yue Wang, Qizhou Wang, Feng Liu, Wei Huang, Yali Du, Xiaojiang Du, Bo Han
ICML 2025 M$^3$HF: Multi-Agent Reinforcement Learning from Multi-Phase Human Feedback of Mixed Quality Ziyan Wang, Zhicheng Zhang, Fei Fang, Yali Du
TMLR 2025 MACCA: Offline Multi-Agent Reinforcement Learning with Causal Credit Assignment Ziyan Wang, Yali Du, Yudi Zhang, Meng Fang, Biwei Huang
ICLR 2025 On the Optimization Landscape of Low Rank Adaptation Methods for Large Language Models Xu-Hui Liu, Yali Du, Jun Wang, Yang Yu
IJCAI 2025 Quantifying the Self-Interest Level of Markov Social Dilemmas Richard Willis, Yali Du, Joel Z. Leibo, Michael Luck
AAAI 2025 RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors Fengshuo Bai, Runze Liu, Yali Du, Ying Wen, Yaodong Yang
ICLR 2025 RuAG: Learned-Rule-Augmented Generation for Large Language Models Yudi Zhang, Pei Xiao, Lu Wang, Chaoyun Zhang, Meng Fang, Yali Du, Yevgeniy Puzyrev, Randolph Yao, Si Qin, Qingwei Lin, Mykola Pechenizkiy, Dongmei Zhang, Saravan Rajmohan, Qi Zhang
NeurIPS 2025 Self-Verifying Reflection Helps Transformers with CoT Reasoning Zhongwei Yu, Wannian Xia, Xue Yan, Bo Xu, Haifeng Zhang, Yali Du, Jun Wang
NeurIPS 2025 Social World Model-Augmented Mechanism Design Policy Learning Xiaoyuan Zhang, Yizhe Huang, Chengdong Ma, Zhixun Chen, Long Ma, Yali Du, Song-Chun Zhu, Yaodong Yang, Xue Feng
NeurIPSW 2024 A Causality-Inspired Spatial-Temporal Return Decomposition Approach for Multi-Agent Reinforcement Learning Yudi Zhang, Yali Du, Biwei Huang, Meng Fang, Mykola Pechenizkiy
NeurIPS 2024 Aligning Individual and Collective Objectives in Multi-Agent Cooperation Yang Li, Wenhao Zhang, Jianhong Wang, Shao Zhang, Yali Du, Ying Wen, Wei Pan
IJCAI 2024 Dual Contrastive Graph-Level Clustering with Multiple Cluster Perspectives Alignment Jinyu Cai, Yunhe Zhang, Jicong Fan, Yali Du, Wenzhong Guo
AAAI 2024 Human-Guided Moral Decision Making in Text-Based Games Zijing Shi, Meng Fang, Ling Chen, Yali Du, Jun Wang
ICMLW 2024 Learning Stable Allocations of Strictly Convex Stochastic Cooperative Games Nam Phuong Tran, The-Anh Ta, Shuqing Shi, Debmalya Mandal, Yali Du, Long Tran-Thanh
NeurIPS 2024 Learning the Expected Core of Strictly Convex Stochastic Cooperative Games Nam Phuong Tran, The Anh Ta, Shuqing Shi, Debmalya Mandal, Yali Du, Long Tran-Thanh
NeurIPS 2024 Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf Xuanfa Jin, Ziyan Wang, Yali Du, Meng Fang, Haifeng Zhang, Jun Wang
NeurIPSW 2024 MACCA: Offline Multi-Agent Reinforcement Learning with Causal Credit Assignment Ziyan Wang, Yali Du, Yudi Zhang, Meng Fang, Biwei Huang
IJCAI 2024 Off-Agent Trust Region Policy Optimization Ruiqing Chen, Xiaoyuan Zhang, Yali Du, Yifan Zhong, Zheng Tian, Fanglei Sun, Yaodong Yang
ICML 2024 PEARL: Zero-Shot Cross-Task Preference Alignment and Robust Reward Learning for Robotic Manipulation Runze Liu, Yali Du, Fengshuo Bai, Jiafei Lyu, Xiu Li
NeurIPS 2024 Policy Learning from Tutorial Books via Understanding, Rehearsing and Introspecting Xiong-Hui Chen, Ziyan Wang, Yali Du, Shengyi Jiang, Meng Fang, Yang Yu, Jun Wang
AAAI 2024 STAS: Spatial-Temporal Return Decomposition for Solving Sparse Rewards Problems in Multi-Agent Reinforcement Learning Sirui Chen, Zhaowei Zhang, Yaodong Yang, Yali Du
NeurIPS 2024 Self-Guiding Exploration for Combinatorial Problems Zangir Iklassov, Yali Du, Farkhad Akimov, Martin Takáč
AAAI 2024 TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient Xingzhou Lou, Junge Zhang, Timothy J. Norman, Kaiqi Huang, Yali Du
JAIR 2024 Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination Yang Li, Shao Zhang, Jichen Sun, Wenhao Zhang, Yali Du, Ying Wen, Xinbing Wang, Wei Pan
ICMLW 2024 When Do Language Models Need to Be Large? Zhixun Chen, Yali Du, David Henry Mguni
NeurIPS 2023 An Efficient End-to-End Training Approach for Zero-Shot Human-AI Coordination Xue Yan, Jiaxian Guo, Xingzhou Lou, Jun Wang, Haifeng Zhang, Yali Du
IJCAI 2023 Capturing the Long-Distance Dependency in the Control Flow Graph via Structural-Guided Attention for Bug Localization Yi-Fan Ma, Yali Du, Ming Li
NeurIPS 2023 ChessGPT: Bridging Policy Learning and Language Modeling Xidong Feng, Yicheng Luo, Ziyan Wang, Hongrui Tang, Mengyue Yang, Kun Shao, David Mguni, Yali Du, Jun Wang
AAAI 2023 Cooperative Multi-Agent Learning in a Complex World: Challenges and Solutions Yali Du
ICML 2023 Cooperative Open-Ended Learning Framework for Zero-Shot Coordination Yang Li, Shao Zhang, Jichen Sun, Yali Du, Ying Wen, Xinbing Wang, Wei Pan
NeurIPS 2023 Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach Yudi Zhang, Yali Du, Biwei Huang, Ziyan Wang, Jun Wang, Meng Fang, Mykola Pechenizkiy
NeurIPS 2023 Invariant Learning via Probability of Sufficient and Necessary Causes Mengyue Yang, Zhen Fang, Yonggang Zhang, Yali Du, Furui Liu, Jean-Francois Ton, Jianhong Wang, Jun Wang
NeurIPS 2023 Reduced Policy Optimization for Continuous Control with Hard Constraints Shutong Ding, Jingya Wang, Yali Du, Ye Shi
ICLR 2023 Stay Moral and Explore: Learn to Behave Morally in Text-Based Games Zijing Shi, Meng Fang, Yunqiu Xu, Ling Chen, Yali Du
NeurIPSW 2023 Zero-Shot Cross-Task Preference Alignment for Offline RL via Optimal Transport Runze Liu, Yali Du, Fengshuo Bai, Jiafei Lyu, Xiu Li
NeurIPSW 2022 Contextual Transformer for Offline Meta Reinforcement Learning Runji Lin, Ye Li, Xidong Feng, Zhaowei Zhang, Xian Hong Wu Fung, Haifeng Zhang, Jun Wang, Yali Du, Yaodong Yang
AAAI 2022 Learning to Identify Top Elo Ratings: A Dueling Bandits Approach Xue Yan, Yali Du, Binxin Ru, Jun Wang, Haifeng Zhang, Xu Chen
NeurIPS 2022 Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-Based Reinforcement Learning Runze Liu, Fengshuo Bai, Yali Du, Yaodong Yang
ICLR 2022 Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL Rui Yang, Yiming Lu, Wenzhe Li, Hao Sun, Meng Fang, Yali Du, Xiu Li, Lei Han, Chongjie Zhang
ICML 2021 Estimating $α$-Rank from a Few Entries with Low Rank Matrix Completion Yali Du, Xue Yan, Xu Chen, Jun Wang, Haifeng Zhang
ICML 2021 Learning in Nonzero-Sum Stochastic Games with Potentials David H Mguni, Yutong Wu, Yali Du, Yaodong Yang, Ziyi Wang, Minne Li, Ying Wen, Joel Jennings, Jun Wang
NeurIPSW 2021 MHER: Model-Based Hindsight Experience Replay Rui Yang, Meng Fang, Lei Han, Yali Du, Feng Luo, Xiu Li
IJCAI 2021 Ordering-Based Causal Discovery with Reinforcement Learning Xiaoqiang Wang, Yali Du, Shengyu Zhu, Liangjun Ke, Zhitang Chen, Jianye Hao, Jun Wang
NeurIPS 2020 Deep Reinforcement Learning with Stacked Hierarchical Attention for Text-Based Games Yunqiu Xu, Meng Fang, Ling Chen, Yali Du, Joey Tianyi Zhou, Chengqi Zhang
NeurIPS 2019 Curriculum-Guided Hindsight Experience Replay Meng Fang, Tianyi Zhou, Yali Du, Lei Han, Zhengyou Zhang
ICML 2019 Grid-Wise Control for Multi-Agent Reinforcement Learning in Video Game AI Lei Han, Peng Sun, Yali Du, Jiechao Xiong, Qing Wang, Xinghai Sun, Han Liu, Tong Zhang
NeurIPS 2019 LIIR: Learning Individual Intrinsic Reward in Multi-Agent Reinforcement Learning Yali Du, Lei Han, Meng Fang, Ji Liu, Tianhong Dai, Dacheng Tao
IJCAI 2017 Collaborative Rating Allocation Yali Du, Chang Xu, Dacheng Tao
IJCAI 2017 Privileged Matrix Factorization for Collaborative Filtering Yali Du, Chang Xu, Dacheng Tao