Achieving Collective Welfare in Multi-Agent Reinforcement Learning via Suggestion Sharing

Abstract

In human society, the conflict between self-interest and collective well-being often obstructs efforts to achieve shared welfare. Related concepts like the Tragedy of the Commons and Social Dilemmas frequently manifest in our daily lives. As artificial agents increasingly serve as autonomous proxies for humans, we propose a novel multi-agent reinforcement learning (MARL) method to address this issue - learning policies to maximise collective returns even when individual agents’ interests conflict with the collective one. Unlike traditional cooperative MARL solutions that involve sharing rewards, values, and policies or designing intrinsic rewards to encourage agents to learn collectively optimal policies, we propose a novel MARL approach where agents exchange action suggestions. Our method reveals less private information compared to sharing rewards, values, or policies, while enabling effective cooperation without the need to design intrinsic rewards. Our algorithm is supported by our theoretical analysis that establishes a bound on the discrepancy between collective and individual objectives, demonstrating how sharing suggestions can align agents’ behaviours with the collective objective. Experimental results demonstrate that our algorithm performs competitively with baselines that rely on value or policy sharing or intrinsic rewards.

Cite

Text

Jin et al. "Achieving Collective Welfare in Multi-Agent Reinforcement Learning via Suggestion Sharing." Machine Learning, 2025. doi:10.1007/S10994-025-06823-Z

Markdown

[Jin et al. "Achieving Collective Welfare in Multi-Agent Reinforcement Learning via Suggestion Sharing." Machine Learning, 2025.](https://mlanthology.org/mlj/2025/jin2025mlj-achieving/) doi:10.1007/S10994-025-06823-Z

BibTeX

@article{jin2025mlj-achieving,
  title     = {{Achieving Collective Welfare in Multi-Agent Reinforcement Learning via Suggestion Sharing}},
  author    = {Jin, Yue and Wei, Shuangqing and Montana, Giovanni},
  journal   = {Machine Learning},
  year      = {2025},
  pages     = {190},
  doi       = {10.1007/S10994-025-06823-Z},
  volume    = {114},
  url       = {https://mlanthology.org/mlj/2025/jin2025mlj-achieving/}
}