SAEIR: Sequentially Accumulated Entropy Intrinsic Reward for Cooperative Multi-Agent Reinforcement Learning with Sparse Reward

Abstract

We consider the problem of fair allocation of m indivisible items to a group of n agents with subsidies (money). We address scenarios where agents have general additive cost/utility functions. Our work primarily focuses on the special case of three agents. Assuming that the maximum cost/utility of an item to an agent can be compensated by one dollar, we demonstrate that a total subsidy of 1/6 dollars is sufficient to ensure the existence of Maximin Share (MMS) allocations for both goods and chores. Additionally, we provide examples to establish the lower bounds of the required subsidies.

Cite

Text

He et al. "SAEIR: Sequentially Accumulated Entropy Intrinsic Reward for Cooperative Multi-Agent Reinforcement Learning with Sparse Reward." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/454

Markdown

[He et al. "SAEIR: Sequentially Accumulated Entropy Intrinsic Reward for Cooperative Multi-Agent Reinforcement Learning with Sparse Reward." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/he2024ijcai-saeir/) doi:10.24963/ijcai.2024/454

BibTeX

@inproceedings{he2024ijcai-saeir,
  title     = {{SAEIR: Sequentially Accumulated Entropy Intrinsic Reward for Cooperative Multi-Agent Reinforcement Learning with Sparse Reward}},
  author    = {He, Xin and Ge, Hongwei and Hou, Yaqing and Yu, Jincheng},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {4107-4115},
  doi       = {10.24963/ijcai.2024/454},
  url       = {https://mlanthology.org/ijcai/2024/he2024ijcai-saeir/}
}