SAEIR: Sequentially Accumulated Entropy Intrinsic Reward for Cooperative Multi-Agent Reinforcement Learning with Sparse Reward

He, Xin; Ge, Hongwei; Hou, Yaqing; Yu, Jincheng

doi:10.24963/ijcai.2024/454

SAEIR: Sequentially Accumulated Entropy Intrinsic Reward for Cooperative Multi-Agent Reinforcement Learning with Sparse Reward

Xin He, Hongwei Ge, Yaqing Hou, Jincheng Yu

IJCAI 2024 pp. 4107-4115

doi:10.24963/ijcai.2024/454 /ijcai/2024/he2024ijcai-saeir/

Abstract

We consider the problem of fair allocation of m indivisible items to a group of n agents with subsidies (money). We address scenarios where agents have general additive cost/utility functions. Our work primarily focuses on the special case of three agents. Assuming that the maximum cost/utility of an item to an agent can be compensated by one dollar, we demonstrate that a total subsidy of 1/6 dollars is sufficient to ensure the existence of Maximin Share (MMS) allocations for both goods and chores. Additionally, we provide examples to establish the lower bounds of the required subsidies.

PDF IJCAI Semantic Scholar

Cite

Text

He et al. "SAEIR: Sequentially Accumulated Entropy Intrinsic Reward for Cooperative Multi-Agent Reinforcement Learning with Sparse Reward." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/454

Markdown

[He et al. "SAEIR: Sequentially Accumulated Entropy Intrinsic Reward for Cooperative Multi-Agent Reinforcement Learning with Sparse Reward." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/he2024ijcai-saeir/) doi:10.24963/ijcai.2024/454

BibTeX

@inproceedings{he2024ijcai-saeir,
  title     = {{SAEIR: Sequentially Accumulated Entropy Intrinsic Reward for Cooperative Multi-Agent Reinforcement Learning with Sparse Reward}},
  author    = {He, Xin and Ge, Hongwei and Hou, Yaqing and Yu, Jincheng},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {4107-4115},
  doi       = {10.24963/ijcai.2024/454},
  url       = {https://mlanthology.org/ijcai/2024/he2024ijcai-saeir/}
}