Learning with Generated Teammates to Achieve Type-Free Ad-Hoc Teamwork

Abstract

In ad-hoc teamwork, an agent is required to cooperate with unknown teammates without prior coordination. To swiftly adapt to an unknown teammate, most works adopt a type-based approach, which pre-trains the agent with a set of pre-prepared teammate types, then associates the unknown teammate with a particular type. Typically, these types are collected manually. This hampers previous works by both the availability and diversity of types they manage to obtain. To eliminate these limitations, this work addresses to achieve ad-hoc teamwork in a type-free approach. Specifically, we propose the model of Entropy-regularized Deep Recurrent Q-Network (EDRQN) to generate teammates automatically, meanwhile utilize them to pre-train our agent. These teammates are obtained from scratch and are designed to perform the task with various behaviors, therefore their availability and diversity are both ensured. We evaluate our model on several benchmark domains of ad-hoc teamwork. The result shows that even if our model has no access to any pre-prepared teammate types, it still achieves significant performance.

Cite

Text

Xing et al. "Learning with Generated Teammates to Achieve Type-Free Ad-Hoc Teamwork." International Joint Conference on Artificial Intelligence, 2021. doi:10.24963/IJCAI.2021/66

Markdown

[Xing et al. "Learning with Generated Teammates to Achieve Type-Free Ad-Hoc Teamwork." International Joint Conference on Artificial Intelligence, 2021.](https://mlanthology.org/ijcai/2021/xing2021ijcai-learning/) doi:10.24963/IJCAI.2021/66

BibTeX

@inproceedings{xing2021ijcai-learning,
  title     = {{Learning with Generated Teammates to Achieve Type-Free Ad-Hoc Teamwork}},
  author    = {Xing, Dong and Liu, Qianhui and Zheng, Qian and Pan, Gang},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2021},
  pages     = {472-478},
  doi       = {10.24963/IJCAI.2021/66},
  url       = {https://mlanthology.org/ijcai/2021/xing2021ijcai-learning/}
}