Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment

Abstract

Deep Reinforcement Learning (DRL) agents have demonstrated impressive success in a wide range of game genres. However, existing research primarily focuses on optimizing DRL competence rather than addressing the challenge of prolonged player interaction. In this paper, we propose a practical DRL agent system for fighting games named Shūkai, which has been successfully deployed to Naruto Mobile, a popular fighting game with over 100 million registered users. Shūkai quantifies the state to enhance generalizability, introducing Heterogeneous League Training (HELT) to achieve balanced competence, generalizability, and training efficiency. Furthermore, Shūkai implements specific rewards to align the agent’s behavior with human expectations. Shūkai’s ability to generalize is demonstrated by its consistent competence across all characters, even though it was trained on only 13% of them. Additionally, HELT exhibits a remarkable 22% improvement in sample efficiency. Shūkai serves as a valuable training partner for players in Naruto Mobile, enabling them to enhance their abilities and skills.

Cite

Text

Zhang et al. "Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment." International Conference on Machine Learning, 2024.

Markdown

[Zhang et al. "Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment." International Conference on Machine Learning, 2024.](https://mlanthology.org/icml/2024/zhang2024icml-advancing/)

BibTeX

@inproceedings{zhang2024icml-advancing,
  title     = {{Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment}},
  author    = {Zhang, Chen and He, Qiang and Zhou, Yuan and Liu, Elvis S. and Wang, Hong and Zhao, Jian and Wang, Yang},
  booktitle = {International Conference on Machine Learning},
  year      = {2024},
  pages     = {59003-59023},
  volume    = {235},
  url       = {https://mlanthology.org/icml/2024/zhang2024icml-advancing/}
}