Simultaneous Adversarial Multi-Robot Learning

Abstract

Multi-robot learning faces all of the challenges of robot learning with all of the challenges of multiagent learning. There has been a great deal of recent research on multiagent reinforcement learning in stochastic games, which is the intuitive extension of MDPs to multiple agents. This recent work, although general, has only been applied to small games with at most hundreds of states. On the other hand robot tasks have continuous, and often complex, state and action spaces. Robot learning tasks demand approximation and generalization techniques, which have only received extensive attention in single-agent learning. In this paper we introduce GraWoLF, a general-purpose, scalable, multiagent learning algorithm. It combines gradient-based policy learning techniques with the WoLF ("Win or Learn Fast") variable learning rate.

Cite

Text

Bowling and Veloso. "Simultaneous Adversarial Multi-Robot Learning." International Joint Conference on Artificial Intelligence, 2003.

Markdown

[Bowling and Veloso. "Simultaneous Adversarial Multi-Robot Learning." International Joint Conference on Artificial Intelligence, 2003.](https://mlanthology.org/ijcai/2003/bowling2003ijcai-simultaneous/)

BibTeX

@inproceedings{bowling2003ijcai-simultaneous,
  title     = {{Simultaneous Adversarial Multi-Robot Learning}},
  author    = {Bowling, Michael H. and Veloso, Manuela M.},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2003},
  pages     = {699-704},
  url       = {https://mlanthology.org/ijcai/2003/bowling2003ijcai-simultaneous/}
}