State Space Construction for Behavior Acquisition in Multi Agent Environments with Vision and Action

Abstract

This paper proposes a method which estimates the relationships between learner's behaviors and other agents' ones in the environment through interactions (observation and action) using the method of system identification. In order to identify the model of each agent, Akaike's Information Criterion is applied to the results of Canonical Variate Analysis for the relationship between the observed data in terms of action and future observation. Next, reinforcement learning based on the estimated state vectors is performed to obtain the optimal behavior. The proposed method is applied to a soccer playing situation, where a rolling ball and other moving agents are well modeled and the learner's behaviors are successfully acquired by the method. Computer simulations and real experiments are shown and a discussion is given.

Cite

Text

Uchibe et al. "State Space Construction for Behavior Acquisition in Multi Agent Environments with Vision and Action." IEEE/CVF International Conference on Computer Vision, 1998. doi:10.1109/ICCV.1998.710819

Markdown

[Uchibe et al. "State Space Construction for Behavior Acquisition in Multi Agent Environments with Vision and Action." IEEE/CVF International Conference on Computer Vision, 1998.](https://mlanthology.org/iccv/1998/uchibe1998iccv-state/) doi:10.1109/ICCV.1998.710819

BibTeX

@inproceedings{uchibe1998iccv-state,
  title     = {{State Space Construction for Behavior Acquisition in Multi Agent Environments with Vision and Action}},
  author    = {Uchibe, Eiji and Asada, Minoru and Hosoda, Koh},
  booktitle = {IEEE/CVF International Conference on Computer Vision},
  year      = {1998},
  pages     = {870-875},
  doi       = {10.1109/ICCV.1998.710819},
  url       = {https://mlanthology.org/iccv/1998/uchibe1998iccv-state/}
}