Closed-Loop Learning of Visual Control Policies

Abstract

In this paper we present a general, flexible framework for learning mappings from images to actions by interacting with the environment. The basic idea is to introduce a feature-based image classifier in front of a reinforcement learning algorithm. The classifier partitions the visual space according to the presence or absence of few highly informative local descriptors that are incrementally selected in a sequence of attempts to remove perceptual aliasing. We also address the problem of fighting overfitting in such a greedy algorithm. Finally, we show how high-level visual features can be generated when the power of local descriptors is insufficient for completely disambiguating the aliased states. This is done by building a hierarchy of composite features that consist of recursive spatial combinations of visual features. We demonstrate the efficacy of our algorithms by solving three visual navigation tasks and a visual version of the classical "Car on the Hill" control problem.

Cite

Text

Jodogne and Piater. "Closed-Loop Learning of Visual Control Policies." Journal of Artificial Intelligence Research, 2007. doi:10.1613/JAIR.2110

Markdown

[Jodogne and Piater. "Closed-Loop Learning of Visual Control Policies." Journal of Artificial Intelligence Research, 2007.](https://mlanthology.org/jair/2007/jodogne2007jair-closedloop/) doi:10.1613/JAIR.2110

BibTeX

@article{jodogne2007jair-closedloop,
  title     = {{Closed-Loop Learning of Visual Control Policies}},
  author    = {Jodogne, Sébastien and Piater, Justus H.},
  journal   = {Journal of Artificial Intelligence Research},
  year      = {2007},
  pages     = {349-391},
  doi       = {10.1613/JAIR.2110},
  volume    = {28},
  url       = {https://mlanthology.org/jair/2007/jodogne2007jair-closedloop/}
}