Closed-Loop Learning of Visual Control Policies
Abstract
In this paper we present a general, flexible framework for learning mappings from images to actions by interacting with the environment. The basic idea is to introduce a feature-based image classifier in front of a reinforcement learning algorithm. The classifier partitions the visual space according to the presence or absence of few highly informative local descriptors that are incrementally selected in a sequence of attempts to remove perceptual aliasing. We also address the problem of fighting overfitting in such a greedy algorithm. Finally, we show how high-level visual features can be generated when the power of local descriptors is insufficient for completely disambiguating the aliased states. This is done by building a hierarchy of composite features that consist of recursive spatial combinations of visual features. We demonstrate the efficacy of our algorithms by solving three visual navigation tasks and a visual version of the classical "Car on the Hill" control problem.
Cite
Text
Jodogne and Piater. "Closed-Loop Learning of Visual Control Policies." Journal of Artificial Intelligence Research, 2007. doi:10.1613/JAIR.2110Markdown
[Jodogne and Piater. "Closed-Loop Learning of Visual Control Policies." Journal of Artificial Intelligence Research, 2007.](https://mlanthology.org/jair/2007/jodogne2007jair-closedloop/) doi:10.1613/JAIR.2110BibTeX
@article{jodogne2007jair-closedloop,
title = {{Closed-Loop Learning of Visual Control Policies}},
author = {Jodogne, Sébastien and Piater, Justus H.},
journal = {Journal of Artificial Intelligence Research},
year = {2007},
pages = {349-391},
doi = {10.1613/JAIR.2110},
volume = {28},
url = {https://mlanthology.org/jair/2007/jodogne2007jair-closedloop/}
}