Variable Resolution Dynamic Programming
Abstract
An effective method to create an autonomous reactive controller is to learn a model of the environment and then use dynamic programming to derive a policy to maximize long term reward. Neither learning environmental models nor dynamic programming require parametric assumptions about the world, and so learning can proceed with no danger of becoming “stuck― by a mismatch between the parametric assumptions and reality. The paper discusses how such an approach can be realized in real valued multivariate state spaces in which straightforward discretization falls prey to the curse of dimensionality.
Cite
Text
Moore. "Variable Resolution Dynamic Programming." International Conference on Machine Learning, 1991. doi:10.1016/B978-1-55860-200-7.50069-6Markdown
[Moore. "Variable Resolution Dynamic Programming." International Conference on Machine Learning, 1991.](https://mlanthology.org/icml/1991/moore1991icml-variable/) doi:10.1016/B978-1-55860-200-7.50069-6BibTeX
@inproceedings{moore1991icml-variable,
title = {{Variable Resolution Dynamic Programming}},
author = {Moore, Andrew W.},
booktitle = {International Conference on Machine Learning},
year = {1991},
pages = {333-337},
doi = {10.1016/B978-1-55860-200-7.50069-6},
url = {https://mlanthology.org/icml/1991/moore1991icml-variable/}
}