Generalized Prioritized Sweeping

Abstract

Prioritized sweeping is a model-based reinforcement learning method that attempts to focus an agent's limited computational resources to achieve a good estimate of the value of environment states. To choose ef(cid:173) fectively where to spend a costly planning step, classic prioritized sweep(cid:173) ing uses a simple heuristic to focus computation on the states that are likely to have the largest errors. In this paper, we introduce generalized prioritized sweeping, a principled method for generating such estimates in a representation-specific manner. This allows us to extend prioritized sweeping beyond an explicit, state-based representation to deal with com(cid:173) pact representations that are necessary for dealing with large state spaces. We apply this method for generalized model approximators (such as Bayesian networks), and describe preliminary experiments that compare our approach with classical prioritized sweeping.

Cite

Text

Andre et al. "Generalized Prioritized Sweeping." Neural Information Processing Systems, 1997.

Markdown

[Andre et al. "Generalized Prioritized Sweeping." Neural Information Processing Systems, 1997.](https://mlanthology.org/neurips/1997/andre1997neurips-generalized/)

BibTeX

@inproceedings{andre1997neurips-generalized,
  title     = {{Generalized Prioritized Sweeping}},
  author    = {Andre, David and Friedman, Nir and Parr, Ronald},
  booktitle = {Neural Information Processing Systems},
  year      = {1997},
  pages     = {1001-1007},
  url       = {https://mlanthology.org/neurips/1997/andre1997neurips-generalized/}
}