Considering Unseen States as Impossible in Factored Reinforcement Learning

Kozlova, Olga; Sigaud, Olivier; Wuillemin, Pierre-Henri; Meyer, Christophe

doi:10.1007/978-3-642-04180-8_64

Considering Unseen States as Impossible in Factored Reinforcement Learning

Olga Kozlova, Olivier Sigaud, Pierre-Henri Wuillemin, Christophe Meyer

ECML-PKDD 2009 pp. 721-735

doi:10.1007/978-3-642-04180-8_64 /ecmlpkdd/2009/kozlova2009ecmlpkdd-considering/

Abstract

The Factored Markov Decision Process ( fmdp ) framework is a standard representation for sequential decision problems under uncertainty where the state is represented as a collection of random variables. Factored Reinforcement Learning ( frl ) is an Model-based Reinforcement Learning approach to fmdps where the transition and reward functions of the problem are learned. In this paper, we show how to model in a theoretically well-founded way the problems where some combinations of state variable values may not occur, giving rise to impossible states. Furthermore, we propose a new heuristics that considers as impossible the states that have not been seen so far. We derive an algorithm whose improvement in performance with respect to the standard approach is illustrated through benchmark experiments.

PDF ECML-PKDD Semantic Scholar

Cite

Text

Kozlova et al. "Considering Unseen States as Impossible in Factored Reinforcement Learning." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2009. doi:10.1007/978-3-642-04180-8_64

Markdown

[Kozlova et al. "Considering Unseen States as Impossible in Factored Reinforcement Learning." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2009.](https://mlanthology.org/ecmlpkdd/2009/kozlova2009ecmlpkdd-considering/) doi:10.1007/978-3-642-04180-8_64

BibTeX

@inproceedings{kozlova2009ecmlpkdd-considering,
  title     = {{Considering Unseen States as Impossible in Factored Reinforcement Learning}},
  author    = {Kozlova, Olga and Sigaud, Olivier and Wuillemin, Pierre-Henri and Meyer, Christophe},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2009},
  pages     = {721-735},
  doi       = {10.1007/978-3-642-04180-8_64},
  url       = {https://mlanthology.org/ecmlpkdd/2009/kozlova2009ecmlpkdd-considering/}
}