Predictive Representations of State

Abstract

We show that states of a dynamical system can be usefully repre(cid:173) sented by multi-step, action-conditional predictions of future ob(cid:173) servations. State representations that are grounded in data in this way may be easier to learn, generalize better, and be less depen(cid:173) dent on accurate prior models than, for example, POMDP state representations. Building on prior work by Jaeger and by Rivest and Schapire, in this paper we compare and contrast a linear spe(cid:173) cialization of the predictive approach with the state representa(cid:173) tions used in POMDPs and in k-order Markov models. Ours is the first specific formulation of the predictive idea that includes both stochasticity and actions (controls). We show that any system has a linear predictive state representation with number of predictions no greater than the number of states in its minimal POMDP model.

Cite

Text

Littman and Sutton. "Predictive Representations of State." Neural Information Processing Systems, 2001.

Markdown

[Littman and Sutton. "Predictive Representations of State." Neural Information Processing Systems, 2001.](https://mlanthology.org/neurips/2001/littman2001neurips-predictive/)

BibTeX

@inproceedings{littman2001neurips-predictive,
  title     = {{Predictive Representations of State}},
  author    = {Littman, Michael L. and Sutton, Richard S.},
  booktitle = {Neural Information Processing Systems},
  year      = {2001},
  pages     = {1555-1561},
  url       = {https://mlanthology.org/neurips/2001/littman2001neurips-predictive/}
}