Exponential Family Predictive Representations of State

Abstract

In order to represent state in controlled, partially observable, stochastic dynamical systems, some sort of sufficient statistic for history is necessary. Predictive repre- sentations of state (PSRs) capture state as statistics of the future. We introduce a new model of such systems called the “Exponential family PSR,” which defines as state the time-varying parameters of an exponential family distribution which models n sequential observations in the future. This choice of state representation explicitly connects PSRs to state-of-the-art probabilistic modeling, which allows us to take advantage of current efforts in high-dimensional density estimation, and in particular, graphical models and maximum entropy models. We present a pa- rameter learning algorithm based on maximum likelihood, and we show how a variety of current approximate inference methods apply. We evaluate the qual- ity of our model with reinforcement learning by directly evaluating the control performance of the model.

Cite

Text

Wingate and Baveja. "Exponential Family Predictive Representations of State." Neural Information Processing Systems, 2007.

Markdown

[Wingate and Baveja. "Exponential Family Predictive Representations of State." Neural Information Processing Systems, 2007.](https://mlanthology.org/neurips/2007/wingate2007neurips-exponential/)

BibTeX

@inproceedings{wingate2007neurips-exponential,
  title     = {{Exponential Family Predictive Representations of State}},
  author    = {Wingate, David and Baveja, Satinder S.},
  booktitle = {Neural Information Processing Systems},
  year      = {2007},
  pages     = {1617-1624},
  url       = {https://mlanthology.org/neurips/2007/wingate2007neurips-exponential/}
}