Reinforcement Learning with Soft State Aggregation

Satinder P. Singh, Tommi Jaakkola, Michael I. Jordan

NeurIPS 1994 pp. 361-368

/neurips/1994/singh1994neurips-reinforcement/

Abstract

It is widely accepted that the use of more compact representations than lookup tables is crucial to scaling reinforcement learning (RL) algorithms to real-world problems. Unfortunately almost all of the theory of reinforcement learning assumes lookup table representa(cid:173) tions. In this paper we address the pressing issue of combining function approximation and RL, and present 1) a function approx(cid:173) imator based on a simple extension to state aggregation (a com(cid:173) monly used form of compact representation), namely soft state aggregation, 2) a theory of convergence for RL with arbitrary, but fixed, soft state aggregation, 3) a novel intuitive understanding of the effect of state aggregation on online RL, and 4) a new heuristic adaptive state aggregation algorithm that finds improved compact representations by exploiting the non-discrete nature of soft state aggregation. Preliminary empirical results are also presented.

PDF NeurIPS Semantic Scholar

Cite

Text

Singh et al. "Reinforcement Learning with Soft State Aggregation." Neural Information Processing Systems, 1994.

Markdown

[Singh et al. "Reinforcement Learning with Soft State Aggregation." Neural Information Processing Systems, 1994.](https://mlanthology.org/neurips/1994/singh1994neurips-reinforcement/)

BibTeX

@inproceedings{singh1994neurips-reinforcement,
  title     = {{Reinforcement Learning with Soft State Aggregation}},
  author    = {Singh, Satinder P. and Jaakkola, Tommi and Jordan, Michael I.},
  booktitle = {Neural Information Processing Systems},
  year      = {1994},
  pages     = {361-368},
  url       = {https://mlanthology.org/neurips/1994/singh1994neurips-reinforcement/}
}