Reinforcement Algorithms Using Functional Approximation for Generalization and Their Application to Cart Centering and Fractal Compression

Claussen, Clifford; Gutta, Srinivas; Wechsler, Harry

Reinforcement Algorithms Using Functional Approximation for Generalization and Their Application to Cart Centering and Fractal Compression

Clifford Claussen, Srinivas Gutta, Harry Wechsler

IJCAI 1999 pp. 1362-1369

/ijcai/1999/claussen1999ijcai-reinforcement/

Abstract

We address the conflict between identification and control or alternatively, the conflict be-tween exploration and exploitation, within the framework of reinforcement learning. Q-learning has recently become a popular off-policy reinforcement learning method. The conflict between exploration and exploitation slows down Q-learning algorithms; their per-formance does not scale up and degrades rap-idly as the number of states and actions in-creases. One reason for this slowness is that exploration lacks the ability to extrapolate and interpolate from learning and to a large extent has to &quot;reinvent the wheel&quot;. Moreover, not all reinforcement problems one encounters are f i-nite state and action systems. Our approach to solving continuous state and action problems is to approximate the continuous state and ac-tion spaces with finite sets of states and ac-tions and then to apply a finite state and action learning method. This approach provides the means for solving continuous state and action problems but does not yet address the per-formance problem associated with scaling up states and actions. We address the scaling problem using functional approximation methods. Towards that end, this paper intro-duces two new reinforcement algorithms, QLVQ and Quad-Q-learning, respectively, and shows their successful application for cart centering and fractal compression. 1

PDF Semantic Scholar

Cite

Text

Claussen et al. "Reinforcement Algorithms Using Functional Approximation for Generalization and Their Application to Cart Centering and Fractal Compression." International Joint Conference on Artificial Intelligence, 1999.

Markdown

[Claussen et al. "Reinforcement Algorithms Using Functional Approximation for Generalization and Their Application to Cart Centering and Fractal Compression." International Joint Conference on Artificial Intelligence, 1999.](https://mlanthology.org/ijcai/1999/claussen1999ijcai-reinforcement/)

BibTeX

@inproceedings{claussen1999ijcai-reinforcement,
  title     = {{Reinforcement Algorithms Using Functional Approximation for Generalization and Their Application to Cart Centering and Fractal Compression}},
  author    = {Claussen, Clifford and Gutta, Srinivas and Wechsler, Harry},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {1999},
  pages     = {1362-1369},
  url       = {https://mlanthology.org/ijcai/1999/claussen1999ijcai-reinforcement/}
}