An Analysis of Laplacian Methods for Value Function Approximation in MDPs

Petrik, Marek

An Analysis of Laplacian Methods for Value Function Approximation in MDPs

IJCAI 2007 pp. 2574-2579

/ijcai/2007/petrik2007ijcai-analysis/

Abstract

Recently, a method based on Laplacian eigenfunctions was proposed to automatically construct a basis for value function approximation in MDPs. We show that its success may be explained by drawing a connection between the spectrum of the Laplacian and the value function of the MDP. This explanation helps us to identify more precisely the conditions that this method requires to achieve good performance. Based on this, we propose a modification of the Laplacian method for which we derive an analytical bound on the approximation error. Further, we show that the method is related the augmented Krylov methods, commonly used to solve sparse linear systems. Finally, we empirically demonstrate that in basis construction the augmented Krylov methods may significantly outperform the Laplacian methods in terms of both speed and quality. URL: http://cs.umass.edu/~petrik/pub/Petrik2007b.pdf

PDF Semantic Scholar

Cite

Text

Petrik. "An Analysis of Laplacian Methods for Value Function Approximation in MDPs." International Joint Conference on Artificial Intelligence, 2007.

Markdown

[Petrik. "An Analysis of Laplacian Methods for Value Function Approximation in MDPs." International Joint Conference on Artificial Intelligence, 2007.](https://mlanthology.org/ijcai/2007/petrik2007ijcai-analysis/)

BibTeX

@inproceedings{petrik2007ijcai-analysis,
  title     = {{An Analysis of Laplacian Methods for Value Function Approximation in MDPs}},
  author    = {Petrik, Marek},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2007},
  pages     = {2574-2579},
  url       = {https://mlanthology.org/ijcai/2007/petrik2007ijcai-analysis/}
}