The Multiquadric Kernel for Moment-Matching Distributional Reinforcement Learning

Abstract

Distributional reinforcement learning has gained significant attention in recent years due to its ability to handle uncertainty and variability in the returns an agent will receive for each action it takes. A key challenge in distributional reinforcement learning is finding a measure of the difference between two distributions that is well-suited for use with the distributional Bellman operator, a function that takes in a value distribution and produces a modified distribution based on the agent's current state and action. In this paper, we address this challenge by introducing the multiquadric kernel to moment-matching distributional reinforcement learning. We show that this kernel is both theoretically sound and empirically effective. Our contribution is mainly of a theoretical nature, presenting the first formally sound kernel for moment-matching distributional reinforcement learning with good practical performance. We also provide insights into why the RBF kernel has been shown to provide good practical results despite its theoretical problems. Finally, we evaluate the performance of our kernel on a number of standard benchmarks, obtaining results comparable to the state-of-the-art.

Cite

Text

Killingberg and Langseth. "The Multiquadric Kernel for Moment-Matching Distributional Reinforcement Learning." Transactions on Machine Learning Research, 2023.

Markdown

[Killingberg and Langseth. "The Multiquadric Kernel for Moment-Matching Distributional Reinforcement Learning." Transactions on Machine Learning Research, 2023.](https://mlanthology.org/tmlr/2023/killingberg2023tmlr-multiquadric/)

BibTeX

@article{killingberg2023tmlr-multiquadric,
  title     = {{The Multiquadric Kernel for Moment-Matching Distributional Reinforcement Learning}},
  author    = {Killingberg, Ludvig and Langseth, Helge},
  journal   = {Transactions on Machine Learning Research},
  year      = {2023},
  url       = {https://mlanthology.org/tmlr/2023/killingberg2023tmlr-multiquadric/}
}