Exploring Large Action Sets with Hyperspherical Embeddings Using Von Mises-Fisher Sampling

Abstract

This paper introduces von Mises-Fisher exploration (vMF-exp), a scalable method for exploring large action sets in reinforcement learning problems where hyperspherical embedding vectors represent these actions. vMF-exp involves initially sampling a state embedding representation using a von Mises-Fisher distribution, then exploring this representation’s nearest neighbors, which scales to virtually unlimited numbers of candidate actions. We show that, under theoretical assumptions, vMF-exp asymptotically maintains the same probability of exploring each action as Boltzmann Exploration (B-exp), a popular alternative that, nonetheless, suffers from scalability issues as it requires computing softmax values for each action. Consequently, vMF-exp serves as a scalable alternative to B-exp for exploring large action sets with hyperspherical embeddings. Experiments on simulated data, real-world public data, and the successful large-scale deployment of vMF-exp on the recommender system of a global music streaming service empirically validate the key properties of the proposed method.

Cite

Text

Bendada et al. "Exploring Large Action Sets with Hyperspherical Embeddings Using Von Mises-Fisher Sampling." Proceedings of the 42nd International Conference on Machine Learning, 2025.

Markdown

[Bendada et al. "Exploring Large Action Sets with Hyperspherical Embeddings Using Von Mises-Fisher Sampling." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/bendada2025icml-exploring/)

BibTeX

@inproceedings{bendada2025icml-exploring,
  title     = {{Exploring Large Action Sets with Hyperspherical Embeddings Using Von Mises-Fisher Sampling}},
  author    = {Bendada, Walid and Salha-Galvan, Guillaume and Hennequin, Romain and Bontempelli, Théo and Bouabça, Thomas and Cazenave, Tristan},
  booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
  year      = {2025},
  pages     = {3677-3711},
  volume    = {267},
  url       = {https://mlanthology.org/icml/2025/bendada2025icml-exploring/}
}