vMF-Exp: Von Mises-Fisher Exploration of Large Action Sets with Hyperspherical Embeddings

Abstract

This workshop paper is under review for presentation at an international conference. We introduce von Mises-Fisher exploration (vMF-exp), a scalable method for exploring large action sets in reinforcement learning problems where hyperspherical embedding vectors represent actions. vMF-exp involves initially sampling a state embedding representation using a von Mises-Fisher hyperspherical distribution, then exploring this representation's nearest neighbors, which scales to unlimited numbers of candidate actions. We show that, under theoretical assumptions, vMF-exp asymptotically maintains the same probability of exploring each action as Boltzmann Exploration (B-exp), a popular alternative that, nonetheless, suffers from scalability issues as it requires computing softmax values for each action. Consequently, vMF-exp serves as a scalable alternative to B-exp for exploring large action sets with hyperspherical embeddings. We further validate the empirical relevance of vMF-exp by discussing its successful deployment at scale on a music streaming service to recommend playlists to millions of users.

Cite

Text

Bendada et al. "vMF-Exp: Von Mises-Fisher Exploration of Large Action Sets with Hyperspherical Embeddings." ICML 2024 Workshops: ARLET, 2024.

Markdown

[Bendada et al. "vMF-Exp: Von Mises-Fisher Exploration of Large Action Sets with Hyperspherical Embeddings." ICML 2024 Workshops: ARLET, 2024.](https://mlanthology.org/icmlw/2024/bendada2024icmlw-vmfexp/)

BibTeX

@inproceedings{bendada2024icmlw-vmfexp,
  title     = {{vMF-Exp: Von Mises-Fisher Exploration of Large Action Sets with Hyperspherical Embeddings}},
  author    = {Bendada, Walid and Salha-Galvan, Guillaume and Hennequin, Romain and Bontempelli, Théo and Bouabça, Thomas and Cazenave, Tristan},
  booktitle = {ICML 2024 Workshops: ARLET},
  year      = {2024},
  url       = {https://mlanthology.org/icmlw/2024/bendada2024icmlw-vmfexp/}
}