Personalized Federated X-Armed Bandit

Abstract

In this work, we study the personalized federated $\mathcal{X}$-armed bandit problem, where the heterogeneous local objectives of the clients are optimized simultaneously in the federated learning paradigm. We propose the \texttt{PF-PNE} algorithm with a unique double elimination strategy, which safely eliminates the non-optimal regions while encouraging federated collaboration through biased but effective evaluations of the local objectives. The proposed \texttt{PF-PNE} algorithm is able to optimize local objectives with arbitrary levels of heterogeneity, and its limited communications protects the confidentiality of the client-wise reward data. Our theoretical analysis shows the benefit of the proposed algorithm over single-client algorithms. Experimentally, \texttt{PF-PNE} outperforms multiple baselines on both synthetic and real life datasets.

Cite

Text

Li et al. "Personalized Federated X-Armed Bandit." Artificial Intelligence and Statistics, 2024.

Markdown

[Li et al. "Personalized Federated X-Armed Bandit." Artificial Intelligence and Statistics, 2024.](https://mlanthology.org/aistats/2024/li2024aistats-personalized/)

BibTeX

@inproceedings{li2024aistats-personalized,
  title     = {{Personalized Federated X-Armed Bandit}},
  author    = {Li, Wenjie and Song, Qifan and Honorio, Jean},
  booktitle = {Artificial Intelligence and Statistics},
  year      = {2024},
  pages     = {37-45},
  volume    = {238},
  url       = {https://mlanthology.org/aistats/2024/li2024aistats-personalized/}
}