Personalized Federated X-Armed Bandit
Abstract
In this work, we study the personalized federated $\mathcal{X}$-armed bandit problem, where the heterogeneous local objectives of the clients are optimized simultaneously in the federated learning paradigm. We propose the \texttt{PF-PNE} algorithm with a unique double elimination strategy, which safely eliminates the non-optimal regions while encouraging federated collaboration through biased but effective evaluations of the local objectives. The proposed \texttt{PF-PNE} algorithm is able to optimize local objectives with arbitrary levels of heterogeneity, and its limited communications protects the confidentiality of the client-wise reward data. Our theoretical analysis shows the benefit of the proposed algorithm over single-client algorithms. Experimentally, \texttt{PF-PNE} outperforms multiple baselines on both synthetic and real life datasets.
Cite
Text
Li et al. "Personalized Federated X-Armed Bandit." Artificial Intelligence and Statistics, 2024.Markdown
[Li et al. "Personalized Federated X-Armed Bandit." Artificial Intelligence and Statistics, 2024.](https://mlanthology.org/aistats/2024/li2024aistats-personalized/)BibTeX
@inproceedings{li2024aistats-personalized,
title = {{Personalized Federated X-Armed Bandit}},
author = {Li, Wenjie and Song, Qifan and Honorio, Jean},
booktitle = {Artificial Intelligence and Statistics},
year = {2024},
pages = {37-45},
volume = {238},
url = {https://mlanthology.org/aistats/2024/li2024aistats-personalized/}
}