Labbi, Safwan

3 publications

ICLR 2026 Beyond SoftMax and Entropy: Convergence Rates of Policy Gradients with $\boldsymbol{f}$-SoftArgmax Parameterization $\&$ Coupled Regularization Safwan Labbi, Daniil Tiapkin, Paul Mangold, Eric Moulines
AISTATS 2025 Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents Safwan Labbi, Daniil Tiapkin, Lorenzo Mancini, Paul Mangold, Eric Moulines
NeurIPS 2024 SCAFFLSA: Taming Heterogeneity in Federated Linear Stochastic Approximation and TD Learning Paul Mangold, Sergey Samsonov, Safwan Labbi, Ilya Levin, Reda Alami, Alexey Naumov, Eric Moulines