Gaillard, Pierre
29 publications
ICLR
2025
Finally Rank-Breaking Conquers MNL Bandits: Optimal and Efficient Algorithms for MNL Assortment
AISTATS
2024
Efficient Model-Based Concave Utility Reinforcement Learning Through Greedy Mirror Descent
NeurIPS
2024
Towards Efficient and Optimal Covariance-Adaptive Algorithms for Combinatorial Semi-Bandits
AISTATS
2023
One Arrow, Two Kills: A Unified Framework for Achieving Optimal Regret Guarantees in Sleeping Bandits
ICML
2022
Versatile Dueling Bandits: Best-of-Both World Analyses for Learning from Relative Preferences
NeurIPS
2021
Continuized Accelerations of Deterministic and Stochastic Gradient Descents, and of Gossip Algorithms
NeurIPS
2021
Online Sign Identification: Minimization of the Number of Errors in Thresholding Bandits