Gaillard, Pierre

29 publications

TMLR 2025 Counterfactual Learning of Stochastic Policies with Continuous Actions Houssam Zenati, Alberto Bietti, Matthieu Martin, Eustache Diemert, Pierre Gaillard, Julien Mairal
ICLR 2025 Finally Rank-Breaking Conquers MNL Bandits: Optimal and Efficient Algorithms for MNL Assortment Aadirupa Saha, Pierre Gaillard
ALT 2025 Logarithmic Regret for Unconstrained Submodular Maximization Stochastic Bandit Julien Zhou, Pierre Gaillard, Thibaud Rahier, Julyan Arbel
NeurIPS 2025 Minimax Adaptive Online Nonparametric Regression over Besov Spaces Paul Liautaud, Pierre Gaillard, Olivier Wintenberger
ALT 2025 Minimax-Optimal and Locally-Adaptive Online Nonparametric Regression Paul Liautaud, Pierre Gaillard, Olivier Wintenberger
ICML 2025 Online Episodic Convex Reinforcement Learning Bianca Marin Moreno, Khaled Eldowa, Pierre Gaillard, Margaux Brégère, Nadia Oudjane
AISTATS 2024 Efficient Model-Based Concave Utility Reinforcement Learning Through Greedy Mirror Descent Bianca M. Moreno, Margaux Bregere, Pierre Gaillard, Nadia Oudjane
NeurIPS 2024 MetaCURL: Non-Stationary Concave Utility Reinforcement Learning Bianca Marin Moreno, Margaux Brégère, Pierre Gaillard, Nadia Oudjane
NeurIPS 2024 Towards Efficient and Optimal Covariance-Adaptive Algorithms for Combinatorial Semi-Bandits Julien Zhou, Pierre Gaillard, Thibaud Rahier, Houssam Zenati, Julyan Arbel
AISTATS 2023 One Arrow, Two Kills: A Unified Framework for Achieving Optimal Regret Guarantees in Sleeping Bandits Pierre Gaillard, Aadirupa Saha, Soham Dan
ICML 2023 Sequential Counterfactual Risk Minimization Houssam Zenati, Eustache Diemert, Matthieu Martin, Julien Mairal, Pierre Gaillard
AISTATS 2022 Efficient Kernelized UCB for Contextual Bandits Houssam Zenati, Alberto Bietti, Eustache Diemert, Julien Mairal, Matthieu Martin, Pierre Gaillard
ICML 2022 Versatile Dueling Bandits: Best-of-Both World Analyses for Learning from Relative Preferences Aadirupa Saha, Pierre Gaillard
NeurIPS 2021 Continuized Accelerations of Deterministic and Stochastic Gradient Descents, and of Gossip Algorithms Mathieu Even, Raphaël Berthier, Francis R. Bach, Nicolas Flammarion, Hadrien Hendrikx, Pierre Gaillard, Laurent Massoulié, Adrien Taylor
NeurIPS 2021 Dueling Bandits with Adversarial Sleeping Aadirupa Saha, Pierre Gaillard
NeurIPS 2021 Mixability Made Efficient: Fast Online Multiclass Logistic Regression Rémi Jézéquel, Pierre Gaillard, Alessandro Rudi
NeurIPS 2021 Online Sign Identification: Minimization of the Number of Errors in Thresholding Bandits Reda Ouhamma, Rémy Degenne, Pierre Gaillard, Vianney Perchet
COLT 2020 Efficient Improper Learning for Online Logistic Regression Rémi Jézéquel, Pierre Gaillard, Alessandro Rudi
ICML 2020 Improved Sleeping Bandits with Stochastic Action Sets and Adversarial Rewards Aadirupa Saha, Pierre Gaillard, Michal Valko
NeurIPS 2020 Tight Nonparametric Convergence Rates for Stochastic Gradient Descent Under the Noiseless Linear Model Raphaël Berthier, Francis R. Bach, Pierre Gaillard
NeurIPS 2019 Efficient Online Learning with Kernels for Adversarial Large Scale Problems Rémi Jézéquel, Pierre Gaillard, Alessandro Rudi
ALT 2019 Uniform Regret Bounds over $\mathbb{R}^d$ for the Sequential Linear Regression Problem with the Square Loss Pierre Gaillard, Sébastien Gerchinovitz, Malo Huard, Gilles Stoltz
NeurIPS 2018 Efficient Online Algorithms for Fast-Rate Regret Bounds Under Sparsity Pierre Gaillard, Olivier Wintenberger
COLT 2017 Algorithmic Chaining and the Role of Partial Feedback in Online Nonparametric Learning Nicolò Cesa-Bianchi, Pierre Gaillard, Claudio Gentile, Sébastien Gerchinovitz
AISTATS 2017 Sparse Accelerated Exponential Weights Pierre Gaillard, Olivier Wintenberger
COLT 2015 A Chaining Algorithm for Online Nonparametric Regression Pierre Gaillard, Sébastien Gerchinovitz
COLT 2014 A Second-Order Bound with Excess Losses Pierre Gaillard, Gilles Stoltz, Tim van Erven
MLJ 2013 Forecasting Electricity Consumption by Aggregating Specialized Experts - A Review of the Sequential Aggregation of Specialized Experts, with an Application to Slovakian and French Country-Wide One-Day-Ahead (half-)hourly Predictions Marie Devaine, Pierre Gaillard, Yannig Goude, Gilles Stoltz
NeurIPS 2012 Mirror Descent Meets Fixed Share (and Feels No Regret) Nicolò Cesa-bianchi, Pierre Gaillard, Gabor Lugosi, Gilles Stoltz