Gaillard, Pierre

29 publications

TMLR 2025 Counterfactual Learning of Stochastic Policies with Continuous Actions Houssam Zenati, Alberto Bietti, Matthieu Martin, Eustache Diemert, Pierre Gaillard, Julien Mairal

ICLR 2025 Finally Rank-Breaking Conquers MNL Bandits: Optimal and Efficient Algorithms for MNL Assortment Aadirupa Saha, Pierre Gaillard

ALT 2025 Logarithmic Regret for Unconstrained Submodular Maximization Stochastic Bandit Julien Zhou, Pierre Gaillard, Thibaud Rahier, Julyan Arbel

NeurIPS 2025 Minimax Adaptive Online Nonparametric Regression over Besov Spaces Paul Liautaud, Pierre Gaillard, Olivier Wintenberger

ALT 2025 Minimax-Optimal and Locally-Adaptive Online Nonparametric Regression Paul Liautaud, Pierre Gaillard, Olivier Wintenberger

ICML 2025 Online Episodic Convex Reinforcement Learning Bianca Marin Moreno, Khaled Eldowa, Pierre Gaillard, Margaux Brégère, Nadia Oudjane

AISTATS 2024 Efficient Model-Based Concave Utility Reinforcement Learning Through Greedy Mirror Descent Bianca M. Moreno, Margaux Bregere, Pierre Gaillard, Nadia Oudjane

NeurIPS 2024 MetaCURL: Non-Stationary Concave Utility Reinforcement Learning Bianca Marin Moreno, Margaux Brégère, Pierre Gaillard, Nadia Oudjane

NeurIPS 2024 Towards Efficient and Optimal Covariance-Adaptive Algorithms for Combinatorial Semi-Bandits Julien Zhou, Pierre Gaillard, Thibaud Rahier, Houssam Zenati, Julyan Arbel

AISTATS 2023 One Arrow, Two Kills: A Unified Framework for Achieving Optimal Regret Guarantees in Sleeping Bandits Pierre Gaillard, Aadirupa Saha, Soham Dan

ICML 2023 Sequential Counterfactual Risk Minimization Houssam Zenati, Eustache Diemert, Matthieu Martin, Julien Mairal, Pierre Gaillard

AISTATS 2022 Efficient Kernelized UCB for Contextual Bandits Houssam Zenati, Alberto Bietti, Eustache Diemert, Julien Mairal, Matthieu Martin, Pierre Gaillard

ICML 2022 Versatile Dueling Bandits: Best-of-Both World Analyses for Learning from Relative Preferences Aadirupa Saha, Pierre Gaillard

NeurIPS 2021 Continuized Accelerations of Deterministic and Stochastic Gradient Descents, and of Gossip Algorithms Mathieu Even, Raphaël Berthier, Francis R. Bach, Nicolas Flammarion, Hadrien Hendrikx, Pierre Gaillard, Laurent Massoulié, Adrien Taylor

NeurIPS 2021 Dueling Bandits with Adversarial Sleeping Aadirupa Saha, Pierre Gaillard

NeurIPS 2021 Mixability Made Efficient: Fast Online Multiclass Logistic Regression Rémi Jézéquel, Pierre Gaillard, Alessandro Rudi

NeurIPS 2021 Online Sign Identification: Minimization of the Number of Errors in Thresholding Bandits Reda Ouhamma, Rémy Degenne, Pierre Gaillard, Vianney Perchet

COLT 2020 Efficient Improper Learning for Online Logistic Regression Rémi Jézéquel, Pierre Gaillard, Alessandro Rudi

ICML 2020 Improved Sleeping Bandits with Stochastic Action Sets and Adversarial Rewards Aadirupa Saha, Pierre Gaillard, Michal Valko

NeurIPS 2020 Tight Nonparametric Convergence Rates for Stochastic Gradient Descent Under the Noiseless Linear Model Raphaël Berthier, Francis R. Bach, Pierre Gaillard

NeurIPS 2019 Efficient Online Learning with Kernels for Adversarial Large Scale Problems Rémi Jézéquel, Pierre Gaillard, Alessandro Rudi

ALT 2019 Uniform Regret Bounds over $\mathbb{R}^d$ for the Sequential Linear Regression Problem with the Square Loss Pierre Gaillard, Sébastien Gerchinovitz, Malo Huard, Gilles Stoltz

NeurIPS 2018 Efficient Online Algorithms for Fast-Rate Regret Bounds Under Sparsity Pierre Gaillard, Olivier Wintenberger

COLT 2017 Algorithmic Chaining and the Role of Partial Feedback in Online Nonparametric Learning Nicolò Cesa-Bianchi, Pierre Gaillard, Claudio Gentile, Sébastien Gerchinovitz

AISTATS 2017 Sparse Accelerated Exponential Weights Pierre Gaillard, Olivier Wintenberger

COLT 2015 A Chaining Algorithm for Online Nonparametric Regression Pierre Gaillard, Sébastien Gerchinovitz

COLT 2014 A Second-Order Bound with Excess Losses Pierre Gaillard, Gilles Stoltz, Tim van Erven

MLJ 2013 Forecasting Electricity Consumption by Aggregating Specialized Experts - A Review of the Sequential Aggregation of Specialized Experts, with an Application to Slovakian and French Country-Wide One-Day-Ahead (half-)hourly Predictions Marie Devaine, Pierre Gaillard, Yannig Goude, Gilles Stoltz

NeurIPS 2012 Mirror Descent Meets Fixed Share (and Feels No Regret) Nicolò Cesa-bianchi, Pierre Gaillard, Gabor Lugosi, Gilles Stoltz