ML Anthology
Authors
Search
About
Stoltz, Gilles
30 publications
AISTATS
2025
Narrowing the Gap Between Adversarial and Stochastic MDPs via Policy Optimization
Daniil Tiapkin
,
Evgenii Chzhen
,
Gilles Stoltz
TMLR
2025
Policy Optimization via Adv2: Adversarial Learning on Advantage Functions
Matthieu Jonckheere
,
Chiara Mignacco
,
Gilles Stoltz
TMLR
2024
Diversity-Preserving $k$--Armed Bandits, Revisited
Hedi Hadiji
,
Sébastien Gerchinovitz
,
Jean-Michel Loubes
,
Gilles Stoltz
JMLR
2023
Adaptation to the Range in K-Armed Bandits
Hédi Hadiji
,
Gilles Stoltz
ALT
2023
On Best-Arm Identification with a Fixed Budget in Non-Parametric Multi-Armed Bandits
Antoine Barrier
,
Aurélien Garivier
,
Gilles Stoltz
NeurIPS
2023
Small Total-Cost Constraints in Contextual Bandits with Knapsacks, with Application to Fairness
Evgenii Chzhen
,
Christophe Giraud
,
Zhen Li
,
Gilles Stoltz
NeurIPS
2022
Contextual Bandits with Knapsacks for a Conversion Model
Zhen Li
,
Gilles Stoltz
JMLR
2022
KL-UCB-Switch: Optimal Regret Bounds for Stochastic Bandits from Both a Distribution-Dependent and a Distribution-Free Viewpoints
Aurélien Garivier
,
Hédi Hadiji
,
Pierre Ménard
,
Gilles Stoltz
NeurIPS
2021
A Unified Approach to Fair Online Learning via Blackwell Approachability
Evgenii Chzhen
,
Christophe Giraud
,
Gilles Stoltz
ALT
2019
Uniform Regret Bounds over $\mathbb{R}^d$ for the Sequential Linear Regression Problem with the Square Loss
Pierre Gaillard
,
Sébastien Gerchinovitz
,
Malo Huard
,
Gilles Stoltz
COLT
2014
A Second-Order Bound with Excess Losses
Pierre Gaillard
,
Gilles Stoltz
,
Tim van Erven
COLT
2014
Approachability in Unknown Games: Online Learning Meets Multi-Objective Optimization
Shie Mannor
,
Vianney Perchet
,
Gilles Stoltz
JMLR
2014
Set-Valued Approachability and Online Learning with Partial Monitoring
Shie Mannor
,
Vianney Perchet
,
Gilles Stoltz
MLJ
2013
Forecasting Electricity Consumption by Aggregating Specialized Experts - A Review of the Sequential Aggregation of Specialized Experts, with an Application to Slovakian and French Country-Wide One-Day-Ahead (half-)hourly Predictions
Marie Devaine
,
Pierre Gaillard
,
Yannig Goude
,
Gilles Stoltz
ALT
2012
Algorithmic Learning Theory - 23rd International Conference, ALT 2012, Lyon, France, October 29-31, 2012. Proceedings
Nader H. Bshouty
,
Gilles Stoltz
,
Nicolas Vayatis
,
Thomas Zeugmann
ALT
2012
Editors' Introduction
Nader H. Bshouty
,
Gilles Stoltz
,
Nicolas Vayatis
,
Thomas Zeugmann
NeurIPS
2012
Mirror Descent Meets Fixed Share (and Feels No Regret)
Nicolò Cesa-bianchi
,
Pierre Gaillard
,
Gabor Lugosi
,
Gilles Stoltz
COLT
2011
A Finite-Time Analysis of Multi-Armed Bandits Problems with Kullback-Leibler Divergences
Odalric-Ambrym Maillard
,
Rémi Munos
,
Gilles Stoltz
ALT
2011
Lipschitz Bandits Without the Lipschitz Constant
Sébastien Bubeck
,
Gilles Stoltz
,
Jia Yuan Yu
COLT
2011
Robust Approachability and Regret Minimization in Games with Partial Monitoring
Shie Mannor
,
Vianney Perchet
,
Gilles Stoltz
JMLR
2011
X-Armed Bandits
Sébastien Bubeck
,
Rémi Munos
,
Gilles Stoltz
,
Csaba Szepesvári
COLT
2009
Online Multi-Task Learning with Hard Constraints
Gábor Lugosi
,
Omiros Papaspiliopoulos
,
Gilles Stoltz
ALT
2009
Pure Exploration in Multi-Armed Bandits Problems
Sébastien Bubeck
,
Rémi Munos
,
Gilles Stoltz
NeurIPS
2008
Online Optimization in X-Armed Bandits
Sébastien Bubeck
,
Gilles Stoltz
,
Csaba Szepesvári
,
Rémi Munos
MLJ
2007
Improved Second-Order Bounds for Prediction with Expert Advice
Nicolò Cesa-Bianchi
,
Yishay Mansour
,
Gilles Stoltz
COLT
2007
Strategies for Prediction Under Imperfect Monitoring
Gábor Lugosi
,
Shie Mannor
,
Gilles Stoltz
COLT
2005
Improved Second-Order Bounds for Prediction with Expert Advice
Nicolò Cesa-Bianchi
,
Yishay Mansour
,
Gilles Stoltz
MLJ
2005
Internal Regret in On-Line Portfolio Selection
Gilles Stoltz
,
Gábor Lugosi
COLT
2004
Minimizing Regret with Label Efficient Prediction
Nicolò Cesa-Bianchi
,
Gábor Lugosi
,
Gilles Stoltz
COLT
2003
Internal Regret in On-Line Portfolio Selection
Gilles Stoltz
,
Gábor Lugosi