Stoltz, Gilles

30 publications

AISTATS 2025 Narrowing the Gap Between Adversarial and Stochastic MDPs via Policy Optimization Daniil Tiapkin, Evgenii Chzhen, Gilles Stoltz
TMLR 2025 Policy Optimization via Adv2: Adversarial Learning on Advantage Functions Matthieu Jonckheere, Chiara Mignacco, Gilles Stoltz
TMLR 2024 Diversity-Preserving $k$--Armed Bandits, Revisited Hedi Hadiji, Sébastien Gerchinovitz, Jean-Michel Loubes, Gilles Stoltz
JMLR 2023 Adaptation to the Range in K-Armed Bandits Hédi Hadiji, Gilles Stoltz
ALT 2023 On Best-Arm Identification with a Fixed Budget in Non-Parametric Multi-Armed Bandits Antoine Barrier, Aurélien Garivier, Gilles Stoltz
NeurIPS 2023 Small Total-Cost Constraints in Contextual Bandits with Knapsacks, with Application to Fairness Evgenii Chzhen, Christophe Giraud, Zhen Li, Gilles Stoltz
NeurIPS 2022 Contextual Bandits with Knapsacks for a Conversion Model Zhen Li, Gilles Stoltz
JMLR 2022 KL-UCB-Switch: Optimal Regret Bounds for Stochastic Bandits from Both a Distribution-Dependent and a Distribution-Free Viewpoints Aurélien Garivier, Hédi Hadiji, Pierre Ménard, Gilles Stoltz
NeurIPS 2021 A Unified Approach to Fair Online Learning via Blackwell Approachability Evgenii Chzhen, Christophe Giraud, Gilles Stoltz
ALT 2019 Uniform Regret Bounds over $\mathbb{R}^d$ for the Sequential Linear Regression Problem with the Square Loss Pierre Gaillard, Sébastien Gerchinovitz, Malo Huard, Gilles Stoltz
COLT 2014 A Second-Order Bound with Excess Losses Pierre Gaillard, Gilles Stoltz, Tim van Erven
COLT 2014 Approachability in Unknown Games: Online Learning Meets Multi-Objective Optimization Shie Mannor, Vianney Perchet, Gilles Stoltz
JMLR 2014 Set-Valued Approachability and Online Learning with Partial Monitoring Shie Mannor, Vianney Perchet, Gilles Stoltz
MLJ 2013 Forecasting Electricity Consumption by Aggregating Specialized Experts - A Review of the Sequential Aggregation of Specialized Experts, with an Application to Slovakian and French Country-Wide One-Day-Ahead (half-)hourly Predictions Marie Devaine, Pierre Gaillard, Yannig Goude, Gilles Stoltz
ALT 2012 Algorithmic Learning Theory - 23rd International Conference, ALT 2012, Lyon, France, October 29-31, 2012. Proceedings Nader H. Bshouty, Gilles Stoltz, Nicolas Vayatis, Thomas Zeugmann
ALT 2012 Editors' Introduction Nader H. Bshouty, Gilles Stoltz, Nicolas Vayatis, Thomas Zeugmann
NeurIPS 2012 Mirror Descent Meets Fixed Share (and Feels No Regret) Nicolò Cesa-bianchi, Pierre Gaillard, Gabor Lugosi, Gilles Stoltz
COLT 2011 A Finite-Time Analysis of Multi-Armed Bandits Problems with Kullback-Leibler Divergences Odalric-Ambrym Maillard, Rémi Munos, Gilles Stoltz
ALT 2011 Lipschitz Bandits Without the Lipschitz Constant Sébastien Bubeck, Gilles Stoltz, Jia Yuan Yu
COLT 2011 Robust Approachability and Regret Minimization in Games with Partial Monitoring Shie Mannor, Vianney Perchet, Gilles Stoltz
JMLR 2011 X-Armed Bandits Sébastien Bubeck, Rémi Munos, Gilles Stoltz, Csaba Szepesvári
COLT 2009 Online Multi-Task Learning with Hard Constraints Gábor Lugosi, Omiros Papaspiliopoulos, Gilles Stoltz
ALT 2009 Pure Exploration in Multi-Armed Bandits Problems Sébastien Bubeck, Rémi Munos, Gilles Stoltz
NeurIPS 2008 Online Optimization in X-Armed Bandits Sébastien Bubeck, Gilles Stoltz, Csaba Szepesvári, Rémi Munos
MLJ 2007 Improved Second-Order Bounds for Prediction with Expert Advice Nicolò Cesa-Bianchi, Yishay Mansour, Gilles Stoltz
COLT 2007 Strategies for Prediction Under Imperfect Monitoring Gábor Lugosi, Shie Mannor, Gilles Stoltz
COLT 2005 Improved Second-Order Bounds for Prediction with Expert Advice Nicolò Cesa-Bianchi, Yishay Mansour, Gilles Stoltz
MLJ 2005 Internal Regret in On-Line Portfolio Selection Gilles Stoltz, Gábor Lugosi
COLT 2004 Minimizing Regret with Label Efficient Prediction Nicolò Cesa-Bianchi, Gábor Lugosi, Gilles Stoltz
COLT 2003 Internal Regret in On-Line Portfolio Selection Gilles Stoltz, Gábor Lugosi