Stoltz, Gilles

28 publications

AISTATS 2025 Narrowing the Gap Between Adversarial and Stochastic MDPs via Policy Optimization Daniil Tiapkin, Evgenii Chzhen, Gilles Stoltz

TMLR 2025 Policy Optimization via Adv2: Adversarial Learning on Advantage Functions Matthieu Jonckheere, Chiara Mignacco, Gilles Stoltz

TMLR 2024 Diversity-Preserving $k$--Armed Bandits, Revisited Hedi Hadiji, Sébastien Gerchinovitz, Jean-Michel Loubes, Gilles Stoltz

JMLR 2023 Adaptation to the Range in K-Armed Bandits Hédi Hadiji, Gilles Stoltz

ALT 2023 On Best-Arm Identification with a Fixed Budget in Non-Parametric Multi-Armed Bandits Antoine Barrier, Aurélien Garivier, Gilles Stoltz

NeurIPS 2023 Small Total-Cost Constraints in Contextual Bandits with Knapsacks, with Application to Fairness Evgenii Chzhen, Christophe Giraud, Zhen Li, Gilles Stoltz

NeurIPS 2022 Contextual Bandits with Knapsacks for a Conversion Model Zhen Li, Gilles Stoltz

JMLR 2022 KL-UCB-Switch: Optimal Regret Bounds for Stochastic Bandits from Both a Distribution-Dependent and a Distribution-Free Viewpoints Aurélien Garivier, Hédi Hadiji, Pierre Ménard, Gilles Stoltz

NeurIPS 2021 A Unified Approach to Fair Online Learning via Blackwell Approachability Evgenii Chzhen, Christophe Giraud, Gilles Stoltz

ALT 2019 Uniform Regret Bounds over $\mathbb{R}^d$ for the Sequential Linear Regression Problem with the Square Loss Pierre Gaillard, Sébastien Gerchinovitz, Malo Huard, Gilles Stoltz

COLT 2014 A Second-Order Bound with Excess Losses Pierre Gaillard, Gilles Stoltz, Tim van Erven

COLT 2014 Approachability in Unknown Games: Online Learning Meets Multi-Objective Optimization Shie Mannor, Vianney Perchet, Gilles Stoltz

JMLR 2014 Set-Valued Approachability and Online Learning with Partial Monitoring Shie Mannor, Vianney Perchet, Gilles Stoltz

MLJ 2013 Forecasting Electricity Consumption by Aggregating Specialized Experts - A Review of the Sequential Aggregation of Specialized Experts, with an Application to Slovakian and French Country-Wide One-Day-Ahead (half-)hourly Predictions Marie Devaine, Pierre Gaillard, Yannig Goude, Gilles Stoltz

NeurIPS 2012 Mirror Descent Meets Fixed Share (and Feels No Regret) Nicolò Cesa-bianchi, Pierre Gaillard, Gabor Lugosi, Gilles Stoltz

COLT 2011 A Finite-Time Analysis of Multi-Armed Bandits Problems with Kullback-Leibler Divergences Odalric-Ambrym Maillard, Rémi Munos, Gilles Stoltz

ALT 2011 Lipschitz Bandits Without the Lipschitz Constant Sébastien Bubeck, Gilles Stoltz, Jia Yuan Yu

COLT 2011 Robust Approachability and Regret Minimization in Games with Partial Monitoring Shie Mannor, Vianney Perchet, Gilles Stoltz

JMLR 2011 X-Armed Bandits Sébastien Bubeck, Rémi Munos, Gilles Stoltz, Csaba Szepesvári

COLT 2009 Online Multi-Task Learning with Hard Constraints Gábor Lugosi, Omiros Papaspiliopoulos, Gilles Stoltz

ALT 2009 Pure Exploration in Multi-Armed Bandits Problems Sébastien Bubeck, Rémi Munos, Gilles Stoltz

NeurIPS 2008 Online Optimization in X-Armed Bandits Sébastien Bubeck, Gilles Stoltz, Csaba Szepesvári, Rémi Munos

MLJ 2007 Improved Second-Order Bounds for Prediction with Expert Advice Nicolò Cesa-Bianchi, Yishay Mansour, Gilles Stoltz

COLT 2007 Strategies for Prediction Under Imperfect Monitoring Gábor Lugosi, Shie Mannor, Gilles Stoltz

COLT 2005 Improved Second-Order Bounds for Prediction with Expert Advice Nicolò Cesa-Bianchi, Yishay Mansour, Gilles Stoltz

MLJ 2005 Internal Regret in On-Line Portfolio Selection Gilles Stoltz, Gábor Lugosi

COLT 2004 Minimizing Regret with Label Efficient Prediction Nicolò Cesa-Bianchi, Gábor Lugosi, Gilles Stoltz

COLT 2003 Internal Regret in On-Line Portfolio Selection Gilles Stoltz, Gábor Lugosi