Auer, Peter

45 publications

NeurIPS 2025 Improved Best-of-Both-Worlds Regret for Bandits with Delayed Feedback Ofir Schlisselberg, Tal Lancewicki, Peter Auer, Yishay Mansour
IJCAI 2023 Autonomous Exploration for Navigating in MDPs Using Blackbox RL Algorithms Pratik Gajane, Peter Auer, Ronald Ortner
COLT 2019 Achieving Optimal Dynamic Regret for Non-Stationary Bandits Without Prior Information Peter Auer, Yifang Chen, Pratik Gajane, Chung-Wei Lee, Haipeng Luo, Ronald Ortner, Chen-Yu Wei
COLT 2019 Adaptively Tracking the Best Bandit Arm with an Unknown Number of Distribution Changes Peter Auer, Pratik Gajane, Ronald Ortner
UAI 2019 Variational Regret Bounds for Reinforcement Learning Ronald Ortner, Pratik Gajane, Peter Auer
COLT 2016 An Algorithm with Nearly Optimal Pseudo-Regret for Both Stochastic and Adversarial Bandits Peter Auer, Chao-Kai Chiang
AISTATS 2016 Pareto Front Identification from Stochastic Bandit Feedback Peter Auer, Chao-Kai Chiang, Ronald Ortner, Madalina M. Drugan
ALT 2014 Algorithmic Learning Theory - 25th International Conference, ALT 2014, Bled, Slovenia, October 8-10, 2014. Proceedings Peter Auer, Alexander Clark, Thomas Zeugmann, Sandra Zilles
ALT 2014 Editors' Introduction Peter Auer, Alexander Clark, Thomas Zeugmann, Sandra Zilles
COLT 2012 Autonomous Exploration for Navigating in MDPs Shiau Hong Lim, Peter Auer
ICML 2012 PAC Subset Selection in Stochastic Multi-Armed Bandits Shivaram Kalyanakrishnan, Ambuj Tewari, Peter Auer, Peter Stone
UAI 2012 PAC-Bayesian Inequalities for Martingales Yevgeny Seldin, François Laviolette, Nicolò Cesa-Bianchi, John Shawe-Taylor, Peter Auer
ALT 2012 Regret Bounds for Restless Markov Bandits Ronald Ortner, Daniil Ryabko, Peter Auer, Rémi Munos
ALT 2011 Models for Autonomously Motivated Exploration in Reinforcement Learning - (Extended Abstract) Peter Auer, Shiau Hong Lim, Chris Watkins
UAI 2011 Noisy Search with Comparative Feedback Shiau Hong Lim, Peter Auer
NeurIPS 2011 PAC-Bayesian Analysis of Contextual Bandits Yevgeny Seldin, Peter Auer, John S. Shawe-taylor, Ronald Ortner, François Laviolette
ALT 2011 Upper-Confidence-Bound Algorithms for Active Learning in Multi-Armed Bandits Alexandra Carpentier, Alessandro Lazaric, Mohammad Ghavamzadeh, Rémi Munos, Peter Auer
ECML-PKDD 2010 Exploration-Exploitation of Eye Movement Enriched Multiple Feature Spaces for Content-Based Image Retrieval Zakria Hussain, Alex Po Leung, Kitsuchart Pasupa, David R. Hardoon, Peter Auer, John Shawe-Taylor
JMLR 2010 Near-Optimal Regret Bounds for Reinforcement Learning Thomas Jaksch, Ronald Ortner, Peter Auer
ICML 2009 Workshop Summary: On-Line Learning with Limited Feedback Jean-Yves Audibert, Peter Auer, Alessandro Lazaric, Rémi Munos, Daniil Ryabko, Csaba Szepesvári
NeurIPS 2008 Near-Optimal Regret Bounds for Reinforcement Learning Peter Auer, Thomas Jaksch, Ronald Ortner
MLJ 2007 A New PAC Bound for Intersection-Closed Concept Classes Peter Auer, Ronald Ortner
COLT 2007 Improved Rates for the Stochastic Continuum-Armed Bandit Problem Peter Auer, Ronald Ortner, Csaba Szepesvári
ALT 2006 Hannan Consistency in On-Line Learning in Case of Unbounded Losses Under Partial Monitoring Chamy Allenberg, Peter Auer, László Györfi, György Ottucsák
NeurIPS 2006 Logarithmic Online Regret Bounds for Undiscounted Reinforcement Learning Peter Auer, Ronald Ortner
COLT 2005 Learning Theory, 18th Annual Conference on Learning Theory, COLT 2005, Bertinoro, Italy, June 27-30, 2005, Proceedings Peter Auer, Ron Meir
ECML-PKDD 2004 A Boosting Approach to Multiple Instance Learning Peter Auer, Ronald Ortner
COLT 2004 A New PAC Bound for Intersection-Closed Concept Classes Peter Auer, Ronald Ortner
ECCV 2004 Weak Hypotheses and Boosting for Generic Object Detection and Recognition Andreas Opelt, Michael Fussenegger, Axel Pinz, Peter Auer
MLJ 2002 Finite-Time Analysis of the Multiarmed Bandit Problem Peter Auer, Nicolò Cesa-Bianchi, Paul Fischer
JMLR 2002 Using Confidence Bounds for Exploitation-Exploration Trade-Offs Peter Auer
COLT 2000 Adaptive and Self-Confident On-Line Learning Algorithms Peter Auer, Claudio Gentile
COLT 2000 An Improved On-Line Algorithm for Learning Linear Evaluation Functions Peter Auer
MLJ 1999 Structural Results About On-Line Learning Models with and Without Queries Peter Auer, Philip M. Long
MLJ 1998 Tracking the Best Disjunction Peter Auer, Manfred K. Warmuth
ICML 1997 On Learning from Multi-Instance Examples: Empirical Evaluation of a Theoretical Approach Peter Auer
COLT 1996 Learning of Depth Two Neural Networks with Constant Fan-in at the Hidden Nodes (Extended Abstract) Peter Auer, Stephen Kwek, Wolfgang Maass, Manfred K. Warmuth
NeurIPS 1995 Exponentially Many Local Minima for Single Neurons Peter Auer, Mark Herbster, Manfred K. Warmuth
ALT 1995 Learning Nested Differences in the Presence of Malicious Noise Peter Auer
MLJ 1995 On the Complexity of Function Learning Peter Auer, Philip M. Long, Wolfgang Maass, Gerhard J. Woeginger
ICML 1995 Theory and Applications of Agnostic PAC-Learning with Small Decision Trees Peter Auer, Robert C. Holte, Wolfgang Maass
NeCo 1994 Degree of Approximation Results for Feedforward Networks Approximating Unknown Mappings and Their Derivatives Kurt Hornik, Maxwell B. Stinchcombe, Halbert White, Peter Auer
ALT 1994 On-Line Learning with Malicious Noise and the Closure Algorithm Peter Auer, Nicolò Cesa-Bianchi
COLT 1993 On the Complexity of Function Learning Peter Auer, Philip M. Long, Wolfgang Maass, Gerhard J. Woeginger
COLT 1993 On-Line Learning of Rectangles in Noisy Environments Peter Auer