Szepesvari, Csaba
218 publications
NeurIPS
2024
Almost Free: Self-Concordance in Natural Exponential Families and an Application to Bandits
NeurIPS
2024
Confident Natural Policy Gradient for Local Planning in $q_\pi$-Realizable Constrained MDPs
NeurIPS
2024
Small Steps No More: Global Convergence of Stochastic Gradient Bandits for Arbitrary Learning Rates
NeurIPS
2024
To Believe or Not to Believe Your LLM: Iterative Prompting for Estimating Epistemic Uncertainty
AISTATS
2023
Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning
NeurIPS
2023
Online RL in Linearly $q^\pi$-Realizable MDPs Is as Easy as in Linear MDPs if You Learn What to Ignore
NeurIPS
2023
Optimistic Natural Policy Gradient: A Simple Efficient Policy Optimization Framework for Online RL
UAI
2022
A Free Lunch from the Noise: Provable and Practical Exploration for Representation Learning
NeurIPS
2022
Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization
NeurIPS
2022
Confident Approximate Policy Iteration for Efficient Local Planning in $q^\pi$-Realizable MDPs
AISTATS
2021
Confident Off-Policy Evaluation and Selection Through Self-Normalized Importance Weighting
COLT
2021
Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov Decision Processes
COLT
2021
On Query-Efficient Planning in MDPs Under Linear Realizability of the Optimal State-Value Function
NeurIPS
2020
ImpatientCapsAndRuns: Approximately Optimal Algorithm Configuration from an Infinite Pool
NeurIPS
2019
Think Out of the "Box": Generically-Constrained Asynchronous Composite Optimization and Hedging
AISTATS
2018
Linear Stochastic Approximation: How Far Does Constant Step-Size and Iterate Averaging Go?
AAAI
2016
Delay-Tolerant Online Convex Optimization: Unified Analysis and Adaptive-Gradient Algorithms
NeurIPS
2016
Following the Leader and Fast Rates in Linear Prediction: Curved Constraint Sets and Other Regularities
AISTATS
2015
Exploiting Symmetries to Construct Efficient MCMC Algorithms with an Application to SLAM
ICML
2015
On Identifying Good Options Under Combinatorially Structured Feedback in Finite Noisy Environments
AISTATS
2014
A Finite-Sample Generalization Bound for Semiparametric Regression: Partially Linear Models
COLT
2014
Proceedings of the 27th Conference on Learning Theory, COLT 2014, Barcelona, Spain, June 13-15, 2014
NeurIPS
2013
Online Learning in Markov Decision Processes with Adversarially Chosen Transition Probability Distributions
ICML
2012
Statistical Linear Estimation with Penalized Estimators: An Application to Reinforcement Learning
NeurIPS
2010
Estimation of Rényi Entropy and Mutual Information Based on Generalized Nearest-Neighbor Graphs
ICML
2009
Fast Gradient-Descent Methods for Temporal-Difference Learning with Linear Function Approximation
NeurIPS
2008
A Convergent $O(n)$ Temporal-Difference Algorithm for Off-Policy Learning with Linear Function Approximation