Korda, Nathaniel

7 publications

MLJ 2021 Concentration Bounds for Temporal Difference Learning with Linear Function Approximation: The Case of Batch Data and Uniform Sampling L. A. Prashanth, Nathaniel Korda, Rémi Munos
AAAI 2015 Fast Gradient Descent for Drifting Least Squares Regression, with Application to Bandits Nathaniel Korda, Prashanth L. A., Rémi Munos
ICML 2015 On TD(0) with Function Approximation: Concentration Bounds and a Centered Variant with Exponential Convergence Nathaniel Korda, Prashanth La
ECML-PKDD 2014 Fast LSTD Using Stochastic Approximation: Finite Time Analysis and Application to Traffic Control L. A. Prashanth, Nathaniel Korda, Rémi Munos
UAI 2013 Finite-Time Analysis of Kernelised Contextual Bandits Michal Valko, Nathaniel Korda, Rémi Munos, Ilias N. Flaounas, Nello Cristianini
NeurIPS 2013 Thompson Sampling for 1-Dimensional Exponential Family Bandits Nathaniel Korda, Emilie Kaufmann, Remi Munos
ALT 2012 Thompson Sampling: An Asymptotically Optimal Finite-Time Analysis Emilie Kaufmann, Nathaniel Korda, Rémi Munos