Tsitsiklis, John N.

20 publications

COLT 2018 Private Sequential Learning John N. Tsitsiklis, Kuang Xu, Zhi Xu
ICML 2011 Mean-Variance Optimization in Markov Decision Processes Shie Mannor, John N. Tsitsiklis
JMLR 2009 Online Learning with Sample Path Constraints Shie Mannor, John N. Tsitsiklis, Jia Yuan Yu
COLT 2006 Online Learning with Constraints Shie Mannor, John N. Tsitsiklis
ICML 2004 Bias and Variance in Value Function Estimation Shie Mannor, Duncan Simester, Peng Sun, John N. Tsitsiklis
JMLR 2004 The Sample Complexity of Exploration in the Multi-Armed Bandit Problem (Special Topic on Learning Theory) Shie Mannor, John N. Tsitsiklis
COLT 2003 Lower Bounds on the Sample Complexity of Exploration in the Multi-Armed Bandit Problem Shie Mannor, John N. Tsitsiklis
MLJ 2002 On Average Versus Discounted Reward Temporal-Difference Learning John N. Tsitsiklis, Benjamin Van Roy
JMLR 2002 On the Convergence of Optimistic Policy Iteration John N. Tsitsiklis
NeurIPS 1999 Actor-Critic Algorithms Vijay R. Konda, John N. Tsitsiklis
MLJ 1999 Estimation of Time-Varying Parameters in Statistical Models: An Optimization Approach Dimitris Bertsimas, David Gamarnik, John N. Tsitsiklis
COLT 1997 Estimation of Time-Varying Parameters in Statistical Models: An Optimization Approach Dimitris Bertsimas, David Gamarnik, John N. Tsitsiklis
NeurIPS 1997 Reinforcement Learning for Call Admission Control and Routing in Integrated Service Networks Peter Marbach, Oliver Mihatsch, Miriam Schulte, John N. Tsitsiklis
NeurIPS 1996 Analysis of Temporal-Diffference Learning with Function Approximation John N. Tsitsiklis, Benjamin Van Roy
NeurIPS 1996 Approximate Solutions to Optimal Stopping Problems John N. Tsitsiklis, Benjamin Van Roy
MLJ 1996 Feature-Based Methods for Large Scale Dynamic Programming John N. Tsitsiklis, Benjamin Van Roy
NeurIPS 1995 Stable LInear Approximations to Dynamic Programming for Stochastic Control Problems with Local Transitions Benjamin Van Roy, John N. Tsitsiklis
MLJ 1994 Asynchronous Stochastic Approximation and Q-Learning John N. Tsitsiklis
MLJ 1993 Active Learning Using Arbitrary Binary Valued Queries Sanjeev R. Kulkarni, Sanjoy K. Mitter, John N. Tsitsiklis
COLT 1992 PAC Learning with Generalized Samples and an Application to Stochastic Geometry Sanjeev R. Kulkarni, John N. Tsitsiklis, Sanjoy K. Mitter, Ofer Zeitouni