ML Anthology
Authors
Search
About
Tsitsiklis, John N.
20 publications
COLT
2018
Private Sequential Learning
John N. Tsitsiklis
,
Kuang Xu
,
Zhi Xu
ICML
2011
Mean-Variance Optimization in Markov Decision Processes
Shie Mannor
,
John N. Tsitsiklis
JMLR
2009
Online Learning with Sample Path Constraints
Shie Mannor
,
John N. Tsitsiklis
,
Jia Yuan Yu
COLT
2006
Online Learning with Constraints
Shie Mannor
,
John N. Tsitsiklis
ICML
2004
Bias and Variance in Value Function Estimation
Shie Mannor
,
Duncan Simester
,
Peng Sun
,
John N. Tsitsiklis
JMLR
2004
The Sample Complexity of Exploration in the Multi-Armed Bandit Problem (Special Topic on Learning Theory)
Shie Mannor
,
John N. Tsitsiklis
COLT
2003
Lower Bounds on the Sample Complexity of Exploration in the Multi-Armed Bandit Problem
Shie Mannor
,
John N. Tsitsiklis
MLJ
2002
On Average Versus Discounted Reward Temporal-Difference Learning
John N. Tsitsiklis
,
Benjamin Van Roy
JMLR
2002
On the Convergence of Optimistic Policy Iteration
John N. Tsitsiklis
NeurIPS
1999
Actor-Critic Algorithms
Vijay R. Konda
,
John N. Tsitsiklis
MLJ
1999
Estimation of Time-Varying Parameters in Statistical Models: An Optimization Approach
Dimitris Bertsimas
,
David Gamarnik
,
John N. Tsitsiklis
COLT
1997
Estimation of Time-Varying Parameters in Statistical Models: An Optimization Approach
Dimitris Bertsimas
,
David Gamarnik
,
John N. Tsitsiklis
NeurIPS
1997
Reinforcement Learning for Call Admission Control and Routing in Integrated Service Networks
Peter Marbach
,
Oliver Mihatsch
,
Miriam Schulte
,
John N. Tsitsiklis
NeurIPS
1996
Analysis of Temporal-Diffference Learning with Function Approximation
John N. Tsitsiklis
,
Benjamin Van Roy
NeurIPS
1996
Approximate Solutions to Optimal Stopping Problems
John N. Tsitsiklis
,
Benjamin Van Roy
MLJ
1996
Feature-Based Methods for Large Scale Dynamic Programming
John N. Tsitsiklis
,
Benjamin Van Roy
NeurIPS
1995
Stable LInear Approximations to Dynamic Programming for Stochastic Control Problems with Local Transitions
Benjamin Van Roy
,
John N. Tsitsiklis
MLJ
1994
Asynchronous Stochastic Approximation and Q-Learning
John N. Tsitsiklis
MLJ
1993
Active Learning Using Arbitrary Binary Valued Queries
Sanjeev R. Kulkarni
,
Sanjoy K. Mitter
,
John N. Tsitsiklis
COLT
1992
PAC Learning with Generalized Samples and an Application to Stochastic Geometry
Sanjeev R. Kulkarni
,
John N. Tsitsiklis
,
Sanjoy K. Mitter
,
Ofer Zeitouni