ML Anthology
Authors
Search
About
Duff, Michael O.
6 publications
ICML
2003
Design for an Optimal Probe
Michael O. Duff
ICML
2003
Diffusion Approximation for Bayesian Markov Chains
Michael O. Duff
AISTATS
2001
Monte-Carlo Algorithms for the Improvement of Finite-State Stochastic Controllers: Application to Bayes-Adaptive Markov Decision Processes
Michael O. Duff
NeurIPS
1996
Local Bandit Approximation for Optimal Learning Problems
Michael O. Duff
,
Andrew G. Barto
ICML
1995
Q-Learning for Bandit Problems
Michael O. Duff
NeurIPS
1994
Reinforcement Learning Methods for Continuous-Time Markov Decision Problems
Steven J. Bradtke
,
Michael O. Duff