Learning Curve Bounds for a Markov Decision Process with Undiscounted Rewards
Abstract
Article Free Access Share on Learning curve bounds for a Markov decision process with undiscounted rewards Authors: Lawrence K. Saul Center for Biological and Computational Learning, Massachusetts Institute of Technology, 79 Amherst Street, E10-243, Cambridge, MA Center for Biological and Computational Learning, Massachusetts Institute of Technology, 79 Amherst Street, E10-243, Cambridge, MAView Profile , Satinder P. Singh Harlequin Inc., One Cambridge Center, Cambridge, MA and Center for Biological and Computational Learning, Massachusetts Institute of Technology, 79 Amherst Street, E10-243, Cambridge, MA Harlequin Inc., One Cambridge Center, Cambridge, MA and Center for Biological and Computational Learning, Massachusetts Institute of Technology, 79 Amherst Street, E10-243, Cambridge, MAView Profile Authors Info & Claims COLT '96: Proceedings of the ninth annual conference on Computational learning theoryJanuary 1996 Pages 147–156https://doi.org/10.1145/238061.238084Online:01 January 1996Publication History 3citation244DownloadsMetricsTotal Citations3Total Downloads244Last 12 Months7Last 6 weeks2 Get Citation AlertsNew Citation Alert added!This alert has been successfully added and will be sent to:You will be notified whenever a record that you have chosen has been cited. To manage your alert preferences, click on the button below. Manage my AlertsNew Citation Alert!Please log in to your account Save to BinderSave to BinderCreate a New BinderNameCancelCreateExport CitationPublisher SiteeReaderPDF
Cite
Text
Saul and Singh. "Learning Curve Bounds for a Markov Decision Process with Undiscounted Rewards." Annual Conference on Computational Learning Theory, 1996. doi:10.1145/238061.238084Markdown
[Saul and Singh. "Learning Curve Bounds for a Markov Decision Process with Undiscounted Rewards." Annual Conference on Computational Learning Theory, 1996.](https://mlanthology.org/colt/1996/saul1996colt-learning/) doi:10.1145/238061.238084BibTeX
@inproceedings{saul1996colt-learning,
title = {{Learning Curve Bounds for a Markov Decision Process with Undiscounted Rewards}},
author = {Saul, Lawrence K. and Singh, Satinder P.},
booktitle = {Annual Conference on Computational Learning Theory},
year = {1996},
pages = {147-156},
doi = {10.1145/238061.238084},
url = {https://mlanthology.org/colt/1996/saul1996colt-learning/}
}