Learning Curve Bounds for a Markov Decision Process with Undiscounted Rewards

Abstract

Article Free Access Share on Learning curve bounds for a Markov decision process with undiscounted rewards Authors: Lawrence K. Saul Center for Biological and Computational Learning, Massachusetts Institute of Technology, 79 Amherst Street, E10-243, Cambridge, MA Center for Biological and Computational Learning, Massachusetts Institute of Technology, 79 Amherst Street, E10-243, Cambridge, MAView Profile , Satinder P. Singh Harlequin Inc., One Cambridge Center, Cambridge, MA and Center for Biological and Computational Learning, Massachusetts Institute of Technology, 79 Amherst Street, E10-243, Cambridge, MA Harlequin Inc., One Cambridge Center, Cambridge, MA and Center for Biological and Computational Learning, Massachusetts Institute of Technology, 79 Amherst Street, E10-243, Cambridge, MAView Profile Authors Info & Claims COLT '96: Proceedings of the ninth annual conference on Computational learning theoryJanuary 1996 Pages 147–156https://doi.org/10.1145/238061.238084Online:01 January 1996Publication History 3citation244DownloadsMetricsTotal Citations3Total Downloads244Last 12 Months7Last 6 weeks2 Get Citation AlertsNew Citation Alert added!This alert has been successfully added and will be sent to:You will be notified whenever a record that you have chosen has been cited. To manage your alert preferences, click on the button below. Manage my AlertsNew Citation Alert!Please log in to your account Save to BinderSave to BinderCreate a New BinderNameCancelCreateExport CitationPublisher SiteeReaderPDF

Cite

Text

Saul and Singh. "Learning Curve Bounds for a Markov Decision Process with Undiscounted Rewards." Annual Conference on Computational Learning Theory, 1996. doi:10.1145/238061.238084

Markdown

[Saul and Singh. "Learning Curve Bounds for a Markov Decision Process with Undiscounted Rewards." Annual Conference on Computational Learning Theory, 1996.](https://mlanthology.org/colt/1996/saul1996colt-learning/) doi:10.1145/238061.238084

BibTeX

@inproceedings{saul1996colt-learning,
  title     = {{Learning Curve Bounds for a Markov Decision Process with Undiscounted Rewards}},
  author    = {Saul, Lawrence K. and Singh, Satinder P.},
  booktitle = {Annual Conference on Computational Learning Theory},
  year      = {1996},
  pages     = {147-156},
  doi       = {10.1145/238061.238084},
  url       = {https://mlanthology.org/colt/1996/saul1996colt-learning/}
}