Robust Planning with (L)RTDP
Abstract
Stochastic Shortest Path problems (SSPs), a sub-class of Markov Decision Problems (MDPs), can be efficiently dealt with using Real-Time Dynamic Programming (RTDP). Yet, MDP models are often uncertain (obtained through statistics or guessing). The usual approach is robust planning: searching for the best policy under the worst model. This paper shows how RTDP can be made robust in the common case where transition probabilities are known to lie in a given interval. 1
Cite
Text
Buffet and Aberdeen. "Robust Planning with (L)RTDP." International Joint Conference on Artificial Intelligence, 2005.Markdown
[Buffet and Aberdeen. "Robust Planning with (L)RTDP." International Joint Conference on Artificial Intelligence, 2005.](https://mlanthology.org/ijcai/2005/buffet2005ijcai-robust/)BibTeX
@inproceedings{buffet2005ijcai-robust,
title = {{Robust Planning with (L)RTDP}},
author = {Buffet, Olivier and Aberdeen, Douglas},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2005},
pages = {1214-1219},
url = {https://mlanthology.org/ijcai/2005/buffet2005ijcai-robust/}
}