Control Strategies for a Stochastic Planner

Abstract

We present new algorithms for local planning over Markov decision processes. The base-level algorithm possesses several interesting features for control of computation, based on selecting computations accord-ing to their expected benefit to decision quality. The algorithms are shown to expand the agent’s knowledge where the world warrants it, with appropriate respon-siveness to time pressure and randomness. We then develop an introspective algorithm, using an internal representation of what computational work has already been done. This strategy extends the agent’s knowl-edge base where warranted by the agent’s world model and the agent’s knowledge of the work already put into various parts of this model. It also enables the agent to act so as to take advantage of the computational savings inherent in staying in known parts of the state space. The control flexibility provided by this strategy, by in-corporating natural problem-solving methods, directs computational effort towards where it’s needed better than previous approaches, providing greater hopes for scalability to large domains.

Cite

Text

Tash and Russell. "Control Strategies for a Stochastic Planner." AAAI Conference on Artificial Intelligence, 1994.

Markdown

[Tash and Russell. "Control Strategies for a Stochastic Planner." AAAI Conference on Artificial Intelligence, 1994.](https://mlanthology.org/aaai/1994/tash1994aaai-control/)

BibTeX

@inproceedings{tash1994aaai-control,
  title     = {{Control Strategies for a Stochastic Planner}},
  author    = {Tash, Jonathan and Russell, Stuart},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {1994},
  pages     = {1079-1085},
  url       = {https://mlanthology.org/aaai/1994/tash1994aaai-control/}
}