Towards a Second Generation Random Walk Planner: An Experimental Exploration

Abstract

Random walks have become a popular component of recent planning systems. The increased exploration is a valuable addition to more exploitative search methods such as Greedy Best First Search (GBFS). A number of successful planners which incorporate random walks have been built. The work presented here aims to exploit the experience gained from building those systems. It begins a systematic study of the design space and alternative choices for building such a system, and develops a new random walk planner from scratch, with careful experiments along the way. Four major insights are: 1. a high state evaluation frequency is usually superior to the endpoint-only evaluation used in earlier systems, 2. adjusting the restarting parameter according to the progress speed in the search space performs better than any fixed setting, 3. biasing the action selection towards preferred operators of only the current state is better than Monte Carlo Helpful Actions, which depend on the number of times an action has been a preferred operator in previous walks, and 4. even simple forms of random walk planning can compete with GBFS.

Cite

Text

Nakhost and Müller. "Towards a Second Generation Random Walk Planner: An Experimental Exploration." International Joint Conference on Artificial Intelligence, 2013.

Markdown

[Nakhost and Müller. "Towards a Second Generation Random Walk Planner: An Experimental Exploration." International Joint Conference on Artificial Intelligence, 2013.](https://mlanthology.org/ijcai/2013/nakhost2013ijcai-second/)

BibTeX

@inproceedings{nakhost2013ijcai-second,
  title     = {{Towards a Second Generation Random Walk Planner: An Experimental Exploration}},
  author    = {Nakhost, Hootan and Müller, Martin},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2013},
  pages     = {2336-2342},
  url       = {https://mlanthology.org/ijcai/2013/nakhost2013ijcai-second/}
}