Guiding Search in Continuous State-Action Spaces by Learning an Action Sampler from Off-Target Search Experience

Abstract

In robotics, it is essential to be able to plan efficiently in high-dimensional continuous state-action spaces for long horizons. For such complex planning problems, unguided uniform sampling of actions until a path to a goal is found is hopelessly inefficient, and gradient-based approaches often fall short when the optimization manifold of a given problem is not smooth. In this paper, we present an approach that guides search in continuous spaces for generic planners by learning an action sampler from past search experience. We use a Generative Adversarial Network (GAN) to represent an action sampler, and address an important issue: search experience consists of a relatively large number of actions that are not on a solution path and a relatively small number of actions that actually are on a solution path. We introduce a new technique, based on an importance-ratio estimation method, for using samples from a non-target distribution to make GAN learning more data-efficient. We provide theoretical guarantees and empirical evaluation in three challenging continuous robot planning problems to illustrate the effectiveness of our algorithm.

Cite

Text

Kim et al. "Guiding Search in Continuous State-Action Spaces by Learning an Action Sampler from Off-Target Search Experience." AAAI Conference on Artificial Intelligence, 2018. doi:10.1609/AAAI.V32I1.12106

Markdown

[Kim et al. "Guiding Search in Continuous State-Action Spaces by Learning an Action Sampler from Off-Target Search Experience." AAAI Conference on Artificial Intelligence, 2018.](https://mlanthology.org/aaai/2018/kim2018aaai-guiding/) doi:10.1609/AAAI.V32I1.12106

BibTeX

@inproceedings{kim2018aaai-guiding,
  title     = {{Guiding Search in Continuous State-Action Spaces by Learning an Action Sampler from Off-Target Search Experience}},
  author    = {Kim, Beomjoon and Kaelbling, Leslie Pack and Lozano-Pérez, Tomás},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2018},
  pages     = {6509-6516},
  doi       = {10.1609/AAAI.V32I1.12106},
  url       = {https://mlanthology.org/aaai/2018/kim2018aaai-guiding/}
}