Conditioned Score-Based Models for Learning Collision-Free Trajectory Generation

Abstract

Planning a motion in a cluttered environment is a recurring task autonomous agents need to solve. This paper presents a first attempt to learn generative models for collision-free trajectory generation based on conditioned score-based models. Given multiple navigation tasks, environment maps and collision-free trajectories pre-computed with a sample-based planner, using a signed distance function loss we learn a vision encoder of the map and use its embedding to learn a conditioned score-based model for trajectory generation. A novelty of our method is to integrate in a temporal U-net architecture conditioning variables such as the latent representation of the environment and task features, using a cross-attention mechanism. We validate our approach in a simulated 2D planar navigation toy task, where a robot needs to plan a path that avoids obstacles in a scene.

Cite

Text

Carvalho et al. "Conditioned Score-Based Models for Learning Collision-Free Trajectory Generation." NeurIPS 2022 Workshops: SBM, 2022.

Markdown

[Carvalho et al. "Conditioned Score-Based Models for Learning Collision-Free Trajectory Generation." NeurIPS 2022 Workshops: SBM, 2022.](https://mlanthology.org/neuripsw/2022/carvalho2022neuripsw-conditioned/)

BibTeX

@inproceedings{carvalho2022neuripsw-conditioned,
  title     = {{Conditioned Score-Based Models for Learning Collision-Free Trajectory Generation}},
  author    = {Carvalho, Joao and Baierl, Mark and Urain, Julen and Peters, Jan},
  booktitle = {NeurIPS 2022 Workshops: SBM},
  year      = {2022},
  url       = {https://mlanthology.org/neuripsw/2022/carvalho2022neuripsw-conditioned/}
}