Conditioned Score-Based Models for Learning Collision-Free Trajectory Generation
Abstract
Planning a motion in a cluttered environment is a recurring task autonomous agents need to solve. This paper presents a first attempt to learn generative models for collision-free trajectory generation based on conditioned score-based models. Given multiple navigation tasks, environment maps and collision-free trajectories pre-computed with a sample-based planner, using a signed distance function loss we learn a vision encoder of the map and use its embedding to learn a conditioned score-based model for trajectory generation. A novelty of our method is to integrate in a temporal U-net architecture conditioning variables such as the latent representation of the environment and task features, using a cross-attention mechanism. We validate our approach in a simulated 2D planar navigation toy task, where a robot needs to plan a path that avoids obstacles in a scene.
Cite
Text
Carvalho et al. "Conditioned Score-Based Models for Learning Collision-Free Trajectory Generation." NeurIPS 2022 Workshops: SBM, 2022.Markdown
[Carvalho et al. "Conditioned Score-Based Models for Learning Collision-Free Trajectory Generation." NeurIPS 2022 Workshops: SBM, 2022.](https://mlanthology.org/neuripsw/2022/carvalho2022neuripsw-conditioned/)BibTeX
@inproceedings{carvalho2022neuripsw-conditioned,
title = {{Conditioned Score-Based Models for Learning Collision-Free Trajectory Generation}},
author = {Carvalho, Joao and Baierl, Mark and Urain, Julen and Peters, Jan},
booktitle = {NeurIPS 2022 Workshops: SBM},
year = {2022},
url = {https://mlanthology.org/neuripsw/2022/carvalho2022neuripsw-conditioned/}
}