DiSProD: Differentiable Symbolic Propagation of Distributions for Planning

Abstract

The paper introduces DiSProD, an online planner developed for environments with probabilistic transitions in continuous state and action spaces. DiSProD builds a symbolic graph that captures the distribution of future trajectories, conditioned on a given policy, using independence assumptions and approximate propagation of distributions. The symbolic graph provides a differentiable representation of the policy's value, enabling efficient gradient-based optimization for long-horizon search. The propagation of approximate distributions can be seen as an aggregation of many trajectories, making it well-suited for dealing with sparse rewards and stochastic environments. An extensive experimental evaluation compares DiSProD to state-of-the-art planners in discrete-time planning and real-time control of robotic systems. The proposed method improves over existing planners in handling stochastic environments, sensitivity to search depth, sparsity of rewards, and large action spaces. Additional real-world experiments demonstrate that DiSProD can control ground vehicles and surface vessels to successfully navigate around obstacles.

Cite

Text

Chatterjee et al. "DiSProD: Differentiable Symbolic Propagation of Distributions for Planning." International Joint Conference on Artificial Intelligence, 2023. doi:10.24963/IJCAI.2023/591

Markdown

[Chatterjee et al. "DiSProD: Differentiable Symbolic Propagation of Distributions for Planning." International Joint Conference on Artificial Intelligence, 2023.](https://mlanthology.org/ijcai/2023/chatterjee2023ijcai-disprod/) doi:10.24963/IJCAI.2023/591

BibTeX

@inproceedings{chatterjee2023ijcai-disprod,
  title     = {{DiSProD: Differentiable Symbolic Propagation of Distributions for Planning}},
  author    = {Chatterjee, Palash and Chapagain, Ashutosh and Chen, Weizhe and Khardon, Roni},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2023},
  pages     = {5324-5332},
  doi       = {10.24963/IJCAI.2023/591},
  url       = {https://mlanthology.org/ijcai/2023/chatterjee2023ijcai-disprod/}
}