PcLast: Discovering Plannable Continuous Latent States
Abstract
Goal-conditioned planning benefits from learned low-dimensional representations of rich observations. While compact latent representations typically learned from variational autoencoders or inverse dynamics enable goal-conditioned decision making, they ignore state reachability, hampering their performance. In this paper, we learn a representation that associates reachable states together for effective planning and goal-conditioned policy learning. We first learn a latent representation with multi-step inverse dynamics (to remove distracting information), and then transform this representation to associate reachable states together in $\ell_2$ space. Our proposals are rigorously tested in various simulation testbeds. Numerical results in reward-based settings show significant improvements in sampling efficiency. Further, in reward-free settings this approach yields layered state abstractions that enable computationally efficient hierarchical planning for reaching ad hoc goals with zero additional samples.
Cite
Text
Koul et al. "PcLast: Discovering Plannable Continuous Latent States." International Conference on Machine Learning, 2024.Markdown
[Koul et al. "PcLast: Discovering Plannable Continuous Latent States." International Conference on Machine Learning, 2024.](https://mlanthology.org/icml/2024/koul2024icml-pclast/)BibTeX
@inproceedings{koul2024icml-pclast,
title = {{PcLast: Discovering Plannable Continuous Latent States}},
author = {Koul, Anurag and Sujit, Shivakanth and Chen, Shaoru and Evans, Ben and Wu, Lili and Xu, Byron and Chari, Rajan and Islam, Riashat and Seraj, Raihan and Efroni, Yonathan and Molu, Lekan P and Dudı́k, Miroslav and Langford, John and Lamb, Alex},
booktitle = {International Conference on Machine Learning},
year = {2024},
pages = {25475-25493},
volume = {235},
url = {https://mlanthology.org/icml/2024/koul2024icml-pclast/}
}