PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators

Abstract

We present PoliFormer (Policy Transformer), an RGB-only indoor navigation agent trained end-to-end with reinforcement learning at scale that generalizes to the real-world without adaptation despite being trained purely in simulation. PoliFormer uses a foundational vision transformer encoder with a causal transformer decoder enabling long-term memory and reasoning. It is trained for hundreds of millions of interactions across diverse environments, leveraging parallelized, multi-machine rollouts for efficient training with high throughput. PoliFormer is a masterful navigator, producing state-of-the-art results across two distinct embodiments, the LoCoBot and Stretch RE-1 robots, and four navigation benchmarks. It breaks through the plateaus of previous work, achieving an unprecedented 85.5% success rate in object goal navigation on the CHORES-S benchmark, a 28.5% absolute improvement. PoliFormer can also be trivially extended to a variety of downstream applications such as object tracking, multi-object navigation, and open-vocabulary navigation with no finetuning.

Cite

Text

Zeng et al. "PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators." Proceedings of The 8th Conference on Robot Learning, 2024.

Markdown

[Zeng et al. "PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators." Proceedings of The 8th Conference on Robot Learning, 2024.](https://mlanthology.org/corl/2024/zeng2024corl-poliformer/)

BibTeX

@inproceedings{zeng2024corl-poliformer,
  title     = {{PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators}},
  author    = {Zeng, Kuo-Hao and Zhang, Zichen and Ehsani, Kiana and Hendrix, Rose and Salvador, Jordi and Herrasti, Alvaro and Girshick, Ross and Kembhavi, Aniruddha and Weihs, Luca},
  booktitle = {Proceedings of The 8th Conference on Robot Learning},
  year      = {2024},
  pages     = {408-432},
  volume    = {270},
  url       = {https://mlanthology.org/corl/2024/zeng2024corl-poliformer/}
}