Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments – Extended Abstract

Abstract

We develop a language-guided navigation task set in a continuous 3D environment where agents must execute low-level actions to follow natural language navigation directions. By being situated in continuous environments, this setting lifts a number of assumptions implicit in prior work that represents environments as a sparse graph of panoramas with edges corresponding to navigability. Specifically, our setting drops the presumptions of known environment topologies, short-range oracle navigation, and perfect agent localization. To contextualize this new task, we develop models that mirror many of the advances made in prior settings. We find significantly lower performance in the continuous setting -- suggesting that performance in topological settings may be inflated by the strong implicit assumptions.

Cite

Text

Krantz et al. "Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments – Extended Abstract." ICML 2020 Workshops: LaReL, 2020.

Markdown

[Krantz et al. "Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments – Extended Abstract." ICML 2020 Workshops: LaReL, 2020.](https://mlanthology.org/icmlw/2020/krantz2020icmlw-beyond/)

BibTeX

@inproceedings{krantz2020icmlw-beyond,
  title     = {{Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments – Extended Abstract}},
  author    = {Krantz, Jacob and Wijmans, Erik and Majumdar, Arjun and Batra, Dhruv and Lee, Stefan},
  booktitle = {ICML 2020 Workshops: LaReL},
  year      = {2020},
  url       = {https://mlanthology.org/icmlw/2020/krantz2020icmlw-beyond/}
}