GENNAV: Polygon Mask Generation for Generalized Referring Navigable Regions

Abstract

We focus on the task of identifying the location of target regions from a natural language instruction and a front camera image captured by a mobility. This task is challenging because it requires both existence prediction and segmentation mask generation, particularly for stuff-type target regions with ambiguous boundaries. Existing methods often underperform in handling stuff-type target regions, in addition to absent or multiple targets. To overcome these limitations, we propose GENNAV, which predicts target existence and generates segmentation masks for multiple stuff-type target regions. To evaluate GENNAV, we constructed a novel benchmark called GRiN-Drive, which includes three distinct types of samples: no-target, single-target, and multi-target. GENNAV achieved superior performance over baseline methods on standard evaluation metrics. Furthermore, we conducted real-world experiments with four automobiles operated in five geographically distinct urban areas to validate its zero-shot transfer performance. In these experiments, GENNAV outperformed baseline methods and demonstrated its robustness across diverse real-world environments.

Cite

Text

Katsumata et al. "GENNAV: Polygon Mask Generation for Generalized Referring Navigable Regions." Proceedings of The 9th Conference on Robot Learning, 2025.

Markdown

[Katsumata et al. "GENNAV: Polygon Mask Generation for Generalized Referring Navigable Regions." Proceedings of The 9th Conference on Robot Learning, 2025.](https://mlanthology.org/corl/2025/katsumata2025corl-gennav/)

BibTeX

@inproceedings{katsumata2025corl-gennav,
  title     = {{GENNAV: Polygon Mask Generation for Generalized Referring Navigable Regions}},
  author    = {Katsumata, Kei and Iioka, Yui and Hosomi, Naoki and Misu, Teruhisa and Yamada, Kentaro and Sugiura, Komei},
  booktitle = {Proceedings of The 9th Conference on Robot Learning},
  year      = {2025},
  pages     = {5195-5217},
  volume    = {305},
  url       = {https://mlanthology.org/corl/2025/katsumata2025corl-gennav/}
}