Test-Time Adaptation for Online Vision-Language Navigation with Feedback-Based Reinforcement Learning

Abstract

Navigating in an unfamiliar environment during deployment poses a critical challenge for a vision-language navigation (VLN) agent. Yet, test-time adaptation (TTA) remains relatively underexplored in robotic navigation, leading us to the fundamental question: what are the key properties of TTA for online VLN? In our view, effective adaptation requires three qualities: 1) flexibility in handling different navigation outcomes, 2) interactivity with external environment, and 3) maintaining a harmony between plasticity and stability. To address this, we introduce FeedTTA, a novel TTA framework for online VLN utilizing feedback-based reinforcement learning. Specifically, FeedTTA learns by maximizing binary episodic feedback, a practical setup in which the agent receives a binary scalar after each episode that indicates the success or failure of the navigation. Additionally, we propose a gradient regularization technique that leverages the binary structure of FeedTTA to achieve a balance between plasticity and stability during adaptation. Our extensive experiments on challenging VLN benchmarks demonstrate the superior adaptability of FeedTTA, even outperforming the state-of-the-art offline training methods in REVERIE benchmark with a single stream of learning.

Cite

Text

Kim et al. "Test-Time Adaptation for Online Vision-Language Navigation with Feedback-Based Reinforcement Learning." Proceedings of the 42nd International Conference on Machine Learning, 2025.

Markdown

[Kim et al. "Test-Time Adaptation for Online Vision-Language Navigation with Feedback-Based Reinforcement Learning." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/kim2025icml-testtime/)

BibTeX

@inproceedings{kim2025icml-testtime,
  title     = {{Test-Time Adaptation for Online Vision-Language Navigation with Feedback-Based Reinforcement Learning}},
  author    = {Kim, Sungjune and Oh, Gyeongrok and Ko, Heeju and Ji, Daehyun and Lee, Dongwook and Lee, Byung-Jun and Jang, Sujin and Kim, Sangpil},
  booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
  year      = {2025},
  pages     = {30654-30671},
  volume    = {267},
  url       = {https://mlanthology.org/icml/2025/kim2025icml-testtime/}
}