Test-Time Adaptation for Online Vision-Language Navigation with Feedback-Based Reinforcement Learning
Abstract
Navigating in an unfamiliar environment during deployment poses a critical challenge for a vision-language navigation (VLN) agent. Yet, test-time adaptation (TTA) remains relatively underexplored in robotic navigation, leading us to the fundamental question: what are the key properties of TTA for online VLN? In our view, effective adaptation requires three qualities: 1) flexibility in handling different navigation outcomes, 2) interactivity with external environment, and 3) maintaining a harmony between plasticity and stability. To address this, we introduce FeedTTA, a novel TTA framework for online VLN utilizing feedback-based reinforcement learning. Specifically, FeedTTA learns by maximizing binary episodic feedback, a practical setup in which the agent receives a binary scalar after each episode that indicates the success or failure of the navigation. Additionally, we propose a gradient regularization technique that leverages the binary structure of FeedTTA to achieve a balance between plasticity and stability during adaptation. Our extensive experiments on challenging VLN benchmarks demonstrate the superior adaptability of FeedTTA, even outperforming the state-of-the-art offline training methods in REVERIE benchmark with a single stream of learning.
Cite
Text
Kim et al. "Test-Time Adaptation for Online Vision-Language Navigation with Feedback-Based Reinforcement Learning." Proceedings of the 42nd International Conference on Machine Learning, 2025.Markdown
[Kim et al. "Test-Time Adaptation for Online Vision-Language Navigation with Feedback-Based Reinforcement Learning." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/kim2025icml-testtime/)BibTeX
@inproceedings{kim2025icml-testtime,
title = {{Test-Time Adaptation for Online Vision-Language Navigation with Feedback-Based Reinforcement Learning}},
author = {Kim, Sungjune and Oh, Gyeongrok and Ko, Heeju and Ji, Daehyun and Lee, Dongwook and Lee, Byung-Jun and Jang, Sujin and Kim, Sangpil},
booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
year = {2025},
pages = {30654-30671},
volume = {267},
url = {https://mlanthology.org/icml/2025/kim2025icml-testtime/}
}