Test-Time Adaptation with State-Space Models
Abstract
Distribution shifts between training and test data are all but inevitable over the lifecycle of a deployed model and lead to performance decay. Adapting the model can hopefully mitigate this drop in performance. Yet, adaptation is challenging since it must be unsupervised: we usually do not have access to any labeled data at test time. In this paper, we propose a probabilistic state-space model that can adapt a deployed model subjected to distribution drift. Our model learns the dynamics induced by distribution shifts on the last set of hidden features. Without requiring labels, we infer time-evolving class prototypes that serve as a dynamic classification head. Moreover, our approach is lightweight, modifying only the model's last linear layer. In experiments on real-world distribution shifts and synthetic corruptions, we demonstrate that our approach performs competitively with methods that require back-propagation and access to the model backbone. Our model especially excels in the case of small test batches - the most difficult setting.
Cite
Text
Schirmer et al. "Test-Time Adaptation with State-Space Models." ICML 2024 Workshops: SPIGM, 2024.Markdown
[Schirmer et al. "Test-Time Adaptation with State-Space Models." ICML 2024 Workshops: SPIGM, 2024.](https://mlanthology.org/icmlw/2024/schirmer2024icmlw-testtime/)BibTeX
@inproceedings{schirmer2024icmlw-testtime,
title = {{Test-Time Adaptation with State-Space Models}},
author = {Schirmer, Mona and Zhang, Dan and Nalisnick, Eric},
booktitle = {ICML 2024 Workshops: SPIGM},
year = {2024},
url = {https://mlanthology.org/icmlw/2024/schirmer2024icmlw-testtime/}
}