Test-Time Adaptation with State-Space Models

Mona Schirmer, Dan Zhang, Eric Nalisnick

ICMLW 2024

/icmlw/2024/schirmer2024icmlw-testtime/

Abstract

Distribution shifts between training and test data are all but inevitable over the lifecycle of a deployed model and lead to performance decay. Adapting the model can hopefully mitigate this drop in performance. Yet, adaptation is challenging since it must be unsupervised: we usually do not have access to any labeled data at test time. In this paper, we propose a probabilistic state-space model that can adapt a deployed model subjected to distribution drift. Our model learns the dynamics induced by distribution shifts on the last set of hidden features. Without requiring labels, we infer time-evolving class prototypes that serve as a dynamic classification head. Moreover, our approach is lightweight, modifying only the model's last linear layer. In experiments on real-world distribution shifts and synthetic corruptions, we demonstrate that our approach performs competitively with methods that require back-propagation and access to the model backbone. Our model especially excels in the case of small test batches - the most difficult setting.

PDF ICMLW OpenReview Semantic Scholar

Cite

Text

Schirmer et al. "Test-Time Adaptation with State-Space Models." ICML 2024 Workshops: SPIGM, 2024.

Markdown

[Schirmer et al. "Test-Time Adaptation with State-Space Models." ICML 2024 Workshops: SPIGM, 2024.](https://mlanthology.org/icmlw/2024/schirmer2024icmlw-testtime/)

BibTeX

@inproceedings{schirmer2024icmlw-testtime,
  title     = {{Test-Time Adaptation with State-Space Models}},
  author    = {Schirmer, Mona and Zhang, Dan and Nalisnick, Eric},
  booktitle = {ICML 2024 Workshops: SPIGM},
  year      = {2024},
  url       = {https://mlanthology.org/icmlw/2024/schirmer2024icmlw-testtime/}
}