Temporal Test-Time Adaptation with State-Space Models

Abstract

Distribution shifts between training and test data are inevitable over the lifecycle of a deployed model, leading to performance decay. Adapting a model on test samples can help mitigate this drop in performance. However, most test-time adaptation methods have focused on synthetic corruption shifts, leaving a variety of distribution shifts underexplored. In this paper, we focus on distribution shifts that evolve gradually over time, which are common in the wild but challenging for existing methods, as we show. To address this, we propose STAD, a Bayesian filtering method that adapts a deployed model to temporal distribution shifts by learning the time-varying dynamics in the last set of hidden features. Without requiring labels, our model infers time-evolving class prototypes that act as a dynamic classification head. Through experiments on real-world temporal distribution shifts, we show that our method excels in handling small batch sizes and label shift.

Cite

Text

Schirmer et al. "Temporal Test-Time Adaptation with State-Space Models." Transactions on Machine Learning Research, 2025.

Markdown

[Schirmer et al. "Temporal Test-Time Adaptation with State-Space Models." Transactions on Machine Learning Research, 2025.](https://mlanthology.org/tmlr/2025/schirmer2025tmlr-temporal/)

BibTeX

@article{schirmer2025tmlr-temporal,
  title     = {{Temporal Test-Time Adaptation with State-Space Models}},
  author    = {Schirmer, Mona and Zhang, Dan and Nalisnick, Eric},
  journal   = {Transactions on Machine Learning Research},
  year      = {2025},
  url       = {https://mlanthology.org/tmlr/2025/schirmer2025tmlr-temporal/}
}