Robust Fine-Tuning and Adaptation of Zero-Shot Models via Adaptive Weight-Space Ensembling

Abstract

Distribution shifts during test-time are prevalent in most machine learning applications and can often lead to a significant decline in the model’s performance. Foundation models like CLIP demonstrate zero-shot capabilities across various distributions. Additional fine-tuning on a specific dataset increases task performance but often reduces robustness to distribution shifts. Linearly interpolating the weights of the zero-shot and fine-tuned models (WiSE-FT) improves generalization capabilities, while maintaining task performance. Paradigms like online test-time adaptation (TTA) and test-time training (TTT) address distributional shifts by continuously updating the model during test-time. In light of these findings, we propose a novel method—adaptive weight-space ensembling (AdaWiSE) that dynamically balances between task specialization and generalization by adaptively interpolating the zero-shot and fine-tuned model during test-time. AdaWiSE utilizes Bayesian optimization to effectively determine the currently optimal mixing coefficient for interpolation, based on minimizing the entropy of the model’s predictions.

Cite

Text

Döbler et al. "Robust Fine-Tuning and Adaptation of Zero-Shot Models via Adaptive Weight-Space Ensembling." European Conference on Computer Vision Workshops, 2024. doi:10.1007/978-3-031-91672-4_21

Markdown

[Döbler et al. "Robust Fine-Tuning and Adaptation of Zero-Shot Models via Adaptive Weight-Space Ensembling." European Conference on Computer Vision Workshops, 2024.](https://mlanthology.org/eccvw/2024/dobler2024eccvw-robust/) doi:10.1007/978-3-031-91672-4_21

BibTeX

@inproceedings{dobler2024eccvw-robust,
  title     = {{Robust Fine-Tuning and Adaptation of Zero-Shot Models via Adaptive Weight-Space Ensembling}},
  author    = {Döbler, Mario and Feil, Michael and Marsden, Robert A. and Yang, Bin},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2024},
  pages     = {344-360},
  doi       = {10.1007/978-3-031-91672-4_21},
  url       = {https://mlanthology.org/eccvw/2024/dobler2024eccvw-robust/}
}