Dual Risk Minimization for Robust Fine-Tuning of Zero-Shot Models
Abstract
Fine-tuning zero-shot foundation models often compromises their robustness to downstream distribution shifts. We propose dual risk minimization (DRM) which combines empirical risk minimization with worst-case risk minimization to better preserve core features conducive to downstream robustness. In particular, we utilize core-feature descriptions generated by LLMs to induce core-based zero-shot predictions which then serve as proxies to estimate the worst-case risk. DRM balances two crucial aspects of robustness: expected and worst-case performance over all possible domains, establishing a new state of the art on various real-world benchmarks. DRM significantly improves the out-of-distribution performance of fine-tuned CLIP ViT-L/14@336 on ImageNet (75.9 to 77.1), WILDS-iWildCam (47.1 to 51.8), and WILDS-FMoW (50.7 to 53.1); opening up new avenues for achieving next-level robustness in fine-tuning zero-shot models.
Cite
Text
Li et al. "Dual Risk Minimization for Robust Fine-Tuning of Zero-Shot Models." ICML 2024 Workshops: FM-Wild, 2024.Markdown
[Li et al. "Dual Risk Minimization for Robust Fine-Tuning of Zero-Shot Models." ICML 2024 Workshops: FM-Wild, 2024.](https://mlanthology.org/icmlw/2024/li2024icmlw-dual/)BibTeX
@inproceedings{li2024icmlw-dual,
title = {{Dual Risk Minimization for Robust Fine-Tuning of Zero-Shot Models}},
author = {Li, Kaican and Xie, Weiyan and Silva, Ricardo and Zhang, Nevin L.},
booktitle = {ICML 2024 Workshops: FM-Wild},
year = {2024},
url = {https://mlanthology.org/icmlw/2024/li2024icmlw-dual/}
}