Dual Risk Minimization: Towards Next-Level Robustness in Fine-Tuning Zero-Shot Models

Kaican Li, Weiyan Xie, Yongxiang Huang, Didan Deng, Lanqing Hong, Zhenguo Li, Ricardo Silva, Nevin L. Zhang

NeurIPS 2024

doi:10.52202/079017-2110 /neurips/2024/li2024neurips-dual/

Abstract

Fine-tuning foundation models often compromises their robustness to distribution shifts. To remedy this, most robust fine-tuning methods aim to preserve the pre-trained features. However, not all pre-trained features are robust and those methods are largely indifferent to which ones to preserve. We propose dual risk minimization (DRM), which combines empirical risk minimization with worst-case risk minimization, to better preserve the core features of downstream tasks. In particular, we utilize core-feature descriptions generated by LLMs to induce core-based zero-shot predictions which then serve as proxies to estimate the worst-case risk. DRM balances two crucial aspects of model robustness: expected performance and worst-case performance, establishing a new state of the art on various real-world benchmarks. DRM significantly improves the out-of-distribution performance of CLIP ViT-L/14@336 on ImageNet (75.9$\to$77.1), WILDS-iWildCam (47.1$\to$51.8), and WILDS-FMoW (50.7$\to$53.1); opening up new avenues for robust fine-tuning. Our code is available at https://github.com/vaynexie/DRM.

PDF NeurIPS OpenReview Semantic Scholar

Cite

Text

Li et al. "Dual Risk Minimization: Towards Next-Level Robustness in Fine-Tuning Zero-Shot Models." Neural Information Processing Systems, 2024. doi:10.52202/079017-2110

Markdown

[Li et al. "Dual Risk Minimization: Towards Next-Level Robustness in Fine-Tuning Zero-Shot Models." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/li2024neurips-dual/) doi:10.52202/079017-2110

BibTeX

@inproceedings{li2024neurips-dual,
  title     = {{Dual Risk Minimization: Towards Next-Level Robustness in Fine-Tuning Zero-Shot Models}},
  author    = {Li, Kaican and Xie, Weiyan and Huang, Yongxiang and Deng, Didan and Hong, Lanqing and Li, Zhenguo and Silva, Ricardo and Zhang, Nevin L.},
  booktitle = {Neural Information Processing Systems},
  year      = {2024},
  doi       = {10.52202/079017-2110},
  url       = {https://mlanthology.org/neurips/2024/li2024neurips-dual/}
}