LORE: Lagrangian-Optimized Robust Embeddings for Visual Encoders

Abstract

Visual encoders have become fundamental components in modern computer vision pipelines. However, ensuring robustness against adversarial perturbations remains a critical challenge. Recent efforts have explored both supervised and unsupervised adversarial fine-tuning strategies. We identify two key limitations in these approaches: (i) they often suffer from instability, especially during the early stages of fine-tuning, resulting in suboptimal convergence and degraded performance on clean data, and (ii) they exhibit a suboptimal trade-off between robustness and clean data accuracy, hindering the simultaneous optimization of both objectives. To overcome these challenges, we propose **L**agrangian-**O**ptimized **R**obust **E**mbeddings (LORE), a novel unsupervised adversarial fine-tuning framework. LORE utilizes constrained optimization, which offers a principled approach to balancing competing goals, such as improving robustness while preserving nominal performance. By enforcing embedding-space proximity constraints, LORE effectively maintains clean data performance throughout adversarial fine-tuning. Extensive experiments show that LORE stabilizes training and significantly improves zero-shot adversarial robustness with minimal degradation in clean data accuracy. Furthermore, we demonstrate the effectiveness of the adversarially fine-tuned image encoder in out-of-distribution generalization and enhancing the interpretability of image embeddings. The code is available on [GitHub](https://github.com/Theborna/LORE-Lagrangian-Optimized-Robust-Embeddings).

PDF NeurIPS OpenReview Semantic Scholar

Cite

Text

Khodabandeh et al. "LORE: Lagrangian-Optimized Robust Embeddings for Visual Encoders." Advances in Neural Information Processing Systems, 2025.

Markdown

[Khodabandeh et al. "LORE: Lagrangian-Optimized Robust Embeddings for Visual Encoders." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/khodabandeh2025neurips-lore/)

BibTeX

@inproceedings{khodabandeh2025neurips-lore,
  title     = {{LORE: Lagrangian-Optimized Robust Embeddings for Visual Encoders}},
  author    = {Khodabandeh, Borna and Afzali, Amirabbas and Afsharrad, Amirhossein and Mousavi, Seyed Shahabeddin and Lall, Sanjay and Amini, Sajjad and Moosavi-Dezfooli, Seyed-Mohsen},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025},
  url       = {https://mlanthology.org/neurips/2025/khodabandeh2025neurips-lore/}
}