Lorenza: Enhancing Generalization in Low-Rank Gradient LLM Training via Efficient Zeroth-Order Adaptive SAM

Refael, Yehonathan; Arbel, Iftach; Lindenbaum, Ofir; Tirer, Tom

Lorenza: Enhancing Generalization in Low-Rank Gradient LLM Training via Efficient Zeroth-Order Adaptive SAM

Yehonathan Refael, Iftach Arbel, Ofir Lindenbaum, Tom Tirer

TMLR 2026

/tmlr/2026/refael2026tmlr-lorenza/

Abstract

Modern applications often require fine-tuning large language models (LLMs) within strict memory and computational limits, but existing memory-efficient optimizers tend to compromise robustness and generalization. To tackle this, we introduce Lorenza, a low-memory optimizer based on Sharpness-Aware Minimization (SAM). Lorenza employs a stochastic zeroth-order estimator to approximate ascent directions, reducing the computational complexity of SAM while, as we prove, maintaining its convergence guarantees. Additionally, by applying randomized singular value decomposition, Lorenza performs efficient low-rank gradient updates, achieving memory efficiency similar to traditional methods. Our theoretical analysis and experiments demonstrate that Lorenza improves robustness and generalization, particularly in challenging language tasks. Furthermore, we present Lorenza+, which enhances Lorenza by incorporating the discarded orthogonal gradient component, resulting in additional performance gains without requiring extra memory or computational overhead.

PDF TMLR OpenReview Semantic Scholar

Cite

Text

Refael et al. "Lorenza: Enhancing Generalization in Low-Rank Gradient LLM Training via Efficient Zeroth-Order Adaptive SAM." Transactions on Machine Learning Research, 2026.

Markdown

[Refael et al. "Lorenza: Enhancing Generalization in Low-Rank Gradient LLM Training via Efficient Zeroth-Order Adaptive SAM." Transactions on Machine Learning Research, 2026.](https://mlanthology.org/tmlr/2026/refael2026tmlr-lorenza/)

BibTeX

@article{refael2026tmlr-lorenza,
  title     = {{Lorenza: Enhancing Generalization in Low-Rank Gradient LLM Training via Efficient Zeroth-Order Adaptive SAM}},
  author    = {Refael, Yehonathan and Arbel, Iftach and Lindenbaum, Ofir and Tirer, Tom},
  journal   = {Transactions on Machine Learning Research},
  year      = {2026},
  url       = {https://mlanthology.org/tmlr/2026/refael2026tmlr-lorenza/}
}