Converge Faster, Talk Less: Hessian-Informed Federated Zeroth-Order Optimization
Abstract
Zeroth-order (ZO) optimization enables dimension-free communication in federated learning (FL), making it attractive for fine-tuning of large language models (LLMs) due to significant communication savings. However, existing ZO-FL methods largely overlook curvature information, despite its well-established benefits for convergence acceleration. To address this, we propose **HiSo**, a Hessian-informed ZO federated optimization method that accelerates convergence by leveraging global diagonal Hessian approximations, while strictly preserving scalar-only communication **without transmitting any second-order information**. Theoretically, for non-convex functions, we show that HiSo can achieve an accelerated convergence rate that is independent of the Lipschitz constant $L$ and model dimension $d$ under some Hessian approximation assumptions, offering a plausible explanation for the observed phenomenon of ZO convergence being much faster than its worst-case $O(d)$-bound. Empirically, across diverse LLM fine-tuning benchmarks, HiSo delivers a 1$\sim$5× speedup in communication rounds over existing state-of-the-art ZO-FL baselines. This superior convergence not only cuts communication costs but also provides strong empirical evidence that Hessian information acts as an effective accelerator in federated ZO optimization settings.
Cite
Text
Li et al. "Converge Faster, Talk Less: Hessian-Informed Federated Zeroth-Order Optimization." International Conference on Learning Representations, 2026.Markdown
[Li et al. "Converge Faster, Talk Less: Hessian-Informed Federated Zeroth-Order Optimization." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/li2026iclr-converge/)BibTeX
@inproceedings{li2026iclr-converge,
title = {{Converge Faster, Talk Less: Hessian-Informed Federated Zeroth-Order Optimization}},
author = {Li, Zhe and Ying, Bicheng and Liu, Zidong and Dong, Chaosheng and Yang, Haibo},
booktitle = {International Conference on Learning Representations},
year = {2026},
url = {https://mlanthology.org/iclr/2026/li2026iclr-converge/}
}