Ravan: Multi-Head Low-Rank Adaptation for Federated Fine-Tuning
Abstract
Large Language Models (LLMs) have yet to effectively leverage the vast amounts of edge-device data, and Federated Learning (FL) offers a promising paradigm to collaboratively fine-tune LLMs without transferring private edge data to the cloud. To operate within the computational and communication constraints of edge devices, recent literature on federated fine-tuning of LLMs proposes the use of low-rank adaptation (LoRA) and similar parameter-efficient methods. However, LoRA-based methods suffer from accuracy degradation in FL settings, primarily because of data and computational heterogeneity across clients. We propose Ravan, an adaptive multi-head LoRA method that balances parameter efficiency and model expressivity by reparameterizing the weight updates as the sum of multiple LoRA heads, $s_i\textbf{B}_i\textbf{H}_i\textbf{A}_i$, in which only the $\textbf{H}_i$ parameters and their lightweight scaling factors $s_i$ are trained. These trainable scaling factors let the optimization focus on the most useful heads, recovering a higher-rank approximation of the full update without increasing the number of communicated parameters since clients upload $s_i\textbf{H}_i$ directly. Experiments on vision and language benchmarks show that Ravan improves test accuracy by 2–8\% over prior parameter-efficient baselines, making it a robust and scalable solution for federated fine-tuning of LLMs.
Cite
Text
Raje et al. "Ravan: Multi-Head Low-Rank Adaptation for Federated Fine-Tuning." Advances in Neural Information Processing Systems, 2025.Markdown
[Raje et al. "Ravan: Multi-Head Low-Rank Adaptation for Federated Fine-Tuning." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/raje2025neurips-ravan/)BibTeX
@inproceedings{raje2025neurips-ravan,
title = {{Ravan: Multi-Head Low-Rank Adaptation for Federated Fine-Tuning}},
author = {Raje, Arian and Askin, Baris and Jhunjhunwala, Divyansh and Joshi, Gauri},
booktitle = {Advances in Neural Information Processing Systems},
year = {2025},
url = {https://mlanthology.org/neurips/2025/raje2025neurips-ravan/}
}