Federated Learning with Heterogeneous LLMs: Integrating Small Student Client Models with a Large Hungry Model
Abstract
In recent years, federated learning (FL) has emerged as a promising technique to enable decentralized training of models without the need for data centralization, addressing privacy concerns and reducing communication over-head. The challenge, however, lies in scaling federated systems to accommodate clients with different computational capabilities. The heterogeneity of clients in terms of data, model structures, and computational resources presents significant challenges. Addressing these challenges can lead to more robust and efficient FL systems, making it possible to leverage diverse data sources and computational environments. Here we propose a system where small language models run on heterogeneous cli-ents while a large, more powerful model at the server aggregates their contributions. This architecture leverages the strengths of both small, task-specific client models and a large server model to enhance generalization and efficiency. This is important because it addresses the growing need for scalable, privacy-preserving systems that can operate in diverse environments with varying resources. Through such a system we intend to contribute to the AI field by improving the efficiency of federated learning systems while enhancing their adaptability to real-world applications.
Cite
Text
Jajoo. "Federated Learning with Heterogeneous LLMs: Integrating Small Student Client Models with a Large Hungry Model." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I28.35332Markdown
[Jajoo. "Federated Learning with Heterogeneous LLMs: Integrating Small Student Client Models with a Large Hungry Model." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/jajoo2025aaai-federated/) doi:10.1609/AAAI.V39I28.35332BibTeX
@inproceedings{jajoo2025aaai-federated,
title = {{Federated Learning with Heterogeneous LLMs: Integrating Small Student Client Models with a Large Hungry Model}},
author = {Jajoo, Gautam},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {29581-29583},
doi = {10.1609/AAAI.V39I28.35332},
url = {https://mlanthology.org/aaai/2025/jajoo2025aaai-federated/}
}