Leveraging Foundation Models for Multi-Modal Federated Learning with Incomplete Modality

Che, Liwei; Wang, Jiaqi; Liu, Xinyue; Ma, Fenglong

doi:10.1007/978-3-031-70378-2_25

Leveraging Foundation Models for Multi-Modal Federated Learning with Incomplete Modality

Liwei Che, Jiaqi Wang, Xinyue Liu, Fenglong Ma

ECML-PKDD 2024 pp. 401-417

doi:10.1007/978-3-031-70378-2_25 /ecmlpkdd/2024/che2024ecmlpkdd-leveraging/

Abstract

Federated learning (FL) has obtained tremendous progress in providing collaborative training solutions for distributed data silos with privacy guarantees. However, few existing works explore a more realistic scenario where the clients hold multiple data modalities. In this paper, we aim to solve a novel challenge in multi-modal federated learning (MFL) – modality missing – the clients may lose part of the modalities in their local data sets. To tackle the problems, we propose a novel multi-modal federated learning method, Fed erated M ulti-modal contrasti V e training with P re-trained completion (FedMVP), which integrates the large-scale pre-trained models to enhance the federated training. In the proposed FedMVP framework, each client deploys a large-scale pre-trained model with frozen parameters for modality completion and representation knowledge transfer, enabling efficient and robust local training. On the server side, we utilize generated data to uniformly measure the representation similarity among the uploaded client models and construct a graph perspective to aggregate them according to their importance in the system. We demonstrate that the model achieves superior performance over two real-world image-text classification datasets and is robust to the performance degradation caused by missing modality.

PDF ECML-PKDD Semantic Scholar

Cite

Text

Che et al. "Leveraging Foundation Models for Multi-Modal Federated Learning with Incomplete Modality." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024. doi:10.1007/978-3-031-70378-2_25

Markdown

[Che et al. "Leveraging Foundation Models for Multi-Modal Federated Learning with Incomplete Modality." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024.](https://mlanthology.org/ecmlpkdd/2024/che2024ecmlpkdd-leveraging/) doi:10.1007/978-3-031-70378-2_25

BibTeX

@inproceedings{che2024ecmlpkdd-leveraging,
  title     = {{Leveraging Foundation Models for Multi-Modal Federated Learning with Incomplete Modality}},
  author    = {Che, Liwei and Wang, Jiaqi and Liu, Xinyue and Ma, Fenglong},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2024},
  pages     = {401-417},
  doi       = {10.1007/978-3-031-70378-2_25},
  url       = {https://mlanthology.org/ecmlpkdd/2024/che2024ecmlpkdd-leveraging/}
}