Global and Local Prompts Cooperation via Optimal Transport for Federated Learning

Abstract

Prompt learning in pretrained visual-language models has shown remarkable flexibility across various downstream tasks. Leveraging its inherent lightweight nature recent research attempted to integrate the powerful pretrained models into federated learning frameworks to simultaneously reduce communication costs and promote local training on insufficient data. Despite these efforts current federated prompt learning methods lack specialized designs to systematically address severe data heterogeneities e.g. data distribution with both label and feature shifts involved. To address this challenge we present Federated Prompts Cooperation via Optimal Transport (FedOTP) which introduces efficient collaborative prompt learning strategies to capture diverse category traits on a per-client basis. Specifically for each client we learn a global prompt to extract consensus knowledge among clients and a local prompt to capture client-specific category characteristics. Unbalanced Optimal Transport is then employed to align local visual features with these prompts striking a balance between global consensus and local personalization. By relaxing one of the equality constraints FedOTP enables prompts to focus solely on core image patch regions. Extensive experiments on datasets with various types of heterogeneities have demonstrated that our FedOTP outperforms the state-of-the-art methods.

Cite

Text

Li et al. "Global and Local Prompts Cooperation via Optimal Transport for Federated Learning." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.01155

Markdown

[Li et al. "Global and Local Prompts Cooperation via Optimal Transport for Federated Learning." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/li2024cvpr-global/) doi:10.1109/CVPR52733.2024.01155

BibTeX

@inproceedings{li2024cvpr-global,
  title     = {{Global and Local Prompts Cooperation via Optimal Transport for Federated Learning}},
  author    = {Li, Hongxia and Huang, Wei and Wang, Jingya and Shi, Ye},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {12151-12161},
  doi       = {10.1109/CVPR52733.2024.01155},
  url       = {https://mlanthology.org/cvpr/2024/li2024cvpr-global/}
}