Federated Class-Heterogeneous Report Labeling with Surgical Aggregation

Abstract

Labeling radiology reports is essential for creating medical imaging datasets and enabling AI-driven clinical decision support. While SBERT-based classifiers offer computationally efficient solutions for this task, a major challenge is the class heterogeneity across datasets, as different groups focus on extracting distinct disease labels. For instance, NIH and CheXpert CXR datasets share only 7 of their 14 and 13 labels, respectively. To address this, we propose to use Surgical Aggregation, a class-heterogeneous federated learning framework that collaboratively trains a global multi-label classifier without requiring alignment of labeling schemes across clients. Surgical Aggregation selectively merges shared class weights while appending new disease-specific nodes, thereby unifying distinct local labeling priorities, to dynamically incorporate all disease labels of interest. We evaluated Surgical Aggregation in multiple simulated settings with varying number of participating nodes as well as different degrees of overlapping labels. Our results demonstrate high performance confirming adaptability in class-heterogeneous environments, thereby offering a scalable and privacy-preserving solution for collaborative medical report labeling. Our code is available at https://github.com/BioIntelligence-Lab/Federated-MedEmbedX

Cite

Text

Shah et al. "Federated Class-Heterogeneous Report Labeling with Surgical Aggregation." Medical Imaging with Deep Learning, 2025.

Markdown

[Shah et al. "Federated Class-Heterogeneous Report Labeling with Surgical Aggregation." Medical Imaging with Deep Learning, 2025.](https://mlanthology.org/midl/2025/shah2025midl-federated/)

BibTeX

@inproceedings{shah2025midl-federated,
  title     = {{Federated Class-Heterogeneous Report Labeling with Surgical Aggregation}},
  author    = {Shah, Nikhil and Kulkarni, Pranav and Doo, Florence and Li, Ang and Jacobs, Michael A. and Parekh, Vishwa Sanjay},
  booktitle = {Medical Imaging with Deep Learning},
  year      = {2025},
  url       = {https://mlanthology.org/midl/2025/shah2025midl-federated/}
}