Scalable Detection of Undiagnosed ILD in Population Screening: A Multi-Cohort Study Using 3D Foundation Models

McConnell, Niccolò; Azimbagirad, Mehran; Cheng, Daryl O.; Yamada, Daisuke; Egashira, Ryoko; Chapman, Robert; McCabe, John; Wang, Shanshan; Lynch, David; Kinney, Greg; Vasudev, Pardeep; Taylor, Paul; Alexander, Daniel C.; Janes, Sam M.; Jacob, Joseph

Scalable Detection of Undiagnosed ILD in Population Screening: A Multi-Cohort Study Using 3D Foundation Models

Niccolò McConnell, Mehran Azimbagirad, Daryl O. Cheng, Daisuke Yamada, Ryoko Egashira, Robert Chapman, John McCabe, Shanshan Wang, David Lynch, Greg Kinney, Pardeep Vasudev, Paul Taylor, Daniel C. Alexander, Sam M. Janes, Joseph Jacob

MIDL 2026 pp. 4579-4599

/midl/2026/mcconnell2026midl-scalable/

Abstract

Undiagnosed interstitial lung disease (UILD), an early form of lung fibrosis, is increasingly detected in population-based low-dose computed tomography (LDCT) screening but remains systematically under-reported due to its subtle appearance. We developed and validated a foundation-model-augmented deep learning system for UILD detection across two of the largest thoracic CT cohorts worldwide: SUMMIT, the UK’s largest LDCT screening study ($>$11,000 scans), and COPDGene, a multi-centre US cohort spanning 21 scanners and $>$8,800 scans. We propose ViT-3D-TE, a multi-token 3D Vision Transformer designed to preserve both high-frequency focal texture and diffuse parenchymal change through CLS, MAX, and AVG token fusion. The model was initialised with TANGERINE, an open-source 3D masked autoencoder pretrained on 98,000 full-volume LDCT scans, providing volumetric priors essential for stable optimisation. ViT-3D-TE was trained solely on SUMMIT and evaluated on COPDGene without domain adaptation, and achieved strong performance (AUROC 0.9805, AUPRC 0.7699 internal; AUROC 0.9705, AUPRC 0.6170 external), representing 17$\times$ and 25$\times$ improvements over random baselines at clinically realistic cohort prevalences (4.6% and 2.5%). We further introduce ConvNeXt-2.5-MIL, a slice-based 2.5D alternative that performs competitively without relying on 3D foundation model pretraining. Together, these results provide, to our knowledge, the largest real-world validation to date of deep learning for UILD detection and demonstrate that foundation-model-enhanced 3D Transformers offer a practical and scalable pathway for integrating UILD detection into national LDCT screening workflows.

PDF MIDL Semantic Scholar

Cite

Text

McConnell et al. "Scalable Detection of Undiagnosed ILD in Population Screening: A Multi-Cohort Study Using 3D Foundation Models." Proceedings of The 9th International Conference on Medical Imaging with Deep Learning, 2026.

Markdown

[McConnell et al. "Scalable Detection of Undiagnosed ILD in Population Screening: A Multi-Cohort Study Using 3D Foundation Models." Proceedings of The 9th International Conference on Medical Imaging with Deep Learning, 2026.](https://mlanthology.org/midl/2026/mcconnell2026midl-scalable/)

BibTeX

@inproceedings{mcconnell2026midl-scalable,
  title     = {{Scalable Detection of Undiagnosed ILD in Population Screening: A Multi-Cohort Study Using 3D Foundation Models}},
  author    = {McConnell, Niccolò and Azimbagirad, Mehran and Cheng, Daryl O. and Yamada, Daisuke and Egashira, Ryoko and Chapman, Robert and McCabe, John and Wang, Shanshan and Lynch, David and Kinney, Greg and Vasudev, Pardeep and Taylor, Paul and Alexander, Daniel C. and Janes, Sam M. and Jacob, Joseph},
  booktitle = {Proceedings of The 9th International Conference on Medical Imaging with Deep Learning},
  year      = {2026},
  pages     = {4579-4599},
  volume    = {315},
  url       = {https://mlanthology.org/midl/2026/mcconnell2026midl-scalable/}
}