Harvard Glaucoma Detection and Progression: A Multimodal Multitask Dataset and Generalization-Reinforced Semi-Supervised Learning

Luo, Yan; Shi, Min; Tian, Yu; Elze, Tobias; Wang, Mengyu

doi:10.1109/ICCV51070.2023.01872

Harvard Glaucoma Detection and Progression: A Multimodal Multitask Dataset and Generalization-Reinforced Semi-Supervised Learning

Yan Luo, Min Shi, Yu Tian, Tobias Elze, Mengyu Wang

ICCV 2023 pp. 20471-20482

doi:10.1109/ICCV51070.2023.01872 /iccv/2023/luo2023iccv-harvard/

Abstract

Glaucoma is the number one cause of irreversible blindness globally. A major challenge for accurate glaucoma detection and progression forecasting is the bottleneck of limited labeled patients with the state-of-the-art (SOTA) 3D retinal imaging data of optical coherence tomography (OCT). To address the data scarcity issue, this paper proposes two solutions. First, we develop a novel generalization-reinforced semi-supervised learning (SSL) model called pseudo supervisor to optimally utilize unlabeled data. Compared with SOTA models, the proposed pseudo supervisor optimizes the policy of predicting pseudo labels with unlabeled samples to improve empirical generalization. Our pseudo supervisor model is evaluated with two clinical tasks consisting of glaucoma detection and progression forecasting. The progression forecasting task is evaluated both unimodally and multimodally. Our pseudo supervisor model demonstrates superior performance than SOTA SSL comparison models. Moreover, our model also achieves the best results on the publicly available LAG fun- dus dataset. Second, we introduce the Harvard Glaucoma Detection and Progression (Harvard-GDP) Dataset, a multimodal multitask dataset that includes data from 1,000 patients with OCT imaging data, as well as labels for glaucoma detection and progression. This is the largest glaucoma detection dataset with 3D OCT imaging data and the first glaucoma progression forecasting dataset that is publicly available. Detailed sex and racial analysis are pro- vided, which can be used by interested researchers for fairness learning studies. Our released dataset is benchmarked with several SOTA supervised CNN and transformer deep learning models. The dataset and code are made publicly available via https://ophai.hms.harvard.edu/ datasets/harvard-gdp1000.

PDF ICCV Semantic Scholar

Cite

Text

Luo et al. "Harvard Glaucoma Detection and Progression: A Multimodal Multitask Dataset and Generalization-Reinforced Semi-Supervised Learning." International Conference on Computer Vision, 2023. doi:10.1109/ICCV51070.2023.01872

Markdown

[Luo et al. "Harvard Glaucoma Detection and Progression: A Multimodal Multitask Dataset and Generalization-Reinforced Semi-Supervised Learning." International Conference on Computer Vision, 2023.](https://mlanthology.org/iccv/2023/luo2023iccv-harvard/) doi:10.1109/ICCV51070.2023.01872

BibTeX

@inproceedings{luo2023iccv-harvard,
  title     = {{Harvard Glaucoma Detection and Progression: A Multimodal Multitask Dataset and Generalization-Reinforced Semi-Supervised Learning}},
  author    = {Luo, Yan and Shi, Min and Tian, Yu and Elze, Tobias and Wang, Mengyu},
  booktitle = {International Conference on Computer Vision},
  year      = {2023},
  pages     = {20471-20482},
  doi       = {10.1109/ICCV51070.2023.01872},
  url       = {https://mlanthology.org/iccv/2023/luo2023iccv-harvard/}
}