DART: Disease-Aware Image-Text Alignment and Self-Correcting Re-Alignment for Trustworthy Radiology Report Generation
Abstract
The automatic generation of radiology reports has emerged as a promising solution to reduce a time-consuming task and accurately capture critical disease-relevant findings in X-ray images. Previous approaches for radiology report generation have shown impressive performance. However, there remains significant potential to improve accuracy by ensuring that retrieved reports contain disease-relevant findings similar to those in the X-ray images and by refining generated reports. In this study, we propose a Disease-aware image-text Alignment and self-correcting Re-alignment for Trustworthy radiology report generation (DART) framework. In the first stage, we generate initial reports based on image-to-text retrieval with disease-matching, embedding both images and texts in a shared embedding space through contrastive learning. This approach ensures the retrieval of reports with similar disease-relevant findings that closely align with the input X-ray images. In the second stage, we further enhance the initial reports by introducing a self-correction module that re-aligns them with the X-ray images. Our proposed framework achieves state-of-the-art results on two widely used benchmarks, surpassing previous approaches in both report generation and clinical efficacy metrics, thereby enhancing the trustworthiness of radiology reports.
Cite
Text
Park et al. "DART: Disease-Aware Image-Text Alignment and Self-Correcting Re-Alignment for Trustworthy Radiology Report Generation." Conference on Computer Vision and Pattern Recognition, 2025. doi:10.1109/CVPR52734.2025.01452Markdown
[Park et al. "DART: Disease-Aware Image-Text Alignment and Self-Correcting Re-Alignment for Trustworthy Radiology Report Generation." Conference on Computer Vision and Pattern Recognition, 2025.](https://mlanthology.org/cvpr/2025/park2025cvpr-dart/) doi:10.1109/CVPR52734.2025.01452BibTeX
@inproceedings{park2025cvpr-dart,
title = {{DART: Disease-Aware Image-Text Alignment and Self-Correcting Re-Alignment for Trustworthy Radiology Report Generation}},
author = {Park, Sang-Jun and Heo, Keun-Soo and Shin, Dong-Hee and Son, Young-Han and Oh, Ji-Hye and Kam, Tae-Eui},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2025},
pages = {15580-15589},
doi = {10.1109/CVPR52734.2025.01452},
url = {https://mlanthology.org/cvpr/2025/park2025cvpr-dart/}
}