Controllable Text-to-Image Synthesis for Multi-Modality MR Images

Abstract

Generative modeling has seen significant advancements in recent years, especially in the realm of text-to-image synthesis. Despite this progress, the medical field has yet to fully leverage the capabilities of large-scale foundational models for synthetic data generation. This paper introduces a framework for text-conditional magnetic resonance (MR) imaging generation, addressing the complexities associated with multi-modality considerations. The framework comprises a pre-trained large language model, a diffusion-based prompt-conditional image generation architecture, and an additional denoising network for input structural binary masks. Experimental results demonstrate that the proposed framework is capable of generating realistic, high-resolution, and high-fidelity multi-modal MR images that align with medical language text prompts. Further, the study interprets the cross-attention maps of the generated results based on text-conditional statements. The contributions of this research lay a robust foundation for future studies in text-conditional medical image generation and hold significant promise for accelerating advancements in medical imaging research.

Cite

Text

Kim et al. "Controllable Text-to-Image Synthesis for Multi-Modality MR Images." Winter Conference on Applications of Computer Vision, 2024.

Markdown

[Kim et al. "Controllable Text-to-Image Synthesis for Multi-Modality MR Images." Winter Conference on Applications of Computer Vision, 2024.](https://mlanthology.org/wacv/2024/kim2024wacv-controllable/)

BibTeX

@inproceedings{kim2024wacv-controllable,
  title     = {{Controllable Text-to-Image Synthesis for Multi-Modality MR Images}},
  author    = {Kim, Kyuri and Na, Yoonho and Ye, Sung-Joon and Lee, Jimin and Ahn, Sung Soo and Park, Ji Eun and Kim, Hwiyoung},
  booktitle = {Winter Conference on Applications of Computer Vision},
  year      = {2024},
  pages     = {7936-7945},
  url       = {https://mlanthology.org/wacv/2024/kim2024wacv-controllable/}
}