A Robust Mutual-Reinforcing Framework for 3D Multi-Modal Medical Image Fusion Based on Visual-Semantic Consistency

Abstract

This work proposes a robust 3D medical image fusion framework to establish a mutual-reinforcing mechanism between visual fusion and lesion segmentation, achieving their double improvement. Specifically, we explore the consistency between vision and semantics by sharing feature fusion modules. Through the coupled optimization of the visual fusion loss and the lesion segmentation loss, visual-related and semantic-related features will be pulled into the same domain, effectively promoting accuracy improvement in a mutual-reinforcing manner. Further, we establish the robustness guarantees by constructing a two-level refinement constraint in the process of feature extraction and reconstruction. Benefiting from full consideration for common degradations in medical images, our framework can not only provide clear visual fusion results for doctor's observation, but also enhance the defense ability of lesion segmentation against these negatives. Extensive evaluations of visual fusion and lesion segmentation scenarios demonstrate the advantages of our method in terms of accuracy and robustness. Moreover, our proposed framework is generic, which can be well-compatible with existing lesion segmentation algorithms and improve their performance. The code is publicly available at https://github.com/HaoZhang1018/RMR-Fusion.

Cite

Text

Zhang et al. "A Robust Mutual-Reinforcing Framework for 3D Multi-Modal Medical Image Fusion Based on Visual-Semantic Consistency." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I7.28536

Markdown

[Zhang et al. "A Robust Mutual-Reinforcing Framework for 3D Multi-Modal Medical Image Fusion Based on Visual-Semantic Consistency." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/zhang2024aaai-robust-b/) doi:10.1609/AAAI.V38I7.28536

BibTeX

@inproceedings{zhang2024aaai-robust-b,
  title     = {{A Robust Mutual-Reinforcing Framework for 3D Multi-Modal Medical Image Fusion Based on Visual-Semantic Consistency}},
  author    = {Zhang, Hao and Zuo, Xuhui and Zhou, Huabing and Lu, Tao and Ma, Jiayi},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {7087-7095},
  doi       = {10.1609/AAAI.V38I7.28536},
  url       = {https://mlanthology.org/aaai/2024/zhang2024aaai-robust-b/}
}