Cross-Modal Translation and Alignment for Survival Analysis

Abstract

With the rapid advances in high-throughput sequencing technologies, the focus of survival analysis has shifted from examining clinical indicators to incorporating genomic profiles with pathological images. However, existing methods either directly adopt a straightforward fusion of pathological features and genomic profiles for survival prediction, or take genomic profiles as guidance to integrate the features of pathological images. The former would overlook intrinsic cross-modal correlations. The latter would discard pathological information irrelevant to gene expression. To address these issues, we present a Cross-Modal Translation and Alignment (CMTA) framework to explore the intrinsic cross-modal correlations and transfer potential complementary information. Specifically, we construct two parallel encoder-decoder structures for multi-modal data to integrate intra-modal information and generate cross-modal representation. Taking the generated cross-modal representation to enhance and recalibrate intra-modal representation can significantly improve its discrimination for comprehensive survival analysis. To explore the intrinsic cross-modal correlations, we further design a cross-modal attention module as the information bridge between different modalities to perform cross-modal interactions and transfer complementary information. Our extensive experiments on five public TCGA datasets demonstrate that our proposed framework outperforms the state-of-the-art methods. The source code has been released.

Cite

Text

Zhou and Chen. "Cross-Modal Translation and Alignment for Survival Analysis." International Conference on Computer Vision, 2023. doi:10.1109/ICCV51070.2023.01964

Markdown

[Zhou and Chen. "Cross-Modal Translation and Alignment for Survival Analysis." International Conference on Computer Vision, 2023.](https://mlanthology.org/iccv/2023/zhou2023iccv-crossmodal/) doi:10.1109/ICCV51070.2023.01964

BibTeX

@inproceedings{zhou2023iccv-crossmodal,
  title     = {{Cross-Modal Translation and Alignment for Survival Analysis}},
  author    = {Zhou, Fengtao and Chen, Hao},
  booktitle = {International Conference on Computer Vision},
  year      = {2023},
  pages     = {21485-21494},
  doi       = {10.1109/ICCV51070.2023.01964},
  url       = {https://mlanthology.org/iccv/2023/zhou2023iccv-crossmodal/}
}