Multimodal Disease Progression Modeling via Spatiotemporal Disentanglement and Multiscale Alignment

Abstract

Longitudinal multimodal data, including electronic health records (EHR) and sequential chest X-rays (CXRs), is critical for modeling disease progression, yet remains underutilized due to two key challenges: (1) redundancy in consecutive CXR sequences, where static anatomical regions dominate over clinically-meaningful dynamics, and (2) temporal misalignment between sparse, irregular imaging and continuous EHR data. We introduce $\texttt{DiPro}$, a novel framework that addresses these challenges through region-aware disentanglement and multi-timescale alignment. First, we disentangle static (anatomy) and dynamic (pathology progression) features in sequential CXRs, prioritizing disease-relevant changes. Second, we hierarchically align these static and dynamic CXR features with asynchronous EHR data via local (pairwise interval-level) and global (full-sequence) synchronization to model coherent progression pathways. Extensive experiments on the MIMIC dataset demonstrate that $\texttt{DiPro}$ could effectively extract temporal clinical dynamics and achieve state-of-the-art performance on both disease progression identification and general ICU prediction tasks.

Cite

Text

Liu et al. "Multimodal Disease Progression Modeling via Spatiotemporal Disentanglement and Multiscale Alignment." Advances in Neural Information Processing Systems, 2025.

Markdown

[Liu et al. "Multimodal Disease Progression Modeling via Spatiotemporal Disentanglement and Multiscale Alignment." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/liu2025neurips-multimodal/)

BibTeX

@inproceedings{liu2025neurips-multimodal,
  title     = {{Multimodal Disease Progression Modeling via Spatiotemporal Disentanglement and Multiscale Alignment}},
  author    = {Liu, Chen and Yao, Wenfang and Yin, Kejing and Cheung, William K. and Qin, Jing},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025},
  url       = {https://mlanthology.org/neurips/2025/liu2025neurips-multimodal/}
}