Developing Multimodal Healthcare Foundation Model: From Data-Driven to Knowledge-Enhanced

Abstract

Foundation models in general domains have leveraged multimodal knowledge graphs to great effect, yet the healthcare sector lacks such comprehensive structures, presenting a significant gap in current research. Based on previous exploration with pure data-driven approaches, this proposal describes a two-stage project aiming to enhance multimodal healthcare foundation model with domain knowledge. The first stage is to construct a robust multimodal healthcare knowledge graph based on established healthcare taxonomies, such as UMLS, and enriched with data from multimodal clinical databases like MIMIC-CXR. This knowledge graph will incorporate medical images as cross-modal instances linked to healthcare terminologies, enhancing the depth and applicability of the graph. In the second stage, the knowledge graph will serve as a foundational tool in training healthcare foundation models with enhanced capabilities, particularly in reducing hallucination and managing concept ambiguity through the novel use of reinforcement learning techniques like Direct Preference Optimization (DPO). This research is expected to make significant contributions to the domain of healthcare AI by enabling more accurate, reliable, and explainable AI-driven diagnostics and interventions.

Cite

Text

Wang. "Developing Multimodal Healthcare Foundation Model: From Data-Driven to Knowledge-Enhanced." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I28.35230

Markdown

[Wang. "Developing Multimodal Healthcare Foundation Model: From Data-Driven to Knowledge-Enhanced." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/wang2025aaai-developing/) doi:10.1609/AAAI.V39I28.35230

BibTeX

@inproceedings{wang2025aaai-developing,
  title     = {{Developing Multimodal Healthcare Foundation Model: From Data-Driven to Knowledge-Enhanced}},
  author    = {Wang, Xiaochen},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {29305-29306},
  doi       = {10.1609/AAAI.V39I28.35230},
  url       = {https://mlanthology.org/aaai/2025/wang2025aaai-developing/}
}