MMPF: Multi-Modal Perception Framework for Abnormal Medical Condition Detection
Abstract
As the global population ages and the incidence of chronic diseases increases, the demand for early detection of abnormal medical conditions is increasing. Traditional health monitoring methods often require significant resources and specialized personnel, limiting their widespread use. Leveraging advancements in AI technologies, this study proposes a non-invasive method for detecting abnormal medical conditions from image data. A multimodal perception framework is introduced, integrating features from various modalities, including facial expressions and body postures, to enhance detection accuracy. The framework employs a Cascaded Squeeze-Excitation (CSE) module, consisting of Adaptive and Multi-modal Squeeze-Excitation components, to capture complex feature dependencies and improve cross-modal performance. Extensive experiments demonstrate the effectiveness of this approach, showing improved performance over existing methods. In addition, a new dataset that encompasses a wide range of medical conditions has been released, providing a valuable resource for future research in this domain.
Cite
Text
Zhong et al. "MMPF: Multi-Modal Perception Framework for Abnormal Medical Condition Detection." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I10.33157Markdown
[Zhong et al. "MMPF: Multi-Modal Perception Framework for Abnormal Medical Condition Detection." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/zhong2025aaai-mmpf/) doi:10.1609/AAAI.V39I10.33157BibTeX
@inproceedings{zhong2025aaai-mmpf,
title = {{MMPF: Multi-Modal Perception Framework for Abnormal Medical Condition Detection}},
author = {Zhong, Chuyi and Yang, Dingkang and Zhai, Peng and Zhang, Lihua},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {10653-10661},
doi = {10.1609/AAAI.V39I10.33157},
url = {https://mlanthology.org/aaai/2025/zhong2025aaai-mmpf/}
}