SpeechHGT: A Multimodal Hypergraph Transformer for Speech-Based Early Alzheimer's Disease Detection
Abstract
Early detection of Alzheimer's disease (AD) through spontaneous speech analysis represents a promising, non-invasive diagnostic approach. Existing methods predominantly rely on fusion-based multimodal deep learning, effectively integrating linguistic and acoustic features. However, these methods inadequately model higher-order interactions between modalities, reducing diagnostic accuracy. To address this, we introduce SpeechHGT, a multimodal hypergraph transformer designed to capture and learn higher-order interactions in spontaneous speech features. SpeechHGT encodes multimodal features as hypergraphs, where nodes represent individual features and hyperedges represent grouped interactions. A novel hypergraph attention mechanism enables robust modeling of both pairwise and higher-order interactions. Experimental evaluations on the DementiaBank datasets reveal that SpeechHGT achieves state-of-the-art performance, surpassing baseline models in accuracy and F1 score. These results highlight the potential of hypergraph-based models to improve AI-driven diagnostic tools for early AD detection.
Cite
Text
Abid et al. "SpeechHGT: A Multimodal Hypergraph Transformer for Speech-Based Early Alzheimer's Disease Detection." International Joint Conference on Artificial Intelligence, 2025. doi:10.24963/IJCAI.2025/1015Markdown
[Abid et al. "SpeechHGT: A Multimodal Hypergraph Transformer for Speech-Based Early Alzheimer's Disease Detection." International Joint Conference on Artificial Intelligence, 2025.](https://mlanthology.org/ijcai/2025/abid2025ijcai-speechhgt/) doi:10.24963/IJCAI.2025/1015BibTeX
@inproceedings{abid2025ijcai-speechhgt,
title = {{SpeechHGT: A Multimodal Hypergraph Transformer for Speech-Based Early Alzheimer's Disease Detection}},
author = {Abid, Shagufta and Zhang, Dongyu and Shehzad, Ahsan and Ren, Jing and Yu, Shuo and Lin, Hongfei and Xia, Feng},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2025},
pages = {9131-9139},
doi = {10.24963/IJCAI.2025/1015},
url = {https://mlanthology.org/ijcai/2025/abid2025ijcai-speechhgt/}
}