Enhancing Cross-Subject fMRI-to-Video Decoding with Global-Local Functional Alignment
Abstract
Advancements in brain imaging enable the decoding of thoughts and intentions from neural activities. However, the fMRI-to-video decoding of brain signals across multiple subjects encounters challenges arising from structural and coding disparities among individual brains, further compounded by the scarcity of paired fMRI-stimulus data. Addressing this issue, this paper introduces the fMRI Global-Local Functional Alignment (GLFA) projection, a novel approach that aligns fMRI frames from diverse subjects into a unified brain space, thereby enhancing cross-subject decoding. Additionally, we present a meticulously curated fMRI-video paired dataset comprising a total of 75k fMRI-stimulus paired samples from 8 individuals. This dataset is approximately 4.5 times larger than the previous benchmark dataset. Building on this, we augment a transformer-based fMRI encoder with a diffusion video generator, delving into the realm of cross-subject fMRI-based video reconstruction. This innovative methodology faithfully captures semantic information from diverse brain signals, resulting in the generation of vivid videos and achieving an impressive average accuracy of 84.7% in cross-subject semantic classification tasks.
Cite
Text
Li et al. "Enhancing Cross-Subject fMRI-to-Video Decoding with Global-Local Functional Alignment." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73010-8_21Markdown
[Li et al. "Enhancing Cross-Subject fMRI-to-Video Decoding with Global-Local Functional Alignment." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/li2024eccv-enhancing/) doi:10.1007/978-3-031-73010-8_21BibTeX
@inproceedings{li2024eccv-enhancing,
title = {{Enhancing Cross-Subject fMRI-to-Video Decoding with Global-Local Functional Alignment}},
author = {Li, Chong and Qian, Xuelin and Wang, Yun and Huo, Jingyang and Xue, Xiangyang and Fu, Yanwei and Feng, Jianfeng},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2024},
doi = {10.1007/978-3-031-73010-8_21},
url = {https://mlanthology.org/eccv/2024/li2024eccv-enhancing/}
}