Enhancing Cross-Subject fMRI-to-Video Decoding with Global-Local Functional Alignment

Li, Chong; Qian, Xuelin; Wang, Yun; Huo, Jingyang; Xue, Xiangyang; Fu, Yanwei; Feng, Jianfeng

doi:10.1007/978-3-031-73010-8_21

Enhancing Cross-Subject fMRI-to-Video Decoding with Global-Local Functional Alignment

Chong Li, Xuelin Qian, Yun Wang, Jingyang Huo, Xiangyang Xue, Yanwei Fu, Jianfeng Feng

ECCV 2024

doi:10.1007/978-3-031-73010-8_21 /eccv/2024/li2024eccv-enhancing/

Abstract

Advancements in brain imaging enable the decoding of thoughts and intentions from neural activities. However, the fMRI-to-video decoding of brain signals across multiple subjects encounters challenges arising from structural and coding disparities among individual brains, further compounded by the scarcity of paired fMRI-stimulus data. Addressing this issue, this paper introduces the fMRI Global-Local Functional Alignment (GLFA) projection, a novel approach that aligns fMRI frames from diverse subjects into a unified brain space, thereby enhancing cross-subject decoding. Additionally, we present a meticulously curated fMRI-video paired dataset comprising a total of 75k fMRI-stimulus paired samples from 8 individuals. This dataset is approximately 4.5 times larger than the previous benchmark dataset. Building on this, we augment a transformer-based fMRI encoder with a diffusion video generator, delving into the realm of cross-subject fMRI-based video reconstruction. This innovative methodology faithfully captures semantic information from diverse brain signals, resulting in the generation of vivid videos and achieving an impressive average accuracy of 84.7% in cross-subject semantic classification tasks.

PDF ECCV Semantic Scholar

Cite

Text

Li et al. "Enhancing Cross-Subject fMRI-to-Video Decoding with Global-Local Functional Alignment." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73010-8_21

Markdown

[Li et al. "Enhancing Cross-Subject fMRI-to-Video Decoding with Global-Local Functional Alignment." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/li2024eccv-enhancing/) doi:10.1007/978-3-031-73010-8_21

BibTeX

@inproceedings{li2024eccv-enhancing,
  title     = {{Enhancing Cross-Subject fMRI-to-Video Decoding with Global-Local Functional Alignment}},
  author    = {Li, Chong and Qian, Xuelin and Wang, Yun and Huo, Jingyang and Xue, Xiangyang and Fu, Yanwei and Feng, Jianfeng},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-73010-8_21},
  url       = {https://mlanthology.org/eccv/2024/li2024eccv-enhancing/}
}