A Systematic Comparison of fMRI-to-Video Reconstruction Techniques
Abstract
Recent advances in generative models and large-scale neural datasets have brought forth novel methods to reconstruct stimuli from brain activity. This rapidly evolving family of brain-to-stimuli reconstruction techniques has the opportunity to revolutionize fundamental brain sciences and human-computer interaction applications, yet systemic comparisons of these techniques are lacking. Here, we explore a novel method to reconstruct short videos from functional magnetic resonance imaging (fMRI) brain activity of human subjects that achieves state-of-the-art performance as assessed by a suite of evaluation metrics. We perform preliminary comparisons of reconstruction quality within our pipeline by testing different combinations of semantic encoders and video generation models. Lastly, we compare our pipeline's best reconstruction results with previous work. Together, this work comprehensively assesses state-of-the-art methodologies in the increasingly important discipline of brain-to-video reconstruction.
Cite
Text
Fosco et al. "A Systematic Comparison of fMRI-to-Video Reconstruction Techniques." ICML 2024 Workshops: CVG, 2024.Markdown
[Fosco et al. "A Systematic Comparison of fMRI-to-Video Reconstruction Techniques." ICML 2024 Workshops: CVG, 2024.](https://mlanthology.org/icmlw/2024/fosco2024icmlw-systematic/)BibTeX
@inproceedings{fosco2024icmlw-systematic,
title = {{A Systematic Comparison of fMRI-to-Video Reconstruction Techniques}},
author = {Fosco, Camilo Luciano and Lahner, Ben and Andonian, Alex J and Pan, Bowen and Oliva, Aude},
booktitle = {ICML 2024 Workshops: CVG},
year = {2024},
url = {https://mlanthology.org/icmlw/2024/fosco2024icmlw-systematic/}
}