Controllable Mind Visual Diffusion Model

Abstract

Brain signal visualization has emerged as an active research area, serving as a critical interface between the human visual system and computer vision models. Diffusion-based methods have recently shown promise in analyzing functional magnetic resonance imaging (fMRI) data, including the reconstruction of high-quality images consistent with original visual stimuli. Nonetheless, it remains a critical challenge to effectively harness the semantic and silhouette information extracted from brain signals. In this paper, we propose a novel approach, termed as Controllable Mind Visual Diffusion Model (CMVDM). Specifically, CMVDM first extracts semantic and silhouette information from fMRI data using attribute alignment and assistant networks. Then, a control model is introduced in conjunction with a residual block to fully exploit the extracted information for image synthesis, generating high-quality images that closely resemble the original visual stimuli in both semantic content and silhouette characteristics. Through extensive experimentation, we demonstrate that CMVDM outperforms existing state-of-the-art methods both qualitatively and quantitatively. Our code is available at https://github.com/zengbohan0217/CMVDM.

Cite

Text

Zeng et al. "Controllable Mind Visual Diffusion Model." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I7.28519

Markdown

[Zeng et al. "Controllable Mind Visual Diffusion Model." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/zeng2024aaai-controllable/) doi:10.1609/AAAI.V38I7.28519

BibTeX

@inproceedings{zeng2024aaai-controllable,
  title     = {{Controllable Mind Visual Diffusion Model}},
  author    = {Zeng, Bohan and Li, Shanglin and Liu, Xuhui and Gao, Sicheng and Jiang, Xiaolong and Tang, Xu and Hu, Yao and Liu, Jianzhuang and Zhang, Baochang},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {6935-6943},
  doi       = {10.1609/AAAI.V38I7.28519},
  url       = {https://mlanthology.org/aaai/2024/zeng2024aaai-controllable/}
}