Multi-Modal Latent Variables for Cross-Individual Primary Visual Cortex Modeling and Analysis

Abstract

Elucidating the functional mechanisms of the primary visual cortex (V1) remains a fundamental challenge in systems neuroscience. Current computational models face two critical limitations, namely the challenge of cross-modal integration between partial neural recordings and complex visual stimuli, and the inherent variability in neural characteristics across individuals, including differences in neuron populations and firing patterns. To address these challenges, we present a multi-modal identifiable variational autoencoder (miVAE) that employs a two-level disentanglement strategy to map neural activity and visual stimuli into a unified latent space. This framework enables robust identification of cross-modal correlations through refined latent space modeling. We complement this with a novel score-based attribution analysis that traces latent variables back to their origins in the source data space. Evaluation on a large-scale mouse V1 dataset demonstrates that our method achieves state-of-the-art performance in cross-individual latent representation and alignment, without requiring subject-specific fine-tuning, and exhibits improved performance with increasing data size. Significantly, our attribution algorithm successfully identifies distinct neuronal subpopulations characterized by unique temporal patterns and stimulus discrimination properties, while simultaneously revealing stimulus regions that show specific sensitivity to edge features and luminance variations. This scalable framework offers promising applications not only for advancing V1 research but also for broader investigations in neuroscience.

Cite

Text

Zhu et al. "Multi-Modal Latent Variables for Cross-Individual Primary Visual Cortex Modeling and Analysis." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I1.32111

Markdown

[Zhu et al. "Multi-Modal Latent Variables for Cross-Individual Primary Visual Cortex Modeling and Analysis." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/zhu2025aaai-multi/) doi:10.1609/AAAI.V39I1.32111

BibTeX

@inproceedings{zhu2025aaai-multi,
  title     = {{Multi-Modal Latent Variables for Cross-Individual Primary Visual Cortex Modeling and Analysis}},
  author    = {Zhu, Yu and Lei, Bo and Song, Chunfeng and Ouyang, Wanli and Yu, Shan and Huang, Tiejun},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {1228-1236},
  doi       = {10.1609/AAAI.V39I1.32111},
  url       = {https://mlanthology.org/aaai/2025/zhu2025aaai-multi/}
}