Multi-Modal Latent Variables for Cross-Individual Primary Visual Cortex Modeling and Analysis

Authors

  • Yu Zhu Institute of Automation, Chinese Academy of Science; School of Artifcial Intelligence, University of Chinese Academy of Sciences
  • Bo Lei Beijing Academy of Artificial Intelligence
  • Chunfeng Song Shanghai Artificial Intelligence Laboratory
  • Wanli Ouyang Shanghai Artificial Intelligence Laboratory
  • Shan Yu Institute of Automation, Chinese Academy of Science
  • Tiejun Huang Beijing Academy of Artificial Intelligence

DOI:

https://doi.org/10.1609/aaai.v39i1.32111

Abstract

Elucidating the functional mechanisms of the primary visual cortex (V1) remains a fundamental challenge in systems neuroscience. Current computational models face two critical limitations, namely the challenge of cross-modal integration between partial neural recordings and complex visual stimuli, and the inherent variability in neural characteristics across individuals, including differences in neuron populations and firing patterns. To address these challenges, we present a multi-modal identifiable variational autoencoder (miVAE) that employs a two-level disentanglement strategy to map neural activity and visual stimuli into a unified latent space. This framework enables robust identification of cross-modal correlations through refined latent space modeling. We complement this with a novel score-based attribution analysis that traces latent variables back to their origins in the source data space. Evaluation on a large-scale mouse V1 dataset demonstrates that our method achieves state-of-the-art performance in cross-individual latent representation and alignment, without requiring subject-specific fine-tuning, and exhibits improved performance with increasing data size. Significantly, our attribution algorithm successfully identifies distinct neuronal subpopulations characterized by unique temporal patterns and stimulus discrimination properties, while simultaneously revealing stimulus regions that show specific sensitivity to edge features and luminance variations. This scalable framework offers promising applications not only for advancing V1 research but also for broader investigations in neuroscience.

Published

2025-04-11

How to Cite

Zhu, Y., Lei, B., Song, C., Ouyang, W., Yu, S., & Huang, T. (2025). Multi-Modal Latent Variables for Cross-Individual Primary Visual Cortex Modeling and Analysis. Proceedings of the AAAI Conference on Artificial Intelligence, 39(1), 1228–1236. https://doi.org/10.1609/aaai.v39i1.32111

Issue

Section

AAAI Technical Track on Application Domains