Multi-Modal Latent Variables for Cross-Individual Primary Visual Cortex Modeling and Analysis

Yu Zhu; Bo Lei; Chunfeng Song; Wanli Ouyang; Shan Yu; Tiejun Huang

doi:10.1609/aaai.v39i1.32111

Authors

Yu Zhu Institute of Automation, Chinese Academy of Science; School of Artifcial Intelligence, University of Chinese Academy of Sciences
Bo Lei Beijing Academy of Artificial Intelligence
Chunfeng Song Shanghai Artificial Intelligence Laboratory
Wanli Ouyang Shanghai Artificial Intelligence Laboratory
Shan Yu Institute of Automation, Chinese Academy of Science
Tiejun Huang Beijing Academy of Artificial Intelligence

DOI:

https://doi.org/10.1609/aaai.v39i1.32111

Abstract

Elucidating the functional mechanisms of the primary visual cortex (V1) remains a fundamental challenge in systems neuroscience. Current computational models face two critical limitations, namely the challenge of cross-modal integration between partial neural recordings and complex visual stimuli, and the inherent variability in neural characteristics across individuals, including differences in neuron populations and firing patterns. To address these challenges, we present a multi-modal identifiable variational autoencoder (miVAE) that employs a two-level disentanglement strategy to map neural activity and visual stimuli into a unified latent space. This framework enables robust identification of cross-modal correlations through refined latent space modeling. We complement this with a novel score-based attribution analysis that traces latent variables back to their origins in the source data space. Evaluation on a large-scale mouse V1 dataset demonstrates that our method achieves state-of-the-art performance in cross-individual latent representation and alignment, without requiring subject-specific fine-tuning, and exhibits improved performance with increasing data size. Significantly, our attribution algorithm successfully identifies distinct neuronal subpopulations characterized by unique temporal patterns and stimulus discrimination properties, while simultaneously revealing stimulus regions that show specific sensitivity to edge features and luminance variations. This scalable framework offers promising applications not only for advancing V1 research but also for broader investigations in neuroscience.

Multi-Modal Latent Variables for Cross-Individual Primary Visual Cortex Modeling and Analysis

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information