Single-Stage fMRI-to-3D Reconstruction via Viewpoint-Aware Embedding and Hierarchical Guidance

Authors

  • Xun Zhang Delft University of Technology
  • Weihao Xia University of Cambridge
  • Yulong Liu The Hong Kong University of Science and Technology
  • Bo Yang The Hong Kong Polytechnic University
  • Alessandro Bozzon Delft University of Technology
  • Pan Wang Delft University of Technology

DOI:

https://doi.org/10.1609/aaai.v40i21.38864

Abstract

Understanding the neural basis of three-dimensional (3D) perception is a fundamental objective in cognitive neuroscience. Despite advances in decoding 2D visual stimuli from neural data, reconstructing high-fidelity 3D objects with detailed texture and geometry remains largely unexplored. In this work, we introduce NeuroSculptor3D, the first single-stage, end-to-end framework for reconstructing textured 3D shapes directly from brain activity. NeuroSculptor3D integrates a viewpoint-aware brain embedding module that captures fine-grained spatial variations across visual perspectives, and a hierarchical guidance mechanism that aligns brain-derived features with perceptual, semantic, and structural priors. Together, these components facilitate the generation of consistent multi-view embeddings, which are then decoded via TRELLIS to produce high-quality textured 3D reconstructions. Experiments on the fMRI-Shape dataset demonstrate that NeuroSculptor3D outperforms existing baselines across multiple settings, achieving significant improvements in both structural accuracy and semantic consistency. Code will be released to facilitate further research.

Downloads

Published

2026-03-14

How to Cite

Zhang, X., Xia, W., Liu, Y., Yang, B., Bozzon, A., & Wang, P. (2026). Single-Stage fMRI-to-3D Reconstruction via Viewpoint-Aware Embedding and Hierarchical Guidance. Proceedings of the AAAI Conference on Artificial Intelligence, 40(21), 18037–18045. https://doi.org/10.1609/aaai.v40i21.38864

Issue

Section

AAAI Technical Track on Humans and AI