Geometry Meets Light: Leveraging Geometric Priors for Universal Photometric Stereo Under Limited Multi-Illumination Cues

Authors

  • King-Man Tam Institute of Science Tokyo
  • Satoshi Ikehata National Institute of Informatics, Denso IT Laboratory
  • Yuta Asano National Institute of Informatics
  • Zhaoyi An Institute of Science Tokyo
  • Rei Kawakami Institute of Science Tokyo

DOI:

https://doi.org/10.1609/aaai.v40i11.37887

Abstract

Universal Photometric Stereo is a promising approach for recovering surface normals without strict lighting assumptions. However, it struggles when multi-illumination cues are unreliable, such as under biased lighting or in shadows or self-occluded regions of complex in-the-wild scenes. We propose GeoUniPS, a universal photometric stereo network that integrates synthetic supervision with high-level geometric priors from large-scale 3D reconstruction models pretrained on massive in-the-wild data. Our key insight is that these 3D reconstruction models serve as visual-geometry foundation models, inherently encoding rich geometric knowledge of real scenes. To leverage this, we design a Light-Geometry Dual-Branch Encoder that extracts both multi-illumination cues and geometric priors from the frozen 3D reconstruction model. We also address the limitations of the conventional orthographic projection assumption by introducing the PS-Perp dataset with realistic perspective projection to enable learning of spatially varying view directions. Extensive experiments demonstrate that GeoUniPS delivers state-of-the-arts performance across multiple datasets, both quantitatively and qualitatively, especially in the complex in-the-wild scenes.

Published

2026-03-14

How to Cite

Tam, K.-M., Ikehata, S., Asano, Y., An, Z., & Kawakami, R. (2026). Geometry Meets Light: Leveraging Geometric Priors for Universal Photometric Stereo Under Limited Multi-Illumination Cues. Proceedings of the AAAI Conference on Artificial Intelligence, 40(11), 9287–9295. https://doi.org/10.1609/aaai.v40i11.37887

Issue

Section

AAAI Technical Track on Computer Vision VIII