Reconstruction Using the Invisible: Intuition from NIR and Metadata for Enhanced 3D Gaussian Splatting

Authors

  • Gyusam Chang Korea University University of California, Los Angeles
  • Tuan-Anh Vu University of California, Los Angeles
  • Vivek Alumootil University of California, Los Angeles
  • Harris Song University of California, Los Angeles
  • Deanna Pham University of California, Los Angeles
  • Sangpil Kim Korea University
  • M. Khalid Jawed University of California, Los Angeles

DOI:

https://doi.org/10.1609/aaai.v40i4.37259

Abstract

While 3D Gaussian Splatting (3DGS) has rapidly advanced, its application in agriculture remains underexplored. Agricultural scenes pose unique challenges for 3D reconstruction methods, notably uneven illumination, occlusions, and limited perspectives. To address these limitations, we introduce NTRPlant, a novel multimodal dataset encompassing Near-Infrared (NIR), RGB imagery, textual metadata, Depth, and LiDAR collected under varied indoor and outdoor lighting conditions. By integrating NIR data, our approach enhances robustness and extracts crucial botanical insights beyond visible spectra. Additionally, we leverage text-based metadata derived from vegetation indices, such as NDVI, NDWI, and chlorophyll index, significantly enriching the contextual understanding of complex agricultural environments. To fully exploit these modalities, we propose NIRSplat, an effective multimodal Gaussian splatting architecture employing a cross-attention mechanism combined with 3D point-based positional encoding, providing robust geometric priors. Comprehensive experiments demonstrate that NIRSplat outperforms existing state-of-the-art methods, including 3DGS and InstantSplat, highlighting its effectiveness in challenging agricultural scenarios.

Downloads

Published

2026-03-14

How to Cite

Chang, G., Vu, T.-A., Alumootil, V., Song, H., Pham, D., Kim, S., & Jawed, M. K. (2026). Reconstruction Using the Invisible: Intuition from NIR and Metadata for Enhanced 3D Gaussian Splatting. Proceedings of the AAAI Conference on Artificial Intelligence, 40(4), 2707–2715. https://doi.org/10.1609/aaai.v40i4.37259

Issue

Section

AAAI Technical Track on Computer Vision I