SkySplat: Generalizable 3D Gaussian Splatting from Multi-Temporal Sparse Satellite Images

Authors

  • Xuejun Huang School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China
  • Xinyi Liu School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China Technology Innovation Center for Collaborative Applications of Natural Resources Data in GBA, Ministry of Natural Resources, Guangzhou 510075, China Hubei LuoJia Laboratory, Wuhan 430079, China
  • Yi Wan School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China Technology Innovation Center for Collaborative Applications of Natural Resources Data in GBA, Ministry of Natural Resources, Guangzhou 510075, China Hubei LuoJia Laboratory, Wuhan 430079, China
  • Zhi Zheng Department of Geography and Resource Management, The Chinese University of Hong Kong, Shatin 999077 Hong Kong
  • Bin Zhang China Railway Siyuan Survey and Design Group Co., LTD, Wuhan 430063, China
  • Mingtao Xiong School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China
  • Yingying Pei School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China
  • Yongjun Zhang School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China Technology Innovation Center for Collaborative Applications of Natural Resources Data in GBA, Ministry of Natural Resources, Guangzhou 510075, China Hubei LuoJia Laboratory, Wuhan 430079, China

DOI:

https://doi.org/10.1609/aaai.v40i7.37430

Abstract

Three-dimensional scene reconstruction from sparse-view satellite images is a long-standing and challenging task. While 3D Gaussian Splatting (3DGS) and its variants have recently attracted attention for its high efficiency, existing methods remain unsuitable for satellite images due to incompatibility with rational polynomial coefficient (RPC) models and limited generalization capability. Recent advances in generalizable 3DGS approaches show potential, but they perform poorly on multi-temporal sparse satellite images due to limited geometric constraints, transient objects, and radiometric inconsistencies. To address these limitations, we propose SkySplat, a novel self-supervised framework that integrates the RPC model into the generalizable 3DGS pipeline, enabling more effective use of sparse geometric cues for improved reconstruction. SkySplat relies only on RGB images and radiometric-robust relative height supervision, thereby eliminating the need for ground-truth height maps. Key components include a Cross-Self Consistency Module (CSCM), which mitigates transient object interference via consistency-based masking, and a multi-view consistency aggregation strategy that refines reconstruction results. Compared to per-scene optimization methods, SkySplat achieves an 86 times speedup over EOGS with higher accuracy. It also outperforms generalizable 3DGS baselines, reducing MAE from 13.18 m to 1.80 m on the DFC19 dataset significantly, and demonstrates strong cross-dataset generalization on the MVS3D benchmark.

Downloads

Published

2026-03-14

How to Cite

Huang, X., Liu, X., Wan, Y., Zheng, Z., Zhang, B., Xiong, M., … Zhang, Y. (2026). SkySplat: Generalizable 3D Gaussian Splatting from Multi-Temporal Sparse Satellite Images. Proceedings of the AAAI Conference on Artificial Intelligence, 40(7), 5158–5166. https://doi.org/10.1609/aaai.v40i7.37430

Issue

Section

AAAI Technical Track on Computer Vision IV