PHOTONS: Pose-Free Human-Centric Photo-Realistic Real-Time Novel View Synthesis from Sparse Views

Authors

  • Yongyang Cheng International Business Division, China Telecom Cloud Technology Co., Ltd.
  • Boqin Qin International Business Division, China Telecom Cloud Technology Co., Ltd.
  • Zhao Hui International Business Division, China Telecom Cloud Technology Co., Ltd.
  • Xu Chen International Business Division, China Telecom Cloud Technology Co., Ltd.
  • Tao Zhang International Business Division, China Telecom Cloud Technology Co., Ltd.
  • Shang Sun International Business Division, China Telecom Cloud Technology Co., Ltd.
  • Haiquan Kang International Business Division, China Telecom Cloud Technology Co., Ltd.
  • Xiaojie Xu International Business Division, China Telecom Cloud Technology Co., Ltd.
  • Junwei Lv International Business Division, China Telecom Cloud Technology Co., Ltd.
  • Lei Yang International Business Division, China Telecom Cloud Technology Co., Ltd.
  • Xinyu Liu International Business Division, China Telecom Cloud Technology Co., Ltd.
  • Feng Jiang International Business Division, China Telecom Cloud Technology Co., Ltd.

DOI:

https://doi.org/10.1609/aaai.v40i48.42340

Abstract

We present PHOTONS (Pose-Free Human-Centric Photo-Realistic Real-Time Novel View Synthesis from Sparse Views), a real-time framework for novel view synthesis without requiring camera calibration. Our method reconstructs consistent 3D Gaussian point clouds and synthesizes 2K photo-realistic novel views from arbitrary numbers (>=2) of freely placed cameras. PHOTONS faithfully renders dynamic human bodies amid complex backgrounds, including interactive object manipulation and fine-grained details (e.g., hair strands), while maintaining 25 FPS throughput on commodity GPU like NVIDIA RTX 4090. By combining pose-free spatial point cloud reconstruction with Gaussian parameter estimation, our method demonstrates strong resilience to occlusions and camera perturbations. Additionally, we develop a 3D stereo system that drastically reduces setup complexity compared to existing solutions. Experiments on public and custom datasets show that PHOTONS outperforms state-of-the-art methods in both efficiency and visual quality.

Downloads

Published

2026-03-14

How to Cite

Cheng, Y., Qin, B., Hui, Z., Chen, X., Zhang, T., Sun, S., … Jiang, F. (2026). PHOTONS: Pose-Free Human-Centric Photo-Realistic Real-Time Novel View Synthesis from Sparse Views. Proceedings of the AAAI Conference on Artificial Intelligence, 40(48), 41562–41564. https://doi.org/10.1609/aaai.v40i48.42340