Sync-NeRF: Generalizing Dynamic NeRFs to Unsynchronized Videos

Authors

  • Seoha Kim Yonsei University
  • Jeongmin Bae Yonsei University
  • Youngsik Yun Yonsei University
  • Hahyun Lee Electronics and Telecommunications Research Institute
  • Gun Bang Electronics and Telecommunications Research Institute
  • Youngjung Uh Yonsei University

DOI:

https://doi.org/10.1609/aaai.v38i3.28057

Keywords:

CV: 3D Computer Vision, CV: Computational Photography, Image & Video Synthesis, General

Abstract

Recent advancements in 4D scene reconstruction using neural radiance fields (NeRF) have demonstrated the ability to represent dynamic scenes from multi-view videos. However, they fail to reconstruct the dynamic scenes and struggle to fit even the training views in unsynchronized settings. It happens because they employ a single latent embedding for a frame while the multi-view images at the same frame were actually captured at different moments. To address this limitation, we introduce time offsets for individual unsynchronized videos and jointly optimize the offsets with NeRF. By design, our method is applicable for various baselines and improves them with large margins. Furthermore, finding the offsets always works as synchronizing the videos without manual effort. Experiments are conducted on the common Plenoptic Video Dataset and a newly built Unsynchronized Dynamic Blender Dataset to verify the performance of our method. Project page: https://seoha-kim.github.io/sync-nerf

Published

2024-03-24

How to Cite

Kim, S., Bae, J., Yun, Y., Lee, H., Bang, G., & Uh, Y. (2024). Sync-NeRF: Generalizing Dynamic NeRFs to Unsynchronized Videos. Proceedings of the AAAI Conference on Artificial Intelligence, 38(3), 2777-2785. https://doi.org/10.1609/aaai.v38i3.28057

Issue

Section

AAAI Technical Track on Computer Vision II