PNeRFLoc: Visual Localization with Point-Based Neural Radiance Fields

Authors

  • Boming Zhao State Key Lab of CAD & CG, Zhejiang University
  • Luwei Yang Simon Fraser University
  • Mao Mao State Key Lab of CAD & CG, Zhejiang University
  • Hujun Bao State Key Lab of CAD & CG, Zhejiang University
  • Zhaopeng Cui State Key Lab of CAD & CG, Zhejiang University

DOI:

https://doi.org/10.1609/aaai.v38i7.28576

Keywords:

CV: 3D Computer Vision

Abstract

Due to the ability to synthesize high-quality novel views, Neural Radiance Fields (NeRF) has been recently exploited to improve visual localization in a known environment. However, the existing methods mostly utilize NeRF for data augmentation to improve the regression model training, and their performances on novel viewpoints and appearances are still limited due to the lack of geometric constraints. In this paper, we propose a novel visual localization framework, i.e., PNeRFLoc, based on a unified point-based representation. On one hand, PNeRFLoc supports the initial pose estimation by matching 2D and 3D feature points as traditional structure-based methods; on the other hand, it also enables pose refinement with novel view synthesis using rendering-based optimization. Specifically, we propose a novel feature adaption module to close the gaps between the features for visual localization and neural rendering. To improve the efficacy and efficiency of neural rendering-based optimization, we also developed an efficient rendering-based framework with a warping loss function. Extensive experiments demonstrate that PNeRFLoc performs the best on the synthetic dataset when the 3D NeRF model can be well learned, and significantly outperforms all the NeRF-boosted localization methods with on-par SOTA performance on the real-world benchmark localization datasets. Project webpage: https://zju3dv.github.io/PNeRFLoc/.

Published

2024-03-24

How to Cite

Zhao, B., Yang, L., Mao, M., Bao, H., & Cui, Z. (2024). PNeRFLoc: Visual Localization with Point-Based Neural Radiance Fields. Proceedings of the AAAI Conference on Artificial Intelligence, 38(7), 7450-7459. https://doi.org/10.1609/aaai.v38i7.28576

Issue

Section

AAAI Technical Track on Computer Vision VI