FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance Fields
DOI:
https://doi.org/10.1609/aaai.v38i3.28054Keywords:
CV: 3D Computer Vision, CV: Applications, CV: Computational Photography, Image & Video Synthesis, CV: Scene Analysis & UnderstandingAbstract
We present FPRF, a feed-forward photorealistic style transfer method for large-scale 3D neural radiance fields. FPRF stylizes large-scale 3D scenes with arbitrary, multiple style reference images without additional optimization while preserving multi-view appearance consistency. Prior arts required tedious per-style/-scene optimization and were limited to small-scale 3D scenes. FPRF efficiently stylizes large-scale 3D scenes by introducing a style-decomposed 3D neural radiance field, which inherits AdaIN’s feed-forward stylization machinery, supporting arbitrary style reference images. Furthermore, FPRF supports multi-reference stylization with the semantic correspondence matching and local AdaIN, which adds diverse user control for 3D scene styles. FPRF also preserves multi-view consistency by applying semantic matching and style transfer processes directly onto queried features in 3D space. In experiments, we demonstrate that FPRF achieves favorable photorealistic quality 3D scene stylization for large-scale scenes with diverse reference images.Downloads
Published
2024-03-24
How to Cite
Kim, G., Youwang, K., & Oh, T.-H. (2024). FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance Fields. Proceedings of the AAAI Conference on Artificial Intelligence, 38(3), 2750–2758. https://doi.org/10.1609/aaai.v38i3.28054
Issue
Section
AAAI Technical Track on Computer Vision II