VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression

Qiang Hu; Houqiang Zhong; Zihan Zheng; Xiaoyun Zhang; Zhengxue Cheng; Li Song; Guangtao Zhai; Yanfeng Wang

doi:10.1609/aaai.v39i4.32370

Authors

Qiang Hu Shanghai Jiaotong University
Houqiang Zhong Shanghai Jiaotong University
Zihan Zheng Shanghai Jiao Tong University
Xiaoyun Zhang Shanghai Jiaotong University
Zhengxue Cheng Shanghai Jiaotong University
Li Song Shanghai Jiao Tong University
Guangtao Zhai Shanghai Jiao Tong University
Yanfeng Wang Shanghai Jiao Tong University

DOI:

https://doi.org/10.1609/aaai.v39i4.32370

Abstract

Neural Radiance Field (NeRF)-based volumetric video has revolutionized visual media by delivering photorealistic Free-Viewpoint Video (FVV) experiences that provide audiences with unprecedented immersion and interactivity. However, the substantial data volumes pose significant challenges for storage and transmission. Existing solutions typically optimize NeRF representation and compression independently or focus on a single fixed rate-distortion (RD) tradeoff. In this paper, we propose VRVVC, a novel end-to-end joint optimization variable-rate framework for volumetric video compression that achieves variable bitrates using a single model while maintaining superior RD performance. Specifically, VRVVC introduces a compact tri-plane implicit residual representation for inter-frame modeling of long-duration dynamic scenes, effectively reducing temporal redundancy. We further propose a variable-rate residual representation compression scheme that leverages a learnable quantization and a tiny MLP-based entropy model. This approach enables variable bitrates through the utilization of predefined Lagrange multipliers to manage the quantization error of all latent representations. Finally, we present an end-to-end progressive training strategy combined with a multi-rate-distortion loss function to optimize the entire framework. Extensive experiments demonstrate that VRVVC achieves a wide range of variable bitrates within a single model and surpasses the RD performance of existing methods across various datasets.

VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information