TinyNeRF: Towards 100 x Compression of Voxel Radiance Fields

Authors

  • Tianli Zhao School of Artifcial Intelligence, University of Chinese Academy of Sciences. Beijing, China Institute of Automation, Chinese Academy of Sciences, Beijing, China AIRIA. Nanjing, China Maicro.ai. Nanjing, China
  • Jiayuan Chen AIRIA. Nanjing, China Maicro.ai. Nanjing, China Southeast University. Nanjing, China
  • Cong Leng Institute of Automation, Chinese Academy of Sciences, Beijing, China AIRIA. Nanjing, China Maicro.ai. Nanjing, China
  • Jian Cheng Institute of Automation, Chinese Academy of Sciences, Beijing, China AIRIA. Nanjing, China Maicro.ai. Nanjing, China

DOI:

https://doi.org/10.1609/aaai.v37i3.25469

Keywords:

CV: 3D Computer Vision, CV: Computational Photography, Image & Video Synthesis, CV: Learning & Optimization for CV, ML: Learning on the Edge & Model Compression

Abstract

Voxel grid representation of 3D scene properties has been widely used to improve the training or rendering speed of the Neural Radiance Fields (NeRF) while at the same time achieving high synthesis quality. However, these methods accelerate the original NeRF at the expense of extra storage demand, which hinders their applications in many scenarios. To solve this limitation, we present TinyNeRF, a three-stage pipeline: frequency domain transformation, pruning and quantization that work together to reduce the storage demand of the voxel grids with little to no effects on their speed and synthesis quality. Based on the prior knowledge of visual signals sparsity in the frequency domain, we convert the original voxel grids in the frequency domain via block-wise discrete cosine transformation (DCT). Next, we apply pruning and quantization to enforce the DCT coefficients to be sparse and low-bit. Our method can be optimized from scratch in an end-to-end manner, and can typically compress the original models by 2 orders of magnitude with minimal sacrifice on speed and synthesis quality.

Downloads

Published

2023-06-26

How to Cite

Zhao, T., Chen, J., Leng, C., & Cheng, J. (2023). TinyNeRF: Towards 100 x Compression of Voxel Radiance Fields. Proceedings of the AAAI Conference on Artificial Intelligence, 37(3), 3588-3596. https://doi.org/10.1609/aaai.v37i3.25469

Issue

Section

AAAI Technical Track on Computer Vision III