TinyNeRF: Towards 100 x Compression of Voxel Radiance Fields
DOI:
https://doi.org/10.1609/aaai.v37i3.25469Keywords:
CV: 3D Computer Vision, CV: Computational Photography, Image & Video Synthesis, CV: Learning & Optimization for CV, ML: Learning on the Edge & Model CompressionAbstract
Voxel grid representation of 3D scene properties has been widely used to improve the training or rendering speed of the Neural Radiance Fields (NeRF) while at the same time achieving high synthesis quality. However, these methods accelerate the original NeRF at the expense of extra storage demand, which hinders their applications in many scenarios. To solve this limitation, we present TinyNeRF, a three-stage pipeline: frequency domain transformation, pruning and quantization that work together to reduce the storage demand of the voxel grids with little to no effects on their speed and synthesis quality. Based on the prior knowledge of visual signals sparsity in the frequency domain, we convert the original voxel grids in the frequency domain via block-wise discrete cosine transformation (DCT). Next, we apply pruning and quantization to enforce the DCT coefficients to be sparse and low-bit. Our method can be optimized from scratch in an end-to-end manner, and can typically compress the original models by 2 orders of magnitude with minimal sacrifice on speed and synthesis quality.Downloads
Published
2023-06-26
How to Cite
Zhao, T., Chen, J., Leng, C., & Cheng, J. (2023). TinyNeRF: Towards 100 x Compression of Voxel Radiance Fields. Proceedings of the AAAI Conference on Artificial Intelligence, 37(3), 3588-3596. https://doi.org/10.1609/aaai.v37i3.25469
Issue
Section
AAAI Technical Track on Computer Vision III