PNVC: Towards Practical INR-based Video Compression

Authors

  • Ge Gao University of Bristol
  • Ho Man Kwan University of Bristol
  • Fan Zhang University of Bristol
  • David Bull University of Bristol

DOI:

https://doi.org/10.1609/aaai.v39i3.32315

Abstract

Neural video compression has recently demonstrated significant potential to compete with conventional video codecs in terms of rate-quality performance. These learned video codecs are however associated with various issues related to decoding complexity (for autoencoder-based methods) and/or system delays (for implicit neural representation (INR) based models), which currently prevent them from being deployed in practical applications. In this paper, targeting a practical neural video codec, we propose a novel INR-based coding framework, PNVC, which innovatively combines autoencoder-based and overfitted solutions. Our approach benefits from several design innovations, including a new structural reparameterization-based architecture, hierarchical quality control, modulation-based entropy modeling, and scale-aware positional embedding. Supporting both low delay (LD) and random access (RA) configurations, PNVC outperforms existing INR-based codecs, achieving nearly 35%+ BD-rate savings against HEVC HM 18.0 (LD) - almost 10% more compared to one of the state-of-the-art INR-based codecs, HiNeRV and 5% more over VTM 20.0 (LD), while maintaining 20+ FPS decoding speeds for 1080p content. This represents an important step forward for INR-based video coding, moving it towards practical deployment.

Published

2025-04-11

How to Cite

Gao, G., Kwan, H. M., Zhang, F., & Bull, D. (2025). PNVC: Towards Practical INR-based Video Compression. Proceedings of the AAAI Conference on Artificial Intelligence, 39(3), 3068–3076. https://doi.org/10.1609/aaai.v39i3.32315

Issue

Section

AAAI Technical Track on Computer Vision II