DeNC++: Efficient Diffusion-Enhanced Neural Codec for End-to-end Semantic Streaming at the Edge

Authors

  • Qihua Zhou College of Computer Science and Software Engineering, Shenzhen University
  • Wangjiang Gong Hong Kong University of Science and Technology
  • Zili Meng Hong Kong University of Science and Technology
  • Yaxiong Xie State University of New York at Buffalo
  • Yaodong Huang College of Computer Science and Software Engineering, Shenzhen University
  • Junchen Jiang University of Chicago
  • Laizhong Cui College of Computer Science and Software Engineering, Shenzhen University

DOI:

https://doi.org/10.1609/aaai.v40i34.40135

Abstract

The neural-enhanced video streaming (NeVS) has been an emerging technique to integrate neural models into video codecs for higher streaming efficiency. The state-of-the-art methods, e.g., DeNC and Gemino, typically compress videos in RGB space and restore video quality via a neural enhancement model hosted on the external media server. However, these methods are not always accessible in resource-constrained edge environments due to their heavy reliance on the media server's computation, which undermines end-to-end performance and restricts NeVS's usage boundary. This limitation raises an interesting question: is it possible to make NeVS lightweight so that all neural codec operations can be handled directly by clients' edge devices? In this paper, we present the answer yes and develop a new plug-and-play module called DeNC++, which significantly improves the compression-restoration-overhead trade-off over existing methods. Our core design philosophy is to wrap all the codec operations within a latent semantic space, in which the original high-dimensional visual signals are efficiently embedded into low-dimensional semantic representations. With this fundamental transformation, DeNC++'s neural encoder introduces the triple semantic-bitwidth-resolution compression to effectively lower the streaming traffic. Meanwhile, we make DeNC++'s neural decoder aware of the perceptual loss caused by its encoder and design tiny generative models to guarantee high restoration quality. We also strictly restrict the runtime computational overhead and accelerate the neural enhancement process, making DeNC++ compatible with commodity edge devices. Real-world evaluations reveal that DeNC++ consistently provides higher restoration quality while achieving 24-55 times higher compression ratio and 5-7 times end-to-end speedup over the latest NeVS solutions.

Published

2026-03-14

How to Cite

Zhou, Q., Gong, W., Meng, Z., Xie, Y., Huang, Y., Jiang, J., & Cui, L. (2026). DeNC++: Efficient Diffusion-Enhanced Neural Codec for End-to-end Semantic Streaming at the Edge. Proceedings of the AAAI Conference on Artificial Intelligence, 40(34), 28991-28999. https://doi.org/10.1609/aaai.v40i34.40135

Issue

Section

AAAI Technical Track on Machine Learning XI