Learned Image Transmission with Hierarchical Variational Autoencoder

Authors

  • Guangyi Zhang Zhejiang University
  • Hanlei Li Zhejiang University
  • Yunlong Cai Zhejiang University
  • Qiyu Hu Zhejiang University
  • Guanding Yu Zhejiang University
  • Runmin Zhang Zhejiang University

DOI:

https://doi.org/10.1609/aaai.v39i12.33442

Abstract

In this paper, we introduce an innovative hierarchical joint source-channel coding (HJSCC) framework for image transmission, utilizing a hierarchical variational autoencoder (VAE). Our approach leverages a combination of bottom-up and top-down paths at the transmitter to autoregressively generate multiple hierarchical representations of the original image. These representations are then directly mapped to channel symbols for transmission by the JSCC encoder. We extend this framework to scenarios with a feedback link, modeling transmission over a noisy channel as a probabilistic sampling process and deriving a novel generative formulation for JSCC with feedback. Compared with existing approaches, our proposed HJSCC provides enhanced adaptability by dynamically adjusting transmission bandwidth, encoding these representations into varying amounts of channel symbols. Additionally, we introduce a rate attention module to guide the JSCC encoder in optimizing its encoding strategy based on prior information. Extensive experiments on images of varying resolutions demonstrate that our proposed model outperforms existing baselines in rate-distortion performance and maintains robustness against channel noise.

Downloads

Published

2025-04-11

How to Cite

Zhang, G., Li, H., Cai, Y., Hu, Q., Yu, G., & Zhang, R. (2025). Learned Image Transmission with Hierarchical Variational Autoencoder. Proceedings of the AAAI Conference on Artificial Intelligence, 39(12), 13215–13223. https://doi.org/10.1609/aaai.v39i12.33442

Issue

Section

AAAI Technical Track on Data Mining & Knowledge Management II