The Last Byte: Learning Just Enough for Machine-Oriented Image Compression

Wuyuan Xie; Zhenming Li; Ye Liu; Jian Jin; Yun Song; Miaohui Wang

doi:10.1609/aaai.v40i19.38635

Authors

Wuyuan Xie College of Computer Science & Software Engineering, Shenzhen University
Zhenming Li College of Computer Science & Software Engineering, Shenzhen University
Ye Liu School of Automation, Nanjing University of Posts and Telecommunications
Jian Jin Alibaba-NTU Joint Research Institute, Nanyang Technological University
Yun Song School of Computer Science and Technology, Changsha University of Science and Technology
Miaohui Wang Guangdong Key Laboratory of Intelligent Information Processing, Shenzhen University

DOI:

https://doi.org/10.1609/aaai.v40i19.38635

Abstract

Just recognizable distortion (JRD) has been introduced for image compression for machines, aiming to quantify the maximum coding distortion that can be tolerated by a specific perception model, thereby defining the upper bound of machine vision redundancy (MVR). However, existing JRD-based redundancy estimation methods face three key challenges: limited dataset annotation accuracy, low prediction efficiency, and insufficient perception accuracy, all of which hinder their practical deployment. To address these limitations, we propose a new MVR-Net, a frame-wise efficient JRD prediction method that generates the optimal encoding quantization map in a single inference pass. Furthermore, we refine the annotation standard for JRD datasets based on experimental insights, enhancing the precision of recognizable redundancy measurement. Compared to stateof-the-art methods, MVR-Net achieves a superior balance between bitrate reduction and perception accuracy in JRD-guided compression, while offering up to a 40,000× speed improvement, demonstrating its practicality and efficiency for real-world applications.

The Last Byte: Learning Just Enough for Machine-Oriented Image Compression

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information