The Last Byte: Learning Just Enough for Machine-Oriented Image Compression

Authors

  • Wuyuan Xie College of Computer Science & Software Engineering, Shenzhen University
  • Zhenming Li College of Computer Science & Software Engineering, Shenzhen University
  • Ye Liu School of Automation, Nanjing University of Posts and Telecommunications
  • Jian Jin Alibaba-NTU Joint Research Institute, Nanyang Technological University
  • Yun Song School of Computer Science and Technology, Changsha University of Science and Technology
  • Miaohui Wang Guangdong Key Laboratory of Intelligent Information Processing, Shenzhen University

DOI:

https://doi.org/10.1609/aaai.v40i19.38635

Abstract

Just recognizable distortion (JRD) has been introduced for image compression for machines, aiming to quantify the maximum coding distortion that can be tolerated by a specific perception model, thereby defining the upper bound of machine vision redundancy (MVR). However, existing JRD-based redundancy estimation methods face three key challenges: limited dataset annotation accuracy, low prediction efficiency, and insufficient perception accuracy, all of which hinder their practical deployment. To address these limitations, we propose a new MVR-Net, a frame-wise efficient JRD prediction method that generates the optimal encoding quantization map in a single inference pass. Furthermore, we refine the annotation standard for JRD datasets based on experimental insights, enhancing the precision of recognizable redundancy measurement. Compared to stateof-the-art methods, MVR-Net achieves a superior balance between bitrate reduction and perception accuracy in JRD-guided compression, while offering up to a 40,000× speed improvement, demonstrating its practicality and efficiency for real-world applications.

Published

2026-03-14

How to Cite

Xie, W., Li, Z., Liu, Y., Jin, J., Song, Y., & Wang, M. (2026). The Last Byte: Learning Just Enough for Machine-Oriented Image Compression. Proceedings of the AAAI Conference on Artificial Intelligence, 40(19), 16013–16021. https://doi.org/10.1609/aaai.v40i19.38635

Issue

Section

AAAI Technical Track on Data Mining & Knowledge Management III