The Last Byte: Learning Just Enough for Machine-Oriented Image Compression
DOI:
https://doi.org/10.1609/aaai.v40i19.38635Abstract
Just recognizable distortion (JRD) has been introduced for image compression for machines, aiming to quantify the maximum coding distortion that can be tolerated by a specific perception model, thereby defining the upper bound of machine vision redundancy (MVR). However, existing JRD-based redundancy estimation methods face three key challenges: limited dataset annotation accuracy, low prediction efficiency, and insufficient perception accuracy, all of which hinder their practical deployment. To address these limitations, we propose a new MVR-Net, a frame-wise efficient JRD prediction method that generates the optimal encoding quantization map in a single inference pass. Furthermore, we refine the annotation standard for JRD datasets based on experimental insights, enhancing the precision of recognizable redundancy measurement. Compared to stateof-the-art methods, MVR-Net achieves a superior balance between bitrate reduction and perception accuracy in JRD-guided compression, while offering up to a 40,000× speed improvement, demonstrating its practicality and efficiency for real-world applications.Downloads
Published
2026-03-14
How to Cite
Xie, W., Li, Z., Liu, Y., Jin, J., Song, Y., & Wang, M. (2026). The Last Byte: Learning Just Enough for Machine-Oriented Image Compression. Proceedings of the AAAI Conference on Artificial Intelligence, 40(19), 16013–16021. https://doi.org/10.1609/aaai.v40i19.38635
Issue
Section
AAAI Technical Track on Data Mining & Knowledge Management III