MetaMix: Meta-State Precision Searcher for Mixed-Precision Activation Quantization

Authors

  • Han-Byul Kim Seoul National University Google
  • Joo Hyung Lee Google
  • Sungjoo Yoo Seoul National University
  • Hong-Seok Kim Google

DOI:

https://doi.org/10.1609/aaai.v38i12.29212

Keywords:

ML: Optimization

Abstract

Mixed-precision quantization of efficient networks often suffer from activation instability encountered in the exploration of bit selections. To address this problem, we propose a novel method called MetaMix which consists of bit selection and weight training phases. The bit selection phase iterates two steps, (1) the mixed-precision-aware weight update, and (2) the bit-search training with the fixed mixed-precision-aware weights, both of which combined reduce activation instability in mixed-precision quantization and contribute to fast and high-quality bit selection. The weight training phase exploits the weights and step sizes trained in the bit selection phase and fine-tunes them thereby offering fast training. Our experiments with efficient and hard-to-quantize networks, i.e., MobileNet v2 and v3, and ResNet-18 on ImageNet show that our proposed method pushes the boundary of mixed-precision quantization, in terms of accuracy vs. operations, by outperforming both mixed- and single-precision SOTA methods.

Published

2024-03-24

How to Cite

Kim, H.-B., Lee, J. H., Yoo, S., & Kim, H.-S. (2024). MetaMix: Meta-State Precision Searcher for Mixed-Precision Activation Quantization. Proceedings of the AAAI Conference on Artificial Intelligence, 38(12), 13132–13141. https://doi.org/10.1609/aaai.v38i12.29212

Issue

Section

AAAI Technical Track on Machine Learning III