Predictive Exit: Prediction of Fine-Grained Early Exits for Computation- and Energy-Efficient Inference

Authors

  • Xiangjie Li Shanghai Jiao Tong Univerisity
  • Chenfei Lou Shanghai Jiao Tong University
  • Yuchi Chen Shanghai Jiao Tong University
  • Zhengping Zhu Shanghai Jiao Tong University
  • Yingtao Shen Shanghai Jiao Tong University
  • Yehan Ma Shanghai Jiao Tong University
  • An Zou Shanghai Jiao Tong University

DOI:

https://doi.org/10.1609/aaai.v37i7.26042

Keywords:

ML: Learning on the Edge & Model Compression, APP: Energy, Environment & Sustainability, ML: Deep Neural Architectures, ML: Deep Neural Network Algorithms

Abstract

By adding exiting layers to the deep learning networks, early exit can terminate the inference earlier with accurate results. However, the passive decision-making of whether to exit or continue the next layer has to go through every pre-placed exiting layer until it exits. In addition, it is hard to adjust the configurations of the computing platforms alongside the inference proceeds. By incorporating a low-cost prediction engine, we propose a Predictive Exit framework for computation- and energy-efficient deep learning applications. Predictive Exit can forecast where the network will exit (i.e., establish the number of remaining layers to finish the inference), which effectively reduces the network computation cost by exiting on time without running every pre-placed exiting layer. Moreover, according to the number of remaining layers, proper computing configurations (i.e., frequency and voltage) are selected to execute the network to further save energy. Extensive experimental results demonstrate that Predictive Exit achieves up to 96.2% computation reduction and 72.9% energy-saving compared with classic deep learning networks; and 12.8% computation reduction and 37.6% energy-saving compared with the early exit under state-of-the-art exiting strategies, given the same inference accuracy and latency.

Downloads

Published

2023-06-26

How to Cite

Li, X., Lou, C., Chen, Y., Zhu, Z., Shen, Y., Ma, Y., & Zou, A. (2023). Predictive Exit: Prediction of Fine-Grained Early Exits for Computation- and Energy-Efficient Inference. Proceedings of the AAAI Conference on Artificial Intelligence, 37(7), 8657-8665. https://doi.org/10.1609/aaai.v37i7.26042

Issue

Section

AAAI Technical Track on Machine Learning II