Layer Compression of Deep Networks with Straight Flows

Chengyue Gong; Xiaocong Du; Bhargav Bhushanam; Lemeng  Wu; Xingchao Liu; Dhruv Choudhary; Arun Kejariwal; Qiang Liu

doi:10.1609/aaai.v38i11.29107

Authors

Chengyue Gong University of Texas at Austin
Xiaocong Du Meta, Inc.
Bhargav Bhushanam Meta, Inc.
Lemeng Wu University of Texas at Austin
Xingchao Liu University of Texas at Austin
Dhruv Choudhary Meta, Inc.
Arun Kejariwal Meta, Inc.
Qiang Liu University of Texas at Austin

DOI:

https://doi.org/10.1609/aaai.v38i11.29107

Keywords:

ML: Applications, ML: Deep Learning Algorithms, CV: Applications

Abstract

Very deep neural networks lead to significantly better performance on various real tasks. However, it usually causes slow inference and is hard to be deployed on real-world devices. How to reduce the number of layers to save memory and to accelerate the inference is an eye-catching topic. In this work, we introduce an intermediate objective, a continuous-time network, before distilling deep networks into shallow networks. First, we distill a given deep network into a continuous-time neural flow model, which can be discretized with an ODE solver and the inference requires passing through the network multiple times. By forcing the flow transport trajectory to be straight lines, we find that it is easier to compress the infinite step model into a one-step neural flow model, which only requires passing through the flow model once. Secondly, we refine the one-step flow model together with the final head layer with knowledge distillation and finally, we can replace the given deep network with this one-step flow network. Empirically, we demonstrate that our method outperforms direct distillation and other baselines on different model architectures (e.g. ResNet, ViT) on image classification and semantic segmentation tasks. We also manifest that our distilled model naturally serves as an early-exit dynamic inference model.

Layer Compression of Deep Networks with Straight Flows

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information