Adversarial Learning of Portable Student Networks

Authors

  • Yunhe Wang Peking University
  • Chang Xu University of Sydney
  • Chao Xu Peking University
  • Dacheng Tao University of Sydney

DOI:

https://doi.org/10.1609/aaai.v32i1.11667

Abstract

Effective methods for learning deep neural networks with fewer parameters are urgently required, since storage and computations of heavy neural networks have largely prevented their widespread use on mobile devices. Compared with algorithms which directly remove weights or filters for obtaining considerable compression and speed-up ratios, training thin deep networks exploiting the student-teacher learning paradigm is more flexible. However, it is very hard to determine which formulation is optimal to measure the information inherited from teacher networks. To overcome this challenge, we utilize the generative adversarial network (GAN) to learn the student network. In practice, the generator is exactly the student network with extremely less parameters and the discriminator is used as a teaching assistant for distinguishing features extracted from student and teacher networks. By simultaneously optimizing the generator and the discriminator, the resulting student network can produce features of input data with the similar distribution as that of features of the teacher network. Extensive experimental results on benchmark datasets demonstrate that the proposed method is capable of learning well-performed portable networks, which is superior to the state-of-the-art methods.

Downloads

Published

2018-04-29

How to Cite

Wang, Y., Xu, C., Xu, C., & Tao, D. (2018). Adversarial Learning of Portable Student Networks. Proceedings of the AAAI Conference on Artificial Intelligence, 32(1). https://doi.org/10.1609/aaai.v32i1.11667