Neural Inheritance Relation Guided One-Shot Layer Assignment Search

Authors

  • Rang Meng Zhejiang University
  • Weijie Chen Hikvision Research Institute
  • Di Xie Hikvision Research Institute
  • Yuan Zhang Hikvision Research Institute
  • Shiliang Pu Hikvision Research Institute

DOI:

https://doi.org/10.1609/aaai.v34i04.5959

Abstract

Layer assignment is seldom picked out as an independent research topic in neural architecture search. In this paper, for the first time, we systematically investigate the impact of different layer assignments to the network performance by building an architecture dataset of layer assignment on CIFAR-100. Through analyzing this dataset, we discover a neural inheritance relation among the networks with different layer assignments, that is, the optimal layer assignments for deeper networks always inherit from those for shallow networks. Inspired by this neural inheritance relation, we propose an efficient one-shot layer assignment search approach via inherited sampling. Specifically, the optimal layer assignment searched in the shallow network can be provided as a strong sampling priori to train and search the deeper ones in supernet, which extremely reduces the network search space. Comprehensive experiments carried out on CIFAR-100 illustrate the efficiency of our proposed method. Our search results are strongly consistent with the optimal ones directly selected from the architecture dataset. To further confirm the generalization of our proposed method, we also conduct experiments on Tiny-ImageNet and ImageNet. Our searched results are remarkably superior to the handcrafted ones under the unchanged computational budgets. The neural inheritance relation discovered in this paper can provide insights to the universal neural architecture search.

Downloads

Published

2020-04-03

How to Cite

Meng, R., Chen, W., Xie, D., Zhang, Y., & Pu, S. (2020). Neural Inheritance Relation Guided One-Shot Layer Assignment Search. Proceedings of the AAAI Conference on Artificial Intelligence, 34(04), 5158-5165. https://doi.org/10.1609/aaai.v34i04.5959

Issue

Section

AAAI Technical Track: Machine Learning