Growing Interpretable Part Graphs on ConvNets via Multi-Shot Learning

Authors

  • Quanshi Zhang University of California, Los Angeles
  • Ruiming Cao University of California, Los Angeles
  • Ying Nian Wu University of California, Los Angeles
  • Song-Chun Zhu University of California, Los Angeles

DOI:

https://doi.org/10.1609/aaai.v31i1.10924

Keywords:

Deep Learning, And-Or Graph, Convolutional Neural Network, Weakly-Supervised Learning

Abstract

This paper proposes a learning strategy that embeds object-part concepts into a pre-trained convolutional neural network (CNN), in an attempt to 1) explore explicit semantics hidden in CNN units and 2) gradually transform the pre-trained CNN into a semantically interpretable graphical model for hierarchical object understanding. Given part annotations on very few (e.g., 3-12) objects, our method mines certain latent patterns from the pre-trained CNN and associates them with different semantic parts. We use a four-layer And-Or graph to organize the CNN units, so as to clarify their internal semantic hierarchy. Our method is guided by a small number of part annotations, and it achieves superior part-localization performance (about 13%-107% improvement in part center prediction on the PASCAL VOC and ImageNet datasets)

Downloads

Published

2017-02-13

How to Cite

Zhang, Q., Cao, R., Wu, Y. N., & Zhu, S.-C. (2017). Growing Interpretable Part Graphs on ConvNets via Multi-Shot Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 31(1). https://doi.org/10.1609/aaai.v31i1.10924