Towards an Effective Orthogonal Dictionary Convolution Strategy

Authors

  • Yishi Li Xidian Univercity
  • Kunran Xu Xidian University
  • Rui Lai Xidian University
  • Lin Gu RIKEN,AIP The University of Tokyo

DOI:

https://doi.org/10.1609/aaai.v36i2.20037

Keywords:

Computer Vision (CV)

Abstract

Orthogonality regularization has proven effective in improving the precision, convergence speed and the training stability of CNNs. Here, we propose a novel Orthogonal Dictionary Convolution Strategy (ODCS) on CNNs to improve orthogonality effect by optimizing the network architecture and changing the regularized object. Specifically, we remove the nonlinear layer in typical convolution block “Conv(BN) + Nonlinear + Pointwise Conv(BN)”, and only impose orthogonal regularization on the front Conv. The structure, “Conv(BN) + Pointwise Conv(BN)”, is then equivalent to a pair of dictionary and encoding, defined in sparse dictionary learning. Thanks to the exact and efficient representation of signal with dictionaries in low-dimensional projections, our strategy could reduce the superfluous information in dictionary Conv kernels. Meanwhile, the proposed strategy relieves the too strict orthogonality regularization in training, which makes hyper-parameters tuning of model to be more flexible. In addition, our ODCS can modify the state-of-the-art models easily without any extra consumption in inference phase. We evaluate it on a variety of CNNs in small-scale (CIFAR), large-scale (ImageNet) and fine-grained (CUB-200-2011) image classification tasks, respectively. The experimental results show that our method achieve a stable and superior improvement.

Downloads

Published

2022-06-28

How to Cite

Li, Y., Xu, K., Lai, R., & Gu, L. (2022). Towards an Effective Orthogonal Dictionary Convolution Strategy. Proceedings of the AAAI Conference on Artificial Intelligence, 36(2), 1473-1481. https://doi.org/10.1609/aaai.v36i2.20037

Issue

Section

AAAI Technical Track on Computer Vision II