Efficient Compact Bilinear Pooling via Kronecker Product

Authors

  • Tan Yu Baidu Research
  • Yunfeng Cai Baidu Research
  • Ping Li Baidu Research

DOI:

https://doi.org/10.1609/aaai.v36i3.20225

Keywords:

Computer Vision (CV)

Abstract

Bilinear pooling has achieved excellent performance in fine-grained recognition tasks. Nevertheless, high-dimensional bilinear features suffer from over-fitting and inefficiency. To alleviate these issues, compact bilinear pooling (CBP) methods were developed to generate low-dimensional features. Although the low-dimensional features from existing CBP methods enable high efficiency in subsequent classification, CBP methods themselves are inefficient. Thus, the inefficiency issue of the bilinear pooling is still unsolved. In this work, we propose an efficient compact bilinear pooling method to solve the inefficiency problem inherited in bilinear pooling thoroughly. It decomposes the huge-scale projection matrix into a two-level Kronecker product of several small-scale matrices. By exploiting the ``vec trick'' and the tensor modal product, we can obtain the compact bilinear feature through the decomposed projection matrices in a speedy manner. Systematic experiments on four public benchmarks using two backbones demonstrate the efficiency and effectiveness of the proposed method in fine-grained recognition.

Downloads

Published

2022-06-28

How to Cite

Yu, T., Cai, Y., & Li, P. (2022). Efficient Compact Bilinear Pooling via Kronecker Product. Proceedings of the AAAI Conference on Artificial Intelligence, 36(3), 3170-3178. https://doi.org/10.1609/aaai.v36i3.20225

Issue

Section

AAAI Technical Track on Computer Vision III