EAN: An Efficient Attention Module Guided by Normalization for Deep Neural Networks

Authors

  • Jiafeng Li East China Normal University
  • Zelin Li New York University Shanghai
  • Ying Wen East China Normal University

DOI:

https://doi.org/10.1609/aaai.v38i4.28093

Keywords:

CV: Learning & Optimization for CV, CV: Applications, CV: Object Detection & Categorization, CV: Other Foundations of Computer Vision, CV: Representation Learning for Vision, ML: Deep Learning Algorithms

Abstract

Deep neural networks (DNNs) have achieved remarkable success in various fields, and two powerful techniques, feature normalization and attention mechanisms, have been widely used to enhance model performance. However, they are usually considered as two separate approaches or combined in a simplistic manner. In this paper, we investigate the intrinsic relationship between feature normalization and attention mechanisms and propose an Efficient Attention module guided by Normalization, dubbed EAN. Instead of using costly fully-connected layers for attention learning, EAN leverages the strengths of feature normalization and incorporates an Attention Generation (AG) unit to re-calibrate features. The proposed AG unit exploits the normalization component as a measure of the importance of distinct features and generates an attention mask using GroupNorm, L2 Norm, and Adaptation operations. By employing a grouping, AG unit and aggregation strategy, EAN is established, offering a unified module that harnesses the advantages of both normalization and attention, while maintaining minimal computational overhead. Furthermore, EAN serves as a plug-and-play module that can be seamlessly integrated with classic backbone architectures. Extensive quantitative evaluations on various visual tasks demonstrate that EAN achieves highly competitive performance compared to the current state-of-the-art attention methods while sustaining lower model complexity.

Published

2024-03-24

How to Cite

Li, J., Li, Z., & Wen, Y. (2024). EAN: An Efficient Attention Module Guided by Normalization for Deep Neural Networks. Proceedings of the AAAI Conference on Artificial Intelligence, 38(4), 3100-3108. https://doi.org/10.1609/aaai.v38i4.28093

Issue

Section

AAAI Technical Track on Computer Vision III