Relaxed Rotational Equivariance via G-Biases in Vision

Authors

  • Zhiqiang Wu East China Normal University
  • Yingjie Liu East China Normal University
  • Licheng Sun East China Normal University
  • Jian Yang Information Engineering University
  • Hanlin Dong East China Normal University
  • Shing-Ho J. Lin University of the Chinese Academy of Sciences
  • Xuan Tang East China Normal University
  • Jinpeng Mi Univerfity of Shanghai for Science and Technology
  • Bo Jin Tongji University
  • Xian Wei East China Normal University

DOI:

https://doi.org/10.1609/aaai.v39i8.32922

Abstract

Group Equivariant Convolution (GConv) can capture rotational equivariance from original data. It assumes uniform and strict rotational equivariance across all features as the transformations under the specific group. However, the presentation or distribution of real-world data rarely conforms to strict rotational equivariance, commonly referred to as Rotational Symmetry-Breaking (RSB) in the system or dataset, making GConv unable to adapt effectively to this phenomenon. Motivated by this, we propose a simple but highly effective method to address this problem, which utilizes a set of learnable biases called G-Biases under the group order to break strict group constraints and then achieve a Relaxed Rotational Equivariant Convolution (RREConv). To validate the efficiency of RREConv, we conduct extensive ablation experiments on the discrete rotational group Cn. Experiments demonstrate that the proposed RREConv-based methods achieve excellent performance compared to existing GConv-based methods in both classification and 2D object detection tasks on the natural image datasets.

Published

2025-04-11

How to Cite

Wu, Z., Liu, Y., Sun, L., Yang, J., Dong, H., Lin, S.-H. J., Tang, X., Mi, J., Jin, B., & Wei, X. (2025). Relaxed Rotational Equivariance via G-Biases in Vision. Proceedings of the AAAI Conference on Artificial Intelligence, 39(8), 8541-8549. https://doi.org/10.1609/aaai.v39i8.32922

Issue

Section

AAAI Technical Track on Computer Vision VII