GAM: Gradient Attention Module of Optimization for Point Clouds Analysis

Authors

  • Haotian Hu Zhejiang Leapmotor Technology CO., LTD.
  • Fanyi Wang OPPO Research Institute
  • Zhiwang Zhang The University of Sydney
  • Yaonong Wang Zhejiang Leapmotor Technology CO., LTD.
  • Laifeng Hu Zhejiang Leapmotor Technology CO., LTD.
  • Yanhao Zhang OPPO Research Institute

DOI:

https://doi.org/10.1609/aaai.v37i1.25162

Keywords:

CV: 3D Computer Vision, CV: Segmentation

Abstract

In the point cloud analysis task, the existing local feature aggregation descriptors (LFAD) do not fully utilize the neighborhood information of center points. Previous methods only use the distance information to constrain the local aggregation process, which is easy to be affected by abnormal points and cannot adequately fit the original geometry of the point cloud. This paper argues that fine-grained geometric information (FGGI) plays an important role in the aggregation of local features. Based on this, we propose a gradient-based local attention module to address the above problem, which is called Gradient Attention Module (GAM). GAM simplifies the process of extracting the gradient information in the neighborhood to explicit representation using the Zenith Angle matrix and Azimuth Angle matrix, which makes the module 35X faster. The comprehensive experiments on the ScanObjectNN dataset, ShapeNet dataset, S3DIS dataset, Modelnet40 dataset, and KITTI dataset demonstrate the effectiveness, efficientness, and generalization of our newly proposed GAM for 3D point cloud analysis. Especially in S3DIS, GAM achieves the highest index in the current point-based model with mIoU/OA/mAcc of 74.4%/90.6%/83.2%.

Downloads

Published

2023-06-26

How to Cite

Hu, H., Wang, F., Zhang, Z., Wang, Y., Hu, L., & Zhang, Y. (2023). GAM: Gradient Attention Module of Optimization for Point Clouds Analysis. Proceedings of the AAAI Conference on Artificial Intelligence, 37(1), 835-843. https://doi.org/10.1609/aaai.v37i1.25162

Issue

Section

AAAI Technical Track on Computer Vision I