PaRot: Patch-Wise Rotation-Invariant Network via Feature Disentanglement and Pose Restoration

Authors

  • Dingxin Zhang School of Computer Science, University of Sydney
  • Jianhui Yu School of Computer Science, University of Sydney
  • Chaoyi Zhang School of Computer Science, University of Sydney
  • Weidong Cai School of Computer Science, University of Sydney

DOI:

https://doi.org/10.1609/aaai.v37i3.25450

Keywords:

CV: 3D Computer Vision

Abstract

Recent interest in point cloud analysis has led rapid progress in designing deep learning methods for 3D models. However, state-of-the-art models are not robust to rotations, which remains an unknown prior to real applications and harms the model performance. In this work, we introduce a novel Patch-wise Rotation-invariant network (PaRot), which achieves rotation invariance via feature disentanglement and produces consistent predictions for samples with arbitrary rotations. Specifically, we design a siamese training module which disentangles rotation invariance and equivariance from patches defined over different scales, e.g., the local geometry and global shape, via a pair of rotations. However, our disentangled invariant feature loses the intrinsic pose information of each patch. To solve this problem, we propose a rotation-invariant geometric relation to restore the relative pose with equivariant information for patches defined over different scales. Utilising the pose information, we propose a hierarchical module which implements intra-scale and inter-scale feature aggregation for 3D shape learning. Moreover, we introduce a pose-aware feature propagation process with the rotation-invariant relative pose information embedded. Experiments show that our disentanglement module extracts high-quality rotation-robust features and the proposed lightweight model achieves competitive results in rotated 3D object classification and part segmentation tasks.

Downloads

Published

2023-06-26

How to Cite

Zhang, D., Yu, J., Zhang, C., & Cai, W. (2023). PaRot: Patch-Wise Rotation-Invariant Network via Feature Disentanglement and Pose Restoration. Proceedings of the AAAI Conference on Artificial Intelligence, 37(3), 3418-3426. https://doi.org/10.1609/aaai.v37i3.25450

Issue

Section

AAAI Technical Track on Computer Vision III