Pamba: Enhancing Global Interaction in Point Clouds via State Space Model

Authors

  • Zhuoyuan Li University of Science and Technology of China
  • Yubo Ai University of Science and Technology of China
  • Jiahao Lu University of Science and Technology of China
  • ChuXin Wang University of Science and Technology of China
  • Jiacheng Deng University of Science and Technology of China
  • Hanzhi Chang University of Science and Technology of China
  • Yanzhe Liang University of Science and Technology of China
  • Wenfei Yang University of Science and Technology of China
  • Shifeng Zhang Sangfor Technologies Inc.
  • Tianzhu Zhang University of Science and Technology of China

DOI:

https://doi.org/10.1609/aaai.v39i5.32540

Abstract

Transformers have demonstrated impressive results for 3D point cloud semantic segmentation. However, the quadratic complexity of transformer makes computation costs high, limiting the number of points that can be processed simultaneously and impeding the modeling of long-range dependencies between objects in a single scene. Drawing inspiration from the great potential of recent state space models (SSM) for long sequence modeling, we introduce Mamba, an SSM-based architecture, to the point cloud domain and propose Pamba, a novel architecture with strong global modeling capability under linear complexity. Specifically, to make the disorderness of point clouds fit in with the causal nature of Mamba, we propose a multi-path serialization strategy applicable to point clouds. Besides, we propose the ConvMamba block to compensate for the shortcomings of Mamba in modeling local geometries and in unidirectional modeling. Pamba obtains state-of-the-art results on several 3D point cloud segmentation tasks, including ScanNet v2, ScanNet200, S3DIS and nuScenes, while its effectiveness is validated by extensive experiments.

Downloads

Published

2025-04-11

How to Cite

Li, Z., Ai, Y., Lu, J., Wang, C., Deng, J., Chang, H., … Zhang, T. (2025). Pamba: Enhancing Global Interaction in Point Clouds via State Space Model. Proceedings of the AAAI Conference on Artificial Intelligence, 39(5), 5092–5100. https://doi.org/10.1609/aaai.v39i5.32540

Issue

Section

AAAI Technical Track on Computer Vision IV