msLPCC: A Multimodal-Driven Scalable Framework for Deep LiDAR Point Cloud Compression

Authors

  • Miaohui Wang Guangdong Key Laboratory of Intelligent Information Processing, Shenzhen University, Shenzhen 518060
  • Runnan Huang Guangdong Key Laboratory of Intelligent Information Processing, Shenzhen University, Shenzhen 518060
  • Hengjin Dong Guangdong Key Laboratory of Intelligent Information Processing, Shenzhen University, Shenzhen 518060
  • Di Lin College of Intelligence and Computing, Tianjin University, Tianjin 300072
  • Yun Song School of Computer & Communication Engineering, Changsha University of Science & Technology, Changsha 410004
  • Wuyuan Xie Guangdong Key Laboratory of Intelligent Information Processing, Shenzhen University, Shenzhen 518060

DOI:

https://doi.org/10.1609/aaai.v38i6.28362

Keywords:

CV: Other Foundations of Computer Vision, DMKM: Data Compression

Abstract

LiDAR sensors are widely used in autonomous driving, and the growing storage and transmission demands have made LiDAR point cloud compression (LPCC) a hot research topic. To address the challenges posed by the large-scale and uneven-distribution (spatial and categorical) of LiDAR point data, this paper presents a new multimodal-driven scalable LPCC framework. For the large-scale challenge, we decouple the original LiDAR data into multi-layer point subsets, compress and transmit each layer separately, so as to ensure the reconstruction quality requirement under different scenarios. For the uneven-distribution challenge, we extract, align, and fuse heterologous feature representations, including point modality with position information, depth modality with spatial distance information, and segmentation modality with category information. Extensive experimental results on the benchmark SemanticKITTI database validate that our method outperforms 14 recent representative LPCC methods.

Published

2024-03-24

How to Cite

Wang, M., Huang, R., Dong, H., Lin, D., Song, Y., & Xie, W. (2024). msLPCC: A Multimodal-Driven Scalable Framework for Deep LiDAR Point Cloud Compression. Proceedings of the AAAI Conference on Artificial Intelligence, 38(6), 5526-5534. https://doi.org/10.1609/aaai.v38i6.28362

Issue

Section

AAAI Technical Track on Computer Vision V