CompTrack: Information Bottleneck-Guided Low-Rank Dynamic Token Compression for Point Cloud Tracking

Authors

  • Sifan Zhou School of Automation, Southeast University Key Laboratory of Measurement and Control of Complex Systems of Engineering, Ministry of Education, Southeast University
  • Yichao Cao Central South University
  • Jiahao Nie Zhejiang University of Finance and Economics
  • Yuqian Fu INSAIT, Sofia University ”St. Kliment Ohridski”
  • Ziyu Zhao School of Automation, Southeast University Key Laboratory of Measurement and Control of Complex Systems of Engineering, Ministry of Education, Southeast University
  • Xiaobo Lu School of Automation, Southeast University Key Laboratory of Measurement and Control of Complex Systems of Engineering, Ministry of Education, Southeast University
  • Shuo Wang Mininglamp Technology

DOI:

https://doi.org/10.1609/aaai.v40i16.38385

Abstract

3D single object tracking (SOT) in LiDAR point clouds is a critical task in computer vision and autonomous driving. Despite great success having been achieved, the inherent sparsity of point clouds introduces a dual-redundancy challenge that limits existing trackers: (1) vast spatial redundancy from background noise impairs accuracy, and (2) informational redundancy within the foreground hinders efficiency. To tackle these issues, we propose CompTrack, a novel end-to-end framework that systematically eliminates both forms of redundancy in point clouds. First, CompTrack incorporates a Spatial Foreground Predictor (SFP) module to filter out irrelevant background noise based on information entropy, addressing spatial redundancy. Subsequently, its core is an Information Bottleneck-guided Dynamic Token Compression (IB-DTC) module that eliminates the informational redundancy within the foreground. Theoretically grounded in low-rank approximation, this module leverages an online SVD analysis to adaptively compress the redundant foreground into a compact and highly informative set of proxy tokens. Extensive experiments on KITTI, nuScenes and Waymo datasets demonstrate that CompTrack achieves top-performing tracking performance with superior efficiency, running at a real-time 90 FPS on a single RTX 3090 GPU.

Published

2026-03-14

How to Cite

Zhou, S., Cao, Y., Nie, J., Fu, Y., Zhao, Z., Lu, X., & Wang, S. (2026). CompTrack: Information Bottleneck-Guided Low-Rank Dynamic Token Compression for Point Cloud Tracking. Proceedings of the AAAI Conference on Artificial Intelligence, 40(16), 13773–13781. https://doi.org/10.1609/aaai.v40i16.38385

Issue

Section

AAAI Technical Track on Computer Vision XIII