Semi-supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix
DOI:
https://doi.org/10.1609/aaai.v38i6.28358Keywords:
CV: Vision for Robotics & Autonomous Driving, CV: 3D Computer VisionAbstract
Class-agnostic motion prediction methods aim to comprehend motion within open-world scenarios, holding significance for autonomous driving systems. However, training a high-performance model in a fully-supervised manner always requires substantial amounts of manually annotated data, which can be both expensive and time-consuming to obtain. To address this challenge, our study explores the potential of semi-supervised learning (SSL) for class-agnostic motion prediction. Our SSL framework adopts a consistency-based self-training paradigm, enabling the model to learn from unlabeled data by generating pseudo labels through test-time inference. To improve the quality of pseudo labels, we propose a novel motion selection and re-generation module. This module effectively selects reliable pseudo labels and re-generates unreliable ones. Furthermore, we propose two data augmentation strategies: temporal sampling and BEVMix. These strategies facilitate consistency regularization in SSL. Experiments conducted on nuScenes demonstrate that our SSL method can surpass the self-supervised approach by a large margin by utilizing only a tiny fraction of labeled data. Furthermore, our method exhibits comparable performance to weakly and some fully supervised methods. These results highlight the ability of our method to strike a favorable balance between annotation costs and performance. Code will be available at https://github.com/kwwcv/SSMP.Downloads
Published
2024-03-24
How to Cite
Wang, K., Wu, Y., Pan, Z., Li, X., Xian, K., Wang, Z., Cao, Z., & Lin, G. (2024). Semi-supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix. Proceedings of the AAAI Conference on Artificial Intelligence, 38(6), 5490-5498. https://doi.org/10.1609/aaai.v38i6.28358
Issue
Section
AAAI Technical Track on Computer Vision V