Semi-supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix

Authors

  • Kewei Wang Key Laboratory of Image Processing and Intelligent Control, Ministry of Education School of Artificial Intelligence and Automation, Huazhong University of Science and Technology S-Lab, Nanyang Technological University
  • Yizheng Wu Key Laboratory of Image Processing and Intelligent Control, Ministry of Education School of Artificial Intelligence and Automation, Huazhong University of Science and Technology S-Lab, Nanyang Technological University
  • Zhiyu Pan Key Laboratory of Image Processing and Intelligent Control, Ministry of Education School of Artificial Intelligence and Automation, Huazhong University of Science and Technology
  • Xingyi Li Key Laboratory of Image Processing and Intelligent Control, Ministry of Education School of Artificial Intelligence and Automation, Huazhong University of Science and Technology S-Lab, Nanyang Technological University
  • Ke Xian S-Lab, Nanyang Technological University
  • Zhe Wang SenseTime Research
  • Zhiguo Cao Key Laboratory of Image Processing and Intelligent Control, Ministry of Education School of Artificial Intelligence and Automation, Huazhong University of Science and Technology
  • Guosheng Lin S-Lab, Nanyang Technological University

DOI:

https://doi.org/10.1609/aaai.v38i6.28358

Keywords:

CV: Vision for Robotics & Autonomous Driving, CV: 3D Computer Vision

Abstract

Class-agnostic motion prediction methods aim to comprehend motion within open-world scenarios, holding significance for autonomous driving systems. However, training a high-performance model in a fully-supervised manner always requires substantial amounts of manually annotated data, which can be both expensive and time-consuming to obtain. To address this challenge, our study explores the potential of semi-supervised learning (SSL) for class-agnostic motion prediction. Our SSL framework adopts a consistency-based self-training paradigm, enabling the model to learn from unlabeled data by generating pseudo labels through test-time inference. To improve the quality of pseudo labels, we propose a novel motion selection and re-generation module. This module effectively selects reliable pseudo labels and re-generates unreliable ones. Furthermore, we propose two data augmentation strategies: temporal sampling and BEVMix. These strategies facilitate consistency regularization in SSL. Experiments conducted on nuScenes demonstrate that our SSL method can surpass the self-supervised approach by a large margin by utilizing only a tiny fraction of labeled data. Furthermore, our method exhibits comparable performance to weakly and some fully supervised methods. These results highlight the ability of our method to strike a favorable balance between annotation costs and performance. Code will be available at https://github.com/kwwcv/SSMP.

Published

2024-03-24

How to Cite

Wang, K., Wu, Y., Pan, Z., Li, X., Xian, K., Wang, Z., Cao, Z., & Lin, G. (2024). Semi-supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix. Proceedings of the AAAI Conference on Artificial Intelligence, 38(6), 5490-5498. https://doi.org/10.1609/aaai.v38i6.28358

Issue

Section

AAAI Technical Track on Computer Vision V