DriveLiDAR4D: Sequential and Controllable LiDAR Scene Generation for Autonomous Driving

Authors

  • Kaiwen Cai Li Auto Inc.
  • Xinze Liu Li Auto Inc.
  • Xia Zhou Li Auto Inc.
  • Hengtong Hu Li Auto Inc.
  • Jie Xiang Li Auto Inc.
  • Luyao Zhang Li Auto Inc.
  • Xueyang Zhang Li Auto Inc.
  • Kun Zhan Li Auto Inc.
  • Yifei Zhan Li Auto Inc.
  • Xianpeng Lang Li Auto Inc.

DOI:

https://doi.org/10.1609/aaai.v40i4.37239

Abstract

The generation of realistic LiDAR point clouds plays a crucial role in the development and evaluation of autonomous driving systems. Although recent methods for 3D LiDAR point cloud generation have shown significant improvements, they still face notable limitations, including the lack of sequential generation capabilities and the inability to produce accurately positioned foreground objects and realistic backgrounds. These shortcomings hinder their practical applicability. In this paper, we introduce DriveLiDAR4D, a novel LiDAR generation pipeline consisting of multimodal conditions and a novel sequential noise prediction model LiDAR4DNet, capable of producing temporally consistent LiDAR scenes with highly controllable foreground objects and realistic backgrounds. To the best of our knowledge, this is the first work to address the sequential generation of LiDAR scenes with full scene manipulation capability in an end-to-end manner. We evaluated DriveLiDAR4D on the nuScenes and KITTI datasets, where we achieved an FRD score of 743.13 and an FVD score of 16.96 on the nuScenes dataset, surpassing the current state-of-the-art (SOTA) method, UniScene, with an performance boost of 37.2% in FRD and 24.1% in FVD, respectively.

Downloads

Published

2026-03-14

How to Cite

Cai, K., Liu, X., Zhou, X., Hu, H., Xiang, J., Zhang, L., … Lang, X. (2026). DriveLiDAR4D: Sequential and Controllable LiDAR Scene Generation for Autonomous Driving. Proceedings of the AAAI Conference on Artificial Intelligence, 40(4), 2525–2533. https://doi.org/10.1609/aaai.v40i4.37239

Issue

Section

AAAI Technical Track on Computer Vision I