DriveFlow: Rectified Flow Adaptation for Robust 3D Object Detection in Autonomous Driving

Authors

  • Hongbin Lin Shenzhen Future Network of Intelligence Institute (FNii), The Chinese University of Hong Kong, Shenzhen School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen
  • Yiming Yang Shenzhen Future Network of Intelligence Institute (FNii), The Chinese University of Hong Kong, Shenzhen School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen
  • Chaoda Zheng Xpeng Motors
  • Yifan Zhang MiroMind AI
  • Shuaicheng Niu Nanyang Technological University
  • Zilu Guo Shenzhen Future Network of Intelligence Institute (FNii), The Chinese University of Hong Kong, Shenzhen School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen
  • Yafeng Li Baoji University of Arts and Sciences
  • Gui Gui Central South University
  • Shuguang Cui School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen Shenzhen Future Network of Intelligence Institute (FNii), The Chinese University of Hong Kong, Shenzhen
  • Zhen Li School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen Shenzhen Future Network of Intelligence Institute (FNii), The Chinese University of Hong Kong, Shenzhen

DOI:

https://doi.org/10.1609/aaai.v40i9.37628

Abstract

In autonomous driving, vision-centric 3D object detection recognizes and localizes 3D objects from RGB images. However, due to high annotation costs and diverse outdoor scenes, training data often fails to cover all possible test scenarios, known as the out-of-distribution (OOD) issue. Training-free image editing offers a promising solution for improving model robustness by training data enhancement without any modifications to pre-trained diffusion models. Nevertheless, inversion-based methods often suffer from limited effectiveness and inherent inaccuracies, while recent rectified-flow-based approaches struggle to preserve objects with accurate 3D geometry. In this paper, we propose DriveFlow, a Rectified Flow Adaptation method for training data enhancement in autonomous driving based on pre-trained Text-to-Image flow models. Based on frequency decomposition, DriveFlow introduces two strategies to adapt noise-free editing paths derived from text-conditioned velocities. 1) High-Frequency Foreground Preservation: DriveFlow incorporates a high-frequency alignment loss for foreground to maintain precise 3D object geometry. 2) Dual-Frequency Background Optimization: DriveFlow also conducts dual-frequency optimization for background, balancing editing flexibility and semantic consistency. Comprehensive experiments validate the effectiveness and efficiency of DriveFlow, demonstrating comprehensive performance improvements on all categories across OOD scenarios.

Published

2026-03-14

How to Cite

Lin, H., Yang, Y., Zheng, C., Zhang, Y., Niu, S., Guo, Z., … Li, Z. (2026). DriveFlow: Rectified Flow Adaptation for Robust 3D Object Detection in Autonomous Driving. Proceedings of the AAAI Conference on Artificial Intelligence, 40(9), 6943–6951. https://doi.org/10.1609/aaai.v40i9.37628

Issue

Section

AAAI Technical Track on Computer Vision VI