Improving Batch Normalization with Test-Time Adaptation for Robust Object Detection in Self-Driving

Authors

  • Dacheng Liao Beijing University of Posts and Telecommunications
  • Mengshi Qi Beijing University of Posts and Telecommunications
  • Liang Liu Beijing University of Posts and Telecommunications
  • Huadong Ma Beijing University of Posts and Telecommunications

DOI:

https://doi.org/10.1609/aaai.v40i9.37626

Abstract

In open real-world autonomous driving scenarios, challenges such as sensor failure and extreme weather hinder the generalization of current autonomous driving perception models to these unseen domain, due to the domain shifts between the test and training data. As the parameter scale of autonomous driving perception models grows, traditional test-time adaptation (TTA) methods become unstable and often degrade model performance in most scenarios. To address these challenges, this paper proposes two new robust methods to improve the Batch Normalization with TTA for object detection in autonomous driving: (1) We introduce a new LearnableBN layer based on Geometric Confidence Maximization and Entropy Minimization. Specifically, we modify the traditional BN layer by incorporating auxiliary learnable parameters, which enables the BN layer to dynamically update the statistics according to the different input data. (2) We propose a novel semantic-consistency based dual-stage adaptation strategy, which encourages the model to iteratively search for the optimal solution and eliminates unstable samples during the adaptation process. Extensive experiments on the NuScenes-C dataset shows that our method achieves a maximum improvement of about 10\% using BEVFormer as the baseline across six corruption types and three levels of severity.

Downloads

Published

2026-03-14

How to Cite

Liao, D., Qi, M., Liu, L., & Ma, H. (2026). Improving Batch Normalization with Test-Time Adaptation for Robust Object Detection in Self-Driving. Proceedings of the AAAI Conference on Artificial Intelligence, 40(9), 6925–6933. https://doi.org/10.1609/aaai.v40i9.37626

Issue

Section

AAAI Technical Track on Computer Vision VI