BEVSync: Asynchronous Data Alignment for Camera-based Vehicle-Infrastructure Cooperative Perception Under Uncertain Delays

Authors

  • Wentao Wang Shenzhen Campus of Sun Yat-sen University
  • Jiaqian Wang Shenzhen Campus of Sun Yat-sen University Peng Cheng Laboratory
  • Yuxin Deng Shenzhen Campus of Sun Yat-sen University
  • Guang Tan Shenzhen Campus of Sun Yat-sen University

DOI:

https://doi.org/10.1609/aaai.v39i14.33611

Abstract

Vehicle-to-infrastructure (V2I) cooperative perception systems can enhance the sensing abilities of autonomous vehicles. Existing V2I solutions often consider LiDARs devices instead of cameras, the most prevalent sensors with low cost and wide installation. In addition, a major challenge that has been underexplored is the time asynchrony between image frames from different sources. This asynchrony arises because of clock differences, varying times involved in data processing and transmission, causing uncertain delays that complicate data alignment and potentially reduce perception accuracy. We propose BEVSync, a camera-based V2I cooperative perception system that adaptively aligns frames from the ego-vehicle and infrastructure by compensating for motion deviations. Specifically, we develop an extractor-compensator model to extract and predict perceptual features using historical frames, thereby smoothing out the data misalignment. Experiments on the real-world dataset DAIR-V2X show that our approach surpasses existing methods in terms of performance and robustness.

Published

2025-04-11

How to Cite

Wang, W., Wang, J., Deng, Y., & Tan, G. (2025). BEVSync: Asynchronous Data Alignment for Camera-based Vehicle-Infrastructure Cooperative Perception Under Uncertain Delays. Proceedings of the AAAI Conference on Artificial Intelligence, 39(14), 14699–14707. https://doi.org/10.1609/aaai.v39i14.33611

Issue

Section

AAAI Technical Track on Intelligent Robots