Exploiting Polarized Material Cues for Robust Car Detection
DOI:
https://doi.org/10.1609/aaai.v38i2.27922Keywords:
CV: Multi-modal Vision, CV: Object Detection & CategorizationAbstract
Car detection is an important task that serves as a crucial prerequisite for many automated driving functions. The large variations in lighting/weather conditions and vehicle densities of the scenes pose significant challenges to existing car detection algorithms to meet the highly accurate perception demand for safety, due to the unstable/limited color information, which impedes the extraction of meaningful/discriminative features of cars. In this work, we present a novel learning-based car detection method that leverages trichromatic linear polarization as an additional cue to disambiguate such challenging cases. A key observation is that polarization, characteristic of the light wave, can robustly describe intrinsic physical properties of the scene objects in various imaging conditions and is strongly linked to the nature of materials for cars (e.g., metal and glass) and their surrounding environment (e.g., soil and trees), thereby providing reliable and discriminative features for robust car detection in challenging scenes. To exploit polarization cues, we first construct a pixel-aligned RGB-Polarization car detection dataset, which we subsequently employ to train a novel multimodal fusion network. Our car detection network dynamically integrates RGB and polarization features in a request-and-complement manner and can explore the intrinsic material properties of cars across all learning samples. We extensively validate our method and demonstrate that it outperforms state-of-the-art detection methods. Experimental results show that polarization is a powerful cue for car detection. Our code is available at https://github.com/wind1117/AAAI24-PCDNet.Downloads
Published
2024-03-24
How to Cite
Dong, W., Mei, H., Wei, Z., Jin, A., Qiu, S., Zhang, Q., & Yang, X. (2024). Exploiting Polarized Material Cues for Robust Car Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 38(2), 1564-1572. https://doi.org/10.1609/aaai.v38i2.27922
Issue
Section
AAAI Technical Track on Computer Vision I