IGIANet: Illumination Guided Implicit Alignment Network for Infrared–Visible UAV Detection
DOI:
https://doi.org/10.1609/aaai.v40i4.37298Abstract
Visible-Infrared (RGB-IR) Unmanned Aerial Vehicle (UAV) object detection integrates complementary cues from visible and infrared sensors, offering broad application potential. However, due to sensor parallax, it still faces the challenge of weak spatial misalignment, which significantly limits its performance in UAV-based object detection. Existing methods emphasize strict alignment, overlooking spectral heterogeneity under varying illumination. To address these issues, we propose the Illumination Guided Implicit Alignment Network (IGIANet) to mitigate modality heterogeneity without explicit alignment. Specifically, we integrate three novel modules. First, we propose an illumination-guided frequency modulation module that adaptively allocates fusion weights to visible and infrared features based on global illumination estimation, effectively alleviating modality imbalance under varying lighting conditions. Second, we introduce a frequency-guided cross-modality differential enhancement module, which computes differential cues across frequency domains to enhance complementary information and highlight weakly aligned and low-contrast regions. Finally, we introduce an implicit alignment-driven dynamic fusion module that actively estimates offsets and generates dynamic, position-adaptive fusion kernels to align and fuse modalities. Extensive experiments demonstrate that IGIANet outperforms state-of-the-art models on various benchmarks, achieving 80.9% mAP on DroneVehicle, 57.1% mAP on VEDAI, and 49.4% mAP on FLIR.Downloads
Published
2026-03-14
How to Cite
Chen, X., Zhang, D., Zhao, L., Yang, C., Chen, Z., Lou, J., Zheng, Z., Jeon, S.-W., & Wang, H. (2026). IGIANet: Illumination Guided Implicit Alignment Network for Infrared–Visible UAV Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 40(4), 3056-3064. https://doi.org/10.1609/aaai.v40i4.37298
Issue
Section
AAAI Technical Track on Computer Vision I