Adaptive Agent Selection and Interaction Network for Image-to-Point Cloud Registration

Authors

  • Zhixin Cheng University of Science and Technology of China
  • Xiaotian Yin University of Science and Technology of China
  • Jiacheng Deng University of Science and Technology of China
  • Bohao Liao University of Science and Technology of China
  • Yujia Chen University of Science and Technology of China
  • Xu Zhou Sangfor Technologies Inc
  • Baoqun Yin University of Science and Technology of China
  • Tianzhu Zhang University of Science and Technology of China; Deep Space Exploration Laboratory

DOI:

https://doi.org/10.1609/aaai.v40i5.37329

Abstract

Typical detection-free methods for image-to-point cloud registration leverage transformer-based architectures to aggregate cross-modal features and establish correspondences. However, they often struggle under challenging conditions, where noise disrupts similarity computation and leads to incorrect correspondences. Moreover, without dedicated designs, it remains difficult to effectively select informative and correlated representations across modalities, thereby limiting the robustness and accuracy of registration. To address these challenges, we propose a novel cross-modal registration framework composed of two key modules: the Iterative Agents Selection (IAS) module and the Reliable Agents Interaction (RAI) module. IAS enhances structural feature awareness with phase maps and employs reinforcement learning principles to efficiently select reliable agents. RAI then leverages these selected agents to guide cross-modal interactions, effectively reducing mismatches and improving overall robustness. Extensive experiments on the RGB-D Scenes v2 and 7-Scenes benchmarks demonstrate that our method consistently achieves state-of-the-art performance.

Downloads

Published

2026-03-14

How to Cite

Cheng, Z., Yin, X., Deng, J., Liao, B., Chen, Y., Zhou, X., … Zhang, T. (2026). Adaptive Agent Selection and Interaction Network for Image-to-Point Cloud Registration. Proceedings of the AAAI Conference on Artificial Intelligence, 40(5), 3335–3343. https://doi.org/10.1609/aaai.v40i5.37329

Issue

Section

AAAI Technical Track on Computer Vision II