MatchDet: A Collaborative Framework for Image Matching and Object Detection

Authors

  • Jinxiang Lai Tencent
  • Wenlong Wu Tencent
  • Bin-Bin Gao Tencent
  • Jun Liu Tencent
  • Jiawei Zhan Tencent
  • Congchong Nie Tencent
  • Yi Zeng Tencent
  • Chengjie Wang Tencent Shanghai Jiao Tong University

DOI:

https://doi.org/10.1609/aaai.v38i3.28066

Keywords:

CV: Vision for Robotics & Autonomous Driving, CV: Object Detection & Categorization

Abstract

Image matching and object detection are two fundamental and challenging tasks, while many related applications consider them two individual tasks (i.e. task-individual). In this paper, a collaborative framework called MatchDet (i.e. task-collaborative) is proposed for image matching and object detection to obtain mutual improvements. To achieve the collaborative learning of the two tasks, we propose three novel modules, including a Weighted Spatial Attention Module (WSAM) for Detector, and Weighted Attention Module (WAM) and Box Filter for Matcher. Specifically, the WSAM highlights the foreground regions of target image to benefit the subsequent detector, the WAM enhances the connection between the foreground regions of pair images to ensure high-quality matches, and Box Filter mitigates the impact of false matches. We evaluate the approaches on a new benchmark with two datasets called Warp-COCO and miniScanNet. Experimental results show our approaches are effective and achieve competitive improvements.

Downloads

Published

2024-03-24

How to Cite

Lai, J., Wu, W., Gao, B.-B., Liu, J., Zhan, J., Nie, C., Zeng, Y., & Wang, C. (2024). MatchDet: A Collaborative Framework for Image Matching and Object Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 38(3), 2858-2865. https://doi.org/10.1609/aaai.v38i3.28066

Issue

Section

AAAI Technical Track on Computer Vision II