TopicFM: Robust and Interpretable Topic-Assisted Feature Matching

Authors

  • Khang Truong Giang KAIST
  • Soohwan Song Electronics and Telecommunications Research Institute (ETRI)
  • Sungho Jo KAIST

DOI:

https://doi.org/10.1609/aaai.v37i2.25341

Keywords:

CV: 3D Computer Vision, CV: Representation Learning for Vision, CV: Vision for Robotics & Autonomous Driving, ML: Probabilistic Methods

Abstract

This study addresses an image-matching problem in challenging cases, such as large scene variations or textureless scenes. To gain robustness to such situations, most previous studies have attempted to encode the global contexts of a scene via graph neural networks or transformers. However, these contexts do not explicitly represent high-level contextual information, such as structural shapes or semantic instances; therefore, the encoded features are still not sufficiently discriminative in challenging scenes. We propose a novel image-matching method that applies a topic-modeling strategy to encode high-level contexts in images. The proposed method trains latent semantic instances called topics. It explicitly models an image as a multinomial distribution of topics, and then performs probabilistic feature matching. This approach improves the robustness of matching by focusing on the same semantic areas between the images. In addition, the inferred topics provide interpretability for matching the results, making our method explainable. Extensive experiments on outdoor and indoor datasets show that our method outperforms other state-of-the-art methods, particularly in challenging cases.

Downloads

Published

2023-06-26

How to Cite

Truong Giang, K., Song, S., & Jo, S. (2023). TopicFM: Robust and Interpretable Topic-Assisted Feature Matching. Proceedings of the AAAI Conference on Artificial Intelligence, 37(2), 2447-2455. https://doi.org/10.1609/aaai.v37i2.25341

Issue

Section

AAAI Technical Track on Computer Vision II