Symmetry-Aware Transformer-Based Mirror Detection

Authors

  • Tianyu Huang Harbin Institute of Technology City University of Hong Kong
  • Bowen Dong Harbin Institute of Technology
  • Jiaying Lin City University of Hong Kong
  • Xiaohui Liu Harbin Institute of Technology
  • Rynson W.H. Lau City University of Hong Kong
  • Wangmeng Zuo Harbin Institute of Technology Peng Cheng Laboratory

DOI:

https://doi.org/10.1609/aaai.v37i1.25173

Keywords:

CV: Low Level & Physics-Based Vision

Abstract

Mirror detection aims to identify the mirror regions in the given input image. Existing works mainly focus on integrating the semantic features and structural features to mine specific relations between mirror and non-mirror regions, or introducing mirror properties like depth or chirality to help analyze the existence of mirrors. In this work, we observe that a real object typically forms a loose symmetry relationship with its corresponding reflection in the mirror, which is beneficial in distinguishing mirrors from real objects. Based on this observation, we propose a dual-path Symmetry-Aware Transformer-based mirror detection Network (SATNet), which includes two novel modules: Symmetry-Aware Attention Module (SAAM) and Contrast and Fusion Decoder Module (CFDM). Specifically, we first adopt a transformer backbone to model global information aggregation in images, extracting multi-scale features in two paths. We then feed the high-level dual-path features to SAAMs to capture the symmetry relations. Finally, we fuse the dual-path features and refine our prediction maps progressively with CFDMs to obtain the final mirror mask. Experimental results show that SATNet outperforms both RGB and RGB-D mirror detection methods on all available mirror detection datasets.

Downloads

Published

2023-06-26

How to Cite

Huang, T., Dong, B., Lin, J., Liu, X., W.H. Lau, R., & Zuo, W. (2023). Symmetry-Aware Transformer-Based Mirror Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 37(1), 935-943. https://doi.org/10.1609/aaai.v37i1.25173

Issue

Section

AAAI Technical Track on Computer Vision I