MSML: Enhancing Occlusion-Robustness by Multi-Scale Segmentation-Based Mask Learning for Face Recognition

Authors

  • Ge Yuan Sun Yat-sen University
  • Huicheng Zheng Sun Yat-sen University
  • Jiayu Dong Sun Yat-Sen University

DOI:

https://doi.org/10.1609/aaai.v36i3.20228

Keywords:

Computer Vision (CV)

Abstract

In unconstrained scenarios, face recognition remains challenging, particularly when faces are occluded. Existing methods generalize poorly due to the distribution distortion induced by unpredictable occlusions. To tackle this problem, we propose a hierarchical segmentation-based mask learning strategy for face recognition, enhancing occlusion-robustness by integrating segmentation representations of occlusion into face recognition in the latent space. We present a novel multi-scale segmentation-based mask learning (MSML) network, which consists of a face recognition branch (FRB), an occlusion segmentation branch (OSB), and hierarchical elaborate feature masking (FM) operators. With the guidance of hierarchical segmentation representations of occlusion learned by the OSB, the FM operators can generate multi-scale latent masks to eliminate mistaken responses introduced by occlusions and purify the contaminated facial features at multiple layers. In this way, the proposed MSML network can effectively identify and remove the occlusions from feature representations at multiple levels and aggregate features from visible facial areas. Experiments on face verification and recognition under synthetic or realistic occlusions demonstrate the effectiveness of our method compared to state-of-the-art methods.

Downloads

Published

2022-06-28

How to Cite

Yuan, G., Zheng, H., & Dong, J. (2022). MSML: Enhancing Occlusion-Robustness by Multi-Scale Segmentation-Based Mask Learning for Face Recognition. Proceedings of the AAAI Conference on Artificial Intelligence, 36(3), 3197-3205. https://doi.org/10.1609/aaai.v36i3.20228

Issue

Section

AAAI Technical Track on Computer Vision III