Demodalizing Face Recognition with Synthetic Samples

Authors

  • Zhonghua Zhai Hikvision Research Institute Zhejiang University
  • Pengju Yang Hikvision Research Institute
  • Xiaofeng Zhang Hikvision Research Institute
  • Maji Huang Hikvision Research Institute
  • Haijing Cheng Hikvision Research Institute
  • Xuejun Yan Hikvision Research Institue
  • Chunmao Wang Hikvision Research Institute
  • Shiliang Pu Hikvision Research Institute

DOI:

https://doi.org/10.1609/aaai.v35i4.16439

Keywords:

Biometrics, Face, Gesture & Pose, Computational Photography, Image & Video Synthesis

Abstract

Using data generated by generative adversarial networks or three-dimensional (3D) technology for face recognition training is a theoretically reasonable solution to the problems of unbalanced data distributions and data scarcity. However, due to the modal difference between synthetic data and real data, the direct use of data for training often leads to a decrease in the recognition performance, and the effect of synthetic data on recognition remains ambiguous. In this paper, after observing in experiments that modality information has a fixed form, we propose a demodalizing face recognition training architecture for the first time and provide a feasible method for recognition training using synthetic samples. Specifically, three different demodalizing training methods, from implicit to explicit, are proposed. These methods gradually reveal a generated modality that is difficult to quantify or describe. By removing the modalities of the synthetic data, the performance degradation is greatly alleviated. We validate the effectiveness of our approach on various benchmarks of large-scale face recognition and outperform the previous methods, especially in the low FAR range.

Downloads

Published

2021-05-18

How to Cite

Zhai, Z., Yang, P., Zhang, X., Huang, M., Cheng, H., Yan, X., Wang, C., & Pu, S. (2021). Demodalizing Face Recognition with Synthetic Samples. Proceedings of the AAAI Conference on Artificial Intelligence, 35(4), 3278-3286. https://doi.org/10.1609/aaai.v35i4.16439

Issue

Section

AAAI Technical Track on Computer Vision III