Multi-Domain Incremental Learning for Face Presentation Attack Detection

Authors

  • Keyao Wang Baidu Inc.
  • Guosheng Zhang Baidu Inc.
  • Haixiao Yue Baidu Inc.
  • Ajian Liu University of Chinese Academy of Sciences
  • Gang Zhang Baidu Inc.
  • Haocheng Feng Baidu Inc.
  • Junyu Han Baidu Inc.
  • Errui Ding Baidu Inc.
  • Jingdong Wang Baidu Inc.

DOI:

https://doi.org/10.1609/aaai.v38i6.28359

Keywords:

CV: Biometrics, Face, Gesture & Pose

Abstract

Previous face Presentation Attack Detection (PAD) methods aim to improve the effectiveness of cross-domain tasks. However, in real-world scenarios, the original training data of the pre-trained model is not available due to data privacy or other reasons. Under these constraints, general methods for fine-tuning single-target domain data may lose previously learned knowledge, leading to a catastrophic forgetting problem. To address these issues, we propose a multi-domain incremental learning (MDIL) method for PAD, which not only learns knowledge well from the new domain but also maintains the performance of previous domains stably. Specifically, we propose an adaptive domain-specific experts (ADE) framework based on the vision transformer to preserve the discriminability of previous domains. Furthermore, an asymmetric classifier is designed to keep the output distribution of different classifiers consistent, thereby improving the generalization ability. Extensive experiments show that our proposed method achieves state-of-the-art performance compared to prior methods of incremental learning. Excitingly, under more stringent setting conditions, our method approximates or even outperforms the DA/DG-based methods.

Published

2024-03-24

How to Cite

Wang, K., Zhang, G., Yue, H., Liu, A., Zhang, G., Feng, H., Han, J., Ding, E., & Wang, J. (2024). Multi-Domain Incremental Learning for Face Presentation Attack Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 38(6), 5499-5507. https://doi.org/10.1609/aaai.v38i6.28359

Issue

Section

AAAI Technical Track on Computer Vision V