Multi-Energy Guided Image Translation with Stochastic Differential Equations for Near-Infrared Facial Expression Recognition

Authors

  • Bingjun Luo Tsinghua University
  • Zewen Wang Tsinghua University
  • Jinpeng Wang Tsinghua University
  • Junjie Zhu Tsinghua University
  • Xibin Zhao Tsinghua University
  • Yue Gao Tsinghua University

DOI:

https://doi.org/10.1609/aaai.v38i1.27812

Keywords:

CMS: Affective Computing, APP: Humanities & Computational Social Science, CMS: Social Cognition And Interaction

Abstract

Illumination variation has been a long-term challenge in real-world facial expression recognition (FER). Under uncontrolled or non-visible light conditions, near-infrared (NIR) can provide a simple and alternative solution to obtain high-quality images and supplement the geometric and texture details that are missing in the visible (VIS) domain. Due to the lack of large-scale NIR facial expression datasets, directly extending VIS FER methods to the NIR spectrum may be ineffective. Additionally, previous heterogeneous image synthesis methods are restricted by low controllability without prior task knowledge. To tackle these issues, we present the first approach, called for NIR-FER Stochastic Differential Equations (NFER-SDE), that transforms face expression appearance between heterogeneous modalities to the overfitting problem on small-scale NIR data. NFER-SDE can take the whole VIS source image as input and, together with domain-specific knowledge, guide the preservation of modality-invariant information in the high-frequency content of the image. Extensive experiments and ablation studies show that NFER-SDE significantly improves the performance of NIR FER and achieves state-of-the-art results on the only two available NIR FER datasets, Oulu-CASIA and Large-HFE.

Published

2024-03-25

How to Cite

Luo, B., Wang, Z., Wang, J., Zhu, J., Zhao, X., & Gao, Y. (2024). Multi-Energy Guided Image Translation with Stochastic Differential Equations for Near-Infrared Facial Expression Recognition. Proceedings of the AAAI Conference on Artificial Intelligence, 38(1), 565-573. https://doi.org/10.1609/aaai.v38i1.27812

Issue

Section

AAAI Technical Track on Cognitive Modeling & Cognitive Systems