Enhancing Identity-Deformation Disentanglement in StyleGAN for One-Shot Face Video Re-Enactment

Authors

  • Qing Chang State Key Lab of CAD&CG, Zhejiang University, Hangzhou, China
  • Yao-Xiang Ding State Key Lab of CAD&CG, Zhejiang University, Hangzhou, China
  • Kun Zhou State Key Lab of CAD&CG, Zhejiang University, Hangzhou, China

DOI:

https://doi.org/10.1609/aaai.v39i2.32113

Abstract

The task of one-shot face video re-enactment aims at generating target video of faces with the same identity of one source frame and facial deformation of the driving video. To achieve high quality generation, it is essential to precisely disentangle identity-related and identity-independent characteristics, meanwhile build expressive features keeping high-frequency facial details, which still remain unaddressed for existing approaches. To deal with these two challenges, we propose a two-stage generation model based on StyleGAN, whose key novel techniques lie in better disentangling identity and deformation codes in the latent space through an identity-based modeling and manipulating intermediate StyleGAN features at the second stage for augmenting facial details of the generating targets. To further improve identity consistency, a data augmentation method is introduced during training for enhancing the key features affecting identity such as hair and wrinkles. Extensive experimental results demonstrate the superiority of our approach compared to state-of-the-art methods.

Published

2025-04-11

How to Cite

Chang, Q., Ding, Y.-X., & Zhou, K. (2025). Enhancing Identity-Deformation Disentanglement in StyleGAN for One-Shot Face Video Re-Enactment. Proceedings of the AAAI Conference on Artificial Intelligence, 39(2), 1247–1255. https://doi.org/10.1609/aaai.v39i2.32113

Issue

Section

AAAI Technical Track on Cognitive Modeling & Cognitive Systems