G2L-CariGAN: Caricature Generation from Global Structure to Local Features

Authors

  • Xin Huang Tongji University
  • Yunfeng Bai Tongji University
  • Dong Liang Tongji University
  • Feng Tian Duke Kunshan University
  • Jinyuan Jia Tongji University

DOI:

https://doi.org/10.1609/aaai.v38i3.28014

Keywords:

CV: Computational Photography, Image & Video Synthesis, CV: Applications

Abstract

Existing GAN-based approaches to caricature generation mainly focus on exaggerating a character’s global facial structure. This often leads to the failure in highlighting significant facial features such as big eyes and hook nose. To address this limitation, we propose a new approach termed as G2L-CariGAN, which uses feature maps of spatial dimensions instead of latent codes for geometric exaggeration. G2L-CariGAN first exaggerates the global facial structure of the character on a low-dimensional feature map and then exaggerates its local facial features on a high-dimensional feature map. Moreover, we develop a caricature identity loss function based on feature maps, which well retains the character's identity after exaggeration. Our experiments have demonstrated that G2L-CariGAN outperforms the state-of-arts in terms of the quality of exaggerating a character and retaining its identity.

Published

2024-03-24

How to Cite

Huang, X., Bai, Y., Liang, D., Tian, F., & Jia, J. (2024). G2L-CariGAN: Caricature Generation from Global Structure to Local Features. Proceedings of the AAAI Conference on Artificial Intelligence, 38(3), 2391-2399. https://doi.org/10.1609/aaai.v38i3.28014

Issue

Section

AAAI Technical Track on Computer Vision II