Preserving Structural Consistency in Arbitrary Artist and Artwork Style Transfer

Authors

  • Jingyu Wu Alibaba-Zhejiang University Joint Institute of Frontier Technologies, Zhejiang University
  • Lefan Hou Alibaba-Zhejiang University Joint Institute of Frontier Technologies, Zhejiang University
  • Zejian Li School of Software Technology, Zhejiang University Alibaba-Zhejiang University Joint Institute of Frontier Technologies, Zhejiang University
  • Jun Liao School of Big Data & Software Engineering, Chongqing University
  • Li Liu School of Big Data & Software Engineering, Chongqing University
  • Lingyun Sun Alibaba-Zhejiang University Joint Institute of Frontier Technologies, Zhejiang University Zhejiang-Singapore Innovation and AI Joint Research Lab

DOI:

https://doi.org/10.1609/aaai.v37i3.25384

Keywords:

CV: Computational Photography, Image & Video Synthesis

Abstract

Deep generative models are effective in style transfer. Previous methods learn one or several specific artist-style from a collection of artworks. These methods not only homogenize the artist-style of different artworks of the same artist but also lack generalization for the unseen artists. To solve these challenges, we propose a double-style transferring module (DSTM). It extracts different artist-style and artwork-style from different artworks (even untrained) and preserves the intrinsic diversity between different artworks of the same artist. DSTM swaps the two styles in the adversarial training and encourages realistic image generation given arbitrary style combinations. However, learning style from single artwork can often cause over-adaption to it, resulting in the introduction of structural features of style image. We further propose an edge enhancing module (EEM) which derives edge information from multi-scale and multi-level features to enhance structural consistency. We broadly evaluate our method across six large-scale benchmark datasets. Empirical results show that our method achieves arbitrary artist-style and artwork-style extraction from a single artwork, and effectively avoids introducing the style image’s structural features. Our method improves the state-of-the-art deception rate from 58.9% to 67.2% and the average FID from 48.74 to 42.83.

Downloads

Published

2023-06-26

How to Cite

Wu, J., Hou, L., Li, Z., Liao, J., Liu, L., & Sun, L. (2023). Preserving Structural Consistency in Arbitrary Artist and Artwork Style Transfer. Proceedings of the AAAI Conference on Artificial Intelligence, 37(3), 2830-2838. https://doi.org/10.1609/aaai.v37i3.25384

Issue

Section

AAAI Technical Track on Computer Vision III