Self-Supervised Sketch-to-Image Synthesis

Authors

  • Bingchen Liu Playform - Artrendex Inc., USA Department of Computer Science, Rutgers University
  • Yizhe Zhu Department of Computer Science, Rutgers University
  • Kunpeng Song Playform - Artrendex Inc., USA Department of Computer Science, Rutgers University
  • Ahmed Elgammal Playform - Artrendex Inc., USA Department of Computer Science, Rutgers University

DOI:

https://doi.org/10.1609/aaai.v35i3.16304

Keywords:

Computational Photography, Image & Video Synthesis, Art/Music/Creativity, (Deep) Neural Network Algorithms

Abstract

Imagining a colored realistic image from an arbitrary-drawn sketch is one of human capabilities that we eager machines to mimic. Unlike previous methods that either require the sketch-image pairs or utilize low-quantity detected edges as sketches, we study the exemplar-based sketch-to-image (s2i) synthesis task in a self-supervised learning manner, eliminating the necessity of the paired sketch data. To this end, we first propose an unsupervised method to efficiently synthesize line-sketches for general RGB-only datasets. With the synthetic paired-data, we then present a self-supervised Auto-Encoder (AE) to decouple the content/style features from sketches and RGB-images, and synthesize images both content-faithful to the sketches and style-consistent to the RGB-images. While prior works employ either the cycle-consistence loss or dedicated attentional modules to enforce the content/style fidelity, we show AE's superior performance with pure self-supervisions. To further improve the synthesis quality in high resolution, we also leverage an adversarial network to refine the details of synthetic images. Extensive experiments on $1024^2$ resolution demonstrate a new state-of-art-art performance of the proposed model on CelebA-HQ and Wiki-Art datasets. Moreover, with the proposed sketch generator, the model shows a promising performance on style mixing and style transfer, which the synthesized images are not only style-consistent but also semantically meaningful.

Downloads

Published

2021-05-18

How to Cite

Liu, B., Zhu, Y., Song, K., & Elgammal, A. (2021). Self-Supervised Sketch-to-Image Synthesis. Proceedings of the AAAI Conference on Artificial Intelligence, 35(3), 2073-2081. https://doi.org/10.1609/aaai.v35i3.16304

Issue

Section

AAAI Technical Track on Computer Vision II