Self-Supervised Sketch-to-Image Synthesis

Bingchen Liu; Yizhe Zhu; Kunpeng Song; Ahmed Elgammal

doi:10.1609/aaai.v35i3.16304

Authors

Bingchen Liu Playform - Artrendex Inc., USA Department of Computer Science, Rutgers University
Yizhe Zhu Department of Computer Science, Rutgers University
Kunpeng Song Playform - Artrendex Inc., USA Department of Computer Science, Rutgers University
Ahmed Elgammal Playform - Artrendex Inc., USA Department of Computer Science, Rutgers University

DOI:

https://doi.org/10.1609/aaai.v35i3.16304

Keywords:

Computational Photography, Image & Video Synthesis, Art/Music/Creativity, (Deep) Neural Network Algorithms

Abstract

Imagining a colored realistic image from an arbitrary-drawn sketch is one of human capabilities that we eager machines to mimic. Unlike previous methods that either require the sketch-image pairs or utilize low-quantity detected edges as sketches, we study the exemplar-based sketch-to-image (s2i) synthesis task in a self-supervised learning manner, eliminating the necessity of the paired sketch data. To this end, we first propose an unsupervised method to efficiently synthesize line-sketches for general RGB-only datasets. With the synthetic paired-data, we then present a self-supervised Auto-Encoder (AE) to decouple the content/style features from sketches and RGB-images, and synthesize images both content-faithful to the sketches and style-consistent to the RGB-images. While prior works employ either the cycle-consistence loss or dedicated attentional modules to enforce the content/style fidelity, we show AE's superior performance with pure self-supervisions. To further improve the synthesis quality in high resolution, we also leverage an adversarial network to refine the details of synthetic images. Extensive experiments on $1024^2$ resolution demonstrate a new state-of-art-art performance of the proposed model on CelebA-HQ and Wiki-Art datasets. Moreover, with the proposed sketch generator, the model shows a promising performance on style mixing and style transfer, which the synthesized images are not only style-consistent but also semantically meaningful.

Self-Supervised Sketch-to-Image Synthesis

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information