R3CD: Scene Graph to Image Generation with Relation-Aware Compositional Contrastive Control Diffusion

Authors

  • Jinxiu Liu South China University of Technology
  • Qi Liu South China University of Technology

DOI:

https://doi.org/10.1609/aaai.v38i4.28155

Keywords:

CV: Language and Vision, CV: Multi-modal Vision

Abstract

Image generation tasks have achieved remarkable performance using large-scale diffusion models. However, these models are limited to capturing the abstract relations (viz., interactions excluding positional relations) among multiple entities of complex scene graphs. Two main problems exist: 1) fail to depict more concise and accurate interactions via abstract relations; 2) fail to generate complete entities. To address that, we propose a novel Relation-aware Compositional Contrastive Control Diffusion method, dubbed as R3CD, that leverages large-scale diffusion models to learn abstract interactions from scene graphs. Herein, a scene graph transformer based on node and edge encoding is first designed to perceive both local and global information from input scene graphs, whose embeddings are initialized by a T5 model. Then a joint contrastive loss based on attention maps and denoising steps is developed to control the diffusion model to understand and further generate images, whose spatial structures and interaction features are consistent with a priori relation. Extensive experiments are conducted on two datasets: Visual Genome and COCO-Stuff, and demonstrate that the proposal outperforms existing models both in quantitative and qualitative metrics to generate more realistic and diverse images according to different scene graph specifications.

Downloads

Published

2024-03-24

How to Cite

Liu, J., & Liu, Q. (2024). R3CD: Scene Graph to Image Generation with Relation-Aware Compositional Contrastive Control Diffusion. Proceedings of the AAAI Conference on Artificial Intelligence, 38(4), 3657-3665. https://doi.org/10.1609/aaai.v38i4.28155

Issue

Section

AAAI Technical Track on Computer Vision III