Shadow Generation with Decomposed Mask Prediction and Attentive Shadow Filling

Authors

  • Xinhao Tao Shanghai Jiao Tong University
  • Junyan Cao Shanghai Jiao Tong University
  • Yan Hong Ant Group
  • Li Niu Shanghai Jiao Tong University

DOI:

https://doi.org/10.1609/aaai.v38i6.28326

Keywords:

CV: Computational Photography, Image & Video Synthesis

Abstract

Image composition refers to inserting a foreground object into a background image to obtain a composite image. In this work, we focus on generating plausible shadows for the inserted foreground object to make the composite image more realistic. To supplement the existing small-scale dataset, we create a large-scale dataset called RdSOBA with rendering techniques. Moreover, we design a two-stage network named DMASNet with decomposed mask prediction and attentive shadow filling. Specifically, in the first stage, we decompose shadow mask prediction into box prediction and shape prediction. In the second stage, we attend to reference background shadow pixels to fill the foreground shadow. Abundant experiments prove that our DMASNet achieves better visual effects and generalizes well to real composite images.

Published

2024-03-24

How to Cite

Tao, X., Cao, J., Hong, Y., & Niu, L. (2024). Shadow Generation with Decomposed Mask Prediction and Attentive Shadow Filling. Proceedings of the AAAI Conference on Artificial Intelligence, 38(6), 5198–5206. https://doi.org/10.1609/aaai.v38i6.28326

Issue

Section

AAAI Technical Track on Computer Vision V