Infusing Definiteness into Randomness: Rethinking Composition Styles for Deep Image Matting

Authors

  • Zixuan Ye Huazhong University of Science and Technology
  • Yutong Dai The University of Adelaide
  • Chaoyi Hong Huazhong University of Science and Technology
  • Zhiguo Cao Huazhong University of Science and Technology
  • Hao Lu Huazhong University of Science and Technology

DOI:

https://doi.org/10.1609/aaai.v37i3.25432

Keywords:

CV: Segmentation, CV: Low Level & Physics-Based Vision

Abstract

We study the composition style in deep image matting, a notion that characterizes a data generation flow on how to exploit limited foregrounds and random backgrounds to form a training dataset. Prior art executes this flow in a completely random manner by simply going through the foreground pool or by optionally combining two foregrounds before foreground-background composition. In this work, we first show that naive foreground combination can be problematic and therefore derive an alternative formulation to reasonably combine foregrounds. Our second contribution is an observation that matting performance can benefit from a certain occurrence frequency of combined foregrounds and their associated source foregrounds during training. Inspired by this, we introduce a novel composition style that binds the source and combined foregrounds in a definite triplet. In addition, we also find that different orders of foreground combination lead to different foreground patterns, which further inspires a quadruplet-based composition style. Results under controlled experiments on four matting baselines show that our composition styles outperform existing ones and invite consistent performance improvement on both composited and real-world datasets. Code is available at: https://github.com/coconuthust/composition_styles

Downloads

Published

2023-06-26

How to Cite

Ye, Z., Dai, Y., Hong, C., Cao, Z., & Lu, H. (2023). Infusing Definiteness into Randomness: Rethinking Composition Styles for Deep Image Matting. Proceedings of the AAAI Conference on Artificial Intelligence, 37(3), 3259-3266. https://doi.org/10.1609/aaai.v37i3.25432

Issue

Section

AAAI Technical Track on Computer Vision III