Gated Fully Fusion for Semantic Segmentation

Authors

  • Xiangtai Li Peking University
  • Houlong Zhao DeepMotion
  • Lei Han Tecent AI lab
  • Yunhai Tong Peking University
  • Shaohua Tan Peking University
  • Kuiyuan Yang DeepMotion

DOI:

https://doi.org/10.1609/aaai.v34i07.6805

Abstract

Semantic segmentation generates comprehensive understanding of scenes through densely predicting the category for each pixel. High-level features from Deep Convolutional Neural Networks already demonstrate their effectiveness in semantic segmentation tasks, however the coarse resolution of high-level features often leads to inferior results for small/thin objects where detailed information is important. It is natural to consider importing low level features to compensate for the lost detailed information in high-level features. Unfortunately, simply combining multi-level features suffers from the semantic gap among them. In this paper, we propose a new architecture, named Gated Fully Fusion(GFF), to selectively fuse features from multiple levels using gates in a fully connected way. Specifically, features at each level are enhanced by higher-level features with stronger semantics and lower-level features with more details, and gates are used to control the propagation of useful information which significantly reduces the noises during fusion. We achieve the state of the art results on four challenging scene parsing datasets including Cityscapes, Pascal Context, COCO-stuff and ADE20K.

Downloads

Published

2020-04-03

How to Cite

Li, X., Zhao, H., Han, L., Tong, Y., Tan, S., & Yang, K. (2020). Gated Fully Fusion for Semantic Segmentation. Proceedings of the AAAI Conference on Artificial Intelligence, 34(07), 11418-11425. https://doi.org/10.1609/aaai.v34i07.6805

Issue

Section

AAAI Technical Track: Vision