Progressive Neighborhood Aggregation for Semantic Segmentation Refinement

Authors

  • Ting Liu Northwestern Polytechnical University
  • Yunchao Wei Beijing Jiaotong University
  • Yanning Zhang Northwestern Polytechnical University

DOI:

https://doi.org/10.1609/aaai.v37i2.25262

Keywords:

CV: Segmentation, CV: Applications, CV: Scene Analysis & Understanding, ML: Applications, ML: Deep Neural Architectures

Abstract

Multi-scale features from backbone networks have been widely applied to recover object details in segmentation tasks. Generally, the multi-level features are fused in a certain manner for further pixel-level dense prediction. Whereas, the spatial structure information is not fully explored, that is similar nearby pixels can be used to complement each other. In this paper, we investigate a progressive neighborhood aggregation (PNA) framework to refine the semantic segmentation prediction, resulting in an end-to-end solution that can perform the coarse prediction and refinement in a unified network. Specifically, we first present a neighborhood aggregation module, the neighborhood similarity matrices for each pixel are estimated on multi-scale features, which are further used to progressively aggregate the high-level feature for recovering the spatial structure. In addition, to further integrate the high-resolution details into the aggregated feature, we apply a self-aggregation module on the low-level features to emphasize important semantic information for complementing losing spatial details. Extensive experiments on five segmentation datasets, including Pascal VOC 2012, CityScapes, COCO-Stuff 10k, DeepGlobe, and Trans10k, demonstrate that the proposed framework can be cascaded into existing segmentation models providing consistent improvements. In particular, our method achieves new state-of-the-art performances on two challenging datasets, DeepGlobe and Trans10k. The code is available at https://github.com/liutinglt/PNA.

Downloads

Published

2023-06-26

How to Cite

Liu, T., Wei, Y., & Zhang, Y. (2023). Progressive Neighborhood Aggregation for Semantic Segmentation Refinement. Proceedings of the AAAI Conference on Artificial Intelligence, 37(2), 1737-1745. https://doi.org/10.1609/aaai.v37i2.25262

Issue

Section

AAAI Technical Track on Computer Vision II