Uncertainty Estimation via Response Scaling for Pseudo-Mask Noise Mitigation in Weakly-Supervised Semantic Segmentation

Authors

  • Yi Li The Hong Kong University of Science and Technology, SenseTime Research
  • Yiqun Duan University of Technology Sydney
  • Zhanghui Kuang Sensetime Ltd.
  • Yimin Chen SenseTime
  • Wayne Zhang SenseTime Research
  • Xiaomeng Li The Hong Kong University of Science and Technology

DOI:

https://doi.org/10.1609/aaai.v36i2.20034

Keywords:

Computer Vision (CV)

Abstract

Weakly-Supervised Semantic Segmentation (WSSS) segments objects without heavy burden of dense annotation. While as a price, generated pseudo-masks exist obvious noisy pixels, which result in sub-optimal segmentation models trained over these pseudo-masks. But rare studies notice or work on this problem, even these noisy pixels are inevitable after their improvements on pseudo-mask. So we try to improve WSSS in the aspect of noise mitigation. And we observe that many noisy pixels are of high confidences, especially when the response range is too wide or narrow, presenting an uncertain status. Thus, in this paper, we simulate noisy variations of response by scaling the prediction map in multiple times for uncertainty estimation. The uncertainty is then used to weight the segmentation loss to mitigate noisy supervision signals. We call this method URN, abbreviated from Uncertainty estimation via Response scaling for Noise mitigation. Experiments validate the benefits of URN, and our method achieves state-of-the-art results at 71.2% and 41.5% on PASCAL VOC 2012 and MS COCO 2014 respectively, without extra models like saliency detection. Code is available at https://github.com/XMed-Lab/URN.

Downloads

Published

2022-06-28

How to Cite

Li, Y., Duan, Y., Kuang, Z., Chen, Y., Zhang, W., & Li, X. (2022). Uncertainty Estimation via Response Scaling for Pseudo-Mask Noise Mitigation in Weakly-Supervised Semantic Segmentation. Proceedings of the AAAI Conference on Artificial Intelligence, 36(2), 1447-1455. https://doi.org/10.1609/aaai.v36i2.20034

Issue

Section

AAAI Technical Track on Computer Vision II