Unlocking the Power of Open Set: A New Perspective for Open-Set Noisy Label Learning

Wenhai Wan; Xinrui Wang; Ming-Kun Xie; Shao-Yuan Li; Sheng-Jun Huang; Songcan Chen

doi:10.1609/aaai.v38i14.29469

Authors

Wenhai Wan Nanjing University of Aeronautics and Astronautics
Xinrui Wang Nanjing University of Aeronautics and Astronautics
Ming-Kun Xie Nanjing University of Aeronautics and Astronautics
Shao-Yuan Li Nanjing University of Aeronautics and Astronautics
Sheng-Jun Huang Nanjing University of Aeronautics and Astronautics
Songcan Chen Nanjing University of Aeronautics and Astronautics

DOI:

https://doi.org/10.1609/aaai.v38i14.29469

Keywords:

ML: Classification and Regression, ML: Deep Learning Algorithms, ML: Other Foundations of Machine Learning, ML: Representation Learning, ML: Semi-Supervised Learning, ML: Unsupervised & Self-Supervised Learning

Abstract

Learning from noisy data has attracted much attention, where most methods focus on closed-set label noise. However, a more common scenario in the real world is the presence of both open-set and closed-set noise. Existing methods typically identify and handle these two types of label noise separately by designing a specific strategy for each type. However, in many real-world scenarios, it would be challenging to identify open-set examples, especially when the dataset has been severely corrupted. Unlike the previous works, we explore how models behave when faced with open-set examples, and find that a part of open-set examples gradually get integrated into certain known classes, which is beneficial for the separation among known classes. Motivated by the phenomenon, we propose a novel two-step contrastive learning method CECL (Class Expansion Contrastive Learning) which aims to deal with both types of label noise by exploiting the useful information of open-set examples. Specifically, we incorporate some open-set examples into closed-set classes to enhance performance while treating others as delimiters to improve representative ability. Extensive experiments on synthetic and real-world datasets with diverse label noise demonstrate the effectiveness of CECL.

Unlocking the Power of Open Set: A New Perspective for Open-Set Noisy Label Learning

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription