Improving Label Noise Robustness with Data Augmentation and Semi-Supervised Learning (Student Abstract)

Kento Nishi; Yi Ding; Alex Rich; Tobias Höllerer

doi:10.1609/aaai.v35i18.17924

Authors

Kento Nishi Lynbrook High School, San Jose, CA 95129
Yi Ding University of California, Santa Barbara
Alex Rich University of California, Santa Barbara
Tobias Höllerer University of California, Santa Barbara

DOI:

https://doi.org/10.1609/aaai.v35i18.17924

Keywords:

Learning With Noisy Labels, Label Noise Robustness, Data Augmentation, Semi-Supervised Learning, Image Classification

Abstract

Modern machine learning algorithms typically require large amounts of labeled training data to fit a reliable model. To minimize the cost of data collection, researchers often employ techniques such as crowdsourcing and web scraping. However, web data and human annotations are known to exhibit high margins of error, resulting in sizable amounts of incorrect labels. Poorly labeled training data can cause models to overfit to the noise distribution, crippling performance in real-world applications. In this work, we investigate the viability of using data augmentation in conjunction with semi-supervised learning to improve the label noise robustness of image classification models. We conduct several experiments using noisy variants of the CIFAR-10 image classification dataset to benchmark our method against existing algorithms. Experimental results show that our augmentative SSL approach improves upon the state-of-the-art.

Improving Label Noise Robustness with Data Augmentation and Semi-Supervised Learning (Student Abstract)

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information