Robust Semi-Supervised Learning through Label Aggregation

Authors

  • Yan Yan University of Technology Sydney
  • Zhongwen Xu University of Technology Sydney
  • Ivor Tsang University of Technology Sydney
  • Guodong Long University of Technology Sydney
  • Yi Yang University of Technology Sydney

DOI:

https://doi.org/10.1609/aaai.v30i1.10276

Abstract

Semi-supervised learning is proposed to exploit both labeled and unlabeled data. However, as the scale of data in real world applications increases significantly, conventional semi-supervised algorithms usually lead to massive computational cost and cannot be applied to large scale datasets. In addition, label noise is usually present in the practical applications due to human annotation, which very likely results in remarkable degeneration of performance in semi-supervised methods. To address these two challenges, in this paper, we propose an efficient RObust Semi-Supervised Ensemble Learning (ROSSEL) method, which generates pseudo-labels for unlabeled data using a set of weak annotators, and combines them to approximate the ground-truth labels to assist semi-supervised learning. We formulate the weighted combination process as a multiple label kernel learning (MLKL) problem which can be solved efficiently. Compared with other semi-supervised learning algorithms, the proposed method has linear time complexity. Extensive experiments on five benchmark datasets demonstrate the superior effectiveness, efficiency and robustness of the proposed algorithm.

Downloads

Published

2016-03-02

How to Cite

Yan, Y., Xu, Z., Tsang, I., Long, G., & Yang, Y. (2016). Robust Semi-Supervised Learning through Label Aggregation. Proceedings of the AAAI Conference on Artificial Intelligence, 30(1). https://doi.org/10.1609/aaai.v30i1.10276

Issue

Section

Technical Papers: Machine Learning Methods