Learning with Marginalized Corrupted Features and Labels Together

Authors

  • Yingming Li University of Electronic Science and Technology of China
  • Ming Yang State Universityof New York at Binghamton
  • Zenglin Xu University of Electronic Science and Technology of China
  • Zhongfei Zhang State Universityof New York at Binghamton

DOI:

https://doi.org/10.1609/aaai.v30i1.10152

Abstract

Tagging has become increasingly important in many real-world applications noticeably including web applications, such as web blogs and resource sharing systems. Despite this importance, tagging methods often face difficult challenges such as limited training samples and incomplete labels, which usually lead to degenerated performance on tag prediction. To improve the generalization performance, in this paper, we propose Regularized Marginalized Cross-View learning (RMCV) by jointly modeling on attribute noise and label noise. In more details, the proposed model constructs infinite training examples with attribute noises from known exponential-family distributions and exploits label noise via marginalized denoising autoencoder. Therefore, the model benefits from its robustness and alleviates the problem of tag sparsity. While RMCV is a general method for learning tagging, in the evaluations we focus on the specific application of multi-label text tagging. Extensive evaluations on three benchmark data sets demonstrate that RMCV outstands with a superior performance in comparison with state-of-the-art methods.

Downloads

Published

2016-02-21

How to Cite

Li, Y., Yang, M., Xu, Z., & Zhang, Z. (2016). Learning with Marginalized Corrupted Features and Labels Together. Proceedings of the AAAI Conference on Artificial Intelligence, 30(1). https://doi.org/10.1609/aaai.v30i1.10152

Issue

Section

Technical Papers: Machine Learning Applications