Learning with Marginalized Corrupted Features and Labels Together

Yingming Li; Ming Yang; Zenglin Xu; Zhongfei Zhang

doi:10.1609/aaai.v30i1.10152

Authors

Yingming Li University of Electronic Science and Technology of China
Ming Yang State Universityof New York at Binghamton
Zenglin Xu University of Electronic Science and Technology of China
Zhongfei Zhang State Universityof New York at Binghamton

DOI:

https://doi.org/10.1609/aaai.v30i1.10152

Abstract

Tagging has become increasingly important in many real-world applications noticeably including web applications, such as web blogs and resource sharing systems. Despite this importance, tagging methods often face difficult challenges such as limited training samples and incomplete labels, which usually lead to degenerated performance on tag prediction. To improve the generalization performance, in this paper, we propose Regularized Marginalized Cross-View learning (RMCV) by jointly modeling on attribute noise and label noise. In more details, the proposed model constructs infinite training examples with attribute noises from known exponential-family distributions and exploits label noise via marginalized denoising autoencoder. Therefore, the model benefits from its robustness and alleviates the problem of tag sparsity. While RMCV is a general method for learning tagging, in the evaluations we focus on the specific application of multi-label text tagging. Extensive evaluations on three benchmark data sets demonstrate that RMCV outstands with a superior performance in comparison with state-of-the-art methods.

Learning with Marginalized Corrupted Features and Labels Together

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription