Semi-Supervised Matrix Completion for Cross-Lingual Text Classification


  • Min Xiao Temple University
  • Yuhong Guo Temple University



cross-lingual text classification


Cross-lingual text classification is the task of assigning labels to observed documents in a label-scarce target language domain by using a prediction model trained with labeled documents from a label-rich source language domain. Cross-lingual text classification is popularly studied in natural language processing area to reduce the expensive manual annotation effort required in the target language domain. In this work, we propose a novel semi-supervised representation learning approach to address this challenging task by inducing interlingual features via semi-supervised matrix completion. To evaluate the proposed learning technique, we conduct extensive experiments on eighteen cross language sentiment classification tasks with four different languages. The empirical results demonstrate the efficacy of the proposed approach, and show it outperforms a number of related cross-lingual learning methods.




How to Cite

Xiao, M., & Guo, Y. (2014). Semi-Supervised Matrix Completion for Cross-Lingual Text Classification. Proceedings of the AAAI Conference on Artificial Intelligence, 28(1).