Semi-Supervised Multi-View Correlation Feature Learning with Application to Webpage Classification

Authors

  • Xiao-Yuan Jing Wuhan University; Nanjing University of Posts and Telecommunications
  • Fei Wu Nanjing University of Posts and Telecommunications
  • Xiwei Dong Nanjing University of Posts and Telecommunications
  • Shiguang Shan Chinese Academy of Sciences (CAS)
  • Songcan Chen Nanjing University of Aeronautics and Astronautics

DOI:

https://doi.org/10.1609/aaai.v31i1.10741

Abstract

Webpage classification has attracted a lot of research interest. Webpage data is often multi-view and high-dimensional, and the webpage classification application is usually semi-supervised. Due to these characteristics, using semi-supervised multi-view feature learning (SMFL) technique to deal with the webpage classification problem has recently received much attention. However, there still exists room for improvement for this kind of feature learning technique. How to effectively utilize the correlation information among multi-view of webpage data is an important research topic. Correlation analysis on multi-view data can facilitate extraction of the complementary information. In this paper, we propose a novel SMFL approach, named semi-supervised multi-view correlation feature learning (SMCFL), for webpage classification. SMCFL seeks for a discriminant common space by learning a multi-view shared transformation in a semi-supervised manner. In the discriminant space, the correlation between intra-class samples is maximized, and the correlation between inter-class samples and the global correlation among both labeled and unlabeled samples are minimized simultaneously. We transform the matrix-variable based nonconvex objective function of SMCFL into a convex quadratic programming problem with one real variable, and can achieve a global optimal solution. Experiments on widely used datasets demonstrate the effectiveness and efficiency of the proposed approach.

Downloads

Published

2017-02-12

How to Cite

Jing, X.-Y., Wu, F., Dong, X., Shan, S., & Chen, S. (2017). Semi-Supervised Multi-View Correlation Feature Learning with Application to Webpage Classification. Proceedings of the AAAI Conference on Artificial Intelligence, 31(1). https://doi.org/10.1609/aaai.v31i1.10741

Issue

Section

Main Track: Machine Learning Applications