Unsupervised Feature Selection by Pareto Optimization

Authors

  • Chao Feng University of Science and Technology of China
  • Chao Qian University of Science and Technology of China
  • Ke Tang Southern University of Science and Technology

DOI:

https://doi.org/10.1609/aaai.v33i01.33013534

Abstract

Dimensionality reduction is often employed to deal with the data with a huge number of features, which can be generally divided into two categories: feature transformation and feature selection. Due to the interpretability, the efficiency during inference and the abundance of unlabeled data, unsupervised feature selection has attracted much attention. In this paper, we consider its natural formulation, column subset selection (CSS), which is to minimize the reconstruction error of a data matrix by selecting a subset of features. We propose an anytime randomized iterative approach POCSS, which minimizes the reconstruction error and the number of selected features simultaneously. Its approximation guarantee is well bounded. Empirical results exhibit the superior performance of POCSS over the state-of-the-art algorithms.

Downloads

Published

2019-07-17

How to Cite

Feng, C., Qian, C., & Tang, K. (2019). Unsupervised Feature Selection by Pareto Optimization. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), 3534-3541. https://doi.org/10.1609/aaai.v33i01.33013534

Issue

Section

AAAI Technical Track: Machine Learning