Feature Selection at the Discrete Limit

Authors

  • Miao Zhang University of Texas at Arlington
  • Chris Ding University of Texas at Arlington
  • Ya Zhang Shanghai Jiao Tong University
  • Feiping Nie University of Texas at Arlington

DOI:

https://doi.org/10.1609/aaai.v28i1.8919

Keywords:

Feature selection, sparse limit

Abstract

Feature selection plays an important role in many machine learning and data mining applications. In this paper, we propose to use L2,p norm for feature selection with emphasis on small p. As p approaches 0, feature selection becomes discrete feature selection problem. We provide two algorithms, proximal gradient algorithm and rank one update algorithm, which is more efficient at large regularization. We provide closed form solutions of the proximal operator at p = 0, 1/2. Experiments onreal life datasets show that features selected at small p consistently outperform features selected at p = 1, the standard L2,1 approach and other popular feature selection methods.

Downloads

Published

2014-06-21

How to Cite

Zhang, M., Ding, C., Zhang, Y., & Nie, F. (2014). Feature Selection at the Discrete Limit. Proceedings of the AAAI Conference on Artificial Intelligence, 28(1). https://doi.org/10.1609/aaai.v28i1.8919

Issue

Section

Main Track: Machine Learning Applications