Feature selection for unsupervised learning through local learning

Jin Yao, Qi Mao, Steve Goodison, Volker Mai, Yijun Sun

Research output: Contribution to journalArticle

20 Scopus citations

Abstract

We consider the problem of feature selection for unsupervised learning and develop a new algorithm capable of identifying informative features supporting complex structures embedded in a high-dimensional space. The development of the algorithm is inspired by human learning in detecting complex data structures. We formulate it as an optimization problem with a well-defined objective function, and solve the problem by using an iterative approach. The algorithm can be easily implemented and is computationally very efficient. We use gap statistics to estimate the parameters so that the proposed method is completely parameter-free. We also develop a scheme based on permutation tests to estimate the statistical significance of the presence of a data structure. We demonstrate the effectiveness and versatility of the algorithm by comparing it with seven existing methods on a set of synthetic datasets with a wide variety of structures and cancer microarray gene expression datasets.

Original languageEnglish (US)
Pages (from-to)100-107
Number of pages8
JournalPattern Recognition Letters
Volume53
DOIs
StatePublished - Feb 1 2015

Keywords

  • Clustering
  • Feature selection
  • Manifold learning
  • Unsupervised learning

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

Fingerprint Dive into the research topics of 'Feature selection for unsupervised learning through local learning'. Together they form a unique fingerprint.

  • Cite this