Abstract
Linear discrimination, from the point of view of numerical linear algebra, can be treated as solving an ill-posed system of linear equations. In order to generate a solution that is robust in the presence of noise, these problems require regularization. Here, we examine the ill-posedness involved in the linear discrimination of cancer gene expression data with respect to outcome and tumor subclasses. We show that a filter factor representation, based upon Singular Value Decomposition, yields insight into the numerical ill-posedness of the hyperplane-based separation when applied to gene expression data. We also show that this representation yields useful diagnostic tools for guiding the selection of classifier parameters, thus leading to improved performance.
Original language | English (US) |
---|---|
Pages (from-to) | 79-104 |
Number of pages | 26 |
Journal | Journal of Bioinformatics and Computational Biology |
Volume | 5 |
Issue number | 1 |
DOIs | |
State | Published - Feb 2007 |
Keywords
- Cancer classification
- Gene expression
- Least squares
- Regression
- Regularization
- Singular value decomposition
ASJC Scopus subject areas
- Biochemistry
- Molecular Biology
- Computer Science Applications