A direct approach to sparse discriminant analysis in ultra-high dimensions

Qing Mai; Hui Zou; Ming Yuan

doi:10.1093/biomet/asr066

A direct approach to sparse discriminant analysis in ultra-high dimensions

Qing Mai, Hui Zou, Ming Yuan

Statistics (Twin Cities)

Research output: Contribution to journal › Article › peer-review

140 Scopus citations

Abstract

Sparse discriminant methods based on independence rules, such as the nearest shrunken centroids classifier (Tibshirani et al., 2002) and features annealed independence rules (Fan & Fan, 2008), have been proposed as computationally attractive tools for feature selection and classification with high-dimensional data. A fundamental drawback of these rules is that they ignore correlations among features and thus could produce misleading feature selection and inferior classification. We propose a new procedure for sparse discriminant analysis, motivated by the least squares formulation of linear discriminant analysis. To demonstrate our proposal, we study the numerical and theoretical properties of discriminant analysis constructed via lasso penalized least squares. Our theory shows that the method proposed can consistently identify the subset of discriminative features contributing to the Bayes rule and at the same time consistently estimate the Bayes classification direction, even when the dimension can grow faster than any polynomial order of the sample size. The theory allows for general dependence among features. Simulated and real data examples show that lassoed discriminant analysis compares favourably with other popular sparse discriminant proposals.

Original language	English (US)
Pages (from-to)	29-42
Number of pages	14
Journal	Biometrika
Volume	99
Issue number	1
DOIs	https://doi.org/10.1093/biomet/asr066
State	Published - Mar 2012

Bibliographical note

Funding Information:
The authors thank the editor, associate editor and referees for their helpful comments and suggestions. Mai is supported by an Alumni Fellowship from the School of Statistics at the University of Minnesota. Zou and Yuan are supported by the National Science Foundation, U.S.A.

Keywords

Discriminant analysis
Features annealed independence rule
Lasso
Nearest shrunken centroids classifier
Nonpolynomial-dimension asymptotics

Access

10.1093/biomet/asr066

OpenUrl availability

Full text

Cite this

@article{94cbe6b295134b5d8b0c79b603cdbd95,

title = "A direct approach to sparse discriminant analysis in ultra-high dimensions",

abstract = "Sparse discriminant methods based on independence rules, such as the nearest shrunken centroids classifier (Tibshirani et al., 2002) and features annealed independence rules (Fan & Fan, 2008), have been proposed as computationally attractive tools for feature selection and classification with high-dimensional data. A fundamental drawback of these rules is that they ignore correlations among features and thus could produce misleading feature selection and inferior classification. We propose a new procedure for sparse discriminant analysis, motivated by the least squares formulation of linear discriminant analysis. To demonstrate our proposal, we study the numerical and theoretical properties of discriminant analysis constructed via lasso penalized least squares. Our theory shows that the method proposed can consistently identify the subset of discriminative features contributing to the Bayes rule and at the same time consistently estimate the Bayes classification direction, even when the dimension can grow faster than any polynomial order of the sample size. The theory allows for general dependence among features. Simulated and real data examples show that lassoed discriminant analysis compares favourably with other popular sparse discriminant proposals.",

keywords = "Discriminant analysis, Features annealed independence rule, Lasso, Nearest shrunken centroids classifier, Nonpolynomial-dimension asymptotics",

author = "Qing Mai and Hui Zou and Ming Yuan",

note = "Funding Information: The authors thank the editor, associate editor and referees for their helpful comments and suggestions. Mai is supported by an Alumni Fellowship from the School of Statistics at the University of Minnesota. Zou and Yuan are supported by the National Science Foundation, U.S.A.",

year = "2012",

month = mar,

doi = "10.1093/biomet/asr066",

language = "English (US)",

volume = "99",

pages = "29--42",

journal = "Biometrika",

issn = "0006-3444",

publisher = "Oxford University Press",

number = "1",

}

TY - JOUR

T1 - A direct approach to sparse discriminant analysis in ultra-high dimensions

AU - Mai, Qing

AU - Zou, Hui

AU - Yuan, Ming

N1 - Funding Information: The authors thank the editor, associate editor and referees for their helpful comments and suggestions. Mai is supported by an Alumni Fellowship from the School of Statistics at the University of Minnesota. Zou and Yuan are supported by the National Science Foundation, U.S.A.

PY - 2012/3

Y1 - 2012/3

N2 - Sparse discriminant methods based on independence rules, such as the nearest shrunken centroids classifier (Tibshirani et al., 2002) and features annealed independence rules (Fan & Fan, 2008), have been proposed as computationally attractive tools for feature selection and classification with high-dimensional data. A fundamental drawback of these rules is that they ignore correlations among features and thus could produce misleading feature selection and inferior classification. We propose a new procedure for sparse discriminant analysis, motivated by the least squares formulation of linear discriminant analysis. To demonstrate our proposal, we study the numerical and theoretical properties of discriminant analysis constructed via lasso penalized least squares. Our theory shows that the method proposed can consistently identify the subset of discriminative features contributing to the Bayes rule and at the same time consistently estimate the Bayes classification direction, even when the dimension can grow faster than any polynomial order of the sample size. The theory allows for general dependence among features. Simulated and real data examples show that lassoed discriminant analysis compares favourably with other popular sparse discriminant proposals.

AB - Sparse discriminant methods based on independence rules, such as the nearest shrunken centroids classifier (Tibshirani et al., 2002) and features annealed independence rules (Fan & Fan, 2008), have been proposed as computationally attractive tools for feature selection and classification with high-dimensional data. A fundamental drawback of these rules is that they ignore correlations among features and thus could produce misleading feature selection and inferior classification. We propose a new procedure for sparse discriminant analysis, motivated by the least squares formulation of linear discriminant analysis. To demonstrate our proposal, we study the numerical and theoretical properties of discriminant analysis constructed via lasso penalized least squares. Our theory shows that the method proposed can consistently identify the subset of discriminative features contributing to the Bayes rule and at the same time consistently estimate the Bayes classification direction, even when the dimension can grow faster than any polynomial order of the sample size. The theory allows for general dependence among features. Simulated and real data examples show that lassoed discriminant analysis compares favourably with other popular sparse discriminant proposals.

KW - Discriminant analysis

KW - Features annealed independence rule

KW - Lasso

KW - Nearest shrunken centroids classifier

KW - Nonpolynomial-dimension asymptotics

UR - http://www.scopus.com/inward/record.url?scp=84857556385&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84857556385&partnerID=8YFLogxK

U2 - 10.1093/biomet/asr066

DO - 10.1093/biomet/asr066

M3 - Article

AN - SCOPUS:84857556385

SN - 0006-3444

VL - 99

SP - 29

EP - 42

JO - Biometrika

JF - Biometrika

IS - 1

ER -

A direct approach to sparse discriminant analysis in ultra-high dimensions

Abstract

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this