Multiclass sparse discriminant analysis

Qing Mai, Yi Yang, Hui Zou

Research output: Contribution to journalArticlepeer-review

35 Scopus citations

Abstract

In recent years several sparse linear discriminant analysis methods have been proposed for high-dimensional classification and variable selection. Most of these proposals focus on binary classification and are not directly applicable to multiclass classification problems. Some sparse discriminant analysis methods can handle multiclass classification problems, but their theoretical justifications remain unknown. In this paper, we propose a new multiclass sparse discriminant analysis method that estimates all discriminant directions simultaneously. We show that when applied to the binary case our proposal yields a classification direction that is equivalent to those attained by two successful binary sparse linear discriminant analysis methods, providing a unification of these seemingly unrelated proposals. Our method can be solved by an efficient algorithm that is implemented in an open R package msda available from CRAN. We offer theoretical justification of our method by establishing a variable selection consistency result and finding rates of convergence under the ultrahigh dimensionality setting. We further demonstrate the empirical performance of our method with simulations and data.

Original languageEnglish (US)
Pages (from-to)97-111
Number of pages15
JournalStatistica Sinica
Volume29
Issue number1
DOIs
StatePublished - 2019

Bibliographical note

Publisher Copyright:
© 2019 Institute of Statistical Science. All rights reserved.

Keywords

  • Discriminant analysis
  • High dimensional data
  • Multiclass classification
  • Rates of convergence
  • Variable selection

Fingerprint

Dive into the research topics of 'Multiclass sparse discriminant analysis'. Together they form a unique fingerprint.

Cite this