Variable selection for support vector machines in moderately high dimensions

Xiang Zhang; Yichao Wu; Lan Wang; Runze Li

doi:10.1111/rssb.12100

Variable selection for support vector machines in moderately high dimensions

Xiang Zhang, Yichao Wu, Lan Wang, Runze Li

Statistics (Twin Cities)

Research output: Contribution to journal › Article › peer-review

45 Scopus citations

Abstract

The support vector machine (SVM) is a powerful binary classification tool with high accuracy and great flexibility. It has achieved great success, but its performance can be seriously impaired if many redundant covariates are included. Some efforts have been devoted to studying variable selection for SVMs, but asymptotic properties, such as variable selection consistency, are largely unknown when the number of predictors diverges to ∞. We establish a unified theory for a general class of non-convex penalized SVMs. We first prove that, in ultrahigh dimensions, there is one local minimizer to the objective function of non-convex penalized SVMs having the desired oracle property. We further address the problem of non-unique local minimizers by showing that the local linear approximation algorithm is guaranteed to converge to the oracle estimator even in the ultrahigh dimensional setting if an appropriate initial estimator is available. This condition on the initial estimator is verified to be automatically valid as long as the dimensions are moderately high. Numerical examples provide supportive evidence.

Original language	English (US)
Pages (from-to)	53-76
Number of pages	24
Journal	Journal of the Royal Statistical Society. Series B: Statistical Methodology
Volume	78
Issue number	1
DOIs	https://doi.org/10.1111/rssb.12100
State	Published - 2016

Bibliographical note

Publisher Copyright:
© 2015 Royal Statistical Society

Keywords

Local linear approximation
Non-convex penalty
Oracle property
Support vector machines
Ultrahigh dimensions
Variable selection

Access

10.1111/rssb.12100

OpenUrl availability

Full text

Cite this

@article{596a743f72a34c078f37aaca6fbf7937,

title = "Variable selection for support vector machines in moderately high dimensions",

abstract = "The support vector machine (SVM) is a powerful binary classification tool with high accuracy and great flexibility. It has achieved great success, but its performance can be seriously impaired if many redundant covariates are included. Some efforts have been devoted to studying variable selection for SVMs, but asymptotic properties, such as variable selection consistency, are largely unknown when the number of predictors diverges to ∞. We establish a unified theory for a general class of non-convex penalized SVMs. We first prove that, in ultrahigh dimensions, there is one local minimizer to the objective function of non-convex penalized SVMs having the desired oracle property. We further address the problem of non-unique local minimizers by showing that the local linear approximation algorithm is guaranteed to converge to the oracle estimator even in the ultrahigh dimensional setting if an appropriate initial estimator is available. This condition on the initial estimator is verified to be automatically valid as long as the dimensions are moderately high. Numerical examples provide supportive evidence.",

keywords = "Local linear approximation, Non-convex penalty, Oracle property, Support vector machines, Ultrahigh dimensions, Variable selection",

author = "Xiang Zhang and Yichao Wu and Lan Wang and Runze Li",

note = "Publisher Copyright: {\textcopyright} 2015 Royal Statistical Society",

year = "2016",

doi = "10.1111/rssb.12100",

language = "English (US)",

volume = "78",

pages = "53--76",

journal = "Journal of the Royal Statistical Society. Series B: Statistical Methodology",

issn = "1369-7412",

publisher = "Wiley-Blackwell",

number = "1",

}

TY - JOUR

T1 - Variable selection for support vector machines in moderately high dimensions

AU - Zhang, Xiang

AU - Wu, Yichao

AU - Wang, Lan

AU - Li, Runze

PY - 2016

Y1 - 2016

N2 - The support vector machine (SVM) is a powerful binary classification tool with high accuracy and great flexibility. It has achieved great success, but its performance can be seriously impaired if many redundant covariates are included. Some efforts have been devoted to studying variable selection for SVMs, but asymptotic properties, such as variable selection consistency, are largely unknown when the number of predictors diverges to ∞. We establish a unified theory for a general class of non-convex penalized SVMs. We first prove that, in ultrahigh dimensions, there is one local minimizer to the objective function of non-convex penalized SVMs having the desired oracle property. We further address the problem of non-unique local minimizers by showing that the local linear approximation algorithm is guaranteed to converge to the oracle estimator even in the ultrahigh dimensional setting if an appropriate initial estimator is available. This condition on the initial estimator is verified to be automatically valid as long as the dimensions are moderately high. Numerical examples provide supportive evidence.

AB - The support vector machine (SVM) is a powerful binary classification tool with high accuracy and great flexibility. It has achieved great success, but its performance can be seriously impaired if many redundant covariates are included. Some efforts have been devoted to studying variable selection for SVMs, but asymptotic properties, such as variable selection consistency, are largely unknown when the number of predictors diverges to ∞. We establish a unified theory for a general class of non-convex penalized SVMs. We first prove that, in ultrahigh dimensions, there is one local minimizer to the objective function of non-convex penalized SVMs having the desired oracle property. We further address the problem of non-unique local minimizers by showing that the local linear approximation algorithm is guaranteed to converge to the oracle estimator even in the ultrahigh dimensional setting if an appropriate initial estimator is available. This condition on the initial estimator is verified to be automatically valid as long as the dimensions are moderately high. Numerical examples provide supportive evidence.

KW - Local linear approximation

KW - Non-convex penalty

KW - Oracle property

KW - Support vector machines

KW - Ultrahigh dimensions

KW - Variable selection

UR - http://www.scopus.com/inward/record.url?scp=84920911161&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84920911161&partnerID=8YFLogxK

U2 - 10.1111/rssb.12100

DO - 10.1111/rssb.12100

M3 - Article

C2 - 26778916

AN - SCOPUS:84920911161

SN - 1369-7412

VL - 78

SP - 53

EP - 76

JO - Journal of the Royal Statistical Society. Series B: Statistical Methodology

JF - Journal of the Royal Statistical Society. Series B: Statistical Methodology

IS - 1

ER -

Variable selection for support vector machines in moderately high dimensions

Abstract

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this