The doubly regularized support vector machine

Li Wang; Ji Zhu; Hui Zou

The doubly regularized support vector machine

Li Wang, Ji Zhu, Hui Zou

Statistics (Twin Cities)

Research output: Contribution to journal › Article › peer-review

206 Scopus citations

Abstract

The standard L ₂-norm support vector machine (SVM) is a widely used tool for classification problems. The L ₁-norm SVM is a variant of the standard Lanorm SVM, that constrains the Li-norm of the fitted coefficients. Due to the nature of the L ₁-norm, the L ₁-norm SVM has the property of automatically selecting variables, not shared by the standard L ₂-norm SVM. It has been argued that the L ₁-norm SVM may have some advantage over the L ₂-norm SVM, especially with high dimensional problems and when there are redundant noise variables. On the other hand, the L ₁-norm SVM has two drawbacks: (1) when there are several highly correlated variables, the L ₁-norm SVM tends to pick only a few of them, and remove the rest; (2) the number of selected variables is upper bounded by the size of the training data. A typical example where these occur is in gene microarray analysis. In this paper, we propose a doubly regularized support vector machine (DrSVM). The DrSVM uses the elastic-net penalty, a mixture of the L ₂-norm and the L ₁-norm penalties. By doing so, the DrSVM performs automatic variable selection in a way similar to the L ₁-norm SVM. In addition, the DrSVM encourages highly correlated variables to be selected (or removed) together. We illustrate how the DrSVM can be particularly useful when the number of variables is much larger than the size of the training data (p ≫ n). We also develop efficient algorithms to compute the whole solution paths of the DrSVM.

Original language	English (US)
Pages (from-to)	589-615
Number of pages	27
Journal	Statistica Sinica
Volume	16
Issue number	2
State	Published - Apr 1 2006

Keywords

Grouping effect
Quadratic programming
SVM
Variable selection
p ≫ n

OpenUrl availability

Full text

Cite this

@article{68aaee81e91841d3b823f44a856051f0,

title = "The doubly regularized support vector machine",

abstract = "The standard L 2-norm support vector machine (SVM) is a widely used tool for classification problems. The L 1-norm SVM is a variant of the standard Lanorm SVM, that constrains the Li-norm of the fitted coefficients. Due to the nature of the L 1-norm, the L 1-norm SVM has the property of automatically selecting variables, not shared by the standard L 2-norm SVM. It has been argued that the L 1-norm SVM may have some advantage over the L 2-norm SVM, especially with high dimensional problems and when there are redundant noise variables. On the other hand, the L 1-norm SVM has two drawbacks: (1) when there are several highly correlated variables, the L 1-norm SVM tends to pick only a few of them, and remove the rest; (2) the number of selected variables is upper bounded by the size of the training data. A typical example where these occur is in gene microarray analysis. In this paper, we propose a doubly regularized support vector machine (DrSVM). The DrSVM uses the elastic-net penalty, a mixture of the L 2-norm and the L 1-norm penalties. By doing so, the DrSVM performs automatic variable selection in a way similar to the L 1-norm SVM. In addition, the DrSVM encourages highly correlated variables to be selected (or removed) together. We illustrate how the DrSVM can be particularly useful when the number of variables is much larger than the size of the training data (p ≫ n). We also develop efficient algorithms to compute the whole solution paths of the DrSVM.",

keywords = "Grouping effect, Quadratic programming, SVM, Variable selection, p ≫ n",

author = "Li Wang and Ji Zhu and Hui Zou",

year = "2006",

month = apr,

day = "1",

language = "English (US)",

volume = "16",

pages = "589--615",

journal = "Statistica Sinica",

issn = "1017-0405",

publisher = "Institute of Statistical Science",

number = "2",

}

TY - JOUR

T1 - The doubly regularized support vector machine

AU - Wang, Li

AU - Zhu, Ji

AU - Zou, Hui

PY - 2006/4/1

Y1 - 2006/4/1

N2 - The standard L 2-norm support vector machine (SVM) is a widely used tool for classification problems. The L 1-norm SVM is a variant of the standard Lanorm SVM, that constrains the Li-norm of the fitted coefficients. Due to the nature of the L 1-norm, the L 1-norm SVM has the property of automatically selecting variables, not shared by the standard L 2-norm SVM. It has been argued that the L 1-norm SVM may have some advantage over the L 2-norm SVM, especially with high dimensional problems and when there are redundant noise variables. On the other hand, the L 1-norm SVM has two drawbacks: (1) when there are several highly correlated variables, the L 1-norm SVM tends to pick only a few of them, and remove the rest; (2) the number of selected variables is upper bounded by the size of the training data. A typical example where these occur is in gene microarray analysis. In this paper, we propose a doubly regularized support vector machine (DrSVM). The DrSVM uses the elastic-net penalty, a mixture of the L 2-norm and the L 1-norm penalties. By doing so, the DrSVM performs automatic variable selection in a way similar to the L 1-norm SVM. In addition, the DrSVM encourages highly correlated variables to be selected (or removed) together. We illustrate how the DrSVM can be particularly useful when the number of variables is much larger than the size of the training data (p ≫ n). We also develop efficient algorithms to compute the whole solution paths of the DrSVM.

AB - The standard L 2-norm support vector machine (SVM) is a widely used tool for classification problems. The L 1-norm SVM is a variant of the standard Lanorm SVM, that constrains the Li-norm of the fitted coefficients. Due to the nature of the L 1-norm, the L 1-norm SVM has the property of automatically selecting variables, not shared by the standard L 2-norm SVM. It has been argued that the L 1-norm SVM may have some advantage over the L 2-norm SVM, especially with high dimensional problems and when there are redundant noise variables. On the other hand, the L 1-norm SVM has two drawbacks: (1) when there are several highly correlated variables, the L 1-norm SVM tends to pick only a few of them, and remove the rest; (2) the number of selected variables is upper bounded by the size of the training data. A typical example where these occur is in gene microarray analysis. In this paper, we propose a doubly regularized support vector machine (DrSVM). The DrSVM uses the elastic-net penalty, a mixture of the L 2-norm and the L 1-norm penalties. By doing so, the DrSVM performs automatic variable selection in a way similar to the L 1-norm SVM. In addition, the DrSVM encourages highly correlated variables to be selected (or removed) together. We illustrate how the DrSVM can be particularly useful when the number of variables is much larger than the size of the training data (p ≫ n). We also develop efficient algorithms to compute the whole solution paths of the DrSVM.

KW - Grouping effect

KW - Quadratic programming

KW - SVM

KW - Variable selection

KW - p ≫ n

UR - http://www.scopus.com/inward/record.url?scp=33746154240&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33746154240&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:33746154240

SN - 1017-0405

VL - 16

SP - 589

EP - 615

JO - Statistica Sinica

JF - Statistica Sinica

IS - 2

ER -

The doubly regularized support vector machine

Abstract

Keywords

OpenUrl availability

Other files and links

Fingerprint

Cite this