Practical selection of SVM parameters and noise estimation for SVM regression

Vladimir Cherkassky; Yunqian Ma

doi:10.1016/S0893-6080(03)00169-2

Practical selection of SVM parameters and noise estimation for SVM regression

Vladimir Cherkassky, Yunqian Ma

Electrical and Computer Engineering

Research output: Contribution to journal › Article › peer-review

1796 Scopus citations

Abstract

We investigate practical selection of hyper-parameters for support vector machines (SVM) regression (that is, ε-insensitive zone and regularization parameter C). The proposed methodology advocates analytic parameter selection directly from the training data, rather than re-sampling approaches commonly used in SVM applications. In particular, we describe a new analytical prescription for setting the value of insensitive zone ε, as a function of training sample size. Good generalization performance of the proposed parameter selection is demonstrated empirically using several low- and high-dimensional regression problems. Further, we point out the importance of Vapnik's ε-insensitive loss for regression problems with finite samples. To this end, we compare generalization performance of SVM regression (using proposed selection of ε-values) with regression using 'least-modulus' loss (ε=0) and standard squared loss. These comparisons indicate superior generalization performance of SVM regression under sparse sample settings, for various types of additive noise.

Original language	English (US)
Pages (from-to)	113-126
Number of pages	14
Journal	Neural Networks
Volume	17
Issue number	1
DOIs	https://doi.org/10.1016/S0893-6080(03)00169-2
State	Published - Jan 2004

Bibliographical note

Funding Information:
The authors thank Dr V. Vapnik for many useful discussions. This work was supported, in part, by NSF grant ECS-0099906.

Keywords

Complexity control
Loss function
Parameter selection
Prediction accuracy
Support vector machine regression
VC theory

Access

10.1016/S0893-6080(03)00169-2

OpenUrl availability

Full text

Cite this

@article{518d9e8e92f8464796d34f5f2da6a45c,

title = "Practical selection of SVM parameters and noise estimation for SVM regression",

abstract = "We investigate practical selection of hyper-parameters for support vector machines (SVM) regression (that is, ε-insensitive zone and regularization parameter C). The proposed methodology advocates analytic parameter selection directly from the training data, rather than re-sampling approaches commonly used in SVM applications. In particular, we describe a new analytical prescription for setting the value of insensitive zone ε, as a function of training sample size. Good generalization performance of the proposed parameter selection is demonstrated empirically using several low- and high-dimensional regression problems. Further, we point out the importance of Vapnik's ε-insensitive loss for regression problems with finite samples. To this end, we compare generalization performance of SVM regression (using proposed selection of ε-values) with regression using 'least-modulus' loss (ε=0) and standard squared loss. These comparisons indicate superior generalization performance of SVM regression under sparse sample settings, for various types of additive noise.",

keywords = "Complexity control, Loss function, Parameter selection, Prediction accuracy, Support vector machine regression, VC theory",

author = "Vladimir Cherkassky and Yunqian Ma",

note = "Funding Information: The authors thank Dr V. Vapnik for many useful discussions. This work was supported, in part, by NSF grant ECS-0099906.",

year = "2004",

month = jan,

doi = "10.1016/S0893-6080(03)00169-2",

language = "English (US)",

volume = "17",

pages = "113--126",

journal = "Neural Networks",

issn = "0893-6080",

publisher = "Elsevier Limited",

number = "1",

}

TY - JOUR

T1 - Practical selection of SVM parameters and noise estimation for SVM regression

AU - Cherkassky, Vladimir

AU - Ma, Yunqian

N1 - Funding Information: The authors thank Dr V. Vapnik for many useful discussions. This work was supported, in part, by NSF grant ECS-0099906.

PY - 2004/1

Y1 - 2004/1

N2 - We investigate practical selection of hyper-parameters for support vector machines (SVM) regression (that is, ε-insensitive zone and regularization parameter C). The proposed methodology advocates analytic parameter selection directly from the training data, rather than re-sampling approaches commonly used in SVM applications. In particular, we describe a new analytical prescription for setting the value of insensitive zone ε, as a function of training sample size. Good generalization performance of the proposed parameter selection is demonstrated empirically using several low- and high-dimensional regression problems. Further, we point out the importance of Vapnik's ε-insensitive loss for regression problems with finite samples. To this end, we compare generalization performance of SVM regression (using proposed selection of ε-values) with regression using 'least-modulus' loss (ε=0) and standard squared loss. These comparisons indicate superior generalization performance of SVM regression under sparse sample settings, for various types of additive noise.

AB - We investigate practical selection of hyper-parameters for support vector machines (SVM) regression (that is, ε-insensitive zone and regularization parameter C). The proposed methodology advocates analytic parameter selection directly from the training data, rather than re-sampling approaches commonly used in SVM applications. In particular, we describe a new analytical prescription for setting the value of insensitive zone ε, as a function of training sample size. Good generalization performance of the proposed parameter selection is demonstrated empirically using several low- and high-dimensional regression problems. Further, we point out the importance of Vapnik's ε-insensitive loss for regression problems with finite samples. To this end, we compare generalization performance of SVM regression (using proposed selection of ε-values) with regression using 'least-modulus' loss (ε=0) and standard squared loss. These comparisons indicate superior generalization performance of SVM regression under sparse sample settings, for various types of additive noise.

KW - Complexity control

KW - Loss function

KW - Parameter selection

KW - Prediction accuracy

KW - Support vector machine regression

KW - VC theory

UR - http://www.scopus.com/inward/record.url?scp=0346250790&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0346250790&partnerID=8YFLogxK

U2 - 10.1016/S0893-6080(03)00169-2

DO - 10.1016/S0893-6080(03)00169-2

M3 - Article

C2 - 14690712

AN - SCOPUS:0346250790

SN - 0893-6080

VL - 17

SP - 113

EP - 126

JO - Neural Networks

JF - Neural Networks

IS - 1

ER -

Practical selection of SVM parameters and noise estimation for SVM regression

Abstract

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this