On efficient large margin semisupervised learning: Method and theory

Junhui Wang; Xiaotong Shen; Wei Pan

On efficient large margin semisupervised learning: Method and theory

Research output: Contribution to journal › Article › peer-review

Abstract

In classification, semisupervised learning usually involves a large amount of unlabeled data with only a small number of labeled data. This imposes a great challenge in that it is difficult to achieve good classification performance through labeled data alone. To leverage unlabeled data for enhancing classification, this article introduces a large margin semisupervised learning method within the framework of regularization, based on an efficient margin loss for unlabeled data, which seeks efficient extraction of the information from unlabeled data for estimating the Bayes decision boundary for classification. For implementation, an iterative scheme is derived through conditional expectations. Finally, theoretical and numerical analyses are conducted, in addition to an application to gene function prediction. They suggest that the proposed method enables to recover the performance of its supervised counterpart based on complete data in rates of convergence, when possible.

Original language	English (US)
Pages (from-to)	719-742
Number of pages	24
Journal	Journal of Machine Learning Research
Volume	10
State	Published - Jan 2009

Keywords

Classification
Difference convex programming
Nonconvex minimization
Regulanzation
Support vectors

OpenUrl availability

Full text

Cite this

@article{3c156f0399d14e8394755afb655aa8b8,

title = "On efficient large margin semisupervised learning: Method and theory",

abstract = "In classification, semisupervised learning usually involves a large amount of unlabeled data with only a small number of labeled data. This imposes a great challenge in that it is difficult to achieve good classification performance through labeled data alone. To leverage unlabeled data for enhancing classification, this article introduces a large margin semisupervised learning method within the framework of regularization, based on an efficient margin loss for unlabeled data, which seeks efficient extraction of the information from unlabeled data for estimating the Bayes decision boundary for classification. For implementation, an iterative scheme is derived through conditional expectations. Finally, theoretical and numerical analyses are conducted, in addition to an application to gene function prediction. They suggest that the proposed method enables to recover the performance of its supervised counterpart based on complete data in rates of convergence, when possible.",

keywords = "Classification, Difference convex programming, Nonconvex minimization, Regulanzation, Support vectors",

author = "Junhui Wang and Xiaotong Shen and Wei Pan",

year = "2009",

month = jan,

language = "English (US)",

volume = "10",

pages = "719--742",

journal = "Journal of Machine Learning Research",

issn = "1532-4435",

publisher = "Microtome Publishing",

}

TY - JOUR

T1 - On efficient large margin semisupervised learning

T2 - Method and theory

AU - Wang, Junhui

AU - Shen, Xiaotong

AU - Pan, Wei

PY - 2009/1

Y1 - 2009/1

N2 - In classification, semisupervised learning usually involves a large amount of unlabeled data with only a small number of labeled data. This imposes a great challenge in that it is difficult to achieve good classification performance through labeled data alone. To leverage unlabeled data for enhancing classification, this article introduces a large margin semisupervised learning method within the framework of regularization, based on an efficient margin loss for unlabeled data, which seeks efficient extraction of the information from unlabeled data for estimating the Bayes decision boundary for classification. For implementation, an iterative scheme is derived through conditional expectations. Finally, theoretical and numerical analyses are conducted, in addition to an application to gene function prediction. They suggest that the proposed method enables to recover the performance of its supervised counterpart based on complete data in rates of convergence, when possible.

AB - In classification, semisupervised learning usually involves a large amount of unlabeled data with only a small number of labeled data. This imposes a great challenge in that it is difficult to achieve good classification performance through labeled data alone. To leverage unlabeled data for enhancing classification, this article introduces a large margin semisupervised learning method within the framework of regularization, based on an efficient margin loss for unlabeled data, which seeks efficient extraction of the information from unlabeled data for estimating the Bayes decision boundary for classification. For implementation, an iterative scheme is derived through conditional expectations. Finally, theoretical and numerical analyses are conducted, in addition to an application to gene function prediction. They suggest that the proposed method enables to recover the performance of its supervised counterpart based on complete data in rates of convergence, when possible.

KW - Classification

KW - Difference convex programming

KW - Nonconvex minimization

KW - Regulanzation

KW - Support vectors

UR - http://www.scopus.com/inward/record.url?scp=64149104410&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=64149104410&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:64149104410

SN - 1532-4435

VL - 10

SP - 719

EP - 742

JO - Journal of Machine Learning Research

JF - Journal of Machine Learning Research

ER -

On efficient large margin semisupervised learning: Method and theory

Abstract

Keywords

OpenUrl availability

Other files and links

Fingerprint

Cite this