On the linear convergence of the proximal gradient method for trace norm regularization

Ke Hou; Zirui Zhou; Anthony Man Cho So; Zhi Quan Luo

On the linear convergence of the proximal gradient method for trace norm regularization

Ke Hou, Zirui Zhou, Anthony Man Cho So, Zhi Quan Luo

Electrical and Computer Engineering

Research output: Contribution to journal › Conference article › peer-review

28 Scopus citations

Abstract

Motivated by various applications in machine learning, the problem of minimizing a convex smooth loss function with trace norm regularization has received much attention lately. Currently, a popular method for solving such problem is the proximal gradient method (PGM), which is known to have a sublinear rate of convergence. In this paper, we show that for a large class of loss functions, the convergence rate of the PGMis in fact linear. Our result is established without any strong convexity assumption on the loss function. A key ingredient in our proof is a new Lipschitzian error bound for the aforementioned trace norm-regularized problem, which may be of independent interest.

Original language	English (US)
Journal	Advances in Neural Information Processing Systems
State	Published - 2013
Event	27th Annual Conference on Neural Information Processing Systems, NIPS 2013 - Lake Tahoe, NV, United States Duration: Dec 5 2013 → Dec 10 2013

OpenUrl availability

Full text

Cite this

@article{4ab5286e02174e02844660b6d45073e5,

title = "On the linear convergence of the proximal gradient method for trace norm regularization",

abstract = "Motivated by various applications in machine learning, the problem of minimizing a convex smooth loss function with trace norm regularization has received much attention lately. Currently, a popular method for solving such problem is the proximal gradient method (PGM), which is known to have a sublinear rate of convergence. In this paper, we show that for a large class of loss functions, the convergence rate of the PGMis in fact linear. Our result is established without any strong convexity assumption on the loss function. A key ingredient in our proof is a new Lipschitzian error bound for the aforementioned trace norm-regularized problem, which may be of independent interest.",

author = "Ke Hou and Zirui Zhou and So, {Anthony Man Cho} and Luo, {Zhi Quan}",

year = "2013",

language = "English (US)",

journal = "Advances in Neural Information Processing Systems",

issn = "1049-5258",

note = "27th Annual Conference on Neural Information Processing Systems, NIPS 2013 ; Conference date: 05-12-2013 Through 10-12-2013",

}

TY - JOUR

T1 - On the linear convergence of the proximal gradient method for trace norm regularization

AU - Hou, Ke

AU - Zhou, Zirui

AU - So, Anthony Man Cho

AU - Luo, Zhi Quan

PY - 2013

Y1 - 2013

N2 - Motivated by various applications in machine learning, the problem of minimizing a convex smooth loss function with trace norm regularization has received much attention lately. Currently, a popular method for solving such problem is the proximal gradient method (PGM), which is known to have a sublinear rate of convergence. In this paper, we show that for a large class of loss functions, the convergence rate of the PGMis in fact linear. Our result is established without any strong convexity assumption on the loss function. A key ingredient in our proof is a new Lipschitzian error bound for the aforementioned trace norm-regularized problem, which may be of independent interest.

AB - Motivated by various applications in machine learning, the problem of minimizing a convex smooth loss function with trace norm regularization has received much attention lately. Currently, a popular method for solving such problem is the proximal gradient method (PGM), which is known to have a sublinear rate of convergence. In this paper, we show that for a large class of loss functions, the convergence rate of the PGMis in fact linear. Our result is established without any strong convexity assumption on the loss function. A key ingredient in our proof is a new Lipschitzian error bound for the aforementioned trace norm-regularized problem, which may be of independent interest.

UR - http://www.scopus.com/inward/record.url?scp=84899032581&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84899032581&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:84899032581

SN - 1049-5258

JO - Advances in Neural Information Processing Systems

JF - Advances in Neural Information Processing Systems

T2 - 27th Annual Conference on Neural Information Processing Systems, NIPS 2013

Y2 - 5 December 2013 through 10 December 2013

ER -

On the linear convergence of the proximal gradient method for trace norm regularization

Abstract

OpenUrl availability

Other files and links

Fingerprint

Cite this