An improved convergence analysis of cyclic block coordinate descent-type methods for strongly convex minimization

Xingguo Li; Tuo Zhao; Raman Arora; Han Liu; Mingyi Hong

An improved convergence analysis of cyclic block coordinate descent-type methods for strongly convex minimization

Xingguo Li, Tuo Zhao, Raman Arora, Han Liu, Mingyi Hong

Electrical and Computer Engineering

Research output: Contribution to conference › Paper › peer-review

9 Scopus citations

Abstract

The cyclic block coordinate descent-type (CBCD-type) methods have shown remarkable computational performance for solving strongly convex minimization problems. Typical applications include many popular statistical machine learning methods such as elastic-net regression, ridge penalized logistic regression, and sparse additive regression. Existing optimization literature has shown that the CBCD-type methods attain iteration complexity of O(p · log(1/ϵ)), where ϵ is a pre-specified accuracy of the objective value, and p is the number of blocks. However, such iteration complexity explicitly depends on p, and therefore is at least p times worse than those of gradient descent methods. To bridge this theoretical gap, we propose an improved convergence analysis for the CBCD-type methods. In particular, we first show that for a family of quadratic minimization problems, the iteration complexity of the CBCD-type methods matches that of the GD methods in term of dependency on p (up to a log² p factor). Thus our complexity bounds are sharper than the existing bounds by at least a factor of p/log² p. We also provide a lower bound to confirm that our improved complexity bounds are tight (up to a log² p factor) if the largest and smallest eigen-values of the Hessian matrix do not scale with p. Finally, we generalize our analysis to other strongly convex minimization problems beyond quadratic ones.

Original language	English (US)
Pages	491-499
Number of pages	9
State	Published - 2016
Event	19th International Conference on Artificial Intelligence and Statistics, AISTATS 2016 - Cadiz, Spain Duration: May 9 2016 → May 11 2016

Conference

Conference	19th International Conference on Artificial Intelligence and Statistics, AISTATS 2016
Country/Territory	Spain
City	Cadiz
Period	5/9/16 → 5/11/16

Bibliographical note

Funding Information:
This research is supported by NSF DMS1454377-CAREER; NSF IIS 1546482-BIGDATA; NIH R01MH102339; NSF IIS1408910; NSF IIS1332109; NIH R01GM083084.

Publisher Copyright:
Copyright 2016 by the authors.

OpenUrl availability

Full text

Cite this

@conference{3e70c6a6c2e5490ca7381340b02dd6ff,

title = "An improved convergence analysis of cyclic block coordinate descent-type methods for strongly convex minimization",

abstract = "The cyclic block coordinate descent-type (CBCD-type) methods have shown remarkable computational performance for solving strongly convex minimization problems. Typical applications include many popular statistical machine learning methods such as elastic-net regression, ridge penalized logistic regression, and sparse additive regression. Existing optimization literature has shown that the CBCD-type methods attain iteration complexity of O(p · log(1/ϵ)), where ϵ is a pre-specified accuracy of the objective value, and p is the number of blocks. However, such iteration complexity explicitly depends on p, and therefore is at least p times worse than those of gradient descent methods. To bridge this theoretical gap, we propose an improved convergence analysis for the CBCD-type methods. In particular, we first show that for a family of quadratic minimization problems, the iteration complexity of the CBCD-type methods matches that of the GD methods in term of dependency on p (up to a log2 p factor). Thus our complexity bounds are sharper than the existing bounds by at least a factor of p/log2 p. We also provide a lower bound to confirm that our improved complexity bounds are tight (up to a log2 p factor) if the largest and smallest eigen-values of the Hessian matrix do not scale with p. Finally, we generalize our analysis to other strongly convex minimization problems beyond quadratic ones.",

author = "Xingguo Li and Tuo Zhao and Raman Arora and Han Liu and Mingyi Hong",

note = "Funding Information: This research is supported by NSF DMS1454377-CAREER; NSF IIS 1546482-BIGDATA; NIH R01MH102339; NSF IIS1408910; NSF IIS1332109; NIH R01GM083084. Publisher Copyright: Copyright 2016 by the authors.; 19th International Conference on Artificial Intelligence and Statistics, AISTATS 2016 ; Conference date: 09-05-2016 Through 11-05-2016",

year = "2016",

language = "English (US)",

pages = "491--499",

}

TY - CONF

T1 - An improved convergence analysis of cyclic block coordinate descent-type methods for strongly convex minimization

AU - Li, Xingguo

AU - Zhao, Tuo

AU - Arora, Raman

AU - Liu, Han

AU - Hong, Mingyi

N1 - Funding Information: This research is supported by NSF DMS1454377-CAREER; NSF IIS 1546482-BIGDATA; NIH R01MH102339; NSF IIS1408910; NSF IIS1332109; NIH R01GM083084. Publisher Copyright: Copyright 2016 by the authors.

PY - 2016

Y1 - 2016

N2 - The cyclic block coordinate descent-type (CBCD-type) methods have shown remarkable computational performance for solving strongly convex minimization problems. Typical applications include many popular statistical machine learning methods such as elastic-net regression, ridge penalized logistic regression, and sparse additive regression. Existing optimization literature has shown that the CBCD-type methods attain iteration complexity of O(p · log(1/ϵ)), where ϵ is a pre-specified accuracy of the objective value, and p is the number of blocks. However, such iteration complexity explicitly depends on p, and therefore is at least p times worse than those of gradient descent methods. To bridge this theoretical gap, we propose an improved convergence analysis for the CBCD-type methods. In particular, we first show that for a family of quadratic minimization problems, the iteration complexity of the CBCD-type methods matches that of the GD methods in term of dependency on p (up to a log2 p factor). Thus our complexity bounds are sharper than the existing bounds by at least a factor of p/log2 p. We also provide a lower bound to confirm that our improved complexity bounds are tight (up to a log2 p factor) if the largest and smallest eigen-values of the Hessian matrix do not scale with p. Finally, we generalize our analysis to other strongly convex minimization problems beyond quadratic ones.

AB - The cyclic block coordinate descent-type (CBCD-type) methods have shown remarkable computational performance for solving strongly convex minimization problems. Typical applications include many popular statistical machine learning methods such as elastic-net regression, ridge penalized logistic regression, and sparse additive regression. Existing optimization literature has shown that the CBCD-type methods attain iteration complexity of O(p · log(1/ϵ)), where ϵ is a pre-specified accuracy of the objective value, and p is the number of blocks. However, such iteration complexity explicitly depends on p, and therefore is at least p times worse than those of gradient descent methods. To bridge this theoretical gap, we propose an improved convergence analysis for the CBCD-type methods. In particular, we first show that for a family of quadratic minimization problems, the iteration complexity of the CBCD-type methods matches that of the GD methods in term of dependency on p (up to a log2 p factor). Thus our complexity bounds are sharper than the existing bounds by at least a factor of p/log2 p. We also provide a lower bound to confirm that our improved complexity bounds are tight (up to a log2 p factor) if the largest and smallest eigen-values of the Hessian matrix do not scale with p. Finally, we generalize our analysis to other strongly convex minimization problems beyond quadratic ones.

UR - http://www.scopus.com/inward/record.url?scp=85067565899&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85067565899&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:85067565899

SP - 491

EP - 499

T2 - 19th International Conference on Artificial Intelligence and Statistics, AISTATS 2016

Y2 - 9 May 2016 through 11 May 2016

ER -

An improved convergence analysis of cyclic block coordinate descent-type methods for strongly convex minimization

Abstract

Conference

Bibliographical note

OpenUrl availability

Other files and links

Fingerprint

Cite this