Robust recovery of multiple subspaces by geometric lp minimization

Gilad Lerman; Teng Zhang

doi:10.1214/11-AOS914

Robust recovery of multiple subspaces by geometric lp minimization

Gilad Lerman, Teng Zhang

Mathematics

Research output: Contribution to journal › Article › peer-review

67 Scopus citations

Abstract

We assume i.i.d. data sampled from a mixture distribution with K components along fixed d-dimensional linear subspaces and an additional outlier component. For p > 0, we study the simultaneous recovery of the K fixed subspaces by minimizing the lp-averaged distances of the sampled data points from any K subspaces. Under some conditions, we show that if 0<p ≤ 1, then all underlying subspaces can be precisely recovered by lp minimization with overwhelming probability. On the other hand, if K >1 and p >1, then the underlying subspaces cannot be recovered or even nearly recovered by lp minimization. The results of this paper partially explain the successes and failures of the basic approach of lp energy minimization for modeling data by multiple subspaces.

Original language	English (US)
Pages (from-to)	2686-2715
Number of pages	30
Journal	Annals of Statistics
Volume	39
Issue number	5
DOIs	https://doi.org/10.1214/11-AOS914
State	Published - Oct 2011

Keywords

Clustering
Detection
Geometric probability
High-dimensional data
Hybrid linear modeling
Multiple subspaces
Optimization on the grassmannian
Robustness

Access

10.1214/11-AOS914

OpenUrl availability

Full text

Cite this

@article{231ee87917cf4466a7b4103379c79756,

title = "Robust recovery of multiple subspaces by geometric lp minimization",

abstract = "We assume i.i.d. data sampled from a mixture distribution with K components along fixed d-dimensional linear subspaces and an additional outlier component. For p > 0, we study the simultaneous recovery of the K fixed subspaces by minimizing the lp-averaged distances of the sampled data points from any K subspaces. Under some conditions, we show that if 01 and p >1, then the underlying subspaces cannot be recovered or even nearly recovered by lp minimization. The results of this paper partially explain the successes and failures of the basic approach of lp energy minimization for modeling data by multiple subspaces.",

keywords = "Clustering, Detection, Geometric probability, High-dimensional data, Hybrid linear modeling, Multiple subspaces, Optimization on the grassmannian, Robustness",

author = "Gilad Lerman and Teng Zhang",

year = "2011",

month = oct,

doi = "10.1214/11-AOS914",

language = "English (US)",

volume = "39",

pages = "2686--2715",

journal = "Annals of Statistics",

issn = "0090-5364",

publisher = "Institute of Mathematical Statistics",

number = "5",

}

TY - JOUR

T1 - Robust recovery of multiple subspaces by geometric lp minimization

AU - Lerman, Gilad

AU - Zhang, Teng

PY - 2011/10

Y1 - 2011/10

N2 - We assume i.i.d. data sampled from a mixture distribution with K components along fixed d-dimensional linear subspaces and an additional outlier component. For p > 0, we study the simultaneous recovery of the K fixed subspaces by minimizing the lp-averaged distances of the sampled data points from any K subspaces. Under some conditions, we show that if 01 and p >1, then the underlying subspaces cannot be recovered or even nearly recovered by lp minimization. The results of this paper partially explain the successes and failures of the basic approach of lp energy minimization for modeling data by multiple subspaces.

AB - We assume i.i.d. data sampled from a mixture distribution with K components along fixed d-dimensional linear subspaces and an additional outlier component. For p > 0, we study the simultaneous recovery of the K fixed subspaces by minimizing the lp-averaged distances of the sampled data points from any K subspaces. Under some conditions, we show that if 01 and p >1, then the underlying subspaces cannot be recovered or even nearly recovered by lp minimization. The results of this paper partially explain the successes and failures of the basic approach of lp energy minimization for modeling data by multiple subspaces.

KW - Clustering

KW - Detection

KW - Geometric probability

KW - High-dimensional data

KW - Hybrid linear modeling

KW - Multiple subspaces

KW - Optimization on the grassmannian

KW - Robustness

UR - http://www.scopus.com/inward/record.url?scp=84867092906&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84867092906&partnerID=8YFLogxK

U2 - 10.1214/11-AOS914

DO - 10.1214/11-AOS914

M3 - Article

AN - SCOPUS:84867092906

SN - 0090-5364

VL - 39

SP - 2686

EP - 2715

JO - Annals of Statistics

JF - Annals of Statistics

IS - 5

ER -

Robust recovery of multiple subspaces by geometric lp minimization

Abstract

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this