A divide-and-conquer approach for solving singular value decomposition on a heterogeneous system

Ding Liu; Ruixuan Li; David J. Lilja; Weijun Xiao

doi:10.1145/2482767.2482813

A divide-and-conquer approach for solving singular value decomposition on a heterogeneous system

Ding Liu, Ruixuan Li, David J. Lilja, Weijun Xiao

Electrical and Computer Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

7 Scopus citations

Abstract

Singular value decomposition (SVD) is a fundamental linear operation that has been used for many applications, such as pattern recognition and statistical information processing. In order to accelerate this time-consuming operation, this paper presents a new divide-and-conquer approach for solving SVD on a heterogeneous CPU-GPU system. We carefully design our algorithm to match the mathematical requirements of SVD to the unique characteristics of a heterogeneous computing platform. This includes a high-performance solution to the secular equation with good numerical stability, overlapping the CPU and the GPU tasks, and leveraging the GPU bandwidth in a heterogeneous system. The experimental results show that our algorithm has better performance than MKL's divide-and-conquer routine [18] with four cores (eight hardware threads) when the size of the input matrix is larger than 3000. Furthermore, it is up to 33 times faster than LAPACK's divide-and-conquer routine [17], 3 times faster than MKL's divide-and-conquer routine with four cores, and 7 times faster than CULA on the same device, when the size of the matrix grows up to 14,000. Our algorithm is also much faster than previous SVD approaches on GPUs

Original language	English (US)
Title of host publication	Proceedings of the ACM International Conference on Computing Frontiers, CF 2013
DOIs	https://doi.org/10.1145/2482767.2482813
State	Published - 2013
Event	2013 ACM International Conference on Computing Frontiers, CF 2013 - Ischia, Italy Duration: May 14 2013 → May 16 2013

Publication series

Name	Proceedings of the ACM International Conference on Computing Frontiers, CF 2013

Other

Other	2013 ACM International Conference on Computing Frontiers, CF 2013
Country/Territory	Italy
City	Ischia
Period	5/14/13 → 5/16/13

Keywords

Divide-and-conquer
Heterogeneous architecture
Performance evaluation
Singular value decomposition (SVD)

Access

10.1145/2482767.2482813

OpenUrl availability

Full text

Cite this

A divide-and-conquer approach for solving singular value decomposition on a heterogeneous system. / Liu, Ding; Li, Ruixuan; Lilja, David J. et al.
Proceedings of the ACM International Conference on Computing Frontiers, CF 2013. 2013. (Proceedings of the ACM International Conference on Computing Frontiers, CF 2013).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Liu, D, Li, R, Lilja, DJ & Xiao, W 2013, A divide-and-conquer approach for solving singular value decomposition on a heterogeneous system. in Proceedings of the ACM International Conference on Computing Frontiers, CF 2013. Proceedings of the ACM International Conference on Computing Frontiers, CF 2013, 2013 ACM International Conference on Computing Frontiers, CF 2013, Ischia, Italy, 5/14/13. https://doi.org/10.1145/2482767.2482813

@inproceedings{1a6a4bbe535e4738ba796840a308eaf7,

title = "A divide-and-conquer approach for solving singular value decomposition on a heterogeneous system",

abstract = "Singular value decomposition (SVD) is a fundamental linear operation that has been used for many applications, such as pattern recognition and statistical information processing. In order to accelerate this time-consuming operation, this paper presents a new divide-and-conquer approach for solving SVD on a heterogeneous CPU-GPU system. We carefully design our algorithm to match the mathematical requirements of SVD to the unique characteristics of a heterogeneous computing platform. This includes a high-performance solution to the secular equation with good numerical stability, overlapping the CPU and the GPU tasks, and leveraging the GPU bandwidth in a heterogeneous system. The experimental results show that our algorithm has better performance than MKL's divide-and-conquer routine [18] with four cores (eight hardware threads) when the size of the input matrix is larger than 3000. Furthermore, it is up to 33 times faster than LAPACK's divide-and-conquer routine [17], 3 times faster than MKL's divide-and-conquer routine with four cores, and 7 times faster than CULA on the same device, when the size of the matrix grows up to 14,000. Our algorithm is also much faster than previous SVD approaches on GPUs",

keywords = "Divide-and-conquer, Heterogeneous architecture, Performance evaluation, Singular value decomposition (SVD)",

author = "Ding Liu and Ruixuan Li and Lilja, {David J.} and Weijun Xiao",

year = "2013",

doi = "10.1145/2482767.2482813",

language = "English (US)",

isbn = "9781450320535",

series = "Proceedings of the ACM International Conference on Computing Frontiers, CF 2013",

booktitle = "Proceedings of the ACM International Conference on Computing Frontiers, CF 2013",

note = "2013 ACM International Conference on Computing Frontiers, CF 2013 ; Conference date: 14-05-2013 Through 16-05-2013",

}

TY - GEN

T1 - A divide-and-conquer approach for solving singular value decomposition on a heterogeneous system

AU - Liu, Ding

AU - Li, Ruixuan

AU - Lilja, David J.

AU - Xiao, Weijun

PY - 2013

Y1 - 2013

N2 - Singular value decomposition (SVD) is a fundamental linear operation that has been used for many applications, such as pattern recognition and statistical information processing. In order to accelerate this time-consuming operation, this paper presents a new divide-and-conquer approach for solving SVD on a heterogeneous CPU-GPU system. We carefully design our algorithm to match the mathematical requirements of SVD to the unique characteristics of a heterogeneous computing platform. This includes a high-performance solution to the secular equation with good numerical stability, overlapping the CPU and the GPU tasks, and leveraging the GPU bandwidth in a heterogeneous system. The experimental results show that our algorithm has better performance than MKL's divide-and-conquer routine [18] with four cores (eight hardware threads) when the size of the input matrix is larger than 3000. Furthermore, it is up to 33 times faster than LAPACK's divide-and-conquer routine [17], 3 times faster than MKL's divide-and-conquer routine with four cores, and 7 times faster than CULA on the same device, when the size of the matrix grows up to 14,000. Our algorithm is also much faster than previous SVD approaches on GPUs

AB - Singular value decomposition (SVD) is a fundamental linear operation that has been used for many applications, such as pattern recognition and statistical information processing. In order to accelerate this time-consuming operation, this paper presents a new divide-and-conquer approach for solving SVD on a heterogeneous CPU-GPU system. We carefully design our algorithm to match the mathematical requirements of SVD to the unique characteristics of a heterogeneous computing platform. This includes a high-performance solution to the secular equation with good numerical stability, overlapping the CPU and the GPU tasks, and leveraging the GPU bandwidth in a heterogeneous system. The experimental results show that our algorithm has better performance than MKL's divide-and-conquer routine [18] with four cores (eight hardware threads) when the size of the input matrix is larger than 3000. Furthermore, it is up to 33 times faster than LAPACK's divide-and-conquer routine [17], 3 times faster than MKL's divide-and-conquer routine with four cores, and 7 times faster than CULA on the same device, when the size of the matrix grows up to 14,000. Our algorithm is also much faster than previous SVD approaches on GPUs

KW - Divide-and-conquer

KW - Heterogeneous architecture

KW - Performance evaluation

KW - Singular value decomposition (SVD)

UR - http://www.scopus.com/inward/record.url?scp=84879529598&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84879529598&partnerID=8YFLogxK

U2 - 10.1145/2482767.2482813

DO - 10.1145/2482767.2482813

M3 - Conference contribution

AN - SCOPUS:84879529598

SN - 9781450320535

T3 - Proceedings of the ACM International Conference on Computing Frontiers, CF 2013

BT - Proceedings of the ACM International Conference on Computing Frontiers, CF 2013

T2 - 2013 ACM International Conference on Computing Frontiers, CF 2013

Y2 - 14 May 2013 through 16 May 2013

ER -

A divide-and-conquer approach for solving singular value decomposition on a heterogeneous system

Abstract

Publication series

Other

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this