SPLATT: Efficient and Parallel Sparse Tensor-Matrix Multiplication

Shaden Smith; Niranjay Ravindran; Nicholas D. Sidiropoulos; George Karypis

doi:10.1109/IPDPS.2015.27

SPLATT: Efficient and Parallel Sparse Tensor-Matrix Multiplication

Shaden Smith, Niranjay Ravindran, Nicholas D. Sidiropoulos, George Karypis

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

161 Scopus citations

Abstract

Multi-dimensional arrays, or tensors, are increasingly found in fields such as signal processing and recommender systems. Real-world tensors can be enormous in size and often very sparse. There is a need for efficient, high-performance tools capable of processing the massive sparse tensors of today and the future. This paper introduces SPLATT, a C library with shared-memory parallelism for three-mode tensors. SPLATT contains algorithmic improvements over competing state of the art tools for sparse tensor factorization. SPLATT has a fast, parallel method of multiplying a matricide tensor by a Khatri-Rao product, which is a key kernel in tensor factorization methods. SPLATT uses a novel data structure that exploits the sparsity patterns of tensors. This data structure has a small memory footprint similar to competing methods and allows for the computational improvements featured in our work. We also present a method of finding cache-friendly reordering and utilizing them with a novel form of cache tiling. To our knowledge, this is the first work to investigate reordering and cache tiling in this context. SPLATT averages almost 30x speedup compared to our baseline when using 16 threads and reaches over 80x speedup on NELL-2.

Original language	English (US)
Title of host publication	Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	61-70
Number of pages	10
ISBN (Electronic)	9781479986484
DOIs	https://doi.org/10.1109/IPDPS.2015.27
State	Published - Jul 17 2015
Event	29th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2015 - Hyderabad, India Duration: May 25 2015 → May 29 2015

Publication series

Name	Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015

Other

Other	29th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2015
Country/Territory	India
City	Hyderabad
Period	5/25/15 → 5/29/15

Bibliographical note

Publisher Copyright:
© 2015 IEEE.

Keywords

CANDECOMP
CPD
PARAFAC
Sparse tensors

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access

10.1109/IPDPS.2015.27

OpenUrl availability

Full text

Cite this

Smith, S., Ravindran, N., Sidiropoulos, N. D., & Karypis, G. (2015). SPLATT: Efficient and Parallel Sparse Tensor-Matrix Multiplication. In Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015 (pp. 61-70). Article 7161496 (Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IPDPS.2015.27

SPLATT: Efficient and Parallel Sparse Tensor-Matrix Multiplication. / Smith, Shaden; Ravindran, Niranjay; Sidiropoulos, Nicholas D. et al.
Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015. Institute of Electrical and Electronics Engineers Inc., 2015. p. 61-70 7161496 (Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Smith, S, Ravindran, N, Sidiropoulos, ND & Karypis, G 2015, SPLATT: Efficient and Parallel Sparse Tensor-Matrix Multiplication. in Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015., 7161496, Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015, Institute of Electrical and Electronics Engineers Inc., pp. 61-70, 29th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2015, Hyderabad, India, 5/25/15. https://doi.org/10.1109/IPDPS.2015.27

Smith S, Ravindran N, Sidiropoulos ND, Karypis G. SPLATT: Efficient and Parallel Sparse Tensor-Matrix Multiplication. In Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015. Institute of Electrical and Electronics Engineers Inc. 2015. p. 61-70. 7161496. (Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015). doi: 10.1109/IPDPS.2015.27

Smith, Shaden ; Ravindran, Niranjay ; Sidiropoulos, Nicholas D. et al. / SPLATT : Efficient and Parallel Sparse Tensor-Matrix Multiplication. Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015. Institute of Electrical and Electronics Engineers Inc., 2015. pp. 61-70 (Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015).

@inproceedings{d611b62f1d4e480092b2a3a935801ae5,

title = "SPLATT: Efficient and Parallel Sparse Tensor-Matrix Multiplication",

abstract = "Multi-dimensional arrays, or tensors, are increasingly found in fields such as signal processing and recommender systems. Real-world tensors can be enormous in size and often very sparse. There is a need for efficient, high-performance tools capable of processing the massive sparse tensors of today and the future. This paper introduces SPLATT, a C library with shared-memory parallelism for three-mode tensors. SPLATT contains algorithmic improvements over competing state of the art tools for sparse tensor factorization. SPLATT has a fast, parallel method of multiplying a matricide tensor by a Khatri-Rao product, which is a key kernel in tensor factorization methods. SPLATT uses a novel data structure that exploits the sparsity patterns of tensors. This data structure has a small memory footprint similar to competing methods and allows for the computational improvements featured in our work. We also present a method of finding cache-friendly reordering and utilizing them with a novel form of cache tiling. To our knowledge, this is the first work to investigate reordering and cache tiling in this context. SPLATT averages almost 30x speedup compared to our baseline when using 16 threads and reaches over 80x speedup on NELL-2.",

keywords = "CANDECOMP, CPD, PARAFAC, Sparse tensors",

author = "Shaden Smith and Niranjay Ravindran and Sidiropoulos, {Nicholas D.} and George Karypis",

note = "Publisher Copyright: {\textcopyright} 2015 IEEE.; 29th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2015 ; Conference date: 25-05-2015 Through 29-05-2015",

year = "2015",

month = jul,

day = "17",

doi = "10.1109/IPDPS.2015.27",

language = "English (US)",

series = "Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "61--70",

booktitle = "Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015",

}

TY - GEN

T1 - SPLATT

T2 - 29th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2015

AU - Smith, Shaden

AU - Ravindran, Niranjay

AU - Sidiropoulos, Nicholas D.

AU - Karypis, George

PY - 2015/7/17

Y1 - 2015/7/17

N2 - Multi-dimensional arrays, or tensors, are increasingly found in fields such as signal processing and recommender systems. Real-world tensors can be enormous in size and often very sparse. There is a need for efficient, high-performance tools capable of processing the massive sparse tensors of today and the future. This paper introduces SPLATT, a C library with shared-memory parallelism for three-mode tensors. SPLATT contains algorithmic improvements over competing state of the art tools for sparse tensor factorization. SPLATT has a fast, parallel method of multiplying a matricide tensor by a Khatri-Rao product, which is a key kernel in tensor factorization methods. SPLATT uses a novel data structure that exploits the sparsity patterns of tensors. This data structure has a small memory footprint similar to competing methods and allows for the computational improvements featured in our work. We also present a method of finding cache-friendly reordering and utilizing them with a novel form of cache tiling. To our knowledge, this is the first work to investigate reordering and cache tiling in this context. SPLATT averages almost 30x speedup compared to our baseline when using 16 threads and reaches over 80x speedup on NELL-2.

AB - Multi-dimensional arrays, or tensors, are increasingly found in fields such as signal processing and recommender systems. Real-world tensors can be enormous in size and often very sparse. There is a need for efficient, high-performance tools capable of processing the massive sparse tensors of today and the future. This paper introduces SPLATT, a C library with shared-memory parallelism for three-mode tensors. SPLATT contains algorithmic improvements over competing state of the art tools for sparse tensor factorization. SPLATT has a fast, parallel method of multiplying a matricide tensor by a Khatri-Rao product, which is a key kernel in tensor factorization methods. SPLATT uses a novel data structure that exploits the sparsity patterns of tensors. This data structure has a small memory footprint similar to competing methods and allows for the computational improvements featured in our work. We also present a method of finding cache-friendly reordering and utilizing them with a novel form of cache tiling. To our knowledge, this is the first work to investigate reordering and cache tiling in this context. SPLATT averages almost 30x speedup compared to our baseline when using 16 threads and reaches over 80x speedup on NELL-2.

KW - CANDECOMP

KW - CPD

KW - PARAFAC

KW - Sparse tensors

UR - http://www.scopus.com/inward/record.url?scp=84971377867&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84971377867&partnerID=8YFLogxK

U2 - 10.1109/IPDPS.2015.27

DO - 10.1109/IPDPS.2015.27

M3 - Conference contribution

AN - SCOPUS:84971377867

T3 - Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015

SP - 61

EP - 70

BT - Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 25 May 2015 through 29 May 2015

ER -

SPLATT: Efficient and Parallel Sparse Tensor-Matrix Multiplication

Abstract

Publication series

Other

Bibliographical note

Keywords

UN SDGs

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this