Exploiting TLS parallelism at multiple loop-nest levels

Venkatesan Packirisamy; Antonia Zhai

doi:10.1109/ICPADS.2009.143

Exploiting TLS parallelism at multiple loop-nest levels

Venkatesan Packirisamy, Antonia Zhai

Computer Science and Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

1 Scopus citations

Abstract

As the number of cores integrated onto a single chip increases, architecture and compiler designers are challenged with the difficulty of utilizing these cores to improve the performance of a single application. Thread-level speculation (TLS) can potentially help by allowing possibly dependent threads to speculatively execute in parallel. Extracting speculative thread from sequential applications is key to efficient TLS execution. Previous work on thread extraction has focused on parallelizing iterations from a single loop-nest level or function continuation. However, the amount of parallelism available at a single loopnest level is sometimes limited, and we are forced to look for parallelism across multiple loop-nest levels. In this paper we propose SpecOPTAL - a compiler algorithm that statically allocates cores to threads extracted from different levels of loopnests. We show that, a subset of SPEC 2006 benchmarks are able to benefit from the proposed technique.

Original language	English (US)
Title of host publication	ICPADS '09 - 15th International Conference on Parallel and Distributed Systems
Pages	205-212
Number of pages	8
DOIs	https://doi.org/10.1109/ICPADS.2009.143
State	Published - 2009
Event	15th International Conference on Parallel and Distributed Systems, ICPADS '09 - Shenzhen, Guangdong, China Duration: Dec 8 2009 → Dec 11 2009

Publication series

Name	Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS
ISSN (Print)	1521-9097

Other

Other	15th International Conference on Parallel and Distributed Systems, ICPADS '09
Country/Territory	China
City	Shenzhen, Guangdong
Period	12/8/09 → 12/11/09

Access

10.1109/ICPADS.2009.143

OpenUrl availability

Full text

Cite this

Exploiting TLS parallelism at multiple loop-nest levels. / Packirisamy, Venkatesan; Zhai, Antonia.
ICPADS '09 - 15th International Conference on Parallel and Distributed Systems. 2009. p. 205-212 5395253 (Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Packirisamy, V & Zhai, A 2009, Exploiting TLS parallelism at multiple loop-nest levels. in ICPADS '09 - 15th International Conference on Parallel and Distributed Systems., 5395253, Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS, pp. 205-212, 15th International Conference on Parallel and Distributed Systems, ICPADS '09, Shenzhen, Guangdong, China, 12/8/09. https://doi.org/10.1109/ICPADS.2009.143

@inproceedings{cd669321ed11469a90de1b9d14f0114d,

title = "Exploiting TLS parallelism at multiple loop-nest levels",

abstract = "As the number of cores integrated onto a single chip increases, architecture and compiler designers are challenged with the difficulty of utilizing these cores to improve the performance of a single application. Thread-level speculation (TLS) can potentially help by allowing possibly dependent threads to speculatively execute in parallel. Extracting speculative thread from sequential applications is key to efficient TLS execution. Previous work on thread extraction has focused on parallelizing iterations from a single loop-nest level or function continuation. However, the amount of parallelism available at a single loopnest level is sometimes limited, and we are forced to look for parallelism across multiple loop-nest levels. In this paper we propose SpecOPTAL - a compiler algorithm that statically allocates cores to threads extracted from different levels of loopnests. We show that, a subset of SPEC 2006 benchmarks are able to benefit from the proposed technique.",

author = "Venkatesan Packirisamy and Antonia Zhai",

year = "2009",

doi = "10.1109/ICPADS.2009.143",

language = "English (US)",

isbn = "9780769539003",

series = "Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS",

pages = "205--212",

booktitle = "ICPADS '09 - 15th International Conference on Parallel and Distributed Systems",

note = "15th International Conference on Parallel and Distributed Systems, ICPADS '09 ; Conference date: 08-12-2009 Through 11-12-2009",

}

TY - GEN

T1 - Exploiting TLS parallelism at multiple loop-nest levels

AU - Packirisamy, Venkatesan

AU - Zhai, Antonia

PY - 2009

Y1 - 2009

N2 - As the number of cores integrated onto a single chip increases, architecture and compiler designers are challenged with the difficulty of utilizing these cores to improve the performance of a single application. Thread-level speculation (TLS) can potentially help by allowing possibly dependent threads to speculatively execute in parallel. Extracting speculative thread from sequential applications is key to efficient TLS execution. Previous work on thread extraction has focused on parallelizing iterations from a single loop-nest level or function continuation. However, the amount of parallelism available at a single loopnest level is sometimes limited, and we are forced to look for parallelism across multiple loop-nest levels. In this paper we propose SpecOPTAL - a compiler algorithm that statically allocates cores to threads extracted from different levels of loopnests. We show that, a subset of SPEC 2006 benchmarks are able to benefit from the proposed technique.

AB - As the number of cores integrated onto a single chip increases, architecture and compiler designers are challenged with the difficulty of utilizing these cores to improve the performance of a single application. Thread-level speculation (TLS) can potentially help by allowing possibly dependent threads to speculatively execute in parallel. Extracting speculative thread from sequential applications is key to efficient TLS execution. Previous work on thread extraction has focused on parallelizing iterations from a single loop-nest level or function continuation. However, the amount of parallelism available at a single loopnest level is sometimes limited, and we are forced to look for parallelism across multiple loop-nest levels. In this paper we propose SpecOPTAL - a compiler algorithm that statically allocates cores to threads extracted from different levels of loopnests. We show that, a subset of SPEC 2006 benchmarks are able to benefit from the proposed technique.

UR - http://www.scopus.com/inward/record.url?scp=77949634324&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77949634324&partnerID=8YFLogxK

U2 - 10.1109/ICPADS.2009.143

DO - 10.1109/ICPADS.2009.143

M3 - Conference contribution

AN - SCOPUS:77949634324

SN - 9780769539003

T3 - Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS

SP - 205

EP - 212

BT - ICPADS '09 - 15th International Conference on Parallel and Distributed Systems

T2 - 15th International Conference on Parallel and Distributed Systems, ICPADS '09

Y2 - 8 December 2009 through 11 December 2009

ER -

Exploiting TLS parallelism at multiple loop-nest levels

Abstract

Publication series

Other

Access

OpenUrl availability

Other files and links

Cite this