The performance potential of fine-grain and coarse-grain parallel architectures

David J. Lilja; Pen Chung Yew

doi:10.1109/HICSS.1991.183902

The performance potential of fine-grain and coarse-grain parallel architectures

David J. Lilja, Pen Chung Yew

Research output: Contribution to journal › Conference article › peer-review

4 Scopus citations

Abstract

Recent work has shown that pipelining and multiple instruction issuing are architecturally equivalent in their abilities to exploit parallelism, but there has been little work directly comparing the performance of these fine-grain parallel architectures with that of the coarse-grain multiprocessors. Using trace-driven simulations, the authors compare the performance of a superscalar processor and a pipelined processor using dynamic dependence checking with that of a shared memory multiprocessor. For very parallel programs, they find that the fine-grain processors must bypass an unrealistically large number of branches to match the performance of the multiprocessor. When executing programs with a wide range of potential parallelism, the best performance is obtained using a multiprocessor where each individual processor has a fine-grain parallelism of two to four.

Original language	English (US)
Article number	183902
Pages (from-to)	324-333
Number of pages	10
Journal	Proceedings of the Annual Hawaii International Conference on System Sciences
Volume	1
DOIs	https://doi.org/10.1109/HICSS.1991.183902
State	Published - 1991
Externally published	Yes
Event	24th Annual Hawaii International Conference on System Sciences, HICSS 1991 - Kauai, United States Duration: Jan 8 1991 → Jan 11 1991

Bibliographical note

Funding Information:
This work was supported by the National Science Foundation under Grant No. NSF MIP-8410110, with additional support from NASA Ames Research Center Grant No. NASA NCC 2-559 (DARPA), National Science Foundation Grant No. NSF MIP-88-07775, and Department of Energy Grant No. DOE DE-FG02-85ER25001.

Publisher Copyright:
© 1991 IEEE.

Access

10.1109/HICSS.1991.183902

OpenUrl availability

Full text

Cite this

@article{cb917b1f4dd24e488663a007a6310e9f,

title = "The performance potential of fine-grain and coarse-grain parallel architectures",

abstract = "Recent work has shown that pipelining and multiple instruction issuing are architecturally equivalent in their abilities to exploit parallelism, but there has been little work directly comparing the performance of these fine-grain parallel architectures with that of the coarse-grain multiprocessors. Using trace-driven simulations, the authors compare the performance of a superscalar processor and a pipelined processor using dynamic dependence checking with that of a shared memory multiprocessor. For very parallel programs, they find that the fine-grain processors must bypass an unrealistically large number of branches to match the performance of the multiprocessor. When executing programs with a wide range of potential parallelism, the best performance is obtained using a multiprocessor where each individual processor has a fine-grain parallelism of two to four.",

author = "Lilja, {David J.} and Yew, {Pen Chung}",

note = "Funding Information: This work was supported by the National Science Foundation under Grant No. NSF MIP-8410110, with additional support from NASA Ames Research Center Grant No. NASA NCC 2-559 (DARPA), National Science Foundation Grant No. NSF MIP-88-07775, and Department of Energy Grant No. DOE DE-FG02-85ER25001. Publisher Copyright: {\textcopyright} 1991 IEEE.; 24th Annual Hawaii International Conference on System Sciences, HICSS 1991 ; Conference date: 08-01-1991 Through 11-01-1991",

year = "1991",

doi = "10.1109/HICSS.1991.183902",

language = "English (US)",

volume = "1",

pages = "324--333",

journal = "Proceedings of the Annual Hawaii International Conference on System Sciences",

issn = "1530-1605",

}

TY - JOUR

T1 - The performance potential of fine-grain and coarse-grain parallel architectures

AU - Lilja, David J.

AU - Yew, Pen Chung

N1 - Funding Information: This work was supported by the National Science Foundation under Grant No. NSF MIP-8410110, with additional support from NASA Ames Research Center Grant No. NASA NCC 2-559 (DARPA), National Science Foundation Grant No. NSF MIP-88-07775, and Department of Energy Grant No. DOE DE-FG02-85ER25001. Publisher Copyright: © 1991 IEEE.

PY - 1991

Y1 - 1991

N2 - Recent work has shown that pipelining and multiple instruction issuing are architecturally equivalent in their abilities to exploit parallelism, but there has been little work directly comparing the performance of these fine-grain parallel architectures with that of the coarse-grain multiprocessors. Using trace-driven simulations, the authors compare the performance of a superscalar processor and a pipelined processor using dynamic dependence checking with that of a shared memory multiprocessor. For very parallel programs, they find that the fine-grain processors must bypass an unrealistically large number of branches to match the performance of the multiprocessor. When executing programs with a wide range of potential parallelism, the best performance is obtained using a multiprocessor where each individual processor has a fine-grain parallelism of two to four.

AB - Recent work has shown that pipelining and multiple instruction issuing are architecturally equivalent in their abilities to exploit parallelism, but there has been little work directly comparing the performance of these fine-grain parallel architectures with that of the coarse-grain multiprocessors. Using trace-driven simulations, the authors compare the performance of a superscalar processor and a pipelined processor using dynamic dependence checking with that of a shared memory multiprocessor. For very parallel programs, they find that the fine-grain processors must bypass an unrealistically large number of branches to match the performance of the multiprocessor. When executing programs with a wide range of potential parallelism, the best performance is obtained using a multiprocessor where each individual processor has a fine-grain parallelism of two to four.

UR - http://www.scopus.com/inward/record.url?scp=84939350100&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84939350100&partnerID=8YFLogxK

U2 - 10.1109/HICSS.1991.183902

DO - 10.1109/HICSS.1991.183902

M3 - Conference article

AN - SCOPUS:84939350100

SN - 1530-1605

VL - 1

SP - 324

EP - 333

JO - Proceedings of the Annual Hawaii International Conference on System Sciences

JF - Proceedings of the Annual Hawaii International Conference on System Sciences

M1 - 183902

T2 - 24th Annual Hawaii International Conference on System Sciences, HICSS 1991

Y2 - 8 January 1991 through 11 January 1991

ER -

The performance potential of fine-grain and coarse-grain parallel architectures

Abstract

Bibliographical note

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this