MULTIPROCESSOR CACHE DESIGN CONSIDERATIONS.

Roland L. Lee; Pen Chung Yew; Duncan H. Lawrie

doi:10.1145/30350.30379

MULTIPROCESSOR CACHE DESIGN CONSIDERATIONS.

Roland L. Lee, Pen Chung Yew, Duncan H. Lawrie

Computer Science and Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

32 Scopus citations

Abstract

Cache design is explored for large high-performance multiprocessors with hundreds or thousands of processors and memory modules interconnected by a pipelined multistage network. Multiprocessor conditions are identified and modeled, including: (1) the cost of a cache coherence enforcement scheme; (2) the effect of a high degree of overlap between cache miss services; (3) the cost of a pin-limited data path between shared memory and caches; (4) the effect of a high degree of data prefetching; (5) the program behavior of a scientific workload, as represented by 23 numerical subroutines; and (6) the parallel execution of programs. This model is used to show that the cache miss ratio is not a suitable performance measure in the multiprocessors of interest and to show that the optimal cache block size in such multiprocessors is much smaller than in many uniprocessors.

Original language	English (US)
Title of host publication	Conference Proceedings - Annual Symposium on Computer Architecture
Publisher	IEEE
Pages	253-262
Number of pages	10
ISBN (Print)	0818607769, 9780818607769
DOIs	https://doi.org/10.1145/30350.30379
State	Published - 1987

Publication series

Name	Conference Proceedings - Annual Symposium on Computer Architecture
ISSN (Print)	0149-7111

Access

10.1145/30350.30379

OpenUrl availability

Full text

Cite this

@inproceedings{9dbd0e665f324889805dcd21cca8fb90,

title = "MULTIPROCESSOR CACHE DESIGN CONSIDERATIONS.",

abstract = "Cache design is explored for large high-performance multiprocessors with hundreds or thousands of processors and memory modules interconnected by a pipelined multistage network. Multiprocessor conditions are identified and modeled, including: (1) the cost of a cache coherence enforcement scheme; (2) the effect of a high degree of overlap between cache miss services; (3) the cost of a pin-limited data path between shared memory and caches; (4) the effect of a high degree of data prefetching; (5) the program behavior of a scientific workload, as represented by 23 numerical subroutines; and (6) the parallel execution of programs. This model is used to show that the cache miss ratio is not a suitable performance measure in the multiprocessors of interest and to show that the optimal cache block size in such multiprocessors is much smaller than in many uniprocessors.",

author = "Lee, {Roland L.} and Yew, {Pen Chung} and Lawrie, {Duncan H.}",

year = "1987",

doi = "10.1145/30350.30379",

language = "English (US)",

isbn = "0818607769",

series = "Conference Proceedings - Annual Symposium on Computer Architecture",

publisher = "IEEE",

pages = "253--262",

booktitle = "Conference Proceedings - Annual Symposium on Computer Architecture",

}

TY - GEN

T1 - MULTIPROCESSOR CACHE DESIGN CONSIDERATIONS.

AU - Lee, Roland L.

AU - Yew, Pen Chung

AU - Lawrie, Duncan H.

PY - 1987

Y1 - 1987

N2 - Cache design is explored for large high-performance multiprocessors with hundreds or thousands of processors and memory modules interconnected by a pipelined multistage network. Multiprocessor conditions are identified and modeled, including: (1) the cost of a cache coherence enforcement scheme; (2) the effect of a high degree of overlap between cache miss services; (3) the cost of a pin-limited data path between shared memory and caches; (4) the effect of a high degree of data prefetching; (5) the program behavior of a scientific workload, as represented by 23 numerical subroutines; and (6) the parallel execution of programs. This model is used to show that the cache miss ratio is not a suitable performance measure in the multiprocessors of interest and to show that the optimal cache block size in such multiprocessors is much smaller than in many uniprocessors.

AB - Cache design is explored for large high-performance multiprocessors with hundreds or thousands of processors and memory modules interconnected by a pipelined multistage network. Multiprocessor conditions are identified and modeled, including: (1) the cost of a cache coherence enforcement scheme; (2) the effect of a high degree of overlap between cache miss services; (3) the cost of a pin-limited data path between shared memory and caches; (4) the effect of a high degree of data prefetching; (5) the program behavior of a scientific workload, as represented by 23 numerical subroutines; and (6) the parallel execution of programs. This model is used to show that the cache miss ratio is not a suitable performance measure in the multiprocessors of interest and to show that the optimal cache block size in such multiprocessors is much smaller than in many uniprocessors.

UR - http://www.scopus.com/inward/record.url?scp=0023169552&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0023169552&partnerID=8YFLogxK

U2 - 10.1145/30350.30379

DO - 10.1145/30350.30379

M3 - Conference contribution

AN - SCOPUS:0023169552

SN - 0818607769

SN - 9780818607769

T3 - Conference Proceedings - Annual Symposium on Computer Architecture

SP - 253

EP - 262

BT - Conference Proceedings - Annual Symposium on Computer Architecture

PB - IEEE

ER -

MULTIPROCESSOR CACHE DESIGN CONSIDERATIONS.

Abstract

Publication series

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this