TY - JOUR
T1 - Eliminating stale data references through array data-flow analysis
AU - Choi, Lynn
AU - Yew, Pen Chung
PY - 1996/1/1
Y1 - 1996/1/1
N2 - In this paper, we develop a compiler algorithm for detecting references to stale data in shared-memory multiprocessors. The algorithm consists of two key analysis techniques, stale reference detection and locality preserving analysis. While the stale reference detection finds the memory reference patterns that may violate cache coherence, the locality preserving analysis minimizes the number of such stale references by analyzing both temporal and spatial reuses. By computing the regions referenced by arrays inside loops, we extend the previous scalar algorithms [7, 9] for more precise analysis. We have implemented the algorithm on the Polaris parallelizing compiler [19], and using execution-driven simulations on Perfect Club benchmarks we demonstrate how unnecessary cache misses can be eliminated by the automatic stale reference detection.
AB - In this paper, we develop a compiler algorithm for detecting references to stale data in shared-memory multiprocessors. The algorithm consists of two key analysis techniques, stale reference detection and locality preserving analysis. While the stale reference detection finds the memory reference patterns that may violate cache coherence, the locality preserving analysis minimizes the number of such stale references by analyzing both temporal and spatial reuses. By computing the regions referenced by arrays inside loops, we extend the previous scalar algorithms [7, 9] for more precise analysis. We have implemented the algorithm on the Polaris parallelizing compiler [19], and using execution-driven simulations on Perfect Club benchmarks we demonstrate how unnecessary cache misses can be eliminated by the automatic stale reference detection.
UR - http://www.scopus.com/inward/record.url?scp=0029700415&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0029700415&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:0029700415
SN - 1063-6374
SP - 4
EP - 13
JO - IEEE Symposium on Parallel and Distributed Processing - Proceedings
JF - IEEE Symposium on Parallel and Distributed Processing - Proceedings
T2 - Proceedings of the 1996 10th International Parallel Processing Symposium
Y2 - 15 April 1996 through 19 April 1996
ER -