Efficient integration of compiler-directed Cache Coherence and Data Prefetching

Hock Beng Lim, Pen Chung Yew

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

Cache coherence enforcement and memory latency reduction and hiding are very important and challenging problems in the design of large-scale distributed shared-memory (DSM) multiprocessors. We propose an integrated approach to solve these problems through a compiler-directed cache coherence scheme called the Cache Coherence with Data Prefetching (CCDP) scheme. The CCDP scheme enforces cache coherence by prefetching the potentially stale references in a parallel program. It also prefetches the non-stale references to hide their memory latencies. To optimize the performance of the CCDP scheme, some prefetch hardware support is provided to efficiently handle these two forms of data prefetching operations. We also developed the compiler techniques utilized by the CCDP scheme for stale reference detection, prefetch target analysis, and prefetch scheduling. We evaluated the performance of the CCDP scheme via execution-driven simulations of several numerical applications from the SPEC CFP95 and the Perfect benchmark suites. The simulation results show that the CCDP scheme provides significant performance improvements for the applications studied, comparable to that obtained with a full-map hardware cache coherence scheme.

Original languageEnglish (US)
Pages (from-to)1775-1802
Number of pages28
JournalJournal of Parallel and Distributed Computing
Volume61
Issue number12
DOIs
StatePublished - 2001

Bibliographical note

Funding Information:
1This work is supported in part by the National Science Foundation under Grants MIP 93-07910, MIP 94-96320, CDA 95-02979, and MIP 96-10379. Additional support is provided by a gift from Cray Research, Inc and by a gift from Intel Corporation.

Keywords

  • Compiler-directed cache coherence
  • Data prefetching
  • Memory latency hiding
  • Shared-memory multiprocessors

Fingerprint

Dive into the research topics of 'Efficient integration of compiler-directed Cache Coherence and Data Prefetching'. Together they form a unique fingerprint.

Cite this