An evaluation of a compiler optimization for improving the performance of a coherence directory

Farnaz Mounes-Toussi, David J. Lilja, Zhiyuan Li

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations

Abstract

Both hardware-controlled and compiler-directed mechanisms have been proposed for maintaining cache coherence in largescale shared-memory multiprocessors, but both of these approaches have significant limitations. We examine the potential performance improvement of a new Software-hardware controlled cache coherence mechanism [18]. This approach augments the run-time information available to a directory-based coherence mechanism with compile-time analysis that statically identifies write references that cannot cause coherence problems and writes that should be written through to memory. These references are marked as not needing to send invalidation messages to thereby reduce the network traffic produced by the directory while maintaining cache consistency. For those memory references that are ambiguous, due to conditional branches, or due to the need for complex data flow analysis, for instance, the compiler conservatively marks the references and relies on the hardware directory to ensure coherence. Trace-driven simulations are used to emulate the compile-time analysis on memory traces and to estimate the potential performance improvement that could be expected from a compiler performing this optimization on the Perfect Club benchmark programs. By reducing the number of invalidations, this optimized directory scheme is capable of reducing the processor-memory network traffic by np to 54 percent compared to an unoptimized directory mechanism. In addition, the overall miss ratio can be reduced up to 42 percent due to a corresponding reduction in the number of write misses.

Original languageEnglish (US)
Title of host publicationProceedings of the 8th International Conference on Supercomputing, ICS 1994
PublisherAssociation for Computing Machinery
Pages75-84
Number of pages10
ISBN (Electronic)0897916654
DOIs
StatePublished - Jul 16 1994
Event8th International Conference on Supercomputing, ICS 1994 - Manchester, United Kingdom
Duration: Jul 11 1994Jul 15 1994

Publication series

NameProceedings of the International Conference on Supercomputing
VolumePart F129421

Other

Other8th International Conference on Supercomputing, ICS 1994
Country/TerritoryUnited Kingdom
CityManchester
Period7/11/947/15/94

Bibliographical note

Publisher Copyright:
© 1994 ACM.

Fingerprint

Dive into the research topics of 'An evaluation of a compiler optimization for improving the performance of a coherence directory'. Together they form a unique fingerprint.

Cite this