The exigency of benchmark and compiler drift: Designing tomorrow's processors with yesterday's tools

Joshua J. Yi; Hans Vandierendonck; Lieven Eeckhout; David J. Lilja

doi:10.1145/1183401.1183414

The exigency of benchmark and compiler drift: Designing tomorrow's processors with yesterday's tools

Joshua J. Yi, Hans Vandierendonck, Lieven Eeckhout, David J. Lilja

Electrical and Computer Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

10 Scopus citations

Abstract

Due to the amount of time required to design a new processor, one set of benchmark programs may be used during the design phase while another may be the standard when the design is finally delivered. Using one benchmark suite to design a processor while using a different, presumably more current, suite to evaluate its ultimate performance may lead to sub-optimal design decisions if there are large differences between the characteristics of the two suites and their respective compilers. We call this changes across time "drift". To evaluate the impact of using yesterday's benchmark and compiler technology to design tomorrow's processors, we compare common benchmarks from the SPEC 95 and SPEC 2000 benchmark suites. Our results yield three key conclusions. First, we show that the amount of drift, for common programs in successive SPEC benchmark suites, is significant. In SPEC 2000, the main memory access time is a far more significant performance bottleneck than in SPEC 95, while less significant SPEC 2000 performance bottlenecks include the L2 cache latency, the L1 I-cache size, and the number of reorder buffer entries. Second, using two different statistical techniques, we show that compiler drift is not as significant as benchmark drift. Third, we show that benchmark and compiler drift can have a significant impact on the final design decisions. Specifically, we use a one-parameter-at-a-time optimization algorithm to design two different year-2000 processors, one optimized for SPEC 95 and the other optimized for SPEC 2000, using the energy-delay product (EDP) as the optimization criterion. The results show that using SPEC 95 to design a year-2000 processor results in an 18.5% larger EDP and a 20.8% higher CPI than using the SPEC 2000 benchmarks to design the corresponding processor. Finally, we make a few recommendations to help computer architects minimize the effects of benchmark and compiler drift.

Original language	English (US)
Title of host publication	Proceedings of the 20th Annual International Conference on Supercomputing, ICS 2006
Pages	75-86
Number of pages	12
DOIs	https://doi.org/10.1145/1183401.1183414
State	Published - 2006
Event	20th Annual International Conference on Supercomputing, ICS 2006 - Cairns, Queensland, Australia Duration: Jun 28 2006 → Jul 1 2006

Publication series

Name	Proceedings of the International Conference on Supercomputing

Conference

Conference	20th Annual International Conference on Supercomputing, ICS 2006
Country/Territory	Australia
City	Cairns, Queensland
Period	6/28/06 → 7/1/06

Keywords

Benchmark drift
Compiler drift
Microprocessor design

Access

10.1145/1183401.1183414

OpenUrl availability

Full text

Cite this

Yi, J. J., Vandierendonck, H., Eeckhout, L., & Lilja, D. J. (2006). The exigency of benchmark and compiler drift: Designing tomorrow's processors with yesterday's tools. In Proceedings of the 20th Annual International Conference on Supercomputing, ICS 2006 (pp. 75-86). (Proceedings of the International Conference on Supercomputing). https://doi.org/10.1145/1183401.1183414

The exigency of benchmark and compiler drift: Designing tomorrow's processors with yesterday's tools. / Yi, Joshua J.; Vandierendonck, Hans; Eeckhout, Lieven et al.
Proceedings of the 20th Annual International Conference on Supercomputing, ICS 2006. 2006. p. 75-86 (Proceedings of the International Conference on Supercomputing).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Yi, JJ, Vandierendonck, H, Eeckhout, L & Lilja, DJ 2006, The exigency of benchmark and compiler drift: Designing tomorrow's processors with yesterday's tools. in Proceedings of the 20th Annual International Conference on Supercomputing, ICS 2006. Proceedings of the International Conference on Supercomputing, pp. 75-86, 20th Annual International Conference on Supercomputing, ICS 2006, Cairns, Queensland, Australia, 6/28/06. https://doi.org/10.1145/1183401.1183414

@inproceedings{cf41d9c636774a97b56c7354c245b32b,

title = "The exigency of benchmark and compiler drift: Designing tomorrow's processors with yesterday's tools",

abstract = "Due to the amount of time required to design a new processor, one set of benchmark programs may be used during the design phase while another may be the standard when the design is finally delivered. Using one benchmark suite to design a processor while using a different, presumably more current, suite to evaluate its ultimate performance may lead to sub-optimal design decisions if there are large differences between the characteristics of the two suites and their respective compilers. We call this changes across time {"}drift{"}. To evaluate the impact of using yesterday's benchmark and compiler technology to design tomorrow's processors, we compare common benchmarks from the SPEC 95 and SPEC 2000 benchmark suites. Our results yield three key conclusions. First, we show that the amount of drift, for common programs in successive SPEC benchmark suites, is significant. In SPEC 2000, the main memory access time is a far more significant performance bottleneck than in SPEC 95, while less significant SPEC 2000 performance bottlenecks include the L2 cache latency, the L1 I-cache size, and the number of reorder buffer entries. Second, using two different statistical techniques, we show that compiler drift is not as significant as benchmark drift. Third, we show that benchmark and compiler drift can have a significant impact on the final design decisions. Specifically, we use a one-parameter-at-a-time optimization algorithm to design two different year-2000 processors, one optimized for SPEC 95 and the other optimized for SPEC 2000, using the energy-delay product (EDP) as the optimization criterion. The results show that using SPEC 95 to design a year-2000 processor results in an 18.5% larger EDP and a 20.8% higher CPI than using the SPEC 2000 benchmarks to design the corresponding processor. Finally, we make a few recommendations to help computer architects minimize the effects of benchmark and compiler drift.",

keywords = "Benchmark drift, Compiler drift, Microprocessor design",

author = "Yi, {Joshua J.} and Hans Vandierendonck and Lieven Eeckhout and Lilja, {David J.}",

year = "2006",

doi = "10.1145/1183401.1183414",

language = "English (US)",

isbn = "1595932828",

series = "Proceedings of the International Conference on Supercomputing",

pages = "75--86",

booktitle = "Proceedings of the 20th Annual International Conference on Supercomputing, ICS 2006",

note = "20th Annual International Conference on Supercomputing, ICS 2006 ; Conference date: 28-06-2006 Through 01-07-2006",

}

TY - GEN

T1 - The exigency of benchmark and compiler drift

T2 - 20th Annual International Conference on Supercomputing, ICS 2006

AU - Yi, Joshua J.

AU - Vandierendonck, Hans

AU - Eeckhout, Lieven

AU - Lilja, David J.

PY - 2006

Y1 - 2006

N2 - Due to the amount of time required to design a new processor, one set of benchmark programs may be used during the design phase while another may be the standard when the design is finally delivered. Using one benchmark suite to design a processor while using a different, presumably more current, suite to evaluate its ultimate performance may lead to sub-optimal design decisions if there are large differences between the characteristics of the two suites and their respective compilers. We call this changes across time "drift". To evaluate the impact of using yesterday's benchmark and compiler technology to design tomorrow's processors, we compare common benchmarks from the SPEC 95 and SPEC 2000 benchmark suites. Our results yield three key conclusions. First, we show that the amount of drift, for common programs in successive SPEC benchmark suites, is significant. In SPEC 2000, the main memory access time is a far more significant performance bottleneck than in SPEC 95, while less significant SPEC 2000 performance bottlenecks include the L2 cache latency, the L1 I-cache size, and the number of reorder buffer entries. Second, using two different statistical techniques, we show that compiler drift is not as significant as benchmark drift. Third, we show that benchmark and compiler drift can have a significant impact on the final design decisions. Specifically, we use a one-parameter-at-a-time optimization algorithm to design two different year-2000 processors, one optimized for SPEC 95 and the other optimized for SPEC 2000, using the energy-delay product (EDP) as the optimization criterion. The results show that using SPEC 95 to design a year-2000 processor results in an 18.5% larger EDP and a 20.8% higher CPI than using the SPEC 2000 benchmarks to design the corresponding processor. Finally, we make a few recommendations to help computer architects minimize the effects of benchmark and compiler drift.

AB - Due to the amount of time required to design a new processor, one set of benchmark programs may be used during the design phase while another may be the standard when the design is finally delivered. Using one benchmark suite to design a processor while using a different, presumably more current, suite to evaluate its ultimate performance may lead to sub-optimal design decisions if there are large differences between the characteristics of the two suites and their respective compilers. We call this changes across time "drift". To evaluate the impact of using yesterday's benchmark and compiler technology to design tomorrow's processors, we compare common benchmarks from the SPEC 95 and SPEC 2000 benchmark suites. Our results yield three key conclusions. First, we show that the amount of drift, for common programs in successive SPEC benchmark suites, is significant. In SPEC 2000, the main memory access time is a far more significant performance bottleneck than in SPEC 95, while less significant SPEC 2000 performance bottlenecks include the L2 cache latency, the L1 I-cache size, and the number of reorder buffer entries. Second, using two different statistical techniques, we show that compiler drift is not as significant as benchmark drift. Third, we show that benchmark and compiler drift can have a significant impact on the final design decisions. Specifically, we use a one-parameter-at-a-time optimization algorithm to design two different year-2000 processors, one optimized for SPEC 95 and the other optimized for SPEC 2000, using the energy-delay product (EDP) as the optimization criterion. The results show that using SPEC 95 to design a year-2000 processor results in an 18.5% larger EDP and a 20.8% higher CPI than using the SPEC 2000 benchmarks to design the corresponding processor. Finally, we make a few recommendations to help computer architects minimize the effects of benchmark and compiler drift.

KW - Benchmark drift

KW - Compiler drift

KW - Microprocessor design

UR - http://www.scopus.com/inward/record.url?scp=34547429489&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34547429489&partnerID=8YFLogxK

U2 - 10.1145/1183401.1183414

DO - 10.1145/1183401.1183414

M3 - Conference contribution

AN - SCOPUS:34547429489

SN - 1595932828

SN - 9781595932822

T3 - Proceedings of the International Conference on Supercomputing

SP - 75

EP - 86

BT - Proceedings of the 20th Annual International Conference on Supercomputing, ICS 2006

Y2 - 28 June 2006 through 1 July 2006

ER -

The exigency of benchmark and compiler drift: Designing tomorrow's processors with yesterday's tools

Abstract

Publication series

Conference

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this