Design and implementation of a lightweight dynamic optimization system

Jiwei Lu; Howard Chen; Pen Chung Yew; Wei Chung Hsu

Design and implementation of a lightweight dynamic optimization system

Jiwei Lu, Howard Chen, Pen Chung Yew, Wei Chung Hsu

Computer Science and Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

84 Scopus citations

Abstract

Many opportunities exist to improve micro-architectural performance due to performance events that are difficult to optimize at static compile time. Cache misses and branch mis-prediction patterns may vary for different micro-architectures using different inputs. Dynamic optimization provides an approach to address these and other performance events at runtime. This paper describes a software system of real implementation that detects performance problems of running applications and deploys optimizations to increase execution efficiency. We discuss issues of detecting performance bottlenecks, generating optimized traces and redirecting execution from the original code to the dynamically optimized code. Our current system speeds up many of the CPU2000 benchmark programs having large numbers of D-Cache misses through dynamically deployed cache prefetching. For other applications that don't benefit from our runtime optimization, the average cost is only 2% of execution time. We present this lightweight system as an example of using existing hardware and software to deploy speculative optimizations to improve a program's runtime performance.

Original language	English (US)
Title of host publication	Journal of Instruction-Level Parallelism
Volume	6
State	Published - Apr 2004

OpenUrl availability

Full text

Cite this

@inproceedings{22c9f4de84354b7ba93b95ee3cebbf92,

title = "Design and implementation of a lightweight dynamic optimization system",

abstract = "Many opportunities exist to improve micro-architectural performance due to performance events that are difficult to optimize at static compile time. Cache misses and branch mis-prediction patterns may vary for different micro-architectures using different inputs. Dynamic optimization provides an approach to address these and other performance events at runtime. This paper describes a software system of real implementation that detects performance problems of running applications and deploys optimizations to increase execution efficiency. We discuss issues of detecting performance bottlenecks, generating optimized traces and redirecting execution from the original code to the dynamically optimized code. Our current system speeds up many of the CPU2000 benchmark programs having large numbers of D-Cache misses through dynamically deployed cache prefetching. For other applications that don't benefit from our runtime optimization, the average cost is only 2% of execution time. We present this lightweight system as an example of using existing hardware and software to deploy speculative optimizations to improve a program's runtime performance.",

author = "Jiwei Lu and Howard Chen and Yew, {Pen Chung} and Hsu, {Wei Chung}",

year = "2004",

month = apr,

language = "English (US)",

volume = "6",

booktitle = "Journal of Instruction-Level Parallelism",

}

TY - GEN

T1 - Design and implementation of a lightweight dynamic optimization system

AU - Lu, Jiwei

AU - Chen, Howard

AU - Yew, Pen Chung

AU - Hsu, Wei Chung

PY - 2004/4

Y1 - 2004/4

N2 - Many opportunities exist to improve micro-architectural performance due to performance events that are difficult to optimize at static compile time. Cache misses and branch mis-prediction patterns may vary for different micro-architectures using different inputs. Dynamic optimization provides an approach to address these and other performance events at runtime. This paper describes a software system of real implementation that detects performance problems of running applications and deploys optimizations to increase execution efficiency. We discuss issues of detecting performance bottlenecks, generating optimized traces and redirecting execution from the original code to the dynamically optimized code. Our current system speeds up many of the CPU2000 benchmark programs having large numbers of D-Cache misses through dynamically deployed cache prefetching. For other applications that don't benefit from our runtime optimization, the average cost is only 2% of execution time. We present this lightweight system as an example of using existing hardware and software to deploy speculative optimizations to improve a program's runtime performance.

AB - Many opportunities exist to improve micro-architectural performance due to performance events that are difficult to optimize at static compile time. Cache misses and branch mis-prediction patterns may vary for different micro-architectures using different inputs. Dynamic optimization provides an approach to address these and other performance events at runtime. This paper describes a software system of real implementation that detects performance problems of running applications and deploys optimizations to increase execution efficiency. We discuss issues of detecting performance bottlenecks, generating optimized traces and redirecting execution from the original code to the dynamically optimized code. Our current system speeds up many of the CPU2000 benchmark programs having large numbers of D-Cache misses through dynamically deployed cache prefetching. For other applications that don't benefit from our runtime optimization, the average cost is only 2% of execution time. We present this lightweight system as an example of using existing hardware and software to deploy speculative optimizations to improve a program's runtime performance.

UR - http://www.scopus.com/inward/record.url?scp=2942729643&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=2942729643&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:2942729643

VL - 6

BT - Journal of Instruction-Level Parallelism

ER -

Design and implementation of a lightweight dynamic optimization system

Abstract

OpenUrl availability

Other files and links

Fingerprint

Cite this