Algorithm Transformation Techniques for Concurrent Processors

Keshab K. Parhi

doi:10.1109/5.48830

Algorithm Transformation Techniques for Concurrent Processors

Keshab K. Parhi

Electrical and Computer Engineering

Research output: Contribution to journal › Article › peer-review

129 Scopus citations

Abstract

Progress in supercomputer technology leads to two major trends. First, many of the existing algorithms will need to be redesigned for efficient concurrent implementation using supercomputers. Second, a continuous increase will be apparent in the number of application-specific VLSI integrated circuits, which can provide the performance of supercomputers, using single chips or chipsets (at the expense of design time for algorithm and architecture development). Both of these approaches require considerable efforts in the development of algorithms for specific applications. This paper reviews four independent algorithm transformation methodologies: program unfolding, retiming, lookahead algorithms, and index mapping transformations. These transformation techniques exploit the available parallelism in iterative dataflow programs and create additional parallelism if necessary.

Original language	English (US)
Pages (from-to)	1879-1895
Number of pages	17
Journal	Proceedings of the IEEE
Volume	77
Issue number	12
DOIs	https://doi.org/10.1109/5.48830
State	Published - Dec 1989

Bibliographical note

Funding Information:
Concurrent processors provide the computing power needed to solve many computation-intensive problems, such as those found in signal and image processing, processing of geophysical and seismic data (which require solution of partial differential equations, and inverse problems), solution of very large scale systems, and recognition of words from a large vocabulary as required in speech recognition. A major challenge in programming these supercomputers lies in partitioning the tasks of an algorithm in a way that leads to better processor utilization (or reduced idle time). Often, the algorithms or programs may possess more concurrency, which may not be obvious at first examination. This hidden concurrency can be exploited by unfolding the program. Retiming and index mapping transformations are other approaches to unraveling the hidden concurrency. If the available concurrency in the algorithm is inadequate for the real-time speed constraints of the target application, then look-ahead computation can be used to create additional concurrency (for a class of recursive Manuscript received August 1, 1988; revised June 5, 1989. This research was supported in part by grants from the Advanced Research Project Agency monitored by the Naval Electronics Systems Command under Contract N00039-86-R-0365, the National Science Foundation under Contracts DCl-85-17339 and MIPS 89-08586, an IBM Graduate Fellowship, a University of California Regents Fellowship, and by the Electrical Engineering Department of the University of Minnesota. Portions of the research presented here were carried out when the author was with the University of California at Berkeley.

Access

10.1109/5.48830

OpenUrl availability

Full text

Cite this

@article{256471fa3ae8435594f795e3c30fa9bb,

title = "Algorithm Transformation Techniques for Concurrent Processors",

abstract = "Progress in supercomputer technology leads to two major trends. First, many of the existing algorithms will need to be redesigned for efficient concurrent implementation using supercomputers. Second, a continuous increase will be apparent in the number of application-specific VLSI integrated circuits, which can provide the performance of supercomputers, using single chips or chipsets (at the expense of design time for algorithm and architecture development). Both of these approaches require considerable efforts in the development of algorithms for specific applications. This paper reviews four independent algorithm transformation methodologies: program unfolding, retiming, lookahead algorithms, and index mapping transformations. These transformation techniques exploit the available parallelism in iterative dataflow programs and create additional parallelism if necessary.",

author = "Parhi, {Keshab K.}",

note = "Funding Information: Concurrent processors provide the computing power needed to solve many computation-intensive problems, such as those found in signal and image processing, processing of geophysical and seismic data (which require solution of partial differential equations, and inverse problems), solution of very large scale systems, and recognition of words from a large vocabulary as required in speech recognition. A major challenge in programming these supercomputers lies in partitioning the tasks of an algorithm in a way that leads to better processor utilization (or reduced idle time). Often, the algorithms or programs may possess more concurrency, which may not be obvious at first examination. This hidden concurrency can be exploited by unfolding the program. Retiming and index mapping transformations are other approaches to unraveling the hidden concurrency. If the available concurrency in the algorithm is inadequate for the real-time speed constraints of the target application, then look-ahead computation can be used to create additional concurrency (for a class of recursive Manuscript received August 1, 1988; revised June 5, 1989. This research was supported in part by grants from the Advanced Research Project Agency monitored by the Naval Electronics Systems Command under Contract N00039-86-R-0365, the National Science Foundation under Contracts DCl-85-17339 and MIPS 89-08586, an IBM Graduate Fellowship, a University of California Regents Fellowship, and by the Electrical Engineering Department of the University of Minnesota. Portions of the research presented here were carried out when the author was with the University of California at Berkeley.",

year = "1989",

month = dec,

doi = "10.1109/5.48830",

language = "English (US)",

volume = "77",

pages = "1879--1895",

journal = "Proceedings of the IEEE",

issn = "0018-9219",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "12",

}

TY - JOUR

T1 - Algorithm Transformation Techniques for Concurrent Processors

AU - Parhi, Keshab K.

N1 - Funding Information: Concurrent processors provide the computing power needed to solve many computation-intensive problems, such as those found in signal and image processing, processing of geophysical and seismic data (which require solution of partial differential equations, and inverse problems), solution of very large scale systems, and recognition of words from a large vocabulary as required in speech recognition. A major challenge in programming these supercomputers lies in partitioning the tasks of an algorithm in a way that leads to better processor utilization (or reduced idle time). Often, the algorithms or programs may possess more concurrency, which may not be obvious at first examination. This hidden concurrency can be exploited by unfolding the program. Retiming and index mapping transformations are other approaches to unraveling the hidden concurrency. If the available concurrency in the algorithm is inadequate for the real-time speed constraints of the target application, then look-ahead computation can be used to create additional concurrency (for a class of recursive Manuscript received August 1, 1988; revised June 5, 1989. This research was supported in part by grants from the Advanced Research Project Agency monitored by the Naval Electronics Systems Command under Contract N00039-86-R-0365, the National Science Foundation under Contracts DCl-85-17339 and MIPS 89-08586, an IBM Graduate Fellowship, a University of California Regents Fellowship, and by the Electrical Engineering Department of the University of Minnesota. Portions of the research presented here were carried out when the author was with the University of California at Berkeley.

PY - 1989/12

Y1 - 1989/12

N2 - Progress in supercomputer technology leads to two major trends. First, many of the existing algorithms will need to be redesigned for efficient concurrent implementation using supercomputers. Second, a continuous increase will be apparent in the number of application-specific VLSI integrated circuits, which can provide the performance of supercomputers, using single chips or chipsets (at the expense of design time for algorithm and architecture development). Both of these approaches require considerable efforts in the development of algorithms for specific applications. This paper reviews four independent algorithm transformation methodologies: program unfolding, retiming, lookahead algorithms, and index mapping transformations. These transformation techniques exploit the available parallelism in iterative dataflow programs and create additional parallelism if necessary.

AB - Progress in supercomputer technology leads to two major trends. First, many of the existing algorithms will need to be redesigned for efficient concurrent implementation using supercomputers. Second, a continuous increase will be apparent in the number of application-specific VLSI integrated circuits, which can provide the performance of supercomputers, using single chips or chipsets (at the expense of design time for algorithm and architecture development). Both of these approaches require considerable efforts in the development of algorithms for specific applications. This paper reviews four independent algorithm transformation methodologies: program unfolding, retiming, lookahead algorithms, and index mapping transformations. These transformation techniques exploit the available parallelism in iterative dataflow programs and create additional parallelism if necessary.

UR - http://www.scopus.com/inward/record.url?scp=0024883413&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0024883413&partnerID=8YFLogxK

U2 - 10.1109/5.48830

DO - 10.1109/5.48830

M3 - Article

AN - SCOPUS:0024883413

SN - 0018-9219

VL - 77

SP - 1879

EP - 1895

JO - Proceedings of the IEEE

JF - Proceedings of the IEEE

IS - 12

ER -

Algorithm Transformation Techniques for Concurrent Processors

Abstract

Bibliographical note

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this