On the scalability of FFT on parallel computers

Anshul Gupta, Vipin Kumar

Research output: Chapter in Book/Report/Conference proceedingConference contribution

10 Scopus citations

Abstract

The scalability of the parallel fast Fourier transform (FFT) algorithm on mesh- and hypercube-connected multicomputers is analyzed. The hypercube architecture provides linearly increasing performance for the FFT algorithm with an increasing number of processors and a moderately increasing problem size. However, there is a limit on the efficiency, which is determined by the communication bandwidth of the hypercube channels. Efficiencies higher than this limit can be obtained only if the problem size is increased very rapidly. Technology-dependent features, such as the communication bandwidth, determine the upper bound on the overall performance that can be obtained from a P-processor system. The upper bound can be moved up by either improving the communication-related parameters linearly or increasing the problem size exponentially. The scalability analysis shows that the FFT algorithm cannot make efficient use of large-scale mesh architectures. The addition of such features as cut-through routing and multicasting does not improve the overall scalability on this architecture.

Original languageEnglish (US)
Title of host publicationProc 3 Symp Front Massively Parallel Comput Frontiers 90
PublisherPubl by IEEE
Pages69-74
Number of pages6
ISBN (Print)0818620536
StatePublished - Dec 1 1990
EventProceedings of the 3rd Symposium on the Frontiers of Massively Parallel Computation - Frontiers '90 - College Park, MD, USA
Duration: Oct 8 1990Oct 10 1990

Publication series

NameProc 3 Symp Front Massively Parallel Comput Frontiers 90

Other

OtherProceedings of the 3rd Symposium on the Frontiers of Massively Parallel Computation - Frontiers '90
CityCollege Park, MD, USA
Period10/8/9010/10/90

Fingerprint

Dive into the research topics of 'On the scalability of FFT on parallel computers'. Together they form a unique fingerprint.

Cite this