Exploiting multiple heterogeneous networks to reduce communication costs in parallel programs

Jun Seong Kim, David J. Lilja

Research output: Chapter in Book/Report/Conference proceedingChapter

10 Scopus citations

Abstract

The different types of messages used by a parallel application program executing in a distributed system can each have unique characteristics so that no single communication network can produce the lowest latency for all messages. For instance, short control messages may be sent with the lowest overhead on one type of network, such as Ethernet, while bulk data transfers may be better suited to a different type of network, such as Fibre Channel or HiPPI. In this paper, we investigate how to exploit multiple heterogeneous communication networks that interconnect the same set of processing nodes by dynamically selecting the best (lowest latency) network for each message based on the message size. We also show how to aggregate these multiple parallel networks into a single virtual network to further reduce the latency and increase the available bandwidth. We test this multiplexing and aggregation on a cluster of SGI multiprocessors interconnected with both Fibre Channel and Ethernet. We find that multiplexing between Ethernet and Fibre Channel can substantially reduce communication overhead in a synthetic benchmark compared to using either network alone. Aggregating these two networks into a single virtual network can further reduce communication delays for applications with many large messages. The best choice of either multiplexing or aggregation depends on the mix of message sizes in the application program and the relative overheads of the two networks.

Original languageEnglish (US)
Title of host publicationProceedings of the Heterogeneous Computing Workshop, HCW
Editors Anon
PublisherIEEE
Pages83-95
Number of pages13
StatePublished - 1997
Externally publishedYes
EventProceedings of the 1997 6th Heterogeneous Computing Workshop, HCW'97 - Geneva, Switz
Duration: Apr 1 1997Apr 1 1997

Publication series

NameProceedings of the Heterogeneous Computing Workshop, HCW

Other

OtherProceedings of the 1997 6th Heterogeneous Computing Workshop, HCW'97
CityGeneva, Switz
Period4/1/974/1/97

Fingerprint

Dive into the research topics of 'Exploiting multiple heterogeneous networks to reduce communication costs in parallel programs'. Together they form a unique fingerprint.

Cite this