PermJoin: An efficient algorithm for producing early results in multi-join query plans

Justin J. Levandoski; Mohamed E. Khalefa; Mohamed F. Mokbel

doi:10.1109/ICDE.2008.4497580

PermJoin: An efficient algorithm for producing early results in multi-join query plans

Justin J. Levandoski, Mohamed E. Khalefa, Mohamed F. Mokbel

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

6 Scopus citations

Abstract

This paper introduces an efficient algorithm for Producing Early Results in Multi-join query plans (PermJoin, for short). While most previous research focuses only on the case of a single join operator, PermJoin takes a radical step by addressing query plans with multiple join operators. PermJoin is optimized to maximize the early overall throughput and to adapt to fluctuations in data arrival rates. PermJoin is a non-blocking operator that is capable of producing join results even if one or more data sources are blocked due to slow or bursty network behavior. Furthermore, PermJoin distinguishes itself from all previous techniques as it: (1) employs a new flushing policy to write in-memory data to disk, once memory allotment is exhausted, in a way that helps increase the probability of producing early result throughput in multi-join queries, and (2) employs a novel state manager module that adaptively switches operators between joining in-memory data and disk-resident data in order to maximize overall throughput.

Original language	English (US)
Title of host publication	Proceedings of the 2008 IEEE 24th International Conference on Data Engineering, ICDE'08
Pages	1433-1435
Number of pages	3
DOIs	https://doi.org/10.1109/ICDE.2008.4497580
State	Published - Oct 1 2008
Externally published	Yes
Event	2008 IEEE 24th International Conference on Data Engineering, ICDE'08 - Cancun, Mexico Duration: Apr 7 2008 → Apr 12 2008

Publication series

Name	Proceedings - International Conference on Data Engineering
ISSN (Print)	1084-4627

Other

Other	2008 IEEE 24th International Conference on Data Engineering, ICDE'08
Country/Territory	Mexico
City	Cancun
Period	4/7/08 → 4/12/08

Access

10.1109/ICDE.2008.4497580

OpenUrl availability

Full text

Cite this

Levandoski, J. J., Khalefa, M. E., & Mokbel, M. F. (2008). PermJoin: An efficient algorithm for producing early results in multi-join query plans. In Proceedings of the 2008 IEEE 24th International Conference on Data Engineering, ICDE'08 (pp. 1433-1435). Article 4497580 (Proceedings - International Conference on Data Engineering). https://doi.org/10.1109/ICDE.2008.4497580

PermJoin: An efficient algorithm for producing early results in multi-join query plans. / Levandoski, Justin J.; Khalefa, Mohamed E.; Mokbel, Mohamed F.
Proceedings of the 2008 IEEE 24th International Conference on Data Engineering, ICDE'08. 2008. p. 1433-1435 4497580 (Proceedings - International Conference on Data Engineering).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Levandoski, JJ, Khalefa, ME & Mokbel, MF 2008, PermJoin: An efficient algorithm for producing early results in multi-join query plans. in Proceedings of the 2008 IEEE 24th International Conference on Data Engineering, ICDE'08., 4497580, Proceedings - International Conference on Data Engineering, pp. 1433-1435, 2008 IEEE 24th International Conference on Data Engineering, ICDE'08, Cancun, Mexico, 4/7/08. https://doi.org/10.1109/ICDE.2008.4497580

@inproceedings{7f0d270d0b2e4f979a967fc498e017ce,

title = "PermJoin: An efficient algorithm for producing early results in multi-join query plans",

abstract = "This paper introduces an efficient algorithm for Producing Early Results in Multi-join query plans (PermJoin, for short). While most previous research focuses only on the case of a single join operator, PermJoin takes a radical step by addressing query plans with multiple join operators. PermJoin is optimized to maximize the early overall throughput and to adapt to fluctuations in data arrival rates. PermJoin is a non-blocking operator that is capable of producing join results even if one or more data sources are blocked due to slow or bursty network behavior. Furthermore, PermJoin distinguishes itself from all previous techniques as it: (1) employs a new flushing policy to write in-memory data to disk, once memory allotment is exhausted, in a way that helps increase the probability of producing early result throughput in multi-join queries, and (2) employs a novel state manager module that adaptively switches operators between joining in-memory data and disk-resident data in order to maximize overall throughput.",

author = "Levandoski, {Justin J.} and Khalefa, {Mohamed E.} and Mokbel, {Mohamed F.}",

year = "2008",

month = oct,

day = "1",

doi = "10.1109/ICDE.2008.4497580",

language = "English (US)",

isbn = "9781424418374",

series = "Proceedings - International Conference on Data Engineering",

pages = "1433--1435",

booktitle = "Proceedings of the 2008 IEEE 24th International Conference on Data Engineering, ICDE'08",

note = "2008 IEEE 24th International Conference on Data Engineering, ICDE'08 ; Conference date: 07-04-2008 Through 12-04-2008",

}

TY - GEN

T1 - PermJoin

T2 - 2008 IEEE 24th International Conference on Data Engineering, ICDE'08

AU - Levandoski, Justin J.

AU - Khalefa, Mohamed E.

AU - Mokbel, Mohamed F.

PY - 2008/10/1

Y1 - 2008/10/1

N2 - This paper introduces an efficient algorithm for Producing Early Results in Multi-join query plans (PermJoin, for short). While most previous research focuses only on the case of a single join operator, PermJoin takes a radical step by addressing query plans with multiple join operators. PermJoin is optimized to maximize the early overall throughput and to adapt to fluctuations in data arrival rates. PermJoin is a non-blocking operator that is capable of producing join results even if one or more data sources are blocked due to slow or bursty network behavior. Furthermore, PermJoin distinguishes itself from all previous techniques as it: (1) employs a new flushing policy to write in-memory data to disk, once memory allotment is exhausted, in a way that helps increase the probability of producing early result throughput in multi-join queries, and (2) employs a novel state manager module that adaptively switches operators between joining in-memory data and disk-resident data in order to maximize overall throughput.

AB - This paper introduces an efficient algorithm for Producing Early Results in Multi-join query plans (PermJoin, for short). While most previous research focuses only on the case of a single join operator, PermJoin takes a radical step by addressing query plans with multiple join operators. PermJoin is optimized to maximize the early overall throughput and to adapt to fluctuations in data arrival rates. PermJoin is a non-blocking operator that is capable of producing join results even if one or more data sources are blocked due to slow or bursty network behavior. Furthermore, PermJoin distinguishes itself from all previous techniques as it: (1) employs a new flushing policy to write in-memory data to disk, once memory allotment is exhausted, in a way that helps increase the probability of producing early result throughput in multi-join queries, and (2) employs a novel state manager module that adaptively switches operators between joining in-memory data and disk-resident data in order to maximize overall throughput.

UR - http://www.scopus.com/inward/record.url?scp=52649152956&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=52649152956&partnerID=8YFLogxK

U2 - 10.1109/ICDE.2008.4497580

DO - 10.1109/ICDE.2008.4497580

M3 - Conference contribution

AN - SCOPUS:52649152956

SN - 9781424418374

T3 - Proceedings - International Conference on Data Engineering

SP - 1433

EP - 1435

BT - Proceedings of the 2008 IEEE 24th International Conference on Data Engineering, ICDE'08

Y2 - 7 April 2008 through 12 April 2008

ER -

PermJoin: An efficient algorithm for producing early results in multi-join query plans

Abstract

Publication series

Other

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this