TY - GEN
T1 - PermJoin
T2 - 2008 IEEE 24th International Conference on Data Engineering, ICDE'08
AU - Levandoski, Justin J.
AU - Khalefa, Mohamed E.
AU - Mokbel, Mohamed F.
PY - 2008/10/1
Y1 - 2008/10/1
N2 - This paper introduces an efficient algorithm for Producing Early Results in Multi-join query plans (PermJoin, for short). While most previous research focuses only on the case of a single join operator, PermJoin takes a radical step by addressing query plans with multiple join operators. PermJoin is optimized to maximize the early overall throughput and to adapt to fluctuations in data arrival rates. PermJoin is a non-blocking operator that is capable of producing join results even if one or more data sources are blocked due to slow or bursty network behavior. Furthermore, PermJoin distinguishes itself from all previous techniques as it: (1) employs a new flushing policy to write in-memory data to disk, once memory allotment is exhausted, in a way that helps increase the probability of producing early result throughput in multi-join queries, and (2) employs a novel state manager module that adaptively switches operators between joining in-memory data and disk-resident data in order to maximize overall throughput.
AB - This paper introduces an efficient algorithm for Producing Early Results in Multi-join query plans (PermJoin, for short). While most previous research focuses only on the case of a single join operator, PermJoin takes a radical step by addressing query plans with multiple join operators. PermJoin is optimized to maximize the early overall throughput and to adapt to fluctuations in data arrival rates. PermJoin is a non-blocking operator that is capable of producing join results even if one or more data sources are blocked due to slow or bursty network behavior. Furthermore, PermJoin distinguishes itself from all previous techniques as it: (1) employs a new flushing policy to write in-memory data to disk, once memory allotment is exhausted, in a way that helps increase the probability of producing early result throughput in multi-join queries, and (2) employs a novel state manager module that adaptively switches operators between joining in-memory data and disk-resident data in order to maximize overall throughput.
UR - http://www.scopus.com/inward/record.url?scp=52649152956&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=52649152956&partnerID=8YFLogxK
U2 - 10.1109/ICDE.2008.4497580
DO - 10.1109/ICDE.2008.4497580
M3 - Conference contribution
AN - SCOPUS:52649152956
SN - 9781424418374
T3 - Proceedings - International Conference on Data Engineering
SP - 1433
EP - 1435
BT - Proceedings of the 2008 IEEE 24th International Conference on Data Engineering, ICDE'08
Y2 - 7 April 2008 through 12 April 2008
ER -