TY - GEN
T1 - Towards optimizing wide-area streaming analytics
AU - Heintz, Benjamin
AU - Chandra, Abhishek
AU - Sitaraman, Ramesh K.
PY - 2015/1/1
Y1 - 2015/1/1
N2 - Modern analytics services require the analysis of large quantities of data derived from disparate geo-distributed sources. Further, the analytics requirements can be complex, with many applications requiring a combination of both real-time and historical analysis, resulting in complex tradeoffs between cost, performance, and information quality. While the traditional approach to analytics processing is to send all the data to a dedicated centralized location, an alternative approach would be to push all computing to the edge for in-situ processing. We argue that neither approach is optimal for modern analytics requirements. Instead, we examine complex tradeoffs driven by a large number of factors such as application, data, and resource characteristics. We present an empirical study using PlanetLab experiments with beacon data from Akamai's download analytics service. We explore key tradeoffs and their implications for the design of next-generation scalable wide-area analytics.
AB - Modern analytics services require the analysis of large quantities of data derived from disparate geo-distributed sources. Further, the analytics requirements can be complex, with many applications requiring a combination of both real-time and historical analysis, resulting in complex tradeoffs between cost, performance, and information quality. While the traditional approach to analytics processing is to send all the data to a dedicated centralized location, an alternative approach would be to push all computing to the edge for in-situ processing. We argue that neither approach is optimal for modern analytics requirements. Instead, we examine complex tradeoffs driven by a large number of factors such as application, data, and resource characteristics. We present an empirical study using PlanetLab experiments with beacon data from Akamai's download analytics service. We explore key tradeoffs and their implications for the design of next-generation scalable wide-area analytics.
UR - http://www.scopus.com/inward/record.url?scp=84944340013&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84944340013&partnerID=8YFLogxK
U2 - 10.1109/IC2E.2015.53
DO - 10.1109/IC2E.2015.53
M3 - Conference contribution
AN - SCOPUS:84944340013
T3 - Proceedings - 2015 IEEE International Conference on Cloud Engineering, IC2E 2015
SP - 452
EP - 457
BT - Proceedings - 2015 IEEE International Conference on Cloud Engineering, IC2E 2015
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2015 IEEE International Conference on Cloud Engineering, IC2E 2015
Y2 - 9 March 2015 through 12 March 2015
ER -