Inferring applications at the network layer using collective traffic statistics

Yu Jin; Nick Duffield; Patrick Haffner; Subhabrata Sen; Zhi-Li Zhang

doi:10.1145/1811099.1811082

Inferring applications at the network layer using collective traffic statistics

Yu Jin, Nick Duffield, Patrick Haffner, Subhabrata Sen, Zhi-Li Zhang

Computer Science and Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

4 Scopus citations

Abstract

In this paper, we propose a novel technique for inferring the distribution of application classes present in the aggregated traffic flows between endpoints, which exploits both the statistics of the traffic flows, and the spatial distribution of those flows across the network. Our method employs a two-step supervised model, where the boot-strapping step provides initial (inaccurate) inference on the traffic application classes, and the graph-based calibration step adjusts the initial inference through the collective spatial traffic distribution. In evaluations using real traffic flow measurements from a large ISP, we show how our method can accurately classify application types within aggregate traffic between endpoints, even without the knowledge of ports and other traffic features. While the bootstrap estimate classifies the aggregates with 80% accuracy, incorporating spatial distributions through calibration increases the accuracy to 92%, i.e., roughly halving the number of errors.

Original language	English (US)
Title of host publication	SIGMETRICS'10 - Proceedings of the 2010 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems
Pages	351-352
Number of pages	2
Edition	1 SPEC. ISSUE
DOIs	https://doi.org/10.1145/1811099.1811082
State	Published - 2010
Event	2010 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS'10 - New York, NY, United States Duration: Jun 14 2010 → Jun 18 2010

Publication series

Name	Performance Evaluation Review
Number	1 SPEC. ISSUE
Volume	38
ISSN (Print)	0163-5999

Conference

Conference	2010 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS'10
Country/Territory	United States
City	New York, NY
Period	6/14/10 → 6/18/10

Keywords

Design
Measurement

Access

10.1145/1811099.1811082

OpenUrl availability

Full text

Cite this

Jin, Y., Duffield, N., Haffner, P., Sen, S., & Zhang, Z.-L. (2010). Inferring applications at the network layer using collective traffic statistics. In SIGMETRICS'10 - Proceedings of the 2010 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems (1 SPEC. ISSUE ed., pp. 351-352). (Performance Evaluation Review; Vol. 38, No. 1 SPEC. ISSUE). https://doi.org/10.1145/1811099.1811082

Inferring applications at the network layer using collective traffic statistics. / Jin, Yu; Duffield, Nick; Haffner, Patrick et al.
SIGMETRICS'10 - Proceedings of the 2010 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems. 1 SPEC. ISSUE. ed. 2010. p. 351-352 (Performance Evaluation Review; Vol. 38, No. 1 SPEC. ISSUE).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Jin, Y, Duffield, N, Haffner, P, Sen, S & Zhang, Z-L 2010, Inferring applications at the network layer using collective traffic statistics. in SIGMETRICS'10 - Proceedings of the 2010 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems. 1 SPEC. ISSUE edn, Performance Evaluation Review, no. 1 SPEC. ISSUE, vol. 38, pp. 351-352, 2010 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS'10, New York, NY, United States, 6/14/10. https://doi.org/10.1145/1811099.1811082

Jin Y, Duffield N, Haffner P, Sen S, Zhang ZL. Inferring applications at the network layer using collective traffic statistics. In SIGMETRICS'10 - Proceedings of the 2010 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems. 1 SPEC. ISSUE ed. 2010. p. 351-352. (Performance Evaluation Review; 1 SPEC. ISSUE). doi: 10.1145/1811099.1811082

@inproceedings{837162a2bf9a409ea70e5faefb3c3d52,

title = "Inferring applications at the network layer using collective traffic statistics",

abstract = "In this paper, we propose a novel technique for inferring the distribution of application classes present in the aggregated traffic flows between endpoints, which exploits both the statistics of the traffic flows, and the spatial distribution of those flows across the network. Our method employs a two-step supervised model, where the boot-strapping step provides initial (inaccurate) inference on the traffic application classes, and the graph-based calibration step adjusts the initial inference through the collective spatial traffic distribution. In evaluations using real traffic flow measurements from a large ISP, we show how our method can accurately classify application types within aggregate traffic between endpoints, even without the knowledge of ports and other traffic features. While the bootstrap estimate classifies the aggregates with 80% accuracy, incorporating spatial distributions through calibration increases the accuracy to 92%, i.e., roughly halving the number of errors.",

keywords = "Design, Measurement",

author = "Yu Jin and Nick Duffield and Patrick Haffner and Subhabrata Sen and Zhi-Li Zhang",

note = "Copyright: Copyright 2010 Elsevier B.V., All rights reserved.; 2010 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS'10 ; Conference date: 14-06-2010 Through 18-06-2010",

year = "2010",

doi = "10.1145/1811099.1811082",

language = "English (US)",

isbn = "9781450302111",

series = "Performance Evaluation Review",

number = "1 SPEC. ISSUE",

pages = "351--352",

booktitle = "SIGMETRICS'10 - Proceedings of the 2010 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems",

edition = "1 SPEC. ISSUE",

}

TY - GEN

T1 - Inferring applications at the network layer using collective traffic statistics

AU - Jin, Yu

AU - Duffield, Nick

AU - Haffner, Patrick

AU - Sen, Subhabrata

AU - Zhang, Zhi-Li

PY - 2010

Y1 - 2010

N2 - In this paper, we propose a novel technique for inferring the distribution of application classes present in the aggregated traffic flows between endpoints, which exploits both the statistics of the traffic flows, and the spatial distribution of those flows across the network. Our method employs a two-step supervised model, where the boot-strapping step provides initial (inaccurate) inference on the traffic application classes, and the graph-based calibration step adjusts the initial inference through the collective spatial traffic distribution. In evaluations using real traffic flow measurements from a large ISP, we show how our method can accurately classify application types within aggregate traffic between endpoints, even without the knowledge of ports and other traffic features. While the bootstrap estimate classifies the aggregates with 80% accuracy, incorporating spatial distributions through calibration increases the accuracy to 92%, i.e., roughly halving the number of errors.

AB - In this paper, we propose a novel technique for inferring the distribution of application classes present in the aggregated traffic flows between endpoints, which exploits both the statistics of the traffic flows, and the spatial distribution of those flows across the network. Our method employs a two-step supervised model, where the boot-strapping step provides initial (inaccurate) inference on the traffic application classes, and the graph-based calibration step adjusts the initial inference through the collective spatial traffic distribution. In evaluations using real traffic flow measurements from a large ISP, we show how our method can accurately classify application types within aggregate traffic between endpoints, even without the knowledge of ports and other traffic features. While the bootstrap estimate classifies the aggregates with 80% accuracy, incorporating spatial distributions through calibration increases the accuracy to 92%, i.e., roughly halving the number of errors.

KW - Design

KW - Measurement

UR - http://www.scopus.com/inward/record.url?scp=77954891495&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77954891495&partnerID=8YFLogxK

U2 - 10.1145/1811099.1811082

DO - 10.1145/1811099.1811082

M3 - Conference contribution

AN - SCOPUS:77954891495

SN - 9781450302111

T3 - Performance Evaluation Review

SP - 351

EP - 352

BT - SIGMETRICS'10 - Proceedings of the 2010 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems

T2 - 2010 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS'10

Y2 - 14 June 2010 through 18 June 2010

ER -

Inferring applications at the network layer using collective traffic statistics

Abstract

Publication series

Conference

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this