Unsupervised learning based distributed detection of global anomalies

Junlin Zhou; Aleksandar Lazarevic; Kuo Wei Hsu; Jaideep Srivastava; Yan Fu; Yue Wu

doi:10.1142/S0219622010004172

Unsupervised learning based distributed detection of global anomalies

Junlin Zhou, Aleksandar Lazarevic, Kuo Wei Hsu, Jaideep Srivastava, Yan Fu, Yue Wu

Computer Science and Engineering

Research output: Contribution to journal › Article › peer-review

6 Scopus citations

Abstract

Anomaly detection has recently become an important problem in many industrial and financial applications. Very often, the databases from which anomalies have to be found are located at multiple local sites and cannot be merged due to privacy reasons or communication overhead. In this paper, a novel general framework for distributed anomaly detection is proposed. The proposed method consists of three steps: (i) building local models for distributed data sources with unsupervised anomaly detection methods and computing quality measure of local models; (ii) transforming local unsupervised local models into sharing models; and (iii) reusing sharing models for new data and combining their results by considering both quality and diversity of them to detect anomalies in a global view. In experiments performed on synthetic and real-life large data set, the proposed distributed anomaly detection method achieved prediction performance comparable or even slightly better than the global anomaly detection algorithm applied on the data set obtained when all distributed data set were merged.

Original language	English (US)
Pages (from-to)	935-957
Number of pages	23
Journal	International Journal of Information Technology and Decision Making
Volume	9
Issue number	6
DOIs	https://doi.org/10.1142/S0219622010004172
State	Published - Nov 2010

Bibliographical note

Funding Information:
The work was supported by NASA under award NNX08AC36A, by Natural Science Foundation of China (Nos. 60973120 and 60903073) and by the National High-Tech Research and Development Plan of China under Grant No. 2007AA01Z440.

Keywords

Distributed anomaly detection
combining models
global anomalies

Access

10.1142/S0219622010004172

OpenUrl availability

Full text

Cite this

@article{5d4bdde6c06f419c839382bdeb459044,

title = "Unsupervised learning based distributed detection of global anomalies",

abstract = "Anomaly detection has recently become an important problem in many industrial and financial applications. Very often, the databases from which anomalies have to be found are located at multiple local sites and cannot be merged due to privacy reasons or communication overhead. In this paper, a novel general framework for distributed anomaly detection is proposed. The proposed method consists of three steps: (i) building local models for distributed data sources with unsupervised anomaly detection methods and computing quality measure of local models; (ii) transforming local unsupervised local models into sharing models; and (iii) reusing sharing models for new data and combining their results by considering both quality and diversity of them to detect anomalies in a global view. In experiments performed on synthetic and real-life large data set, the proposed distributed anomaly detection method achieved prediction performance comparable or even slightly better than the global anomaly detection algorithm applied on the data set obtained when all distributed data set were merged.",

keywords = "Distributed anomaly detection, combining models, global anomalies",

author = "Junlin Zhou and Aleksandar Lazarevic and Hsu, {Kuo Wei} and Jaideep Srivastava and Yan Fu and Yue Wu",

note = "Funding Information: The work was supported by NASA under award NNX08AC36A, by Natural Science Foundation of China (Nos. 60973120 and 60903073) and by the National High-Tech Research and Development Plan of China under Grant No. 2007AA01Z440.",

year = "2010",

month = nov,

doi = "10.1142/S0219622010004172",

language = "English (US)",

volume = "9",

pages = "935--957",

journal = "International Journal of Information Technology and Decision Making",

issn = "0219-6220",

publisher = "World Scientific Publishing Co. Pte Ltd",

number = "6",

}

TY - JOUR

T1 - Unsupervised learning based distributed detection of global anomalies

AU - Zhou, Junlin

AU - Lazarevic, Aleksandar

AU - Hsu, Kuo Wei

AU - Srivastava, Jaideep

AU - Fu, Yan

AU - Wu, Yue

N1 - Funding Information: The work was supported by NASA under award NNX08AC36A, by Natural Science Foundation of China (Nos. 60973120 and 60903073) and by the National High-Tech Research and Development Plan of China under Grant No. 2007AA01Z440.

PY - 2010/11

Y1 - 2010/11

N2 - Anomaly detection has recently become an important problem in many industrial and financial applications. Very often, the databases from which anomalies have to be found are located at multiple local sites and cannot be merged due to privacy reasons or communication overhead. In this paper, a novel general framework for distributed anomaly detection is proposed. The proposed method consists of three steps: (i) building local models for distributed data sources with unsupervised anomaly detection methods and computing quality measure of local models; (ii) transforming local unsupervised local models into sharing models; and (iii) reusing sharing models for new data and combining their results by considering both quality and diversity of them to detect anomalies in a global view. In experiments performed on synthetic and real-life large data set, the proposed distributed anomaly detection method achieved prediction performance comparable or even slightly better than the global anomaly detection algorithm applied on the data set obtained when all distributed data set were merged.

AB - Anomaly detection has recently become an important problem in many industrial and financial applications. Very often, the databases from which anomalies have to be found are located at multiple local sites and cannot be merged due to privacy reasons or communication overhead. In this paper, a novel general framework for distributed anomaly detection is proposed. The proposed method consists of three steps: (i) building local models for distributed data sources with unsupervised anomaly detection methods and computing quality measure of local models; (ii) transforming local unsupervised local models into sharing models; and (iii) reusing sharing models for new data and combining their results by considering both quality and diversity of them to detect anomalies in a global view. In experiments performed on synthetic and real-life large data set, the proposed distributed anomaly detection method achieved prediction performance comparable or even slightly better than the global anomaly detection algorithm applied on the data set obtained when all distributed data set were merged.

KW - Distributed anomaly detection

KW - combining models

KW - global anomalies

UR - http://www.scopus.com/inward/record.url?scp=78149346488&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=78149346488&partnerID=8YFLogxK

U2 - 10.1142/S0219622010004172

DO - 10.1142/S0219622010004172

M3 - Article

AN - SCOPUS:78149346488

SN - 0219-6220

VL - 9

SP - 935

EP - 957

JO - International Journal of Information Technology and Decision Making

JF - International Journal of Information Technology and Decision Making

IS - 6

ER -

Unsupervised learning based distributed detection of global anomalies

Abstract

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this