Community detection with prior knowledge

Karthik Subbian; Charu C. Aggarwal; Jaideep Srivastava; Philip S. Yu

doi:10.1137/1.9781611972832.45

Community detection with prior knowledge

Karthik Subbian, Charu C. Aggarwal, Jaideep Srivastava, Philip S. Yu

Computer Science and Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

8 Scopus citations

Abstract

The problem of community detection is a challenging one because of the presence of hubs and noisy links, which tend to create highly imbalanced graph clusters. Often, these resulting clusters are not very intuitive and difficult to interpret. With the growing availability of network information, there is a significant amount of prior knowledge available about the communities in social, communication and several other networks. These community labels may be noisy and very limited, though they do help in community detection. In this paper, we explore the use of such noisy labeled information for finding high quality communities. We will present an adaptive density-based clustering which allows flexible incorporation of prior knowledge in to the community detection process. We use a random walk framework to compute the node densities and the level of supervision regulates the node densities and the quality of resulting density based clusters. Our framework is general enough to produce both overlapping and non-overlapping clusters. We empirically show that even with a tiny amount of supervision, our approach can produce superior communities compared to popular baselines.

Original language	English (US)
Title of host publication	Proceedings of the 2013 SIAM International Conference on Data Mining, SDM 2013
Editors	Joydeep Ghosh, Zoran Obradovic, Jennifer Dy, Zhi-Hua Zhou, Chandrika Kamath, Srinivasan Parthasarathy
Publisher	Siam Society
Pages	405-413
Number of pages	9
ISBN (Electronic)	9781611972627
DOIs	https://doi.org/10.1137/1.9781611972832.45
State	Published - 2013
Event	SIAM International Conference on Data Mining, SDM 2013 - Austin, United States Duration: May 2 2013 → May 4 2013

Publication series

Name	Proceedings of the 2013 SIAM International Conference on Data Mining, SDM 2013

Other

Other	SIAM International Conference on Data Mining, SDM 2013
Country/Territory	United States
City	Austin
Period	5/2/13 → 5/4/13

Bibliographical note

Publisher Copyright:
Copyright © SIAM.

Keywords

Clusters
Communities
Supervision

Access

10.1137/1.9781611972832.45

OpenUrl availability

Full text

Cite this

Subbian, K., Aggarwal, C. C., Srivastava, J., & Yu, P. S. (2013). Community detection with prior knowledge. In J. Ghosh, Z. Obradovic, J. Dy, Z.-H. Zhou, C. Kamath, & S. Parthasarathy (Eds.), Proceedings of the 2013 SIAM International Conference on Data Mining, SDM 2013 (pp. 405-413). (Proceedings of the 2013 SIAM International Conference on Data Mining, SDM 2013). Siam Society. https://doi.org/10.1137/1.9781611972832.45

Community detection with prior knowledge. / Subbian, Karthik; Aggarwal, Charu C.; Srivastava, Jaideep et al.
Proceedings of the 2013 SIAM International Conference on Data Mining, SDM 2013. ed. / Joydeep Ghosh; Zoran Obradovic; Jennifer Dy; Zhi-Hua Zhou; Chandrika Kamath; Srinivasan Parthasarathy. Siam Society, 2013. p. 405-413 (Proceedings of the 2013 SIAM International Conference on Data Mining, SDM 2013).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Subbian, K, Aggarwal, CC, Srivastava, J & Yu, PS 2013, Community detection with prior knowledge. in J Ghosh, Z Obradovic, J Dy, Z-H Zhou, C Kamath & S Parthasarathy (eds), Proceedings of the 2013 SIAM International Conference on Data Mining, SDM 2013. Proceedings of the 2013 SIAM International Conference on Data Mining, SDM 2013, Siam Society, pp. 405-413, SIAM International Conference on Data Mining, SDM 2013, Austin, United States, 5/2/13. https://doi.org/10.1137/1.9781611972832.45

Subbian K, Aggarwal CC, Srivastava J, Yu PS. Community detection with prior knowledge. In Ghosh J, Obradovic Z, Dy J, Zhou ZH, Kamath C, Parthasarathy S, editors, Proceedings of the 2013 SIAM International Conference on Data Mining, SDM 2013. Siam Society. 2013. p. 405-413. (Proceedings of the 2013 SIAM International Conference on Data Mining, SDM 2013). doi: 10.1137/1.9781611972832.45

Subbian, Karthik ; Aggarwal, Charu C. ; Srivastava, Jaideep et al. / Community detection with prior knowledge. Proceedings of the 2013 SIAM International Conference on Data Mining, SDM 2013. editor / Joydeep Ghosh ; Zoran Obradovic ; Jennifer Dy ; Zhi-Hua Zhou ; Chandrika Kamath ; Srinivasan Parthasarathy. Siam Society, 2013. pp. 405-413 (Proceedings of the 2013 SIAM International Conference on Data Mining, SDM 2013).

@inproceedings{eaba72cd03b443daa1b09f36747f9cb9,

title = "Community detection with prior knowledge",

abstract = "The problem of community detection is a challenging one because of the presence of hubs and noisy links, which tend to create highly imbalanced graph clusters. Often, these resulting clusters are not very intuitive and difficult to interpret. With the growing availability of network information, there is a significant amount of prior knowledge available about the communities in social, communication and several other networks. These community labels may be noisy and very limited, though they do help in community detection. In this paper, we explore the use of such noisy labeled information for finding high quality communities. We will present an adaptive density-based clustering which allows flexible incorporation of prior knowledge in to the community detection process. We use a random walk framework to compute the node densities and the level of supervision regulates the node densities and the quality of resulting density based clusters. Our framework is general enough to produce both overlapping and non-overlapping clusters. We empirically show that even with a tiny amount of supervision, our approach can produce superior communities compared to popular baselines.",

keywords = "Clusters, Communities, Supervision",

author = "Karthik Subbian and Aggarwal, {Charu C.} and Jaideep Srivastava and Yu, {Philip S.}",

note = "Publisher Copyright: Copyright {\textcopyright} SIAM.; SIAM International Conference on Data Mining, SDM 2013 ; Conference date: 02-05-2013 Through 04-05-2013",

year = "2013",

doi = "10.1137/1.9781611972832.45",

language = "English (US)",

series = "Proceedings of the 2013 SIAM International Conference on Data Mining, SDM 2013",

publisher = "Siam Society",

pages = "405--413",

editor = "Joydeep Ghosh and Zoran Obradovic and Jennifer Dy and Zhi-Hua Zhou and Chandrika Kamath and Srinivasan Parthasarathy",

booktitle = "Proceedings of the 2013 SIAM International Conference on Data Mining, SDM 2013",

}

TY - GEN

T1 - Community detection with prior knowledge

AU - Subbian, Karthik

AU - Aggarwal, Charu C.

AU - Srivastava, Jaideep

AU - Yu, Philip S.

PY - 2013

Y1 - 2013

N2 - The problem of community detection is a challenging one because of the presence of hubs and noisy links, which tend to create highly imbalanced graph clusters. Often, these resulting clusters are not very intuitive and difficult to interpret. With the growing availability of network information, there is a significant amount of prior knowledge available about the communities in social, communication and several other networks. These community labels may be noisy and very limited, though they do help in community detection. In this paper, we explore the use of such noisy labeled information for finding high quality communities. We will present an adaptive density-based clustering which allows flexible incorporation of prior knowledge in to the community detection process. We use a random walk framework to compute the node densities and the level of supervision regulates the node densities and the quality of resulting density based clusters. Our framework is general enough to produce both overlapping and non-overlapping clusters. We empirically show that even with a tiny amount of supervision, our approach can produce superior communities compared to popular baselines.

AB - The problem of community detection is a challenging one because of the presence of hubs and noisy links, which tend to create highly imbalanced graph clusters. Often, these resulting clusters are not very intuitive and difficult to interpret. With the growing availability of network information, there is a significant amount of prior knowledge available about the communities in social, communication and several other networks. These community labels may be noisy and very limited, though they do help in community detection. In this paper, we explore the use of such noisy labeled information for finding high quality communities. We will present an adaptive density-based clustering which allows flexible incorporation of prior knowledge in to the community detection process. We use a random walk framework to compute the node densities and the level of supervision regulates the node densities and the quality of resulting density based clusters. Our framework is general enough to produce both overlapping and non-overlapping clusters. We empirically show that even with a tiny amount of supervision, our approach can produce superior communities compared to popular baselines.

KW - Clusters

KW - Communities

KW - Supervision

UR - http://www.scopus.com/inward/record.url?scp=84937601996&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84937601996&partnerID=8YFLogxK

U2 - 10.1137/1.9781611972832.45

DO - 10.1137/1.9781611972832.45

M3 - Conference contribution

AN - SCOPUS:84937601996

T3 - Proceedings of the 2013 SIAM International Conference on Data Mining, SDM 2013

SP - 405

EP - 413

BT - Proceedings of the 2013 SIAM International Conference on Data Mining, SDM 2013

A2 - Ghosh, Joydeep

A2 - Obradovic, Zoran

A2 - Dy, Jennifer

A2 - Zhou, Zhi-Hua

A2 - Kamath, Chandrika

A2 - Parthasarathy, Srinivasan

PB - Siam Society

T2 - SIAM International Conference on Data Mining, SDM 2013

Y2 - 2 May 2013 through 4 May 2013

ER -

Community detection with prior knowledge

Abstract

Publication series

Other

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this