Farthest centroids divisive clustering

Haw Ren Fang; Yousef Saad

doi:10.1109/ICMLA.2008.141

Farthest centroids divisive clustering

Haw Ren Fang, Yousef Saad

Computer Science and Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

12 Scopus citations

Abstract

A method is presented to partition a given set of data entries embedded in Euclidean space by recursively bisecting clusters into smaller ones. The initial set is subdivided into two subsets whose centroids are farthest from each other, and the process is repeated recursively on each subset. An approximate algorithm is proposed to solve the original integer programming problem which is NP-hard. Experimental evidence shows that the clustering method often outperforms a standard spectral clustering method, albeit at a slightly higher computational cost. The paper also discusses improvements of the standard K-means algorithm. Specifically, the clustering quality resulting from the K-means technique can be significantly enhanced by using the proposed algorithm for its initialization.

Original language	English (US)
Title of host publication	Proceedings - 7th International Conference on Machine Learning and Applications, ICMLA 2008
Pages	232-238
Number of pages	7
DOIs	https://doi.org/10.1109/ICMLA.2008.141
State	Published - 2008
Event	7th International Conference on Machine Learning and Applications, ICMLA 2008 - San Diego, CA, United States Duration: Dec 11 2008 → Dec 13 2008

Publication series

Name	Proceedings - 7th International Conference on Machine Learning and Applications, ICMLA 2008

Other

Other	7th International Conference on Machine Learning and Applications, ICMLA 2008
Country/Territory	United States
City	San Diego, CA
Period	12/11/08 → 12/13/08

Bibliographical note

Funding Information:
Acknowledgments. This work was done in the context of the CHIST-ERA CAMOMILE project funded by the ANR (Agence Nationale de la Recherche, France) and the FNR (Fonds National de la Recherche, Luxembourg).

Access

10.1109/ICMLA.2008.141

OpenUrl availability

Full text

Cite this

Farthest centroids divisive clustering. / Fang, Haw Ren; Saad, Yousef.
Proceedings - 7th International Conference on Machine Learning and Applications, ICMLA 2008. 2008. p. 232-238 4724980 (Proceedings - 7th International Conference on Machine Learning and Applications, ICMLA 2008).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Fang, HR & Saad, Y 2008, Farthest centroids divisive clustering. in Proceedings - 7th International Conference on Machine Learning and Applications, ICMLA 2008., 4724980, Proceedings - 7th International Conference on Machine Learning and Applications, ICMLA 2008, pp. 232-238, 7th International Conference on Machine Learning and Applications, ICMLA 2008, San Diego, CA, United States, 12/11/08. https://doi.org/10.1109/ICMLA.2008.141

@inproceedings{0b9cce5d32e84c5bbb5d7131e1b3f4de,

title = "Farthest centroids divisive clustering",

abstract = "A method is presented to partition a given set of data entries embedded in Euclidean space by recursively bisecting clusters into smaller ones. The initial set is subdivided into two subsets whose centroids are farthest from each other, and the process is repeated recursively on each subset. An approximate algorithm is proposed to solve the original integer programming problem which is NP-hard. Experimental evidence shows that the clustering method often outperforms a standard spectral clustering method, albeit at a slightly higher computational cost. The paper also discusses improvements of the standard K-means algorithm. Specifically, the clustering quality resulting from the K-means technique can be significantly enhanced by using the proposed algorithm for its initialization.",

author = "Fang, {Haw Ren} and Yousef Saad",

note = "Funding Information: Acknowledgments. This work was done in the context of the CHIST-ERA CAMOMILE project funded by the ANR (Agence Nationale de la Recherche, France) and the FNR (Fonds National de la Recherche, Luxembourg).; 7th International Conference on Machine Learning and Applications, ICMLA 2008 ; Conference date: 11-12-2008 Through 13-12-2008",

year = "2008",

doi = "10.1109/ICMLA.2008.141",

language = "English (US)",

isbn = "9780769534954",

series = "Proceedings - 7th International Conference on Machine Learning and Applications, ICMLA 2008",

pages = "232--238",

booktitle = "Proceedings - 7th International Conference on Machine Learning and Applications, ICMLA 2008",

}

TY - GEN

T1 - Farthest centroids divisive clustering

AU - Fang, Haw Ren

AU - Saad, Yousef

N1 - Funding Information: Acknowledgments. This work was done in the context of the CHIST-ERA CAMOMILE project funded by the ANR (Agence Nationale de la Recherche, France) and the FNR (Fonds National de la Recherche, Luxembourg).

PY - 2008

Y1 - 2008

N2 - A method is presented to partition a given set of data entries embedded in Euclidean space by recursively bisecting clusters into smaller ones. The initial set is subdivided into two subsets whose centroids are farthest from each other, and the process is repeated recursively on each subset. An approximate algorithm is proposed to solve the original integer programming problem which is NP-hard. Experimental evidence shows that the clustering method often outperforms a standard spectral clustering method, albeit at a slightly higher computational cost. The paper also discusses improvements of the standard K-means algorithm. Specifically, the clustering quality resulting from the K-means technique can be significantly enhanced by using the proposed algorithm for its initialization.

AB - A method is presented to partition a given set of data entries embedded in Euclidean space by recursively bisecting clusters into smaller ones. The initial set is subdivided into two subsets whose centroids are farthest from each other, and the process is repeated recursively on each subset. An approximate algorithm is proposed to solve the original integer programming problem which is NP-hard. Experimental evidence shows that the clustering method often outperforms a standard spectral clustering method, albeit at a slightly higher computational cost. The paper also discusses improvements of the standard K-means algorithm. Specifically, the clustering quality resulting from the K-means technique can be significantly enhanced by using the proposed algorithm for its initialization.

UR - http://www.scopus.com/inward/record.url?scp=60649118355&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=60649118355&partnerID=8YFLogxK

U2 - 10.1109/ICMLA.2008.141

DO - 10.1109/ICMLA.2008.141

M3 - Conference contribution

AN - SCOPUS:60649118355

SN - 9780769534954

T3 - Proceedings - 7th International Conference on Machine Learning and Applications, ICMLA 2008

SP - 232

EP - 238

BT - Proceedings - 7th International Conference on Machine Learning and Applications, ICMLA 2008

T2 - 7th International Conference on Machine Learning and Applications, ICMLA 2008

Y2 - 11 December 2008 through 13 December 2008

ER -

Farthest centroids divisive clustering

Abstract

Publication series

Other

Bibliographical note

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this