ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Information Fusion
Volume 7, Issue 3, September 2006, Pages 264-275
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (331 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/j.inffus.2005.01.008    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2005 Elsevier B.V. All rights reserved.

Moderate diversity for better cluster ensembles

Stefan T. Hadjitodorova, Corresponding Author Contact Information, E-mail The Corresponding Author, Ludmila I. Kunchevab and Ludmila P. Todorovaa

aCenter for Biomedical Engineering (CLBME), Bulgarian Academy of Sciences, “Acad G. Bonchev” Str., block 105, Sofia 1113, Bulgaria bSchool of Informatics, University of Wales—Bangor, Bangor, Gwynedd LL57 1UT, United Kingdom

Received 21 September 2004; 
revised 29 January 2005; 
accepted 29 January 2005. 
Available online 3 March 2005.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

Adjusted Rand index is used to measure diversity in cluster ensembles and a diversity measure is subsequently proposed. Although the measure was found to be related to the quality of the ensemble, this relationship appeared to be non-monotonic. In some cases, ensembles which exhibited a moderate level of diversity gave a more accurate clustering. Based on this, a procedure for building a cluster ensemble of a chosen type is proposed (assuming that an ensemble relies on one or more random parameters): generate a small random population of cluster ensembles, calculate the diversity of each ensemble and select the ensemble corresponding to the median diversity. We demonstrate the advantages of both our measure and procedure on 5 data sets and carry out statistical comparisons involving two diversity measures for cluster ensembles from the recent literature. An experiment with 9 data sets was also carried out to examine how the diversity-based selection procedure fares on ensembles of various sizes. For these experiments the classification accuracy was used as the performance criterion. The results suggest that selection by median diversity is no worse and in some cases is better than building and holding on to one ensemble.

Keywords: Pattern recognition; Machine learning; Multiple classifiers; Cluster ensembles; Diversity measures, Adjusted Rand index

Article Outline

1. Introduction
2. Cluster ensembles
3. Diversity measures for cluster ensembles
4. Experiments
5. Relationship between diversity-selection procedure and the ensemble size
6. Conclusions
Acknowledgements
References








Information Fusion
Volume 7, Issue 3, September 2006, Pages 264-275
 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.