ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Data & Knowledge Engineering
Volume 53, Issue 3, June 2005, Pages 243-262
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (666 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/j.datak.2004.09.001    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2004 Elsevier B.V. All rights reserved.

Categorical data visualization and clustering using subjective factors

Chia-Hui ChangCorresponding Author Contact Information, E-mail The Corresponding Author and Zhi-Kai DingE-mail The Corresponding Author

Department of Computer Science and Information Engineering, National Central University, No. 300, Jhungda Road, Jhungli City, Taoyuan 320, Taiwan

Received 3 April 2004; 
accepted 1 September 2004. 
Available online 30 September 2004.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

Clustering is an important data mining problem. However, most earlier work on clustering focused on numeric attributes which have a natural ordering to their attribute values. Recently, clustering data with categorical attributes, whose attribute values do not have a natural ordering, has received more attention. A common issue in cluster analysis is that there is no single correct answer to the number of clusters, since cluster analysis involves human subjective judgement. Interactive visualization is one of the methods where users can decide a proper clustering parameters. In this paper, a new clustering approach called CDCS (Categorical Data Clustering with Subjective factors) is introduced, where a visualization tool for clustered categorical data is developed such that the result of adjusting parameters is instantly reflected. The experiment shows that CDCS generates high quality clusters compared to other typical algorithms.

Keywords: Data mining; Cluster analysis; Categorical data; Cluster visualization

Article Outline

1. Introduction
2. Related work
2.1. Categorical data clustering
2.2. Visualization methods
3. Categorical data clustering and visualization
3.1. Proximity measure for categorical data
3.2. Group merging
4. Visualization with CDCS
4.1. Principle of visualization
4.2. Building a coordinate system
4.3. Interactive visualization and analysis
5. Cluster validation
5.1. Clustering quality
5.2. Discussion on cluster numbers
5.3. Parameter setting for dynamic clustering
6. Conclusion and future work
Acknowledgements
References
Vitae











 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.