ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
Computational Statistics & Data Analysis
Volume 44, Issue 4, 28 January 2004, Pages 625-638
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (239 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/S0167-9473(02)00280-3    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2002 Elsevier B.V. All rights reserved.

Outlier detection in the multiple cluster setting using the minimum covariance determinant estimator

Johanna HardinCorresponding Author Contact Information, E-mail The Corresponding Author, a and David M. RockeE-mail The Corresponding Author, b

a Department of Mathematics, Pomona College, 610 N. College Ave., Claremont, CA 91711, USA b Center for Image Processing and Integrated Computing, University of California at Davis, USA

Received 1 January 2002; 
revised 1 August 2002. 
Available online 24 October 2002.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

Mahalanobis-type distances in which the shape matrix is derived from a consistent high-breakdown robust multivariate location and scale estimator can be used to find outlying points. Hardin and Rocke (http://www.cipic.ucdavis.edu/˜dmrocke/preprints.html) developed a new method for identifying outliers in a one-cluster setting using an F distribution. We extend the method to the multiple cluster case which gives a robust clustering method in conjunction with an outlier identification method. We provide results of the F distribution method for multiple clusters which have different sizes and shapes.

Author Keywords: Minimum covariance determinant; Robust clustering; Outlier detection

Article Outline

1. Introduction
2. Clustering methods
2.1. Robust optimization clustering
3. Robust estimators in a cluster setting
3.1. Affine equivariant estimators
3.2. Minimum covariance determinant
3.3. Estimating the MCD
4. Distance distributions
4.1. Single population distance distributions
4.2. Multiple population distance distributions
5. Results
5.1. Clean data
5.2. Contaminated data
5.2.1. Cluster outliers
5.2.2. Radial outliers
5.2.3. Diffuse outliers
5.3. Summary of results
6. Conclusion
Appendix A
References

 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.