Abstract
The problem of clustering of multivariate random data is considered in presence of outliers. The hypothetical model of data is described by a mixture of regular m-parametric probability densities. Clustering of data is made by the often used in practice decision rule which is derived by substitution of ML-estimators (on the unclassified sample) of parameters for their unknown true values in Bayesian decision rule. Robustness of probability of classification error is evaluated. The new clustering algorithm with smoothing is presented. Illustration for the case of the Gaussian hypothetical model and for the Fisher's data under outliers is given.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Huber, P.: Robust statistics. John Wiley and Sons, N.Y., (1981)
Hampel, F. et al.: Robust statistics. John Wiley and Sons, N.Y., (1986)
Rieder, H. (Ed.): Robust statistics, data analysis and computer intensive metods. Lecture Notes in Statistics 109 (1996)
Bock, H.: Probabilistic aspects in cluster analysis. Proc. 13th Conf. of Classif, Society, Springer-Verlag, N.Y. (1989) 12–44
Aivazyan, S. et al.: Applied Statistics, v.3. Fin. i Stat., Moscow (1989)
Kharin, Yu.: Robustness in statistical pattern recognition, Kluwer Academic Publishers, Dordrecht (1996)
McLachlan, G. et al.: Mixture Models: inference and applications to clustering. Marcel Dekker, N.Y. (1988)
Kharin, Yu., Zhuk, E.: Asymptotic robustness in cluster analysis for Tukey-Huber distortions. Information and Classification. Springer-Verlag, N.Y. (1993) 31–39
Chibisov, D.: Asymptotic expansion for a class of estimators including ML-estimators. Prob. Theory and its Appl. 18 (1973) 303–311
Fisher, R.: The use of multiple measurements in taxonomic problems. Ann. Eugen. 7 (1936) 179–188
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag
About this paper
Cite this paper
Kharin, Y. (1997). Robustness of clustering under outliers. In: Liu, X., Cohen, P., Berthold, M. (eds) Advances in Intelligent Data Analysis Reasoning about Data. IDA 1997. Lecture Notes in Computer Science, vol 1280. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0052866
Download citation
DOI: https://doi.org/10.1007/BFb0052866
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63346-4
Online ISBN: 978-3-540-69520-2
eBook Packages: Springer Book Archive