Abstract
This paper addresses the problem of merging pairs of partition matrices. Such partition matrices may be produced by collaborative clustering. We assume that each subset in one partition matrix matches one of the subsets in the other partition matrix. To align the arbitrarily ordered rows in the partition matrices we use the memberships of a set of anchor points and maximize their pairwise similarities. Here, we consider various set-theoretic similarity measures. Experiments with a simplified version of the well-known BIRCH benchmark data set illustrate the effectivity of the approach and show that all considered similarity measures are well suited for partition merging.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Anderson, D.T., Bezdek, J.C., Popescu, M., Keller, J.M.: Comparing fuzzy, probabilistic, and possibilistic partitions. IEEE Trans. Fuzzy Syst. 18(5), 906–918 (2010)
Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)
Dubois, D., Prade, H.: Fuzzy Sets and Systems. Academic Press, London (1980)
Enta, Y.: Fuzzy decision theory. In: Int. Congress on Applied Systems Research and Cybernetics, Acapulco, Mexico, pp. 2980–2990 (1980)
Geva, A.B.: Hierarchical unsupervised fuzzy clustering. IEEE Trans. Fuzzy Syst. 7(6), 723–733 (1999)
Gregson, R.M.: Psychometrics of Similarity. Academic Press, New York (1975)
Hore, P., Hall, L.O.: Scalable clustering: a distributed approach. In: IEEE Int. Conf. on Fuzzy Syst., Budapest, Hungary, vol. 1, pp. 143–148 (2004)
Hüllermeier, E., Rifqi, M.: A fuzzy variant of the Rand index for comparing clustering structures. In: Joint IFSA World Congress and EUSFLAT Conference, Lisbon, Portugal, pp. 1294–1298 (2009)
Hüllermeier, E., Rifqi, M., Henzgen, S., Senge, R.: Comparing fuzzy partitions: A generalization of the Rand index and related measures. IEEE Trans. Fuzzy Syst. 20(3), 546–556 (2012)
Jaccard, P.: Étude comparative de la distribution florale dans une portion des alpes et des jura. Bulletin de la Société Vaudoise des Sciences Naturelles 37, 547–579 (1901)
Kaymak, U., Babuška, R.: Compatible cluster merging for fuzzy modelling. In: IEEE Int. Conf. on Fuzzy Syst., Yokohama, pp. 897–904 (1995)
Krishnapuram, R., Freg, C.P.: Fitting an unknown number of lines and planes to image data through compatible cluster merging. Pattern Recogn. 25(4), 385–400 (1992)
Pedrycz, W.: Collaborative fuzzy clustering. Pattern Recogn. Lett. 23, 1675–1686 (2002)
Pedrycz, W.: Collaborative and knowledge-based fuzzy clustering. Int. J. Innov. Comput. I 3(1), 1–12 (2007)
Rand, W.M.: Objective criteria for the evaluation of clustering methods. J. Am. Stat. Assoc. 66(336), 846–850 (1971)
Restle, F.: A metric and an ordering on sets. Psychometrica 24, 207–220 (1959)
Runkler, T.A.: Comparing Partitions by Subset Similarities. In: Hüllermeier, E., Kruse, R., Hoffmann, F. (eds.) IPMU 2010. LNCS, vol. 6178, pp. 29–38. Springer, Heidelberg (2010)
Runkler, T.A., Bezdek, J.C., Hall, L.O.: Clustering very large data sets: The complexity of the fuzzy c-means algorithm. In: European Symposium on Intelligent Technologies, Hybrid Systems and Their Implementation on Smart Adaptive Systems (eunite), Albufeira, pp. 420–425 (2002)
Schweizer, B., Sklar, A.: Associative functions and statistical triangle inequalities. Publ. Math–Debrecen. 8, 169–186 (1961)
Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: An efficient data clustering method for very large databases. In: ACM SIGMOD Int. Conf. on Management of Data, pp. 103–114 (1996)
Zwick, R., Carlstein, E., Budescu, D.V.: Measures of similarity among fuzzy concepts: A comparative analysis. Int. J. Approx. Reason. 1, 221–242 (1987)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Runkler, T.A. (2013). Merging Partitions Using Similarities of Anchor Subsets. In: Kruse, R., Berthold, M., Moewes, C., Gil, M., Grzegorzewski, P., Hryniewicz, O. (eds) Synergies of Soft Computing and Statistics for Intelligent Data Analysis. Advances in Intelligent Systems and Computing, vol 190. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33042-1_61
Download citation
DOI: https://doi.org/10.1007/978-3-642-33042-1_61
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33041-4
Online ISBN: 978-3-642-33042-1
eBook Packages: EngineeringEngineering (R0)