Abstract
Directed binary hierarchies have been introduced in order to give a graphical reduced representation of a family of association rules. This type of structure extends the classical binary hierarchical classification in a very specific way. In this paper an accurate formalization of this new structure is studied. A directed hierarchy is defined as a set of ordered pairs of subsets of the initial individual set satisfying specific conditions. A new notion of directed ultrametricity is studied. The main result consists in establishing a bijective correspondence between a directed ultrametric space and a directed binary hierarchy. Finally, an algorithm is proposed in order to transform a directed ultrametric structure into a graphical representation associated with a directed binary hierarchy.
Similar content being viewed by others
References
AGRAWAL, R., IMIELINSKY, T., and SWAMI, A. (1993), “Mining Association Rules Between Sets of Items in Large Databases”, in Proceedings of the ACM SIGMOD’93, pp. 207–216.
BENZÉCRI, J.P. (1973), L’Analyse des Données : La Taxinomie (Vol. 1), Paris: Dunod.
DURAND, C., and FICHET, B. (1988), “One to One Correspondence in Pyramidal Representation: An Unified Approach”, in Classification and Related Methods of Data Analysis, ed. H.H. Bock, Amsterdam: North-Holland, pp. 85–90.
GORDON, A.D. (1999), Classification, London: Chapman & Hall.
GRAS, R. (1979), Contribution à L’étude Expérimentale et L’analyse de Certaines Acquisitions Cognitives et de Certains Objectifs Didactiques en Mathématiques, Doctorat d’État, PhD thesis, Université de Rennes 1.
GRAS, R.,AG ALMOULOUD, S., BAILLEUL,M., LARHER, A.A., POLO,M., RATSIMBARAJOHN, H., and TOTOHASINA, A. (1996), L’implication Statistique - Nouvelle Méthode Exploratoire de Données, France: La Pensée Sauvage.
GRAS, R., GUILLET, F., SPAGNOLO, F., and SUZUKI, E. (eds.) (2008), Statistical Implicative Analysis, Studies in Computational Intelligence (Vol. 127), Berlin Heidelberg: Springer.
GRAS, R., and KUNTZ, P. (2005), “Discovering R-rules with a Directed Hierarchy”, Soft Computing, 10, 453–460.
GRAS, R., KUNTZ, P., and RÉGNIER, J-C. (2004), “Significativité des Niveaux d’une Hiérarchie Orientée en Analyse Statistique Implicative”, in 11-èmes Rencontres de la Société Francophone de Classification, ed. Université de Bordeaux, pp. 39–50.
GUILLET, F., and HAMILTON, H.J. (eds.) (2007), QualityMeasures in DataMining, Studies in Computational Intelligence (Vol. 43), Berlin Heidelberg: Springer.
JOHNSON, S.C. (1967), “Hierarchical Clustering Schemes”, Psychometrika, 32, 241–254.
KLEMENTTINEN,M.,MANILLA, H., RONKAINEN, P., TOIVONEN, H., and VERKAMO, A.I. (1994), “Finding Interesting Rules from Large Sets of Discovered Association Rules”, in Proceedings of the 3rd International Conference on Information and Knowledge Management, ed. ACM, pp. 401–407.
LENT, B., SWAMI, A.N., and WIDOW, J. (1997), “Clustering Association Rules”, in Proceedings of the 13th Conference on Data Engineering, IEEE Computer Society: Washington, pp. 220–231.
LERMAN, I-C. (1970), Les Bases de la Classification Automatique, Paris: Gauthier-Villars.
LERMAN, I-C. (1981), Classification et Analyse Ordinale des Données, Paris: Dunod.
LERMAN, I-C. (2007), “Sur les Différentes Expressions Formelles d’une Hiérarchie Binaire Symétrique ou Implicative”, in Rencontres de la Société Francophone de Classification, ed. SupTélécom Paris, pp. 139–142.
LERMAN, I-C. (2008), “Analyse Logique, Combinatoire et Statistique de la Construction d’une Hiérarchie Binaire Implicative; Niveaux et Noeuds Significatifs”, Mathématiques et Sciences Humaines, Mathematics and Social Sciences, 184, 47–103.
LERMAN, I-C., and AZÉ, J. (2007), “A New Probabilistic Measure of Interestingness for Association Rules, Based on the Likelihood of the Link”, in Quality Measures in Data Mining, Studies in Computational Intelligence (Vol. 43), eds. F. Guillet and H.J. Hamilton, Berlin Heidelberg: Springer, pp. 207–236.
LERMAN, I-C., GRAS, R., and ROSTAM, H. (1981), “Élaboration et Évaluation d’un Indice d’Implication pour des Données Binaires I et II”, Mathématiques et Sciences Humaines, 74–75, 5–35, 5–47.
LERMAN, I-C. (1991), “Foundations of the Likelihood Linkage Analysis (lla) Classification Method”, Stochastic Models and Data Analysis, 7, 63–76.
LOEVINGER, J. (1947), “A Systematic Approach to the Construction and Evaluation of Tests of Ability”, Psychological Monographs, 61, 1–49.
MURTAGH, F. (1984), “Counting Dendograms: A Survey”, Discrete Applied Mathematics, 7, 191–199.
TAKEUCHI, A., SAITO, T., and YADOHISA, H. (2007), “Asymmetric Agglomerative Hierarchical Clustering Algorithms and Their Evaluation”, Journal of Classification, 24, 123–143.
TOIVONEN, H., KLMENTTINEN,M., RONKAIREN, P., HATONEN, K., and MANILA, H. (1995), “Pruning and Grouping Discovered Association Rules”, in Workshop Notes of the ECML Workshop on Statistics, Machine Learning and Knowledge Discovering in Databases, Crete, Greece, pp. 47–52.
YADOHISA, H. (2002), “Formulation of Asymmetric Agglomerative Hierarchical Clustering and Graphical Representation of its Results”, Bulletin of the Computational Statistics of Japan, 15, 309–316.
ZIELMAN, B., and HEISER, W.J. (1996), “Models for Asymmetric Proximities”, British Journal of Mathematical and Statistical Psychology, 49, 127–146.
Author information
Authors and Affiliations
Corresponding author
Additional information
We are indebted to Professor Benjamin Enriquez of the University Louis Pasteur of Strasbourg for his relevant remarks and suggestions which have improved the presentation of a first version of this article.
We are also very indebted to the anonymous reviewers for their comments and suggestions which lead us to a more explicit and to a richer version.
Rights and permissions
About this article
Cite this article
Lerman, IC., Kuntz, P. Directed Binary Hierarchies and Directed Ultrametrics. J Classif 28, 272–296 (2011). https://doi.org/10.1007/s00357-011-9091-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00357-011-9091-y