Elsevier

Information Sciences

Volume 180, Issue 6, 15 March 2010, Pages 803-812
Information Sciences

Aggregation of asymmetric distances in Computer Science

https://doi.org/10.1016/j.ins.2009.06.020Get rights and content

Abstract

In this paper we provide a general description of how to combine a collection (not necessarily finite) of asymmetric distances in order to obtain a single one as output. To this end we introduce the notion of asymmetric distance aggregation function that generalizes the well-known one for distance spaces given by Borsik and Doboš [J. Borsik, J. Doboš, On a product of metricspaces, Math. Slovaca 31 (1981) 193–205]. Among other results, a characterization of such functions is obtained in terms of monotony and subadditivity. Finally, we relate our results to Computer Science. In particular we show that the mathematical formalism based on complexity distances, which has been introduced by Romaguera and Schellekens [S. Romaguera, M. Schellekens, Quasi-metric properties of complexity spaces, Topol. Appl. 98 (1999) 311–322] for the complexity analysis of programs and algorithms, can be obtained as a particular case of our new framework using appropriate asymmetric aggregation distance functions.

Introduction

Over the last years there has been a growing interest in the mathematical theory of information aggregation due to its wide range of applications to practical problems. In particular, for many processes that arise in applied sciences, such as image processing, decision making, control theory, medical diagnosis or biology, it is necessary to process incoming data that comes from sources of a different nature in order to obtain a conclusion. In such processes the pieces of information are symbolized via some numerical values. As a consequence the fusion methods that are based on numerical aggregation operators play a central role in the theory of information aggregation. A wide class of techniques of aggregation impose a constraint when selecting the most suitable aggregation operator for the problem to be solved. In general, this constraint consists of considering only those operators that provide the output data with the same properties as the inputs. An example of this type of situation is given when one wants to merge distances in order to obtain a new distance. Since the notion of distance plays a distinguished role in applied research, many authors have studied in depth which operators allow one to merge a collection of distances into a single one. In fact, in 1981 Borsik and Doboš profoundly studied the general problem of merging a collection of distances (not necessarily finite) into a single one [1]. Recently, Pradera et al. have provided, in the spirit of Borsik and Doboš, a general solution to the problem of merging data represented by means of a finite family of generalized distances and pseudodistances [17], [18], [19]. Several general techniques for merging a finite number of distances into another one have been studied by Casasnovas and Roselló in [3]. Specifically, they analysed the aggregation operators given by the weighted maximum and the weighted sum, and by the weighted Euclidean norm in order to apply some of their properties to the comparison of biological sequences. A related work, by the same authors, with applications to the diagnosis problem in medicine can be found in [4]. More recently, several connections between aggregation operators and distance functions have been given in [2], [15].

Asymmetric distances and other related structures have become efficient tools in some fields of Computer Science and in Bioinformatics. Metric tools based on asymmetric distances have been introduced and developed in order to provide an efficient framework to model processes, for instance, in complexity analysis of programs and algorithms [24], [21], [7], [8], [9], [10], [20], [16] and in logic programming [11], [22], [23]. In [25], [26], a natural correspondence has been obtained between similarity measures on biological (nucleotide or protein) sequences and asymmetric distances, giving practical applications to search in DNA and protein datasets. Motivated by the work of Rosselló and Casasnovas, a version of some results given in [4], [3] has been obtained for the case of aggregating asymmetric distances through the weighted sum in [5].

In this paper we focus our attention on providing a general description of how to combine a collection (not necessarily finite) of asymmetric distances in order to obtain a single one as output. We introduce the notion of asymmetric distance aggregation function which is a generalization of the one given by Borsik and Doboš in [1]. Moreover, we extend some results proved in the former reference in the context of asymmetric distances. In particular, we prove a characterization of asymmetric distance aggregation which differs to the given one for the classical case. More concretely it is shown that the class of such functions is exactly the set of all subadditive and monotone functions. Finally we connect the results obtained in our general framework with the complexity analysis of programs and algorithms in Computer Science. Hence we show that, in many instances, the asymmetric distances used in order to quantify the efficiency gained when an algorithm is substituted by another one can be retrieved by means of a family of distinguished asymmetric distance aggregation functions.

This document is organized as follows. Section 2 is devoted to the introduction of the basics of asymmetric distances, as well as a detailed exposition of the mathematical foundations, in the sense of Romaguera and Schellekens, of complexity analysis of programs and algorithms. In Section 3, we investigate the general problem of asymmetric distance aggregation, and in order to illustrate the obtained results we also show some connections between the developed theory and Computer Science in Section 4.

Section snippets

Preliminaries

Throughout this paper we shall use the letters R,R+, Z+ and N to denote the set of real numbers, the set of nonnegative real numbers, the set of nonnegative integer numbers and the set of positive integer numbers, respectively.

In order to fix the terminology let us recall a few concepts.

In our context by an asymmetric distance (quasi-metric in [13]) on a (nonempty) set X we mean a nonnegative real-valued function d on X×X such that for all x,y,zX:

  • (i)

    d(x,y)=d(y,x)=0x=y.

  • (ii)

    d(x,z)d(x,y)+d(y,z).

    Note

Aggregation of asymmetric distances

The problem of aggregating distances has been explored in depth by Borsik and Doboš [1]. In particular they provided a characterization of those functions which combine a collection (not necessarily finite) of distances into a single one (see Lemmas 2.3 and 2.4, and Theorem 2.6 in [1]). Such functions were called metric preserving (Definition 2.1 in [1]). In this section, we extend the work of Borsik and Doboš to the asymmetric context. The presented results will be key factors in modeling the

A connection with Computer Science: complexity distances via aggregation

In this section, our aim is to show that the complexity distances employed in the mathematical formalism of complexity analysis in Computer Science introduced in Section 2 can be retrieved as a particular case of our developed theory using a distinguished family of asymmetric aggregation distance functions. To this end we will discuss two cases.

Conclusion

The problem of how to combine a (not necessarily finite) collection of distances in order to obtain a single one as output has been studied in depth by many authors. Prompted by the fact that the mathematical structures used to analyse the complexity of algorithms and programs in Computer Science are based on the notion of asymmetric distances, we have proposed and studied the problem of how to merge a (not necessarily finite) collection of asymmetric distances in order to obtain a new

Acknowledgements

The authors are grateful to the referees for their useful comments that have allowed us to improve the article. The authors acknowledge the support of the Govern Balear under Grant PCTIB 2005GC1-07, and the Spanish Ministry of Education and Science and FEDER Grants MTM2006-08322 and MTM2006-14925-C02-01.

References (27)

  • LM. Garcı´a-Raffi et al.

    Weak topologies on asymmetric normed linear spaces and non-asymptotic criteria in the theory of complexity analysis of algorithms

    J. Anal. Appl.

    (2004)
  • L.M. Garcı´a-Raffi et al.

    The supremum asymmetric norm on sequence algebras: a general framework to measure complexity spaces

    Electron. Notes Theor. Comput. Sci.

    (2003)
  • L.M. Garcı´a-Raffi, S. Romaguera, E.A. Sánchez-Pérez, O. Valero, Normed semialgebras: a mathematical model for the...
  • Cited by (0)

    View full text