Globally and locally minimal weight spanning tree networks

doi:10.1016/S0378-4371(01)00430-7

Physica A: Statistical Mechanics and its Applications

Volume 301, Issues 1–4, 1 December 2001, Pages 601-619

https://doi.org/10.1016/S0378-4371(01)00430-7 Get rights and content

Abstract

The competition between local and global driving forces is significant in a wide variety of naturally occurring branched networks. We have investigated the impact of a global minimization criterion versus a local one on the structure of spanning trees. To do so, we consider two spanning tree structures—the generalized minimal spanning tree (GMST) defined by Dror et al. (Eur. J. Oper. Res. 120 (2000) 583) and an analogous structure based on the invasion percolation network, which we term the generalized invasive spanning tree (GIST). In general, these two structures represent extremes of global and local optimality, respectively. Structural characteristics are compared between the GMST and GIST for a fixed lattice. In addition, we demonstrate a method for creating a series of structures which enable one to span the range between these two extremes. Two structural characterizations, the occupied edge density (i.e., the fraction of edges in the graph that are included in the tree) and the tortuosity of the arcs in the trees, are shown to correlate well with the degree to which an intermediate structure resembles the GMST or GIST. Both characterizations are straightforward to determine from an image and are potentially useful tools in the analysis of the formation of network structures.

Introduction

The purpose of the present research is to detail a new method by which information extracted from a single, fixed network structure can be utilized to understand the physical processes which guided the formation of that structure. There are a variety of structures in nature and biology whose temporal development is difficult to observe. Accordingly, the principal data from which to understand the physics that drove the formation of these structures is the final structure itself.

An excellent example of the use of a final network structure to study the underlying physics is the work of Rodriguez–Iturbe and Rinaldo on river basins [2]. Detailed investigations of the structure of river basins combined with a variety of simulation and theoretical analysis support the conclusion that minimal energy dissipation is the driving force (both global and local) in the structure of river basins [3], [4], [5]. Similarly, natural complex branching patterns are observed in systems as diverse as retinal neurons [6], dielectric breakdown [7] and human vasculature [8]. Another recent example is the network of invading cells in malignant brain tumors observed in vitro [9].

All of these problems can be mapped to the language of spanning trees. For example, in the case of invading tumor cells, the tumor cells form branched chains, i.e., tree structures. The brain offers these invading cells a variety of pathways they can invade along (such as blood vessel and white fiber tracts) which may be interpreted as the edges of an underlying graph, with the various resistances along these pathways playing the role of edge weights. In many of these cases, the underlying physics behind the formation of the observed patterns are only beginning to be understood. The work presented here offers a useful tool in studying the driving forces in the formation of these structures.

Here we consider the class of structures called spanning trees. Formally, spanning trees are defined on graphs and, in the most basic definition, are a loopless, connected set of edges that connect all of the nodes in the underlying graph (see Fig. 1). Many different spanning trees can be generated for any given graph. Therefore, it is possible to introduce minimization criteria on the spanning-tree problem and select only those trees which satisfy the criteria. Thus, spanning trees represent an excellent test case for investigating the relation between individual structures and the minimization criteria that govern their formation.

A broadly useful class of spanning trees (for examples see [10], [11], [12]), is the minimal weight spanning tree (MST) [13], [14]. The MST is defined on an underlying graph whose edges each have some weight assigned to them. The MST is then the spanning tree (a subset of the edges in the underlying graph) that minimizes the total weight of the edges it includes. The minimal weight spanning tree represents a structure whose formation is guided by a global optimization principle. It is also possible to define other types of criteria for spanning trees. For example, it is possible to define a spanning tree such that only the lowest weight edges at each node are used (a detailed discussion of such a class of structures follows), giving a system with purely local criteria. Other types of criteria can also be imposed, such as the degree-constrained minimum spanning tree [15], [16], but they are not considered here.

One of the structures we study in this paper is the generalized minimal spanning tree (GMST), proposed by Dror et al. [1]. The GMST is useful in considering problems in which there are relevant length scales longer than a single edge. For example, a biological system is characterized by the diameter of a cell (mapped to a graph edge) as well as the length scale of diffusion in the system, which might be several cell diameters. As the name suggests, the GMST is a generalization of the MST. The GMST is defined on a graph in which the nodes have been partitioned into groups. The spanning condition for the GMST is redefined (relative to the MST) such that instead of requiring that every node in the graph be included in the tree, the inclusion of at least one node from each group is required. The GMST structure is the tree that meets this definition of spanning and minimizes the total weight of the edges it includes. When each group contains only one node, the GMST reduces to the MST.

The second class of trees considered in the present work is our generalization of the invasion percolation network [17] that we call the generalized invasive spanning tree (GIST). The invasion percolation network begins with a connected cluster of edges (in the simplest case, this could be just one edge). This cluster then “invades” the remaining edges by taking one edge from the boundary of the cluster and including it in the cluster. The edge that is included is the single edge, of those on the cluster boundary, with the lowest weight. Additional edges are then included, one at a time, in the same fashion until the cluster percolates (spans) across the system. The generalization of the invasion percolation network to the GIST, in analogy to that of the GMST, partitions the nodes into groups and modifies the percolation condition, such that one node from each group must be spanned.

For graphs in which each group is a single node (i.e., those graphs for which the GMST reduces to the MST), the GIST reduces to an acyclic invasion percolation network (i.e., an invasion percolation network without loops). It has been shown that the acyclic invasion percolation network is identical to the MST [19], [18]. Thus for graphs in which each group is a single node, the GMST and GIST are equivalent structures. Because of this equivalence, it is necessary to consider the GMST and GIST with groups of more than one node, rather than only the MST and invasion percolation, to understand the relation between local and global minimization criteria on tree structures.

The GMST and GIST structures were chosen because they generally offer extremes of global and local criteria. Both classes of trees have criteria which dictate the weight of edges chosen. The GMST structures choose the edges that minimize the total weight of the structure, even if that forces a higher weight edge to be chosen locally. In contrast, the GIST structures include the lowest weight edge locally, even if this results in a higher total weight for the entire tree. Except in the case of single-node groups noted above, each criterion results in a different final structure (though by definition both yield spanning trees). By comparing these structures, the effect of each type of criteria can be identified. Moreover, we provide a method to change a GIST structure incrementally into a more globally optimal GMST-like structure. This allows various structural features to be observed as a function of the degree to which either criterion is imposed. These intermediate structures can then serve as benchmarks for comparison when a real image is analyzed.

This paper is organized as follows. Section 2 is comprised of a description of the GMST and GIST structures. It also contains brief summaries of the protocols used to form these trees, as well as methods for transitioning from the GIST towards the GMST. Section 3 contains basic statistical descriptions of the structures generated for a given set of graph realizations. Section 4 introduces the statistical measurements of edge density and tortuosity and outlines how they may be used to study an experimentally observed image. Finally, Section 5 has some concluding remarks.

Section snippets

Models and algorithms

Spanning trees were generated for graphs with 250,000 nodes arrayed on a square lattice connected by 500,000 edges. The nodes were divided into groups of 64 nodes each, with each group containing the nodes within an 8×8 square. The size of the groups was chosen such that the groups were large enough to allow many possible subtrees within a group, but still small enough to allow many groups within the graph. Each edge was randomly assigned a weight uniformly distributed between 0 and 1. Twenty

Results and standard characterizations

For each graph realization, GMST and GIST structures were generated. The GIST was then pruned repeatedly yielding a backbone structure, which was in turn reduced to a more globally minimal structure using the conversion protocol. As noted previously, the GMST protocol requires choosing a starting group. While the best possible GMST (within the limits of the heuristic method employed) requires testing every group as a potential starting point, in practice for our graphs this proved unnecessary.

Analysis and discussion

While the differences between the various types of trees in standard characterizations such as total weight or included edge fraction are clear, they are also of little utility in analyzing a single given structure. These measurements rely on the complete knowledge of the graph, including all edge weights. When considering a physical problem such as the infiltration of tumor cells into a porous gel, this level of information is generally difficult, if not impossible, to obtain. Instead a

Conclusions

The results described above show that altering the criterion for including edges in a spanning tree from a global one (the GMST) to a local criterion (the GIST) has a measurable impact on the statistical characterization of the resulting trees. In particular, the total weight, occupied edge density, inclusion fraction distribution, and the tortuosity varied systematically with different types of spanning trees. Of these, however, the majority require extensive information about the graph under

Acknowledgements

This work has been supported in part by grants CA84509 and CA69246 from the National Institutes of Health. The work was also supported by the Engineering Research Program of the Office of Basic Energy Sciences at the Department of Energy (Grant DE-FG02-92ER14275). The authors would like to thank Dr. T.S. Deisboeck for valuable discussions.

References (22)

M Dror et al.
Generalized spanning trees
Eur. J. Oper. Res.
(2000)
D.R Kayser et al.
Active walker models: track and landscapes
Physica A
(1992)
S.C Narula et al.
Degree-constrained minimum spanning tree
Comput. Oper. Res.
(1980)
L.H Hsu et al.
Finding the most vital edge with respect to minimum spanning tree in weighted graphs
Inf. Process. Lett.
(1991)
I Rodriguez–Iturbe et al.
Fractal River Basins
(1997)
A Rinaldo et al.
Self-organized fractal river networks
Phys. Rev. Lett.
(1993)
T Sun et al.
Minimum energy dissipation model for river basin geometry
Phys. Rev. E
(1994)
A Maritan et al.
Scaling laws for river networks
Phys. Rev. E
(1996)
R.D Freimuth et al.
Active walker models for filamentary growth patterns
G.B West et al.
A general model for the origin of allometric scaling laws in biology
Science
(1997)

T.S Deisboeck et al.

Patterns of self-organization in tumour systems: complex growth dynamics in a novel brain tumour spheroid model

Cell Prolif.

(2001)

Cited by (13)

The generalized minimum spanning tree problem: An overview of formulations, solution procedures and latest advances
2020, European Journal of Operational Research
Citation Excerpt :
The problem thus defined aims at finding a minimum cost tree spanning a subset of ports or docks which includes exactly one from every island. The area of applications was extended to physics by Kansal and Torquato (2001), who developed a general method through which the data obtained from a given network can be applied for the understanding of the physical process that has lead the information of the specific structure. In the scientific literature, there have been introduced various types of linear integer programming formulations of the GMSTP.
In this paper, some of the main known results relative to the generalized minimum spanning tree problem are surveyed. The principal feature of this problem is related to the fact that the vertices of the graph are partitioned into a certain number of clusters and we are interested in finding a minimum-cost tree spanning a subset of vertices with precisely one vertex considered from every cluster. The paper is structured around the following main headings: problem definition, variations and practical applications, complexity aspects, integer programming formulations, exact and heuristic solution approaches developed for solving this problem. Furthermore, we also discuss some open problems and possible research directions.
A multi-operator genetic algorithm for the generalized minimum spanning tree problem
2016, Expert Systems with Applications
Citation Excerpt :
Fig. 1 presents a GMSTP example with 5 clusters and 17 vertices and shows that, for each cluster, only one node is chosen and that the cost of the objective function of the GMSTP corresponds to the sum of the costs. The GMSTP has several applications, which include the definition of automated watering systems (Dror, Haouari, & Chaouachi, 2000), the design of wireless networks and computer networks (Myung, Lee, & Tcha, 1995), and applications in physics (Kansal & Torquato, 2001). Although the problem is very similar to the minimum spanning tree problem (MSTP), which is solved by means of the well-known polynomial algorithms of Prim (1957) and Kruskal (1956), the inclusion of the clusters concept turns it into an NP-hard problem.
The generalized minimum spanning tree problem, with applications in the field of communication networks, is a computational challenge due essentially to its NP-hardness. The problem consists of finding a minimum cost spanning tree in an undirected graph whose vertices are grouped in clusters, such that the spanning tree contains only one vertex of each cluster. The algorithms that have provided the best results still do not optimally solve all instances in the literature. One of the most widely studied approaches to the problem is the use of genetic algorithms that, in all cases, use only single operators for crossover and mutation, disregarding the potential synergy of multi-operators. We present a multi-operator genetic algorithm of the genotype-phenotype class, in which the genotype is a chain of integers that represents a cluster's selected vertex. Therefore, the phenotype is a minimum cost spanning tree that is generated by means of Kruskal's algorithm and joins the vertices selected from each cluster. Two operators are used for crossover and five for mutation, three of which are local search operators. The performance of the resultant algorithm is evaluated using the most challenging instances in the literature, the results of which are compared with those of other mono-operator genetic algorithms and with the best existing results. With the 101 instances that are considered, an average error of 0.0142% is achieved, and in 83 instances, the best solution cost is obtained. Such performance is due both to the synergistic effect produced among the operators and the mutation operators working as local searches. Additionally, the results suggest that for many other combinatorial optimization problems, which have been addressed with a genetic algorithm, better results could possibly be obtained simply by using a greater number of variation operators.
Modelling and optimization of retrofitting residential energy systems at the urban scale
2014, Energy
Citation Excerpt :
Services in a city tend to be laid along the paths of the surfaced roads. The tortuosity factor κ takes account of the impact of the road density (or path length) and may be defined for two arbitrary nodes as the ratio of the path length to the Euclidean distance [27]. The resulting augmented distance ψ for an arc (i,i′), shown in Eq. (30), is the product of the straight-line distance ε and the tortuosity factor κ.
Local governments and property developers are increasingly seeking robust models for strategic planning of retrofitting residential energy systems. Strategic planning here infers making decisions on technology upgrades at the concept design stage. This represents a problem with many degrees of freedom. Optimization models offer a solution. The presentation of the tool RESCOM is the focus of this paper, a MILP (mixed integer linear program) that builds upon previous models of urban energy systems and extends these efforts by incorporating both demand side technologies, and explicit spatial and temporal resolution.
A test case for a London borough describes the result of applying spatial optimization and demonstrates the accessibility of this program to decision makers. Results provide the optimal configuration of supply side and demand side technologies required to satisfy thermal energy requirements for a range of scenarios. The presented approach solves retrofit problems at urban scale in an efficient and thorough manner, providing an expandable framework towards providing solutions for the selection and operation of complex energy systems.
A GRASP-based approach to the generalized minimum spanning tree problem
2012, Expert Systems with Applications
Citation Excerpt :
The GMSTP appears in the field of telecommunications (Myung et al., 1995), in the location of facilities such as distribution centers, warehouses, or stores (Shyu et al., 2003). Dror et al. (2000) describe an interesting application in the field of agricultural watering, while Kansal and Torquato (2001) extend the field of applications to physics. Considering a different objective function that contains prices associated with the nodes, Golden, Raghavan, and Stanojević (2008) study an application on the design of submarine cable networks.
Given a multipartite graph G the generalized minimum spanning tree problem is to find a tree of minimal cost that includes a vertex from each part. This paper proposes several versions of the GRASP metaheuristic for the problem. The GRASP approach is based on constructive heuristics as well as on additional improvement mechanisms such as path-relinking and iterated local search. Several computational experiments are performed over a set of existing instances. A cut generation algorithm is proposed that is able to find lower bounds, based on a formulation for Steiner’s problem in directed graphs. The computational results show that the best versions of the GRASP approach use improvement mechanisms. The solutions found are better than most of the known solutions in the literature and require significantly less computer time. Furthermore, a set of rules is defined for pre-processing the instances, based on the Bottleneck distance concept. Using those rules, it was possible to reduce the size of the instances to an average of 14% of the number of edges in relation to the original graphs.
Modeling the effects of vasculature evolution on early brain tumor growth
2006, Journal of Theoretical Biology
Citation Excerpt :
Typically, when a solid tumor is removed, these invasive cells are left behind and tumor recurrence is almost inevitable (Holland, 2000). We hope to extend the hybrid cellular automaton model in the future in order to address the impact that the tumor vasculature, cell–cell adhesion and long-range cell signaling (Kansal and Torquato, 2001) have on single-cell invasion and treatment. In particular, recent mathematical models (Cristini et al., 2005; Frieboes et al., 2006) have demonstrated that the invasive phenotype of tumor cells is favored in the presence of a heterogeneous distribution of oxygen and nutrients, while suppressed in the presence of a homogeneous oxygen distribution.
Mathematical modeling of both tumor growth and angiogenesis have been active areas of research for the past several decades. Such models can be classified into one of two categories: those that analyze the remodeling of the vasculature while ignoring changes in the tumor mass, and those that predict tumor expansion in the presence of a non-evolving vasculature. However, it is well accepted that vasculature remodeling and tumor growth strongly depend on one another. For this reason, we have developed a two-dimensional hybrid cellular automaton model of early brain tumor growth that couples the remodeling of the microvasculature with the evolution of the tumor mass. A system of reaction–diffusion equations has been developed to track the concentration of vascular endothelial growth factor (VEGF), Ang-1, Ang-2, their receptors and their complexes in space and time. The properties of the vasculature and hence of each cell are determined by the relative concentrations of these key angiogenic factors. The model exhibits an angiogenic switch consistent with experimental observations on the upregulation of angiogenesis. Particularly, we show that if the pathways that produce and respond to VEGF and the angiopoietins are properly functioning, angiogenesis is initiated and a tumor can grow to a macroscopic size. However, if the VEGF pathway is inhibited, angiogenesis does not occur and tumor growth is thwarted beyond 1–2 mm in size. Furthermore, we show that tumor expansion can occur in well-vascularized environments even when angiogenesis is inhibited, suggesting that anti-angiogenic therapies may not be sufficient to eliminate a population of actively dividing malignant cells.
Upper and lower bounding strategies for the generalized minimum spanning tree problem
2006, European Journal of Operational Research
We address the generalized minimum spanning tree problem (GMST) which requires spanning at least one vertex out of every set of disjoint vertices in a graph. We show that the geometric version of this problem is $NP$ -hard, and we propose two stochastic heuristics. The first one is a very fast randomized greedy search algorithm and the second one being a genetic algorithm. Also, we investigate some existing integer programming formulations and present an new one. A new Lagrangian based lower bound is proposed and implemented to assess the performance of the heuristics. Computational experiments performed on a large set of randomly generated instances with up to 1000 vertices and 10,000 edges provide evidence of the good performance of the proposed heuristics.

View all citing articles on Scopus

View full text

Globally and locally minimal weight spanning tree networks

Abstract

Introduction

Section snippets

Models and algorithms

Results and standard characterizations

Analysis and discussion

Conclusions

Acknowledgements

Eur. J. Oper. Res.

Physica A

Comput. Oper. Res.

Inf. Process. Lett.

Fractal River Basins

Self-organized fractal river networks

Phys. Rev. Lett.

Minimum energy dissipation model for river basin geometry

Phys. Rev. E

Scaling laws for river networks

Phys. Rev. E

Active walker models for filamentary growth patterns

A general model for the origin of allometric scaling laws in biology

Science

Patterns of self-organization in tumour systems: complex growth dynamics in a novel brain tumour spheroid model

Cell Prolif.