Greed is Good for Deterministic Scale-Free Networks

Chauhan, Ankit; Friedrich, Tobias; Rothenberger, Ralf

doi:10.1007/s00453-020-00729-z

Greed is Good for Deterministic Scale-Free Networks

Open access
Published: 19 June 2020

Volume 82, pages 3338–3389, (2020)
Cite this article

Download PDF

You have full access to this open access article

Algorithmica Aims and scope Submit manuscript

Greed is Good for Deterministic Scale-Free Networks

Download PDF

1239 Accesses
2 Citations
2 Altmetric
Explore all metrics

Abstract

Large real-world networks typically follow a power-law degree distribution. To study such networks, numerous random graph models have been proposed. However, real-world networks are not drawn at random. Therefore, Brach et al. (27th symposium on discrete algorithms (SODA), pp 1306–1325, 2016) introduced two natural deterministic conditions: (1) a power-law upper bound on the degree distribution (PLB-U) and (2) power-law neighborhoods, that is, the degree distribution of neighbors of each vertex is also upper bounded by a power law (PLB-N). They showed that many real-world networks satisfy both properties and exploit them to design faster algorithms for a number of classical graph problems. We complement their work by showing that some well-studied random graph models exhibit both of the mentioned PLB properties. PLB-U and PLB-N hold with high probability for Chung–Lu Random Graphs and Geometric Inhomogeneous Random Graphs and almost surely for Hyperbolic Random Graphs. As a consequence, all results of Brach et al. also hold with high probability or almost surely for those random graph classes. In the second part we study three classical $\textsf {NP}$-hard optimization problems on PLB networks. It is known that on general graphs with maximum degree $\Delta$, a greedy algorithm, which chooses nodes in the order of their degree, only achieves a $\Omega (\ln \Delta )$-approximation for Minimum Vertex Cover and Minimum Dominating Set, and a $\Omega (\Delta )$-approximation for Maximum Independent Set. We prove that the PLB-U property with $\beta >2$ suffices for the greedy approach to achieve a constant-factor approximation for all three problems. We also show that these problems are APX-hard even if PLB-U, PLB-N, and an additional power-law lower bound on the degree distribution hold. Hence, a PTAS cannot be expected unless P = NP. Furthermore, we prove that all three problems are in MAX SNP if the PLB-U property holds.

Approximation Algorithms for Optimization Problems in Random Power-Law Graphs

Efficiently Approximating Vertex Cover on Scale-Free Networks with Underlying Hyperbolic Geometry

Article Open access 27 June 2023

Thomas Bläsius, Tobias Friedrich & Maximilian Katzmann

Resolving Braess’s Paradox in Random Networks

1 Introduction

A wide range of real-world networks exhibit a degree distribution that resembles a power-law [4, 39]. This means that the number of vertices with degree k is proportional to $k^{-\beta }$, where $\beta >1$ is the power-law exponent, a constant intrinsic to the network. This applies to Internet topologies [24], the Web [7, 35], social networks [1], power grids [43], and literally hundreds of other domains [40]. Networks with a power-law degree distribution are also called scale-free networks and have been widely studied.

To capture the degree distribution and other properties of scale-free networks, a multitude of random graph models have been proposed. These models include Preferential Attachment [7], the Configuration Model [2], Chung–Lu Random Graphs [18] and Hyperbolic Random Graphs [34]. Despite the multitude of random models, none of the models truly has the same set of properties as real world networks.

This shortcoming of random graph models motivates studying deterministic properties of scale-free models, as these deterministic properties can be checked for real-world networks. To describe the properties of scale-free networks without the use of random graphs, Aiello et al. [2] define $(\alpha , \beta )$-Power Law Graphs. The problem of this model is that it essentially demands a perfect power law degree distribution, whereas the degree distributions of real networks normally exhibit slight deviations from power-laws. Therefore, $(\alpha , \beta )$-Power Law Graphs are too constrained and do not capture most real networks.

To allow for those deviations in the degree distribution Brach et al. [11] define buckets containing nodes of degrees $\left[ 2^i, 2^{i+1}\right)$. If the number of nodes in each bucket is at most as high as for a power-law degree sequence, a network is said to be power-law bounded, which we denote as a network with property PLB-U. They also define the property of PLB neighborhoods: A network has PLB neighborhoods if every node of degree k has at most as many neighbors of degree at least k as if those neighbors were picked independently at random with probability proportional to their degree. This property we abbreviate as PLB-N. PLB-U and PLB-N allow some degrees of parameterization: Both properties assume a power law distribution with power-law exponent $\beta >2$, a possible shift $t\geqslant 0$, and a scaling factor $c_1$ and $c_3$ respectively. A shift of t means that the number of nodes of degree k is proportional to $(k+t)^{-\beta }$. A formal definition of both properties can be found in Sect. 3. Brach et al. [11] showed experimentally that PLB-(U,N) properties hold for many real-world networks, which implies that the mentioned graph problems can be solved faster on these real-world networks than worst-case lower bounds for general graphs suggest.

2 Our Contribution

2.1 PLB Properties in Power-Law Random Graph Models

The PLB-(U,N) properties are designed to describe power-law graphs in a way that allows analyzing algorithms deterministically. As already mentioned, there is a multitude of random graph models [2, 7, 18, 34], which can be used to generate power-law graphs. Brach et al. [11] proved that the Erased Configuration Model [2] with a power-law degree distribution follows PLB-U and w. h. p. also PLB-N. In the Configuration Model a graph with a given degree sequence is sampled uniformly at random. This is done by generating $\deg (v)$ stubs for each node $v\in V$ and then matching these stubs independently and uniformly at random to create edges. In the Erased Configuration Model loops and multiple edges are removed in order to generate a simple graph. Since the Erased Configuration Model has a fixed degree sequence, it is relatively easy to prove the PLB-U property, but it is quite technical to prove the PLB-N property. There are other power-law random graph models, which are based on expected degree sequences, e.g. Chung–Lu Random Graphs [18]. Brach et al. argued that for showing the PLB-U property on these models, a typical concentration statement does not work, as it accumulates the additive error for each bucket. They leave it as a challenging open question, whether other random graph models also produce graphs with PLB-(U,N) properties with high probability^{Footnote 1}.

The models we consider in Sect. 4 are Geometric Inhomogeneous Random Graphs, Hyperbolic Random Graphs, and Chung–Lu Random Graphs.

Geometric Inhomogeneous Random Graphs satisfy PLB-(U,N): Geometric Inhomogeneous Random Graphs (GIRGs) [12, 13, 33] consider an expected degree vector and an underlying geometry.

In GIRGs, all nodes draw a position uniformly at random and each edge (i, j) exists independently with a probability depending on $\frac{w_i\cdot w_j}{W}$ and the distance of i and j in the underlying geometry. We show:

Theorem 4.11

Let G be a GIRG whose weight sequence $\mathbf {w}$ follows a general power-law with exponent $\beta '>2$. Then, for all $2<\beta <\beta '$ and $t=0$ there are constants $c_1$ and $c_3$ such that G fulfills PLB-U and PLB-N with high probability.

Hyperbolic Random Graphs satisfy PLB-(U,N): Hyperbolic Random Graphs (HRGs) [34] assume an underlying hyperbolic space. Each node is positioned uniformly at random in this space and connected to other nodes with a probability proportional to its hyperbolic distance to them. For Hyperbolic Random Graphs we show the following:

Theorem 4.14

Let G be a HRG with $\alpha _H>\frac{1}{2}$. Then, G almost surely fulfills PLB-U and PLB-N with $\beta =2\alpha _H+1-\eta$, $t=0$, any constant $\eta >0$, and some constants $c_1$ and $c_3$.

Chung–Lu Random Graphs satisfy PLB-(U,N): Chung–Lu Random Graphs (CLRGs) [18] assume a sequence of expected degrees $w_1,\ w_2,\ldots ,\ w_n$ and each edge (i, j) exists independently at random with probability $\min (1,\frac{w_i\cdot w_j}{W})$, where $W=\sum _{i=1}^{n}{w_i}$. We show the following theorem:

Theorem 4.16

Let G be a CLRG whose weight sequence $\mathbf {w}$ follows a general power-law with exponent $\beta '>2$. Then, for all $2<\beta <\beta '$ and $t=0$ there are constants $c_1$ and $c_3$ such that G fulfills PLB-U and PLB-N with high probability.

2.2 Algorithmic Results

The above results imply that all results of Brach et al. [11] also hold w. h. p. for Geometric Inhomogeneous Random Graphs and Chung–Lu Random Graphs and almost surely for Hyperbolic Random Graphs. Therefore, the problems transitive closure, maximum matching, determinant, PageRank, matrix inverse, counting triangles and maximum clique have faster algorithms on Chung–Lu and Geometric Inhomogeneous Random Graphs w. h. p. and on Hyperbolic Random Graphs almost surely.

In this work we additionally consider the three classical $\textsf {NP}$-complete problems Minimum Dominating Set(MDS), Maximum Independent Set(MIS) and Minimum Vertex Cover(MVC) on PLB-U networks. For the first two problems, positive results are already known for $(\alpha , \beta )$-Power Law Graphs, which are a special case of graphs with the PLB-U property and an additional power law lower bound on the degree distribution (PLB-L). Note that this deterministic graph class is much more restrictive and does not cover typical real-world graphs. On the contrary, our positive results only assume the PLB-U property. Our algorithmic results can therefore be applied to real-world networks after measuring the respective constants of the PLB-model. In Sect. 5 we prove our main lemma, Lemma 5.2 (the Potential Volume Lemma). Using the Potential Volume Lemma, we prove lower bounds for the size of MDS, MIS and MVC in the order of $\Theta (n)$ on PLB-U networks with exponent $\beta >2$. This essentially means, even taking all nodes as a solution gives a constant factor approximation. Furthermore, in Theorem 5.7 we prove that the greedy algorithm actually achieves a better constant approximation ratio for Minimum Dominating Set. The positive results from Sect. 5 also hold for $(\alpha , \beta )$-Power Law Graphs.

Brach et al. [11] proved that for PLB-(U,N) networks with $\beta >3$ finding a maximum clique is solvable in polynomial time. This result gives rise to the question whether the PLB-N property can be helpful in solving other NP-complete problems on power-law graphs in polynomial time. In Sect. 6 we consider the mentioned NP-Complete problems MDS, MIS and MVC and prove that these problems are APX-hard even for PLB-(U,L,N) networks with $\beta >2$. Therefore, at least for the three problems we considered, even the PLB-N property is not enough to make those problems polynomial-time solvable. As a side product we also get a lower-bound on the approximability of the respective problems under some complexity theoretical assumptions. Since the negative results for $(\alpha , \beta )$-Power Law Graphs imply the same non-approximability on graphs with PLB-(U,L), we only consider simple graphs with PLB-(U,L,N) in Sect. 6.

Finally, we show that all three problems are in MAX-SNP for graphs with PLB-U and $\beta >2$. This implies that, if we reduce any of those problems to Max 3-Sat, there exists an $fpt-$algorithm where the parameter is the maximum number of satisfiable clauses minus a lower-bound on this number, which is linear in the total number of clauses. This parameter can be considerably smaller than the solution size of the original problem.

2.2.1 Dominating Set

Given a Graph $G=(V,E)$, a Minimum Dominating Set (MDS) is a subset $S\subseteq {V}$ of minimum size such that for each $v\in {V}$ either v or a neighbor of v is in S. MDS cannot be approximated within a factor of $(1-\varepsilon )\,\ln |V|$ for any $\varepsilon >0$ [25] unless $\textsf {NP}\subseteq \textsf {DTIME}(|V|^{\log \log |V|})$ and not to within a factor of $\ln \Delta - c\ln \ln \Delta$ for some $c>0$ [16] unless $\textsf {P}=\textsf {NP}$, although a simple greedy algorithm achieves an approximation ratio of $1+\ln \Delta$ [32]. We also know that even for sparse graphs, MDS cannot be approximated within a factor of $o(\ln (n))$, since we could have a graph with a star of $n-\sqrt{n}$ nodes to which an arbitrary graph of the $\sqrt{n}$ remaining nodes is attached [36]. Furthermore, if we parameterize Dominating Set with the size of the solution as a parameter, it is $\textsf {W[2]}-$complete [21].

MDS has already been studied in the context of $(\alpha , \beta )$-Power Law Graphs. Pandurangan, and Park [26] showed that the problem remains $\textsf {NP}$-hard for $\beta >0$. Shen et al. [46] proved that there is no $\left( 1+\frac{1}{3120\zeta (\beta )3^\beta }\right)$-approximation^{Footnote 2} for $\beta >1$ unless $\textsf {P}=\textsf {NP}$. They also showed that the greedy algorithm achieves a constant approximation factor for $\beta >2$, showing that in this case the problem is APX-hard. Gast, Hauptmann, and Karpinski [29] also proved a logarithmic lower bound on the approximation factor when $\beta \leqslant 2$.

For graphs with the PLB-U property and power law exponent $\beta >2$ we will show a lower bound on the size of the minimum dominating set in the range of $\Theta (n)$, which already gives us a constant factor approximation by taking all nodes. This also means that any brute-force algorithm which runs in exponential time is in FPT when we take the solution size as a parameter.

In contrast to $(\alpha , \beta )$-Power Law Graphs the PLB-U property captures a wide range of real networks, making it possible to transfer our results to them. All our upper bounds are in terms of the following two expressions, which depend on the parameters $c_1$, $\beta$ and t of the PLB-U property (cf. Definition 3.1):

$$\begin{aligned} a_{\beta ,t}:=\left( 1+\frac{\beta -1}{\beta -2}\frac{1}{1-\left( \frac{t+2}{t+1}\right) ^{1-\beta }}\right) \text { and }\\ b_{c_1,\beta ,t}:=\left( c_1\frac{\beta -1}{\beta -2}\cdot 2^{\beta }\cdot (t+1)^{\beta -1}\right) ^{\frac{1}{\beta -2}}. \end{aligned}$$

In the rest of the paper we assume the parameters $c_1$, $\beta$ and t to be constants, which implies that $a_{\beta ,t}$ and $b_{c_1,\beta ,t}$ are constant as well.

Theorem 5.3

For a graph without loops and isolated vertices and with the PLB-U property with parameters $\beta >2$, $c_1>0$ and $t\geqslant 0$, the minimum dominating set is of size at least

$$\begin{aligned} \left( 2\cdot a_{\beta ,t}\cdot b_{c_1,\beta ,t}+1\right) ^{-1}n = \Theta (n). \end{aligned}$$

Furthermore, we will show that the greedy algorithm actually achieves a lower approximation factor than the one we get from the bound in Theorem 5.3 (see Fig. 3 for a comparison).

Theorem 5.7

For a graph without loops and isolated vertices and with the PLB-U property with parameters $\beta >2$, $c_1>0$ and $t\geqslant 0$, the classical greedy algorithm for Minimum Dominating Set (cf. [22]) has an approximation factor of at most

$$\begin{aligned} \log _{3}(5)\cdot a_{\beta ,t}\ln \left( b_{c_1,\beta ,t}+1\right) +1=\Theta (1). \end{aligned}$$

Note that in networks with PLB-U the maximum degree can be $\Delta =\Theta (n^{\frac{1}{\beta -1}})$. That means the simple bound for the greedy algorithm gives us only an approximation factor of $\ln (\Delta +1)=\Theta (\log n)$.

In Minimum Connected Dominating Set we are looking for a smallest dominating set S with the extra property that the induced subgraph of S in G is connected. For this related problem we prove the following constant approximation factor for the greedy algorithm introduced by Ruan et al. [44] (Table 1).

Table 1 Comparison of the approximation ratios achieved by greedy algorithms on networks with an upper bound on the power-law degree distribution (PLB-U) and exponent $\beta >2$ and on general graphs

Full size table

Theorem 5.8

For a graph without loops and isolated vertices and with the PLB-U property with parameters $\beta >2$, $c_1>0$ and $t\geqslant 0$, the greedy algorithm for Minimum Connected Dominating Set (cf. [44]) has an approximation factor of at most

$$\begin{aligned} 2+\ln \left( 2 \cdot a_{\beta ,t} \cdot b_{c_1,\beta ,t}+1\right) =\Theta (1). \end{aligned}$$

Furthermore, we show that Minimum Dominating Set remains $\textsf {APX}$-hard on networks with PLB-U and $\beta >2$, even with the PLB-L and PLB-N property. Finally, we prove that on networks with PLB-U and $\beta >2$, Minimum Dominating Set is in MAX SNP (Table 2).

Table 2 Comparison of the approximation lower bounds for polynomial-time algorithms (assuming $\textsf {P}\ne \textsf {NP}$) on networks with an upper (PLB-U) and lower (PLB-L) bound on the power-law degree distribution and with PLB neighborhoods (PLB-N) with the approximation lower bounds on general graphs

Full size table

2.2.2 Independent Set

For a graph $G=(V,E)$, Maximum Independent Set (MIS) consists of finding a subset $S\subseteq {V}$ of maximum size, such that no two different vertices $u,v\in S$ are connected by an edge. MIS cannot be approximated within a factor of $\Delta ^\varepsilon$ for some $\varepsilon >0$ unless $\textsf {P}=\textsf {NP}$ [6], although a simple greedy algorithm achieves an approximation factor of $\frac{\Delta +2}{3}$ [30]. We also know from Turán’s Theorem that every graph with an average degree of ${\bar{d}}$ has a maximum independent set of size at least $\frac{n}{{\bar{d}}+1}$. This lower bound can already be achieved by the same greedy algorithm [30, Theorem 1]. When we consider parameterized Independent Set with solution size as the parameter, it is W[1]-complete [21].

MIS has also been studied in the context of $(\alpha , \beta )$-Power Law Graphs. Ferrante et al. [26] showed that the problem remains $\textsf {NP}$-hard for $\beta >0$. Shen et al. [46] proved that for $\beta >1$ there is no $\left( 1+\frac{1}{1120\zeta (\beta )3^\beta }-\varepsilon \right)$-approximation unless $\textsf {P}=\textsf {NP}$ and Hauptmann and Karpinski [31] gave the first non-constant bound on the approximation ratio of MIS for $\beta \leqslant 1$.

Since the PLB-U property with $\beta >2$ induces a constant average degree, the greedy algorithm already gives us a constant approximation factor for Maximum Independent Set on networks with these properties. Although we can not give better bounds for the maximum independent set, Theorem 5.3 immediately implies a lower bound for the size of all maximal independent sets.

Theorem 5.13

In a graph without loops and isolated vertices and with the PLB-U property with parameters $\beta >2$, $c_1>0$ and $t\geqslant 0$, every maximal independent set is of size at least

$$\begin{aligned} \left( 2\cdot a_{\beta ,t}\cdot b_{c_1,\beta ,t}+1\right) ^{-1}n=\Theta (n). \end{aligned}$$

It is easy to see that these lower bounds do not hold in sparse graphs in general, since in a star the center node also constitutes a maximal independent set.

Furthermore, we show that Maximum Independent Set remains $\textsf {APX}$-hard in networks with PLB-U and $\beta >2$, even with the PLB-L and PLB-N property. Finally, we prove that on networks with PLB-U and $\beta >2$, Maximum Independent Set is also in MAX SNP.

2.2.3 Vertex Cover

Given a graph $G=(V,E)$, Minimum Vertex Cover (MVC) consists of finding a subset $S\subseteq V$ of minimum size such that each edge $e\in E$ is incident to at least one node from S. MVC cannot be approximated within a factor of $10\sqrt{5}-21\approx 1.3606$ unless P=NP, whereas the simple algorithm which greedily constructs a maximal matching achieves an approximation ratio of 2 [41]. Unfortunatly, the greedy algorithm based on node degrees only achieves an approximation factor of $\ln \Delta$ (Theorem 5.15).

MVC has also been studied in the context of $(\alpha , \beta )$-Power Law Graphs. Shen et al. [46] proved that there is no PTAS for $\beta >1$ under the Unique Games Conjecture.

We can show that in networks with PLB-U and without isolated vertices the minimum vertex cover has to have a size of at least $\Theta (n)$. This follows immediately from Theorem 5.3, since in a graph without isolated nodes every vertex cover is also a dominating set:

Theorem 5.16

In a graph without loops and isolated vertices and with the PLB-U property with parameters $\beta >2$, $c_1>0$ and $t\geqslant 0$, the minimum vertex cover is of size at least

$$\begin{aligned} \left( 2\cdot a_{\beta ,t} \cdot b_{c_1,\beta ,t}+1\right) ^{-1}n. \end{aligned}$$

Also, we show that Minimum Vertex Cover remains $\textsf {APX}$-hard in networks with PLB-U and $\beta >2$, even with the PLB-L and PLB-N property. After that we prove that on networks with PLB-U and $\beta >2$, Minimum Vertex Cover is in MAX SNP.

3 Preliminaries and Notation

We mostly consider undirected multigraphs $G=(V,E)$ without loops, where V denotes the set of vertices and E the multiset of edges with $n=|V|$. In the following we will refer to multigraphs as graphs and state explicitly if we talk about simple graphs. Throughout the paper we use $\deg (v)$ to denote the degree of node v. Furthermore, we use $d_{min}$ and $\Delta$ to denote the minimum and maximum degree of the graph respectively. For a set $S\subseteq V$, we let $\textsc {vol}(S)=\sum _{v\in S}\deg (v)$ denote the volume of S. We use $b_i$ to denote the set of nodes $v\in V$ with $\deg (v)\in [2^i,2^{i+1})$. For $v\in V$ we let $N(v)=\left\{ u\in V\mid \left\{ u,v\right\} \in E\right\}$ denote the exclusive neighborhood of v and we let $N^{+}(v)=\left\{ u\in V\mid u=v \vee \left\{ u,v\right\} \in E\right\}$ denote the inclusive neighborhood of v. Analogously, for a set $S\subseteq V$, we let $N^{+}(S)=\left\{ v\in V\mid \exists u\in S:v\in N^{+}(u)\right\}$ denote the inclusive neighborhood of S and we let $N(S)=N^{+}(S)\setminus S$ denote the exclusive neighborhood of S. Furthermore, for $v\in V$ we let $N^{r}(v)=\left\{ u\in V\mid {\text {dist}}(u,v)\leqslant r\right\}$ denote the r-closed neighborhood of v, where ${\text {dist}}(u,v)$ denotes the length of a shortest path from u to v in G. Analogously, for a set of nodes $S\subseteq V$ we let $N^{r}(S)=\bigcup _{v\in S}N^{r}(v)$ denote the r-closed neighborhood of S. If not stated otherwise $\log$ denotes the logarithm of base 2 and $\ln$ denotes the natural logarithm.

Now we give a formal definition of the PLB-U, PLB-L and PLB-N properties.

Definition 3.1

(PLB-U [11]) Let G be an undirected n-vertex graph and let $c_1>0$ be a universal constant. We say that G is power law upper-bounded (PLB-U) for some parameters $1<\beta ={\mathcal {O}}(1)$ and $t\geqslant 0$ if for every integer $d\geqslant 0$, the number of vertices v, such that $\deg (v)\in \left[ 2^d,2^{d+1}\right)$ is at most

$$\begin{aligned} c_1n(t+1)^{\beta -1}\sum _{i=2^d}^{2^{d+1}-1}{(i+t)^{-\beta }}. \end{aligned}$$

Definition 3.2

(PLB-L) Let G be an undirected n-vertex graph and let $c_2>0$ be a universal constant. We say that G is power law lower-bounded (PLB-L) for some parameters $1<\beta ={\mathcal {O}}(1)$ and $t\geqslant 0$ if for every integer $\left\lfloor \log d_{min}\right\rfloor \leqslant d \leqslant \left\lfloor \log \Delta \right\rfloor$, the number of vertices v, such that $\deg (v)\in \left[ 2^d,2^{d+1}\right)$ is at least

$$\begin{aligned} c_2n(t+1)^{\beta -1}\sum _{i=2^d}^{2^{d+1}-1}{(i+t)^{-\beta }}. \end{aligned}$$

Since the PLB-U property alone can capture a much broader class of networks, for example empty graphs and rings, this lower-bound is important to restrict networks to real power-law networks. In the definition of PLB-L $d_{min}$ and $\Delta$ are necessary because in real world power law networks there are no nodes of lower or higher degree, respectively.

It is also noteworthy, that not all values of $c_1$ (respectively $c_2$) are eligible, given t, $\beta$, and n. These constants have to be big (small) enough for the buckets to encompass all n nodes. If this is the case, we call $c_1$ ($c_2$) admissable.

Definition 3.3

(PLB-N [11]) Let G be an undirected n-vertex graph with PLB-U for some parameters $1<\beta ={\mathcal {O}}(1)$ and $t\geqslant 0$. We say that G has PLB neighborhoods (PLB-N) if for every vertex v of degree k, the number of neighbors of v of degree at least k is at most $c_3\max \left( \log n, (t+1)^{\beta -2}k\sum _{i=k}^{n-1}{i(i+t)^{-\beta }}\right)$ for some universal constant $c_3>0$.

Brach et al. [11] assume that PLB-U and PLB-N for the same graph have the same values of t and $\beta$. This makes sense, since both bounds describe the same (power-law) degree sequence. Henceforth, in the rest of this paper, we will assume that PLB-U, PLB-N, and PLB-L for the same graph also have the same values of t and $\beta$. How well the degree distributions of real networks and randomly generated power-law graphs fit into the PLB-U and PLB-L bounds can be seen in Fig. 1.

Throughout the paper we will also make repeated use of the following Lemma, which is a more precise version of [11, Lemma 2.2]. It will help us relate power-law bounds to the bucketed bounds of the PLB properties.

Lemma 3.4

Let $1\leqslant a\leqslant b/2$, for $a,b\in \mathbb {N}$, and let $c>0$ be a constant. Then

$$\begin{aligned} a^{-c}\leqslant \frac{c}{1-2^{-c}}\sum _{i=a}^{b-1}{i^{-c-1}}. \end{aligned}$$

Proof

$$\begin{aligned} \sum _{i=a}^{b-1}{i^{-c-1}}\geqslant \int _{a}^{b} \! x^{-c-1} \, \mathrm {d}x=\frac{1}{c}\left( a^{-c}-b^{-c}\right) \geqslant \frac{1-2^{-c}}{c}\cdot a^{-c}. \end{aligned}$$

$\square$

Furthermore, we will use the following standard Chernoff Bounds (cf. [23, Theorem 1.1]) to show that our random models generate graphs with PLB-U and PLB-N.

Theorem 3.5

Let $X:=\sum _{i\in [n]}X_i$, where $X_i$ for $i\in [n]$ are independently distributed in [0, 1]. Then, for $0<\varepsilon <1$,

$$\begin{aligned} \Pr \left( X>(1+\varepsilon )\mathbb {E}\left[ X\right] \right) \leqslant \exp \left( -\frac{\varepsilon ^2}{3}\mathbb {E}\left[ X\right] \right) \end{aligned}$$

and

$$\begin{aligned} \Pr \left( X<(1-\varepsilon )\mathbb {E}\left[ X\right] \right) \leqslant \exp \left( -\frac{\varepsilon ^2}{2}\mathbb {E}\left[ X\right] \right) . \end{aligned}$$

If $t> 2\cdot e\cdot \mathbb {E}\left[ X\right]$, then

$$\begin{aligned} \Pr \left( X>t\right) \leqslant 2^{-t}. \end{aligned}$$

4 Power-Law Random Graphs and the PLB properties

In this section we analyze some well-known power law random graph models and prove that w. h. p. or almost surely graphs generated by these models have PLB-U and PLB-N properties. We consider $(\alpha ,\beta )$-Power Law Graphs, Chung–Lu Random Graphs, Geometric Inhomogeneous Random Graphs, and Hyperbolic Random Graphs, because they are common models and rather easy to analyze. Furthermore, they assume independence or some geometrically implied sparseness of edges, which is important for establishing the PLB-N property.

4.1 $(\alpha , \beta )$-Power Law Graph

First, we consider $(\alpha , \beta )$-Power Law Graphs. Note that $(\alpha , \beta )$-Power Law Graphs are no random graph model. Instead, they are classes of graphs whose degree distributions follow a power-law with scaling $e^\alpha$ and exponent $\beta$. We show that this already ensures the PLB-U property. We will also see that they satisfy PLB-N if they are generated with the Erased Configuration Model. This follows with a result by Brach et al. [11].

Formally, $(\alpha ,\beta )$-Power Law Graphs are defined as follows.

Definition 4.1

($(\alpha ,\beta )$-Power Law Graph [3]) An $(\alpha ,\beta )$-Power Law Graph is an undirected multigraph with the following degree distribution depending on two given values $\alpha$ and $\beta$. For $1\leqslant i\leqslant \Delta =\left\lfloor e^{\alpha /\beta }\right\rfloor$ there are $y_i=\left\lfloor \frac{e^\alpha }{i^\beta }\right\rfloor$ nodes of degree i.

Intuitively those graphs already satisfy PLB-U by definition. The following theorem confirms this intuition. Remember that throughout the paper $\zeta$ denotes the Riemann zeta function.

Theorem 4.2

The $(\alpha ,\beta )$-Power Law Graph with $\beta >1$ has the PLB-U property with $c_1=\frac{1}{\zeta (\beta )}$, $t=0$ and exponent $\beta$.

Proof

It holds that the number of nodes of degree between $2^d$ and $2^{d+1}-1$ is at most

$$\begin{aligned} e^\alpha \sum _{i=2^d}^{2^{d+1}-1}{i^{-\beta }} \leqslant \frac{n}{\zeta (\beta )}\sum _{i=2^d}^{2^{d+1}-1}{i^{-\beta }} \end{aligned}$$

due to the definition of the degree distribution and the fact that $n=\left\lfloor \zeta (\beta )e^\alpha \right\rfloor$ for $\beta >1$. $\square$

Since the degree sequence of those graphs follow a power-law, we can show that PLB-L holds as well.

Theorem 4.3

The $(\alpha ,\beta )$-Power Law Graph with $\beta >1$ has the PLB-L property with $c_1=\frac{1}{2\zeta (\beta )}$, $t=0$ and exponent $\beta$.

Proof

The number of nodes of degree i is exactly $\left\lfloor \frac{e^\alpha }{i^\beta }\right\rfloor$. Since $i\leqslant \left\lfloor e^{\alpha /\beta }\right\rfloor$, this number is at least one. Therefore $\left\lfloor \frac{e^\alpha }{i^\beta }\right\rfloor \geqslant \frac{1}{2}\frac{e^\alpha }{i^\beta }$. It now holds that the number of nodes of degree between $2^d$ and $2^{d+1}-1$ is at least

$$\begin{aligned} \frac{e^\alpha }{2}\sum _{i=2^d}^{2^{d+1}-1}{i^{-\beta }} = \frac{n}{2\zeta (\beta )}\sum _{i=2^d}^{2^{d+1}-1}{i^{-\beta }} \end{aligned}$$

due to the definition of the degree distribution and the fact that $n=\zeta (\beta )e^\alpha$ for $\beta >1$.$\square$

Since $(\alpha ,\beta )$-Power Law Graphs describe classes of graphs with given degree distributions, PLB-N is not satisfied automatically. However, one can sample those graphs randomly with the Erased Configuration Model [2] to guarantee PLB-N with high probability. The Configuration Model gets a degree sequence as input and randomly generates a multigraph with this sequence. This is done by creating $\deg (u)$ many stubs for each node u and then connecting stubs uniformly at random. In the Erased Configuration Model loops and multi-edges are erased after this process in order to generate simple graphs. Brach et al. [11] proved that random networks created by the Erased Configuration Model whose prescribed degree sequence follows PLB-U, also follow PLB-U and PLB-N with high probability. This yields the following lemma, which concludes our section on $(\alpha ,\beta )$-Power Law Graphs.

Lemma 4.4

([11]) A random $(\alpha ,\beta )$-Power Law Graph with $\beta >1$ created with the Erased Configuration Model has the PLB-U and PLB-N properties with high probability.

4.2 Geometric Inhomogeneous Random Graphs

In this section we consider the very general model of Geometric Inhomogeneous Random Graphs (GIRGs), which was introduced by Bringmann et al. [12]. In this model, nodes are distributed uniformly at random on some underlying ground space. The probability of creating an edge between two nodes then depends on given weights for those nodes and on their distance in the ground space. Formally, the model is defined as follows.

Definition 4.5

(Geometric Inhomogeneous Random Graphs (GIRGs) [12, 14]) A Geometric Inhomogeneous Random Graph is a simple graph $G=(V,E)$ with the following properties. For $|V|=n$ let $w=(w_1,\ldots , w_n)$ be a sequence of positive weights. Let $W=\sum _{i=1}^n w_i$ be the total weight. For any vertex v, draw a point $x_v$ uniformly at random from the d-dimensional torus ${\mathbb {T}}^d=\mathbb {R}^d\setminus \mathbb {Z}^d$ with $d\in \mathbb {N}^{+}$. We connect vertices $u\ne v$ independently with probability $p_{uv}=p_{uv}(r)$, which depends on the weights $w_u$, $w_v$ and on the positions $x_u$, $x_v$, more precisely, on the distance $r=\left\| x_u-x_v\right\|$. For some fixed constant $\alpha >1$ the edge probability is

$$\begin{aligned} p_{uv}=\Theta \Big (\ min\Big \{\frac{1}{||x_u-x_v||^{\alpha d}}\Big (\frac{w_u w_v}{W} \Big )^{\alpha },1\Big \} \Big ). \end{aligned}$$

The definition of GIRGs lends itself to encompass a multitude of models with a high degree of freedom regarding expected degree distributions and underlying geometries. Thus, we need additional constraints to show that this model generates instances that are power-law bounded. One straightforward condition is that the expected node degrees are power-law distributed. The following definition captures this condition formally.

Definition 4.6

(General Power-law [12]) A weight sequence $\mathbf {w}$ is said to follow a general power-law with exponent $\beta > 2$ if $w_{\min }:=\min \left\{ w_v\mid v\in V\right\} =\Omega (1)$ and if there is a ${\bar{w}}={\bar{w}}(n)\geqslant n^{\omega (1/\log \log n)}$ such that for all constants $\eta >0$ there are $\varepsilon _1,\varepsilon _2>0$ with

$$\begin{aligned} \varepsilon _1\frac{n}{w^{\beta -1+\eta }}\leqslant \left| \left\{ v\in V\mid w_v\geqslant w\right\} \right| \leqslant \varepsilon _2\frac{n}{w^{\beta -1-\eta }}, \end{aligned}$$

where the first inequality holds for all $w_{\min }\leqslant w \leqslant {\bar{w}}$ and the second holds for all $w\geqslant w_{\min }$.

Note that the former definition states that for any choice of $\eta >0$ one can find constants $c_1$ and $c_2$ with the desired properties. However, for PLB-L, PLB-U, and PLB-N to hold it is sufficient to choose a fixed $\eta$ and constants $\varepsilon _1$ and $\varepsilon _2$ that satisfy the property. Another property implied by the general power-law is that the maximum degree is $\Delta =\mathcal {O}\left( n^{1/(\beta -\eta -1)}\right)$. We will use this property in the proof of Theorem 4.11.

We are now going to prove that GIRGs fulfill PLB-U and PLB-N. For this we need the following theorem and auxiliary lemmas by Bringmann et al. [13].

Theorem 4.7

([13]) Let G be a GIRG with a weight sequence that follows a general power-law with exponent $\beta$ and average degree $\Theta (1)$. Then, with high probability the degree sequence of G follows a general power law with exponent $\beta$ and average degree $\Theta (1)$, i.e for all constants $\eta >0$ there exist constants $\varepsilon _3,\varepsilon _4 >0$ such that w. h. p.

$$\begin{aligned} \varepsilon _3\frac{n}{k^{\beta -1+\eta }}\leqslant |\{v\in V|deg(v)\geqslant k\}|\leqslant \varepsilon _4 \frac{n}{k^{\beta -1-\eta }}, \end{aligned}$$

where the first inequality holds for all $1\leqslant d\leqslant {\bar{w}}$ and the second holds for all $d\geqslant 1$.

The following three lemmas are necessary to prove Theorem 4.11. Lemma 4.8 states that the marginal edge probability between two nodes u and v is essentially $\min \left\{ 1,\frac{w_u w_v}{W}\right\}$. Furthermore, even conditioned on a node’s position $x_u$, all edges between u and other nodes are present independently.

Lemma 4.8

([13]) Fix $u\in [n]$ and $x_u\in {\mathbb {T}}^d$. All edges $\left\{ u,\ v\right\}$, $u\ne v$, are independently present with probability

$$\begin{aligned} \Pr \left[ u\sim v\mid x_u\right] =\Theta (\Pr \left[ u\sim v\right] )=\Theta \left( \min \left\{ 1,\frac{w_u w_v}{W}\right\} \right) . \end{aligned}$$

In the proof of Theorem 4.11 we will only use the marginal edge probability from the former lemma. Lemma 4.9 bounds the expected node degrees asymptotically.

Lemma 4.9

([13]) For any $v\in [n]$ in a Geometric Inhomogeneous Random Graph, we have

$$\begin{aligned} {\mathbb {E}}[deg(v)]=\Theta (w_v). \end{aligned}$$

The former two lemmas imply that we can use standard Chernoff bounds to bound node degrees, but we also need the following auxiliary lemma. It bounds the expected volume of nodes with given maximum or minimum weights.

Lemma 4.10

([13]) Let $\mathbf {w}$ be a general power-law weight sequence with exponent $\beta$ and let $W_{\geqslant w}=\sum _{\mathbf {w}_v:\mathbf {w}_v\geqslant w}{\mathbf {w}_v}$ and $W_{\leqslant w}=\sum _{\mathbf {w}_v:\mathbf {w}_v\leqslant w}{\mathbf {w}_v}$. Then the total weight satisfies $W=\Theta (n)$. Moreover, for all sufficiently small $\eta >0$,

(i)
$W_{\geqslant w}=\mathcal {O}(nw^{2-\beta +\eta })$ for all $w\geqslant w_{\min }$,
(ii)
$W_{\geqslant w}=\Omega (nw^{2-\beta -\eta })$ for all $w_{\min }\leqslant w \leqslant {\bar{w}}$,
(iii)
$W_{\leqslant w}=\mathcal {O}(n)$ for all w, and
(iv)
$W_{\leqslant w}=\Omega (n)$ for all $w=\omega (1)$.

We are now ready to prove our first main theorem: For GIRGs whose weight sequence follows a general power-law with exponent $\beta '$, we can always find constants $c_1$ and $c_2$ such that PLB-U and PLB-N are satisfied for $t=0$ and any power law exponent $2<\beta <\beta '$. Intuitively, this holds, since PLB-U and PLB-N only demand upper bounds on the number of nodes in a bucket or the neighborhood of a node. By decreasing the power-law exponent in PLB-U and PLB-N, these upper bounds only get less restrictive.

The proof works as follows. First, we show that Theorem 4.7 implies PLB-U. Second, we show PLB-N by bounding the ranges of node degrees depending on weights. We show that only nodes of sufficiently high weights can have a degree of at least k. It now suffices to consider those nodes with high weights as potential neighbors of a degree-k node. Since the edges between a degree-k node and its potential neighbors are drawn independently, we can use a Chernoff bound to show that the number of neighbors of degree at least k is concentrated around its expected value. All these statements hold with high probability. Thus, we can simply collect the error probabilities without considering dependencies. This shows PLB-N.