On Statistical Properties of a New Family of Geometric Random Graphs

Joglekar, Kedar; Joglekar, Pushkar; Shinde, Sandeep

doi:10.3390/engproc2024062024

Open AccessProceeding Paper

On Statistical Properties of a New Family of Geometric Random Graphs^†

by

Kedar Joglekar

¹

,

Pushkar Joglekar

^2,*,‡

and

Sandeep Shinde

^2,‡

¹

NVIDIA, Inc., Pune 411006, India

²

Vishwakarma Institute of Technology, Pune 411037, India

^*

Author to whom correspondence should be addressed.

^†

Presented at the 2nd Computing Congress 2023, Chennai, India, 28–29 December 2023.

^‡

These authors contributed equally to this work.

Eng. Proc. 2024, 62(1), 24; https://doi.org/10.3390/engproc2024062024

Published: 18 July 2024

(This article belongs to the Proceedings of The 2nd Computing Congress 2023)

Download

Browse Figures

Versions Notes

Abstract

:

We define a new family of random geometric graphs which we call random covering graphs and study its statistical properties. To the best of our knowledge, this family of graphs has not been explored in the past. Our experimental results suggest that there are striking deviations in the expected number of edges, degree distribution, spectrum of adjacency/normalized Laplacian matrix associated with the new family of graphs as compared to both the well-known Erdos–Renyi random graphs and the general random geometric graphs as originally defined by Gilbert. Particularly, degree distribution of the graphs shows some interesting features in low dimensions. To the more applied end, we believe that our random graph family might be effective in modelling some practically useful networks (world wide web, social networks, railway or road networks, etc.). It is observed that the degree distribution of some complex networks arising in practice follow power law distribution or log power distribution; they tend to be right skewed, having a heavy tail unlike the degree distribution of Erdos–Renyi graphs or general geometric random graphs (which follow exponential distribution with a sharp tail). The degree distribution of our random graph family significantly deviates from that of Erdos–Renyi graphs or general geometric random graphs and is closer to a right-skewed power law distribution with a heavy tail. Thus, we believe that this new family of graphs might be more effective in modelling the typical real-world networks mentioned above. The key contribution of the paper is introducing this new random graph family and studying some of its properties experimentally, further investigation into which would be interesting from a purely mathematical perspective. Also, it might be of practical interest in terms of modelling real-world networks.

Keywords:

geometric random graphs; degree distribution; Erdos–Renyai graphs; spectrum of a graph

1. Introduction

Graphs are used extensively in modern times to model real-world complex systems. The examples are as diverse as world wide web, rail networks, electrical networks, communication networks, social media networks, protein graphs, modeling of human brain, etc. Perhaps the first and simplest model for random graphs was proposed by Erdos and Renyi in the 1960s [1], in which for any pair of vertices, an edge is added with probability p. The Erdos–Renyi model is a purely combinatorial model of random graphs. The model has the property of independence between a set of edges. Based on the range in which probability p lies, Erdos–Renyi graphs show different behavior in terms of the number of connected components, expansion properties, spectral distribution, etc. The Erdos–Renyi model has been extensively studied in the literature (see [2,3]). The Erdos–Renyi model is not very appropriate when the connectivity of nodes is based on physical proximity. Gilbert initiated the study of random geometric graphs [4] in

R^{2}

. The points are chosen from a 2D plane with Poisson distribution, and two points are joined by an edge if the Euclidean distance between points is less than r. In subsequent years, random geometric graphs were studied very extensively due to their wide applicability. They were studied in higher dimensions, by changing ambient space to hypercube

{[0, 1]}^{d}

Euclidean spheres. Several variants of the model have been explored like soft RGGs, directed RGGs [5], dense RGG [6], and translation invariant RGGs, to name a few. The random geometric graph model has found application in a wide variety of areas like wireless networks [7], consensus [8], robot motion planning [9], spread of virus [10], and protein interaction [11]. Instead of giving a long list of related works, we point the reader to an excellent survey [12] for modern development on the topic, and for more historical development we refer to [13].

We landed on defining and exploring the random covering graph model from an entirely different route. The underlying covering process used in our definition of the RCG model is quite common in the study of integer lattices and geometry of numbers in general. Specifically, a similar process has been used by Ajtai–Kumar–Sivakumar [14] in the sieving algorithm for the shortest vector problem. While analyzing certain properties of the sieving process, we found it convenient to define the underlying graph as we did in the RCG definition.

Our definition of RCG is similar to geometric random graphs (RGGs) but has a crucial difference. We pick points uniformly at random from the Euclidean ball of unit radius in n-dimensional space. We retain only those random points in the collection which are at a distance at least r from the already chosen points (this is the crucial difference with the standard RGG model). To ensure that the above process terminates, we define suitable criteria based on the expected number of random points we need to pick before we find a point which is at distance r apart from the already chosen points. We put up an edge between the two chosen vertices if they are at distance at most

2 r

. Clearly, we cannot add any new point to our collection only when every point inside the ball of the unit radius is at distance at most r from some chosen point, that is, balls of radius r centered at the chosen points cover the unit ball. That is the reason we name our model, random covering graphs. We believe that putting a lower bound on the distance between the chosen points makes the model intrinsically linked with the geometry of the underlying Euclidean space. The model has both global (e.g., it can be used to study random packing density or random covering densities) as well as local aspects (e.g., we observe an interesting pattern in the degree distribution of the RCG, which might be connected with the kissing number, as we note in the paper that clearly the kissing number is a local property).

First, we prove the bound on the number of vertices and edges of RCGs using a simple packing argument. Empirically, we observe that the log of the expected number of vertices and the log of the expected number of edges of RGGs with parameter r are a linear function of

1 / r

. Next, we study the degree distribution of RCGs, varying the parameter r. We observe two distinct bulges in the graph in the dimensions 2, 3, 4 and 5. The second bulge is visible in dimension 6 if we carefully observe it, but it is not prominent. As r decreases, we observe that the second maxima reduces, and it also shifts away from the first local maxima. It will be, mathematically, a very interesting problem to understand the reason behind this, particularly the two-lobe structure of the degree distribution graph.

Interestingly, the first local maxima are located at degrees which are close to the kissing number in the respective dimensions. The kissing number of the n-dimesional Euclidean space is the largest number of unit radius n-dimensional spheres one can place around a unit radius sphere so that no spheres intersects. The term kissing is derived from the game of billiards, where it is used to indicate the touching of balls. The kissing number problem has very interesting history: The debate on the kissing number of 3-dimensional space dates back to Newton and Kepler. For 3-dimensional space, the answer was not known at that time, and it was either 12 or 13. Even today, exact kissing numbers are known only for dimensions 1, 2, 3, 4, 8 and 24 [15,16,17], which are, respectively, 2, 6, 12, 24 and 196,560. For other dimensions, only lower bounds and upper bounds are known (see [15,18] for the current best-known upper bounds and lower bounds). All of the above results except for dimension 1, 2 underlie deep mathematics, and even some of the proofs require computer assistance. Clearly, the properties of RCGs are linked with the geometric properties of the underlying space. We believe that insisting on the lower bound on the distance between the chosen points strengthens this linkage, as it is against general geometric random graphs.

The first local maxima of the degree distribution graph for RCG are close to kissing numbers in dimensions 2, 3, 4 and 5. Does it indicate some relation between RCG and the kissing number problem? Even if there might not be a direct connection with the kissing number, we believe that the mathematical study of this model certainly sheds some light on the local geometric properties of Euclidean space. Moreover, it appears that even to study the combinatorial properties of the RCGs such as planarity (in general, finding the genus of the graph), coloring number of the graphs, size of the sparsest cut, maximum flow, etc., the interplay between geometric and combinatorial techniques would be needed, which makes the model quite interesting.

It turns out that the degree distribution of RCGs as well as the distribution of the eigenvalues of RCGs deviate (which in itself is interesting, as RCGs too are geometric random graphs of a special kind) from that of Erdos–Renyi random graphs and general geometric random graphs. With reference to degree distribution, power law distribution or log power law distribution has been observed in several practically useful networks [19,20]. Moreover, the degree distribution of RCG demonstrates a “heavy tail”, right-skewed nature: a common feature of many practical networks [19,20,21]. The spectrum of RCGs is closer to the power law distribution unlike other random graph models, which have a spectrum close to Wigner’s semicircle distribution [22,23,24].

We note that in a recent work [6], the authors emphasized upon a similar point as we are doing in the current work about a geometric random graph model which deviates from Erdos–Renyi or general geometric random graphs in terms of various parameters. It will be interesting to further investigate the effectiveness of the RCG model in modeling practically arising complex networks. In the current work, our main motivation was to introduce the model and motivate further study of the model. There is substantial scope for new experiments to evaluate various properties of the model, such as conductance of the graph, expansion properties of the graph, average number of triangles, clustering coefficients, etc.

The following is the organization of the paper. We start with the preliminary section, followed by a section in which we define the random covering graph model. In the next two sections, we summarize our experimental findings and compare and contrast them with those of other random graph models. We conclude with a summary and discussion.

2. Preliminaries

We begin with some basic definitions and preliminaries which are useful throughout the paper. The definition of geometric random graphs and the new family of graphs we will be defining rely on sampling points from higher-dimensional Euclidean balls. First, we recall some related definitions. Let

R

denote a set of real numbers. For any k tuple,

x = (x_{1}, x_{2}, \dots, x_{k}) \in R^{k}

, and for real number

p \geq 1

, the

ℓ_{p}

norm of x denoted by

{∥ x ∥}_{p}

is defined as

{∥ x ∥}_{p} = {(| x_{1} |^{p} + | x_{2} |^{p} + \dots + {| x_{k} |}^{p})}^{\frac{1}{p}},

where

| x_{i} |

denotes the absolute value of

x_{i}

.

R^{k}

along with

ℓ_{p}

norm forms a metric space and thereby any points

x, y \in R^{k}

satisfy triangle inequality

{∥ x + y ∥}_{p} \leq {∥ x ∥}_{p} + {∥ y ∥}_{p}

with equality only if vectors

x, y

are parallel. For

x, y \in R^{k}

, the distance between x and y with respect to norm

ℓ_{p}

is denoted by

d_{p} (x, y) = {∥ x - y ∥}_{p}

. For

x \in R^{k}

and a positive real number r, the ball of radius r with respect to

ℓ_{p}

norm centered at x is defined as

B_{p} (x, r) = {y \in R^{k} {| ∥ x - y ∥}_{p} \leq r},

so

B_{p} (x, r)

is simply the collection of all points with distance, at most, r from x. In most of the part of the paper, we will be working with Euclidean space with the associated

ℓ_{2}

norm. For ease of notation, unless stated otherwise, by

∥ x ∥, d (x, y), B (x, r)

we mean

{∥ x ∥}_{2}, d_{2} (x, y), B_{2} (x, r)

, respectively.

There are several practical scenarios where data items of interest are naturally expressed as vectors in higher dimensions: for example, a product is associated with a tuple such that each entry of tuple is an attribute of the product, the image is represented by the vector of its features, a word is expressed as a high-dimensional vector using word embeddings like WordtoVec, etc. Our geometric intuition is formed in 2 and 3 dimensions and is often misleading in higher dimensions. For example, if we pick a point uniformly at random in d dimensional unit spheres centered at origin, then with very high probability, the distance from the point to the origin is between

1 - \frac{c}{d}

and 1, where c is an absolute constant independent of d. That is, with high probability, the point would lie in the outer annular fringe of width

c / d

. To give another example, uniformly random chosen vectors in higher dimensions are almost orthogonal to each other with very high probability unlike our intuition in 2-D or 3-D. One has to be careful while geometrically interpreting the higher-dimensional data. For a thorough treatment of various properties of higher-dimensional space, we refer to [25,26].

The volume of radius r Euclidean ball in

R^{n}

is

V_{n} (r) = \frac{π^{n / 2} r^{n}}{Γ (1 + \frac{n}{2})}

where

Γ

is Euler’s gamma functions, which extends the usual factorial function to non-integer arguments. For positive integer n,

Γ (n) = (n - 1)!

and

Γ (n + \frac{1}{2}) = (n - \frac{1}{2}) (n - \frac{3}{2}) \dots \frac{1}{2} \cdot π^{1 / 2}

. A simple method to sample points uniformly at random from a low-dimensional sphere centered at origin is as follows: Choose each coordinate

x_{i}

of point uniformly at random in the range

[- 1, 1]

. This amounts to choosing point x uniformly at random from the box of dimension 2 in each coordinate centrally placed at origin. We discard the point if it falls outside the sphere and regenerate the sample point uniformly at random as discussed above until we obtain a point inside the unit sphere. This simple Monte Carlo simulation, even though it works for low dimensions, soon becomes computationally infeasible, as it ends up discarding too many points because the unit sphere has a vanishingly small volume inside the bounding box (

{Lim}_{n \to \infty} V_{n} (1) = 0)

. There are several different sophisticated ways to efficiently sample points uniformly from the

n -

sphere. We will discuss a suitable efficient sampling procedure in the next section.

We recall some basic probability distributions useful for the paper.

Definition 1

(Wigner’s Semicircle distribution). It is a probability distribution on

[- R, R]

for positive real number R whose probability density function is defined as

f (x) = \frac{2}{π R^{2}} \sqrt{R^{2} - x^{2}}

for

- R \leq x \leq R

, and

f (x)

is defined to be 0 if

| x | > R

.

Definition 2

(Power Law Distribution). A power law distribution has the form

y = k x^{α}

where

x, y

are variables of interest, k is the absolute constant, and α is the exponent.

Next, we recall some basic terminology related to graphs. An undirected graph G is a pair

G = (V, E)

, where the finite set V is called a set of vertices and E is some collection of unordered pairs of vertices. Elements of set E are called edges. For an edge

e = {i, j} \in E

,

i, j

are called end points of the edge, and edge e is said to be incident on

i, j

. The degree of a vertex

i \in V

is the number of edges incident on the vertex i. An adjacency matrix for G is a

| V | \times | V |

matrix, with

{(i, j)}^{t h}

entry being 1 if edge

{i, j} \in E

. For undirected graphs, the corresponding adjacency matrix is symmetric, and being symmetric it has real eigenvalues. We denote eigenvalues as

λ_{1} \geq λ_{2} \geq \dots \geq λ_{n} \geq 0

. The collection of eigenvalues is called the spectrum of the graph. The spectrum of the graph includes a lot of interesting information of the graph. For example, in the case of d regular graphs,

λ_{1} = d

.

λ_{1} \neq λ_{2}

if and only if the graph is connected, and

λ_{n} = - λ_{1}

if and only if the graph is bipartite. For basic properties of the spectrum of the graph, we refer to [27]. Refer to [28] for applications to computer science.

Let D be a diagonal matrix with the

{(i, i)}^{t h}

entry being the degree of vertex i. Let

A = D^{- 1 / 2} A D^{1 / 2}

be the normalized adjacency matrix. Let

L = D - A

be the Laplacian matrix and

L = D^{- 1 / 2} L D^{1 / 2}

be the normalized Laplacian matrix [29].

Next, we define random graph models useful for the discussion in the paper.

Definition 3

(Erdos–Renyi Random Graphs). The

G (n, p)

model for random graphs due to Erdos and Renyi is defined as follows. n is the number of vertices. For each pair of vertices

i, j

in the vertex set of G, there is an edge between i and j with probability p.

Based on certain thresholds on p, the graph may have several connected components, it can have one huge connected component, or the graph is connected. We are more interested in the connected graph regime. It is well known that the degree distribution of the Erdos–Renyi graph follows binomial distribution (see [25]) and the eigenvalue distribution follows Wigner’s semicircle law (see [30]) For a thorough discussion on various properties of the model, we refer to Section 8 of [25].

Definition 4

(Random Geometric Graphs). Random geometric graph

G (n, r)

is defined as follows. Choose n points uniformly at random from

B (0, 1)

in dimension d, and put an edge between two vertices

x, y

if

∥ x - y ∥ \leq r

.

We refer to [13] for comprehensive treatment of geometric random graphs.

3. Random Covering Graphs

In this section, we define our random geometric graph family

G (d, r)

, where d is the dimension of the Euclidean space and r is a parameter which takes real values between 0 and 1.

Sampling from unit n-sphere: Before describing a new family of graphs, first we describe a standard efficient process to sample points uniformly from an n-dimensional ball [25]:

For $i = 1$ to n, let $x_{i}$ be chosen according to Gaussian distribution

$f (x) = \frac{1}{σ \sqrt{2 π}} e^{- \frac{1}{2} {(\frac{x - μ}{σ})}^{2}}$

with mean $μ = 0$ and variance $σ = 1$ .
Choose u uniformly at random in $[0, 1]$ .
Let $y = \frac{u^{1 / n}}{\sqrt{x_{1}^{2} + x_{2}^{2} + \dots + x_{n}^{2}}} (x_{1}, x_{2}, \dots, x_{n})$

The vector y will be uniformly distributed inside the n-dimensional unit sphere of radius 1 centered at origin. In Step 1 above, we sample point x from the surface of the unit sphere centered at the origin. In Step 3, we scale x appropriately to obtain a point y inside the unit sphere. Figure 1 demonstrates the random samples chosen from the 2D ball.

Random covering graph construction: The random covering graph family

G (d, r)

is defined as below.

Let $S = \emptyset$ .
choose x uniformly at random from $B (0, 1)$
Repeat Step 2 until we find that $d (x, y) > r$ for all point $y \in S$ . If we need to repeat Step 2 more than $2 / r^{d}$ times then goto Step 4 else goto Step 5.
Include x in S and goto Step 2.
Define graph G whose vertex set is S and we put an edge between two vertices $x, y$ of G if $d (x, y) \leq 2 r$ .

Basically, we are repeatedly sampling points uniformly in the unit sphere as long as we do not obtain a fresh point which is at a distance of at least r from all the chosen points so far. To obtain a process with bounded running time, we need to identify a situation where we would not obtain a fresh point anymore. To do this, we use the threshold in Step 3 based on simple geometric distribution. Let V denote the union of all spheres of radius r centered at points chosen thus far to be intersected with the unit sphere centered at origin, that is

V = \cup_{x \in S} B (x, r) ⋂ B (0, 1)

Intuitively, if

Vol (B (0, 1) \ V) < Vol (B (0, r))

, then it is unlikely to find a fresh point anymore, where

Vol (T)

denotes the volume of T. Now, suppose

Vol (B (0, 1) \ V) < Vol (B (0, r))

, then the expected number of times we need to pick a point uniformly at random from

B (0, 1)

until we obtain a fresh point (which is at least r distance from all the chosen points so far) is

\frac{Vol (B (0, 1))}{Vol (B (0, 1) \ V)} > \frac{1}{r^{d}}

(this follows from the basic properties of usual geometric distribution). This gives intuitive justification for the choice of our threshold in Step 3.

Note that if there do not exist any fresh points, then it is implied that for every point

y \in B (0, 1)

, there is

x \in S

such that

y \in B (x, r)

, that is

B (0, 1) \subseteq \cup_{x \in S} B (x, r)

, union of balls of radius r placed at points in S covers the entire unit ball

B (0, 1)

. This is why we name our family as the random covering graph family.

There are a couple of important features of the above construction. In general, in geometric graphs, two vertices are connected if they are at a small distance from each other, whereas in the case of random covering graphs, the process ensures that though the distance between the vertices connected by an edge is “small” at a small distance, they are not at “too small” of a distance from each other. This is ensured by points which are at a distance of at least r. This is the key distinguishing feature of the covering graph family compared to random graphs, and it is the crucial reason behind the deviations observed in the degree distribution and eigenvalue distribution as compared to general random geometric graphs. Typically, for Erdos–Renyi graphs, as well as general geometric random graphs, one is also interested in aspects such as number of connected components, threshold beyond which a large connected component appears in the graph, etc. We note that covering graphs are almost always connected. If we want to understand the behavior of our model in the regime, where the graph may have several connected components, we can do that by setting a smaller threshold in Step 3 above. We summarize it in the observations below.

Observation 1.

The random geometric graph

G (d, r)

is always connected with very high probability. If we want to work with random covering graphs with multiple connected components, then it can be achieved by setting a smaller threshold in Step 3 above.

Claim 1.

The number of vertices, degree of any vertex of

G (d, r)

, are upper-bounded by

e^{2 d / r}

and

5^{d}

, respectively.

Proof.

For any two vertices

x, y

of G, we know that

d (x, y) > r

. All the vertices are inside

B (0, 1)

. Place

r / 2

radius spheres centered at each vertex of G. From triangle inequality, it follows that

B (x, r / 2) \cap B (y, r / 2) = \emptyset

. Again, as all vertices of G lie inside

B (0, 1)

, by triangle inequality it follows that

B (x, r / 2) \subseteq B (0, 1 + r / 2)

. That is, all small spheres are disjoint and contained inside ball of radius

1 + r / 2

centered at origin. So, simple packing argument implies that the number of vertices of G is upper-bounded by

\frac{Vol (B (0, 1 + r / 2))}{Vol (B (0, r / 2)} \leq \frac{{(1 + r / 2)}^{d}}{{(r / 2)}^{d}} \leq e^{2 d / r}

.

Suppose x is a vertex of G and

y_{1}, y_{2}, \dots, y_{k}

are neighbors of x in G. So, we have

d (x, y_{j}) \leq 2 r

. Clearly,

d (y_{i}, y_{j}) > r

from the definition of G. Again, using triangle inequality, we see that balls of radius

r / 2

placed at

y_{1}, y_{2}, \dots, y_{k}

are disjoint and contained in

B (x, 2 r + r / 2)

. This implies

k \leq \frac{{(5 r / 2)}^{d}}{{(r / 2)}^{d}} \leq 5^{d}

. □

So the number of vertices is upper bounded by an exponential function of

1 / r

, whereas the maximum degree is upper bounded by

5^{d}

which is a constant (independent of r).

We have experimentally computed the average number of vertices and average number of edges for various values of r. To compute the average, we run the same experiment repeatedly for a “large” number of iterations with fresh random bits and compute the average value of the number of vertices and edges across the iterations. We plot the logarithm of the average number of vertices against

1 / r

; it is clear from the graph that

log | V | \propto (1 / r)

, where

| V |

is the average number of vertices. Similarly, the linearity of

log | E |

with respect to

1 / r

is clear from the graph in Figure 2, where

| E |

is the average number of edges. We can see the line approximation becomes better and better with increasing the dimension. Below, we have given graphs only in the

2 d

case just to demonstrate the nature of the graph.

Observation 2.

Let

G (d, r)

be a random covering graph with vertex set V and edge set E, then

log | V |, log | E | \propto (1 / r)

.

4. Degree Distribution

In this section, we contrast and compare the degree distribution of Erdos–Renyi random graphs as well as general geometric random graphs with the degree distribution of random covering graphs.

Let

G (n, p)

be an Erdos–Renyi graph. Since p is the probability of an edge being present, the clearly expected degree of any vertex is

p n

. It follows easily that the actual degree distribution is given by

Probability that vertex has degree k = (\binom{n - 1}{k}) p^{k} {(1 - p)}^{n - k - 1} \approx (\binom{n}{k}) p^{k} {(1 - p)}^{n - k}

Using tail inequalities like the Chernoff bound, it can be shown that the above binomial distribution falls exponentially fast as we shift away from the mean. For the general geometric graphs too, similar behavior has been observed. We demonstrate in Figure 3 our experimental observation displaying the degree distribution for random geometric graphs by varying parameter r.

To obtained the degree distribution, we run the random experiment a large number of times with fresh random bits. For every iteration, we compute the degree histogram by counting the number of vertices of each degree. Then, we take the average across the number of iterations. The number of iterations is chosen to be sufficiently large so that the distribution converges. We repeat the experiment by changing r in a certain interval, each time varying the value of r by a small constant. The specific interval is chosen so that it captures interesting features in the graph; at the same time, the computation can be performed in a reasonable amount of time. We cannot choose r to be too small, as from the Observation 1, we know that the number of vertices grows exponentially with the reciprocal of r. In Figure 4, we demonstrate the distribution observed for the dimensions 2 to 6.

As remarked in the introduction, the degree distribution of several graphs arising in practice do not exhibit sharp drops in the degree when one goes away from the mean degree. They rather drop slowly, resulting in a broader distribution, which is referred to as “heavy tail” or “fat tail” in the literature [24]. The degree distribution of random covering graphs is quite broad as noted in Figure 4 This suggest the possibility of using covering graphs to model real-world networks. Even though covering graphs are almost always connected, if one wants them to model disconnected graphs, then one can set appropriately smaller threshold in the stopping criteria in Step 3 as noted in Observation 2.

Shape of degree distribution and plausible connection with kissing number:

As observed in Figure 4, we see two lobes in the graph of degree distribution of random covering graphs. It is a very interesting problem to understand the mathematical reason behind the similarity. As we increase the dimension, the second lobe starts moving towards the right, and the local maxima corresponding to the second lobe drops down. While defining the covering graphs, we have additionally put up a requirement that the chosen points are not too close. This must be crucially connected to the observed nature of the graph. From Claim 1, we know that the degree of any vertex is upper bounded by

5^{d}

but still the distribution seems to concentrated around quite small values (e.g., 7 to 8 in 2D, and 11 to 18 in 3D). It is interesting to note that the two bulges emerge in the graph in the region around the kissing number in the respective dimension. A kissing number in dimension d is the largest number of unit spheres one can place touching a central unit sphere such that no two spheres overlap. The problem has an interesting history, dating back to Newton and Kepler. We refer to an excellent book by Conway and Sloane [15] and article [17] for the interesting history and results related to the problem. For 3-dimensional space, the answer was not known at that time, and it was either 12 or 13. Even today, the exact kissing numbers are known only for dimensions 1, 2, 3, 4, 8 and 24 [15,16,17], which are, respectively, 2, 6, 12, 24 and 196,560. For other dimensions, only lower bounds and upper bounds are known (see [15,18] for the current best-known upper bounds and lower bounds). All of the above results except for dimension 1, 2 underlie deep mathematics, and even some of the proofs require computer assistance.

Clearly, the properties of RCGs are linked with the geometric properties of the underlying space. We believe that insisting on the lower bound on the distance between the chosen points strengthens this linkage, as it is against general geometric random graphs.

The first local maxima of degree distribution graph for RCG are close to the kissing numbers in dimensions 2, 3, 4 and 5. Does it indicate some relation between RCG and the kissing number problem? It might be quite speculative to say that there is a connection between the kissing number problem and the degree distribution of random covering graphs; nevertheless, it would be definitely worthwhile to explore the plausibility of such a connection. Even if there might not be a direct connection with the kissing number, we believe that the mathematical study of this model certainly sheds some light on the local geometric properties of Euclidean space. It will be an interesting problem from a purely mathematical perspective.

Observation 3.

The degree distributions of random covering graphs have two bulges unlike the distribution for general geometric random graphs. They are located near the kissing number in the respective dimensions for dimensions 2, 3, 4, 5.

5. Spectrum of RCG

In this section, we briefly comment about our experimental observations regarding the spectrum of RCGs in comparison with geometric random graphs and Erdos–Renyi graphs. The spectrum of Erdos–Renyi graphs as well as geometric random graphs are well studied and have Weiner’s semicircle distribution [30]. We see that the eigenvalue distribution for RCGs shows quite a bit of deviation from Wiener’s semicircle distribution; in fact, it resembles power law distribution. We demonstrate in Figure 5 the eigenvalue distribution for RCGs of the normalized Laplacian matrix as well as the normalized adjacency matrix. The virtual dip observed at the end of the waveform is due to the averaging effect. In Figure 6, we show the fitting of the degree 6 polynomial on the eigenvalue distribution for RCGs. In the Figure 7, we demonstrate the spectrum for random geometric graphs.

Observation 4.

The eigenvalue distribution of RCGs resembles power law distribution, and it might be useful to model some networks in practice.

6. Application in Computer Science

We have observed that the degree distribution of RCGs as well as the distribution of eigenvalues of RCGs deviate (which in itself is interesting, as RCGs, too, are geometric random graphs of a special kind) from that of Erdos–Renyi random graphs and general geometric random graphs. With reference to degree distribution, power law distribution or log power law distribution has been observed in several practically useful networks [19,20]. Moreover, the degree distribution of RCG demonstrates a “heavy tail”, right-skewed nature: a common feature of many practical networks [19,20,21]. The spectrum of RCGs is closer to power law distribution unlike other random graph models which have a spectrum close to Wigner’s semicircle distribution [22,23,24]. Our observations indicate that RCGs might be useful in modeling practically arising networks.

7. Concluding Remarks

We have introduced a new family of geometric random graphs and studied its degree distribution and eigenvalue distribution, and compared our experimental findings with respective distributions for the Erdos–Renyi graph and geometric random graphs. It will be an interesting question to find the mathematical reason for the emergence of two bulges in the degree distribution, and it will also be interesting to explore a plausible connection between the kissing number and degree distribution graph of RCGs.

We have explored a few basic properties of RCGs. It will be interesting to study other parameters, such as conductance, diameter of the graph, clustering coefficient, chromatic number, number of triangles, community structure, etc., which are quite useful in the analysis of networks which arise in practice. Our current work was mainly experimental, so it would be interesting to mathematically analyze the studied parameters; we are pursuing this direction of research. Currently, we have not performed any specific optimizations to improve the computational time, but if we want to extend the scope of experiments, such optimizations are essential. An operation one needs to perform repeatedly is to check if the distance between a pair of points is within a threshold. One can possibly use geometric space-partitioning data structures to minimize the number of comparisons by introducing hierarchical space partitioning with larger regions.

Author Contributions

Conceptualization, P.J. and S.S.; mathematical formulation, K.J. and P.J.; methodology, K.J., P.J. and S.S.; programing, K.J. and P.J.; validation, K.J., P.J. and S.S.; formal analysis, P.J.; investigation, K.J., P.J. and S.S.; writing—original draft preparation, P.J. and S.S.; funding acquisition, P.J. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Science and Engineering Research Board (SERB), Government of India grant number CRG/2020/001456. The authors would like to thank SERB for the funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data sharing is not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Erdos, P.; Renyi, A. On Random Graphs. Publ. Math. 1959, 6, 290–297. [Google Scholar] [CrossRef]
Bollobas, B. Random Graphs; Academic Press: Cambridge, MA, USA, 1995. [Google Scholar]
Alon, N.; Spencer, J.H.; Erdös, P. The Probabilistic Method; Wiley: Hoboken, NJ, USA, 1995. [Google Scholar]
Gilbert, E.N. Random plane networks. J. Soc. Ind. Appl. Math. 1961, 9, 533–553. [Google Scholar] [CrossRef]
Peralta-Martinez, K.; Méndez-Bermúdez, J.A. Directed random geometric graphs: Structural and spectral properties. J. Phys. Complex. 2022, 4, 439–561. [Google Scholar] [CrossRef]
Adhikari, K.; Adler, R.; Bobrowski, O.; Rosenthal, R. On the Spectrum of Dense Random Geometric Graphs. arXiv 2022, arXiv:2004.04967. [Google Scholar] [CrossRef]
Haenggi, M.; Andrews, J.G.; Baccelli, F.; Dousse, O.; Franceschetti, M. Stochastic geometry and random graphs for the analysis and design of wireless networks. IEEE J. Sel. Areas Commun. 2009, 27, 1029–1046. [Google Scholar] [CrossRef]
Estrada, E.; Sheerin, M. Consensus dynamics on Random Rectangular Graphs. Phys. D Nonlinear Phenom. Nonlinear Dyn. Interconnected Netw. 2016, 323–324. [Google Scholar] [CrossRef]
Solovey, K.; Salzman, O.; Halperin, D. New perspective on sampling-based motion planning via Random Geometric Graphs. Int. J. Robot. Res. 2016, 37. [Google Scholar] [CrossRef]
Preciado, V.; Jadbabaie, A. Spectral analysis of virus spreading in random geometric networks. In Proceedings of the 48h IEEE Conference on Decision and Control (CDC) Held Jointly with 2009 28th Chinese Control Conference, Shanghai, China, 15–18 December 2009. [Google Scholar]
Higham, D.; Rasajski, M.; Przulj, N. Fitting a geometric graph to a protein-protein interaction network. Bioinformatics 2006, 24, 1093–1099. [Google Scholar] [CrossRef] [PubMed]
Duchemin, Q.; de Castro, Y. Random Geometric Graph: Some recent developments and perspectives. arXiv 2022, arXiv:2203.15351. [Google Scholar]
Penrose, M. Random Geometric Graphs; Oxford University Press: Oxford, UK, 2002. [Google Scholar]
Ajtai, M.; Kumar, R.; Sivakumar, D. A sieve algorithm for the shortest lattice vector problem. In Proceedings of the 33rd Annual ACM Symposium on Theory of Computing, Heraklion, Crete, Greece, 6–8 July 2001; Vitter, J.S., Spirakis, P.G., Yannakakis, M., Eds.; ACM: New York, NY, USA, 2001; pp. 601–610. [Google Scholar] [CrossRef]
Conway, J.H.; Sloane, N.J.A. Sphere Packings, Lattices and Groups; Springer: Berlin/Heidelberg, Germany, 1993. [Google Scholar]
Musin, O.R. The Kissing Number in 4 dimensions. Ann. Math. 2008, 168, 1–32. [Google Scholar] [CrossRef]
Pfender, F.; Ziegler, G.M. Kissing Numbers, Sphere Packings, and Some Unexpected Proofs. Not. Am. Math. Soc. 2006, 51, 873–883. [Google Scholar]
Cohn, H. Kissing Numbers. 2023. Available online: https://cohn.mit.edu/kissing-numbers (accessed on 1 May 2024).
Barabasi, A.L.; Albert, R. Emergence of scaling in random networks. Science 1999, 286, 509–512. [Google Scholar] [CrossRef]
Broido, A.; Clauset, A. Scale-free networks are rare. Nat. Commun. 2019, 10, 1017. [Google Scholar] [CrossRef]
McGlohon, M.; Akoglu, L.; Faloutsos, C. Statistical Properties of Social Networks. In Social Network Data Analytics; Springer: Boston, MA, USA, 2011; pp. 17–42. [Google Scholar]
Hamidouche, M. Spectral Analysis of Random Geometric Graphs. Available online: https://theses.hal.science/tel-03135086/document (accessed on 1 May 2024).
Dettmann, C.P.; Georgiou, O.; Knight, G. Spectral statistics of random geometric graphs. arXiv 2016, arXiv:1608.01154v2. [Google Scholar] [CrossRef]
Mihail, M.; Papadimitriou, C.H. On the Eigenvalue Power Law. In Proceedings of the Randomization and Approximation Techniques, 6th International Workshop, RANDOM 2002, Cambridge, MA, USA, 13–15 September 2002; Proceedings; Lecture Notes in Computer Science. Rolim, J.D.P., Vadhan, S.P., Eds.; Springer: Berlin/Heidelberg, Germany, 2002; Volume 2483, pp. 254–262. [Google Scholar] [CrossRef]
Blum, A.; Hopcroft, J.; Kannan, R. Foundations of Data Science; Cambridge University Press: Cambridge, UK, 2021. [Google Scholar]
Arora, S. Theorists Tookkit. 2005. Available online: https://www.cs.princeton.edu/~arora/pubs/toolkit.pdf (accessed on 1 May 2024).
Chung, F. Spectral Graph Theory; Springer: Berlin/Heidelberg, Germany, 1993. [Google Scholar]
Hooray, S.; Linial, N.; Wigderson, A. Expander graphs and their applications. Bull. Am. Math. Soc. 2006, 43, 439–561. [Google Scholar] [CrossRef]
Williamson, D.P. Bridging Continuous and Discrete Optimization. 2019. Available online: https://people.orie.cornell.edu/dpw/orie6334/ (accessed on 1 May 2024).
Zhao, Y. Spectral distribution of Random Graphs. Available online: https://web.mit.edu/18.338/www/2012s/projects/yz_report.pdf (accessed on 1 May 2024).

Figure 1. Uniform random sampling from 2D ball.

Figure 2. Average number of nodes and edges in RCG (Solid lines indicate actual plots and dotted line indicates best fitting linear function to the respective plots in sub Figure 2a,b).

Figure 3. Degree distribution for random geometric graphs in 2D.

Figure 4. Degree distribution for dimensions 2, 3, 4, 5, and 6, varying r.

Figure 5. Eigenvalue distribution for RCGs.

Figure 6. Degree 6 polynomial fitting on spectrum of RCGs (The solid line represents the spectrum of RCG and dotted line is best degree 6 polynomial approximation of it).

Figure 7. Eigenvalue distribution for random geometric graphs.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Joglekar, K.; Joglekar, P.; Shinde, S. On Statistical Properties of a New Family of Geometric Random Graphs. Eng. Proc. 2024, 62, 24. https://doi.org/10.3390/engproc2024062024

AMA Style

Joglekar K, Joglekar P, Shinde S. On Statistical Properties of a New Family of Geometric Random Graphs. Engineering Proceedings. 2024; 62(1):24. https://doi.org/10.3390/engproc2024062024

Chicago/Turabian Style

Joglekar, Kedar, Pushkar Joglekar, and Sandeep Shinde. 2024. "On Statistical Properties of a New Family of Geometric Random Graphs" Engineering Proceedings 62, no. 1: 24. https://doi.org/10.3390/engproc2024062024

Article Menu

On Statistical Properties of a New Family of Geometric Random Graphs^†

Abstract

1. Introduction

2. Preliminaries

3. Random Covering Graphs

4. Degree Distribution

5. Spectrum of RCG

6. Application in Computer Science

7. Concluding Remarks

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

On Statistical Properties of a New Family of Geometric Random Graphs †

Abstract

1. Introduction

2. Preliminaries

3. Random Covering Graphs

4. Degree Distribution

5. Spectrum of RCG

6. Application in Computer Science

7. Concluding Remarks

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

On Statistical Properties of a New Family of Geometric Random Graphs^†