Local Phase Transitions in a Model of Multiplex Networks with Heterogeneous Degrees and Inter-Layer Coupling

Bayrakdar, Nedim; Gemmetto, Valerio; Garlaschelli, Diego

doi:10.3390/e25050828

Open AccessFeature PaperEditor’s ChoiceArticle

Local Phase Transitions in a Model of Multiplex Networks with Heterogeneous Degrees and Inter-Layer Coupling

by

Nedim Bayrakdar

¹,

Valerio Gemmetto

¹ and

Diego Garlaschelli

^1,2,3,*

¹

Lorentz Institute for Theoretical Physics, University of Leiden, 2333 CA Leiden, The Netherlands

²

IMT School of Advanced Studies Lucca, 55100 Lucca, Italy

³

INdAM-GNAMPA Istituto Nazionale di Alta Matematica, 00185 Rome, Italy

^*

Author to whom correspondence should be addressed.

Entropy 2023, 25(5), 828; https://doi.org/10.3390/e25050828

Submission received: 1 March 2023 / Revised: 6 May 2023 / Accepted: 9 May 2023 / Published: 22 May 2023

(This article belongs to the Special Issue Recent Trends and Developments in Econophysics)

Download

Browse Figures

Versions Notes

Abstract

:

Multilayer networks represent multiple types of connections between the same set of nodes. Clearly, a multilayer description of a system adds value only if the multiplex does not merely consist of independent layers. In real-world multiplexes, it is expected that the observed inter-layer overlap may result partly from spurious correlations arising from the heterogeneity of nodes, and partly from true inter-layer dependencies. It is therefore important to consider rigorous ways to disentangle these two effects. In this paper, we introduce an unbiased maximum entropy model of multiplexes with controllable intra-layer node degrees and controllable inter-layer overlap. The model can be mapped to a generalized Ising model, where the combination of node heterogeneity and inter-layer coupling leads to the possibility of local phase transitions. In particular, we find that node heterogeneity favors the splitting of critical points characterizing different pairs of nodes, leading to link-specific phase transitions that may, in turn, increase the overlap. By quantifying how the overlap can be increased by increasing either the intra-layer node heterogeneity (spurious correlation) or the strength of the inter-layer coupling (true correlation), the model allows us to disentangle the two effects. As an application, we show that the empirical overlap observed in the International Trade Multiplex genuinely requires a nonzero inter-layer coupling in its modeling, as it is not merely a spurious result of the correlation between node degrees across different layers.

Keywords:

multiplex networks; maximum entropy models; World Trade Multiplex; mean-field Ising model

1. Introduction

The wide variety of different phenomena that occur around us are often the result of systems that emerge and (self-)organize dynamically. These systems consist of a multitude of basic constituents interacting with each other in complicated ways and forming complex patterns. Many of these systems can be represented as networks sustaining various processes. Examples of such systems include social networks, transportation networks, biological networks, financial networks, and technological networks. In particular, social, financial, and economic networks are an important class of systems that, in the wake of recent global crises (such as the 2007–2008 financial crisis, the COVID-19 pandemic, and the ongoing Ukraine crisis), have been attracting attention given the possibility of studying the propagation of shocks among their constituents. Generally, individuals, banks, firms, or countries can be represented as nodes, and the relationships among them can be represented as links [1,2,3]. Other types of economic and financial networks are obtained as some form of projection from time series data [3,4,5,6,7]. The study of these networks may increase our understanding of a variety of processes that take place through them, such as the spreading of diseases, the diffusion of (mis)information, the stability of financial markets, and the resilience of the economy.

The simplest approach is to map each constituent within a system onto a single node and to map each interaction between pairs of constituents onto a link of a single type, regardless of the nature of the interaction. In this approach, all the links in a network are treated on an equal footing, making it a single-layer network representation, which might, however, lead to an oversimplification that fails to capture the details of a multirelational system. For instance, production and trade networks are the result of the functioning of global supply chains, involving the exchange of multiple products between firms and countries, which determines nontrivial dependencies between product-specific layers of the network. In order to realistically follow the propagation of shocks in the economy, knowledge of the nature of the links is essential. The inability to properly represent multirelational systems using single-layer networks has lead to the introduction of so-called multilayer networks [8,9,10,11,12]. Multilayer networks allow us to describe multirelational systems by representing each type of relationship in a separate layer of the network, where each node is present in all layers, and the different types of connections are reported in the corresponding layers. Returning to the example of social networks, the different types of relationships between people, such as kinship, friendship, coworkership, etc., would each be represented by links in a different layer [13], and could be analyzed in their mutual dependencies.

However, in order to assess true dependencies across layers, one should use proper null models. In recent years, there has been an increase in attention towards null models of networks constructed as random graph ensembles [14,15,16,17,18,19,20]. A class of such models is the so-called Exponential Random Graph Models (ERGMs) [17,18,19,20,21,22,23,24,25,26,27]. ERGMs are used commonly within the social network analysis community, and have been more recently re-derived within a statistical physics maximum entropy framework [19,20,27]. This has allowed researchers to utilize techniques that are common in statistical physics. In the ERGM framework, one chooses the probability distribution on graphs such that it maximizes the entropy. This maximization is performed while the expected values of certain chosen graph properties are constrained to be equal to desired values.

Real-world multilayer networks have been compared against null ERGMs with independent layers [28,29]. This comparison has highlighted various properties of real multilayer networks that result from the interdependence of layers. Two such properties are the overlap and the multiplexity [9,28]. The overlap and the multiplexity essentially contain similar information and capture the correlation of a node’s connectivity across two or more layers. For example, in a social network, people may communicate with their friends through multiple means of communication, such as talking on the phone, sending emails, or sending instant text messages. In this example, the layer that represents communication through email has a significant overlap with the layer of communication through text messages. A more specific example is a study of the so-called World Trade Multiplex (representing international trade in different commodities among countries [30]), which showed that, despite the fact that each layer of the multiplex is separately well described by a maximum entropy model with given node degrees [31,32,33], the observed trade overlap across different commodity-specific layers is significantly different from the overlap predicted by a null model with independent layers [28]. This result is not unexpected, since one can imagine that the trade of a certain product between two countries may increase/decrease the possibility of the trade of a different product between the same two countries. Other examples of networks displaying a significant overlap are airport networks, on-line social games, collaboration networks, and citation networks [34,35,36].

An important conclusion that has been reached after comparing real-world multiplexes against null models with independent layers is that a significant part of the observed overlap in many real networks could actually be spuriously created by the correlations among node degrees across different layers, even if the latter are conditionally independent of each other, instead of resulting from genuine inter-layer dependencies [28,29]. Indeed, if node degrees are correlated among layers, then there will be an increased probability of a link between two nodes being present in multiple layers, while the probability of a link occurring in one layer will not necessarily influence the presence of a link occurring in another layer. The measured overlap of the network therefore consists of a part resulting from ‘spurious’ coupling between the layers and of a part resulting from genuine coupling between the layers. This spurious coupling increases as the density and/or heterogeneity of the degrees of the network increases. Real-world networks are often dense and have strongly heterogeneous degrees; therefore, the assessment of inter-layer coupling in these real-world networks will be severely affected.

The focus of this paper is the introduction of interdependencies between the layers of a multilayer network in the ERGM through the explicit inclusion of the overlap as an extra constraint. This inclusion of the overlap in the ERGM will aid us in understanding which (higher-order) properties of the network structure may be (highly) dependent on the overlap. Additionally, it will help us distinguish between the overlap in the network due to the correlation of single-node properties across layers and the overlap due to a genuine coupling between the layers. Finally, it will allow us to generate null models with the desired amount of spurious overlap and genuine overlap. It turns out that this problem is mathematically identical to solving the Ising model on a complete graph (which is also known as the mean-field Curie–Weiss model) and leads to a phase transition between a ‘multiplexed’ (magnetized) and a ‘non-multiplexed’ (non-magnetized) phase. However, the problem is more general because the locality of the constraints on the degrees of nodes will imply different parameter values, and hence different properties for the phase transitions relative to different pairs of nodes. For instance, it will, in general, not be possible to enforce a ‘zero-field’ spontaneous symmetry breaking condition for all pairs of nodes simultaneously. Therefore, for a given specification of the constraints, different pairs of nodes may realize different symmetry-broken values of their contribution to the overall inter-layer overlap. Crucially, this property arises only from the simultaneous presence of the two constraints (on the global overlap and on the heterogeneous local degrees), and would not be realized in the absence of one of them.

The rest of the paper is organized as follows: In Section 2, we mathematically define quantities and models that are relevant to this paper. This includes the derivation of a benchmark model, where the layers of the multiplex network are independent. In Section 3, we introduce, and solve analytically, our new model, where the layers of the multiplex are interdependent due to the inclusion of the overlap. Section 4 contains a discussion regarding the possible local phase transitions of the model. In Section 5, we explore our model by using various numerical methods. In Section 6, we briefly analyze the World Trade Multiplex, and show that the empirical overlap in this real-world network is not merely the result of the heterogeneity of the network, but requires a nonzero coupling between the layers in its modeling. Finally, we provide some concluding remarks in Section 7, and some technical details in Appendix A and Appendix B.

2. Background Theory

This section contains some background notions, definitions, and models.

2.1. Single-Layer Network Definitions

We will limit our discussion to the case of binary and undirected networks. A binary undirected network can be defined as a graph that is an ordered pair

G = (V, E)

, where

V = {v_{1}, v_{2}, . . ., v_{N}}

is a set of N vertices or nodes, and E is a set of unordered pairs of different vertices called edges or links. Note that the definition of E depends on the relevant class of relations between the constituents of the system. The vertex

v_{i} \in V

will be referred to simply as i throughout the rest of the paper. If

(i, j) \in E

, the vertices i and j are said to be connected, and may be referred to as neighbors of each other. The number of links L of the graph is given by the cardinality of E:

L = | E |

.

Matrix Representation

A graph G is represented by its adjacency matrix

G = {g_{i j}}

. This is an

N \times N

matrix where

g_{i j} = \{\begin{matrix} 1 & if (i, j) \in E, \\ 0 & otherwise . \end{matrix}

(1)

We define E as containing pairs of distinct vertices, which means that a vertex cannot have a connection to itself (self-loop). It is then natural to define the diagonal elements as

g_{i i} \equiv 0

. Since we limit our discussion to undirected graphs, the adjacency matrix is always symmetric,

g_{i j} = g_{j i}

, and it therefore contains

N (N - 1) / 2

independent elements that fully specify the matrix and ultimately the graph.

Degrees and Degree Distribution

One of the main topics in the analysis of complex networks is the identification of the different roles that nodes play [37]. For instance, there are a variety of measures that characterize the structural importance of a node in a network. The degree

k_{i} (G)

of the graph G is defined as the number of connections node i has to other nodes in the network.

k_{i} (G) = \sum_{j = 1}^{N} g_{i j}

(2)

The list

{k_{i} (G)}_{i = 1}^{N}

of degrees is called the degree sequence of the graph G. The degree distribution

P (k)

is defined as the fraction of nodes in the network with degree k. Real-world networks systematically show a degree distribution with heavy tails, where the degrees vary over a broad range, often spanning several orders of magnitude [38,39]. The majority of the vertices of these real-world networks have a small number of links to other vertices, while a few vertices have a relatively high number of links to other vertices, which are also referred to as ‘hubs’. An example is the World Wide Web, where some pages are incredibly popular and are pointed to by thousands of other pages, while generally, most pages are almost unknown. The heavy tails of real-world degree distributions can often be, but not necessarily, approximated by power laws of the form

P (k) \sim k^{- γ}

. In any case, vertices with a degree much larger than the average degree

〈 k 〉

occur with a non-negligible probability. This is a signature of a high level of statistical heterogeneity in real-world networks. Encoding this heterogeneity will be a crucial ingredient of our models.

2.2. Multiplex Network Definitions

A binary undirected multiplex network can be defined in terms of the previously defined single-layer networks. A multiplex network is a set

\vec{G} = {G^{α}}_{α = 1}^{M}

of M undirected binary graphs

G^{α} = (V, E^{α})

that share the same set of N nodes. In the context of multilayer networks,

G^{α}

is called a layer of

M

, and will be referred to simply as

α

throughout the rest of the paper. Note that a multiplex network is a type of multilayer network that does not allow inter-layer connections between two layers

α

and

β

where

α \neq β

.

Matrix Representation

The layer

G^{α}

and its intra-layer links can then be represented by the adjacency matrix

G^{α} = {g_{i j}^{α}}

. This is an

N \times N

matrix where

g_{i j}^{α} = \{\begin{matrix} 1 & if (i, j) \in E^{α}, \\ 0 & otherwise . \end{matrix}

(3)

Multilinks in Multiplex Networks

In order to capture the information regarding the presence of the links between the pair of nodes

(i, j)

in any of the M layers, we define the object

m_{i j} \equiv (g_{i j}^{1}, g_{i j}^{2}, \dots, g_{i j}^{M})

(4)

which is also known as the multilink of

(i, j)

. Additionally, we define the set

M_{i j}

as the set that contains all the

2^{M}

possible configurations of

m_{i j}

.

Multidegrees

The multidegree of a node

i \in V

of a multiplex network

\vec{G}

is the object

{\vec{k}}_{i} (\vec{G}) \equiv (k_{i}^{1} (\vec{G}), k_{i}^{2} (\vec{G}), \dots, k_{i}^{M} (\vec{G}))

(5)

where

k_{i}^{α} (\vec{G}) = \sum_{j \neq i}^{N} g_{i j}^{α}

(6)

is the degree of the node i in the layer

α

[9,40]. From the vector definition of the multidegree, one can obtain a scalar quantity defined as the layer-averaged degree:

{\bar{k}}_{i} (\vec{G}) = \frac{1}{M} \sum_{α = 1}^{M} k_{i}^{α} (\vec{G}),

(7)

which is the degree of node i averaged over all the M layers. Note that, in each layer

α

, the total layer-specific degree of all nodes equals twice the number of links in that layer, which we denote as

L^{α}

:

\sum_{i = 1}^{N} k_{i}^{α} (\vec{G}) = \sum_{i < j} g_{i j}^{α} = 2 L^{α} (\vec{G}) .

(8)

Summing the above relationship for the M layers, we get

M \sum_{i = 1}^{N} {\bar{k}}_{i} (\vec{G}) = \sum_{α = 1}^{M} \sum_{i < j} g_{i j}^{α} = 2 \sum_{α = 1}^{M} L^{α} (\vec{G}) = 2 L (\vec{G}),

(9)

where

L (\vec{G})

denotes the total number of links over the entire multiplex:

L (\vec{G}) = \sum_{α = 1}^{M} \sum_{i < j} g_{i j}^{α} .

(10)

Overlap

There are many properties that encode the interdependence between the layers of a multilayer network, but we will limit our discussion to one such property: the overlap. The overlap

O^{α β} (\vec{G})

between two layers

α

and

β

of the multiplex

\vec{G}

is defined as the number of links that appear in both layers

α

and

β

[34,41]:

O^{α β} (\vec{G}) = \sum_{i < j} g_{i j}^{α} g_{i j}^{β}

(11)

where, throughout the paper, using

\sum_{a < b}

and

\prod_{a < b}

, we denote a double sum and a double product for all possible (unrepeated) pairs of values of the two indices, a and b (with

a \neq b

), respectively. The global overlap

O (\vec{G})

is defined as the sum of

O^{α β} (\vec{G})

for all pairs of layers:

O (\vec{G}) = \sum_{α < β} \sum_{i < j} g_{i j}^{α} g_{i j}^{β} .

(12)

As the names of these properties suggest, they are a measure of how overlapping the layers of the multiplex network are.

2.3. Exponential Random Graph Models for Multiplexes

ERGMs are ensemble models, which means that they are defined as probability distributions over many possible (multiplex) networks. Given the observed (or desired) value

C_{i}^{*} \equiv C_{i} (\vec{G} *)

for K graph properties

{C_{i} (\vec{G})}_{i = 1}^{K}

defined on each possible multiplex

\vec{G}

(where

\vec{G} *

represents a particular, e.g., real-world, multiplex of interest), an ERGM generates a probability distribution

P (\vec{G})

over multiplex networks that maximizes the entropy, under the constraint that the expected value of

C_{i} (\vec{G})

equals

C_{i}^{*}

, for all

i = 1, K

. This method provides us with a general framework for modeling maximally random (maximum entropy) multiplex networks, to be used as null models that can be compared against the empirical multiplex

\vec{G} *

to detect higher-order patterns that are irreducible to the K enforced constraints. Maximizing the entropy subject to a set of constraints is also widely used in problems with incomplete information [42,43].

Let

G_{N}^{M}

be the set of (binary undirected) multiplex networks consisting of N vertices and M layers (note that this set includes single-layer networks for

M = 1

), let

\vec{G} = {G_{1}, G_{2}, . . ., G_{M}} \in G_{N}^{M}

be a multiplex network in that set, and let

P (\vec{G})

be the sought-for probability of

\vec{G}

within the ensemble. We want

P (\vec{G})

to be such that the expectation value of each graph observable

C_{i} (\vec{G})

(in the chosen set of K observables) is equal to the corresponding observed or desired value

C_{i}^{*}

. This type of probability distribution is also referred to as a canonical ensemble. The ideal probability distribution is the one that maximizes the Gibbs–Shannon entropy

S = - \sum_{\vec{G} \in G_{N}^{M}} P (\vec{G}) ln P (\vec{G})

(13)

under the normalization condition

\sum_{\vec{G} \in G_{N}^{M}} P (\vec{G}) = 1

(14)

and the other K constraints

C_{i}^{*} = 〈 C_{i} 〉, i = 1, \dots, K,

(15)

where

〈 C_{i} 〉 \equiv \sum_{\vec{G} \in G_{N}^{M}} P (\vec{G}) C_{i} (\vec{G}) .

(16)

The maximization of the entropy is achieved by introducing a global Lagrange multiplier

η

for the normalization condition and a specific multiplier

θ_{i}

for each constraint

〈 C_{i} 〉 = C_{i}^{*}

,

i = 1, \dots, K

. This leads to the parametric solution

P (\vec{G}, \vec{θ}) = \frac{e^{- H (\vec{G}, \vec{θ})}}{Z (\vec{θ})}

(17)

where

H (\vec{G}, \vec{θ})

is the graph Hamiltonian

H (\vec{G}, \vec{θ}) \equiv \sum_{i = 1}^{K} θ_{i} C_{i} (\vec{G}) = \vec{θ} \cdot \vec{C} (\vec{G})

(18)

and

Z (\vec{θ})

is the partition function determined by the normalization condition

Z (\vec{θ}) \equiv e^{η + 1} = \sum_{\vec{G} \in G_{N}^{M}} e^{- H (\vec{G}, \vec{θ})} .

(19)

The parametric form of

P (\vec{G}, \vec{θ})

, if inserted back into Equation (13), leads to the explicit expression for the entropy:

S (\vec{θ}) = - \sum_{\vec{G} \in G_{N}^{M}} P (\vec{G}, \vec{θ}) ln P (\vec{G}, \vec{θ}) = \vec{θ} \cdot 〈 \vec{C} 〉 + ln Z (\vec{θ}) .

(20)

2.4. Maximum Likelihood Parameter Estimation

Equations (17)–(19) fully define the ERGM, apart from the specification of the parameters

\vec{θ}

. In principle, by treating these Lagrange multipliers as free parameters, one can study the effects that the specification of certain graph observables

{C_{i}}

has on other aspects of network structure [27,44,45,46,47]. This approach, however, does not allow one to consider ERGMs as null models of a particular real network [17,19]. In the latter case, maximum likelihood parameter estimation leads to the unique (given the choice of constraints) ERGM representing a null model for a particular real (multiplex) network

\vec{G} *

, and hence, enforcing Equation (15) exactly, as we briefly recall below. This null model can then be used to detect statistically significant deviations of empirical structural properties of

\vec{G} *

from the ensemble.

The log-likelihood of the particular multiplex

\vec{G} *

is

L (\vec{G} *, \vec{θ}) = ln P (\vec{G} *, \vec{θ}) = - \sum_{i = 1}^{K} θ_{i} C_{i}^{*} - ln Z (\vec{θ}) .

(21)

This function has the following properties [19]:

\frac{\partial L (\vec{G} *, \vec{θ})}{\partial θ_{i}} = 〈 C_{i} 〉 - C_{i}^{*}

(22)

\begin{matrix} \frac{\partial^{2} L (\vec{G} *, \vec{θ})}{\partial θ_{i} \partial θ_{j}} & = - 〈 C_{i} C_{j} 〉 + 〈 C_{i} 〉 〈 C_{j} 〉 . \end{matrix}

(23)

Equation (22) means that the stationary points

\vec{θ} = \vec{θ} *

of

L

are precisely those that satisfy the constraints (15), i.e.,

{〈 C_{i} 〉}_{\vec{θ} *} = \sum_{\vec{G} \in G_{N}^{M}} C_{i} (\vec{G}) P (\vec{G}, \vec{θ} *) = \sum_{\vec{G} \in G_{N}^{M}} C_{i} (\vec{G}) \frac{e^{- \sum_{j = 1}^{K} θ_{j}^{*} C_{j} (\vec{G})}}{Z (\vec{θ})} = C_{i} (\vec{G} *), i = 1, \dots, K

(24)

where

{〈 C_{i} 〉}_{\vec{θ} *}

indicates that the ensemble average is evaluated at the values

\vec{θ} *

. Equation (23) indicates that

L

is concave, since the matrix with entries

\partial^{2} L / \partial θ_{i} \partial θ_{j}

has the form of a negative covariance matrix, and must therefore be non-positive definite [48]. The solutions

\vec{θ} *

of the coupled equations

{〈 C_{i} 〉}_{\vec{θ} *} = C_{i}^{*}

in Equation (15) can therefore be found by maximizing the log-likelihood

L

. If

\partial^{2} L / \partial θ_{i} \partial θ_{j}

is negative definite, which will be true if the functions

C_{i} (\vec{G})

are linearly independent [48] (i.e., the chosen constraints are non-redundant), then there will be, at most, one solution, and it will be the unique maximum of

L

. Maximizing a concave function is generally easier than solving the system of coupled nonlinear equations in Equation (24). Once the solution

\vec{θ} = \vec{θ} *

is found, it can be used to generate a null model of

\vec{G} *

. Moreover, inserting the value

θ^{*}

back into Equation (21) and using Equation (20), we obtain the important relation

\begin{matrix} L (\vec{G} *, \vec{θ} *) & = & ln P (\vec{G} *, \vec{θ} *) \\ = & - \sum_{i = 1}^{K} θ_{i}^{*} C_{i}^{*} - ln Z (\vec{θ} *) \\ = & - \sum_{i = 1}^{K} θ_{i}^{*} {〈 C_{i} 〉}_{\vec{θ} *} - ln Z (\vec{θ} *) \\ = & - S (\vec{θ} *), \end{matrix}

(25)

i.e., the maximized log-likelihood equals minus the entropy for the particular value

\vec{θ} *

, which in turn represents the ‘entropy of the data’ given the chosen constraints. This result allows one to easily calculate the entropy of the data

S (\vec{θ} *) = - L (\vec{G} *, \vec{θ} *)

automatically as part of the likelihood maximization procedure, rather than as a much more complicated formal sum of all configurations, as in the general definition (13).

2.5. Benchmark: Independent Layers Model

As anticipated in the Introduction, our goal is that of considering how the empirical overlap between links in different layers of a multiplex is jointly determined by both a ‘genuine’ coupling between the M layers and a ‘spurious’ correlation resulting from the heterogeneous (and correlated across layers) degrees of the N nodes. As a null benchmark before inserting both components in an ERGM of a multiplex, we first consider only the layer-averaged degrees of all vertices as constraints, as defined in Equation (7). We can therefore create a null model of a real multiplex

\vec{G} *

using the ERGM in combination with the maximum likelihood method. This model will be referred to as the Average Configuration Model (ACM), and will allow us to study the sole effects of correlated heterogeneous degrees on the inter-layer overlap. The Hamiltonian of this model, denoted as

H_{0}

, since it represents a benchmark for a more complicated model to be defined later, is

H_{0} (\vec{G}, \vec{θ}) = M \sum_{i = 1}^{N} θ_{i} {\bar{k}}_{i} (\vec{G}) = \sum_{α = 1}^{M} \sum_{i < j} (θ_{i} + θ_{j}) g_{i j}^{α}

(26)

where we have reparametrized by exposing M for convenience. The partition function is

\begin{matrix} Z_{0} (\vec{θ}) & = \sum_{\vec{G} \in G_{N}^{M}} e^{- \sum_{α = 1}^{M} \sum_{i < j} (θ_{i} + θ_{j}) g_{i j}^{α}} \\ = \sum_{\vec{G} \in G_{N}^{M}} \prod_{α = 1}^{M} \prod_{i < j} e^{- (θ_{i} + θ_{j}) g_{i j}^{α}} \\ = \prod_{α = 1}^{M} \prod_{i < j} \sum_{g_{i j}^{α} = 0}^{1} e^{- (θ_{i} + θ_{j}) g_{i j}^{α}} \\ = \prod_{α = 1}^{M} \prod_{i < j} [1 + e^{- (θ_{i} + θ_{j})}] \\ = \prod_{i < j} {[1 + e^{- (θ_{i} + θ_{j})}]}^{M} . \end{matrix}

(27)

The probability distribution over the ensemble is then given by

P_{0} (\vec{G}, \vec{θ}) = \prod_{α = 1}^{M} \prod_{i < j} \frac{e^{- (θ_{i} + θ_{j}) g_{i j}^{α}}}{1 + e^{- (θ_{i} + θ_{j})}},

(28)

from which we see that pairs of nodes and pairs of layers are all independent of each other, each entry

g_{i j}^{α}

being an independent Bernoulli random variable with success probability

p_{i j}^{α} (\vec{θ})

and expected value

{〈 g_{i j}^{α} 〉}_{\vec{θ}}

given by

p_{i j}^{α} (\vec{θ}) = {〈 g_{i j}^{α} 〉}_{\vec{θ}} = \frac{e^{- (θ_{i} + θ_{j})}}{1 + e^{- (θ_{i} + θ_{j})}} \equiv p_{i j} (\vec{θ}) .

(29)

Clearly,

p_{i j}^{α} (\vec{θ}) = p_{i j} (\vec{θ})

is the probability that a link occurs between node i and j in layer

α

, which turns out to be independent of

α

given our choice of the layer-averaged (not layer-specific) degree as a constraint.

The log-likelihood of the multiplex

\vec{G} *

is

L_{0} (\vec{G} *, \vec{θ}) = - M \sum_{i = 1}^{N} θ_{i} {\bar{k}}_{i}^{*} - M \sum_{i < j} ln [1 + e^{- (θ_{i} + θ_{j})}],

(30)

where

{\bar{k}}_{i}^{*} = {\bar{k}}_{i} ({\vec{G}}^{*})

. The parameter value

θ_{m}^{*}

maximizing the log-likelihood must satisfy

\begin{matrix} {\frac{\partial L_{0} (\vec{G} *, \vec{θ})}{\partial θ_{m}}|}_{\vec{θ} = \vec{θ} *} & = - M {\bar{k}}_{m}^{*} + M \sum_{j \neq m} \frac{e^{- (θ_{m}^{*} + θ_{j}^{*})}}{1 + e^{- (θ_{m}^{*} + θ_{j}^{*})}} = 0 \forall m \end{matrix}

(31)

or equivalently,

{\bar{k}}_{i}^{*} = \sum_{j \neq i} \frac{e^{- (θ_{i}^{*} + θ_{j}^{*})}}{1 + e^{- (θ_{i}^{*} + θ_{j}^{*})}} \forall i .

(32)

The above results show that, as expected from the general result reported in Equation (24), according to the maximum likelihood principle, the empirical layer-averaged degree

{\bar{k}}_{i}^{*} = {\bar{k}}_{i} (\vec{G} *)

of the real multiplex

\vec{G} *

is equal to the ensemble average

{〈 {\bar{k}}_{i} 〉}_{\vec{θ} *}

:

\begin{matrix} {\bar{k}}_{i}^{*} & = \sum_{j \neq i} p_{i j} (\vec{θ} *) \\ = \frac{1}{M} \sum_{α = 1}^{M} \sum_{j \neq i} p_{i j}^{α} (\vec{θ} *) \\ = \frac{1}{M} \sum_{α = 1}^{M} \sum_{j \neq i} {〈 g_{i j}^{α} 〉}_{\vec{θ} *} \\ = {〈 {\bar{k}}_{i} 〉}_{\vec{θ} *} . \end{matrix}

(33)

The probability distribution

P_{0} (\vec{G}, \vec{θ} *)

can then be written as a product of the layers:

P_{0} (\vec{G}, \vec{θ} *) = \prod_{α = 1}^{M} P_{0}^{α} (G^{α}, \vec{θ} *)

(34)

where

P_{0}^{α}

is the probability distribution over a single layer, i.e.,

P_{0}^{α} (G^{α}, \vec{θ} *) = \prod_{i < j} {[p_{i j} (\vec{θ} *)]}^{g_{i j}^{α}} {[1 - p_{i j} (\vec{θ} *)]}^{1 - g_{i j}^{α}} .

(35)

This means that each layer

α

can be generated by using the link probability

p_{i j} (\vec{θ} *)

that is equal throughout the layers. This is again a consequence of exclusively constraining properties defined as the overall averages of the layers. This null model can be used as a benchmark to determine the expected value of the inter-layer overlap

O (\vec{G})

defined in Equation (12), which is due solely to the correlation between the degree of the same node i across the M layers, and not to any genuine inter-layer dependency. This expected value is

{〈 O 〉}_{\vec{θ} *} = \sum_{α < β} \sum_{i < j} {〈 g_{i j}^{α} g_{i j}^{β} 〉}_{\vec{θ} *} = \sum_{α < β} \sum_{i < j} {〈 g_{i j}^{α} 〉}_{\vec{θ} *} {〈 g_{i j}^{β} 〉}_{\vec{θ} *} = \sum_{α < β} \sum_{i < j} p_{i j}^{2} (\vec{θ} *),

(36)

where we have used the independence

{〈 g_{i j}^{α} g_{i j}^{β} 〉}_{\vec{θ} *} = {〈 g_{i j}^{α} 〉}_{\vec{θ} *} 〈 g_{i j}^{β} 〉

between layers

α \neq β

. Deliberately, we have chosen the layer-averaged degree as the only constraint so that the expected degree of a node is the same across all layers, thereby creating a strong correlation between degrees in different layers, while keeping the layers themselves independent. Using Equations (25) and (30), we can calculate the entropy of the data, given the model, as

S_{0} (\vec{θ} *) = - L_{0} (\vec{θ} *) = - ln P_{0} (\vec{G} *, \vec{θ} *) = M \sum_{i = 1}^{N} θ_{i}^{*} {\bar{k}}_{i}^{*} + M \sum_{i < j} ln [1 + e^{- (θ_{i}^{*} + θ_{j}^{*})}],

(37)

which only requires the knowledge of

\vec{θ} *

and of the layer-averaged degrees

{\bar{k}}_{i} (\vec{G} *)

,

i = 1, N

.

3. The Overlapping Average Configuration Model

Having illustrated all the ingredients that are necessary to define and model basic properties of multiplex networks within a maximum entropy framework, in this section, we introduce a model of multiplex networks with genuinely interdependent layers. To this end, we incorporate the overlap as an extra constraint in the ERGM, and study the model in combination with the maximum likelihood method. This model is a generalization of the previous ACM benchmark, and will therefore be referred to as the Overlapping Average Configuration Model (OACM), as it includes not only the intra-layer degrees, but also the inter-layer coupling, as building blocks.

3.1. Constructing the Hamiltonian

We want to define a model of a multiplex with M layers, N vertices, and given expected layer-averaged degrees (as defined in Equation (7)) and global inter-layer overlap (as defined in Equation (12)). The Hamiltonian of our ERGM is, in this case,

H (\vec{G}, \vec{θ}, J) = M \sum_{i = 1}^{N} θ_{i} {\bar{k}}_{i} (\vec{G}) - \frac{4 J}{M} O (\vec{G}) = \sum_{i < j} \sum_{α = 1}^{M} (θ_{i} + θ_{j}) g_{i j}^{α} - \frac{4 J}{M} \sum_{i < j} \sum_{α < β} g_{i j}^{α} g_{i j}^{β}

(38)

where

(\vec{θ}, J)

are the Lagrange multipliers coupled to the

N + 1

constraints. We have defined the Lagrange multiplier for the overlap as

- 4 J / M

for later convenience. Clearly,

H (\vec{G}, \vec{θ}, J) = H_{0} (\vec{G}, \vec{θ})

where

H_{0}

is the benchmark Hamiltonian of the ACM without overlap defined in Equation (26). Using the multilink

m_{i j}

defined in Equation (4) and defining

θ_{i j} \equiv θ_{i} + θ_{j},

(39)

the Hamiltonian in Equation (38), this can be written as a sum of the pairs of vertices:

H (\vec{G}, \vec{θ}, J) = \sum_{i < j} h_{i j} (m_{i j}, θ_{i j}, J)

(40)

where

h_{i j} (m_{i j}, θ_{i j}, J) \equiv (θ_{i} + θ_{j}) \sum_{α = 1}^{M} g_{i j}^{α} - \frac{4 J}{M} \sum_{α < β} g_{i j}^{α} g_{i j}^{β}

(41)

will be referred to as the pair Hamiltonian. As we shall see in a moment, the pair Hamiltonian can be mapped exactly to a mean-field Ising model coupling the M layers homogeneously. To arrive at this mapping, we transform the Boolean variables

g_{i j}^{α} \in {0, 1}

to new ‘spin’ variables

σ_{i j}^{α} \in {- 1, 1}

, as follows:

g_{i j}^{α} = \frac{1}{2} (σ_{i j}^{α} + 1) .

(42)

From now on, we assume that M is large (multiplex with several layers) and expand expressions accordingly. By defining

s_{i j} \equiv {σ_{i j}^{1}, σ_{i j}^{2}, \dots, σ_{i j}^{M}}

(43)

as the multilink for the node pair

(i, j)

in terms of the

σ_{i j}^{α} = \pm 1

variables, we see that Equation (42) can be used to transform Equation (41) into

h_{i j} (s_{i j}, θ_{i j}, J) = (\frac{θ_{i j}}{2} - J) \sum_{α = 1}^{M} σ_{i j}^{α} - \frac{J}{M} \sum_{α < β} σ_{i j}^{α} σ_{i j}^{β} - \frac{J M}{2} + \frac{M θ_{i j}}{2} .

(44)

If we define

B_{i j} \equiv J - \frac{θ_{i j}}{2},

(45)

v_{i j} \equiv - M B_{i j} + \frac{J M}{2},

(46)

then the pair Hamiltonian finally reduces to

h_{i j} (s_{i j}, B_{i j}, J) = - B_{i j} \sum_{α = 1}^{M} σ_{i j}^{α} - \frac{J}{M} \sum_{α < β} σ_{i j}^{α} σ_{i j}^{β} + v_{i j} .

(47)

From the above expression, we see that, for every specific pair of nodes

(i, j)

, the variables

σ_{i j}^{α}

can be thought of as Ising spins residing in the M nodes of a fully connected graph, where every Ising spin interacts with every other

M - 1

spins and is coupled to a ‘field’

B_{i j}

. In terms of the multiplex networks being modeled, this means that for every specific pair of nodes

(i, j)

, the edges connecting i and j throughout the M layers are all coupled to a common ‘external’ field

B_{i j}

, and are also coupled to each other with a homogeneous interaction strength

J / M

. A positive coupling

J > 0

favors more overlap (i.e., more alignment between links in different layers), while

J < 0

disfavors the overlap. The term

v_{i j}

is an inessential overall shift in energy independent of the spin configuration. This model is identical to the mean-field Ising or Curie–Weiss model. This exact mapping is what we use in Appendix in order to solve the model analytically, and in particular, to show the existence, for each pair of nodes, of a phase transition separating a ‘magnetized’ phase and a ‘non-magnetized’ phase, which here represent a ‘multiplexed’ phase (where links in different layers tend to ‘align’ to each other) and a ‘non-multiplexed’ phase, respectively.

The full Hamiltonian (40) is a summation of the Hamiltonians of non-interacting Ising systems, each for a distinct pairs of nodes. Note, however, that despite the independence of different pairs of nodes, the pair Hamiltonians

h_{i j} (s_{i j}, B_{i j}, J)

share some parameters: J is common to all such Hamiltonians, and

h_{i j} (s_{i j}, B_{i j}, J)

and (say)

h_{i k} (s_{i k}, B_{i k}, J)

also share the parameter

θ_{i}

, because the latter appears in both

B_{i j}

and

B_{i k}

. This is the result of the original constraint on the degree of each node, which results in the same Lagrange multiplier

θ_{i}

appearing in all pair Hamiltonians involving the same node i. These common parameters imply that, even if all pairs of nodes are independent, the control parameters of all pair Hamiltonians cannot be chosen independently, resulting in a correlated phenomenology for the various pairs of nodes. In particular, as we shall see, each pair of nodes can undergo locally the typical phase transition of the mean-field Ising model, but the features of these pair-specific phase transitions are all nontrivially related to each other.

We also note, from Equations (44) and (47), that if

J = θ_{i j} / 2

(or equivalently,

B_{i j} = 0

), then the pair Hamiltonian (hence the graph probability) becomes invariant upon a global ‘spin flip’ (

σ_{i j}^{α} \to - σ_{i j}^{α}

\forall α

), which here corresponds to the replacement of each existing link with a missing link (

g_{i j}^{α} = 1 \to g_{i j}^{α} = 0

\forall α

) and, vice versa, of each missing link with an existing link (

g_{i j}^{α} = 0 \to g_{i j}^{α} = 1

\forall α

). This is due to the vanishing of the ‘external field’

B_{i j}

that, when present, selects a preferred ‘spin direction’ (up versus down), which here means a preferred density (high versus low). We expect that with the parameter choice

J = θ_{i j} / 2

, the pair of nodes

(i, j)

gains an expected

1 / 2

density of links across the M layers, i.e., an expected number of links equal to

M / 2

, corresponding to half the maximum number of links for that node pair. Additionally, if J is smaller than the critical value, this expected number of links is also the typical value, and basically, the model is not fundamentally different from a model without constraints, where the intermediate density is produced as a result of a completely uniform probability distribution for the multilink. However, if J exceeds the critical value, the intermediate average density is no longer the typical one realized by individual graphs sampled from the model: rather, it is the ensemble average of two typical (high and low) values of the realized density, just like in the equivalent spin system, below the Curie temperature, and without an external field one would typically observe, with the same probability, overall positive and negative magnetization with a zero ensemble average. The numerical simulations access the typical realized values, while the equations still govern the expected value. This situation corresponds to a ‘symmetry-broken’ phase, where the typical realizations are less symmetric than the Hamiltonian that generates them. However, here, the heterogeneity of the degrees implies different values of the external field

B_{i j} = J - θ_{i j} / 2

, which means that the zero-field spontaneous symmetry breaking condition cannot, in general, be realized for all pairs of nodes simultaneously, leading to a phenomenology governed by the interplay between the values of J and

{θ_{i}}_{i = 1}^{N}

, and ultimately between the values of the inter-layer overlap and the node degrees.

3.2. Calculating the Partition Function

The partition function defined in (19) can be written as the product

Z (\vec{θ}, J) = \sum_{\vec{G} \in G_{N}^{M}} e^{- H (\vec{G}, \vec{θ}, J)} = \sum_{\vec{G} \in G_{N}^{M}} \prod_{i < j} e^{- h_{i j} (s_{i j}, θ_{i j}, J)} = \prod_{i < j} z_{i j} (θ_{i j}, J),

(48)

where

z_{i j} (θ_{i j}, J)

is the pair partition function, which is a sum of the set

S_{i j}

of all

2^{M}

possible multilinks for

(i, j)

:

z_{i j} (θ_{i j}, J) \equiv \sum_{s_{i j} \in S_{i j}} e^{- h_{i j} (s_{i j}, θ_{i j}, J)} .

(49)

The multiplex probability can be written in terms of the multilink probabilities

P_{i j} (s_{i j}, θ_{i j}, J)

:

P (\vec{G}, \vec{θ}, J) = \prod_{i < j} P_{i j} (s_{i j}, θ_{i j}, J)

(50)

where

P_{i j} (s_{i j}, θ_{i j}, J) \equiv \frac{e^{- h_{i j} (s_{i j}, θ_{i j}, J)}}{z_{i j} (θ_{i j}, J)} .

(51)

The complete partition function and multiplex probability can therefore be obtained as products of pair-specific quantities, where each multilink can be regarded as a configuration of a Curie–Weiss system. To obtain an explicit expression for

z_{i j} (θ_{i j}, J)

, we use a Hubbard–Stratonovich transformation and the Laplace theorem [49] in the limit

M \to \infty

. The details are provided in Appendix A and are a generalization of the approach used in [50]. The final result is

z_{i j} (θ_{i j}, J) = 2^{M} e^{- \frac{M}{2} θ_{i j} - 2 J M u_{i j} (u_{i j} - 1)} {cosh}^{M} (2 J u_{i j} - \frac{θ_{i j}}{2}),

(52)

where

u_{i j}

is the solution to the equation

u_{i j} = \frac{1}{2} + \frac{1}{2} tanh (2 J u_{i j} - \frac{θ_{i j}}{2}) .

(53)

The solutions to the above equation will be discussed in the next section.

Now, given a particular real multiplex network

\vec{G} *

, the log-likelihood, as defined, in general, in Equation (21), is

L (\vec{θ}, J) = ln P (\vec{G} *, \vec{θ}, J) = \sum_{i < j} [- h_{i j} (s_{i j}^{*}, θ_{i j}, J) - ln z_{i j} (θ_{i j}, J)] .

(54)

At a stationary point of

L

, the derivatives of

L

with respect to every Lagrange multiplier must equal zero. As we show in Appendix B, this leads to the maximum likelihood equations

\sum_{j \neq i}^{N} \sum_{α = 1}^{M} {g_{i j}^{*}}^{α} = M \sum_{j \neq i}^{N} u_{i j}^{*} \forall i

(55)

\frac{4}{M} \sum_{i < j} \sum_{α < β} {g_{i j}^{*}}^{α} {g_{i j}^{*}}^{β} = 2 M \sum_{i < j} {(u_{i j}^{*})}^{2}

(56)

where

u_{i j}^{*}

, being the solution to Equation (53) with

(\vec{θ}, J)

replaced by

(\vec{θ} *, J^{*})

, is implicitly related to the maximum likelihood parameters

(\vec{θ} *, J^{*})

. Note that the quantities on the LHS of Equations (55) and (56) are precisely the quantities that we constrained from the start, namely,

M {\bar{k}}_{i}^{*}

and

4 O^{*} / M

, respectively. According to the maximum likelihood principle, these empirical quantities must equal their respective ensemble averages,

M {〈 {\bar{k}}_{i} 〉}_{θ^{*}, J^{*}}

and

4 {〈 O 〉}_{θ^{*}, J^{*}} / M

, which appear on the RHS. The quantity

u_{i j}^{*}

can therefore be considered as an average probability of a link occurring between the nodes i and j, which is equal throughout the M layers and is, therefore, a measure of the density of links in the multilink

m_{i j}

. This is similar to how we identified

p_{i j}

to be the connection probability in the ACM, which was based solely on the constraints

{\bar{k}}_{i}

. In support of this idea, we see that, in the case

J^{*} = 0

, the Lagrange multipliers

\vec{θ} *

reduce it to the value

{\vec{θ}}^{0} \equiv \vec{θ} * |_{J^{*} = 0}

, such that

{u_{i j}^{*}|}_{J^{*} = 0} = \frac{1}{2} [1 + tanh (- \frac{θ_{i}^{0} + θ_{j}^{0}}{2})] = \frac{e^{- (θ_{i}^{0} + θ_{j}^{0})}}{1 + e^{- (θ_{i}^{0} + θ_{j}^{0})}} = p_{i j} ({\vec{θ}}^{0})

(57)

which is identical to the expression in Equation (29), providing the link probability

p_{i j}

obtained in Section 2.5 in the absence of the constraint for the overlap. The quantity

u_{i j}^{*}

can therefore possibly be interpreted as a mean-field quantity that globally incorporates the layer interdependence that was introduced through the overlap

O^{*}

, but locally treats the layers as if they were independent. A characteristic of mean-field theories is that the effects of all elements of a system on a given element are approximated by a single, average effect.

Formally, we can calculate the entropy of the data, given the model, as the maximized likelihood using Equations (25) and (54):

\begin{matrix} S (\vec{θ} *, J^{*}) & = & - L (\vec{θ} *, J^{*}) \\ = & - ln P (\vec{G} *, \vec{θ} *, J^{*}) \\ = & H (\vec{G} *, \vec{θ} *, J^{*}) + \sum_{i < j} ln z_{i j} (θ_{i j}^{*}, J^{*}) \\ = & M \sum_{i = 1}^{N} θ_{i}^{*} {\bar{k}}_{i}^{*} - \frac{4 J^{*}}{M} O^{*} + \sum_{i < j} ln z_{i j} (θ_{i j}^{*}, J^{*}), \end{matrix}

(58)

which requires the knowledge of the parameters

\vec{θ} *

and

J^{*}

(which are, however, defined only implicitly through

u_{i j}^{*}

). Comparing the above expression with Equation (37), we see that

S ({\vec{θ}}^{0}, 0) = S_{0} ({\vec{θ}}^{0})

, as expected, i.e., the model with

J^{*} = 0

has the same entropy as the equivalent ACM with no overlap, for the same value of

{\vec{θ}}^{0}

. Similarly,

L ({\vec{θ}}^{0}, 0) = L_{0} ({\vec{θ}}^{0})

for the maximized likelihood in the two models. In order to understand the relationship between the entropies of the two models when

J^{*} \neq 0

, let us first note that a positive (resp. negative) coupling strength

J^{*}

means that the empirical overlap

O^{*}

is larger (resp. smaller) than the expected overlap under the null model with

J^{*} = 0

, i.e.,

O^{*} ≶ {〈 O 〉}_{{\vec{θ}}^{0}} \Leftrightarrow J^{*} ≶ 0

(59)

where we have used the notation in Equation (36). However, one should not naively conclude from the combination of Equations (58) and (59) that the entropy of the model with

J^{*} < 0

is larger than the entropy of the model with

J^{*} = 0

, because the two partition functions are different, and also because the two entropies are calculated for different Lagrange multipliers, i.e.,

{\vec{θ}}^{*} \neq {\vec{θ}}^{0}

when

J^{*} \neq 0

. In fact, we can actually show that the entropy of the model with

J^{*} \neq 0

is always smaller than the one for the model with

J^{*} = 0

. To see this, we introduce the relative entropy (or Kullback–Leibler divergence) between the two models, as follows:

R ({\vec{θ}}^{0}, \vec{θ} *, J^{*}) \equiv \sum_{\vec{G} \in G_{N}^{M}} P (\vec{G}, \vec{θ} *, J^{*}) ln \frac{P (\vec{G}, \vec{θ} *, J^{*})}{P_{0} (\vec{G}, {\vec{θ}}^{0})} \geq 0,

(60)

where the last inequality is a well-known property of the relative entropy, and the equality is realized if, and only if,

P_{0} (\vec{G}, {\vec{θ}}^{0})

and

P (\vec{G}, \vec{θ} *, J^{*})

are identical, which, in turn, requires

J^{*} = 0

, yielding

{\vec{θ}}^{0} = \vec{θ} *

and

R ({\vec{θ}}^{0}, {\vec{θ}}^{0}, 0) = 0

. For

J^{*} \neq 0

, we can write

\begin{matrix} R ({\vec{θ}}^{0}, \vec{θ} *, J^{*}) & = & \sum_{\vec{G} \in G_{N}^{M}} P (\vec{G}, {\vec{θ}}^{*}, J^{*}) ln P (\vec{G}, \vec{θ} *, J^{*}) - \sum_{\vec{G} \in G_{N}^{M}} P (\vec{G}, \vec{θ} *, J^{*}) ln P_{0} (\vec{G}, {\vec{θ}}^{0}) \\ = & - S (\vec{θ} *, J^{*}) + \sum_{\vec{G} \in G_{N}^{M}} P (\vec{G}, \vec{θ} *, J^{*}) [H_{0} (\vec{G}, {\vec{θ}}^{0}) + ln Z_{0} ({\vec{θ}}^{0})] \\ = & - S (\vec{θ} *, J^{*}) + \sum_{\vec{G} \in G_{N}^{M}} P_{0} (\vec{G}, {\vec{θ}}^{0}) [H_{0} (\vec{G}, {\vec{θ}}^{0}) + ln Z_{0} ({\vec{θ}}^{0})] \\ = & - S (\vec{θ} *, J^{*}) + \sum_{\vec{G} \in G_{N}^{M}} P_{0} (\vec{G}, {\vec{θ}}^{0}) ln P_{0} (\vec{G}, {\vec{θ}}^{0}) \\ = & - S (\vec{θ} *, J^{*}) + S_{0} ({\vec{θ}}^{0}), \end{matrix}

(61)

where we have used the fact that

H_{0} (\vec{G}, {\vec{θ}}^{0}) = M \sum_{i = 1}^{N} θ_{i}^{0} {\bar{k}}_{i} (\vec{G})

has the same expectation value, equal to

M \sum_{i = 1}^{N} θ_{i}^{0} {\bar{k}}_{i} (\vec{G} *)

, under both

P (\vec{G}, \vec{θ} *, J^{*})

and

P_{0} (\vec{G}, {\vec{θ}}^{0})

:

\begin{matrix} \sum_{\vec{G} \in G_{N}^{M}} P (\vec{G}, \vec{θ} *, J^{*}) H_{0} (\vec{G}, {\vec{θ}}^{0}) & = & M \sum_{i = 1}^{N} θ_{i}^{0} [\sum_{\vec{G} \in G_{N}^{M}} P (\vec{G}, \vec{θ} *, J^{*}) {\bar{k}}_{i} (\vec{G})] \\ = & M \sum_{i = 1}^{N} θ_{i}^{0} {\bar{k}}_{i} (\vec{G} *) \\ = & M \sum_{i = 1}^{N} θ_{i}^{0} [\sum_{\vec{G} \in G_{N}^{M}} P_{0} (\vec{G}, {\vec{θ}}^{0}) {\bar{k}}_{i} (\vec{G})] \\ = & \sum_{\vec{G} \in G_{N}^{M}} P_{0} (\vec{G}, {\vec{θ}}^{0}) H_{0} (\vec{G}, {\vec{θ}}^{0}) . \end{matrix}

(62)

Now, applying the inequality

R ({\vec{θ}}^{0}, \vec{θ} *, J^{*}) \geq 0

in Equation (60) to Equation (62), we get

0 \leq S (\vec{θ} *, J^{*}) \leq S_{0} ({\vec{θ}}^{0}),

(63)

confirming that the entropy of the model with

J^{*} \neq 0

is always smaller than the one for the model with

J^{*} = 0

, consistent with the fact that the former is more constrained than the latter.

4. Local Phase Transitions in the Model

The number of solutions of Equation (53) depends on the values of the parameters

θ_{i j} = θ_{i} + θ_{j}

and J. We illustrate this fact in Figure 1, where both the LHS and the RHS of Equation (53) are plotted as a function of

u_{i j}

for various values of

θ_{i j}

and J. The appearance of multiple solutions signals the existence of phase transitions in the limit when the number M of layers diverges, which determine abrupt changes in the value of

u_{i j}

and, therefore, also in the properties of the multilink

m_{i j}

and the structure of the multiplex as a whole. The configurations for

m_{i j}

that are separated by a phase transition are the phases of the multilink. The point where multiple solutions appear or vanish is the bifurcation point.

Figure 1 shows that, at the interval

0 \leq u_{i j} \leq 1

, there can be either one, two, or three solutions, and that for

θ_{i j} \to + \infty

or

θ_{i j} \to - \infty

there is always one solution, namely,

u_{i j} = 0

or

u_{i j} = 1

, respectively. The number of solutions depends on whether the slope (derivative) of the RHS (which depends on the parameters) exceeds the slope of the LHS (which is always equal to 1) of Equation (53) at their intersection. From now on, we will consider only the case

J \geq 0

, which corresponds to a tendency to create an increased inter-layer overlap compared with the model with

J = 0

. The case

J < 0

corresponds to the opposite case where the overlap is suppressed, which we do not discuss here. New solutions appear or vanish at the point where Equation (53) is satisfied and the derivatives of the LHS and RHS of Equation (53) are equal:

1 = J [1 - {tanh}^{2} (2 J u_{i j} - \frac{θ_{i j}}{2})] .

(64)

Equation (64) cannot be satisfied if

0 \leq J \leq 1

, since

0 \leq {tanh}^{2} (x) < 1

for

x \in R

, and, therefore, if

J \leq 1

, a phase transition is impossible, and there is a unique solution for

u_{i j}

. When

J > 1

, Equation (64) gives us two potential solution branches,

u_{i j}^{\pm} = \frac{1}{2} \pm \frac{1}{2} \sqrt{1 - 1 / J}

, where we have used

2 u_{i j} - 1 = tanh (2 J u_{i j} - θ_{i j} / 2)

. Equation (53) can be written as

θ_{i j} = 4 J u_{i j} - ln [u_{i j} / (1 - u_{i j})]

using the identity

{tanh}^{- 1} x = \frac{1}{2} ln [(1 + x) / (1 - x)]

. By then substituting

u_{i j}^{\pm}

into this expression for

θ_{i j}

, we obtain the equations for the two curves in the

(J, θ_{i j})

plane that mark the points where additional solutions appear or vanish:

θ_{i j}^{+} (J) = \frac{2 \sqrt{J}}{\sqrt{J} - \sqrt{J - 1}} - ln (\frac{\sqrt{J} + \sqrt{J - 1}}{\sqrt{J} - \sqrt{J - 1}}),

(65)

θ_{i j}^{-} (J) = \frac{2 \sqrt{J}}{\sqrt{J} + \sqrt{J - 1}} - ln (\frac{\sqrt{J} - \sqrt{J - 1}}{\sqrt{J} + \sqrt{J - 1}}),

(66)

as shown in Figure 2. In the region between the two curves, there are three solutions to Equation (53). Note that the ‘zero-field’ condition

θ_{i j} = 2 J

is always in that region when

J > 1

. This means that the condition

J > 1

is sufficient to ensure that the system is in the magnetized (symmetry-broken) phase when in the absence of the external field. However, when

θ_{i j} \neq 2 J

, the condition

J > 1

is necessary but not sufficient. In particular, generally, it may happen that, for a given value of

J > 1

, different pairs of nodes will be in different (magnetized or non-magnetized) phases depending on the value of

θ_{i j}

. This shows that the system can undergo a multitude of separate phase transitions if the parameters

{θ_{i j}}

remain fixed and J is varied.

In the magnetized phase, the phenomenon of symmetry breaking will occur: the typical realized values of the ‘magnetization’ will not coincide with the corresponding ensemble average. In the zero-field case (

θ_{i j} = 2 J

), the symmetry breaking is ‘spontaneous’, i.e., not induced by any field pointing in a preferred direction, while in the nonzero-field case, the symmetry is broken by the field itself. This well-known property of the Ising model has specific implications for our problem here. Indeed, while certain values of

θ_{i j}, J

may solve the maximum likelihood Equations (55) and (56), the corresponding solutions to Equation (53) may not necessarily maximize the likelihood, and are therefore not ‘valid’ (or stable). Once the values

θ_{i j}^{*}

and

J^{*}

that solve the maximum likelihood equations are found, the graph probability corresponding to this set of values can be written as a function of the configuration of the graph (or the collection of configurations of the multilinks

m_{i j}

), and one can check which typical configurations (those minimizing the Hamiltonian) arise. As Figure 1 suggests, in the regime where there are three solutions,

u_{i j}

, one value will be relatively high (which corresponds to a relatively high density of links in

m_{i j}

), another value will be relatively low (which corresponds to a relatively low density of links in

m_{i j}

), and the third value will be between the other two, corresponding to an intermediate density of links in

m_{i j}

. By inspecting the (pair) Hamiltonian in Equation (47) in terms of the

σ_{i j}^{α} = 2 g_{i j}^{α} - 1

variable, it becomes clear which of the three solutions

u_{i j}^{*}

are viable (stable). In the case where

B_{i j} = 0

, or equivalently, when

θ_{i j} = 2 J

, the (pair) Hamiltonian is symmetric with respect to a change in sign,

σ_{i j}^{α} \to - σ_{i j}^{α}

, which means that the high- and low-density solutions are equal. This is the symmetry-broken situation we have discussed in Section 3.1. In this case, the intermediate-density solution will result in a lower value for the Hamiltonian than the high- and low-density solutions. The viable (stable) solutions are therefore the high- and low-density ones. In the case where

B_{i j} \neq 0

, it is clear that the high-density solution minimizes the Hamiltonian when

B_{i j} > 0

and maximizes it when

B_{i j} < 0

. The low-density solution minimizes the Hamiltonian when

B_{i j} < 0

and maximizes it when

B_{i j} > 0

. The intermediate solution will, however, never minimize the Hamiltonian when

B \neq 0

, and is therefore never viable (stable). From these considerations, it becomes clear that a phase transition, corresponding to a sudden change in

u_{i j}

, may only happen when we cross from a negative (positive)

B_{i j}

to a positive (negative)

B_{i j}

(when

J > 1

). Figure 3 shows the symmetric stable solutions

u_{i j}

in the case where

B_{i j} = 0

, with the bifurcation occurring at

J = 1

. In case of the positive field

B_{i j} = + 1

, it shows a single stable solution curve, which is the high-density solution (in the case where

B_{i j} = - 1

, this image would be flipped with respect to the

u_{i j}^{*} = 1 / 2

axis). The right panel in Figure 3 shows that the value of the stable solution

u_{i j}

jumps when

B_{i j}

crosses from positive to negative, as expected.

Combining the above considerations for all multilinks simultaneously, and adding the other constraint on the layer-averaged degrees, the multiplex will undergo a sequence of phase transitions, determining a hierarchy of increasingly ordered (magnetized, or rather ‘multiplexed’ in this case) phases where, for an increasing number of pairs of nodes, the links in different layers will tend to ‘align’ to each other (for

J > 1

). The separations between these phase transitions will depend on the values of the enforced layer-averaged degrees, which determine

\vec{θ} *

. The fully ordered phase, where all pairs of nodes are multiplexed, is the one where all the M layers of the multiplex are perfectly aligned, and are, therefore, basically an identical copy of each other. We might say that, in this case, the effective number of independent layers is

M_{eff} \approx 1

, and the expected overlap is maximal and proportional to the expected number

{〈 L 〉}_{\vec{θ} *, J^{*}} = \sum_{α = 1}^{M} \sum_{i < j} u_{i j}^{*}

of links in the entire multiplex:

{〈 O 〉}_{\vec{θ} *, J^{*}} \approx \sum_{α < β} \sum_{i < j} u_{i j}^{*} = (M / 2) {〈 L 〉}_{\vec{θ} *, J^{*}},

(67)

since

{〈 g_{i j}^{α} g_{i j}^{β} 〉}_{\vec{θ} *, J^{*}} \approx {〈 g_{i j}^{α} 〉}_{\vec{θ} *, J^{*}} = u_{i j}^{*}

for most pairs, i.e.,

α, β

, of layers. In the opposite extreme, we have a fully disordered phase where no pair of nodes is multiplexed (for instance, if

J < 1

), so the effective number of independent layers is maximal (

M_{eff} \approx M

), and the expected overlap is basically of the order of that given by Equation (36) for the model with

J^{*} = 0

, i.e.,

{〈 O 〉}_{\vec{θ} *, J^{*}} \approx \sum_{α < β} \sum_{i < j} {(u_{i j}^{*})}^{2},

(68)

since

{〈 g_{i j}^{α} g_{i j}^{β} 〉}_{\vec{θ} *, J^{*}} \approx {〈 g_{i j}^{α} 〉}_{\vec{θ} *, J^{*}} {〈 g_{i j}^{β} 〉}_{\vec{θ} *, J^{*}} = {(u_{i j}^{*})}^{2}

for most pairs of layers. The relationship between

{〈 O 〉}_{\vec{θ} *, J^{*}}

and

{〈 L 〉}_{\vec{θ} *, J^{*}}

will depend on the specific values of

{u_{i j}^{*}}_{i < j}

, so ultimately, on the enforced degree sequence. Between these two extremes, if the phases are well separated (which here means that the enforced degrees of different nodes have very different values), there will be intermediate regimes where

{〈 O 〉}_{\vec{θ} *, J^{*}}

and

{〈 L 〉}_{\vec{θ} *, J^{*}}

scale in a way that is between the two limiting scalings. All these general considerations will be confirmed in the next sections with numerical, analytical, and empirical analyses.

5. Numerical Analysis

Equations (53), (55), and (56) are the key equations of our OACM model. These equations are generally, however, very difficult to solve. Therefore, before creating a null model for a real-world network by solving the maximum likelihood equations to find the Lagrange multipliers, we shall first treat the Lagrange multipliers as free parameters in order to explore and analyze the properties of the model as a function of these parameters. This analysis shall be performed by utilizing the Metropolis–Hastings algorithm [51]. This algorithm can be used to sample the exponential probability distribution defined by the Hamiltonian of the model. By sampling the distribution, we numerically obtain various properties of the graph ensemble, which may then be compared to our analytical results in order to test the validity of the latter. Note that the sampling of the exponential distribution defined by a specific Hamiltonian may also be regarded as the simulation of a multiplex that corresponds to that Hamiltonian.

5.1. Exploring the Parameter Space

In order to explore the space of parameters, we are primarily interested in the difference between statistically homogeneous networks and statistically heterogeneous ones. To this end, we will explore the parameter space

(θ_{1}, \dots, θ_{N}, J)

of the model by specifying a value for J and sampling certain transformed parameters

x_{1}, \dots, x_{N}

from a distribution for each class, where

x_{i} \equiv e^{- θ_{i}}

. The quantity

x_{i}

will be referred to as the ‘fitness’, or ‘hidden variable’, of node i. The broader the distribution of the fitness, the more heterogeneous the resulting network structure.

5.1.1. Homogeneous Fitness: Erdős–Rényi Graphs with Overlap

The simplest distribution from which we can sample

x_{1}, \dots, x_{N}

is the delta distribution centered at x, such that

x_{1} = x_{2} = \dots = x_{N} \equiv x

and, therefore,

θ_{1} = θ_{2} = \dots = θ_{N} \equiv θ = - ln x

, resulting in statistically homogeneous networks. With this choice of parameters, our model is an extension of the Erdős–Rényi model, which is a random graph model that can be derived within the ERGM by solely constraining the total number of links in the network, and where all links occur with the same probability. As we shall see, the extension derives from the fact that the extra constraint on the overlap can lead to a symmetry-breaking phase transition, although the broken symmetry might not manifest at first sight. Indeed, since the parameters are the same for all pairs of nodes, the condition for the existence of multiple solutions is also the same, and, therefore, there is a unique phase transition where, depending on the values of

θ

and J, pairs of nodes are either all ‘magnetized’ or all ‘non-magnetized’. Similarly, since here

θ_{i j} = θ_{i} + θ_{j} = 2 θ

\forall i, j

, the spontaneous symmetry-breaking condition discussed in Section 3.1 for the vanishing of the external field is the same for all pairs of nodes, and given by

J = θ

. In the symmetry-broken (magnetized) phase, for all pairs of nodes, the expected value of

\sum_{α = 1}^{M} g_{i j}^{α}

(or equivalently, of the ‘magnetization’

\sum_{α = 1}^{M} σ_{i j}^{α}

) is the same, and is always between the two typical (high-density and low-density) realized values. However, since all pairs are independent, the actual realized values of

\sum_{α = 1}^{M} g_{i j}^{α}

are also independent across pairs, so on average, over the entire network, the magnetization will realize both the low-density and high-density values, with equal probability. In other words, different pairs of nodes are i.i.d. realizations of the same system. This is a peculiar situation where the realized values of L and O (which represent sums of all pairs of nodes) will still coincide with their expected values as if no symmetry breaking was present, even if different pairs of nodes actually realize different symmetry-broken values that are individually different from the expected value. The net result is an expected number of links

〈 L 〉 = M N (N - 1) / 4

) equal to half the maximum one, or equivalently, an average zero magnetization in the associated spin system. Similar considerations apply to the case

J \neq θ

, with the difference that, in that case, the symmetry is not broken spontaneously, but by the direction of the external field (value of

θ

), which implies that the two typical realized values of the magnetization for a given pair of nodes are no longer symmetric around the expected value. Still, both typical values will be realized, independently and with their probabilities, across the entire network, because different pairs of nodes are still independent. So, irrespective of the value of J and

θ

, we expect to observe realized values of L and O that correspond again to what one would observe without symmetry breaking, using the ensemble averages for each pair, irrespective of the phase of the system. All these considerations are confirmed below.

By looking at Equation (38), we can see that a uniform

θ

essentially means that instead of constraining the average layer degrees

{\bar{k}}_{i}

, we constrain the total number of links L in the multiplex network. In this case, the combined maximum entropy and maximum likelihood equations become

u = \frac{1}{2} + \frac{1}{2} tanh (2 J^{*} u - θ^{*})

(69)

\sum_{i < j}^{N} \sum_{α = 1}^{M} {g_{i j}^{*}}^{α} = \frac{M N (N - 1)}{2} u^{*} = {〈 L 〉}_{θ^{*}, J^{*}}

(70)

\frac{4}{M} \sum_{i < j} \sum_{α < β} {g_{i j}^{*}}^{α} {g_{i j}^{*}}^{β} = M N (N - 1) {(u^{*})}^{2} = \frac{4}{M} {〈 O 〉}_{θ^{*}, J^{*}}

(71)

where

u^{*} = u (θ^{*}, J^{*})

is the solution to Equation (69). Note that we now have a single equation for u, confirming the existence of a single global phase transition across the multiplex network, rather than separate local phase transitions for every multilink

m_{i j}

. Additionally, we note that if

u^{*}

can be considered as the density (and the link probability) of the network, then the value of

u^{*}

is exactly the same as the value of the density p in the Erdős–Rényi model [14,27], which solely constrains the number of links in the network. The difference between our model and the Erdős–Rényi model is that our model contains the possibility of a phase transition. However, since the number of links

〈 L 〉

also determines the overlap

〈 O 〉

, the two quantities cannot be tuned independently of each other.

By using the Metropolis–Hastings algorithm, we have sampled our ERGM for multiplexes with

M = 100

layers and

N = 100

nodes for various values of

θ

and/or J. If we repeat the simulations for

J = 1.5

and

θ = 1.4

,

θ = 1.5

, and

θ = 1.6

, the system must undergo a phase transition as per Figure 3. We expect an abrupt change in the value of

u^{*}

, and according to Equations (70) and (71), we therefore expect an abrupt change in the equilibrium value of both L and O. Figure 4 shows simulations for

θ \in {1.4, 1.5, 1.6}

confirming the transition from a relatively high to a low density as the value of the field

B = J - θ

changes sign. These simulations have been repeated for different combinations of values for J and

θ

around the point where B changes sign, confirming the results shown here. Note that the middle plot in Figure 4 shows that the algorithm converges to multiplexes with a density of

1 / 2

, confirming that, when

B = 0

, L is approximately half of the total amount of possible links in the multiplex, as we expected above.

In Figure 5 we test the prediction, given by Equations (70) and (71), of the quadratic relationship

〈 O 〉 = {〈 L 〉}^{2} / N^{2}

. Note that this quadratic trend is predicted irrespective of the value of

J > 0

, and even coincides with what Equation (68) predicts in the case

J = 0

for a homogeneous multiplex with constant

θ

, as considered here. So, in this case, the expected relationship between

〈 O 〉

and

〈 L 〉

is not informative regarding the phase transition, although the specific values picked up by the system along the curve are. Indeed, we again simulate multiplexes with

M = 100

layers,

N = 100

nodes, and a variety of values for

θ

and J. Each simulation results in a value for

〈 L 〉

and a value for

〈 O 〉

, which we plot against each other. These points are then compared to the theoretical points predicted by Equations (69)–(71) for the chosen parameter values, and added to Figure 5. We see that the relationship between simulated quantities is in agreement with the one predicted by the model. As we had anticipated, this is the result of the fact that different pairs of nodes are i.i.d. realizations of the same system, so that the ensemble average is realized as a sample average of the pairs of nodes across the network, even if in the symmetry-broken phase, the ensemble average of

\sum_{α = 1}^{M} g_{i j}^{α}

is not representative of any of the values realized locally for individual pairs of nodes. Therefore, the only scaling we observe coincides with the one given in Equation (68) for the ‘non-magnetized’ regime in the case where

θ

is the same for all nodes. The only, although very important, signature of the phase transition we see in Figure 5 is the fact that, for

J > 1

and

θ \neq J

, both the simulated data and the corresponding theoretical predictions ‘drift away’ from the intermediate values of

〈 L 〉

(which are still obtained for

θ = J

) towards either low (

θ > J

) or high (

θ < J

) values of

〈 L 〉

. This is because the realized multiplex networks are either low-density or high-density, which is an indication of a phase transition occurring when increasing the value of J, exactly as predicted by Figure 3.

We conclude our discussion of the homogeneous case by noting that, given an empirical multiplex

\vec{G} *

of interest, the entropy of the data given, in general, by Equation (58) reduces, in this case, to

\begin{matrix} S (θ^{*}, J^{*}) & = & M θ^{*} \sum_{i = 1}^{N} {\bar{k}}_{i}^{*} - \frac{4 J^{*}}{M} O^{*} + \sum_{i < j} ln z_{i j} (2 θ^{*}, J^{*}) \\ = & 2 θ^{*} L^{*} - \frac{4 J^{*}}{M} O^{*} + \frac{N (N - 1)}{2} ln z (2 θ^{*}, J^{*}), \end{matrix}

(72)

where we have used Equation (9) (denoting, via

L^{*} = L (\vec{G} *)

, the total number of links in the multiplex, which also equals the expected value

{〈 L 〉}_{θ^{*}, J^{*}}

) and the fact that the pair partition function

z_{i j}

, given by Equation (52), has the same value

z (2 θ^{*}, J^{*}) \equiv z_{i j} (2 θ^{*}, J^{*})

for all the

N (N - 1) / 2

pairs of nodes. From Equation (72), we see that the entropy is determined, as expected, by both

L^{*}

and

O^{*}

. At the same time, we know that

O^{*}

depends uniquely and quadratically on

L^{*}

in this homogeneous model. The values achieved by the entropy are, therefore, bound by the relationship between

L^{*}

and

O^{*}

, which here is the same irrespective of the value of

J^{*}

, including when

J^{*} = 0

. In any case, the entropy also depends on the specific values of

(θ^{*}, J^{*})

, and Equation (63) guarantees that an upper bound for

S (θ^{*}, J^{*})

is given by the entropy

S_{0} (θ^{0})

of the ACM model with

J^{*} = 0

and

θ^{*} = θ^{0}

(clearly, the homogeneity implies that

θ_{i}^{0} = θ^{0}

for all

i = 1, N

in the ACM model as well).

5.1.2. Power-Law-Distributed Fitness: Scale-Free Networks with Overlap

We now move away from the homogeneous case and consider a situation where the fitness values

{x_{i}}_{i = 1}^{N}

are drawn from a heavy-tailed distribution, in particular, a power law. This choice will produce a high degree of heterogeneity. In the ACM (see Section 2.5), the expected degree distribution is determined by the Lagrange multipliers

θ_{i}

, or equivalently, the transformed hidden variables

x_{i} = e^{- θ_{i}}

. If x is distributed according to a power law, the expected degree distribution shall be distributed according to a power law as well, with the modulo as an upper cut-off. Since our OACM is an extension of the ACM, we will still sample

x_{i}

from a power law distribution

P (x) \sim x^{- γ}

for various values of

γ

, even though the expected degree distribution is not solely determined by the hidden variables

{x_{i}}

, but depends on J as well. In any case, a higher level of heterogeneity in the hidden variables

x_{i}

will lead to a higher level of heterogeneity in the degrees. Since the parameter space is rather large (

N + 1

-dimensional), we define

x_{i} = z x_{0, i}

(73)

where z is a scaling factor. We sample

x_{0, i}

only once from every chosen distribution. The value of

x_{i}

is varied by varying the scaling factor z. The parameter space to be explored will then be

(z, J)

, which is 2-dimensional. We deduce that

θ_{i} = - ln (z x_{0, i})

(74)

which shows that an increasing z leads to a decreasing

θ_{i}

. In the ACM, we have shown that the link probability is equal to

p_{i j} = x_{i} x_{j} / (1 + x_{i} x_{j})

, which means that larger values of

x_{i}

lead to a larger expected degree, so that increasing all the fitness values will increase the density in the network. This qualitative relationship still holds with the addition of the constraint on the expected overlap (for fixed J).

The complexity of Equations (53), (55), and (56) does not allow us to easily derive the expected relationship between the overlap and the number of links in the network, as was the case when

θ_{i}

was constant. It is, however, possible to visualize the relationship between the overlap and the number of links by using the Metropolis–Hastings algorithm. Figure 6 shows this relationship, where

x_{i}

is sampled from power law distributions with various values of

γ

, alongside the expected quadratic term previously observed to occur for homogeneous values of the fitness

x_{i}

(delta distribution). We see that the overlap for a given number of links is higher in the cases where x is drawn from a power law distribution than when x is drawn from a delta distribution, even though the coupling parameter J is kept constant. The cause of this difference lies in the level of heterogeneity of the fitness distribution: unlike the homogeneous case, now different pairs of nodes have very different values of

θ_{i j} = θ_{i} + θ_{j}

, and, therefore, the condition

J = θ_{i j} / 2

for the vanishing of the ‘external field’

B_{i j}

(spontaneous symmetry-breaking condition) cannot be realized simultaneously by all pairs. The figure also shows the effect of different exponents of the power law distributions of the fitness. A smaller value of

γ

leads to a higher overlap for a given number of links. By increasing the value of

γ

, the power law distribution becomes more sharply peaked, and will therefore lead to more homogeneous networks. Note, however, that increasing the value of the coupling parameter J itself also leads to an increase in the overlap for a given number of links for the same distribution.

Importantly, the phase transition now occurs for different pairs of nodes as J is varied. Some pairs of nodes will be in the non-magnetized phase, while others will be in the magnetized phase. The effective number

M_{eff}

of independent layers will, in general, depend on the choice of parameters. Among the magnetized pairs, the realized values of the overlap are no longer those corresponding to the ensemble average (as in the homogeneous case), but typically to the symmetry-broken solution with lower energy (hence dictated by the value of

θ_{i j}

), because no other pair of nodes will, in general, exist with the same parameters and such that the two symmetry-broken values are averaged by the resulting value of the realized overlap. In particular, while for

0 < J < 1

all node pairs are in the non-magnetized phase, as J increases from 1 towards larger values, the pairs of nodes that first undergo the phase transition are the ones with values

θ_{i} + θ_{j}

that fall between the limits set by Equations (65) and (66). As those equations and Figure 2 show, there are more and more combinations

θ_{i} + θ_{j}

entering the magnetized phase as J increases. When J is sufficiently large, all pairs will be magnetized. Clearly, for any two pairs of nodes,

(i, j)

and

(i, k)

, that share the same node, i, the values of

θ_{i} + θ_{j}

and

θ_{i} + θ_{k}

will be correlated, as they share the same term

θ_{i}

. This means that the pairs of nodes entering the magnetized phase typically have nodes in common, even if it would be incorrect to say that individual nodes enter the magnetized phase ‘one by one’, while this is certainly correct for individual node pairs, if the sum

θ_{i} + θ_{j}

is different across all of them.

Figure 6 indeed shows the effect of the changing number of magnetized node pairs as J increases above 1. We note that, for larger and larger J, the relationship between

〈 O 〉

and

〈 L 〉

tends towards the ‘maximally multiplexed’ linear extreme (shown as a straight line) given in Equation (67). At the same time, we see that the ‘non-multiplexed’ case (

J < 1

) described by Equation (68) now realizes values of the overlap that are very different from the quadratic trend achieved by the homogeneous model (also shown as a solid curve in Figure 6), which now turns out to represent a lower bound. We can ‘zoom in’ to better see this difference by looking at Figure 7, where, by using Equations (53), (55), and (56), we additionally calculate the theoretically predicted values of

〈 O 〉

and

〈 L 〉

and compare them to the simulation data, where

x_{0, i}

is sampled from a power law distribution with

γ = 1

(the results for

γ \in {2, 3, 4}

are qualitatively similar and are therefore not shown here). The figure confirms a strong deviation from the curve for the homogeneous model, even when

J = 0

(signaling a much higher but spurious overlap, arising only from the rising correlation among node degrees across different layers), and a close agreement with the maximally overlapping value in Equation (67) already for

J = 1.5

(corresponding to a further increase in overlap, arising from an additional, genuine coupling between layers).

5.1.3. Log-Normally Distributed Fitness

The delta and power law distributions we have considered so far represent examples of completely homogeneous and extremely heterogeneous (especially for

γ = 1

) distributions, respectively. We now consider the log-normal distribution as a third example between these two extremes. This analysis will indeed lead to results that are in some sense intermediate between what we have observed so far, and useful for interpreting the real-world case that we will present later on. A log-normal distribution is the distribution of a random variable whose logarithm is normally distributed (i.e., if the random variable x is log-normally distributed, then

y = ln x

follows a normal distribution). The probability density for a log-normal distribution is

P (x) = \frac{1}{x σ \sqrt{2 π}} e^{- {(ln x - μ)}^{2} / (2 σ)},

(75)

where

μ

and

σ

correspond to the mean and the standard deviation of the normal distribution of

ln x

. We will vary the value of

x_{i}

by again introducing a scaling factor that can be changed such that

x_{i} = z x_{0, i}

and

θ_{i} = - ln (z x_{0, i})

, where we sample

x_{0, i}

once from the log-normal distribution for a variety of values for

μ

and

σ

.

The log-normal distribution allows us to inspect the transition in the relationship between the overlap and the number of links from the quadratic lower limit to the linear upper limit by varying the value of

σ

. Indeed, when

0 < σ ≪ 1

, the normal distribution of

ln x_{0, i}

is sharply peaked. By decreasing the value of

σ

towards 0,

ln x_{0, i}

(and, therefore,

x_{0, i}

as well) shall approach a delta distribution. This is the distribution that led us to the quadratic lower limit for the relationship between the overlap and the number of links in the network. Conversely, when

σ ≫ 1

, the log-normal distribution approaches a distribution with a power law tail with

γ = 1

. This distribution led us to the linear upper limit between the overlap and the number of links in the network (when J was sufficiently large). By increasing the value of

σ

from 0 to a sufficiently large value (e.g.,

σ = 10

), we can therefore increase the heterogeneity of the network from a completely homogeneous network achieving the quadratic lower limit to an extremely heterogeneous network close to the linear upper limit relationship in the simulation data.

Figure 8 shows the relationship between the average overlap and the number of links in the network with simulation data that were obtained by using the Metropolis–Hastings algorithm for a variety of values for J and

σ

. Again, the linear upper limit is illustrated as a straight line and the quadratic lower limit as a solid curve. The figure confirms that in the case where

J = 0

, the data points that correspond to

x_{0, i}

being sampled from a log-normal distribution with a relatively low value for

σ

are either on or close to the quadratic lower limit curve. On the other hand, the case where

σ = 10

results in data points where the overlap in the network for a given number of links is almost maximal, and therefore approaches the linear upper limit. This first set of results confirms the strong role of node heterogeneity in determining increased correlations between the degrees of the same node across different layers, which, in turn, increase the inter-layer overlap even without any explicit coupling (

J = 0

), and hence, in a ‘spurious’ manner. On the other hand, when we increase the value of J, the data points corresponding to relatively low values of

σ

(e.g.,

σ = 10^{- 5}

and

σ = 10^{- 3}

) stay on or close to the quadratic lower limit, a finding similar to the results in Section 5.1.1, showing that the symmetry-broken values realized by different pairs of nodes, when averaged across the network, restore the ensemble average because the node pairs are all independent and (almost) identically distributed. Remarkably this means that, in a certain sense, node homogeneity ‘suppresses’ the effects of the true inter-layer coupling (

J > 0

) on the realized overlap. For the intermediate value

σ = 1.0

, the data are distributed close to the quadratic lower limit curve only for low values of J, while increasing the value of J leads to a more linear trend, eventually approaching the linear upper limit. In this case, the coupling is effective in producing a higher realized overlap. In the case where

σ = 10

, the linear trend is instead achieved already for

J = 0.0

(although the points are aligned below it); hence, increasing the value of J barely influences the value of the overlap for a given number of links.

Therefore the effect of increasing J in networks with a moderate heterogeneity is a transition from multiplex configurations with densities of all levels towards multiplex configurations with either low or high density, which is a result of the phase transition. It also shows that a very high level of heterogeneity leads to an overlap in the network that is already close to maximal for a given number of links, irrespective of the phase transition and the value of J. However, in the case where we have an intermediate level of heterogeneity (

σ = 1.0

), we observe that the effect of the coupling can be relatively strong, and we can therefore construct networks with a combination of the overlap and number of links falling between the extreme linear upper limit and the quadratic lower limit in a controlled, systematic manner. Note that Figure 8 also shows that, as J increases above 1, the (symmetry-broken) realized data start to ‘drift away’ from the intermediate densities, in a way similar to what we observed in Figure 5, but in a more pronounced manner. This is due to the fact that, as J increases, a larger number of multilinks shall be either in the low-density or high-density phase.

Again, in Figure 9 (which is the counterpart of Figure 7), we ‘zoom in’, and, using Equations (53), (55), and (56), we show the theoretically predicted values of

〈 O 〉

and

〈 L 〉

and compare them to the simulation data, where

x_{0, i}

is sampled from a log-normal distribution with

σ = 1

, for

J = 0

and

J = 1.5

. The results for

σ \in {10^{- 5}, 10^{- 3}, 10^{- 1}, 10^{1}}

are not shown here since relatively low and high values for

σ

lead to results similar to those we have shown in Section 5.1.1 and Section 5.1.2, respectively. Figure 9 confirms that the theoretical predictions are in good agreement with the simulation data, apart from the expected ‘drifting away’ of symmetry-broken values from the corresponding ensemble average.

6. Analysis of the World Trade Multiplex

In this section, we finally consider an application of the model to a real-world economic network. Since our models lead to multiplex networks with independent pairs of nodes (i.e., independent multilinks) even when links are correlated across layers, it is important that the real-world network is consistent with this assumption. For instance, networks constructed from time series data [3,4,5] are not viable, because the known (and strong) correlations between the time series corresponding to different vertices generate dependencies between pairs of nodes (and higher-order patterns) through the triangular inequality [6,7]. For this reason, we select the World Trade Multiplex as an ideal case study for the present analysis, because each separate layer of that network has been successfully modeled in the past via maximum entropy models of networks with given degrees [31,32,33]. At the same time, it has been shown that certain structural properties of commodity-specific layers are very similar across the different layers of the multiplex [30], and that this similarity (in particular, the correlation among the degrees of the same node in different layers) generates a large spurious component of the inter-layer overlap [28,29], which is not necessarily due to a genuine coupling. In this sense, our analysis here will add a natural novel aspect to the modeling of the network, namely, the explicit comparison with a model with nontrivial coupling among layers, which has not been considered so far. We use the UN-COMTRADE dataset that represents the multiplex network of international trade (https://comtradeplus.un.org, accessed on 2 September 2019). The different layers of this multiplex network represent different commodities. The vertices in this network represent different countries, and a link exists between two countries in a given layer if there is trade between them in that commodity. The data include

N = 206

countries and

M = 96

commodities. Some examples of traded commodities are meat, fish, dairy products, coffee, and tobacco [30,33].

Using the international trade data, we wish to identify a possibly nontrivial overlap by creating

(L, O)

plots similar to the ones depicted in Figure 6 and Figure 7 or Figure 8. We therefore repeatedly filter the network such that each layer

α

has the same number of links

L^{α} \equiv L^{0}

(where

α = 1, \dots, M

), and calculate the corresponding overlap O for the specified value of

L^{0}

(note that this means that the total number of links in the entire multiplex is

L = M L^{0}

). The criterion we follow is choosing the

L^{0}

strongest (highest weight) links in every layer to obtain data with comparable degrees across layers, as in our models. Note that, by using this filtering method, the highest possible density we can achieve is limited by the density of the sparsest layer in the unfiltered network. The results are shown in Figure 10, which indicates that the overlap for a given number of links appears to be around halfway between the quadratic lower limit curve and the linear upper limit curve. This suggests that the degree of heterogeneity of the network is intermediate, similar to that realized by log-normally distributed fitness values, as in our example considered above.

As anticipated, we are currently unable to solve the maximum likelihood equations in order to obtain the joint values of all the Lagrange multipliers in the full OACM model with

J \neq 0

. However, after filtering the original empirical network such that every layer has

L^{0}

links, we can use the values of the hidden variables

x_{i}^{*}

for the null model corresponding to the absence of inter-layer coupling, i.e., to

J^{*} = 0

. As we have shown in Equations (57), this assumption reduces our model to the ACM discussed in Section 2.5. The maximum likelihood equations in this case are much easier to solve, and can be found using one of the numerical algorithms available at https://meh.imtlucca.it (accessed on 1 May 2023). This procedure is repeated for a range of values for

L^{0}

. The cumulative distribution of the hidden variables

x_{i}^{*}

are plotted in Figure 10 for various values of

L^{0}

. The figure qualitatively shows that the shape of the cumulative distribution of x is fat-tailed and indeed similar to the one for a log-normal distribution. Moreover, it does not vary with

L^{0}

, apart from an overall change of scale.

The null model with

J^{*} = 0

, when compared to the data for the same choice of

L_{0}

, allows us to detect the presence of nontrivial coupling among the layers, when present. Indeed, from Figure 10, we see that the filtered networks have a relatively high overlap, the data points being distributed along a similar trend as the one corresponding to a nonzero J in our previous heterogeneous examples. By using the values of the hidden variables for the model with

J^{*} = 0

, we can calculate the corresponding expected number of links and the expected overlap under the null hypothesis of no coupling between the layers, but the same average degree sequence in the real network. The results are shown in Figure 10, alongside the curve corresponding to the empirical data. We see that the assumption

J = 0

leads to an insufficiently overlapping multiplex, demonstrating the necessity of a model that introduces dependencies between the layers of a network. The difference between the two curves can be quantified by fitting both to the curve

O = A L^{α}

(76)

where A is a proportionality factor and

α

is an exponent (not to be confused with the label of a layer of the multiplex). For the empirical data, we find a steeper increase characterized by an exponent

α_{empirical} = 1.19

, while for the predictions from the ACM, we find

α_{CM} = 1.06

(see Figure 10). The difference between the two values implies that the difference between the realized and expected overlap increases as L increases, confirming that the observed overlap in the WTM is not only the spurious result of the correlated heterogeneity of the degrees of countries, but reflects genuine (

J^{*} > 0

) inter-layer dependencies.

We conclude with a discussion about the entropy in the heterogeneous case, analogous to the one we made in Section 5.1.1 in the homogeneous case. Here we note that, given a multiplex

\vec{G} *

of interest, the entropy

S (\vec{θ} *, J^{*})

of the data, given the OACM model, is the one given by Equation (58), which in the heterogeneous case cannot be, in general, reduced to a simpler formula. However, if we define the minimum and maximum values of the hidden variables as

θ_{\min}^{*} \equiv min_{i = 1, N} {θ_{i}^{*}}, θ_{\max}^{*} \equiv max_{i = 1, N} {θ_{i}^{*}},

(77)

respectively, we can bound the entropy as follows:

S_{\min} (\vec{θ} *, J^{*}) \leq S (\vec{θ} *, J^{*}) \leq S_{\max} (\vec{θ} *, J^{*})

(78)

where we have defined

\begin{matrix} S_{\min} (\vec{θ} *, J^{*}) & \equiv & 2 θ_{\min}^{*} L^{*} - \frac{4 J^{*}}{M} O^{*} + \sum_{i < j} ln z_{i j} (θ_{i}^{*} + θ_{j}^{*}, J^{*}), \end{matrix}

(79)

\begin{matrix} S_{\max} (\vec{θ} *, J^{*}) & \equiv & 2 θ_{\max}^{*} L^{*} - \frac{4 J^{*}}{M} O^{*} + \sum_{i < j} ln z_{i j} (θ_{i}^{*} + θ_{j}^{*}, J^{*}) . \end{matrix}

(80)

The bounds in Equation (78) are alternative to the general ones in Equation (63), and arguably more useful to characterize how the entropy is effectively constrained by, once again, the relationship between

L^{*}

and

O^{*}

. The latter, unlike the homogeneous case, is not necessarily quadratic, and can follow the diverse trends we have shown in Figure 6, Figure 8 and Figure 10. In particular, the power law relationship captured by Equation (76) for the empirical WTM provides a convenient way of bounding

S (\vec{θ} *, J^{*})

via Equations (78)–(80).

7. Conclusions

In this paper we have introduced a maximum entropy model, or ERGM, of multiplex networks with given degrees and inter-layer overlap. The model allowed us to separately control the effects of the correlations between node degrees across different layers (which lead to a spurious overlap) and that of a genuine inter-layer coupling. The nature of the enforced constraints is such that different pairs of nodes are statistically independent, even if the parameters governing them are correlated via those of the nodes they share.

For each pair of nodes, the model can be mapped exactly to a mean-field Ising model featuring a magnetization-like phase transition, which includes the possibility of (spontaneous) symmetry breaking. Given the difficulty of solving the maximum likelihood equations to obtain the values of the Lagrange multipliers corresponding to a particular real network, we first treated the Lagrange multipliers as free parameters in order to explore and analyze the properties of multiplex systems as a function of these parameters using numerical methods. Additionally, the numerical results were compared to our analytical results in order to test the validity of the latter. We have shown that the analytical equations are highly accurate. The combined result, at the level of the entire multiplex, of the properties of all node pairs is nontrivial and crucially depends on the values of the node-specific parameters, which ultimately depend on the enforced degrees.

In the fully homogeneous case, the phase transition occurs at the same critical point for all node pairs simultaneously, because the parameters are identical for all nodes. However, the independence of different node pairs implies that, even in the magnetized phase, the realized values of the inter-layer overlap and total number of links coincide with the ensemble average. This happens because different node pairs realize all the possible symmetry-broken values independently, so that an average of the realized values for a large number of independent node pairs asymptotically equals the ensemble average. The value of J has little effect on the relationship connecting the overlap to the number of links, which remains similar to what we observed for the case

J = 0

, showing that node homogeneity suppresses the effects of a genuine inter-layer coupling.

In the heterogeneous case, the phenomenology is very different since, despite the fact that node pairs are still independent, they are now governed by different parameters, and the ensemble average for a given pair can no longer be realized as an average of the realized values of pairs with the same parameters. This implies that the observed overlap and number of links will depend on the realized symmetry-broken values, whose typical value does not coincide in general with the ensemble average, and is determined by the node-specific parameters (hence, ultimately by the degrees). Moreover different pairs of nodes are, in general, found in different phases, so the multiplex displays, as a function of the parameters, a hierarchy of phase transitions. We have found that increasing the value of the coupling parameter J generally increases the (genuine) overlap for a given number of links, if there is enough node heterogeneity. However, we have also shown that increasing the heterogeneity of the network increases the (spurious) overlap for a given number of links as well. This is a consequence of the presence of large hubs that appear in a correlated manner across layers, due to the increased heterogeneity of the network. Additionally, every multilink that is connected to these hubs has a relatively low critical threshold for the coupling parameter J. Therefore, these multilinks have a higher probability to be in the high density phase, which leads to a higher overlap as well, which corresponds to increasing the amount of genuine correlation. In general, the overlap for a given number of links can be increased by increasing either the heterogeneity of the network or the value of the coupling parameter, with a subtle interplay between the two. In principle, this can be used in order to create multiplexes with a specific degree of overlap for a given of number of links, provided their combination is within the theoretical limits discussed in Section 5.

Finally, by using a dataset that represents the empirical multiplex network of international trade in several commodity-specific layers, we have used the model to disentangle the spurious overlap arising from the documented strong correlation of node degrees across layers [28,29] from the genuine overlap arising from actual inter-layer coupling. We have found that the assumption that there is no coupling between the layers (

J = 0

), which reduces our model to the ACM, results in a multiplex with insufficient inter-layer overlap. This means that the empirical overlap is not merely the spurious result of the correlated heterogeneity of the network, but requires a true nonzero coupling between layers.

Our results demonstrate the subtleties of the interplay between node heterogeneity and inter-layer dependencies in multiplex networks, highlighting the need for null models that can control these factors separately. In this paper, we have introduced perhaps the simplest, although already very rich, model of this type. Our model can be seen as a minimal one, to be further generalized in the future.

Author Contributions

Conceptualization, D.G. and V.G.; methodology, D.G., V.G. and N.B.; software, N.B.; data curation, V.G. and N.B.; writing—original draft preparation, N.B.; writing—review and editing, V.G. and D.G.; visualization, N.B.; supervision, D.G.; project administration, V.G.; funding acquisition, D.G. All authors have read and agreed to the published version of the manuscript.

Funding

The APC was funded by Stichting Econophysics, Leiden, The Netherlands.

Data Availability Statement

For the empirical analysis of the World Trade Multiplex, the publicly available UN-COMTRADE dataset was analyzed in this study. The data are available at http://comtrade.un.org/ (accessed on 2 September 2019). The codes used for the numerical calculation of the parameters maximizing the likelihood are available at https://meh.imtlucca.it (accessed on 1 May 2023).

Acknowledgments

This work is supported by the European Union—Horizon 2020 Program under the scheme ‘INFRAIA-01-2018-2019—Integrating Activities for Advanced Communities’, Grant Agreement n.871042, ‘SoBigData++: European Integrated Infrastructure for Social Mining and Big Data Analytics’ (http://www.sobigdata.eu, accessed on 1 May 2023). This work is also supported by the European Union-NextGenerationEU-National Recovery and Resilience Plan (Piano Nazionale di Ripresa e Resilienza, PNRR), project ‘SoBigData.it-Strengthening the Italian RI for Social Mining and Big Data Analytics’-Grant IR0000013 (3264, 28/12/2021). We also acknowledge support from the project NetRes—‘Network analysis of economic and financial resilience’, Italian DM n. 289, 25-03-2021 (PRO3 Scuole), CUP D67G22000130001 (https://netres.imtlucca.it, accessed on 1 May 2023).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Hubbard–Stratonovich Transform

The pair Hamiltonian of our OACM in Equation (47) can be rewritten as

h_{i j} (s_{i j}, B_{i j}, J) = - B_{i j} \sum_{α = 1}^{M} σ_{i j}^{α} - \frac{J}{2 M} {(\sum_{α = 1}^{M} σ_{i j}^{α})}^{2} + \frac{J}{2} + v_{i j} .

(A1)

We want to obtain an expression for the pair partition function:

\begin{matrix} z_{i j} (B_{i j}, J) & = \sum_{s_{i j} \in S_{i j}} e^{- h_{i j} (s_{i j}, B_{i j}, J)} \\ = \sum_{s_{i j} \in S_{i j}} exp [\frac{J}{2 M} {(\sum_{α = 1}^{M} σ_{i j}^{α})}^{2} + B_{i j} \sum_{α = 1}^{M} σ_{i j}^{α} - \frac{J}{2} - v_{i j}] \\ = e^{- J / 2} e^{- v_{i j}} \sum_{s_{i j} \in S_{i j}} exp [{(\sqrt{\frac{J}{2 M}} \sum_{α = 1}^{M} σ_{i j}^{α})}^{2} + B_{i j} \sum_{α = 1}^{M} σ_{i j}^{α}] . \end{matrix}

(A2)

The argument of the exponent in the above expression can be linearized by using the Gaussian integral

e^{a^{2}} = \frac{1}{\sqrt{2 π}} \int_{- \infty}^{\infty} d ξ_{i j} e^{- ξ_{i j}^{2} / 2 + \sqrt{2} a ξ_{i j}} .

(A3)

In our case, by choosing

a = \sqrt{J / (2 M)} \sum_{α = 1}^{M} σ_{i j}^{α}

the partition function factorizes with respect to the individual summations of

σ_{i j}^{α}

:

\begin{matrix} z_{i j} (B_{i j}, J) & = & \frac{e^{- J / 2 - v_{i j}}}{\sqrt{2 π}} \sum_{s_{i j} \in S_{i j}} \int_{- \infty}^{\infty} d ξ_{i j} e^{- ξ_{i j}^{2} / 2} exp [\sum_{α = 1}^{M} σ_{i j}^{α} (\sqrt{\frac{J}{M}} ξ_{i j} + B_{i j})] \\ = & \frac{e^{- J / 2 - v_{i j}}}{\sqrt{2 π}} \int_{- \infty}^{\infty} d ξ_{i j} e^{- ξ_{i j}^{2} / 2} \sum_{σ_{i j}^{1} \in {- 1, 1}} \dots \sum_{σ_{i j}^{M} \in {- 1, 1}} \prod_{α = 1}^{M} exp [σ_{i j}^{α} (\sqrt{\frac{J}{M}} ξ_{i j} + B_{i j})] \\ = & \frac{e^{- J / 2 - v_{i j}} 2^{M}}{\sqrt{2 π}} \int_{- \infty}^{\infty} d ξ_{i j} e^{- ξ_{i j}^{2} / 2} {[cosh (\sqrt{\frac{J}{M}} ξ_{i j} + B_{i j})]}^{M} . \end{matrix}

(A4)

Performing the change of variable

\sqrt{J / M} ξ_{i j} = J y_{i j}

we obtain

z_{i j} (B_{i j}, J) = 2^{M} \sqrt{\frac{J M}{2 π}} e^{- J / 2} e^{- v_{i j}} \int_{- \infty}^{\infty} d ξ_{i j} {[Φ_{J, B_{i j}} (y_{i j})]}^{M}

(A5)

where

Φ_{J, B_{i j}} \equiv e^{- J y_{i j}^{2} / 2} cosh (J y_{i j} + B_{i j}) .

(A6)

We are interested in the large M limit. To proceed in the calculation of

z_{i j} (B_{i j}, J)

, it is useful to define the quantity

f_{i j} (B_{i j}, J) \equiv - lim_{M \to \infty} \frac{1}{M} ln z_{i j} (B_{i j}, J) = - lim_{M \to \infty} ln z_{i j}^{1 / M} (B_{i j}, J)

(A7)

which is the free energy per layer. By inserting the result (A5) into (A7), we obtain

\begin{matrix} f_{i j} (B_{i j}, J) & = & - ln 2 - lim_{M \to \infty} \frac{1}{M} [ln (e^{- J / 2} \sqrt{\frac{J M}{2 π}}) - v_{i j} + ln (\int_{- \infty}^{\infty} d y_{i j} {[Φ_{J, B_{i j}} (y)]}^{M})] \\ = & - ln 2 + \frac{J}{2} - B_{i j} - ln [lim_{M \to \infty} {(\int_{- \infty}^{\infty} d y_{i j} {[Φ_{J, B_{i j}} (y)]}^{M})}^{1 / M}] . \end{matrix}

(A8)

In order to obtain a more explicit form of the function

f_{i j (B_{i j}, J)}

, we use the Laplace theorem [49]. Let

ϕ (y)

and

ψ (y)

be continuous and positive functions within a range

c \leq y \leq d

, then

lim_{M \to \infty} {[\int_{c}^{d} ψ (y) {(ϕ (y))}^{M}]}^{1 / M} = max_{c \leq y \leq d} ϕ (y) .

(A9)

For

ψ (y) = 1

and

ϕ (y) = Φ_{J, B_{i j}} (y)

, this results in

f_{i j} (B_{i j}, J) = - ln 2 + \frac{J}{2} - B_{i j} - ln [max_{- \infty \leq y_{i j} \leq \infty} Φ_{J, B_{i j}} (y_{i j})] .

(A10)

The derivative of

Φ_{J, B_{i j}} (y_{i j})

with respect to

y_{i j}

is zero at its maximum:

\frac{d Φ_{J, B_{i j}} (y_{i j})}{d y_{i j}} = J e^{- J y_{i j}^{2} / 2} sinh (J y_{i j} + B_{i j}) - J y_{i j} e^{- J y_{i j}^{2} / 2} cosh (J y_{i j} + B_{i j}) = 0 .

(A11)

The variable

y_{i j}

therefore obeys the equation

y_{i j} = tanh (J y_{i j} + B_{i j}) .

(A12)

Note that this equation is identical to the one obtained for the magnetization in the Ising Model, and, depending on the values of J and

B_{i j}

, there are either one or three solutions that satisfy Equation (A12). The free energy

f_{i j}

can now be written as a function of J and

B_{i j}

:

f_{i j} (B_{i j}, J) = - ln 2 + \frac{J}{2} - B_{i j} + \frac{J}{2} {(y_{i j})}^{2} - ln [cosh (J y_{i j} + B_{i j})] .

(A13)

We then finally arrive at the pair partition function

\begin{matrix} z_{i j} (B_{i j}, J) & = & e^{- M f_{i j}} \\ = & 2^{M} e^{- v_{i j}} e^{- J M {(y_{i j})}^{2} / 2} {cosh}^{M} (J y_{i j} + B_{i j}) \end{matrix}

(A14)

which, returning to the variables

θ_{i j}

, coincides with Equation (52) in the main text, where

u_{i j}

is the solution to Equation (53).

Appendix B. Maximum Likelihood

To determine the parameters

(\vec{θ} *, J^{*})

that maximize the log-likelihood of the OACM given in Equation (54), we first calculate the derivatives

\begin{matrix} - \frac{\partial L (\vec{θ}, J)}{\partial θ_{k}} & = & \sum_{i < j} \frac{\partial h_{i j} (m_{i j}^{*}, θ_{i j}, J)}{\partial θ_{k}} + \sum_{i < j} \frac{\partial ln z_{i j} (θ_{i j}, J)}{\partial θ_{k}}, k = 1, \dots, N \end{matrix}

(A15)

\begin{matrix} - \frac{\partial L (\vec{θ}, J)}{\partial J} & = & \sum_{i < j} \frac{\partial h_{i j} (m_{i j}^{*}, θ_{i j}, J)}{\partial J} + \sum_{i < j} \frac{\partial ln z_{i j} (θ_{i j}, J)}{\partial J} . \end{matrix}

(A16)

We then set the derivatives with respect to

θ_{k}

to zero:

\begin{matrix} - {\frac{\partial L (\vec{θ}, J)}{\partial θ_{k}}|}_{\vec{θ} *, J^{*}} & = \sum_{α = 1}^{M} \sum_{j \neq k} {g_{j k}^{*}}^{α} - M \sum_{j \neq k} u_{j k}^{*} = 0 \end{matrix}

(A17)

where we utilize the fact that

g_{i j}^{α}

and

u_{i j}

are symmetric with respect to the indices

i, j

, i.e.,

\sum_{i < j} g_{i j}^{α} δ_{i}^{k} = \sum_{j = k + 1}^{N} g_{j k}^{α}, \sum_{i < j} g_{i j}^{α} δ_{j}^{k} = \sum_{j = 1}^{k - 1} g_{j k}^{α} .

(A18)

Similarly, we set the derivative with respect to J to zero:

\begin{matrix} - {\frac{\partial L (\vec{θ}, J)}{\partial J}|}_{\vec{θ} *, J^{*}} = \sum_{i < j} (- \frac{4}{M} \sum_{α < β} {g_{i j}^{*}}^{α} {g_{i j}^{*}}^{β} + 2 M {(u_{i j}^{*})}^{2}) = 0 . \end{matrix}

(A19)

Taken together, the above calculations lead to the maximum likelihood Equations (55) and (56) in the main text.

References

Krackhardt, D. Cognitive social structures. Soc. Netw. 1987, 9, 109–134. [Google Scholar] [CrossRef]
Padgett, J.F.; Ansell, C.K. Robust Action and the Rise of the Medici, 1400–1434. Am. J. Sociol. 1993, 98, 1259–1319. [Google Scholar] [CrossRef]
Bardoscia, M.; Barucca, P.; Battiston, S.; Caccioli, F.; Cimini, G.; Garlaschelli, D.; Saracco, F.; Squartini, T.; Caldarelli, G. The physics of financial networks. Nat. Rev. Phys. 2021, 3, 490–507. [Google Scholar] [CrossRef]
Lacasa, L.; Luque, B.; Ballesteros, F.; Luque, J.; Nuno, J.C. From time series to complex networks: The visibility graph. Proc. Natl. Acad. Sci. USA 2008, 105, 4972–4975. [Google Scholar] [CrossRef] [PubMed]
Tsiotas, D.; Magafas, L.; Argyrakis, P. An electrostatics method for converting a time-series into a weighted complex network. Sci. Rep. 2021, 11, 11785. [Google Scholar] [CrossRef]
MacMahon, M.; Garlaschelli, D. Community Detection for Correlation Matrices. Phys. Rev. X 2015, 5, 021006. [Google Scholar] [CrossRef]
Anagnostou, I.; Squartini, T.; Kandhai, D.; Garlaschelli, D. Uncovering the mesoscale structure of the credit default swap market to improve portfolio risk modelling. Quant. Financ. 2021, 21, 1501–1518. [Google Scholar] [CrossRef]
De Domenico, M.; Solé-Ribalta, A.; Cozzo, E.; Kivelä, M.; Moreno, Y.; Porter, M.A.; Gómez, S.; Arenas, A. Mathematical formulation of multilayer networks. Phys. Rev. X 2013, 3, 041022. [Google Scholar] [CrossRef]
Battiston, F.; Nicosia, V.; Latora, V. Structural measures for multiplex networks. Phys. Rev. E 2014, 89, 032804. [Google Scholar] [CrossRef]
Kivelä, M.; Arenas, A.; Barthelemy, M.; Gleeson, J.P.; Moreno, Y.; Porter, M.A. Multilayer networks. J. Complex Netw. 2014, 2, 203–271. [Google Scholar] [CrossRef]
Battiston, F.; Nicosia, V.; Latora, V. The new challenges of multiplex networks: Measures and models. Eur. Phys. J. Spec. Top. 2017, 226, 401–416. [Google Scholar] [CrossRef]
Boccaletti, S.; Bianconi, G.; Criado, R.; Del Genio, C.I.; Gómez-Gardenes, J.; Romance, M.; Sendina-Nadal, I.; Wang, Z.; Zanin, M. The structure and dynamics of multilayer networks. Phys. Rep. 2014, 544, 1–122. [Google Scholar] [CrossRef] [PubMed]
Verbrugge, L.M. Multiplexity in adult friendships. Soc. Forces 1979, 57, 1286–1309. [Google Scholar] [CrossRef]
Erdos, P.; Rényi, A. On the evolution of random graphs. Publ. Math. Inst. Hung. Acad. Sci 1960, 5, 17–60. [Google Scholar]
Albert, R.; Barabási, A.L. Statistical mechanics of complex networks. Rev. Mod. Phys. 2002, 74, 47. [Google Scholar] [CrossRef]
Watts, D.J.; Strogatz, S.H. Collective dynamics of’small-world’networks. Nature 1998, 393, 440. [Google Scholar] [CrossRef]
Squartini, T.; Garlaschelli, D. Analytical maximum-likelihood method to detect patterns in real networks. New J. Phys. 2011, 13, 083001. [Google Scholar] [CrossRef]
Squartini, T.; Mastrandrea, R.; Garlaschelli, D. Unbiased sampling of network ensembles. New J. Phys. 2015, 17, 023052. [Google Scholar] [CrossRef]
Squartini, T.; Garlaschelli, D. Maximum-Entropy Networks: Pattern Detection, Network Reconstruction and Graph Combinatorics; Springer: Berlin/Heidelberg, Germany, 2017. [Google Scholar]
Cimini, G.; Squartini, T.; Saracco, F.; Garlaschelli, D.; Gabrielli, A.; Caldarelli, G. The statistical physics of real-world networks. Nat. Rev. Phys. 2019, 1, 58–71. [Google Scholar] [CrossRef]
Holland, P.W.; Leinhardt, S. An exponential family of probability distributions for directed graphs. J. Am. Stat. Assoc. 1981, 76, 33–50. [Google Scholar] [CrossRef]
Besag, J. Spatial interaction and the statistical analysis of lattice systems. J. R. Stat. Soc. Ser. B (Methodol.) 1974, 36, 192–236. [Google Scholar] [CrossRef]
Frank, O.; Strauss, D. Markov graphs. J. Am. Stat. Assoc. 1986, 81, 832–842. [Google Scholar] [CrossRef]
Contractor, N.S.; Wasserman, S.; Faust, K. Testing multitheoretical, multilevel hypotheses about organizational networks: An analytic framework and empirical example. Acad. Manag. Rev. 2006, 31, 681–703. [Google Scholar] [CrossRef]
Wasserman, S.; Faust, K. Social Network Analysis: Methods and Applications; Cambridge University Press: Cambridge, UK, 1994; Voluem 8. [Google Scholar]
Carrington, P.J.; Scott, J.; Wasserman, S. Models and Methods in Social Network Analysis; Cambridge University Press: Cambridge, UK, 2005; Volume 28. [Google Scholar]
Park, J.; Newman, M.E. Statistical mechanics of networks. Phys. Rev. E 2004, 70, 066117. [Google Scholar] [CrossRef]
Gemmetto, V.; Garlaschelli, D. Multiplexity versus correlation: The role of local constraints in real multiplexes. Sci. Rep. 2015, 5, 9120. [Google Scholar] [CrossRef]
Gemmetto, V.; Squartini, T.; Picciolo, F.; Ruzzenenti, F.; Garlaschelli, D. Multiplexity and multireciprocity in directed multiplexes. Phys. Rev. E 2016, 94, 042316. [Google Scholar] [CrossRef]
Barigozzi, M.; Fagiolo, G.; Garlaschelli, D. Multinetwork of international trade: A commodity-specific analysis. Phys. Rev. E 2010, 81, 046104. [Google Scholar] [CrossRef]
Squartini, T.; Fagiolo, G.; Garlaschelli, D. Randomizing world trade. I. A binary network analysis. Phys. Rev. E 2011, 84, 046117. [Google Scholar] [CrossRef]
Fagiolo, G.; Squartini, T.; Garlaschelli, D. Null models of economic networks: The case of the world trade web. J. Econ. Interact. Coord. 2013, 8, 75–107. [Google Scholar] [CrossRef]
Mastrandrea, R.; Squartini, T.; Fagiolo, G.; Garlaschelli, D. Reconstructing the world trade multiplex: The role of intensive and extensive biases. Phys. Rev. E 2014, 90, 062804. [Google Scholar] [CrossRef]
Szell, M.; Lambiotte, R.; Thurner, S. Multirelational organization of large-scale social networks in an online world. Proc. Natl. Acad. Sci. USA 2010, 107, 13636–13641. [Google Scholar] [CrossRef] [PubMed]
Cardillo, A.; Gómez-Gardenes, J.; Zanin, M.; Romance, M.; Papo, D.; Del Pozo, F.; Boccaletti, S. Emergence of network features from multiplexity. Sci. Rep. 2013, 3, 1344. [Google Scholar] [CrossRef] [PubMed]
Menichetti, G.; Remondini, D.; Panzarasa, P.; Mondragón, R.J.; Bianconi, G. Weighted multiplex networks. PLoS ONE 2014, 9, e97857. [Google Scholar] [CrossRef] [PubMed]
Freeman, L.C. A set of measures of centrality based on betweenness. Sociometry 1977, 40, 35–41. [Google Scholar] [CrossRef]
Clauset, A.; Shalizi, C.R.; Newman, M.E. Power-law distributions in empirical data. SIAM Rev. 2009, 51, 661–703. [Google Scholar] [CrossRef]
Barabási, A.L.; Albert, R. Emergence of scaling in random networks. Science 1999, 286, 509–512. [Google Scholar] [CrossRef]
Berlingerio, M.; Coscia, M.; Giannotti, F.; Monreale, A.; Pedreschi, D. Foundations of multidimensional network analysis. In Proceedings of the Advances in Social Networks Analysis and Mining (ASONAM), 2011 International Conference, Kaohsiung, Taiwan, 25–27 July 2011; pp. 485–489. [Google Scholar]
Bianconi, G. Statistical mechanics of multiplex networks: Entropy and overlap. Phys. Rev. E 2013, 87, 062806. [Google Scholar] [CrossRef]
Jaynes, E.T. Information theory and statistical mechanics. Phys. Rev. 1957, 106, 620. [Google Scholar] [CrossRef]
Jaynes, E.T. On the rationale of maximum-entropy methods. Proc. IEEE 1982, 70, 939–952. [Google Scholar] [CrossRef]
Garlaschelli, D.; Loffredo, M.I. Multispecies grand-canonical models for networks with reciprocity. Phys. Rev. E 2006, 73, 015101. [Google Scholar] [CrossRef]
Newman, M.E.; Girvan, M. Finding and evaluating community structure in networks. Phys. Rev. E 2004, 69, 026113. [Google Scholar] [CrossRef] [PubMed]
Anand, K.; Bianconi, G. Entropy measures for networks: Toward an information theory of complex topologies. Phys. Rev. E 2009, 80, 045102. [Google Scholar] [CrossRef] [PubMed]
Garlaschelli, D.; Loffredo, M.I. Generalized bose-fermi statistics and structural correlations in weighted networks. Phys. Rev. Lett. 2009, 102, 038701. [Google Scholar] [CrossRef]
Coolen, T.; Annibale, A.; Roberts, E. Generating Random Networks and Graphs; Oxford University Press: Oxford, UK, 2017. [Google Scholar]
Pólya, G.; Szegő, G. Problems and Theorems in Analysis: Series, Integral Calculus, Theory of Functions; Aeppli, D., Translator; Springer: Berlin/Heidelberg, Germany, 1972. [Google Scholar]
Park, J.; Newman, M.E. Solution of the two-star model of a network. Phys. Rev. E 2004, 70, 066146. [Google Scholar] [CrossRef]
Hastings, W.K. Monte Carlo sampling methods using Markov chains and their applications. Biometrika 1970, 57, 97–109. [Google Scholar] [CrossRef]

Figure 1. A graphical illustration of the solution(s) of Equation (53). The solid lines show the RHS of Equation (53) as a function of

u_{i j}

for the different parameters

θ_{i j} \in {- 12, - 8, - 4, - 2, 0, 2, 4, 8, 12}

, while the dashed line shows the LHS, which equals

u_{i j}

itself. For a given parameter value, the solutions of Equation (53) are the intersection between the dashed and the corresponding solid line. Each panel corresponds to a different value of J (in the rest of the paper, we will consider only

J \geq 0

).

Figure 1. A graphical illustration of the solution(s) of Equation (53). The solid lines show the RHS of Equation (53) as a function of

u_{i j}

for the different parameters

θ_{i j} \in {- 12, - 8, - 4, - 2, 0, 2, 4, 8, 12}

, while the dashed line shows the LHS, which equals

u_{i j}

itself. For a given parameter value, the solutions of Equation (53) are the intersection between the dashed and the corresponding solid line. Each panel corresponds to a different value of J (in the rest of the paper, we will consider only

J \geq 0

).

Figure 2. The upper (blue) and lower (red) curves correspond to Equations (65) and (66), respectively, which delimit the region of phase space (yellow area), for which Equation (53) has three solutions. Note that the ‘zero-field’ condition

θ_{i j} = 2 J

is always in the yellow area when

J > 1

, so the condition

J > 1

is sufficient to ensure that the system in zero field is in the magnetized (symmetry-broken) phase.

Figure 2. The upper (blue) and lower (red) curves correspond to Equations (65) and (66), respectively, which delimit the region of phase space (yellow area), for which Equation (53) has three solutions. Note that the ‘zero-field’ condition

θ_{i j} = 2 J

is always in the yellow area when

J > 1

, so the condition

J > 1

is sufficient to ensure that the system in zero field is in the magnetized (symmetry-broken) phase.

Figure 3. Solutions for

u_{i j}

as a function of

θ_{i j}

for different parameter values. The blue and red segments of the curve(s) correspond to the stable and unstable solutions of Equation (53), respectively. Left panel:

B_{i j} = 0

(with J varying accordingly). Middle panel:

B_{i j} = 1

(with J varying accordingly). Right panel: constant value of

J = 1.5

, which translates to a non-constant

B_{i j}

.

Figure 3. Solutions for

u_{i j}

as a function of

θ_{i j}

for different parameter values. The blue and red segments of the curve(s) correspond to the stable and unstable solutions of Equation (53), respectively. Left panel:

B_{i j} = 0

(with J varying accordingly). Middle panel:

B_{i j} = 1

(with J varying accordingly). Right panel: constant value of

J = 1.5

, which translates to a non-constant

B_{i j}

.

Figure 4. Total number of links L (top panels) and inter-layer overlap O (bottom panels) as a function of simulation time using the Metropolis–Hastings algorithm for

J = 1.5

,

N = 100

,

M = 100

. Left panels:

θ = 1.4

. Middle panels:

θ = 1.5 = J

(symmetry-broken case). Right panels:

θ = 1.6

. For fixed J, varying

θ

determines a phase transition from a high-density phase to a low-density phase.

Figure 4. Total number of links L (top panels) and inter-layer overlap O (bottom panels) as a function of simulation time using the Metropolis–Hastings algorithm for

J = 1.5

,

N = 100

,

M = 100

. Left panels:

θ = 1.4

. Middle panels:

θ = 1.5 = J

(symmetry-broken case). Right panels:

θ = 1.6

. For fixed J, varying

θ

determines a phase transition from a high-density phase to a low-density phase.

Figure 5. Relationship between the expected inter-layer overlap

〈 O 〉

and the total number of links

〈 L 〉

in homogeneous multiplexes with

N = 100

,

M = 100

, and

θ_{i} = θ

for all

i = 1, N

. The blue points correspond to simulations obtained via the Metropolis–Hastings algorithm for

J \in {0.0, 0.3, 0.6, 0.9, 1.2, 1.5}

and

θ \in [0.05, 2.00]

in steps of

Δ θ = 0.05

. The open red circles are the corresponding theoretically predicted points. The solid curve corresponds to the quadratic trend

〈 O 〉 = {〈 L 〉}^{2} / N^{2}

predicted for all

J \geq 0

. Multiple solutions for

u_{i j}^{*}

first appear when

J > 1

, but the system keeps following the quadratic trend, albeit drifting away from the central point obtained for the zero-field case

θ = J

(corresponding to a spontaneously broken symmetry).

Figure 5. Relationship between the expected inter-layer overlap

〈 O 〉

and the total number of links

〈 L 〉

in homogeneous multiplexes with

N = 100

,

M = 100

, and

θ_{i} = θ

for all

i = 1, N

. The blue points correspond to simulations obtained via the Metropolis–Hastings algorithm for

J \in {0.0, 0.3, 0.6, 0.9, 1.2, 1.5}

and

θ \in [0.05, 2.00]

in steps of

Δ θ = 0.05

. The open red circles are the corresponding theoretically predicted points. The solid curve corresponds to the quadratic trend

〈 O 〉 = {〈 L 〉}^{2} / N^{2}

predicted for all

J \geq 0

. Multiple solutions for

u_{i j}^{*}

first appear when

J > 1

, but the system keeps following the quadratic trend, albeit drifting away from the central point obtained for the zero-field case

θ = J

(corresponding to a spontaneously broken symmetry).

Figure 6. Relationship between the expected inter-layer overlap

〈 O 〉

and the total number of links

〈 L 〉

in heterogeneous multiplexes with

N = 100

,

M = 100

, and

x_{0, i}

sampled from a power law distribution with different values for

γ

. The colored points correspond to simulations obtained via the Metropolis–Hastings algorithm for

J \in {0.0, 0.3, 0.6, 0.9, 1.2, 1.5}

and

z \in [0.05, 2.00]

in steps of

Δ z = 0.05

. The straight line corresponds to the upper limit

〈 O 〉 = M 〈 L 〉 / 2

calculated in Equation (67). The solid curve corresponds to the quadratic trend

〈 O 〉 = {〈 L 〉}^{2} / N^{2}

(achieved by homogeneous multiplexes with constant

x_{i}

), which here turns out to mark a lower bound. For increasing values of J, and especially as

J > 1

, the system moves closer to the upper bound. For

J = 1.5

, we see that the points are concentrating towards high-density and low-density (symmetry-broken) regimes, drifting away from the intermediate values, like in the homogeneous case. However, this is now the combined result of the behavior of statistically different pairs of nodes, each having a different zero-field condition

θ_{i} + θ_{j} = 2 J

, so the spontaneous symmetry breaking cannot be realized for all node pairs simultaneously.

Figure 6. Relationship between the expected inter-layer overlap

〈 O 〉

and the total number of links

〈 L 〉

in heterogeneous multiplexes with

N = 100

,

M = 100

, and

x_{0, i}

sampled from a power law distribution with different values for

γ

. The colored points correspond to simulations obtained via the Metropolis–Hastings algorithm for

J \in {0.0, 0.3, 0.6, 0.9, 1.2, 1.5}

and

z \in [0.05, 2.00]

in steps of

Δ z = 0.05

. The straight line corresponds to the upper limit

〈 O 〉 = M 〈 L 〉 / 2

calculated in Equation (67). The solid curve corresponds to the quadratic trend

〈 O 〉 = {〈 L 〉}^{2} / N^{2}

(achieved by homogeneous multiplexes with constant

x_{i}

), which here turns out to mark a lower bound. For increasing values of J, and especially as

J > 1

, the system moves closer to the upper bound. For

J = 1.5

, we see that the points are concentrating towards high-density and low-density (symmetry-broken) regimes, drifting away from the intermediate values, like in the homogeneous case. However, this is now the combined result of the behavior of statistically different pairs of nodes, each having a different zero-field condition

θ_{i} + θ_{j} = 2 J

, so the spontaneous symmetry breaking cannot be realized for all node pairs simultaneously.

Figure 7. Relationship between the expected inter-layer overlap

〈 O 〉

and the total number of links

〈 L 〉

in heterogeneous multiplexes with

N = 100

,

M = 100

, and

x_{0, i}

sampled from a power law distribution with

γ = 1

. The blue points correspond to simulations obtained via the Metropolis–Hastings algorithm for

z \in [0.05, 2.00]

in steps of

Δ z = 0.05

with

J = 0

(left panel) and

J = 1.5

(right panel). The red open circles are the theoretically predicted values corresponding to the same parameters used in the simulations. The straight line corresponds to the upper limit

〈 O 〉 = M 〈 L 〉 / 2

calculated in Equation (67). The solid curve corresponds to the quadratic trend

〈 O 〉 = {〈 L 〉}^{2} / N^{2}

(achieved by homogeneous multiplexes with constant

x_{i}

), which here turns out to mark a lower bound. We see that, compared with the homogeneous lower bound, the heterogeneity of nodes increases the overlap dramatically, even in the absence of true coupling (

J = 0

). When coupling is present, the overlap is additionally increased and already approaches the upper bound for

J = 1.5

.

Figure 7. Relationship between the expected inter-layer overlap

〈 O 〉

and the total number of links

〈 L 〉

in heterogeneous multiplexes with

N = 100

,

M = 100

, and

x_{0, i}

sampled from a power law distribution with

γ = 1

. The blue points correspond to simulations obtained via the Metropolis–Hastings algorithm for

z \in [0.05, 2.00]

in steps of

Δ z = 0.05

with

J = 0

(left panel) and

J = 1.5

(right panel). The red open circles are the theoretically predicted values corresponding to the same parameters used in the simulations. The straight line corresponds to the upper limit

〈 O 〉 = M 〈 L 〉 / 2

calculated in Equation (67). The solid curve corresponds to the quadratic trend

〈 O 〉 = {〈 L 〉}^{2} / N^{2}

(achieved by homogeneous multiplexes with constant

x_{i}

), which here turns out to mark a lower bound. We see that, compared with the homogeneous lower bound, the heterogeneity of nodes increases the overlap dramatically, even in the absence of true coupling (

J = 0

). When coupling is present, the overlap is additionally increased and already approaches the upper bound for

J = 1.5

.

Figure 8. Relationship between the expected inter-layer overlap

〈 O 〉

and the total number of links

〈 L 〉

in heterogeneous multiplexes with

N = 100

,

M = 100

, and

x_{0, i}

sampled from a log-normal distribution with different values for

σ

. The colored points correspond to simulations obtained via the Metropolis–Hastings algorithm for

J \in {0.0, 0.3, 0.6, 0.9, 1.2, 1.5}

and

z \in [0.05, 2.00]

in steps of

Δ z = 0.05

. The straight line corresponds to the upper limit

〈 O 〉 = M 〈 L 〉 / 2

calculated in Equation (67). The solid curve corresponds to the quadratic trend

〈 O 〉 = {〈 L 〉}^{2} / N^{2}

(achieved by homogeneous multiplexes with constant

x_{i}

), which here marks a lower bound achieved when

σ \to 0^{+}

. For increasing values of J (genuine coupling) and

σ

(spurious coupling), the system moves closer to the upper bound. For

J > 1

, we see that, starting from the multiplexes with smaller values of

σ

, the points are concentrating towards high-density and low-density (symmetry-broken) regimes, drifting away from the intermediate values, like in the homogeneous and power law cases. To realize this separation for larger values of

σ

, a larger value of J is required.

Figure 8. Relationship between the expected inter-layer overlap

〈 O 〉

and the total number of links

〈 L 〉

in heterogeneous multiplexes with

N = 100

,

M = 100

, and

x_{0, i}

sampled from a log-normal distribution with different values for

σ

. The colored points correspond to simulations obtained via the Metropolis–Hastings algorithm for

J \in {0.0, 0.3, 0.6, 0.9, 1.2, 1.5}

and

z \in [0.05, 2.00]

in steps of

Δ z = 0.05

. The straight line corresponds to the upper limit

〈 O 〉 = M 〈 L 〉 / 2

calculated in Equation (67). The solid curve corresponds to the quadratic trend

〈 O 〉 = {〈 L 〉}^{2} / N^{2}

(achieved by homogeneous multiplexes with constant

x_{i}

), which here marks a lower bound achieved when

σ \to 0^{+}

. For increasing values of J (genuine coupling) and

σ

(spurious coupling), the system moves closer to the upper bound. For

J > 1

, we see that, starting from the multiplexes with smaller values of

σ

, the points are concentrating towards high-density and low-density (symmetry-broken) regimes, drifting away from the intermediate values, like in the homogeneous and power law cases. To realize this separation for larger values of

σ

, a larger value of J is required.

Figure 9. Relationship between the expected inter-layer overlap

〈 O 〉

and the total number of links

〈 L 〉

in heterogeneous multiplexes with

N = 100

,

M = 100

, and

x_{0, i}

sampled from a log-normal distribution with

σ = 1

. The blue points correspond to simulations obtained via the Metropolis–Hastings algorithm for

z \in [0.05, 2.00]

in steps of

Δ z = 0.05

with

J = 0

(left panel) and

J = 1.5

(right panel). The red open circles are the theoretically predicted values corresponding to the same parameters used in the simulations.

Figure 9. Relationship between the expected inter-layer overlap

〈 O 〉

and the total number of links

〈 L 〉

in heterogeneous multiplexes with

N = 100

,

M = 100

, and

x_{0, i}

sampled from a log-normal distribution with

σ = 1

. The blue points correspond to simulations obtained via the Metropolis–Hastings algorithm for

z \in [0.05, 2.00]

in steps of

Δ z = 0.05

with

J = 0

(left panel) and

J = 1.5

(right panel). The red open circles are the theoretically predicted values corresponding to the same parameters used in the simulations.

Figure 10. Comparison of the empirical World Trade Multiplex (WTM) with the zero-coupling (

J^{*} = 0

) benchmark provided by the Average Configuration Model (ACM). The WTM consists of

N = 206

nodes, each representing a country, and

M = 96

layers, each representing a commodity group. The filtered data were obtained by retaining the same number

L^{0}

of strongest links in each layer (hence

L = M L^{0}

links in the entire multiplex), and varying

L^{0}

. Top left: relationship between the expected inter-layer overlap

〈 O 〉

and the total number of links

〈 L 〉

in the WTM (blue), compared with the upper limit

〈 O 〉 = M 〈 L 〉 / 2

calculated in Equation (67) (purple straight line) and the quadratic trend

〈 O 〉 = {〈 L 〉}^{2} / N^{2}

achieved by homogeneous multiplexes (black solid curve). Top right: zoomed-in version of the top left panel, showing that the empirical data follow an intermediate scaling between the two extremes. Center left: cumulative distributions reporting the number

F (x)

of nodes with hidden variable larger than x in the ACM, obtained for different values of

L^{0}

(see legend). Center right: same as the top right panel with the addition of the relationship produced by the ACM benchmark, showing that the empirical WTM (blue) has a higher overlap than the corresponding null model having zero inter-layer coupling but the same degree heterogeneity (orange). Bottom left: log–log plot of the relationship between the overlap and the number of links in the empirical WTM, along with a power law fit of the form

O = A L^{α}

, where the fitted exponent is

α = 1.19

. Bottom right: log–log plot of the same relationship in the ACM benchmark with no coupling, along with a power law fit of the form

O = A L^{α}

, where the fitted exponent is

α = 1.06

.

Figure 10. Comparison of the empirical World Trade Multiplex (WTM) with the zero-coupling (

J^{*} = 0

) benchmark provided by the Average Configuration Model (ACM). The WTM consists of

N = 206

nodes, each representing a country, and

M = 96

layers, each representing a commodity group. The filtered data were obtained by retaining the same number

L^{0}

of strongest links in each layer (hence

L = M L^{0}

links in the entire multiplex), and varying

L^{0}

. Top left: relationship between the expected inter-layer overlap

〈 O 〉

and the total number of links

〈 L 〉

in the WTM (blue), compared with the upper limit

〈 O 〉 = M 〈 L 〉 / 2

calculated in Equation (67) (purple straight line) and the quadratic trend

〈 O 〉 = {〈 L 〉}^{2} / N^{2}

achieved by homogeneous multiplexes (black solid curve). Top right: zoomed-in version of the top left panel, showing that the empirical data follow an intermediate scaling between the two extremes. Center left: cumulative distributions reporting the number

F (x)

of nodes with hidden variable larger than x in the ACM, obtained for different values of

L^{0}

(see legend). Center right: same as the top right panel with the addition of the relationship produced by the ACM benchmark, showing that the empirical WTM (blue) has a higher overlap than the corresponding null model having zero inter-layer coupling but the same degree heterogeneity (orange). Bottom left: log–log plot of the relationship between the overlap and the number of links in the empirical WTM, along with a power law fit of the form

O = A L^{α}

, where the fitted exponent is

α = 1.19

. Bottom right: log–log plot of the same relationship in the ACM benchmark with no coupling, along with a power law fit of the form

O = A L^{α}

, where the fitted exponent is

α = 1.06

.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bayrakdar, N.; Gemmetto, V.; Garlaschelli, D. Local Phase Transitions in a Model of Multiplex Networks with Heterogeneous Degrees and Inter-Layer Coupling. Entropy 2023, 25, 828. https://doi.org/10.3390/e25050828

AMA Style

Bayrakdar N, Gemmetto V, Garlaschelli D. Local Phase Transitions in a Model of Multiplex Networks with Heterogeneous Degrees and Inter-Layer Coupling. Entropy. 2023; 25(5):828. https://doi.org/10.3390/e25050828

Chicago/Turabian Style

Bayrakdar, Nedim, Valerio Gemmetto, and Diego Garlaschelli. 2023. "Local Phase Transitions in a Model of Multiplex Networks with Heterogeneous Degrees and Inter-Layer Coupling" Entropy 25, no. 5: 828. https://doi.org/10.3390/e25050828

APA Style

Bayrakdar, N., Gemmetto, V., & Garlaschelli, D. (2023). Local Phase Transitions in a Model of Multiplex Networks with Heterogeneous Degrees and Inter-Layer Coupling. Entropy, 25(5), 828. https://doi.org/10.3390/e25050828

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Local Phase Transitions in a Model of Multiplex Networks with Heterogeneous Degrees and Inter-Layer Coupling

Abstract

1. Introduction

2. Background Theory

2.1. Single-Layer Network Definitions

2.2. Multiplex Network Definitions

2.3. Exponential Random Graph Models for Multiplexes

2.4. Maximum Likelihood Parameter Estimation

2.5. Benchmark: Independent Layers Model

3. The Overlapping Average Configuration Model

3.1. Constructing the Hamiltonian

3.2. Calculating the Partition Function

4. Local Phase Transitions in the Model

5. Numerical Analysis

5.1. Exploring the Parameter Space

5.1.1. Homogeneous Fitness: Erdős–Rényi Graphs with Overlap

5.1.2. Power-Law-Distributed Fitness: Scale-Free Networks with Overlap

5.1.3. Log-Normally Distributed Fitness

6. Analysis of the World Trade Multiplex

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Hubbard–Stratonovich Transform

Appendix B. Maximum Likelihood

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI