1. Introduction and Motivation
The study of complex networks is an interdisciplinary topic that covers several areas of knowledge, such as computer science, mathematics, physics, biology, and sociology; see [
1,
2,
3,
4] and references therein. Concerning the emergence and importance of social networks in recent times, networks play a central role in representing the way ideas and information spread in modern society. A considerable part of our opinions and decisions are directly or indirectly determined by our social contacts. Social influence is a critical factor when we adopt a certain behavior, make a decision, adopt an innovation, or formalize our cultural, political, and religious ideologies; see [
5].
The study of dynamic networks seeks to explain the evolution of propagation phenomena that are influenced by peers, such as the spread of epidemics, dissemination of information, and popularization of ideas and choices. The networks used to represent these connections can be created from real data or by using previously defined networks from the literature, such as classical random networks, small-world networks, and scale-free networks, among others; see [
6]. Thus, it is possible to apply dynamic diffusion models to the chosen networks in order to understand these phenomena. These propagation processes are referred to as social contagion, as they are reminiscent of how a disease is transmitted between the individuals in a population. Several authors have presented different approaches to this theme over the past decades; see [
7] for a complete and updated compilation of references in this field.
Influence maximization was first proposed in [
8], which was later interpreted as a discrete optimization problem. This typification was a landmark in the research on influence maximization, and was soon considered an NP-hard problem (nondeterministic polynomial time hardness, which is a class of problems that have at least exponential complexity; see [
9]) for the case when the given information propagation model is an independent cascade or a linear threshold model. On the one hand, a large number of publications on this topic have been presented, including two categories of classical influence maximization algorithms, namely, greedy algorithms and heuristic algorithms. While the greedy algorithm has high accuracy, it is computationally time-consuming; on the other hand, the heuristic algorithm is more efficient at the cost of accuracy. Most of the existing studies are based on static networks, although a few researchers have devoted themselves to the study of influence maximization in dynamic networks. Several of these works consider the dynamics in the information dissemination process, such as the dynamic activation probability, dynamic threshold, and dynamic perception, while others consider the dynamics of the network topology as links are added or removed over time. On this subject, see [
7,
10,
11,
12].
In the present work we study the linear threshold model in random networks, specifically in Erdös-Rényi networks; see [
13,
14]. The linear threshold model explains how an individual can be influenced depending on a previously established threshold value. This means that a node is only activated if the influence applied on it by its neighbours exceeds a certain value. Generally, the threshold value is randomly chosen by taking into account a certain range of values; see [
15,
16] and references therein. In our case, the threshold value is estimated taking into account the underlying network parameters, particularly from the expected value for the network nodes degree.
The rest of this work is organized as follows. In
Section 2, the definitions of the activation threshold and influence activation mapping for Erdös-Rényi random networks are presented. The activation threshold expresses the network topology and its global dynamics. On the other hand, the influence activation mapping is defined by relying on the local dynamics of each node of the network under analysis. As a consequence of the previous definitions, the seed set for implementation of the diffusion process on Erdös-Rényi networks is characterized.
Section 3 elaborates on the establishment of the theoretical procedure for the linear threshold model over the Erdös-Rényi networks. This procedure mathematically formalizes the diffusion process of the linear threshold model for this type of random network; in this section, we highlight the generality of the conditions required for its implementation.
In
Section 4, we analyze the influence maximization dynamics for the linear threshold model, i.e., we intend to state the conditions that will trigger this diffusion process. In particular, our focus is on the activation of the diffusion process considering the local and global dynamics of the networks (population dynamics). From these results, analytical properties for the probability of activating the influence diffusion process can be obtained for a randomly chosen seed, and these properties are related to the nodes with greater centrality degree in the Erdös-Rényi networks. The results revealed in this section highlight the complexity of the chosen approach as well as the richness and usefulness of the centrality measures.
Section 5 is devoted to the study of a topological order for the Erdös-Rényi networks, which is established at the expense of the definition of topological entropy for this type of random network. In this context, asymptotic behaviour of the spectral radius of the adjacency matrix of the Erdös-Rényi networks is used. This approach to the topological entropy, provided through the spectral radius of the corresponding adjacency matrices, characterizes the topological dynamics of Erdös-Rényi networks. Sufficient conditions are presented for this definition of topological entropy to be characterized by the expected value of the node degree of the considered random networks. Consequently, several properties are proved and a relation between the activation threshold and the topological entropy is established.
In
Section 6, several numerical case studies are performed according to different choices for the seed set considering random or chosen seeds within the centrality measures, namely, degree, closeness, and betweenness. Finally, in
Section 7 we discuss the work, provide our conclusions, and outline future research.
2. Influence Activation Mapping for the Linear Threshold Model
In this section we introduce an influence activation mapping for the linear threshold model which incorporates an activation threshold
; see [
15]. These characterizations allow us to present a definition for the seed set of the linear threshold model for the type of diffusion networks analyzed. Throughout this work, the considered type of diffusion networks are Erdös-Rényi random networks. An Erdös-Rényi random network consists of
n nodes, where
and each pair of nodes is linked by a connection or an edge with a certain probability
. The edges between the nodes are independently and randomly generated with the same probability
p; see [
13,
14]. Erdös-Rényi random networks are usually represented by
, where
is the node set and represents the individuals,
indicates that there are
individuals, and the edge set
E represents the connections between different individuals. Information propagates through the connections in the network. Throughout this paper, we denote by
the neighbourhood of
, i.e., the nodes of
V that are linked to
, and by
the degree of node
for
. For more details on Erdös-Rényi networks, see for example [
1,
2,
3,
4] and references therein.
Concerning the classical models of influence, we consider the linear threshold model, where every node at each instant has one of two possible states, namely, inactive or active; see [
17,
18]. We deem the network as unweighted, i.e., with a constant threshold value for every node. The choice of the threshold value
is a very important feature in terms of linear threshold based influence maximization. This one of two vital points, along with the degree distribution of the network; see [
16]. While there is no specific guideline or property for the choice of a given threshold, our purpose in this article is to define values that allow diffusion. These considerations lead us to reintroduce the following definition (for more, see [
19]).
Definition 1. Let be an Erdös-Rényi random network. The activation threshold θ of the network is defined bywhere is the expected value for the node degree of the network. Note that this activation threshold is dependent on the network size n and the connection probability p; thus, this invariant reflects the network topology and its global dynamics.
In the linear threshold model, a node becomes active if and only if the ratio between the active neighbours of at time t (denoted by ) and the degree of that node (denoted by ) is higher than the threshold value . Thus, in the following definition we introduce the concept of influence activation mapping in a network, in particular for the Erdös-Rényi random network .
Definition 2. Let be an Erdös-Rényi random network, with being a node at discrete time for and θ the being the activation threshold provided by Equation (1). The influence activation mapping , whereis defined as follows: We remark here that the influence activation mapping
f is defined acoording to the local dynamics of each node
of the network
under analysis. Further, note that when considering the influence activation mapping
f provided by Equation (
2) we can recursively define the mapping of the active neighbours of a node
,
, in terms of the influence activation mapping
f in the following way:
with
for
and for each discrete time
, where
represents the initial instant of the diffusion process. It should be emphasized again that in the approach explored in this work, the topological structure of the network
is considered in the definitions of the influence activation mapping
f and the active neighbours mapping
provided by Equations (
2) and (
3), respectively.
In order to further explain this procedure, we can consider the seed set as the subset of
V that specifies the initial conditions for the recursion mapping of the active neighbours, as provided by Equation (
3). This determines which nodes are activated at the initial state of the diffusion process. It is our purpose to first choose these nodes according to different properties, then analyze and compare the diffusion effects of these choices as concerns the influence maximization problem. The concept of a seed set already exists; see for example [
17,
18] and references therein. However, here we present a formal definition for the seed set characterized according to the influence activation mapping
f at the initial instant
.
Definition 3. Let be a contact Erdös-Rényi network and let be the node state at the initial instant , with . The seed set of the network is defined bywhere . 3. Theoretical Procedure for the Linear Threshold Model
In this section, we introduce a theoretical procedure for the linear threshold model applied to diffusion networks of the Erdös-Rényi random network type. In the linear threshold model, a node is activated if and only if the influence exerted on it by its neighbours exceeds a certain value. This means that if the value exceeds the threshold
for a specific node, the node becomes active. This indicates that it has adopted an idea, obtained information, changed its behavior, etc.; see [
18] and references therein.
Procedure 1. Let be an Erdös-Rényi random network, let f be the influence activation mapping provided by Equation (2), and let be the seed set provided by Definition 3. The theoretical procedure of the linear threshold model for the network follows the next steps: - 1.
At the initial instant , the nodes in the seed set are all active, i.e., - 2.
At instant , if node is activated, then node remains activated, i.e., - 3.
If at instant a node is not activated, then the influence activation function is verified as provided in Equation (2); - 4.
At each instant , processes 2 and 3 are repeated until reaching the following stopping criteria:i.e., at instant t the vector at iteration t is equal to the vector in the previous iteration.
The procedure formalized above corresponds to a finite number of steps, which are described as follows. In step 1, it is stated that the seed set corresponds to the active nodes. Step 2 guarantees that an active node remains active throughout the procedure. Step 3 corresponds to the nodes that are not activated and can change their state. Finally, step 4 provides a stopping condition for the procedure.
Remark 1. Note that if in step 3 this situation corresponds to the case where the nodes in the seed set are the only nodes that are active when the procedure ends.
Remark 2. The former theoretical procedure is presented with no restrictions on the values of the linear threshold model. Moreover, there are no restrictions on the underlying contact networks. Nevertheless, it is important to emphasize the following features:
- 1.
The connectivity of Erdös-Rényi networks depends on the value of the connection probability p and the size of the network n. If , then there is no giant component. Consequently, there will be no diffusion regardless of the propagation model considered.
- 2.
Concerning the linear threshold model, the theoretical procedure applies to every value of the threshold θ.
4. Influence Activation Dynamics for the Linear Threshold Model
Considering Remark 1 from the previous section, we want to analyze under which conditions there are active nodes other than the seed set
. In another words, we intend to state the conditions that trigger the diffusion process for the linear threshold model. First, however, we need to recall certain features regarding connectivity of the Erdös-Rényi network
according to the values of its parameters
n and
p. For the extreme values of
p,
, and
we have a disconnected network and a complete network, respectively, while for
we have the emergence of a giant component, which is an example of a phase transition in Erdös-Rényi networks; see [
20]. In this case, this rather abrupt change characterizes the transition between the most disconnected and most connected networks for the values of
p; see [
2].
More specifically, for an Erdös-Rényi random network with
n nodes and connection probability
p, the supercritical regime is defined as the range of
p values such that the expected size of the largest connected component scales with
n, meaning that there is a very high probability of there being no nodes outside the giant component. This regime is characterized by the fact that the probability of there being a giant component goes to one as the network size increases. In contrast, in the subcritical regime, i.e.,
, the probability of a giant component goes to zero as the network size increases, while in the critical regime, for
the probability of a giant component approaches a non-zero constant as the network size becomes large. For more details on topological and phase transitions in Erdös-Rényi networks, see for example [
1,
4].
Throughout this section, we consider a single seed, i.e.,
, with the value of the connection probability
p satisfying
, in order to ensure that the network has a giant component and that
is the activation threshold provided by Equation (
1). Suppose, without loss of generality, that
with
and that
lies in the giant component of the network
. Let us further suppose that
has degree one, i.e.,
. Let
be the only node in
. According to the influence activation mapping
f provided by Equation (
2), at instant
node
is active if and only if
. From Definition 1, we know that
, meaning that node
is activated if and only if,
where
is the expected value for the node degree of the network
. On the other hand, the degree distribution of the network
follows a binomial law with the same parameters; thus, we can write
In conclusion, if a seed corresponds to a pendent node on the giant component of the network , then the set of the active nodes through the diffusion procedure is equal to the seed set with probability .
Let us now consider a more general case. Suppose that
with
, where
is a node in the giant component of an Erdös-Rényi network
, and further suppose that
has degree
, i.e.,
, with
. For each
, to simplify the notation, we denote by
the degree of node
for
. It is our purpose here to analyze the case where
can activate other nodes in its neighbourhood. Using the same arguments as in the previous case, node
does not activate any neighbour if and only if the following condition holds,
Let us now compute the probability of the previous event,
The first equality results from the fact that the degree distribution of each node is independent. Considering the above explanations, we are now in a condition to state the following result.
Proposition 1. Let be an Erdös-Rényi random network, f the activation mapping provided by Equation (2), the initial instant, and the seed set provided by Definition 3, where and . If with , thenfor . In Proposition 1, we obtained the probability of initiating the diffusion process for a randomly chosen seed for the linear threshold model over Erdös-Rényi networks.
Now, let us consider , where is the node with a greater degree of centrality in the network . Under these conditions, we need to introduce the following definition.
Definition 4. Let be an Erdös-Rényi random network, f the activation mapping provided by Equation (2), the initial instant, and the seed set provided by Definition 3, where and . The influence activation probability mapping is defined as follows:for . We are now in a position to establish the corollary that is a consequence of the previous proposition and definition.
Corollary 1. Under the assumption in Proposition 1 and Definition 4, the influence activation probability mapping g attains its maximum if and only if v is the node with greater centrality degree in the Erdös-Rényi network .
These remarks lead us to a very important and well-known topic, namely, the influence maximization problem, which asks for the best seed set to provide the maximum number of activated nodes at the final stage (see [
9,
18]). In this particular case, we can ask whether the best choice of a seed to initiate the diffusion process is also the best choice for having more active nodes at the final stage. In other words, is the best trigger also the best spreader? If not, what happens if we choose a seed considering other centrality measures for detecting influential nodes, such as the closeness or betweenness centrality, or even other measures?
5. Topological Order in Erdös-Rényi Random Networks and Activation Threshold
Spectral techniques have a very important role in recent graph theory, in particular concerning random graphs; see for example [
1,
2,
3,
4] and references therein. The results obtained from spectral graph theory, along with bounds or specific values for graph eigenvalues, provide tools for several graph algorithms. Erdös-Rényi networks, or random graphs
have an adjacency matrix
A that can be analyzed as a random symmetric matrix with a diagonal equal to zero and the remaining entries being ones with probability
p and zeroes with probability
. This allow us to relate the behaviour of these matrices using the well-known behaviour of symmetric matrices and their spectra.
Within this framework, we can study a topological invariant associated with the dynamics between the nodes of the Erdös-Rényi networks
, which we refer to as the network topological entropy. Usually, the network entropy is the entropy of a stochastic matrix associated with the adjacency matrix
A; see [
21] and references therein. Considering
,
as the adjacency matrix of the Erdös-Rényi network
, where
denotes the spectral radius of
A, let
be the corresponding leading eigenvector. It has been shown that
satisfies a variational principle and that the supremum over all possible stochastic matrices is attained for the unique stochastic matrix
,
, defined by
with stationary distribution
; see [
21,
22,
23,
24,
25] and references therein. Under these conditions, the network entropy of this dynamical process is characterized as follows:
From the previous results, we establish the definition of network entropy for the networks .
Definition 5. Let be an Erdös-Rényi network and let A be the adjacency matrix associated with the spectral radius λ. The topological entropy of is defined by Note that the approach taken in this work is the network topological entropy concept used in [
25,
26,
27,
28]. In this context, the topological entropy of the Erdös-Rényi network
characterizes the topological dynamics of this type of network through the spectral radius
of the corresponding adjacency matrices
A. The topological entropy established in Definition 5 defines a topological order on the Erdös-Rényi random network
.
We can say that a graph property holds
almost surely (or a.s.) in
if the probability that
has
is
. In this section, we use the asymptotic behaviour of the spectral radius
of the adjacency matrix
A of
. Let us denote by
the maximum degree in
. There is a general result stating that the spectral radius of
satisfies
where
, as
. For more details on this subject, see [
29] and references therein. This result was improved in [
30], where several values were considered for the probability
p. Suppose that we have
and that the probability
p satisfies the following condition:
for a sufficiently large network size
n; then, the spectral radius of the Erdös-Rényi network
is almost certain to be provided by
Considering the results established in Definition 5 and Equation (
8), we are now in a position to establish the following result.
Proposition 2. Let be an Erdös-Rényi network and let A be the adjacency matrix associated with spectral radius λ. If p satisfies Equation (7) and the network size n is sufficiently large, then the topological entropy of is provided by Clearly, the topological entropy of the Erdös-Rényi network
is characterized by the expected value of the node degree
, with
p satisfying Equation (
7) and
n being sufficiently large. Therefore, this topological invariant is dependent on the network size
n and the connection probability
p, which reflects the topology of the Erdös-Rényi networks under analysis. In the following results, several properties concerning the topological entropy of
are established.
Proposition 3. Let be an Erdös-Rényi network and let be the topological entropy of provided by Equation (9). The following properties hold true: - (P1)
If the network size n is fixed and sufficiently large and if p satisfies Equation (7) with and , then the topological entropy increases with the increase of p and is verified: - (P2)
If the network size and if p satisfies Equation (7), then .
Proof. Considering the characterization of the topological entropy of
provided by Equation (
9), and attending to the monotony of logarithmic function, we are able to conclude that
increases with the growth of
p, with
p satisfying Equation (
7) and the network size
n being fixed and sufficiently large. On the other hand, again following the hypothesis that
p satisfies Equation (
7), the inequalities of Equation (
14) are established using the equality provided by Equation (
9). Thus, item
(P1) is proved.
The proof of the result of item
(P2) follows from Equation (
9) and the behaviour of the logarithmic function when
and
p satisfies Equation (
7). This completes the proof of the proposition. □
The above results provide properties of the topological entropy of the Erdös-Rényi network
, which are typified as a dependence of the network parameters
n and
p, i.e., they reflect the topological structure of the network under study.
Figure 1a illustrates the results of Proposition 3.
Because the expression for the topological entropy
only holds when the values of
p lie in the interval
for
, it is our purpose to perform a deeper analysis concerning the possible variation in the values of the probability
p as
n becomes larger. Consider the following real functions defined in the positive real numbers:
Computing the derivatives of each function
and
, we obtain the following expressions:
Through analysis of each derivative function
and
, we can find that for
, considering the natural numbers
, both sequences
and
decrease and both sequences tend to zero as
. In particular, we can establish the following relationship between the variation intervals of the connection probability
p provided by Equation (
7) between the network sizes
n and
.
Property 1. Let be a probability and a natural number such that . Under the assumption in Proposition 2, the following inequalities hold true: Moreover, for every such that , which corresponds to the maximum value for the probability p.
Activation Threshold Dynamics and Topological Entropy
The definition of the activation threshold
in the Erdös-Rényi network
provided in Definition 1 has no restrictions on the network topology, i.e., on the
n and
p parameters, and as such can be improved by taking into account Equation (
8). However, as set out in
Section 5, the parameters
n and
p have restrictions:
p must satisfy Equation (
7), and the network size
n must be sufficiently large. With these considerations, the definition of the activation threshold
can be rewritten.
Definition 6. Let be an Erdös-Rényi network and let A be the adjacency matrix associated with the spectral radius λ. Considering a value of p that satisfies Equation (7) and sufficiently large network size n, the activation threshold θ of the network is defined by Figure 1b illustrates the comparison between Definition 1 and Definition 6 of the activation threshold
.
The characterization of the activation threshold of
provided by Equation (
12) allows us to reinterpret the results set out in Proposition 3. Considering the definition of topological entropy provided by Proposition 2 and attending to the definition of the activation threshold
for the network
provided by Equation (
12), we are able to conclude that
Therefore, the activation threshold in the linear threshold model for under the conditions in Definition 6 can be defined through the topological entropy of the network, as proven above.
Proposition 4. Let be an Erdös-Rényi network, θ the activation threshold of provided by Equation (6), and the topological entropy of provided by Equation (9). Then, the following properties hold true: - (P1)
If the network size n is fixed and sufficiently large and if p satisfies Equation (7) with and , then the activation threshold decreases with the increase of p and is verified: - (P2)
If the network size and if p satisfies Equation (7), then the activation threshold .
The proof of the above results follows from Equation (
13) and from arguments analogous to the proof of Proposition 3.
Figure 1b illustrates the results of Proposition 4.
Remark 3. The results proved in this section allow us to redefine the influence activation mapping f provided by Equation (2) in Definition 2 using the relationship between the local dynamics of the nodes (measured through the active neighbours of a node at time t along with the degree of the node) and the global dynamics of the network (measured by the topological entropy of ), i.e., 6. Numerical Studies
In this section, it is our purpose to present numerical simulations concerning a number of different goals. On one hand, we intend to illustrate the results proved in the previous sections. On the other, we wish to analyze the results of our simulations, where it is not possible to state theoretical results, with the aim of addressing the influence maximization problem. We present four case studies that have the same parameters for the underlying network and the linear threshold model, while differing in the choice of the seed between random, degree, closeness, and betweenness centrality measures. First, let us recall a few definitions.
Definition 7. Let be an Erdös-Rényi random network and let with :
- 1.
The degree centrality is , where is the degree of the node
- 2.
The closeness centrality is , where represents the distance from to
- 3.
The betweenness centrality is , where is the number of paths between and , and is the number of paths between and that contain .
The choice of the seed set
is related to the detection of influential nodes on a contact network. On one hand, the degree centrality concerns the network’s local dynamics, while on the other different centrality measures such as closeness and betweenness take into account the network’s global dynamics; see [
31].
For each study case presented in the following sections, we intend to compare the effect of the seed choice on the dynamics of influence maximization. We want to illustrate two different scenarios from Proposition 1: first when the process is activated, and second when the theoretical procedure reaches step 4 at . Because activation is a necessary condition that is not sufficient for diffusion, we simulate two different situations concerning the diffusion process.
Because the influence maximization problem is NP-hard (see [
9]), numerical simulations are essential to comparing and discussing this phenomenon. In this section we present results for the linear threshold model by comparing the simulations in a family of forty Erdös-Rényi networks
for
. Among other features, the diffusion relies on the network connectivity and the threshold value. Concerning the connectivity, we perform numerical studies considering the two most connected regimes in the Erdös-Rényi networks, i.e., where a giant connected component is present. These regimes are the supercritical regime for
and the connected regime for
. Recall that in the supercritical regime the giant component may have cycles and the other connected components are mainly trees, while in the connected regime the order of the giant component is almost
n. All the simulations presented were generated in python.
6.1. Case Study 1: Random Seeds
In this section, we present the numerical simulations of two case studies with the seed set random and . For the implementation of Procedure 1, the theoretical procedure for the linear threshold model which is mathematically formalized in this work, we use an Erdös-Rényi network with nodes, connection probability , and activation threshold .
In
Figure 2a, the Erdös-Rényi network
is plotted at the initial instant
; node
is the active node or seed, which was chosen randomly. In this case, the theoretical procedure is not activated, i.e., for every instant
the procedure verifies step 4 (the stopping condition); see
Figure 2b.
In
Figure 3a, the same network is activated by node
, which again was randomly selected. There remains no diffusion of influence, and step 4 in the theoretical procedure is verified at instant
with only two active nodes; see
Figure 3b. In conclusion, in
Figure 2 the theoretical procedure is not activated according to Proposition 1, while
Figure 3 the procedure is activated. Neither simulation case involves diffusion of influence.
When considering the numerical simulations of the 40 Erdös-Rényi networks, we chose a larger value for the network size at
. In the supercritical regime, represented by
Figure 4a, we have
and
, while in the connected regime in
Figure 4b we have
and
. Note that we consider these values throughout all the subsequent subsections. In the first regime, most of the cases are not activated, and for those that are there is no diffusion. Conversely, in the connected regime, when activated we obtain influence maximization. In this case, the growth curve of the activated or influenced individuals is of the sigmoid type (the red graphics in
Figure 4a), while the curve of nonactivated individuals is an inverse sigmoid function (the green graphics in
Figure 4a).
6.2. Case Study 2: Seeds With Higher Degree Centrality
In order to compare the different choices for the only seed, throughout the cases studies we preserve the network topology and the activation threshold
established in case study 1.
Figure 5a plots the Erdös-Rényi network
at the initial instant
, where node
is the node with higher degree. As can be observed in
Figure 5b, not only is the theoretical procedure activated, the total maximum influence is obtained after nine iterations. Note that the only inactive node
is outside the giant component of the network.
For the degree centrality, it can be observed that a great many networks are activated in
Figure 6a, even though the maximum of active nodes is approximately 25. In
Figure 6b, when a network is activated maximum influence is obtained with the activation of almost all of the 200 nodes.
6.3. Case Study 3: Seeds With Higher Closeness Centrality
Figure 7a plots the Erdös-Rényi network
at the initial instant
, where node
is the node with higher closeness centrality. As in the previous case study, it can be verified that the theoretical procedure is activated in
Figure 7b and that influence maximization is fulfilled. The only inactive node,
, is outside the giant component of the network. The stopping condition is achieved rather quickly at instant
. Under these conditions, influence maximization is achieved over the entire connected component of the network
, to which the seed node belongs.
Again, similar to the previous subsection, there is no influence maximization for the network topology defined by
Figure 8a, while either almost complete activation or no activation is obtained in the connected regime in
Figure 8b. The influence spreads somewhat slower than in the previous case.
6.4. Case Study 4: Seeds with Higher Betweenness Centrality
In our last case study, the seed is chosen according to the betweenness centrality attained in node
. In
Figure 9a, the Erdös-Rényi network
is plotted at the initial instant
, while in
Figure 9b the theoretical procedure leads to the situation of maximum influence. Again, regarding the network topology, the only remaining inactive node
is disconnected from all of the other nodes. In this case, the influence maximization is faster than in case study 3.
The differences between the two regimes are very deep. In these simulations, when the seed is chosen considering the betweenness centrality, as in
Figure 10a, we obtain the highest number of active nodes in the supercritical regime. In
Figure 10b it is possible to identify two limit situations: the procedure stops at instant
, or the number of active nodes is almost
.
In conclusion, as referred to previously, in general the influence maximization problem has no theoretical solution even when numerical approaches are considered. In this section we have investigated different choices of seeds, which provide very similar simulation results; however, the network topology is preserved, and it may be the case that this situation leads to the obtained results.
7. Discussion and Conclusions
In this work, we have considered the linear threshold model over the family of Erdös-Rényi random networks , where n is the network size and p is the probability of two nodes being connected. Throughout this paper, it has been our intention to analyze and discuss the influence, activation, and maximization dynamics in a specific population, as well as to identify and characterize certain properties of the underlying network. Our starting point was the mathematical formalization of the linear threshold diffusion model by reference to the definition of the influence activation mapping in Definition 2. This formalization led us to the establishment of a theoretical procedure for the linear threshold model, provided by Procedure 1. In particular, we have highlighted the general conditions under which this procedure is established, as explained in Remark 2. As a consequence, an expression for the activation probability can be obtained under this model considering the case of a unique seed, as provided Proposition 1. At this point, it is possible to discuss the maximum value of the influence activation probability mapping depending on the centrality degree of the network, raising the question of whether other centrality measures can be used as the criterion of choice for seed or activation nodes.
Regarding the topology of Erdös-Rényi random networks, by using the spectral radius of the asymptotically defined associated adjacency matrices, we obtained an explicit expression for the topological entropy for this type of random network by considering different values of
p provided by Definition 5. In particular, this definition allowed us to characterize the topological entropy through the expected value of the network node degree. This result, provided by Proposition 2, determines the topological invariant under study via the behaviour of the global dynamics of the network. According to this definition, a topological order can be presented for Erdös-Rényi random networks. By taking a dynamical systems point of view, the approach to the topic of influence maximization discussed in this paper allowed us to prove topological entropy properties in Erdös-Rényi random networks as provided by Proposition 3. Furthermore, a relation between the activation threshold and the topological entropy was derived considering Definition 6. Consequently, it was possible for us to redefine the influence activation mapping in Equation (
2) provided by Definition 2 through the relation between the local dynamics of the nodes (measured through the active neighbours of a node at time
t and the degree of the node) and the global dynamics of the network (measured by the topological entropy of
). The culmination of this result is certainly the most original contribution of the present work, relating discrete dynamical systems theory and influence maximization in Erdös-Rényi networks via the linear threshold model.
In general, the influence maximization problem is theoretically difficult. Moreover, it is known that the spread of influence is computationally complex under any diffusion model for a given seed set. In particular, the linear threshold model is in fact an NP-hard problem; see [
9]. Because there are no analytical results that provide a good seed set, we have performed numerical case studies in order to analyze different choices for the activation nodes, which are exhibited and discussed in
Section 6.
In fact, the subject under discussion is highly complex and interdisciplinary, and the approaches and topics to be analyzed can be very diverse. As a future perspective, network stability analysis is very important in many practical applications; for an example, see the approach presented in [
32].