A Multi-Information Dissemination Model Based on Cellular Automata

Shao, Changheng; Shao, Fengjing; Liu, Xin; Yang, Dawei; Sun, Rencheng; Zhang, Lili; Jiang, Kaiwen

doi:10.3390/math12060914

Open AccessArticle

A Multi-Information Dissemination Model Based on Cellular Automata

by

Changheng Shao

¹,

Fengjing Shao

¹,

Xin Liu

²,

Dawei Yang

²,

Rencheng Sun

^1,*,

Lili Zhang

¹ and

Kaiwen Jiang

¹

College of Computer Science and Technology, Qingdao University, Qingdao 266071, China

²

College of Computer Science and Technology, China University of Petroleum, Qingdao 266580, China

^*

Author to whom correspondence should be addressed.

Mathematics 2024, 12(6), 914; https://doi.org/10.3390/math12060914

Submission received: 22 December 2023 / Revised: 24 February 2024 / Accepted: 16 March 2024 / Published: 20 March 2024

(This article belongs to the Special Issue Multi-attribute Decision Making and Intelligent Computing in Smart Governance)

Download

Browse Figures

Versions Notes

Abstract

Significant public opinion events often trigger pronounced fluctuations in online discourse. While existing models have been extensively employed to analyze the propagation of public opinion, they frequently overlook the intricacies of information dissemination among heterogeneous users. To comprehensively address the implications of public opinion outbreaks, it is crucial to accurately predict the evolutionary trajectories of such events, considering the dynamic interplay of multiple information streams. In this study, we propose a SEInR model based on cellular automata to simulate the propagation dynamics of multi-information. By delineating information dissemination rules that govern the diverse modes of information propagation within the network, we achieve precise forecasts of public opinion trends. Through the concurrent simulation and prediction of multi-information game and evolution processes, employing Weibo users as nodes to construct a public opinion cellular automaton, our experimental analysis reveals a significant similarity exceeding 98% between the proposed model and the actual user participation curve observed on the Weibo platform.

Keywords:

public opinion; cellular automata; multi-information game; propagation dynamics

MSC:

94-10

1. Introduction

Numerous components within complex systems exhibit intricate connectivity patterns and dynamic processes, often characterized by mutual dependencies among heterogeneous units or the coupling of multiple layers, resulting in the coexistence of diverse relationships among homogeneous units [1,2]. In the realm of a complex and interconnected network, public opinion is influenced by a myriad of factors encompassing unique individual values, beliefs, cultural contexts, and social affiliations. Consequently, a multitude of information dissemination phenomena coalesce within this intricate network, where various pieces of information interact dynamically throughout the dissemination process, giving rise to instances of either reinforced or attenuated dissemination which reflect the intricate dynamics of public opinion propagation.

Data from the China Internet Network Information Center [3] reveals that the number of internet users and the penetration rate in China has reached 1.079 billion and 76.4%, respectively, underscoring the widespread adoption of the Internet. However, users navigating social networks often lack a comprehensive understanding of public opinion and are frequently exposed to rumors and misinformation, posing threats to public safety and the social order [4]. Consequently, this phenomenon often leads to a cascading and secondary spread of public opinion.

Hence, public opinion dissemination analysis has become a pressing concern among scholars, necessitating an exploration of the mechanisms governing public opinion dissemination, especially with the coexistence of multiple pieces of information. In the broader context of research on the evolution of online public opinion abroad, a multidisciplinary approach prevails. Scholars commonly employ three established strategies to delve into the intricacies of public opinion dissemination: complex networks, propagation dynamics and cellular automata. In detail, complex networks provide insights into network structures and connectivity patterns facilitating the understanding of topology and evolution [5,6,7]; propagation dynamics modeling offers insights into consensus emergence and information cascades, but it may suffer from limited predictive power and sensitivity to initial conditions [8,9,10,11,12]; cellular automata offers a simplified and interpretable strategy for simulating complex system dynamics, although they may oversimplify real-world dynamics and require refinement for accurate modeling [13,14,15]. Notably, research focusing on public opinion dissemination based on cellular automata has emerged as a prominent area in public opinion analysis.

Consequently, there is a need to develop more sophisticated propagation mechanisms based on the cellular automata model to enhance the accuracy of public opinion trend prediction. In this paper, we propose the SEInR model that incorporates different states to capture the complexities of public opinion dynamics based on the classical SEIR (Susceptible-Exposed-Infectious-Recovered) model. By implementing the SEInR model within a cellular automaton framework, we capture the nuanced interactions between multi-information dissemination, thereby enhancing the reliability and precision of our predictions. Finally, the reliability and accuracy of the model are validated through experiments.

2. Related Works

2.1. Propagation Dynamics

The utilization of social networks facilitates the exchange of diverse information, leading to multiple emotional expressions. Users heavily rely on friendships within online social networks to express and propagate information opinions. The dissemination of online public opinion information follows a systematic, dynamic and complex life cycle, necessitating a comprehensive study of intricate mechanisms. With complex topology feature, online social networks contain diverse collection of various popular online sentiments [16,17,18].

Propagation dynamics are a crucial facet of information dissemination which explore how information spreads and evolves across diverse social contexts, encompassing vehicular transportation systems, urban structures, viral infections and public opinion dynamics. Understanding user behavior and psychology dynamics is essential for comprehending public opinion formation. This field primarily explores how information dissemination occurs, how messages are interpreted, and how communication influences network dynamics. Research in transmission dynamics often adopts a multidisciplinary approach, drawing insights from communication theory, sociology, psychology, anthropology and related disciplines.

In the realm of information dissemination, several prevalent models exist, including the following:

(1): The Susceptible–Infectious (SI) model [19] represents the simplest model that can capture the transition from susceptible to infectious states. Shao et al. introduced a SIn model for multi-information spreading and demonstrated its ability to imitate and predict dynamic behaviors [20].
(2): The Susceptible–Infectious–Susceptible (SIS) model [21] extends the SI model, allowing for repeating or recurring infections with infected individuals returning to the susceptible state. Xuan et al. proposed a network continuous-time SIS model coupled with individual opinion dynamics [22].
(3): The Susceptible–Infectious–Recovered (SIR) model [23,24] is a generic epidemiological model that describes the transmission of infectious diseases through individuals who transit between susceptible, infectious, and recovered states. Han et al. analyzed the impact of human activity patterns on information diffusion using the SIR model [25].
(4): The Susceptible–Knowledgeable–Infectious–Recovered (SKIR) model extends the traditional SIR model by introducing a “knowledgeable” state, where individuals have been exposed to both the disease or information and counteracting knowledge. Xiao et al. proposed an SKIR rumor propagation model to describe the propagation of rumors and the dynamic changes in the influence of anti-rumor information [26].
(5): The Susceptible–Exposed–Infectious–Recovered (SEIR) model [26,27] introduces an “exposed” state between being susceptible and infectious, representing individuals who have been exposed to the disease or information but are not yet infectious. This model is particularly relevant for diseases with an incubation period. Li et al. proposed a public opinion evolution HK–SEIR model which combines the opinion fusion HK and the epidemic transmission SEIR models [28].

2.2. Cellular Automata

A cellular automaton (CA) is a discrete grid-based dynamic model capable of simulating the spatiotemporal evolution processes of complex systems, wherein spatial interactions and temporal causality are localized. By iteratively applying predefined rules based on the states of neighboring cells, the CA drives the evolution of these systems [29,30,31]. Its versatility spans diverse fields, including image processing [32,33,34] and traffic management [35,36,37], underscoring its profound influence on scientific inquiry and technological innovation.

Meanwhile, a CA is utilized in the context of information dissemination [38,39] to model the spread of information through a network of interconnected cells or nodes. Each cell represents an individual or entity within the network, and the state of each cell evolves over discrete time steps and states of neighboring cells according to predefined rules. By simulating the interactions and dynamics between cells, it provides insights into the propagation of information, the interpretation of messages and the influence of communication on networks. The model helps researchers to understand the complex mechanisms underlying information dissemination and predict trends in public opinion dynamics. As such, public opinion prediction based on CA has been a hot topic recently. Liu et al. [40,41] proposed public opinion cellular automata for situation deduction to predict the possible trending of public events. It has been proved that the cellular automata-based prediction of the number of participants and emotional trending are more accurate than other methods.

While cellular automata have demonstrated promise in modeling the dissemination of public opinion, existing research has predominantly concentrated on two-dimensional relationships, thus constraining their ability to capture the intricacies of opinion dissemination within networked environments. Addressing this gap, our paper introduces the SEInR model, a novel application of a CA to the multi-state of public opinion dissemination. By employing meticulous analysis and empirical validation, our goal is to enhance our understanding of public opinion dynamics and advance computational modeling methodologies in the social sciences.

3. Detailed Model

3.1. Model Characteristic

3.1.1. Model Definition

Definition 1.

Node State. Although existing models have been widely used in the spread of public opinion, they usually do not consider the reality of information dissemination among multiple infected users. Here, an infected user is defined as a user who has been exposed to public opinion and influenced by it, forming a certain attitude or opinion that is resistant to further influence. In reality, the interaction of multiple pieces of information within a network is complex, leading to various states such as negative and positive. However, there is a lack of comprehensive models to describe the propagation process and interactions between multiple pieces of information with different infected states.

Given the premise assumption of the diffusion process for multiple pieces of public opinion information and the transfer process for individual states, it becomes necessary to construct a multi-information dissemination mechanism based on cA. This paper proposes an SEInR model with features for multi-information public opinion propagation, aiming to analyze the propagation process of public opinion in crowds with high accuracy and objectivity.

In the enhanced cellular automata model, node states are categorized into

n + 3

types:

{S, E, I_{1}, \dots, I_{n}, R}

. This classification is illustrated by Figure 1 and Table 1:

Definition 2.

State Transition. In the cellular automata model, a state is usually transited to a different state. When the state of a node transits from the

Γ^{t - 1} (n o d e)

to the

Γ^{t} (n o d e)

, the

Γ^{t - 1} (n o d e)

stands as the source state of the node in

t - 1

time and the

Γ^{t} (n o d e)

stands as the current state of the node in t time.

Definition 3.

Node model. Within the cellular automata model, nodes serve as representations of users within the network’s public opinion space. These nodes undergo state transitions based on varying conditions. For example, nodes initially uninvolved in a topic may transit to either the “Exposed State” or the “Infectious State” upon encountering public opinion information. Similarly, nodes in an Engaged state may shift to the “Exposed State” when influenced by neighboring nodes in the propagating state. Nodes in the “Exposed State” are susceptible to influence from neighboring nodes in different states, potentially leading to state transformation into other types. Over time, nodes in the “Exposed State” or “Infectious State” may recover to the “Recovered State” due to various factors. This dynamic framework encapsulates the evolving feature of public opinion dissemination within the network, shedding light on the intricate interplay between different states and the impact of neighboring nodes on state transitions. The differential equation of node model is defined as follows:

\frac{d s (t)}{d t} = - (β_{12} s (t) e (t) + β_{13} s (t) i_{1} (t) + \dots + β_{1 (m - 1)} s (t) i_{n} (t))

(1)

\frac{d e (t)}{d t} = β_{12} s (t) e (t) - (β_{23} e (t) i_{1} (t) + \dots + β_{2 (m - 1)} e (t) i_{n} (t) + β_{2 m} e (t) r (t))

(2)

\frac{d i_{1} (t)}{d t} = β_{13} s (t) i_{1} (t) + \dots + β_{(m - 1) 3} i_{n} (t) i_{1} (t) - β_{3 m} i_{1} (t) r (t) - β_{32} i_{1} (t) i_{n} (t)

(3)

\frac{d r (t)}{d t} = β_{2 m} e (t) r (t) + β_{3 m} r (t) i_{1} (t) + \dots + β_{(m - 1) 3} i_{n} (t) i_{1} (t)

(4)

Here,

β

corresponds to transitions of probability from the

Γ^{t - 1} (n o d e)

to the

Γ^{t} (n o d e)

. And

s (t)

,

i_{n} (t)

,

e (t)

,

r (t)

represent the proportions of nodes in state S, In, E and R, respectively.

Definition 4.

Inference rule. In the cellular automata model, the state transition of user nodes and propagation is expressed by the following:

Γ {(n o d e)}^{t} \in S, E, I_{1}, \dots, I_{n}, R

(1): S→E Transition:

$Γ {(n o d e)}^{t} = E, i f (N_{p} - N_{n} δ_{1}) + (W_{p} - W_{n} δ_{2}) + (N_{s} - N_{n} δ_{3}) > 1 a n d Γ {(n o d e)}^{t - 1} = S .$

(5)

where $N_{p}$ represents the number of user nodes in the neighborhood space of a user node expressing a certain opinion, $N_{n}$ represents the total number of user nodes in the neighborhood space of a user node, $W_{p}$ represents the total number of user nodes in the entire space expressing a certain opinion, $W_{n}$ represents the total number of user nodes in the entire space and $β_{N} s$ represents the total number of user nodes in the neighborhood expressing a certain opinion; $δ_{i}$ represents the transition factor.
(2): S,E→I Transition: When users engage in discussions on public opinion topics, they may exhibit a proactive or reactive behavior when expressing their viewpoints. These behaviors must be processed with different conditions. Active Propagation: If a cellular node is actively participating, there exists a probability that it will contribute relevant commentary in the subsequent time step. This probability is determined by $I_{v}$ the “Independent Opinion Index” of the node.
Passive Propagation: If a cellular node is in the participating state, neighboring nodes exhibit a propagation influence greater than that of the node at the previous time step; then, it may passively transition to a propagation state (I) with a certain probability.
Continuation of Participation: If the node remains in a participating state without entering either the active or passive propagation states, it will persist in this state until it meets the conditions for exiting.
(3): E,I→R Transition: When both the permanent exit time limit and the temporary exit time limit are less than 0, the user will transit to the exit state (R).

3.1.2. Model Properties

3.1.2.1. Balance State. In the process dissemination of information, the network is in equilibrium when the values of

s (t), e (t), i_{1} (t), \dots, i_{n} (t), r (t)

remain unchanged.

3.1.2.2. Dissemination influence. In order to enable user nodes to reflect the situation of the entire network’s public opinion space, each node contains eight attributes. These are the user’s node influence I, the independent opinion index

I v

, the forwarding index

S c

, opinion firmness

S p

, opinion interest

I p

, emotional inclination

M u

, frequency of idle remarks

I s

and interest list

I l

. The value of the attributes will affect the user’s response to different public opinion information, such as whether they are interested in the topic and which emotional attitude they hold. The attribute values of each user node are determined based on their own historical comments. In addition to attributes, user nodes will also be in different states during the deduction process. The changes to cellular state nodes are introduced as follows:

The dissemination influence I of node V reflects the importance of the opinion subject in the online public opinion space. The more influential the node is, the more users will accept the opinion published by the node, which can even affect the emotional and interest tendencies of its fan nodes. In the model proposed in this article, the node influence of the user node is based on the user’s activity degree

W_{1}

and the dissemination degree

W_{2}

. Here, the activity degree

W_{1}

is computed based on the total number of user comments

X_{1}

and the total number of original comments

X_{2}

. While dissemination degree is comprehensively computed based on the number of reposts

X_{3}

, the number of responses

X_{4}

, reposts of original content

X_{5}

, responses to original content

X_{6}

and likes received.

Based on the weight ratio of each part in Table 2, the formula for calculating node dissemination influence is shown in the following equation:

I = (C W_{1} + (1 - C) W_{2})

(6)

The calculation formula for user activity

W_{1}

and the calculation formula for the activity of user comments

W_{2}

are shown in the following equations, respectively.

\begin{matrix} W_{1} = λ_{1} ln (X_{1} + 1) + λ_{2} ln (X_{2} + 1) \\ λ_{1} + λ_{2} = 1 \\ 0 \leq λ_{i} \leq 1 \end{matrix}

(7)

\begin{matrix} W_{2} = α_{1} ln (X_{3} + 1) + α_{2} ln (X_{4} + 1) + α_{3} ln (X_{5} + 1) \\ + α_{4} ln (X_{6} + 1) + α_{5} ln (X_{7} + 1) \end{matrix}

(8)

\begin{matrix} α_{1} + α_{2} + α_{3} + α_{4} + α_{5} = 1 \\ 0 \leq α_{i} \leq 1 \end{matrix}

(9)

By collecting historical comments from users, and distinguishing between forwarded and original comments, we can obtain the number of reposts, responses and likes per post. Based on the above formula, the user’s node influence can be calculated. The larger the value of the dissemination influence, the greater the user’s influence in the online public opinion space.

3.1.2.3. Independent Opinion Index. In the online public opinion space, some users enjoy exploring others’ comments and forwarding them, but rarely actively express certain opinions themselves. At the same time, some users are the main publishers of online public opinions. Furthermore, these users will actively express their opinions to participate in opinion dissemination when public opinion events occur. The user’s independent opinion index “Iv” is used to measure the probability of users actively expressing their own opinions when participating in online public opinion events. In the cellular automata modeling, the independent opinion index will be used to determine whether a user node will make comments about public opinion events or not. The calculation formula is shown as follows:

I v = \frac{X_{2}}{X_{1}}

(10)

where

X_{2}

represents the number of original user comments and

X_{1}

represents the number of all comments made by the user. The higher the user’s independent opinion index, the more frequently the user will make original comments when participating in public opinion events.

3.1.2.4. Forwarding index. The user’s forwarding index

S c

is used to measure the likelihood of users actively disseminate information when participating in online public opinion events. In the network public opinion inference model, the forwarding index will be used to determine whether user nodes should disseminate their opinions during public opinion events or not. The calculation formula is shown as follows:

S c = 1 - I v

(11)

And the sum of the user’s forwarding index and the user’s independent opinion index is one. The higher the user’s forwarding index is, the greater the probability that the user will forward relevant opinions when participating in public opinion events.

3.1.2.5. Opinion firmness. In the online public opinion space, users have a certain level of judgment ability regarding the comments they receive, making it difficult for the received comments to affect their original views. The firmness of opinion

S p

signifies the extent to which individuals’ personal opinions are susceptible to external influence from neighboring users and the surrounding environment. The following steps are designed to calculate opinion firmness. We denote this process in Algorithm 1.

In the described algorithm, the initial step involves the extraction of keywords from all user comments and forwards within a specified public opinion event, leading to the creation of an opinion keyword matrix. Subsequently, the Principal Component Analysis (PCA) dimensionality reduction algorithm is applied to compress the vectors within this matrix into two-dimensional representations. During this reduction process, two-dimensional vectors that meet predefined criteria for local density and relative distance between centers are identified, effectively serving as clustering centers among the reduced vectors. A higher count of such clustering centers indicates a more dispersed distribution of user opinions during the event, reflecting a decreased level of conviction or firmness in their expressions. This analytical methodology yields valuable insights into the dynamics of public opinion dissemination and user engagement within the investigated context.

3.1.2.6. Topic Initiation Ability. Certain users exhibit a keen interest in exploring a diverse array of public opinion topics and demonstrate a high sensitivity to trending internet discussions. Typically, these users engage in public opinion events not by relying on interactions with neighboring nodes but rather by monitoring hot search topic rankings to stay abreast with events and actively participate in relevant discussions. The frequency of idle comments made by users serves as a metric to gauge their responsiveness to public opinion hotspots within the broader online environment. The subsequent procedure outlined herein is tailored to compute the frequency of idle comments made by users. We denote this process in Algorithm 2.

Algorithm 1: Firmness of opinion

Algorithm 2: Topic Initiation Ability

Input: All comments posted and forwarded by the user as the user’s opinion dataset

D s

Output: The frequency of idle comments by this user

I s

;

1: Obtain all comments posted and forwarded by the user as the user’s opinion dataset $D s$ ;
2: Perform steps 2 to 8 in the above Algorithm 1;
3: Set the discrete local density threshold $L d_{t}^{'}$ and the discrete relative distance threshold $R d_{t}^{'}$ , and count the number of vectors $C n$ in $W_{P C A}$ with local density less than $L d_{t}^{'}$ and relative distance greater than $R d_{t}^{'}$ ;
4: Return( $\frac{C n}{X_{1}}$ )

Based on Algorithm 2, the vectors in

W_{P C A}

with a local density less than the discrete local density threshold

L d_{t}^{'}

and relative distance greater than the discrete relative distance threshold

R d_{t}^{'}

which can be considered as discrete points in all user comment data. The greater the number of discrete points, the broader the range of public opinion topics that users are engaged with, indicating a heightened sensitivity to events within the online public opinion space. Such users are predisposed to readily participate in topic events without relying on information dissemination from neighboring nodes.

In the online public opinion space, each user maintains their own set of interests. Despite the topic’s overall popularity, users tend to be more receptive to comments within their interest areas, while showing less engagement with topics outside those areas. Consequently, user participation becomes challenging when topics fall beyond their interests.

The interest list, denoted as

I l

, comprises a q-dimensional vector. Each dimension represents a distinct field of interest, with values ranging between 0 and 1. A higher value indicates greater interest in the corresponding field, while a lower value reflects lesser interest.

To establish the interest list, we compute the occurrences of keywords from different fields across all user comments. The frequency of keywords in a particular field directly correlates with the user’s interest level in that field. The value of the i-th dimension in the interest list is calculated as follows:

I l_{i} = \frac{K w_{i}}{K w_{m a x}}

(12)

where

K w_{i}

represents the number of times a keyword in the i-th interest field appears in the user’s entire opinion, and

K w_{m a x}

represents the maximum number of occurrences of keywords in each of the q domains across all user comments. Therefore, the dimension corresponding to the domain of interest that users are most interested in is set to one. Throughout the inference process, an interest list serves as an effective way of demonstrating the user’s level of interest in comments across various fields.

3.1.3. Sentiment Orientation

The sentiment orientation refers to the degree of intensity with which a subject expresses positive or negative emotions towards an object. These varying degrees of emotion are typically conveyed through different emotional words or tones. To accurately capture this phenomenon, it is common practice to assign different weights to each emotional word.

Furthermore, user sentiment orientation refers to the inclination of users to align themselves more closely with specific emotional orientations when interacting with particular public opinion topics. It also encompasses the emotional orientations they are prone to express when providing comments or opinions.

To improve the accuracy of computing user sentiment orientation, this study extends the existing sentiment lexicon. By leveraging this enhanced sentiment lexicon, following is proposed in this paper to accurately compute user sentiment orientation, thereby offering a deeper insight into how users express and engage with emotions in the context of public opinion discussions. It is computed as following:

Collect a vocabulary of positive words, negative words, negative words and degree adverbs;
Obtain all comments posted and forwarded by the user, and initialize the emotional value $M = 0$ for each comment;
Use a new word-discovery algorithm based on the association confidence of the word segmentation of each user’s opinion data;
Traverse through the word sequence obtained from step 3 for each statement. If a keyword appears in the positive word library, determine whether the previous word is a definite or degree adverb. If it is a negative word, reduce the value of M by one; If it is a degree adverb, increase the value of M by two; If it is not a negative word or a degree adverb, increase the value of “M” by one; If a keyword appears in the negative word vocabulary, determine whether the previous word is a definite or degree adverb; If it is a negative word, increase the value of M by one; If it is a degree adverb, decrease the value of M by two; If it is not a negative word or degree adverb, decrease the value of M by one.
Based on step 4, calculate the emotional value of each comment from the user, and the user’s sentiment orientation is $M u = \frac{1}{n} \sum_{i = 1}^{n} M_{i}$ , where n is the total number of comments made by the user, $M_{i}$ is the sentiment orientation value of the user’s i-th comment.

Compute the average emotional value of a user’s opinion using the above procedure to determine the user’s sentiment orientation. This orientation influences the emotional bias of users’ comments during the deduction process. A higher sentiment orientation indicates a predominantly positive emotional stance, whereas a lower orientation suggests a more negative emotional disposition.

3.2. Model Definition

3.2.1. Preliminary Segmentation

To better address the segmentation and computation of short texts in public opinion and enable an accurate assessment of the firmness of user node comments, this paper proposes a novel word discovery algorithm based on association confidence. In order to prevent the occurrence of accidental word units and their left and right adjacent words with a 100% association confidence, it is necessary to merge the short text before preliminary word segmentation to increase the length of the text to be segmented. Assuming a total of n text data to be segmented, merge each m text datum to obtain the merged text segment

T e x t^{'} = T e x t_{1} + T e x t_{2} + \dots + T e x t_{m}

, resulting in a total of

⌈ n m ⌉

merged text segments. For each merged text segment, use the precise mode from the Jieba tool library to segment, and obtain multiple segmentation results

T c^{'}

. For each segmentation result

T c^{'}

, proceed to the next step of processing.

3.2.2. Correlation Confidence

The correlation confidence level of the association between each word unit and its left and right adjacent word units is determined based on each segmentation result, denoted as

T c

. Candidate new words are obtained by merging multiple word units that satisfy the correlation confidence threshold

T h

, forming the candidate new word set W.

There are two scenarios with candidate new words in the candidate new word set: one involves consolidating overly fragmented words to obtain correct new word results, while the other pertains to phrases formed from excessive merging. If combinations of multiple word units appear multiple times and repetitively in the text, they are merged due to meeting the correlation confidence threshold. Some phrases may represent longer text segments formed by the combination of multiple correct words, while others may denote lengthier named entities, such as network terminologies or foreign names. Consequently, it is necessary to filter the candidate new words in the candidate new word set, retaining only those with practical significance and smaller.

3.2.3. Splitting Conjunctions

To refine the granularity of newly identified words, it is crucial to address cases of excessive merging within phrases. This necessitates further segmentation of phrases, especially when conjunctions are detected within candidate new words. Specific measures are taken to ensure optimal segmentation when candidate new words contain conjunctions. It is computed as follows:

Compute the average correlation confidence for each connecting word $w_{i}$ and its adjacent word $w_{j}$ to the left or right; this is the average value of $R C o n f (w_{i} \to w_{j})$ and $R C o n f (w_{j} \to w_{i})$ ;
If the average correlation confidence values differ between a connecting word and its adjacent word units, the candidate new word undergoes splitting. The split point is determined between the connecting word and the adjacent word units with lower average correlation confidence values.
When the average correlation confidence value is consistent among a connecting word and its adjacent word units, maintain the merging state of the two word units. Proceed to identifying the next connecting word in the candidate new word.
Following the aforementioned steps of connecting word splitting, the resulting word unit sequence represents the final word segmentation outcome of the text segment. This sequence encompasses both newly formed words by combining word units and separated connecting words.
By implementing the splitting of connecting words within candidate new words, the phrase blocks formed from merging multiple word units can be dismantled. This process effectively reduces the granularity of the final new word result while ensuring semantic coherence. Consequently, the accuracy of the new word result is enhanced.

Drawing upon the newly proposed new word discovery algorithm, a comprehensive word segmentation process can be executed on both pre-existing historical data and recently acquired public opinion texts, facilitating the precise identification of words imbued with distinct semantic nuances within Chinese sentences. Subsequently, established keyword extraction techniques such as TF-IDF can be harnessed to distill pivotal terms from the textual corpus. Furthermore, the utilization of an incremental association rule mining model enables the exploration of association patterns between keywords entrenched in the historical dataset and those introduced in the newly acquired corpus. This integrated approach fosters a nuanced comprehension of semantic structures and evolving trends within the corpus, thereby augmenting the efficacy of text analysis and information retrieval methodologies.

3.2.4. Emotional Calculation

The calculation rule for emotional value information is as follows:

When there are no neighboring users posting comments, the user’s emotional value and information entropy are updated to the emotional value and information entropy of the received comments.
When neighboring users make comments, the user’s emotional value is represented by the product of the impact of their idle comments and the average emotional value of all users neighboring comments; the impact of idle user comments is represented by the product of the average information entropy of all user neighbor comments.

The exit topic deduction rule is the following:

Establish a fixed duration for users’ participation in an event, setting a permanent exit time limit. This limit decreases by 1 after each round of user engagement in the event. When the permanent exit time limit reaches or falls below 0, the user discontinues their involvement in the event discussion, opting out permanently.
Upon a user’s initial participation in an event, assign a temporary exit time limit. With each successive round of user activity in the event, this limit decreases by 1 in the absence of any comments from the user. If the temporary exit time limit drops to or below 0, the user’s departure from the event discussion is subject to reconsideration based on the participation discussion rule.

4. Experiments and Results

The experiments were conducted on a 64 bit Windows 10 operating system, employing Python 3.7 as the programming language within the development environment PyCharm 2020.

Before conducting the experiments, we initialized the inferred public opinion event information, representing domains that are relevant to public opinion events using topic vectors. Each dimension of the topic vector corresponds to a distinct domain, with values ranging from 0 to 1 indicating the relevance of the public opinion event to the represented domain content. The initial public opinion event comprises emotional values, entropy of multiple information pieces and the set of nodes initially employed for information dissemination.

To evaluate the impact of different factors on the inference process and results, we conducted comparative experiments for the selection of the initial propagation nodes and performed inference simulations. Through systematic experimentation and analysis, we aim to provide insights into the effectiveness and versatility in deducing public opinion dynamics.

4.1. SEInR Model

In this paper, we used a game inference cellular automaton model. We conducted an experimental comparison between the ordinary inference cellular automaton model and the game inference cellular automaton model (SEInR) constructed in this article, shown in the following Figure 2:

The conventional cellular automaton inference model relies solely on basic inference rules. In contrast, the SEInR model concurrently processes positive, negative, and neutral information, providing a comprehensive view of the information dissemination dynamics. In the diagram above, the “Number of participants” depicted by the red line represents the total number of individuals engaging in the topic. The “Number of active users” illustrated by the dark blue line signifies those involved in discussions pertaining to positive information, while the “Number of passive users” represented by the light green line denotes individuals participating in discussions regarding negative information. Additionally, the “Number of neutral users” indicated by the orange line reflects participation in discussions centered on neutral information.

Empirical analysis of the comparative experiment depicted in Figure 2 reveals a significantly higher number of users engaged in discussions on negative and neutral information topics in the right figure compared to the left figure. This observation underscores the enhanced effectiveness of the game-based cellular automaton model (SEInR) in information dissemination, thereby enabling a more realistic simulation of information spread. These findings suggest that the SEInR model presented in this study outperforms the conventional cellular automaton model in multi-information dissemination scenarios.

4.2. Information Dissemination with Different Strategy

In order to explore the impact of information placement at different time points on the dissemination of user node information, this article conducted a comparative experimental analysis based on the SEInR model. In the experiment, this article used user nodes at different time points as initial propagation nodes, and used different information dissemination strategies to compare the impact of different time points and information dissemination strategies to verify the method.

4.3. Dissemination with Different Time

A comparative experiment was conducted to assess the impact of information placement at different time points on information dissemination. It was designed as follows: (1) Simultaneous dissemination of Information: Positive, negative, and neutral types of information are concurrently disseminated. (2) Delayed dissemination of information (Positive): Initially, negative and neutral information is disseminated, with the dissemination of positive information being delayed.

The left portion of Figure 3a illustrates the results of the experiment where positive, negative, and neutral types of information were simultaneously delivered to observe information dissemination. Conversely, Figure 3b demonstrates the scenario where negative and neutral information were delivered first, followed by a three-round delay in the placement of positive information, thereby facilitating the observation of changes in information dissemination.

Through our comparative experiments and empirical analysis, it was found that the timing of information delivery plays a regulatory role in the game-like propagation of information. There is a certain correlation between the timing of delivering different types of information and their propagation speed. Additionally, mutual interactions among different types of information were observed. Our experimental results vividly demonstrate the complexity of the information game process, providing valuable insights for a deeper understanding of information dissemination mechanisms.

4.4. Dissemination with Different Thresholds

A comparative experiment was conducted to assess the impact of different thresholds on information dissemination. It was designed as follows:

The transition threshold of state transition was set to 70%. In order to observe the impact of threshold on information dissemination in SEInR model for users transferring from a state of not participating in a topic to a state of participating in a topic, two comparative experiments were conducted by adjusting the values of transition threshold. Figure 4 and Figure 5 were created by simultaneously introducing both positive, negative, and neutral information with transition thresholds of 0.2, 0.3, 0.7 and 0.8.

As shown by Figure 4, it was found that the adjustment of the transition threshold had a moderating effect on the speed and duration of information dissemination, so a lower transition threshold can lead to more people participating in the topic and a longer duration of information dissemination. Moreover, as shown by Figure 5, it was found that adjusting parameter a has a certain moderating effect on the speed of information dissemination, but has no effect on the duration of information dissemination. In conclusion, combining the two experiments, it is shown that transition threshold has significant impact on the speed of information dissemination at lower values and also has a certain impact on the duration of information dissemination. While a higher transition threshold has a slight impact on the speed of information dissemination.

4.5. Inference

This article verifies the reliability of the deduction model through an empirical analysis of a public opinion event to verify the performance of the model. A comparative experiment was conducted between the evolution of positive and negative information user participation change curves using SEInR and the positive and negative information user participation change curves from real events. As shown by Figure 6, the cosine similarity between the curve of the true positive user participation and the evolution of positive user participation is 99.56%, while the cosine similarity between the curve of the real negative user participation curve and the evolution of negative user participation curve is 98.35%. It is proved that the game propagation model has a high accuracy in predicting the evolution trend of positive and negative participants.

In all, it is found that the principal factors influencing information dissemination in the online public opinion space included the domain of opinion, the node influence of the initial propagation node and the timing of information placement. These findings align with the propagation patterns observed in real online public opinion spaces, validating the reliability of the model proposed in this study through comparisons with historical event data. The model presented herein facilitates real-time monitoring of the popularity of topic events in the public opinion space, user emotional tendencies, user engagement in topic discussions and opinion dissemination status. By inferring the model’s evolution process, predictions regarding the impact of public opinion topic events on cyberspace can be made, including assessments of the event’s scope and its effects on user emotions.

5. Conclusions

This paper is grounded in the principles of cellular automata and successfully constructs a sophisticated network model. It comprehensively addresses the dynamics of public opinion information dissemination and individual emotional tendencies. Through experimental verification using real user data from Weibo, it is demonstrated that the inference model (SEInR) proposed herein effectively captures changes in the state of user nodes within the cellular space and the dynamics between information during the inference process. The proposed model accurately mirrors real shifts in user behavior during public opinion event dissemination and offers a degree of predictability regarding event development, thereby creating an effective solution for complex network research.

In future endeavors, there is a potential to broaden the scope of the model, considering a more comprehensive array of factors both within and beyond the cellular space. Aiming to enhance the fidelity of simulations to real-world network public opinion dissemination scenarios. Moreover, there is room for enriching and refining the intricacies of information deduction rules, thereby shedding light on the intricate interactions among different types of information. Ultimately, this research endeavored to provide deeper insights into the field of public opinion analysis and to furnish a more scientifically grounded basis for formulating information dissemination strategies in cyberspace.

Author Contributions

C.S., F.S. and R.S. developed the initial idea for the study. C.S., X.L. and D.Y. designed the research methodology, including data collection, experimental procedures. L.Z. and K.J. created figures, tables of the presentation of results. C.S., F.S. and R.S. wrote the initial version of the manuscript. R.S. provided oversight and guidance throughout the research. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

All data generated or analyzed during this study are included in this published article.

Conflicts of Interest

All authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest or non-financial interest in the subject matter or materials discussed in this manuscript.

References

De Domenico, M. More is different in real-world multilayer networks. Nat. Phys. 2023, 19, 1247–1262. [Google Scholar] [CrossRef]
Jin, C.; Yin, C.; Jin, X.; Min, Y.; Li, Y.; Chen, N.; Huang, J. Group-based rewiring rules of binary opinion competition dynamics. Sci. Rep. 2018, 8, 14423. [Google Scholar] [CrossRef] [PubMed]
China Internet Network Information Center. Statistical Report on the Development of China Internet Network; China Internet Network Information Center: Beijing, China, 2023. [Google Scholar]
Geng, L.; Zheng, H.; Qiao, G.; Geng, L.; Wang, K. Online public opinion dissemination model and simulation under media intervention from different perspectives. Chaos Solitons Fractals 2023, 166. [Google Scholar] [CrossRef]
Luo, Y.; Ma, J. The influence of positive news on rumor spreading in social networks with scale-free characteristics. Mod. Phys. C 2018, emph29. [Google Scholar] [CrossRef]
Dai, L.; Shi, L.; Xie, G. Public opinion analysis of complex network information of local similarity clustering based on intelligent fuzzy system. Intell. Fuzzy Syst. 2020, 39, 1693–1700. [Google Scholar]
Jin, N. The Research on the Public Opinion Dissemination in Universities based on the Two-layer Coupling Network. In Proceedings of the International Conference on Frontiers of Electronics, Information and Computation Technologies (ICFEICT 2021), Changsha, China, 21–23 May 2021; p. 109. [Google Scholar]
Xiao, Y.; Chen, D.; Wei, S.; Li, Q.; Wang, H.; Xu, M. Rumor propagation dynamic model based on evolutionary game and anti-rumor. Nonlinear Dyn. 2018, 95, 523–539. [Google Scholar] [CrossRef]
Wu, B.; Yuan, T.; Qi, Y.; Dong, M. Public Opinion Dissemination with Incomplete Information on Social Network: A Study Based on the Infectious Diseases Model and Game Theory. Complex Syst. Model. Simul. 2021, 1, 109–121. [Google Scholar] [CrossRef]
Morita, S. Six Susceptible-Infected-Susceptible Models on Scale-free Networks. Sci. Rep. 2016, 6, 22506. [Google Scholar] [CrossRef] [PubMed]
Xiao, Y.; Zhang, L.; Li, Q.; Liu, L. MM-SIS: Model for multiple information spreading in multiplex network. Phys. A Stat. Mech. Its Appl. 2019, 513, 135–146. [Google Scholar] [CrossRef]
Heng, K.; Althaus, C.L. The approximately universal shapes of epidemic curves in the Susceptible-Exposed-Infectious-Recovered (SEIR) model. Sci. Rep. 2020, 10, 19365. [Google Scholar] [CrossRef] [PubMed]
Chen, H.; Song, Y.; Dan, L. Research on Cellular Automata Network Public Opinion Transmission Model Based on Combustion Theory. J. Phys. Conf. Ser. 2020, 95, 20–22. [Google Scholar] [CrossRef]
Liu, X.; Zhao, Q.; Wang, X.; Dong, X.; Li, Y.; Tian, Y. Iteratively Tracking Hot Topics on Public Opinion Based on Parallel Intelligence. IEEE J. Radio Freq. Identif. 2023, 7, 158–162. [Google Scholar] [CrossRef]
Liu, S.; Zhang, X. The Research of Public Opinion Inversion Based on Swarm Intelligence and Emergence. In Proceedings of the 2023 16th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI), Taizhou, China, 28–30 October 2023; pp. 1–6. [Google Scholar]
Barabási, A.L.; Albert, R. Emergence of scaling in random networks. Science 1999, 286, 509–512. [Google Scholar] [CrossRef]
Barabási, A.L.; Oltvai, Z.N. Network biology: Understanding the cell’s functional organization. Nat. Rev. Genet. 2004, 5, 101–113. [Google Scholar] [CrossRef]
Watts, D.J.; Strogatz, S.H. Collective dynamics of ‘small-world’ networks. Nature 1998, 393, 440–442. [Google Scholar] [CrossRef]
Pastor Satorras, R.; Vespignani, A. Epidemic Spreading in Scale-Free Networks. Phys. Rev. Lett. 2001, 86, 3200–3203. [Google Scholar] [CrossRef] [PubMed]
Shao, F.; Sun, R.; Li, S. A model of Multi-Information Dissemination with suppressed action. Complex Syst. Complex. Sci. 2010, 7, 47–51. [Google Scholar]
Allen, L.J.S.; Bolker, B.M.; Lou, Y.; Nevai, A.L. Asymptotic profiles of the steady states for an SIS epidemic reaction-diffusion model. Disc. Cont. Dyn. Syst. 2008, 21, 1–20. [Google Scholar] [CrossRef]
Xuan, W.; Ren, R.; Pare, P.E.; Ye, M.; Ruf, S.; Liu, J. On a Network SIS Model with Opinion Dynamics. IFAC Pap. Online 2020, 53, 2582–2587. [Google Scholar] [CrossRef]
Lv, X.; Fan, D.; Li, Q.; Wang, J.; Zhou, L. Simplicial SIR rumor propagation models with delay in both homogeneous and heterogeneous networks. Phys. A Stat. Mech. Its Appl. 2023, 627, 129131. [Google Scholar] [CrossRef]
Zhang, L.; Su, C.; Jin, Y.; Goh, M.; Wu, Z. Cross-network dissemination model of public opinion in coupled networks. Inf. Sci. 2018, 451, 240–252. [Google Scholar] [CrossRef]
Han, S.C.; Liu, Y.; Chen, H.L.; Zhang, Z.J. Influence Model of User Behavior Characteristics on Information Dissemination. Int. J. Comput. Commun. Control 2016, 11, 209–223. [Google Scholar] [CrossRef]
He, S.; Peng, Y.; Sun, K. SEIR modeling of the COVID-19 and its dynamics. Nonlinear Dyn. 2020, 101, 1667–1680. [Google Scholar] [CrossRef] [PubMed]
Bjørnstad, O.N.; Shea, K.; Krzywinski, M.; Altman, N. The SEIRS model for infectious disease dynamics. Nat. Methods 2020, 17, 557–558. [Google Scholar] [CrossRef] [PubMed]
Li, Q.; Du, Y.; Li, Z.; Yan, H.; Hu, J.; Hu, R.; Lv, B.; Jia, P. HK–SEIR model of public opinion evolution based on communication factors. Eng. Appl. Artif. Intell. 2021, 100, 104192. [Google Scholar] [CrossRef]
Wolfram, S. Cellular Automata as Simple Self-Organizing Systems. Comput. Sci. Phys. 1982. [Google Scholar]
Ilachinski, A. Cellular Automata: A Discrete Universe; World Scientific: Singapore, 2001. [Google Scholar]
Mitchell, M. The Genesis of Cellular Automata; Oxford University Press: Oxford, UK, 1993. [Google Scholar]
Su, Y.; Wo, Y.; Han, G. Reversible cellular automata image encryption for similarity search. Signal Process. Image Commun. 2019, 72, 134–147. [Google Scholar] [CrossRef]
Ping, P.; Wu, J.; Mao, Y.; Xu, F.; Fan, J. Design of image cipher using life-like cellular automata and chaotic map. Signal Process. 2018, 150, 233–247. [Google Scholar] [CrossRef]
Xu, X.; Fan, C.; Wang, L. A deep analysis of the image and video processing techniques using nanoscale quantum-dots cellular automata. Optik 2022, 260, 169036. [Google Scholar] [CrossRef]
Sun, Z.; Chen, Z.; Zheng, J. Ship interaction in narrow water channels: A two-lane cellular automata approach. Phys. A. 2015, 431, 46–51. [Google Scholar] [CrossRef]
Qi, L.; Zheng, Z.; Gang, L. Marine traffic model based on cellular automation: Considering the change of the ship’s velocity under the influence of the weather and sea. Phys. A 2017, 483, 480–494. [Google Scholar] [CrossRef]
Qi, L.; Ji, L.; Balling, R.; Xu, W. A cellular automaton-based model of ship traffic flow in busy waterways. J. Navig. 2021, 74, 605–618. [Google Scholar] [CrossRef]
Hegselmann, R.; Krause, U. Opinion dynamics and bounded confidence models, analysis, and simulation. J. Artif. Soc. Soc. Simul. 2002, 5, 1–33. [Google Scholar]
Krause, U. A discrete nonlinear and non-autonomous model of consensus formation. Commun. Differ. Differ. Equ. 2002, 7, 227–236. [Google Scholar]
Liu, X.; Cao, S.; Zheng, L.; Gong, F.; Wang, X.; Zhou, J. POCA4SD: A Public Opinion Cellular Automata for Situation Deduction. IEEE Trans. Comput. Soc. Syst. 2020, 8, 201–213. [Google Scholar] [CrossRef]
Liu, X.; Cao, S.; Cao, Y.; He, J.; Zhang, W.; Wang, X.; Zheng, L. Online Public Opinion Deduction Based on an Innovative Cellular Automata. In Proceedings of the Cyberspace Data and Intelligence, and Cyber-Living, Syndrome, and Health, Beijing, China, 16–18 December 2019; Volume 1137, pp. 141–160. [Google Scholar]

Figure 1. The transition of S-E-I1-I2…-In-R.

Figure 2. Comparison between ordinary inference cellular automaton model and the enhanced cellular automaton model.

Figure 3. Comparison between simultaneous and delayed dissemination of information (Positive).

Figure 4. Comparison between threshold (0.2) and threshold (0.3).

Figure 5. Comparison between threshold (0.7) and threshold (0.8).

Figure 6. Comparison between true user participation and the evolution of user participation.

Table 1. Cell node state types.

Numbers	States	Notations
1	Susceptible State (S)	Nodes not involved in public opinion topics
2	Exposed State (E)	Nodes involved in public opinion topics
3	Infectious State of Information 1 (I1)	Nodes that have been exposed to information of state 1 and have been affected
…	…	…
$m - 1$	Infectious State of Information n (In)	Nodes that have been exposed to information of state n and have been affected
m	Recovered State (R)	Nodes previously participated in a topic have withdrawn from the topic and no longer participate in discussions or dissemination

Table 2. Weight ratio for influence calculation.

Activity degree $W_{1}$	Total number of comments $X_{1}$ ( $λ_{1}$ )
Activity degree $W_{1}$	Total number of original comments $X_{2}$ ( $λ_{2}$ )
Dissemination degree $W_{2}$	Total number of reposts $X_{3}$ ( $α_{1}$ )
	Total number of responses $X_{4}$ ( $α_{2}$ )
	Total number of original reposts $X_{5}$ ( $α_{3}$ )
	Total number of original responses $X_{6}$ ( $α_{4}$ )
	Total number of likes $X_{7}$ ( $α_{5}$ )

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Shao, C.; Shao, F.; Liu, X.; Yang, D.; Sun, R.; Zhang, L.; Jiang, K. A Multi-Information Dissemination Model Based on Cellular Automata. Mathematics 2024, 12, 914. https://doi.org/10.3390/math12060914

AMA Style

Shao C, Shao F, Liu X, Yang D, Sun R, Zhang L, Jiang K. A Multi-Information Dissemination Model Based on Cellular Automata. Mathematics. 2024; 12(6):914. https://doi.org/10.3390/math12060914

Chicago/Turabian Style

Shao, Changheng, Fengjing Shao, Xin Liu, Dawei Yang, Rencheng Sun, Lili Zhang, and Kaiwen Jiang. 2024. "A Multi-Information Dissemination Model Based on Cellular Automata" Mathematics 12, no. 6: 914. https://doi.org/10.3390/math12060914

APA Style

Shao, C., Shao, F., Liu, X., Yang, D., Sun, R., Zhang, L., & Jiang, K. (2024). A Multi-Information Dissemination Model Based on Cellular Automata. Mathematics, 12(6), 914. https://doi.org/10.3390/math12060914

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Multi-Information Dissemination Model Based on Cellular Automata

Abstract

1. Introduction

2. Related Works

2.1. Propagation Dynamics

2.2. Cellular Automata

3. Detailed Model

3.1. Model Characteristic

3.1.1. Model Definition

3.1.2. Model Properties

3.1.3. Sentiment Orientation

3.2. Model Definition

3.2.1. Preliminary Segmentation

3.2.2. Correlation Confidence

3.2.3. Splitting Conjunctions

3.2.4. Emotional Calculation

4. Experiments and Results

4.1. SEInR Model

4.2. Information Dissemination with Different Strategy

4.3. Dissemination with Different Time

4.4. Dissemination with Different Thresholds

4.5. Inference

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI