5.1. Simulation Process Description
The simulation operation flow of the model is shown in
Figure 6. In the process of the simulation experiment, the initial setting of the model is completed according to the basic parameters such as actual individuals, place information, and infectious disease-related parameters. Then, specific parameters are selected to adjust according to actual simulation problems. The preceding stages are all the setting stages of the model. After the setting is completed, the formal operation stage of the model is entered. After the end of the operation cycle, infection-related data are collected and the infection results are analyzed.
When the model is officially run, it uses the basic unit of days. The health status of infected individuals on the previous day is first transformed every day, the process of which is mainly divided into two parts: First, infected individuals will reach recovery or death states, which do not affect other individuals, so these individuals directly enter the final-state statistics. Second, an individual in a latent state may develop disease and enter an infected state. If control is considered at this time, the infected individual may have a change in the state related to prevention and control and be transferred to the hospital, and the associated individuals and places will also have a corresponding change in state. If the latent person does not change to the state of infection, then it is shown that they are not a positive case in reality, so they will not be controlled, and their individual travel conditions will continue to be considered. The choice of travel place and co-existence in the place occurs, and further consideration of whether the situation of transmission occurs.
After the change of health state is completed, individuals who are not restricted to travel can choose an unenclosed place for multiple mobile interactions. There may be transmission among individuals who have a co-existing relationship every day. If transmission occurs, the health state of individuals becomes latent, and transmission is recorded in the corresponding place. Regardless of whether transmission occurs, the attributes related to the exposure state of the individual and the place change, and the individual exposed to an infection source is recorded. On the basis of these changes, individual-state statistics are carried out, which mainly count the values of all individuals in different health states and further divide them into different types of individuals such as infection patients and latent quarantine personnel through their travel restriction attributes. The model runs in a cycle of days until the preset running period of the model is reached. This paper realizes further simulation work based on NetLogo 6.1.1 software, which has been applied to difficult and complex problems in reality in multi-agent modeling and simulation experiments [
17].
5.2. Model Parameter Description and Initial Setup
Experimental parameters can be divided into three categories: agent parameters, transmission parameters, and prevention and control parameters. Agent parameters are mainly structural parameters that need to be considered when abstractly generating agent models based on relevant data of real individuals and places. Transmission parameters are mainly related to the transmission characteristics of infectious diseases, including the epidemiological characteristics of infectious diseases and the number of initial infectious sources. The prevention and control parameters mainly reflect the type and intensity of relevant prevention and control behaviors in the model.
The following describes the parameters and analyzes their initial values:
- (1)
Agent parameter description
The total number of places in the abstract area is preset to be 400 (20 × 20). In real societies, urban individuals have many social attributes, and there are differences among individuals and different resistances to viruses. According to the age structure of Chinese residents in existing research and the age structure of the national population in the data of China’s seventh census, all urban agents are divided into three types: students, office workers, and retired people. So, the proportions are roughly set as students 18%, office workers 63%, and retirees 19% [
18].
This paper mainly uses POI data (Point of Interest) to set specific places in the region; POI data correspond to geographical entities closely related to human life, such as communities, schools, supermarkets, restaurants, etc., and the name, category, latitude, and longitude of these entities are displayed in the form of geographical points. In geography, they are often used to study the functional classification of specific urban spaces. This paper mainly studies the number statistics of different types of places according to their subdivisions and specific names and sets the proportion of ideal regional places with the help of real data from various places.
With reference to the results of China’s seventh population census, two representative cities were selected as a reference in the five categories of megacities: Cities XL, Type I metropolises, Type II metropolises, and small- and medium-sized cities, namely, Beijing, Shanghai, Nanjing, Shenyang, Changchun, Suzhou, Yangzhou, Jilin, Yingkou and Heihe. By obtaining the resident population and POI data of the corresponding six types of places, it is found that the ratio of the resident population to the number of POI points in most cities such as Beijing, Shanghai, Dalian, and Yingkou is about 35:1, and the ratio may fluctuate in a few other cities, but the median is about 35:1. Therefore, this paper sets the ratio of population to place in the ideal area as 35:1 and sets the number of resident agents as 14,000. We used POI data to classify different places according to their functions. The distribution of places in the above typical cities is shown in
Table 8. Therefore, when setting the initial value of the model, the average value of each city is taken in this paper, and the final proportion is 6% for residence, 16% for office and study, 48% for catering and shopping, 4% for leisure and entertainment, 21% for life service, and 5% for medical places.
- (2)
Description of transmission parameters
Most of the transmission parameters can be directly evaluated, and the values are described as follows: During the setting of the initial infection source in the region,
resident individuals are randomly selected as latent individuals to simulate the situation of the initial infection source in the region brought by the transmission and gestation stage, which is generally set as 1. The relevant parameters of infectious disease epidemiology are mainly calculated by referring to relevant research data on COVID-19: the value of
is 2.5 [
19], the average incubation period is about 5 days [
20], and μ is taken as 1/5. Considering that in relevant studies, the average interval from onset to diagnosis and treatment is 5 days [
20], that is, the time for an infected individual to infect others is 5 days, the infectious period is set to τ. The value of τ is set as the sum of the average incubation period and the infection period of an infected individual, that is, 10 days [
21]. The average length of hospitalization is 16 days [
22], the value of γ is 1/16, and the value of ω is 0.002 [
23].
Two probability values related to intra-place transmission need to be calculated: the probability
of intra-place basic transmission without control and without considering any interference factors is mainly related to the basic virus regeneration number
and the infectious period τ. The basic regeneration number
is the average number of people a patient can infect when everyone is susceptible without intervention. If an individual will co-exist with n susceptible individuals on average in the same place during the infection period τ days, then the possibility of each susceptible individual being infected is
/n [
20]. That is, the value of the basic propagation probability
within the place is as follows:
In the initial model environment, an individual agent is randomly selected to label its daily co-existing individuals, the n value in the initial model environment can be obtained through multiple experimental calculations, and the calculated value
is 0.067%. By substituting the model test, it can be verified that the average number of infections originating from an individual is about 2.5 within 10 days of the infection period. The other transmission probability
represents the actual transmission probability value considering control and other factors, and its value is equal to the product of the basic transmission probability
and the place risk coefficient r, which is in flux.
- (3)
Description of prevention and control parameter
Among the prevention and control parameters, the risk coefficient of the place mainly reflects the transmission situation in the place under the condition of taking appropriate measures; the risk coefficient of the place in the closed state is 0; the admission rate of infected persons reflects the proportion of treatment and isolation of infected individuals every day; and the control rate reflects the proportion of investigation and control of infected individuals and places, and the three changes with different prevention and control measures. If there is no prevention and control, transmission may occur during brief co-existence, and there is no difference in contact modes between places. The risk coefficient of various places is 1, and the admission rate and control rate of infected persons are 0 regardless of the detection and treatment of infected persons and the control of personnel in relevant places. Considering the preventive measures under prevention and control, mainly masks, temperature measurement, personnel flow restriction, and other protective measures, the risk coefficient of various types of places will be reduced to a certain extent. Considering the emergency management and control stage under prevention and control, measures such as epidemiological investigation, close screening, and large-scale nucleic acid testing will occur; that is, the admission and treatment of infected persons, relevant close contacts, and the control of locus risk places will be considered. At this time, the admission and control rate will change, and the risk coefficient of different places will also change according to the specific prevention and control measures.
In the initial conditions, the default is the no prevention and control scenario, the risk coefficient of all types of places is 1, and the admission rate of infected persons and the control rate of infected individuals and places are 0.
5.3. Model Results under Initial Settings
Based on the basic parameters set in the previous section, the initial number of the latent stage was set to 1, the model running cycle was set to 120 days, and the initial simulation run was started. Due to the randomness of the model, the average value of 100 experiments was selected for the results. The final results of different state groups of people are shown in
Figure 7. In the first 20 days, the number of latent and infected people increased slowly, the number of infected people increased rapidly in 30–40 days, and the number of infected people reached its peak around 45 days.
In order to compare the differences in infection conditions under different parameter settings, the cumulative number of infected persons is introduced here to describe the transmission results of infectious diseases: The cumulative number of infected persons per day is the sum of the number of people in the three states of infection, recovery, and death, and its value mainly reflects the change in the overall infection scale of the epidemic, and the steepness of its curve can reflect the speed of transmission and spread of the epidemic. The cumulative number of infected persons in the model under the initial setting is shown in
Figure 7b, corresponding to the results in
Figure 7a. The number of infected persons increased slowly in the first 20 days, and the curve was relatively gentle; the number of infected persons increased significantly in 30–40 days, and the curve was steep. After 45 days, the growth rate of cumulative infected persons gradually slowed down until about the 65th day of operation of the model, when almost all individuals in the region were infected. In the simulation of different problems, comparative analysis was carried out mainly using the cumulative number of infected persons, which is shown in
Figure 7b.
The average results of the cumulative number of infections in the repeated experiments of 10, 20, 50, 100, and 200 times were calculated, respectively, and the values can be obtained as shown in
Figure 8. It can be found that the variation in the cumulative number of infections is consistent when the number of repetitions exceeds 20, and it can be considered that the average results are representative after the number of experimental repetitions exceeds 20.
5.4. Simulation Experiment of Isolation Control for Close Contacts
In the face of such infectious diseases with an incubation period, considering increasing the isolation and control of the close contacts of infected persons can better control latent individuals and the spread of the epidemic. Through literature analysis, it is found that the subsequent incidence probability of all close contacts under isolation and control in real cases is about 1% [
24], and 6% of uninfected individuals with co-existing relationships are judged as close contacts. Under the transmission rate of the model, the number of infected people in the same place at this time accounts for about 1% of all close contacts in the place. The quarantine and control period for healthy individuals who are not infected is set at 14 days in accordance with the relevant data on COVID-19 prevention and control.
In the real world, there are asymptomatic infected people similar to the novel coronavirus, and in influenza outbreaks, early symptoms are mild and cannot be diagnosed in time, so the actual timely admission rate of infected people reaching 100% is basically impossible. Therefore, in the four cases of admission rates of 20%, 40%, 60%, and 80%, respectively, five control rates of 20%, 40%, 60%, 80%, and 100% were set to control close contacts. In order to distinguish the cases of both individual and place control, the isolation control rate is hereinafter referred to. The experimental results are shown in
Figure 9. In the figure, 20-0 represents the admission rate of 20% of infected persons, and the isolation control rate is 0, which corresponds to the results of the admission of infected persons only.
Through data analysis, it can be found that the addition of close contact control has a significant impact on the final infection result. For each group with a certain admission rate, there was a large difference in the data between no isolation control and 20% isolation control, and the difference in the final number of infected people decreased with the continuous improvement of the isolation control rate. From the perspective of the time cycle, the cumulative number of infected persons with or without isolation control is very different in about 30 days, because at this time, the number of infected persons in the model is large, and the number of associated latent persons is also large. The measures to control close contacts can timely control more latent persons, so the growth rate of accumulated infected persons in the subsequent time can be reduced.
At the same time, it can be found that the admission rate of infected persons has a great impact on the final effect of isolation control measures as shown in
Figure 10.When the admission rate of infected persons is only 20%, even if the isolation control rate reaches 100%, the final number of infected persons is much higher than other results, which indicates that the effect of isolation control measures for close contacts is largely limited by the admission situation of infected persons. This is consistent with reality because the isolation control measures rely on the detection of relevant infected persons, and only when the proportion of timely detection of infected persons increases can the isolation control measures have a better effect, timely control most of the latent persons, and avoid the further spread of the epidemic.
5.6. Model Validation–Simulation Analysis Based on Real-World Cases
From July to September 2021, in Yangzhou City, Jiangsu Province, a total of 570 people were infected by the local transmission and spread of an imported latent infection. The entire development process of the epidemic mainly occurred in Yangzhou City, which is a typical example of the evolution of regional epidemic transmission. Therefore, this case was selected for simulation and analysis in this paper. Taken from the relevant notification and press conference on the website of Yangzhou Municipal People’s Government, the data of various population groups during the entire epidemic period are shown in
Figure 12. The cumulative confirmed cases are calculated by adding up the number of newly confirmed cases every day. The confirmed cases include asymptomatic infected patients, symptomatic patients, and the infected and admitted persons in the corresponding model. The number of recovered persons is the cumulative number of discharged persons published daily; the number of confirmed cases, that is, those who have been diagnosed but have not recovered, is calculated by subtracting the cumulative number of confirmed cases from the number of discharged patients.
In order to simulate the case, firstly, agent parameters and transmission parameters were set according to the real data of Yangzhou City and the data of Delta, the strain of this epidemic. The specific values and descriptions are shown in
Table 9. The parameters related to prevention and control changed, which were specifically explained in the simulation design. During the experiment, it was found that with a recovery rate of 1/16, the recovery of the confirmed individuals was significantly different from the real situation in Yangzhou. The average recovery period was calculated to be 20 days, and the actual hospitalization days were more than 13 days. Therefore, corresponding to the actual situation, the recovery period is divided into two parts: the first 13 days are the default treatment period, the recovery period begins after 13 days, and the probability of reaching a recovered state is 1/7 per day.
According to official data, the key information about the development of the entire epidemic is as follows: On 21 July, a latent person entered Yangzhou and lived on the mainland. On 28 July, the first infection was confirmed, and the city began to take control measures such as the treatment of infected people, the control of close contacts, and the control of trajectory places. On 3 August, the control of Yangzhou City was upgraded, and all communities in the main urban area were closed. Based on the above information, the following simulation design was made: on day 0, a latent patient is set to appear in the area, and on day 7, the infected person is admitted and treated, and the related individuals and places of infected persons are controlled. The admission rate at this stage is relatively low, which is set as the first-stage admission rate, receive_rate1. On the 13th day of the experiment, intra-area control was upgraded and the potential risk control strategy mentioned above was adopted: learning, entertainment, and leisure places were closed; the risk coefficient of catering, shopping, life service, and medical places was adjusted to 0.3, 0.5 and 0.5; and the scope of individual activities in the region was limited to 2. Meanwhile, the admission rate of infected persons was relatively improved and set as the second-stage admission rate, receive_rate2. In reality, in the process of a single epidemic transmission, the screening methods and screening personnel of infected individuals and locus do not change, and the control rate does not change in the whole process, which is a constant value.
Since the actual epidemic data published are all related data of confirmed patients, that is, corresponding to the situation of patients admitted with infections, the data of cumulative confirmed patients were mainly used here to evaluate the fitting effect of the model. The admission rate and control rate were adjusted through multiple experiments and compared with the results of polynomial fitting; the set of values with better fitting effect was as follows: The first-stage admission rate receive_rate1 was 20%, the second-stage admission rate receive_rate2 was 40%, and the control rate of close contacts and related locus places was 40%. At this time, the data results of cumulative confirmed patients, existing confirmed patients, and recovered patients were compared, as shown in
Figure 13,
Figure 14 and
Figure 15. In the existing diagnosis model, the health state of the individual is the infection state and the travel limit is to receive treatment. The cumulative number of diagnosed persons corresponds to the individuals in the infection treatment and recovery state, and the recovered person corresponds to the individual in the recovery state. On the whole, the operation results of the model are more consistent with the overall development trend of the epidemic situation in Yangzhou in terms of cycle advancement and specific number changes, indicating that the model can correspond to the development scenario of the actual epidemic situation to a certain extent.
Furthermore, the following indicators were used to verify the fitting performance of the model.
- (1)
MAE (Mean Absolute Error): This is the average of the absolute errors. It can better reflect the actual predicted value error.
- (2)
RMSE (Root Mean Square Error): It represents the sample standard deviation of the difference between the predicted value and the observed value.
- (3)
R2 (R Squared, coefficient of determination): It reflects the accuracy of the model-fitting data, generally ranging from 0 to 1. The closer the value is to 1, the stronger the explanatory ability of the variables of the equation is for y, and the better the model fits the data.
where
represents the real data and
represents the predicted data.
Through calculation, we obtain the specific value of the model-fitting performance evaluation index, as shown in
Table 10.
It can be proven that this model can better predict the actual occurrence of epidemics through the results of the evaluation index.