Mathematical Modelling of Virus Spreading in COVID-19

Luo, Liaofu; Lv, Jun

doi:10.3390/v15091788

Open AccessArticle

Mathematical Modelling of Virus Spreading in COVID-19

by

Liaofu Luo

^1,*

and

Jun Lv

^2,*

¹

Faculty of Physical Science and Technology, Inner Mongolia University, 235 West College Road, Hohhot 010021, China

²

College of Science, Inner Mongolia University of Technology, 49 Aymin Street, Hohhot 010051, China

^*

Authors to whom correspondence should be addressed.

Viruses 2023, 15(9), 1788; https://doi.org/10.3390/v15091788

Submission received: 3 August 2023 / Revised: 20 August 2023 / Accepted: 21 August 2023 / Published: 23 August 2023

(This article belongs to the Section SARS-CoV-2 and COVID-19)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

A mathematical model is proposed to analyze the spreading dynamics of COVID-19. By using the parameters of the model, namely the basic reproduction number (R₀) and the attenuation constant (k), the daily number of infections (DNI) and the cumulative number of infections (CNI) over time (m) are deduced and shown to be in good agreement with experimental data. This model effectively addresses three key issues: (1) inferring the conditions under which virus infections die out for a specific strain given R₀; (2) explaining the occurrence of second waves of infection and developing preventive measures; and (3) understanding the competitive spread of two viruses within a region and devising control strategies. The findings highlight the potential of this simple mathematical framework in comprehensively addressing these challenges. The theoretical insights derived from this model can guide the evaluation of infection wave severity and the formulation of effective strategies for controlling and mitigating epidemic outbreaks.

Keywords:

virus spread; COVID-19; mathematical model; daily number of infections; cumulative number of infections

1. Introduction

The ongoing COVID-19 pandemic has posed several unresolved questions. Firstly, our aim is to understand the reason behind the waveform pattern observed in the daily number of infections (DNI) and to predict the scale and duration of an infection wave, including the cumulative number of infections (CNI) and the time until infection ends. Secondly, we investigate why the second wave of infection typically follows the first wave and how to predict its occurrence. Lastly, we explore how virus spreading in a region is influenced by the emergence of new strains and develop methods to control their spread. Existing epidemic models have limitations in addressing these problems comprehensively. Since the 1920s, differential equations were used to model the population distribution of disease spread, including susceptible, infected, and recovered/dead pools [1,2]. Such models could not examine important social and behavioural factors, such as the behavioural responses of individuals to policy measures, and the effect of heterogeneous social contacts on diffusion patterns. Next, since the 1990s, agent-based simulations were proposed that included some important sources of population heterogeneity and explored the structure and dynamics of transmission networks. However, none of the agent-based models are based on explicit empirical/theoretical assumptions of individual behaviour, social transmission mechanisms, and social structure constraints [3,4,5]. Recently, several network models simulating the spread of epidemics in the population were proposed, such as the Susceptible-Infectious-Susceptible model on random networks [6], and the epidemic spreading on modular networks [7]. Although these specific models have solved some aspects of the complexity of infectious diseases and obtained meaningful results, they still fail to comprehensively answer the aforementioned unresolved questions.

As the Chinese idiom goes, “the greatest truths are the simplest”. Starting from the principle of natural selection—the interaction and compromise between the virus and its host (human)—we propose a mathematical model based on insights from experimental data. The model is based on two fundamental assumptions. First, each viral strain has a basic reproduction number (R₀), which represents the average number of infections caused by an initial infectious person in a completely susceptible population [8]. R₀ is a measure of the transmission potential of a particular infectious disease. Second, virus infections undergo a series of events due to the presence of population immunity or intervention that makes the instantaneous reproduction number, R_t, decreasing step by step, described by an attenuation parameter k (k < 1) [9,10]. The parameter k is influenced by social contacts and policy measures, where looser contacts and measures result in higher values of k. It is worth noting that public health measures in a given region usually undergo significant changes over a longer period of time compared to the duration of a virus wave [11]. The stringency index of policy measures can be found at https://ourworldindata.org/covid-stringency-index (accessed on 13 August 2023). Therefore, it is reasonable to assume that the attenuation parameter k is approximately constant during a single wave of virus spread. Based on two parameters, R₀ and k, we can deduce the general formula for the daily number of infections (DNI) and the cumulative number of infections (CNI) over m steps. The CNI is denoted by F(R₀, k; m). These formulas are in good agreement with experiments and provide a generalized framework to simulate single and multiple virus infections. By introducing F₁ and F₂ we can also study the competitive spread of two viruses in a region, including scenarios with the introduction of a new virus strain. It is worth noting that the loosening of public health measures and/or the emergence of new viruses can lead to subsequent waves of virus transmission.

2. Materials and Methods

The data of daily virus infections of COVID-19 were taken from WHO (https://covid19.who.int/data, accessed on 12 May 2023) for the UK and from the public database (https://ourworldindata.org/coronavirus, accessed on 12 May 2023) for Hong Kong. Since Gauss discovered the least square method and successfully applied it to astronomical observation, the ordinary least square is recognized as one of the best methods for curvilinear regression. In the present study, we used the least square simulation of observational data of the virus infection to determine the parameters in each COVID-19 pandemic. The goodness of the nonlinear least square fitting (NLLSF) is tested by R², i.e.,

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - {\hat{π}}_{i})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}},

where y_i are the observation values,

\bar{y}

their average, and

{\hat{π}}_{i}

the predictions of the model. The goodness R² and the p-value of Prob(F-statistic) are calculated for each simulation. The F-statistic is defined as

F - s t a t i s t i c = \frac{(\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2} - \sum_{i = 1}^{n} {(y_{i} - {\hat{π}}_{i})}^{2}) / (k - 1)}{\sum_{i = 1}^{n} \frac{{(y_{i} - {\hat{π}}_{i})}^{2}}{n - k}},

(n–number of samples, ν–number of parameters) that can be calculated from the experimental data. The p-value is decided by the percentile of the Fisher’s F-distribution F_ν_−1,n−ν(z), namely,

p - v a l u e = t h e p r o b a b i l i t y o f F_{ν - 1, n - ν} (z > F - s t a t i s t i c)

The goodness R², the p-value, and the root mean squared error (RMSE) for each simulation are given in the figures.

3. Results

3.1. Derivation of Formulas for Cumulative Number of Infections (CNI) and Daily Number of Infections (DNI)

The potential for infection of a given virus strain is determined by the basic reproduction number known as R₀. For the virus to spread, it must continually re-infect the rest of the susceptible population. However, countermeasures by humans must aim to limit these subsequent infections. Essentially, this implies that the interaction and compromise between the virus and host (humans) necessitates the j-th reproduction number R_j to be smaller than the (j − 1)-th reproduction R_j₋₁, as expressed in the equation

R_{j} = k R_{j - 1}, (k < 1) .

The reason R_j changes with j is twofold. Firstly, as the virus continues to spread, the susceptible population decreases, leading to a decrease in the transmission potential. Secondly, ongoing public health measures are being implemented to control the spread of the virus, further contributing to the reduction in the instantaneous reproduction number. The attenuation constant ‘k’ in this scenario is influenced by social contact rates and policy measures. After m rounds of infection spread, the total number of infections in a specific area can be represented as

F (R_{0}, k; m) = R_{0} + R_{0} R_{1} + \dots + R_{0} R_{1} R_{2} \dots R_{m - 1} = R_{0} + k R_{0}^{2} + \dots + k^{m (m - 1) / 2} R_{0}^{m}

(1)

This function F(R₀, k; m) describes the cumulative number of infections (CNI), which is an increasing function of m. Assuming the total number of infected individuals to be N, solving the equation F(R₀, k; m) = N will produce ‘m’, the transmission number of the virus strain. The parameter m grows with time t. Assuming each transmission takes place every q days, we have

m = (t + t_{0}) / q

(2)

where t₀ is a time shift parameter linked to the start of infection (the time of m = 1). Equations (1) and (2) provide the cumulative number of infections (CNI) for each branch of the epidemic. If various generations of different branches exist simultaneously, the total CNI is simply the sum of the CNIs of each branch. The CNI can be observed experimentally. Inserting (2) into (1), we obtain a formula of CNI vs. t represented by four parameters R₀, k, q, and t₀ that can be used to simulate the change of a cumulative number of infections with time in a region. As seen in Figure 1, our simulations of the UK-alpha strain (November 2020 to April 2021) and the Hong Kong-delta/omicron strain (February 2022 to April 2022) spreading are well fitted to the data obtained from COVID-19 pandemic updates. The high accuracy of the simulation shows that the assumption of the constancy of k is reasonable. It is important to note that our model not only allows for simulation of COVID-19 pandemics but also enables predictions of virus spread over longer periods by utilizing parameter data derived from a particular phase of the outbreak.

Applying Equations (1) and (2), the daily number of infections (DNI) is derived:

d F / d t = (d F / d m) / q, d F / d m = k^{m (m - 1) / 2} R_{0}^{m} .

(3)

From Equation (3), it is easy to understand why DNI first rises then falls, considering that R₀ > 1 (for most infectious viruses) and k < 1. Zero DNI arrives when dF/dm = 0, which requires m to be large enough that

k^{(m - 1) / 2} R_{0} < 1 o r R_{0} < {(1 / k)}^{(m - 1) / 2}

(4)

This is a condition for the end of an infection wave.

3.2. Insights from Typical Figures of CNI vs. m

Utilizing Equation (1), we plotted the change of CNIs with variables k and m for the given R₀ (Figure 2). Figure 2A–E show typical cases that help in understanding the development and termination of an infection wave.

From Figure 2, one can observe that the curve of F(R₀, k; m) increases with m and approaches a stable value when m > m_st for each k < k_th. We denote F(R₀, k_th; m_st) as N_th. Calculating for m_st = 15 in Figure 2A–E, we obtained the threshold k_th and the corresponding CNI N_th for each given R₀ (each virus strain). On the other hand, CNI attaining a stable value means the DNI is approaching zero. By defining E_km = (1/k_th)^(mst−1)/2, Equation (4) can be written as R₀ < E_km. The threshold k_th, the corresponding CNI N_th, and the parameter E_km are listed in Table 1. We found the relation R₀ < E_km to hold well across all data.

The daily increasing number dF/dm = 0 means the virus infection is dying out. The above result shows the higher the basic reproduction number R₀, the stricter the public health measure is required to be to increase E_km to satisfy Equation (4) and terminate the wave of virus infection effectively. From Table 1, we found the cumulative infection numbers, N_th, of many virus strains (except Omicron with R₀ = 18.6) are lower than 10⁵ if an appropriate k (lower than k_th) is introduced.

For the virus strain of given R₀, if F(R₀, k; m) (for all m) exceeds a threshold N_max, then the spread of this strain would lead to a wave of COVID-19 infection in a region with a population larger than N_max. In order to end this spread, the necessary condition would be

F (R_{0}, k; m) < N_{m a x} (f o r a l l m)

(5)

Equation (5) provides a constraint on k for strains with R₀. The critical value of k is denoted as k_cr. Taking N_max = 10⁷, the values of k_cr are also listed in Table 1. Note that the parameter k_th is the threshold value of k required for ending the spread in the 15th generation, but the parameter k_cr is that value for ending the spread in an arbitrary generation. The latter constraint is looser than the former. Therefore, k_cr is larger than k_th.

Virus strains with low R₀ values have higher k_cr and k_th values. As a result, they can spread in regions with smaller populations under looser public health measures. This theoretical prediction can explain why strains like SARS-CoV in 2003 only spread in restricted regions and soon disappeared globally. On the contrary, virus strains with high R₀ values, such as Omicron, have lower k_cr and k_th values. To satisfy Equations (4) and (5), the parameter k needs to meet very strict constraints. In cases where social management measures are not stringent enough, the number of infected individuals will quickly surpass the N_max limit, triggering a global pandemic wave, possibly culminating in a type of coexistence between viruses and humans.

In summary, the termination of an infection wave is determined by the condition dF/dm = 0 on the CNI diagram (Figure 2), which requires a sufficiently large m and the condition in Equation (4) satisfied, i.e., R₀ < E_km = (1/k_th)⁷. Additionally, the necessary condition for a pandemic to die out in a population of N_max is expressed by Equation (5), imposing a limitation on k, namely k < k_cr.

3.3. Prediction on the Second Wave of Pandemics

Early models based on previous pandemics such as SARS, MERS, and the 2009 H1N1 outbreak can effectively predict the occurrence of the first wave of a disease. However, their predictive power decreases when it comes to anticipating the possibility of a second wave [12]. This raises the question: why does the second wave of COVID-19 infections often follow the first wave? The present model aims to address this issue.

As shown in Figure 2, there are multiple curves (F versus m) for a given R₀ value. These curves differ from each other by the parameter k. When the number of daily new infections (DNI) approaches zero, any fluctuation in k significantly influences the spread of the virus. Therefore, changes in public health measures can induce variations in virus spread along different curves. Generally, as public health measures are relaxed, the parameter k increases, causing the virus to transition from one curve of F to another steeper curve. This signifies the onset of a new wave of virus spread. For instance, in Figure 2A, the spread of the Omicron variant (with an R₀ of 18.6) is plotted. Assuming the initial spread is along the curve with k equal to 0.613, a stable state is reached at m = 15. At this point, the k value increases to 0.722. In response to this change, the virus begins spreading along a new curve with k = 0.722, starting from m = 4, as the value of F(18.6, 0.722; m = 4) is equal to the original CNI value, F(18.6, 0.613; m = 15). This example explains how the second wave of virus infection occurs. However, in cases where the parameter k decreases when the DNI approaches zero, the curve of F will transition to a flatter one. This indicates that the first wave of viral infection will end soon, and no second wave will occur.

By examining Figure 2A–E, we can analyze how the occurrence of a second wave depends on the R₀ value of the virus. For instance, as k changes from 0.85 to 0.9, the CNI (at m = 15) for high R₀ viruses increases hundreds of times, whereas it only increases tens of times for low R₀ viruses. Therefore, our model predicts that multiwave infections are more likely to occur with viruses that have high R₀ values.

In the aforementioned discussions, we have assumed that no virus mutation occurs and that only one type of virus is spreading. In reality, changes in public health measures may be accompanied by virus mutations. In this case, the change in public health measures would cause the jump of F not only between different curves with a given R₀ but also between different R₀ values, providing more opportunities for the occurrence of a second wave during virus spread.

Another crucial point to consider is that the change in k can simultaneously induce a change in q, as the eigen-time (inherent time) m depends on q (Equation (2)). In the case of a single wave, the parameter m can be used to represent time dependence, and q can be simply set to 1 (known as normalization). However, when studying two continuous waves, the dependence on q should be clearly indicated due to the different eigen-times m. We have mentioned that the dependence of F on the change in k results in a jump from one curve to another. Meanwhile, the dependence of F on the change in q only affects the lengthening or shortening of the abscissa of the graph without altering the shape of the curve. When the public health measures change and a jump between curves occurs, the abscissa of the graph of the second wave simultaneously lengthens or shortens.

The occurrence of the second wave of infection is more likely when public health measures are relaxed. This change in the second wave is accompanied by a change in the q value. These predictions align with experimental data. For example, in the UK from May to September 2021, public health measures were relaxed, and a second wave of infection followed the first wave (Figure 1A). Similarly, in Hong Kong several months after May 2022, the looser public health measures led to a second wave occurring after the first wave (Figure 1B). (The data on the change of public health measures can be found at https://www.bsg.ox.ac.uk/research/covid-19-government-response-tracker, accessed on 13 August 2023).

In summary, the dependence of the CNI on k (given a specific R₀) is determined by the jump between different curves on the graph F(R₀, k; m) versus m, referred to as k-transformation. The dependence of the CNI on the duration of m is obtained by stretching the abscissa m on the graph, known as q-transformation. The k-transformation and q-transformation, occurring when the first wave is nearing its end, are the causes of a continuous second wave. The change in k is attributed to the modification of public health measures, while the modification of q is due to the change in physical time within a unit of m. The changes in k and q provide an explanation for the experimental data on continuous multiwave infections.

3.4. Cross-Spread of Two Viruses: Discriminant Function

Viral infections often involve the simultaneous presence of two or more viruses. To accurately simulate the cross-spread of two viruses, it is necessary to consider the differences in eigen-times (m₁ and m₂) between these viruses and their relationship with the physical time t. Consequently, additional parameters q and t₀ (as given in Equation (2)) must be taken into account. By utilizing the four parameters R₀, k, q, and t₀, the CNI F(R₀, k; m) can be expressed as:

F (R_{0}, k; m (t)) = F (t; a, b, c, d), (a = R_{0}, b = k, c = q, d = t_{0})

(6)

When the CNIs of two virus strains F₁(t; a₁, b₁, c₁, d₁) and F₂(t; a₂, b₂, c₂, d₂) intersect at t_cr,

F_{1} (t_{c r}; a_{1}, b_{1}, c_{1}, d_{1}) = F_{2} (t_{c r}; a_{2}, b_{2}, c_{2}, d_{2})

(7)

and

\begin{matrix} F_{1} (t; a_{1}, b_{1}, c_{1}, d_{1}) > F_{2} (t; a_{2}, b_{2}, c_{2}, d_{2}) a s t < t_{c r}, \\ F_{1} (t; a_{1}, b_{1}, c_{1}, d_{1}) < F_{2} (t; a_{2}, b_{2}, c_{2}, d_{2}) a s t > t_{c r} . \end{matrix}

This represents a transition in the population of virus strains from F₁ to F₂. As it is challenging to directly solve the intersection equation (Equation (7)), we introduce a function:

D_{21} = l o g [(d F_{2} / d t) / (d F_{1} / d t)]

(8)

which satisfies:

\begin{matrix} D_{21} > 0 as d F_{2} / d t > d F_{1} / d t, \\ D_{21} < 0 as d F_{2} / d t < d F_{1} / d t . \end{matrix}

(9)

By utilizing Equations (2), (3), (6) and (8), D₂₁ can be formulated as a simple quadratic function of time

D_{21} = α t^{2} + β t + γ

(10)

where

\begin{matrix} α = \log b_{2} / (2 c_{2}^{2}) - \log b_{1} / (2 c_{1}^{2}), \\ β = (2 d_{2} - c_{2}) \log b_{2} / (2 c_{2}^{2}) - (2 d_{1} - c_{1}) \log b_{1} / (2 c_{1}^{2}) + \log a_{2} / c_{2} - \log a_{1} / c_{1}, \\ γ = d_{2} (d_{2} - c_{2}) \log b_{2} / (2 c_{2}^{2}) - d_{1} (d_{1} - c_{1}) \log b_{1} / (2 c_{1}^{2}) + d_{2} \log a_{2} / c_{2} - d_{1} \log a_{1} / c_{1} + \log (c_{1} / c_{2}) . \end{matrix}

In order to determine if a real root of D₂₁ exists, we define:

∆ = β^{2} - 4 α γ

(11)

The value of Δ determines the existence of the real root in the quadratic form D₂₁. This form, known as the discriminant function, serves as a tool to identify the occurrence of t_cr and the domain of its existence. The prediction rules can be summarized as follows:

Rule 1: When Δ > 0, the quadratic form intersects with the t axis at t_s and t_m (t_m > t_s),

t_{s} = \{\begin{matrix} \frac{- β - \sqrt{∆}}{2 α} (for α > 0) \\ \frac{- β + \sqrt{∆}}{2 α} (for α < 0) \end{matrix}, t_{m} = \{\begin{matrix} \frac{- β + \sqrt{∆}}{2 α} (for α > 0) \\ \frac{- β - \sqrt{∆}}{2 α} (for α < 0) \end{matrix} .

These values partition the time into three distinct domains: t < t_s in the first domain, t_s < t < t_m in the second domain, and t > t_m in the third domain.

Rule 2: In the first domain, t_a = qm₀ − t₀ is defined as the initial time where m₀ >> 1, indicating m_1,2 = (t + d_1,2)/c_1,2 >> 1. If D₂₁ > 0, there will be no intersection of F₁(t) and F₂(t) in the domain between t_a and t_s when the initial values of F at t_a satisfy F₁(t_a) < F₂(t_a), but there may be one t_cr (the number of t_cr is either 1 or 0) in the domain when F₁(t_a) > F₂(t_a). If D₂₁ < 0, there will be no intersection of F₁(t) and F₂(t) in the domain when the initial values satisfy F₁(t_a) > F₂(t_a), but there may be one t_cr (the number of t_cr is either 1 or 0) when F₁(t_a) < F₂(t_a).

Rule 3: In the second domain, if D₂₁ > 0 there will be no intersection of F₁(t) and F₂(t) in the domain when the F-values at t = t_s satisfy F₁(t_s) < F₂(t_s), but there may be one t_cr (the number of t_cr is either 1 or 0) when F₁(t_s) > F₂(t_s). If D₂₁ < 0 there will be no intersection of F₁(t) and F₂(t) in the domain when the F-values at t = t_s satisfy F₁(t_s) > F₂(t_s), but there may be one t_cr (i.e., the number of t_cr is either 1 or 0) when F₁(t_s) < F₂(t_s). The F-values at t = t_s are determined by F₁(t) and F₂(t) in the first domain.

Rule 4: In the third domain, if D₂₁ > 0 there will be no intersection of F₁(t) and F₂(t) in the domain when the F-values at t = t_m satisfy F₁(t_m) < F₂(t_m), but there may be one t_cr (the number of t_cr is either 1 or 0) when F₁(t_m) > F₂(t_m). If D₂₁ < 0 there will be no intersection of F₁(t) and F₂(t) in the domain when the F-values at t = t_m satisfy F₁(t_m) > F₂(t_m), but there may be one t_cr (i.e., the number of t_cr is either 1 or 0) when F₁(t_m) < F₂(t_m). The F-values at t = t_m are determined by F₁(t) and F₂(t) in the second domain.

Rule 5: There can be at most one t_cr in a given domain because the symbol of D₂₁ is definite in any domain. The magnitude of the domain is an important factor to predict the occurrence of a t_cr. For example, in the second domain the magnitude is t_m − t_s = Δ^1/2/(2|α|), and the necessary condition for the occurrence of t_cr is a large enough (t_m − t_s) or Δ.

Rule 6: When Δ < 0, the quadratic form D₂₁ does not intersect with t axis. In this case, there is only one domain, and the rule is the same as that in the first domain given by Rule 2.

Figure 3 presents examples of cross-spread of two virus strains, 1 and 2. The left panel shows the discriminant function, and the right panel displays the cross-spread of the two strains. The influence of the change in parameters (as an example, we only assume the R₀ value of strain 1 changes) on the intersection of two strains is shown in the figure. In Figure 3A, there is no intersection. In Figure 3B,C, there are two intersections in the second and third domain, respectively. In Figure 3D, there is one intersection in the second domain. These intersection occurrences are in agreement with the aforementioned prediction rules.

In the first domain we have introduced t_a = qm₀ − t₀ as the initial time. To study the cross-spread in the time interval between m = 1 and t_a where dF/dt in D₂₁ is difficult to be defined, one should use the F(t)-ladder method as follows:

The step size of CNIs of the first few steps in ladders 1 and 2 are given by a₁, b₁a₁², b₁³a₁³, b₁⁶a₁⁴, b₁¹⁰a₁⁵, b₁¹⁵a₁⁶, b₁²¹a₁⁷, b₁²⁸a₁⁸, b₁³⁶a₁⁹, etc., and a₂, b₂a₂², b₂³a₂³, b₂⁶a₂⁴, b₂¹⁰a₂⁵, b₂¹⁵a₂⁶, b₂²¹a₂⁷, b₂²⁸a₂⁸, b₂³⁶a₂⁹, etc., respectively (Equation (1)). Strain 1 spreads from the 1st step a₁ at t = c₁ − d₁ on ladder 1 and strain 2 spreads from the 1st step a₂ at t = c₂ − d₂ on ladder 2. The CNI and arrival time t of the two strains on their respective ladders are listed in Table 2. For example, let us take c₁ = 7, d₁ = 15, c₂ = 5, d₂ = 10. The earlier arrival times, in turn, are t = c₁ − d₁ = −8, c₂ − d₂ = −5, 2c₁ − d₁ = −1, 2c₂ − d₂ = 0, 3c₂ − d₂ = 5, 3c₁ − d₁ = 6, etc. By using Table 2, one can easily calculate the CNI at each arrival time as the parameters a₁, b₁, a₂, b₂ are given, compare the CNI values on two ladders and obtain the information on the cross-spread of two strains.

3.5. Examples of the Cross-Spread of Two Viruses

The epidemics that occurred in the UK from November 2020 to February 2022 provide a clear demonstration of the cross-spread phenomenon involving multiple viruses. This process can be divided into five stages, namely: (A) the alpha epidemic, (B) the delta invasion and cross-spread of two strains, (C) the delta dominant stage, (D) the omicron invasion and cross-spread of delta and omicron, and (E) the omicron dominant stage. In this section, we will specifically focus on studying the cross-spread of two strains during stages B and D.

Firstly, let us examine the cross-spread between the alpha strain (designated as strain (1) and the delta strain (designated as strain (2) during stage B. The spread of the alpha strain during stage A has been represented in Figure 1A, where we obtained the parameter a₁ = R₀⁽¹⁾ = 3.9. The spread of the delta strain during stage C has been illustrated in Figure 4A, and we derived the parameter a₂ = R₀⁽²⁾ = 5.1 based on this data. By using R₀⁽¹⁾ and R₀⁽²⁾ as inputs, we simulated the cross-spread of the two strains, as depicted in Figure 4B. Furthermore, utilizing all the parameters, a_i, b_i, c_i, d_i (i = 1,2), obtained from Figure 1A and Figure 4A,B, we constructed the discriminant function of the cross-spread and plotted the intersection of the two strains during stage B in Figure 4C. Interestingly, we discovered that one t_cr occurs at t = 71 in the region where t > t_m (t_m = 44.5), which is consistent with the prediction outlined in Rule 4.

Similarly, we investigated the cross-spread between the delta strain (strain 1) and the omicron strain (strain 2) during stage D. The results of this analysis are shown in Figure 5A–C. From Figure 5C, we observed that a t_cr appears at t = 42 in the region where t > t_m (t_m = 28.5), which is in agreement with the prediction made by Rule 4.

In both simulations, we assumed April 1 was the initial time for stage B, and November 11 was the initial time for stage D. Due to the uncertainty surrounding the exact timing of the delta invasion in stage B and the omicron invasion in stage D, we shifted the initial times t of stages B and D by several days. Remarkably, we found that the same values of k₁, k₂, q₁, q₂, t₀⁽¹⁾ were obtained, and only t₀⁽²⁾ varied while still maintaining the invariant t + t₀⁽²⁾.

4. Discussion

4.1. On the Simulation of COVID-19 Cases

The traditional SIR-type epidemic models depict the exponential growth of the number of infected individuals. However, empirical data have demonstrated that COVID-19 outbreaks do not exhibit exponential growth, but rather follow a three-parameter Gompertz growth function [13,14]. A new compartment model, known as the broken link model, has been proposed in the literature to explain the mechanism of Gompertz growth [15]. However, our proposed model is logically simple. In this model, we suggest that the spread of the virus depends on four parameters: R₀, k, q, and t₀. R₀ describes the inherent infectious ability of the virus, k represents the strictness of social management, q represents the time needed for one step of infection, and t₀ is an additional parameter that aligns with the starting date of the experimental data. By incorporating these four parameters, our four-parameter simulation accurately fits all existing COVID-19 epidemic data and will contribute to the prediction of future outbreaks. Moreover, the mechanism of Gompertz growth has been elucidated by our model.

4.2. On the Mutation of the SARS-CoV-2 Virus

The SARS-CoV-2 virus continuously undergoes mutations, giving rise to new strains. This ongoing mutation process is the reason why the pandemic has persisted for more than three years. As a result of natural selection, these new mutant strains possess higher infectious ability but lower lethality rates. Although the lethality rate of the new strains may be lower, it still results in a significant number of deaths. Therefore, since mutation occurrence is inevitable and costly, the key focus should be on reducing the epidemic probability of mutants. While humans cannot prevent the virus from mutating, they can hinder the mutant strains from dominating the competition. According to six prediction rules on the cross-spreading of two strains, it is highly unlikely for the intersection point t_cr to manifest under the following conditions: (1) when Δ > 0, there are three domains and in this case there will be no occurrence of t_cr within a domain if the period of this domain is short enough (i.e., t_s − t_a small, Δ small, or t_m large) or if the symbol of D₂₁ within a domain does not align with the symbol of F₁ − F₂ at the initial time of this domain; (2) when Δ < 0, there is only one domain and in this case there will be no t_cr if the period of virus spread is short enough or if the symbol of D₂₁ does not align with the symbol of F₁ − F₂ at the initial time.

An example to highlight the phenomenon of no occurrence of t_cr is the winter epidemic in China in 2022. Among tens of millions of SARS-CoV-2 cases in one city, no new mutant strain emerged or triumphed over the competition, with the exception of the original omicron strand. This suggests that the pandemic was effectively terminated within a short period, lasting only one month.

In summary, the strategies proposed by our model to control an epidemic have two main aspects. First, it is critical to prevent the occurrence of a second wave. Second, one should aim to avoid competition among mutant strains. In the latter scenario, minimizing the duration of virus spread is an efficient approach.

4.3. Dependence of Virus Infection Potential on Temperature and Humidity

The conformational equilibrium between the open and closed conformations of the receptor binding domain (RBD) of the spike (S) protein can be analyzed using first-principle techniques for a susceptible individual. The RBD can exist in either the open or closed position, known as the up or down conformation, respectively. The population of conformational states can be determined based on the free energy change during conformational transitions of the S protein. Let us denote the closed conformation as state A and the open conformation as state B. The Gibbs free energies of states A and B are represented by G_A and G_B, respectively. Generally, if G_A is lower than G_B, the RBD will primarily assume the inactive conformation A. Conversely, in order to initiate the infectious process, the equilibrium should shift towards the open conformation, implying that G_B should be lower than G_A [16]. The free energy G in a given conformation is related to the partition function Z, which can be expressed as follows:

G = - k_{B} T l n Z, Z = \sum_{n} e^{- β E_{n}} (β = \frac{1}{k_{B} T}) .

(12)

Here E_n represents the energy level of a given conformation,

{(E_{n})}_{A, B} = V_{A, B} + (n + 1 / 2) ℏ ω_{A, B}

(13)

V_A,B represents two minima of conformational potential, respectively, and (n + 1/2)ħω_A,B represents the corresponding vibrational energy around the minimum of the conformational potential. By the summation of Boltzmann factor over vibration states one has

\frac{Z_{A}}{Z_{B}} = e^{- β (V_{A} - V_{B})} Y_{A / B}, Y_{A / B} = \frac{e^{\frac{1}{2} β ℏ ω_{B}} - e^{- \frac{1}{2} β ℏ ω_{B}}}{e^{\frac{1}{2} β ℏ ω_{A}} - e^{- \frac{1}{2} β ℏ ω_{A}}}

(14)

Let T_C represent the phase transition temperature, which can be determined by ΔG = G_B − G_A = 0. From Equations (12) and (14) we obtain a simplified equation for T_C

\frac{2 (V_{B} - V_{A})}{ℏ {(ω}_{A} - ω_{B})} = c o t h \frac{ℏ ω_{A}}{2 k_{B} T_{C}} (a s |\frac{ω_{A} - ω_{B}}{ω_{A}}| ≪ 1)

(15)

and ΔG > 0 for T > T_C, ΔG < 0 for T < T_C. Therefore, if the environmental temperature decreases to T < T_C, the conformational transition from the closed conformation to the open conformation occurs rapidly. This explains why the virus has higher transmission rates during the winter and the entrance to host cells is prioritized. Conversely, the summer season provides the most favorable conditions for virus elimination. In addition to temperature, the conformational equilibrium is influenced by humidity. The virus can be modeled as a charged sphere, and through the application of electrostatics principles to salty solutions, one can derive an expression for the potential at the surface of the charged sphere, where the dielectric constant ε is incorporated [17]. This implies that the elastic frequency ω² should be replaced by ω²/ε. Considering water has a dielectric constant of ε = 80, the frequency parameter takes a reduced value of ω/9 in the presence of a fully salty solution rather than ω in a vacuum. Consequently, this provides a quantitative estimation of the virus’s infection potential, which is strongly dependent on humidity.

The above discussions on the susceptible individual can be extended to the population level, providing evidence that R₀ is influenced by temperature and humidity. The point will be discussed in detail in the future work.

5. Conclusions

(1): We propose a logically simple model for analyzing the spread of the COVID-19 virus. The model is based on two fundamental assumptions: each viral strain possesses a basic reproduction number R₀, which quantifies its transmission potential, and the instantaneous reproduction number (R_t or R_j) decreases gradually due to factors such as population immunity and interventions, which can be represented by an attenuation constant k (k < 1).
(2): The daily number of infections (DNI) and the cumulative number of infections (CNI) versus time (m) are deduced based on the aforementioned two assumptions. By utilizing the explicit relation m(t) where t is the physical time, our simulations with the four parameters (R₀, k, q, t₀) demonstrate excellent agreement with all experimental data.
(3): Insights obtained from typical plots of CNI vs. m (i.e., typical plots of F(R₀, k; m)) provide valuable information regarding the conditions required for the decline of a viral infection wave, as well as an explanation for the occurrence of continuous second waves of infection and its preventive measures.
(4): The persistence of the SARS-CoV-2 pandemic for more than three years can be attributed to frequent mutations. We thoroughly examine the cross-spread of two strains within a region and lay a theoretical foundation for designing strategies to avoid competition among mutant strains or hindering the mutant strains from dominating the competition.

Author Contributions

Conceptualization, L.L.; validation, J.L. and L.L.; investigation, J.L.; writing—original draft preparation, L.L.; writing—review and editing, J.L.; visualization, J.L.; supervision, L.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Kermack, W.O.; McKendrick, A.G. A contribution to the mathematical theory of epidemics. Proc. R. Soc. Lond. A 1927, 115, 700–721. [Google Scholar] [CrossRef]
Murray, J.D. Mathematical Biology, 2nd ed.; Springer-Verlag GmbH: Berlin/Heidelberg, Germany; New York, NY, USA, 1993; pp. 1–92. [Google Scholar] [CrossRef]
Squazzoni, F.; Polhill, J.G.; Edmonds, B.; Ahrweiler, P.; Antosz, P.; Scholz, G.; Chappin, E.; Borit, M.; Verhagen, H.; Giardini, F.; et al. Computational models that matter during a global pandemic outbreak: A call to action. JASSS 2020, 23, 10. [Google Scholar] [CrossRef]
Adam, D. Special report: The simulations driving the world’s response to COVID-19. Nature 2020, 580, 316–318. [Google Scholar] [CrossRef] [PubMed]
Sridhar, D.; Majumder, M.S. Modelling the pandemic. BMJ 2020, 369, m1567. [Google Scholar] [CrossRef] [PubMed]
Parshani, R.; Carmi, S.; Havlin, S. Epidemic threshold for the susceptible-infectious-susceptible model on random networks. Phys. Rev. Lett. 2010, 104, 258701. [Google Scholar] [CrossRef] [PubMed]
Valdez, L.D.; Braunstein, L.A.; Havlin, S. Epidemic spreading on modular networks: The fear to declare a pandemic. Phys. Rev. E 2020, 101, 032309. [Google Scholar] [CrossRef] [PubMed]
Wan, S.; Liu, J.; Liu, M. Progress on the basic reproduction number of SARS-CoV-2. Chin. Sci. Bull. 2020, 65, 2334–2341. [Google Scholar] [CrossRef]
Bugalia, S.; Tripathi, J.P.; Wang, H. Estimating the time-dependent effective reproduction number and vaccination rate for COVID-19 in the USA and India. Math. Biosci. Eng. 2023, 20, 4673–4689. [Google Scholar] [CrossRef]
Hasan, A.; Susanto, H.; Tjahjono, V.; Kusdiantara, R.; Putri, E.; Nuraini, N.; Hadisoemarto, P. A new estimation method for COVID-19 time-varying reproduction number using active cases. Sci. Rep. 2022, 12, 6675. [Google Scholar] [CrossRef]
Hale, T.; Angrist, N.; Goldszmidt, R.; Kira, B.; Petherick, A.; Phillips, T.; Webster, S.; Cameron-Blake, E.; Hallas, L.; Majumdar, S.; et al. A global panel database of pandemic policies (Oxford COVID-19 Government Response Tracker). Nat. Hum. Behav. 2021, 5, 529–538. [Google Scholar] [CrossRef]
Costris-Vas, C.; Schwartz, E.J.; Smith, R. Predicting COVID-19 using past pandemics as a guide: How reliable were mathematical models then, and how reliable will they be now? Math. Biosci. Eng. 2020, 17, 7502–7518. [Google Scholar] [CrossRef]
Levitt, M.; Scaiewicz, A.; Zonta, F. Predicting the trajectory of any COVID19 epidemic from the best straight line. medRxiv 2020, 2020.06.26.20140814. [Google Scholar] [CrossRef]
Ohnishi, A.; Namekawa, Y.; Fukui, T. Universality in COVID-19 spread in view of the Gompertz function. Prog. Theor. Exp. Phys. 2020, 12, 123J01. [Google Scholar] [CrossRef]
Ikeda, Y.; Sasaki, K.; Nakano, T. A new compartment model of COVID-19 transmission: The broken-link model. Int. J. Environ. Res. Public. Health 2022, 19, 6864. [Google Scholar] [CrossRef] [PubMed]
Luo, L.F.; Zuo, Y.C. Spike conformation transition in SARS-CoV-2 infection. arXiv 2021, arXiv:2009.11288. [Google Scholar]
Phillips, R.; Kondev, J.; Theriot, J.; Garcia, H.G. Physical Biology of the Cell, 2nd ed.; Garland Science: New York, NY, USA, 2012; pp. 355–382. [Google Scholar]

Figure 1. COVID-19 pandemics in the UK (A) and Hong Kong (B).

Figure 2. CNI functions F(R₀, k; m) for several typical R₀ values. (A): R₀ = 18.6 (for Omicron BA.4, BA.5 in South Africa, 2022-1), (B): R₀ = 9.5 (for Omicron B.1.1.529 in many countries, 2021-11), (C): R₀ = 6.5 (for Delta in India, R0 = 5–8, 2020-10), (D): R₀ = 4.5 (for Alpha in the UK, 2020-9, Beta in South Africa, 2020-5, Gamma in Brazil, 2020-11, R₀ = 4–5), and (E): R₀ = 2 (for SARS-CoV 2003, R₀ = 2–3).

Figure 3. Discriminant Function and intersection occurrence.

Figure 4. CNI simulation in the UK from November 2020 to November 2021. (A): CNI simulation of delta spread in stage C, (B): CNI simulation of alpha/delta spread in stage B, and (C): discriminant function and intersection of alpha/delta spread in stage B (left panel gives discriminant function and right panel the cross-spread of two strains). Note: CNI simulation of alpha spread in stage A from November 2020 to April 2021 has been plotted in Figure 1A.

Figure 5. CNI simulation in the UK from November 2021 to February 2022. (A): CNI simulation of omicron spread in stage E (only the data of first omicron peak are used.); (B): CNI simulation of delta/omicron spread in stage D; and (C): Discriminant function and intersection of delta/omicron spread in stage D (left panel gives discriminant function and right panel the cross-spread of two strains).

Table 1. Parameters related to virus infection dying out.

R₀	k_th (m_st = 15)	E_km = (1/k_th)⁷	N_th	k_cr (N_max = 10⁷)
18.6	0.613	30.7	10⁵	0.722
9.5	0.722	9.78	∼10⁵	0.8
6.5	0.75	7.49	∼10⁴	0.85
4.5	0.8	4.77	∼10³	0.9
2	0.9	2.09	∼10²	>0.95

Table 2. CNI and arrival time t of two strains on F(t)-ladder.

t	CNI
c_i − d_i¹	a_i
2c_i − d_i	a_i + b_ia_i²
3c_i − d_i	a_i + b_ia_i² + b_i³a_i³
4c_i − d_i	a_i + b_ia_i² + b_i³a_i³ + b_i⁶a_i⁴
5c_i − d_i	a_i + b_ia_i² + b_i³a_i³ + b_i⁶a_i⁴ + b_i¹⁰a_i⁵
6c_i − d_i	a_i + b_ia_i² + b_i³a_i³ + b_i⁶a_i⁴ + b_i¹⁰a_i⁵ + b_i¹⁵a_i⁶

¹ i = 1, 2 refer to two strains, respectively, to save space only six steps of the F(t)-ladder are listed.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Luo, L.; Lv, J. Mathematical Modelling of Virus Spreading in COVID-19. Viruses 2023, 15, 1788. https://doi.org/10.3390/v15091788

AMA Style

Luo L, Lv J. Mathematical Modelling of Virus Spreading in COVID-19. Viruses. 2023; 15(9):1788. https://doi.org/10.3390/v15091788

Chicago/Turabian Style

Luo, Liaofu, and Jun Lv. 2023. "Mathematical Modelling of Virus Spreading in COVID-19" Viruses 15, no. 9: 1788. https://doi.org/10.3390/v15091788

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Mathematical Modelling of Virus Spreading in COVID-19

Abstract

1. Introduction

2. Materials and Methods

3. Results

3.1. Derivation of Formulas for Cumulative Number of Infections (CNI) and Daily Number of Infections (DNI)

3.2. Insights from Typical Figures of CNI vs. m

3.3. Prediction on the Second Wave of Pandemics

3.4. Cross-Spread of Two Viruses: Discriminant Function

3.5. Examples of the Cross-Spread of Two Viruses

4. Discussion

4.1. On the Simulation of COVID-19 Cases

4.2. On the Mutation of the SARS-CoV-2 Virus

4.3. Dependence of Virus Infection Potential on Temperature and Humidity

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI