Abstract
We consider systems of differential equations with polynomial and rational nonlinearities and with a dependence on a discrete parameter. Such systems arise in biological and ecological applications, where the discrete parameter can be interpreted as a genetic code. The genetic code defines system responses to external perturbations. We suppose that these responses are defined by deep networks. We investigate the stability of attractors of our systems under sequences of perturbations (for example, stresses induced by environmental changes), and we introduce a new concept of biosystem stability via gene regulation. We show that if the gene regulation is absent, then biosystems sooner or later collapse under fluctuations. By a genetic regulation, one can provide attractor stability for large times. Therefore, in the framework of our model, we prove the Gromov–Carbone hypothesis that evolution by replication makes biosystems robust against random fluctuations. We apply these results to a model of cancer immune therapy.
1. Introduction
In this paper, we consider systems of differential equations with polynomial and rational nonlinearities and with a dependence on a discrete parameter (hybrid systems). Such systems arise in biological and ecological applications, where the discrete parameter allows us to incorporate in the model a genetic regulation by a genetic code. We investigate the stability of attractors of these systems under stress. The attractors define system functioning regimes, while stresses are sudden changes of system parameters (for example, heat shocks). Our aim is to investigate the attractor stability under infinite sequences of shocks. It is inspired by the following idea of M. Gromov and A. Carbone: “Homeostasis of an individual cell cannot be stable for a long time as it would be destroyed by random fluctuations within and out of cell. There is no adequate mathematical formalism to express the intuitively clear idea of replicative stability of dynamical systems” ([1], p. 40). It is clear that replicative stability is important for the evolution of cancer cells, viruses, for example, such as COVID 19, and bacteria such as E. coli. There is no doubt that replication helps viruses and bacteria survive in changing environments (see, for example, Lensky’s experiment with E. coli [2]). However, there is a non-trivial question: how to estimate chances that such replicative evolution can continue eternally even under action of therapeutic agents? For COVID-19, it is an important question of will we have fourth or fifth pandemic waves, etc., or can the virus vanish? In this paper, however, we consider cancer cells as an example because our analyses are more applicable to this case.
Using a model, proposed in this paper, one can formulate the Gromov–Carbone idea in rigorous mathematical terms and, under some assumptions, to prove this hypothesis. Our model exploits ideas of artificial intelligence theory; namely, we suppose that the response to stress is defined by deep gene networks (DGN), which are similar to deep neural networks (DNNs). It allows us to use estimates obtained recently in DNN theory [3,4,5]. Note that deep networks were recently applied as interpretable models of the gene network regulations in [6].
To describe replicative stability, we incorporate into our model a regulation by the discrete parameter (for biosystems and ecosystems it is a gene regulation) and consider how the stochastic stability depends on regulation. Our results help us to explain, for example, why cancer cells exhibit a long survival time even under a strong therapy. Indeed, cancer cells give us an example of evolving biosystem affected by a sequence of stresses (for example, immune therapy or chemotherapy). It is worth noting that replicative stability for biochemical systems, introduced in this paper, is a new idea for the theory of stability (robustness) of these systems. Earlier, a number of works (see fundamental papers [7,8] and references therein) considered a more usual approach, which investigates reactions of the biochemical system to the small disturbances of kinetic rates or reagent concentrations. We also investigate the stability in the vicinity of small changes in some states (attractors). However, we take into account variations of the system genotype. We suppose that these variations compensate for a significant part of external perturbations that reinforces the system stability. The gene regulation system evolves in such a way that this compensation mechanism becomes more and more effective, which provides stability under fluctuations for large times.
Let us first outline the concept of stochastic stability. We consider organisms as biochemical machines, which can be modeled by a system of ordinary differential equations. A regime of our biosystem functioning is defined by an attractor . Let us denote by the probability that the system state lies in a -neighborhood of the attractor within the time interval . The parameter defines the size of a homeostasis domain, where the system stays viable (here we follow viability theory, see, for example, [9]). In the framework of that viability approach, the Gromov–Carbone hypothesis reduces to two claims. The first assertion is that for biological systems with fixed parameters and a fixed genetic code,
This means that all biosystem states, sooner or later, will be destroyed by fluctuations. The second assertion is that a sequence of modifications of the system genotype can lead to eternal stability, where the viability probability stays positive for all times:
where . The constant can be interpreted as follows: imagine a large population of cancer cells, bacteria or viruses. Let this population be affected by consecutive therapeutic attacks. The constant describes the relative fraction of cells ( bacteria, viruses) that will survive.
In other words, systems with fixed parameters are unstable under environment fluctuations but the evolution of the genetic code can stabilize them. This replicative stability is considered in [10] for some network models; see [11] for an overview. For some classes of hybrid dynamical systems involving a dependence on discrete genotypes , which can evolve in time, it is shown that inequality (2) implies
i.e., the genotype length L has a tendency to increase. Here we assume that the genotype s is a Boolean string and the length L of s is N. However, it is well known that the genotype length does not correlate directly with the morphological complexity and that evolution does not always lead to a genotype length increase. For example, Drosophila, nematode and human have numbers of coding genes of the same order (although bacteria genomes are much shorter than the nematode genome). Nonetheless, it is well known that the gene regulation system has become more and more complex throughout evolution. In this paper, we use deep networks (DNs) to describe gene regulation and show that DN network growth makes biosystems stable for long time periods.
Let us outline the model in more detail. In contrast to some earlier works [12,13,14], which used either formidable computations [12,13], or sophisticated nonlinear asymptotics [14], we use deep networks. These networks consist of, in general, many layers. The depth of a network is the number of hidden layers, and the size is the total number of units. In practice, networks with depth can be considered as shallow, while deep networks typically have many layers. Connections between units in these networks depend on coding sequences , of Boolean genes, which could be switched off () or switched on (). The inputs of the DN’s are the stress parameter values and the system states. The DN produces an output, which depends on unit interconnections and thus on the genetic code, which determines these interconnections. The main idea behind our model of regulation is that the DN output and the stress impact cancel out each other.
To describe stress impacts, we use a model of subsequent environmental shocks. This idea is consistent with the Gould–Eldredge concept [15], where evolution is represented as a sequence of intermittent short bursts and longtime stasis periods, the evolution bursts appear in response to stresses induced by environment. Let be an infinite increasing sequence of time moments . These moments can be considered as moments of environment shocks (stresses), when the environment suddenly changes. At these moments, some system parameters can change. A well-known example is a heat shock connected with temperature, which affects kinetic rates in biochemical systems and produces mutations [16]. For cancer cells, these shocks can describe a therapy. Our aim is to estimate the probability that a local attractor is robust under all those environmental shocks within a large (maybe, infinite) time interval.
Let us outline our results. The first result (Theorem 1) concerns a connection between gene regulation complexity and uncertainty of environment. The important measure of genetic regulation complexity is the number of equilibria of gene regulation network . This question on the number of the equilibria has received the attention of many works; in particular, it was studied for the famous NK model (pioneered in [17]). The NK model is a mathematical model describing a “tunably rugged” fitness landscape, depending on two parameters, N and K. The parameter N is the length of a Boolean genotype s length, and K defines the topology of fitness landscape via the number of genes involved in the trait control. The NK model has the main applications in evolutionary biology. First, it was supposed that is a relatively small, , where an exponent . In particular, for the Hopfield model with symmetrical interaction [18], this relation holds with . However, later it was shown that in the NK model, can be exponentially large [19]. A similar result is obtained for another simple model, which describes a special class of Hopfield models with asymmetric interaction and a special interaction topology [20]. Experimental data [21,22] show that in gene interaction networks there are strongly connected nodes that can be named hubs, or centers. The hubs communications go via a number of weakly connected nodes. This centralized connectivity has been found in gene–gene and protein–protein interaction networks [21,22], and such networks can support many equilibrium states.
To explain why they have a number of equilibria, we define an entropy of environment uncertainty, depending on the robustness level and gene regulation. This quantity is the Kolmogorov -entropy (see, for example, [23,24]) on the metric space of perturbations with a special metric. We show that if an attractor describing biosystem functioning does not collapse under all perturbations (conserves its topological structure) then there exists a connection between and the entropy :
This means that entropy of regulation must be no less than the environment uncertainty.
This result, in particular, shows that, to kill cancer cells, we should use a combined therapy (for example, chemotherapy together with a heat therapy). Hyperthermia therapy is heat treatment for cancer that can be a powerful tool when used in combination with chemotherapy or radiation. Chemotherapy resistance develops over time as the tumors adapt (due to replicative stability). Chemoresistance against several anticancer drugs could be reversed by the addition of heat therapy.
Our results show that before all, it is necessary to attack hubs of the gene regulation network. In fact, critically depends on the gene interaction topology. Even a small increase in the number of hubs in gene regulation networks leads to a sharp increase in [20]. These results are consistent with the conclusions of [25,26], where it was shown that network hubs buffer environmental variations and also make switches in regulatory networks. Our results are supported by some experimental data, at least qualitatively [27]. A number of hub genes are identified in cancer cell regulation. According to these results, drugs attacking the hub genes should be especially effective. A list of such candidate drugs targeting hub genes can be found in Table 3 from [27].
There is also a connection with basic results of hard combinatorial problem theory. Fundamental results [28] on Boolean functions and the K-SAT problem assert that to satisfy many constraints, we should use a number of Boolean variables. Our result (4) can be considered as an analog of these results. If we have a genotype s of the length N, then ; thus, for Boolean regulation models, (4) implies that the genome length N should be more than the entropy .
Furthermore, we show that it is possible to provide stochastic attractor robustness by an appropriate genotype choice at each environment shock . If, in contrast, the genotype is fixed, then the stochastic stability tends to zero as . For cancer these results are confirmed by numerical simulations, where we use a model from [29] extended to take into account the evolution of cancer cells. If the cancer cells do not evolve, immune therapy may be effective, otherwise cancer wins sooner or later.
To conclude this introduction, let us note that the estimates of stochastic stability, obtained in the paper (see Theorem 3), allows us to find the main factors, which determine the organism fate under the stress stability: the sparsity of reaction network, the stress sensitivity, the stress dimension (the number of perturbed external parameters affecting the rates of biochemical reactions) and the size of gene regulation network.
The paper is organized as follows. In the next section, we describe models with random perturbations and gene regulation. In Section 4, we state the model of environmental shocks and formulate some assumptions needed for the mathematical tractability of the problem. In Section 5, we formulate some definitions, and further, we find a necessary condition for the attractor stability.
In Section 7, we consider attractor robustness under a sequence of environment shocks. In this section, we apply DNN’s that finally give Theorem 3, which states an estimate of replicative stability. The last section contains a discussion and conclusions. To simplify reading, the technical parts of demonstrations are relegated into Appendix A.
2. Materials and Methods
We use standard methods of mathematical analysis and theory of differential equations. Moreover, we apply the known results for approximations of Lipshitz smooth functions by deep neural networks [3,4,5]. For numerical simulations, and to produce Figure 1, we used Matlab and the standard Euler method.
Figure 1.
This plot shows dynamics of concentration of tumor cancer cells in three cases: no therapy (the curve 1), therapy in the case when tumor cells do not evolve ( the curve 3) and therapy in the case when tumor cells mutate with the probability (the curve 2). The parameters are taken from [30]: , . We have a suite of 20 shocks, which at the time moments , . For the increasing rate, the tumor cell rate increment is . For the decreasing and increasing curve, the infusion dose is .
3. Model with Random Parameters
We describe our models in two steps, first we formulate a model without gene regulations and then with regulation.
We consider systems of differential equations with a dependence on a random process . They have the form
where , where D is a compact domain in with a smooth boundary , are biochemical species concentrations, , the reaction terms are sufficiently smooth functions of v, for example, multivariate polynomials in v, and also smooth functions of . The functions are random processes of a trajectory, which are piecewise constant in t and take values in a compact subset of , where the quantity p is a positive integer (dimension of stress parameter ). These processes describe multiplicative noise. The source of this noise is an environment of a biosystem. Other assumptions of will be described later.
We complement the system of stochastic Equation (5) by initial conditions
To define f we use the standard models of chemical kinetics and population dynamics. The simplest choice is a polynomial model, which can be derived by the law of mass action:
where is a multi-index with integers , are finite subsets of and are coefficients that determine kinetic rates. Note that the well-known Lotka–Volterra and generalized Lotka–Volterra systems (which have ecological and economical applications, see [31,32]) are also included in the class of systems defined by (7).
In real biochemical applications, we are often dealing with binary reactions, where only two exponents , and sparse , where, although the reagent number n may be large, for each i, the number of non-zero coefficients in each row is bounded, . In a more sophisticated case (which arises when in a slow-fast polynomial system we express fast variables via slow ones) we have rational v nonlinearities, for example,
where and are polynomials. We suppose that are separated from zero:
for all and . Such systems appear in biochemical kinetics, population dynamics and cancer modeling. As an example, let us consider the following model of cancer immunotherapy [29,30]:
where denotes the concentration of cells with antitumor activity in the tumor site (CTL cells), and is the concentration of tumor cells. This model thus concerns two cell populations: normal cells and cancer ones. Parameters are positive coefficients, which can be explained as follows. The parameter is the normal rate of immune cell flow into the tumor site, is the immune cells death rate due to interaction with tumor cells, is the fraction of tumor cells killed by immune cells, is the tumor cells carrying capacity, r is the tumor cells growth rate, is a natural death rate of immune cells, is the steepness coefficient of immune cell recruitment and is the maximal immune cells recruitment rate [29]. For this model and its generalizations are investigated in [30], the term describes a therapy by infusion of CTL cells (it is proposed in [29]).
To provide the existence and uniqueness of solutions to the Cauchy problem (5) and (6) for all , we assume the following. Let
where is a unit normal vector directed outside D at the point . Then solutions of the Cauchy problem (5) and (6) exist and are unique for all .
It is worth noting that polynomial and rational functions may not satisfy conditions (12). Since we further investigate a local stability of local attractors, we can restrict ourselves to considering narrow neighborhoods of those attractors . Therefore, we can truncate nonlinearities in setting for instance , where outside of an open neighborhood W of .
3.1. Extended Model with Regulation
Different complicated models of gene regulation were proposed, for example, [33,34] for Drosophila morphogenesis, however, we apply another approach using DNN’s and inspired by recent biological works (for example, [6]).
Let be a time dependent gene expression vector. We have N Boolean genes , which take values in the set , . The i-th gene may be switch on (when ), or switch off (when ). The Boolean string s encodes interconnections in the networks of gene regulation (see Section 7.2). We incorporate genes s into system (5) using the following
Assumption GRSuppose that coefficients depend on the genetic code s:
and they are Lipshitz in ξ for each s.
This assumption means that regulation goes via a genetic control of kinetic rates. Moreover, we assume that the network finds a genotype s, which produces an optimal answer to stress, almost instantly. We obtain thus that s is involved in the vector field via kinetic coefficients : .
Let us introduce vector fields describing perturbations. Consider, for example, a heat shock. All kinetic rates depend on the temperature T: , sometimes in a complicated manner. However, typically we have a reference value corresponding to an optimal functioning regime (for example, if T is the human body temperature, then is about 36 degrees). Let , where is a temperature increment corresponding to a heat shock. Then in the case (7) one obtains that the perturbations of the have the form
where
In the general case, let be an optimal parameter value corresponding to a normal system functioning. Then we denote by
the perturbation of f, which appears as a result of a parameter variation. Let us denote the non-perturbed field by . Then we have the two following systems: the non-perturbed one
and the perturbed system:
where, in general case, and s can depend on t.
3.2. Attractors and Semiflows
Let us denote by a compact invariant locally attracting set (a local attractor) of semiflow generated by the Cauchy problem for system (17) for given fixed s and . The semiflow generated by the non-perturbed system (16) will be denoted by and the corresponding attractors by . We consider mainly two cases: is a stable hyperbolic equilibrium or a stable hyperbolic limit cycle with a period . (A reminder that hyperbolicity for an equilibrium means that the spectrum of the corresponding linearized operator does not intersect an imaginary axis, while for cycles it means that the corresponding Floquet multiplicators do not lie on the unit circle of the complex plane. For the general theory of hyperbolic sets, see [35,36]). We are going to estimate the probability that a local attractor (or an invariant set) of our system is stable under those shocks. We define that probability in the next subsection.
4. Stochastic Stability under Shocks
4.1. Environmental Shocks
To describe stresses, we use the model of subsequent environmental shocks. Let be an infinite increasing sequence of time moments such that , where and are characteristic times of gene regulation functioning and biochemical model dynamics, respectively. At the moments we have environment shocks, when an interaction between the environment and the biosystem changes. We suppose that within interval the parameter , where at each step j we choose randomly as follows in two steps. The first step is a random choice of the set of perturbations affecting the system at j-th interval . We choose the set randomly from a countable set . The index k in defines a kind of interaction between the organism and its environment. Each set is equipped with a probabilistic measure .
The second step is a random choice of . The shocks are independent, and at each step j and each k, we sample according to the measure . Therefore, we obtain a sequence of independent random variables , , which describes subsequent random changes of our stress environment.
4.2. Attractor Stochastic Stability
To formulate the stochastic stability problem, we use the viability approach [9]. Let be an attractor under consideration. For we introduce a small tubular neighborhood of that attractor by
where the distance between a state v and a subset is defined as
where is the standard Euclidean norm in .
Let us introduce probability that the state of our system lies in within the time interval :
Naturally we suppose that the initial value . The probability depends on that initial value. Since our aim to obtain estimates of uniform in we will omit dependence on . The probability can be considered as a measure of attractor stochastic stability.
5. Robustness with Gene Regulation
5.1. Definitions
Let us formulate certain definitions. We consider ergodic attractors . The statement of ergodic attractor’s theory can be found in [37]. For such attractors and sets, the time average can be replaced by averages over the attractor taken by an invariant measure defined on the attractor. Let us introduce the time averages along trajectories
where is a smooth function, and is a trajectory on the attractor. For ergodic attractors these averages exist and do not depend on the choice of the trajectory on the attractor for almost all with respect to the measure . Of course rest point and limit cycle attractors are ergodic, but theory [37] also concerns the non-trivial situation, where may be chaotic.
Let us denote by the set of all smooth functions defined on D. We denote by the Lipshitz constant of :
Definition 1.
We say that a local attractor of the semiflow defined by Equation (17) with is ε-robust with respect to the stress parameter ξ and the genotype s if
i.e., the time averages over the perturbed and unperturbed attractors are ϵ-close.
Note that if the attractor is hyperbolic and is small, then due to the persistence of hyperbolic sets [35,36], the perturbed attractor is topologically equivalent to the non-perturbed one.
Definition 2.
A local attractor of the non-perturbed flow defined by system (16) is ϵ-robust with gene regulation with respect to the stress environment if for each there is an appropriate genotype such that
where is the local attractor of the semiflow .
Due to the definition of robustness under gene regulation for each field we have a genotype such that trajectory averages over the perturbed attractor are -close to ones for non-perturbed attractor of system (16). We have thus the map from the set into the set of all genotypes .
5.2. Robustness Condition
Let us formulate a claim.
Proposition 1.
Let a local attractor of a non-perturbed system (16) be ergodic and ε-robust with respect to and . Then
where the constant is uniform in .
6. Gene Regulation and Robustness
In this section, we describe a connection between robustness and the gene regulation.
Environment Uncertainty
In this subsection, we introduce a measure of external perturbation uncertainty. Let be a number. Consider the set of perturbations . Let us introduce a special distance between perturbations depending on the attractor and g:
where denotes the set of all genotypes.
By we denote a -covering of consisting of the sets , :
where the diameter of a bounded set B is defined by
The formula for the diameter can be rewritten as
We define as the Kolmogorov -entropy of :
where we take the infimum over all -coverings.
Remark 1.
The introduced entropy depends on the parameter δ, and decreases in . Moreover, depends on the unperturbed attractor (we omit that dependence in our notation).
Remark 2.
The Kolmogorov δ-entropy has numerous applications for coding, image processing and neural networks, see [23,24].
It is difficult to compute ; however, we can find a rough estimate of this entropy via the dimension of the stress parameter. Suppose that the diameter of is bounded: where is a positive constant, which can be interpreted as a maximal absolute value of the stress parameter variations. Then we obtain
If the set is not compact then .
There holds
Theorem 1.
Let an ergodic attractor of a non-perturbed system (16) be ε-robust with gene regulation. Then for sufficiently small one has
where , where is a constant uniform in ε as .
For a proof, see Appendix A. The idea is simple: using Proposition 1 one can show that if the same genotype provides the stability for all perturbations from a set B, then .
By results of [20] we can find a connection between the number of centers (hubs) in the gene regulation network and the entropy . To find a rough estimate, we suppose that we have a starlike network with centers and each center is connected to satellites. Then the equilibrium number has the order [20] and
Equation (30) shows that the main factor determining the effectiveness of the gene response to stresses is the number of hubs involved in the corresponding gene regulation, and (we use here that ln is a very slow increasing function of its argument). The estimate (28) then gives the following topological condition of the stability:
i.e., roughly speaking, to support stability, the number of the gene network hubs should be proportional to the dimension of the stress parameter.
7. Stability under Many Shocks
In this section, we use the model of gene regulation from Section 4.1. We are going to prove the Gromov–Carbone hypothesis and show that the complexity growth of the gene regulation network can lead to a successful evolution, where attractors stay robust under all shocks. First, in the coming subsection, we study the case when the gene regulation does not work and the genotype s is fixed.
7.1. Instability for Systems with Fixed Parameters
Let us prove a claim. Remember that on the set of possible perturbations , a probabilistic measure is defined.
Theorem 2.
Let be a hyperbolic equilibrium for system (17) with . Suppose that for each s the range of the map contains a closed ball of radius centered at 0, and the measure μ on is continuous with respect to the Lebesgue measure on .
Then for sufficiently small
where .
Proof.
Using the theorem assumptions, we choose such a perturbation such that
If is small enough then by implicit function theorem and using that our equilibrium is hyperbolic, one can show that the equation
has a solution close to : . This solution is a hyperbolic equilibrium attractor for system (17). If a trajectory stays inside the -neighborhood of for all , this trajectory converges to :
Therefore, . However, according to Proposition 1, one has
which creates a contradiction with (33) for sufficiently small . Therefore, we have a non-zero probability to find a such that all trajectories , which start in a -neighborhood of , leave that neighborhood within a sufficiently large interval . Now we use the fact that the parameters are i.i.d. random variables. In fact, consider the intervals . The probability to find such that the state leaves within is not less than . All these leaving events are independent, because are independent. Therefore, the probability of leaving within is not less than , and as a result, we obtain (32). □
This result confirms Gromov–Carbone’s hypothesis on the fundamental instability of biosystems under large parameter variations. However, if we suppose that the gene regulation system can evolve, it is possible that inequality (2) holds for a . Then, evolution can continue within a large time frame, with a positive probability. We consider that evolution in the coming subsection.
7.2. Evolution and DNN’s
To investigate the evolution problem in more detail, we apply recent results of the deep networks theory [4,24,38]. Recently, people started to apply ReLU networks to model gene regulation [3,6,39]. The deep networks with ReLU activators allow us to overcome the curse of dimensionality [38].
Let the gene regulation be defined by deep networks with ReLU activator functions. Remember that the depth of a network is the number of hidden layers, and the size is the total number of units. Let us denote by the maximal number of units in the network layers.
The following estimate follows from results [4]. Let be a Lipshitz function defined on a compact set . Then there is a network with layers and units in each layer such that
where is the Lipshitz constant of G.
Note that this approximation is made by a network with real-valued synaptic weights, for example, for a single hidden layer perceptron, we have
where are real valued coefficients. We however need a network whose parameters depend on a binary string s. For ReLU sigmoids , this binary coding problem can be resolved as follows. Consider for simplicity the case (35) (a multilayered case can be considered in a similar way). We first find values of real valued coefficients , providing the needed approximation. Each we approximate by M bits within a precision :
where
The ReLU function is Lipshitz with the constant 1. Therefore,
for bounded inputs and some constants , and denotes the network of the same structure as but with the weights . The same procedure with the replacement of to can be performed for coefficients .
Remark. If the network has the size , then we use genes to construct the network encoded by a binary string .
7.3. Result on Replicative Stability
We assume that assumption GR holds and consider the non-perturbed system (16). Let be a local attractor of the semiflow generated by this system. The following theorem describes the probability of stochastic robustness of this attractor with gene regulation under a sequence of shocks , . Together with (16) we consider the same systems with gene regulation terms defined by DNN’s (which are considered in the previous subsection):
Within each interval , the system under an environmental shock is defined by the parameter , , where is the k-th set of external perturbations. The shocks are independent and we choose the set randomly from a countable set . The index k defines a kind of interaction between the organism and its environment. At the k-th step the corresponding gene regulation networks have the size , the depth and units in each layer. The weights of connections in that network are encoded by a genotype as it is explained in the previous subsection. We suppose that this network finds the correct choice instantly. These networks solve the following approximation problem:
Approximation problem APFor a given target function
of v to find a deep network such that the error
is minimal.
Further, to formulate the next theorem, let us introduce important parameters permitting to estimate the Lipshitz constants of the target functions . The first parameter is the sparsity of the reaction network , which is the number of elements in . We introduce the sparsity of the whole biochemical system by . The second parameter is a sensitivity of kinetic rates under the perturbation of k-th type defined as follows. Let
Then the k-th stress sensitivity is
The third parameter is the productivity of the network, or in other words, the maximal (over all components and all reactions) reaction output
Then the Lipshitz constant of perturbation g as a function of can be estimated via the introduced quantities:
An estimate obtained in the next theorem is rough; nonetheless, it allows us to identify the main factors, which determine the organism fate under the stress stability: the sparsity of reaction network, the stress sensitivity, and the size of the gene regulation network.
Theorem 3.
Suppose that is either a stable hyperbolic equilibrium or a stable hyperbolic limit cycle for the semiflow defined by non-perturbed system (16), and we have stresses at the moments .
Then there is a sequence of networks , , such that for sufficiently small positive the stochastic stability satisfies
where
and is a constant uniform in δ as .
Let us make some comments.
(i) The main idea is to provide stability as follows: at each time we choose resolving the approximation problem AP.
(ii) If the attractor is chaotic, the attractor contains unstable trajectories and a priori estimates from the proof fail. The standard ideas connected with hyperbolicity do not work because our perturbation is not autonomous in time and moreover it is not sufficiently smooth ( are Lipshitz functions but they are not ).
(iii) We suppose that the number of coding genes in the layer may depend on the evolution step k: , where is the number of genes in each layer at the k-th evolution step. The dependence on k is very important: it determines the evolution of the regulation network, and according to assertion 2, without that evolution, it is impossible to reach the robustness for all times.
(iv) From the biological point of view, assumptions of the theorem are not quite realistic because we suppose here that the gene regulation system instantly finds an optimal genotype , but it is hard to study a completely realistic model. If the time lag between action of environmental stress and the system adaptive answer is sufficiently small with respect to a characteristic time of biochemical evolution, then one can expect that our results are still valid (although the mathematical analysis becomes more sophisticated). However, if that time lag is not small, our results are not applicable and the biochemical system can lose its adaptivity.
(v) For infinite sequences of perturbations , the stability for all times is possible if the series converges.
(vi) Due to results of Section 7.2 (see the remark at the end of that subsection), the number of genes needed to encode a deep gene regulations network can be estimated as , i.e., it is more than the network size up to a logarithmic factor. This is roughly consistent with experimental data. For example, for the E. coli metabolic network includes 744 reactions catalyzed by 607 enzymes and the E. coli genome contains 4288 protein coding genes [40].
Note that Theorems 1 and 3 lead to the following interesting corollary: if , then, to provide eternal robustness (2), the size of the gene regulation network cannot be bounded as : .
Proof.
Note that are non-smooth (although Lipshitz bounded) functions of v; therefore, we cannot use ideas connected with -shadowing, hyperbolicity, etc. Instead of this, we use rough straight forward estimates. Let us consider the Cauchy problems
where and . Let . Then for one has
where and denote the derivative of with respect to v.
Lemma 1.
Let the errors of approximation problemAPsatisfy
for each k. Then for sufficiently small one has
where is uniform in .
For the proof, which is quite standard, see Appendix A.
8. Numerical Simulations
To illustrate our analytical results on evolution, systems (10) and (11) are considered. To describe the evolution of cancer cells, we extend the model in [29]. Note that among all model parameters, r is a key parameter that defines the growth of the cancer cell. It is natural thus to assume that mutations increasing this parameter play the main role, and these mutations support replicative stability. We introduce a small mutation probability . We solve systems (10) and (11) by the Euler method with a time step , and at each time step positive mutations can occur, increasing the growth parameter r. At each time step, the growth parameter either increases by a small quantity, or that parameter stays the same. The growth event occurs with the probability . We suppose that at these mutation time moments, we change the growth parameter r by an increment , where be a positive parameter. Therefore, r subsequently takes the values .
Following [29], we suppose that has the form of a sequence of periodical impulses:
where is a parameter (the therapy intensity), and stands for the Dirac delta function. Such a choice corresponds to jumps in concentrations at time moments .
We consider the three cases:
- (i)
- No therapy and no evolution of cancer cells, ;
- (ii)
- Therapy and no evolution of cancer cells, and ;
- (iii)
- Therapy and evolution of cancer cells, and .
Figure 2.
This plot shows how the replicative stability affects the abundance of cancer cells. The quantity F on the graph is the fraction, where the numerator is the abundance of the cancer cells for the replicative case and the denominator is the same abundance in the case when the mutations are absent. We see that F is almost constant for a large therapy dosage . The parameters are selected in the same way as in the previous figure.
We see that despite therapy, the cancer cell population starts to grow sooner or later, and the increasing curve nicely describes such a growth.
One can conclude that it is necessary to take into account cancer cell adaptivity and evolution, which allows us to describe cancer dormancy, an important effect in cancer treatment [41]. Many patients relapse due to dormant cancer cells. These rare and elusive cells can hide in specialized niches in distant organs before being reactivated to cause disease relapse [41]. The cancer dormancy is a very complex problem, and we postpone this issue for future articles.
9. Discussion
In this paper, the idea of replicative stability proposed by M. Gromov and A. Carbone is investigated for systems that are standard models for metabolic kinetics and population dynamics. To this end, systems of ODE’s are considered, which involve terms describing environmental shocks controlled by random parameters and gene regulation terms defined by networks. As a model of gene regulation, we used deep neural networks (DNN) here with ReLU activators. Since the deep networks overcome the curse of dimensionality, the problem of existence of genetic response to environmental stress can be resolved effectively.
In a framework of such approach, a connection is found between the uncertainty (entropy) of external stresses induced by a random environment and the topology of regulation networks. To compensate for the stress impact, the gene regulation networks should have a sufficient number of the hubs. Here the main result is an estimate of the hub number in regulation networks via the stress parameter dimension. Roughly speaking, to support stability, the number of the regulation network hubs should be not less than the dimension of the stress impact (this dimension can be simply understood as the number of different threats to the stability of the system). We think that this result is quite general and it is applicable to many systems. In fact, biological, ecological and economic systems are clearly becoming more complex in the process of evolution. However, it is not so easy to formalize these intuitive ideas on the complexity growth in precise mathematical terms. In this paper, using Kolmogorov’s entropy, deep networks, and the Gromov–Carbone idea of replicative stability, we shed light on this hard problem. The main idea is that more complex systems support homeostasis in a more effective manner. The fluctuations of system internal parameters could be essentially less for systems having a regulation. For example, the DNA replication in such systems can be performed in a more reliable way and therefore more complicated systems stabilize their genetic code better, which give them a selective advantage.
From the biological point of view, the main result of this paper is that replicative stability provides a stability of populations within a large time period and even maybe an eternal stability (a sufficiently unpleasant conclusion, COVID-19 forever!). We show that it is possible due to gene regulation network evolution. We also present models that allow us to estimate the fraction of cancer cells (viruses, bacteria) that still survive after many stages of therapy.
Author Contributions
Conceptualization, D.G. and S.V.; writing—original draft preparation, S.V.; writing—review and editing, D.G. and S.V. All authors have read and agreed to the published version of the manuscript.
Funding
The first author was supported by the Ministry of Science and Higher Education of the Russian Federation by the Agreement N 075-15-2020-933 dated 13 November 2020 on the provision of a grant in the form of subsidies from the federal budget for the implementation of state support for the establishment and development of the world-class scientific center Pavlov center “Integrative physiology for medicine, high-tech healthcare, and stress- resilience technologies”.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A
Appendix A.1. Proof of Lemma 1
Proof.
Let us consider the Cauchy problem defined by the linear equation
The solutions of this equation define the semigroup as follows:
Then we can rewrite (40) as an integral equation:
Let us consider first the case when is a hyperbolic equilibiria . Then the semigroup induced by linear operator . It satisfies
where are positive constants. Equation (A1) and that last estimate imply the integral inequality
where is a positive constant uniform in . Let and let us denote by the integral operator defined by the right hand side of the last inequality:
We consider the operator on the space of bounded non-negative functions defined on with the norm if is small enough then for an appropriate the operator maps the subset into itself. This implies (42) for the case of an equilibrium.
In order to obtain (42) in the case of the limit cycle , we slightly modify our arguments because then the semigroup is not exponentially decreasing in . We proceed as follows. Let be a cycle. At the cycle, we represent the solution as , where is defined by the condition that distance between the cycle and the solution is minimal. For we obtain two differential equations, and when we restrict on the subspace of w such that for all t the semigroup satisfies (A2). □
Appendix A.2. Proof of Theorem 1
Appendix A.2.1. Auxiliary Lemma
Let us prove a preliminary Lemma.
Lemma A1.
Let
where and is an appropriate constant uniform in ε as . Then
Appendix A.2.2. End of Demonstration
To conclude the proof of Theorem 1, we just use the definition of . The proof is complete.
References
- Gromov, M.; Carbone, A. Mathematical Slices of Molecular Biology; Preprint IHES/M/01/03 (2001); Societe Mathematique de France: Paris, France, 2001. [Google Scholar]
- Lenski, R.E.; Wiser, M.J.; Ribeck, N.; Blount, Z.D.; Nahum, J.R.; Morris, J.J.; Zaman, L.; Turner, C.B.; Wade, B.D.; Maddamsetti, R.; et al. Sustained fitness gains and variability in fitness trajectories in the long-term evolution experiment with Escherichia coli. Proc. R. Soc. Biol. Sci. 2015, 282, 20152292. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Kishan, K.; Rui, L.; Cui, F.; Yu, Q.; Haake, A.R. GNE: A deep learning framework for gene network inference by aggregating biological information. BMC Syst. Biol. 2019, 13, 38. [Google Scholar]
- Shen, Z.; Yang, H.; Zhang, S. Nonlinear Approximation via Compositions. Neural Netw. 2019, 119, 74–84. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Weinan, E.; Wang, Q. Exponential convergence of the deep neural network approximation for analytic functions. China Sci. Math. 2018, 61, 1733–1740. [Google Scholar]
- Liu, Y.; Barr, K.; Reinitz, J. Fully interpretable deep learning model of transcriptional control. Bionformatics 2020, 36, i499–i507. [Google Scholar] [CrossRef] [PubMed]
- Mochizuki, A.; Bernold Fiedler, B. Sensitivity of chemical reaction networks: A structural approach. 1. Examples and the carbon metabolic network. J. Theor. Biol. 2015, 21, 189–202. [Google Scholar] [CrossRef] [Green Version]
- Brehm, B.; Fiedler, B. Sensitivity of chemical reaction networks: A structural approach. arXiv 2017, arXiv:1606.00279v3. [Google Scholar]
- Aubin, J.P. Dynamic Economic Theory: A Viability Approach; Springer: Berlin/Heidelberg, Germany, 1997. [Google Scholar]
- Vakulenko, S.; Grigoriev, D. Stable growth of complex systems. In Proceedings of the Fifth Workshop on Simulation, St. Petersburg, Russia, 26 June–2 July 2005; pp. 705–709. [Google Scholar]
- Vakulenko, S. Complexity and Evolution of Dissipative Systems; de Gruyter: Berlin, Germany, 2014. [Google Scholar]
- Jiang, P.; Ludwig, M.L.; Kreitman, M.; Reinitz, J. Natural variation of the expression pattern of the segmentation gene even-skipped in Drosophila melanogaster. Dev. Biol. 2015, 405, 173–181. [Google Scholar] [CrossRef] [Green Version]
- Surkova, S.; Spirov, A.V.; Gursky, V.V.; Janssens, H.; Kim, A.R.; Radulescu, O.; Vanario-Alonso, C.E.; Sharp, D.H.; Samsonova, M.; Reinitz, J. Canalization of Gene Expression in the Drosophila Blastoderm by Gap Gene Cross Regulation. PLoS Biol. 2009, 7, e1000049. [Google Scholar] [CrossRef] [Green Version]
- Vakulenko, S.; Radulescu, O.; Manu; Reinitz, J. Size Regulation in the Segmenta- tion of Drosophila: Interacting Interfaces between Localized Domains of Gene Expression Ensure Robust Spatial Patterning. Phys. Rev. Lett. 2009, 103, 168102–168106. [Google Scholar] [CrossRef]
- Gould, S.J. The Structure of Evolutionary Theory; The Belknap Press of Harvard University Press: Cambridge, MA, USA, 2002. [Google Scholar]
- Rutherford, S.L.; Lindquist, S. Hsp90 as a capacitor for morphological evolution. Nature 1998, 396, 336–342. [Google Scholar] [CrossRef]
- Kauffman, S.A. Metabolic stability and epigenesis in randomly constructed genetic nets. J. Theor. Biol. 1969, 22, 437–467. [Google Scholar] [CrossRef]
- Hopfield, J.J. Neural networks and physical systems with emergent collective computational abilities. Proc. Natl. Acad. Sci. USA 1982, 79, 2554–2558. [Google Scholar] [CrossRef] [Green Version]
- Samuelsson, B.; Troein, C. Superpolynomial Growth in the Number of Attractors in Kauffman Networks. Phys. Rev. Lett. 2003, 90, 098701. [Google Scholar] [CrossRef] [Green Version]
- Vakulenko, S.; Morozov, I.; Radulescu, O. Maximal switchability of centralized networks. Nonlinearity 2016, 29, 2327. [Google Scholar] [CrossRef] [Green Version]
- Jeong, H.; Tombor, B.; Albert, R.; Oltvai, Z.N.; Barabási, A.L. The large-scale organization of metabolic networks. Nature 2000, 407, 651–654. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Jeong, H.; Mason, S.P.; Barabási, A.L.; Oltvai, Z.N. Lethality and centrality in protein networks. Nature 2001, 411, 41–42. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Cohen, A.; Dahmen, W.; Daubechies, I.; DeVore, R. Tree Approximation and Optimal Encoding. Appl. Comput. Harmon. Anal. 2001, 11, 199–226. [Google Scholar] [CrossRef] [Green Version]
- Grohs, P.; Perekrestenko, D.; Elbrächter, D.; Bolcskei, H. Deep Neural Network Approximation Theory. arXiv 2019, arXiv:1901.022202v4. [Google Scholar]
- Levy, S.; Siegal, M. Network hubs buffer environmental variation in Saccharomyces cerevisiae. PLoS Biol. 2008, 6, e264. [Google Scholar] [CrossRef] [Green Version]
- Seo, C.H.; Kim, J.R.; Kim, M.S.; Cho, K.H. Hub genes with positive feedbacks function as master switches in developmental gene regulatory networks. Bioinformatics 2009, 25, 1898–1904. [Google Scholar] [CrossRef] [Green Version]
- Yang, W.; Zhao, X.; Han, Y.; Duan, L.; Lu, X.; Wang, X.; Zhang, Y.; Zhou, W.; Liu, J.; Zhang, H.; et al. Identification of hub genes and therapeutic drugs in esophageal squamous cell carcinoma based on integrated bioinformatics strategy. Cancer Cell Int. 2019, 19, 142. [Google Scholar] [CrossRef] [PubMed]
- Friedgut, E. Sharp thresholds of graph properties, and the k-sat problem. J. Am. Math. Soc. 1999, 12, 1017–1055. [Google Scholar] [CrossRef]
- Pang, L.; Chen, L.; Zhao, Z. Mathematical Modelling and Analysis of the Tumor Treatment Regimens with Pulsed Immunotherapy and Chemotherapy. Comput. Math. Methods Med. 2016, 2016, 6260474. [Google Scholar] [CrossRef] [Green Version]
- Kuznetsov, V.; Makalkin, I.; Taylor, M.; Perelson, A. Nonlinear dynamics of immunogenic tumors: Parameter estimation and global bifurcation analysis. Bull. Math. Biol. 1994, 56, 295–321. [Google Scholar] [CrossRef]
- Hofbauer, J.; Sigmund, K. Evolutionary Games and Population Dynamics; Cambridge University Press: Cambridge, UK, 1998. [Google Scholar]
- Takeuchi, Y. Global Dynamical Properties of Lotka-Volterra Systems; World Scientific Pub Co., Inc.: Singapore, 1996. [Google Scholar]
- Mjolsness, E.; Sharp, D.H.; Reinitz, J. A connectionist model of development. J. Theor. Biol. 1991, 152, 429–453. [Google Scholar] [CrossRef]
- Reinitz, J.; Sharp, D.H. Mechanism of formation of eve stripes. Mech. Dev. 1995, 49, 133–158. [Google Scholar] [CrossRef]
- Katok, A.; Hasselblatt, B. Introduction to the Modern Theory of Dynamical Systems; Cambridge University Press: Cambridge, UK, 1997. [Google Scholar]
- Ruelle, D. Elements of Differentiable Dynamics and Bifurcation Theory; Elsevier: Amsterdam, The Netherlands, 2014. [Google Scholar]
- Eckmann, J.P.; Ruelle, D. Ergodic theory of strange attractors. Rev. Mod. Phys. 1985, 57, 617–656. [Google Scholar] [CrossRef]
- Montanelli, H.; Yang, H.; Du, Q. Deep ReLU networks overcome the curse of dimensionality for generalized bandlimited functions. arXiv 2020, arXiv:1903.00735. [Google Scholar] [CrossRef]
- Martinez, C.A.; Barr, K.A.; Kim, A.; Reinitz, J. A synthetic biology approach to the development of transcriptional regulatory models and custom enhancer design. Methods 2013, 62, 91–98. [Google Scholar] [CrossRef] [Green Version]
- Ouzounis, C.A.; Peter, D.; Karp, P.D. Global Properties of the Metabolic Map of Escherichia coli. Genome Res. 2000, 10, 568–576. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Phan, T.G.; Croucher, P.I. The dormant cancer cell life cycle. Nat. Rev. Cancer 2020, 20, 398–411. [Google Scholar] [CrossRef] [PubMed]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

