Abstract
In this paper, the design of a data-rate constrained observer for a dynamical system is presented. This observer is designed to function both in discrete time and continuous time. The system is connected to a remote location via a communication channel which can transmit limited amounts of data per unit of time. The objective of the observer is to provide estimates of the state at the remote location through messages that are sent via the channel. The observer is designed such that it is robust toward losses in the communication channel. Upper bounds on the required communication rate to implement the observer are provided in terms of the upper box dimension of the state space and an upper bound on the largest singular value of the system’s Jacobian. Results that provide an analytical bound on the required minimum communication rate are then presented. These bounds are obtained by using the Lyapunov dimension of the dynamical system rather than the upper box dimension in the rate. The observer is tested through simulations for the Lozi map and the Lorenz system. For the Lozi map, the Lyapunov dimension is computed. For both systems, the theoretical bounds on the communication rate are compared to the simulated rates.
1. Introduction
Ever since the widespread usage of wireless technologies, there has been a focus on data-rate problems for dynamical systems. These problems arise when networked technologies are employed in configurations where sensors, actuators, and controllers are placed at locations that are remote from one another. Extra complications arise from uncertainties in the system’s parameters, initial conditions, sensor measurements, communication channels, and dynamics, such as in the form of exogenous disturbances. These necessitate communication strategies that are efficient in terms of data rate and robust against all kinds of uncertainty. In this paper, the focus will be placed on uncertainties in the initial conditions, and also on issues of losses in the communication channel.
Up until now, the main focus in the relevant control-oriented literature was on state estimation and stabilization problems. In the early 2000’s, most of the research dealt with linear systems, for which many results have been obtained (see [1,2,3,4] for extended surveys).
Early results on nonlinear systems (exemplified by [5,6]) typically assumed special properties of the considered systems, and were also based on these properties. Proper adapting and extending techniques, originally developed for linear plants, opened the door for handling generic nonlinear systems and permitted the establishment of lower data-rate bounds sufficient for the observability and stabilizability of such systems; see, e.g., [7]. Another research trend was concerned with intrinsic characteristics of nonlinear systems that provide a somewhat exhaustive description of the bit-rate of information transmission under which a certain dynamic property (such as stability, invariance, observability) can be achieved. As a result, there appeared a whole series of extensions and modifications of classical topological entropy [8], such as feedback topological entropy [9], invariance entropy [10,11], topological entropy with regard to dynamic uncertainties [12] and to uncertainties in the initial conditions [13], estimation entropy [14], and others (see, e.g., [15,16,17,18]). Finally, some papers such as [19] have relied on passivity-based methods to provide bit-rate bounds.
One of the objectives of the paper is to use non-Euclidean concepts of the set dimension as an alternative to the aforementioned notions of entropy. Among the non-Euclidean concepts of set dimensions, the best known is, maybe, the Hausdorf dimension [20]. Another related characteristic is the upper box dimension [21], which is sometimes referred to as the limit capacity [22]. These two dimensions—the entropy and Lyapunov exponent—were proven to be closely related to one another in [23,24,25]. Both dimensions are based on covering a set with balls of infinitesimally small size (a technique which is very similar to the idea of partitioning the state-space and using symbolic dynamics to describe the dynamical system—see, e.g., [26]); both can assume non-integer values, and both may be smaller than the dimension of the hosting Euclidean space. These concepts have been much inspired by studies of fractals and research on chaotic attractors of dynamical systems. The latter is a primary incentive for our interest in these dimensions, which may be a non-integer for chaotic attractors and provide a somewhat deeper insight into the issue of their dimensionality than the ordinary Euclidean dimension.
Unfortunately, there are still no general analytical techniques for computing the two aforementioned dimensions for chaotic attractors. Numerical methods remain the main tool used by scientists and engineers to estimate these dimensions [27]. In this paper, an alternative to this numerical approach is developed. The alternative is based on the so-called Lyapunov dimension, in which the upper bounds are the above two dimensions [28]. Moreover, its advantage resides in the fact that the Lyapunov dimension can be computed analytically by using the second Lyapunov method [29,30], which leads to analytical upper bounds. By following this alternative, we obtain a fully analytical lower bound on the communication data-rate under which reliable estimation of the system’s state becomes feasible. When doing so, we consider a generic nonlinear system and focus attention on its behavior within a given invariant set, which may be a chaotic attractor, for example.
Apart from this bound, the design of a particular observer that ensures such an observability is also presented. The observer is composed of a sampler, a quantizer, a data-rate constrained channel, and a decoder. All components interact in order to build estimates of the state at a remote location in real time. The observer can ensure arbitrarily high precision of estimation with a communication rate that remains below the channel capacity. Moreover, the proposed observer is robust against delays and losses in the communication channel, which is a valuable property for applications where delays and losses are a common occurrence. This robustness is achieved without any feedback in the communication channel, which is atypical for most data-rate constrained observers in the current literature [31,32,33,34] and constitutes the novel contribution of the paper.
This paper is both a generalization and an extension of [35,36]. We provide a unified solution for both continuous and discrete-time systems. In addition to providing proofs for all the results that were presented in the two aforementioned conference papers, the problem statement is extended to also include delays in the communication channel.
In Section 2, we define the types of systems to be observed, as well as the observer notations, and provide a definition for observability with data-rate constraints. Section 3 introduces the proposed observer. In Section 4, preliminary criteria for observability of the plant are offered, which are converted into a fully analytical form in Section 5. Section 6 illustrates the general theory via handling two examples: the Lozi map and the Lorenz system. For both systems, the necessary data-rates are computed and simulations that confirm the theoretical results are provided.
2. Problem Statement
In this section, we introduce the problem statement. The general setting is that of a dynamical system and two peers connected together via a communication channel. Both peers have full knowledge about the dynamics of the system. Meanwhile, only one of them has direct access to the system’s current state and fully measures it. The task is to provide estimates of the state to the other peer by sending messages through the communication channel. This channel is discrete (i.e., the variety of transmittable messages is finite) and has delays, losses, and limited data-rate. The effects due to data-rate constraints and delays are explicitly modeled in this section, whereas the issue of message losses is discussed separately in Remark 1. Two types of delays will be considered in turn. A processing delay is also incurred, since the channel can transmit only a given and finite number of bits c per unit time, and so a B-bits message can wholly arrive at the receiving end of the channel not earlier than time units after the transmission of this message is commenced. A transmission delay is caused by holding up the progress in the ideal routine of bits transfer, which may occur as a result of, e.g., resolving competition with third parties for shared resources of the communication medium or network.
In order to solve the stated problem, we will develop a particular type of observer. In this section, we will only introduce notations concerned with the observer. Operation of its components will be described in the next section.
2.1. Observed Dynamical System
We consider a dynamical system on an open set , paying special attention to a certain subset . Here,
- T is the set of time periods, which is either or ;
- is the evolution function that gives the system state at time , provided that the initial state is ;
- is the focus of our interest in the system.
Specifically, we are interested only in trajectories that start in and remain there afterwards:
Assumption 1.
The dynamical system at hands is time-invariant: .
Assumption 2.
The set is a bounded forward invariant , its closure lies in S.
Our main interest is in systems for which complex and possibly chaotic long-horizon dynamics arise from rather regular short-horizon behavior. The last feature is partly substantiated by the following.
Assumption 3.
For any , the evolution function is continuously differentiable.
Depending on the “time-set” option, two types of dynamical systems will be considered.
(1) Discrete time systems: , and the system evolves as follows:
where is a given mapping. In this case,
Assumption 1 holds, and Assumption 3 is met if, and only if is continuously differentiable.
(2) Continuous time systems: and the evolution of the system is described by an ordinary differential equation (ODE):
where is a continuously differentiable vector field. So for any , the solution of the Cauchy problem for the ODE (2) exists, and is unique; it can be extended to the right on the maximal interval . However, Equation (2) not ineluctably defines a dynamical system on S, since, not necessarily, for all . Insofar as the right-hand side of Equation (2) is not defined outside S, this extendability means, in particular, that the solution never attempts to leave the set S; i.e., the set S is forward invariant. So when dealing with ODE, we always assume that all its solutions that start in S at can be extended on while remaining in S.
The following proposition can be proved by retracing the arguments from Section 2.2 in [37].
Proposition 1.
Whenever the vector field is smooth (i.e., continuously differentiable) and the ODE (2) has the just-stated extendability and invariance properties, this ODE gives rise to a dynamical system on S (), which satisfies Assumptions 1 and 3. Moreover, and its first derivatives, with respect to t and x, are continuous functions of t and x.
2.2. Architecture of the Observer, Notations, and General Traits of the Communication Channel
We assumed that the current state was observed in full at a certain measurement site but is needed at time t at a remote location, where data can be communicated only via a discrete channel. The channel is discrete in the sense that first, it is constrained to carry messages that are drawn from a finite set, and second, the messages can be communicated only one at a time and, while the channel is busy transmitting a previous message, it is closed for the next transmission.
The purpose of the observer is to arrange and manage transmissions across the channel and to finally build, at time t and at the remote location, an estimate of the current state with a pre-specified exactness. The formal definition of the last notion is as follows.
Definition 1.
A number is called an “exactness of observation”, and if there exists such that
As is illustrated in Figure 1, an observer is defined as a composition consisting of a sampler, quantizer, and decoder; the sampler and quantizer together form a coder:
Figure 1.
Structure of the observer.
- The sampler and quantizer are built at the measurement site and have access to the dynamics of the system, the set , the current state , and the desired exactness of observation .
- The decoder is built at the remote site and has access to the system dynamics , the set , the desired exactness of observation , and the messages transmitted across the channel.
The roles and structures of the observer components are as follows.
The Sampler generates the (sampling) instants (where at every one of these instants , transmission of another message is initiated):
Also, the sampler builds a finite alphabet from which the message should be drawn at time for subsequently communicating across the channel:
The alphabet is thus permitted to depend on .
The Quantizer forms the message to be dispatched
The Decoder generates state estimates based on the previously received messages:
where is the time when the message arrives at the remote site and . If no message has arrived yet, and the meaningless is replaced by an arbitrarily pre-specified symbol, e.g., . The observer has to fit the constraints and capabilities of the channel, which are as follows.
- (c.1)
- The channel correctly transfers any message to the receiving end provided that the message processing time and the size of the message are in balance:Here, is a channel-dependent function that gives the number of bits processable by the channel during any time period of length .
- (c.2)
- As the processing time increases to infinity, the average number of bits transmittable per unit of time stabilizes and converges to a certain value , called the (bit-rate) channel capacity:
- (c.3)
- The channel is closed for the next message until all bits of the current message have been processed, but is open afterwards.
- (c.4)
- On its way to the destination point , any message incurs a transmission delay :where is the time when the whole of the message arrives at , and the processing time plays the role of a processing delay here.
- (c.5)
- The transmission delays are upper-bounded: .
To correctly transmit messages, the sampler should balance the chosen alphabet and the message processing time in accordance with Equation (7), and respect the requirement:
2.3. Observability via Channels with Limited Bit-Rate Capacity
The objective of the observer is to guarantee observability, as defined in the following definition.
Definition 2.
Observability is classically defined as a property of the system itself. However, in the current context, finite data rate makes observability critically dependable on the employed communication channel. So, by following [18,35,36,38], observability is introduced as a property of the pair “system + channel”. In Definition 2, the reference to existence of an observer in fact conveys the idea of most effectively utilizing the properties of the system and the potentialities of the channel, where if their clever use may result in a reliable and exact state estimate at the receiving end of the channel, the pair is sealed with the stamp, “observable”.
3. Design of the Proposed Observer
Since we are interested only in trajectories satisfying Equation (1), our discussion of the observer design is confined to the case where in Equations (3)–(5).
We will introduce an observer that is determined by the following four entities:
- (e.1)
- —The period between consecutive dispatches of messages via the channel;
- (e.2)
- P—A symmetric and positive definite -matrix;
- (e.3)
- —A function of and for which
- (e.4)
- —A finite covering of the compact set with balls (with respect to the norm ) centered in and with a radius of each.
Here, the centers may also depend on .
Lemma 1.
Let Assumptions 2 and 3 hold, and for any , the derivatives of are bounded over and x from some τ-dependent neighborhood of . Then, a function with the property (10) exists.
The proof of this lemma simply follows from the continuous differentiability of and the boundedness of of the derivates.
The proposed observer operates as follows.
Procedure 1. (Observer)
- (o.1)
- (o.2)
- The quantizer finds an element of the covering from (e.4) that contains and sends its index k over the channel:
- (o.3)
- The decoder performs the following operations at time :
- -
- Extracts the index k from the last message received at a time , where (If no message has been received yet, k is assigned an arbitrarily pre-specified value, e.g., 1.).
- -
- By using the centers from (e.4), forms the current state estimate
Several comments on this observer are as follows:
- In (o.2), we do not address the case due to the reason stated at the onset of the section.
- The proposed design assumes that both the coder and decoder have access to from (e.1) and the covering from (e.4).
- The observer uses a fixed alphabet , which is shared by the coder and the decoder.
- The quantizer sends data about the estimate of not the current , but the forward-time state , which is computed from the measured by using the known transition map .
- The idea behind this relies on the expectation that these data will be received prior to and put in use at due time, . Then, the exactness of estimation will be at this time.
- These data are also used to estimate the state on the subsequent time interval via applying the matching transition map to the just-discussed estimate at time . By Equation (10), this guarantees the exactness of estimation on this interval.
In order for the proposed observer to be able to operate correctly via a given communication channel, the message initiated at time should be fully processed and received prior to . (This, in particular, implies that the messages arrive in order: .) Due to (c.1), (c.4), and (c.5), correct operation occurs whenever there exists a solution to the following two inequalities:
We recall that is an upper bound on the transmission delay. In the typical case where is an increasing function of and modulo the possibility to choose , Equation (12) reduces to only one inequality:
Anyhow, the inequalities depend on both the system (via ) and channel (via ). This means that correct operation in fact requests a certain level of conformity between the system and the channel.
The conditions for correct operation will be fleshed out in the next section. We conclude the section with a comment on observability and a remark on how the observer proceeds when a loss occurs.
Observation 1.
The following statements are true:
- (i)
- Let the proposed observer correctly operate for a given and . Then, for any trajectory satisfying Equation (1), the desired exactness of observation ϵ is ensured with respect to the norm ;
- (ii)
- Let a communication channel be given. Also, let any small enough be coupled with some so that the proposed observer with these ϵ and operates correctly via the channel at hand. Then, the system is observable on the set via this communication channel.
All observability conditions that will be established in this paper are nothing but implications of ii) in this observation. This means that these conditions ensure the correct operation of the proposed observer modulo’s proper and feasible choice of its parameters. In other words, whenever these conditions are satisfied, a reliable state estimate can be obtained by means of this observer.
Remark 1.
Suppose that messages may be lost when transmitting over the communication channel. If a loss does occur, the message which was last received is used in Equation (11) not only during the intended time interval (from to ), but also during the subsequent time intervals until the next successful transmission. Certainly, there is no guarantee that the estimation accuracy will be within the desired ϵ on these extra intervals. However, as soon as a new message arrives, this accuracy is restored due to the very design of the observer. This robustness against losses is achieved without any feedback in the communication channel (i.e., the coder is not notified when losses occur on the channel), unlike many competing schemes [31,32,33,34].
This remark extends on the situation where the message is not lost, but corrupted so that an incorrect is occasionally used in Equation (11).
4. Criteria for Observability of the System
A problem with the conditions (12) and (13) is that they use the function from (e.4), for which there is a lack of constructive techniques to compute, or at least to assess it from its “parents”: the dynamics , and the set . In this section, we make a first step to overcome this deficit; whereas the function is a by-product of the coalesce of the dynamics and set, we re-master the conditions into a form where separate characteristics of the dynamics and the set are employed.
4.1. The Size of Finite Covering
Inspired by (e.4), we start with the question: How many balls of a common radius are needed to cover a given bounded set? Though not articulated thus far, our interest in fact focuses on the high exactness of estimation . This, in turn, motivates asymptotical analysis as . A response to these concerns is partly given by the concept of an upper box-counting dimension , which is defined as follows.
Definition 3
([21]). The upper box-counting dimension of a bounded set is given by
Here, can be defined in any of the following ways, with all of them resulting in a common value (14):
- (i)
- The smallest number of closed balls of radius δ that cover F;
- (ii)
- The smallest number of closed balls of radius δ and centers in F that cover F;
- (iii)
- The smallest number of cubes of side δ that cover F;
- (iv)
- The number of δ-mesh cubes that intersect F;
- (v)
- The smallest number of sets of diameter at most δ that cover F;
- (vi)
- The largest number of disjoint balls of radius δ with centers in F.
It follows that for arbitrarily small , the number of -balls with centers in F that are needed to cover F does not exceed for all sufficiently small .
As is well-known [21], for any bounded set and if the interior of F is not empty, , and , . The box-counting dimension may assume non-integer values; for example, for the middle-thirds of the Cantor set .
Our particular interest is in dynamical systems and their invariant sets with ; this case does hold for some chaotic systems and complex attractors .
4.2. Balance between the Initial and Forthcoming Estimation Exactness, Respectively
Now, we are going to study relations between the initial exactness of the state x estimate and the implied forthcoming exactness during the time horizon of duration . This study is aimed at building the component (e.3) of which the proposed observer is composed, among others. We recall that this component is a function for which Equation (10) holds.
The growth rate of the system on the set is defined to be:
is the Jacobian matrix of the map at point x and . It is well-defined for all sufficiently small , since in Equation (15), thanks to the following.
Lemma 2.
There exists such that for any .
The proof of this lemma is trivial and thus omitted from this document.
In Equation (15), the limit exists since the subsequent quantity decays as decreases. Since all norms in the space of -matrices are equivalent, it is easy to see that does not depend on the choice of the norm.
Among other components, the proposed observer uses a function with a special property described in (e.3). Now, we show how such a function can be built from .
Lemma 3.
Let . For any and any positive definite -matrix P, there exists a function with the property (e.3) that is given by
for all sufficiently small and sufficiently large .
The proof of this lemma is provided in Appendix A.
4.3. Correct Operation of the Observer and a Criterion for Observability
By bringing the pieces together, we arrive at the following.
Proposition 2.
Suppose that Assumptions 1–3 hold and the system has a finite growth rate on the set . Consider a communication channel with capacity c. If
the system is observable on the set via this communication channel in the sense of Definition 2.
The proof of this proposition is provided in Appendix A. The previous inequality strongly resembles other inequalities in the context of entropy in dynamical systems that link dimensions, Lyapunov exponents, and entropy (see [23,24,25]).
5. Constructive Estimates and Analytical Bounds
In this section, we make the next and final step for obtaining tractable conditions for observability. The road to this is via the development of techniques for assessing growth rate and the box-counting dimension. A technique will be employed in both these cases that is similar in spirit to the second Lyapunov method.
5.1. Lyapunov-Like Function
The characteristic trait of the classic Lyapunov function is its decay along the trajectories of the system. In the current context, we are not interested in such a decay. Instead, our interest is focused on the rate at which an infinitesimally small ball is expanded under the transition mapping . The smallness implies that this mapping is well-approximated by the first two terms of its Taylor series, and so the rate in question is nothing but the expansion rate due to the Jacobian matrix defined in Equation (15). The deformation of a ball under a linear mapping A is described by the singular values of A; in particular, the maximal of them is the norm of A and may be used in Equation (15). If P is a symmetric positive definite matrix and is endowed with the P-related norm , these values are the square roots of the solutions of the algebraic equation repeated in accordance with their algebraic multiplicities and ordered from large to small. With these in mind, we introduce a function with special properties whose description uses the t-step increment of this function:
Assumption 4.
There exist , a bounded function , constant , and symmetric positive definite matrix , such that
where are the roots of the algebraic equation
repeated in accordance with their algebraic multiplicities and ordered from large to small, and .
In the discrete-time case, only is concerned in Equation (19). In the continuous-time case, Equation (19) is imposed only within the finite time horizon of duration 1.
With , Equation (20) reduces to and are the squares of the standard singular values of the Jacobian matrix . For a generic P, the roots can also be reduced to standard singular values. Indeed, let U be the symmetric and positive definite “square root” of the symmetric and positive definite matrix . The solutions of Equation (20) are evidently identical to those of , and so the ’s are the squares of the ordinary singular values of the matrix . This matrix is similar to , and so these two matrices represent a common linear transformation in various bases. Thus, the role of P is, in fact, that of a linear coordinate transformation in pursuit of ease of building and .
Assumption 4 will be utilized for assessment of both quantities that we are interested in. Specifically, it will be used with and arbitrary to upper-estimate the growth rate (15) of the system; this estimate is given by . With and some , it will be used to establish an upper bound on the upper box dimension of the invariant set ; this bound is given by d.
In the case of a continuous time system, the next proposition provides an alternative to computing the transition maps and checking infinitely many inequalities (19), each for its own , when verifying Assumption 4. To state this proposition, we introduce the Jacobian matrix of the right-hand side in Equation (2):
Proposition 3.
Let there exist , a continuously differentiable bounded function , constant , and a symmetric positive definite matrix , such that
where and are the solutions of the algebraic equation
ordered from largest to smallest and repeated in accordance with their algebraic multiplicity. Then, Assumption 4 holds with the particular P and d of Equation (22), and .
This result is proved in Appendix B.
5.2. Analytical Upper Bound on the System’s Growth Rate and Related Conditions for Observability
Proposition 4.
Let Assumptions 1–4 hold with and in the last of them. Then, the growth rate (15) of the system on obeys the following bound:
The proof of this proposition is provided in Appendix B.
By combining Propositions 2 and 4, we arrive at the following.
Theorem 1.
Suppose that Assumptions 1–4 hold with and in the last of them, and consider a communication channel with capacity c. If
the system is observable on the set via this communication channel in the sense of Definition 2.
The observation schemes proposed in [39,40] can sometimes work under the channel rates smaller than that given in Theorem 1. This improved rate comes at a price: these schemes are not robust against losses in the communication channel.
In [7], the observer requires some feedback in the channel and a channel rate of the form , where L is the Lipschitz constant of the mapping . The estimate (23) is less conservative, both because (if L is related to a norm of the form ) and , with in some cases. Moreover, the scheme from [7] does not enjoy robustness against losses in the communication channel.
Finally, Corollary 6.2.1 of [29] provides an estimate for the topological entropy by using a result from [41], which is identical to our estimate of the rate c with identical assumptions.
5.3. Analytical Bounds on the Upper Box Dimension and Final Conditions for Observability
A drawback of Theorem 1 is that it uses the upper box dimension, whereas there are no general techniques to compute this dimension analytically. To compensate for this drawback, we will use results from [28,30] to replace the upper box dimension by its upper estimate in the form of another well-known kind of dimension, i.e., the so-called Lyapunov dimension. The benefit from this is that the latter can be estimated analytically.
We start by introducing the necessary definitions, including those of the Lyapunov dimension of a map in a point, of a map over a set, and of a dynamical system. Next, we will recall the required results from [28,30], and finally, we will provide the general results of this paper, which offers analytical conditions for observability under a finite communication bit-rate.
Definition 4.
For any , the singular value function of of order at point is defined as
Here are the singular values of the -matrix A.
Definition 5
([30]). For any , the Lyapunov dimension of the map at the point is given by
Definition 6
([30]). For any , the Lyapunov dimension of the map with respect to the invariant set is given by
Definition 7
([30]). The Lyapunov dimension of the dynamical system with respect to the invariant set , is defined as
For the sake of completeness, we provide the results that we borrowed from [28,30].
Theorem 2
([28]). Let Assumptions 1–3 hold. Then,
Corollary 1
([28]). Let the hypotheses of Theorem 2 be true. Then for all ,
The following proposition is essentially a reformulation of Theorem 2 from [30].
Proposition 5.
Let Assumptions 1–3 be true. Suppose also that Assumption 4 holds with some and . Then, for sufficiently large , the following inequality is valid:
The proof of this proposition is provided in Appendix B.
In some cases, the inequalities in Equation (24) take place as equalities. Specifically, the following proposition is valid, which is a reformulation of Proposition 3 and Corollary 3 from [30].
Proposition 6
([30]). Suppose that at one of the equilibrium points of the dynamical system , the matrix has the simple real eigenvalues . Let us consider a non-singular matrix U, such that
where , which matrix does exist thanks to the first assumption of the proposition. Let be the transition mapping after the linear coordinate change. Suppose that Assumption 4 holds with some d and , and additionally, we have
Then, for any compact invariant set of , the following equation holds
Now we are in a position to state the main result of the paper, which is clear from Theorem 1 and Proposition 5.
Theorem 3.
Let Assumptions 1–3 be true. Suppose also that Assumption 4 holds twice: first, with and some and second, with and some . Consider a communication channel with capacity c. If
the system is observable on the set via this communication channel in the sense of Definition 2.
6. Examples
In this section, we apply the previous theory to two celebrated prototypical chaotic systems: the smoothened Lozi map and the Lorenz system. For the smoothened Lozi map, we will compute the Lyapunov dimension and provide a bound on the channel rate above where the associated dynamical system is observable via the channel at hand. We will then test this bound via computer simulations of the proposed observer to show that the established theoretical rates are close to the actual practical rates. For the Lorenz system, we borrow upper estimates of the Lyapunov dimension and the largest singular value of the Jacobian from [39,42], respectively, to provide a bound on the channel rate by using Theorem 3. Like in the previous example, we will also test this bound via computer simulations.
6.1. The Smoothened Lozi Map
The Lozi map [43,44] is a modification of the Henon map. The Lozi map is not continuously differentiable, and so does not meet Assumption 3. We examine its continuously differentiable analog introduced in [45] by smoothing the Lozi map at the fracture point. The respective smoothened map acts according to the following formula
where a, b, and are positive parameters and
If and , the smoothened Lozi map has an equilibrium
If and in addition to the previous inequalities, there exists a second equilibrium
In this section, we adopt the following.
Assumption 5.
The following inequalities hold:
This assumption implies that two equilibria exist, and that they are unstable. Moreover, ensures for the associated discrete-time dynamical system. We start by giving more insight into the Lyapunov dimension of the smoothened Lozi map.
Theorem 4.
The proof of this theorem is provided in Appendix C.
In order to use the observer from Section 3 for the smoothened Lozi map (27), we need to choose a compact and invariant set . For the original (i.e., non-smooth) Lozi map, such a set exists whenever the following conditions are met: [46]
Moreover, when the previous inequalities hold, the set is the closure of the unstable manifold of the unstable equilibrium . It is still unknown whether they guarantee the same for the smoothened Lozi map (27). To the best of the authors’ knowledge, no conditions that guarantee the existence of such a set for the map (27) are available in the literature. In the following, we will assume that Equations (30)–(34) are sufficient to ensure the existence of a compact and invariant set . Our simulations with the parameters and , , which verify Equations (30)–(34), provide evidence, illustrated in Figure 2, in favor of this hypothesis.
Figure 2.
Typical trajectory with 10,000 steps of the smoothened Lozi map with , , and .
By combining Theorem 3 with Theorem 4 and an estimate of the largest singular value of the concerned Jacobian given in [40], we arrive at the following.
Corollary 2.
Let Assumption 5 hold, and let be a compact invariant set of the smoothened Lozi map (27). Then, the associated dynamical system is observable on the set via any communication channel whose capacity
Proof.
Theorem 13 from [18] yields that Assumption 4 holds with and
Theorems 3 and 4 complete the proof. □
Corollary 2 implies that for these parameters, the associated dynamical system is observable for any channel rate above bits/s. To verify whether this lower bound on the channel rate can be improved, we employed the observer from Section 3 whose parameters were experimentally tuned to ensure a pre-specified exactness of observation during the first 1000 steps. The following values were considered: . An accompanying objective of experimentally tuning was to minimize the size of the alphabet K employed for data encoding, or, in other words, the channel capacity requested by the observer. The best values of the capacity can be found in Table 1. It shows that for high exactness, the system can be observed with a channel rate slightly below the theoretical estimate. For the lowest exactness, the experimental rate barely exceeds the theoretical bound. However, in any case, this bound seems to be pretty close to the experimental result.
Table 1.
Results of the simulations on the smoothened Lozi map.
6.2. The Lorenz System
In this section, we apply our previous theoretical results to the Lorenz system. The Lorenz system [47] is a well-known example of a continuous-time system where, for certain values of its parameters, it displays chaotic behavior. The system equations are:
where , and are positive parameters. If , the system has a single globally asymptotically stable equilibrium: the origin. For , this equilibrium becomes a hyperbolically unstable saddle-point. In addition, two equilibria appear. In this paper, we assume . We will apply our findings to the system with a chaotic attractor as an invariant set. As is well-known [48], the conditions on the parameters ( suffice to ensure the presence of a compact invariant set. Moreover, to compute the Lyapunov dimension, we adopt the following assumption which is taken from [42].
Assumption 6.
Let the following hold:
and either
or the following equation has two distinct solutions, ν
and
where is the largest root of Equation (36).
Any solution of the Lorenz system that starts at can be extended on [29], and thus has the extendability property discussed just after Equation (2). Hence, the differential Equations (35) give rise to a dynamical system on in the sense of Section 2.1.
Proposition 7.
Let Assumption 6 hold, and let be a compact invariant set of the Lorenz system. Then, this system is observable on the set via any communication channel whose capacity is:
Proof.
Since the right-hand side of the equations in Equation (35) are polynomial, Assumptions 1 and 3 hold by Proposition 1; Assumption 2 is true due to the choice of . It is easy to see by inspection of the proof of Theorem 4.3 from [39] that the assumptions of Proposition 3 hold with ,
and the matrix P defined by (16) in [39]. So Proposition 3 guarantees that Assumption 4 holds with and . As is shown in Section 4 from [42], Assumption 4 also holds with and
Theorem 3 completes the proof. □
We have performed simulation studies similar to those carried out for the previous example. Starting from various initial conditions in , we have simulated the observer for various chosen , each with its own chosen and associated covering. In our simulations, we used , , , which verify Assumption 6. For these parameters, the theoretical rate bound is bit/s. The results of the simulations can be seen in Table 2. Once again, it can be seen that the experimentally found rate is below or very close to the theoretical rate. This confirms that our theoretical results correctly predict the rate.
Table 2.
Results of the simulations on the Lorenz system.
7. Conclusions
In this paper, we have presented an observer for both discrete and continuous-time nonlinear systems. We have provided bounds on the necessary data-rates to implement the observer. We have proven that this observer can be implemented on any channel with a finite delay parameter and a channel rate , where is an upper bound on the largest singular value of the Jacobian and , the upper box dimension of the compact invariant set of the system. By combining results from several other papers, we have provided an analytical bound to the channel rate that depends on the Lyapunov dimension, rather than the upper box dimension. These analytical bounds have been computed for the smoothened Lozi map and the Lorenz system. For the smoothened Lozi map, we computed the Lyapunov dimension. Simulations of the observer on both of these systems have proven that the theoretical rate is closely related to the actual rate required to implement the observer.
Author Contributions
Conceptualization, Q.V., A.Y.P. and A.S.M.; methodology, Q.V., A.Y.P. and A.S.M.; software, Q.V.; validation, Q.V., A.Y.P., A.S.M. and H.N.; formal analysis, Q.V., A.Y.P. and A.S.M.; investigation, Q.V.; resources, A.Y.P. and A.S.M.; data curation, Q.V.; writing—original draft preparation, Q.V., A.Y.P. and A.S.M.; writing—review and editing, Q.V., A.Y.P., A.S.M. and H.N.; visualization, Q.V. and A.S.M.; supervision, A.Y.P. and H.N.; project administration, H.N.; funding acquisition, A.Y.P. and H.N.
Funding
This paper was elaborated in the UCoCoS project which has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No. 675080.
Acknowledgments
We dedicate this paper to the memory of Gennady A. Leonov, who passed away on the April 23, 2018. The authors acknowledge the many fruitful discussions they held with Professor Leonov.
Conflicts of Interest
The authors declare no conflict of interest.
Abbreviations
The following notations are used in this manuscript:
| : | the set of nonnegative integers; |
| : | the set of nonnegative real numbers; |
| : | the absolute value of x; |
| : | the cardinality of a set F; |
| =; | |
| : | the identity matrix of dimension ; |
| : | the conjugate transpose of the matrix M; |
| : | the inverse of the transpose of the matrix M; |
| = ; | |
| = , with P symmetric and positive definite; | |
| : | the ball in of radius centered in x; |
| : | the ball in of radius centered in x. |
Appendix A. Proofs of Section 4
In this appendix, we present the proofs of of the results stated in Section 4.
Appendix A.1. Proof of Lemma 3
We introduce the P-associated spectral norm of a -matrix: . Based on Equation (15), Lemma 2, and the remark following that lemma, we first pick such that and
Based on this inequality, we then pick so that for all , we have
Now, we consider , , and defined in Equation (16). Let the context of Equation (10) holds, i.e., and . Then, the segment lies in the balls , and so in S by the choice of . By using the mean value inequality Theorem 9.19 from [49], we have for any ,
Now, we note that the conditions from Equation (A1) are fulfilled for , and . Hence
Thus, we see that Equation (10) is true. It remains to extend the function from on all and by putting □
Appendix A.2. Proof of Proposition 2
We first pick and so close to and 0, respectively, that
We also pick a positive definite -matrix P and borrow the function from Lemma 3. We also consider and so small and large, respectively, that Equation (16) holds. As was remarked just after Definition 3, the set can be covered by no more than -balls centered in for all small enough . By reducing , if necessary, we make small enough in this sense irrespective of . Then, no more than
-balls centered in are needed to cover . Hence, the integer floor of the right-hand side of this equation is the function from e.4). So, the condition (13) for the correct operation of the observer from Section 3 takes the form
Appendix B. Proofs of Section 5
Appendix B.1. Proof of Proposition 3
For the dynamical system given by Equation (2), the matrices defined in Equation (15) obey the equations
Hence,
where as . Moreover, this convergence is uniform over x from any compact subset K of S. Indeed, for , Equation (A3) yields that
where . Due to the last claim of Proposition 1, the function is continuous. Hence it is uniformly continuous on the compact set and so
It remains to note that for all ,
By invoking Equation (A4), we see that
where
uniformly over due to the foregoing since the continuous function is bounded on the compact set K. Let be the positive definite square root of P. Equation (20) with can be rewritten by virtue of Equation (A5) as follows
Thus we see that are the ordinary eigenvalues of the symmetric matrix , which goes to as uniformly over . Meanwhile the eigenvalues continuously depend of the symmetric matrix by Corollary 6.3.8 [50]. It follows that
where as uniformly over and are the solutions for the eigenvalue problem
The last equation is identical to Equation (22), and so . By using the previous equations together, we see that
Now we invoke d from Proposition 3. For any matrix B, we denote by the roots of the algebraic equation
enumerated from large to small, and put
By combining the generalized Horn inequality [29] with Lemma 8.1 in [39] (which relates to the roots of Equation (A7) with the concept of the singular value of a matrix), we infer that for any matrices . It follows that for any sequence of such matrices
Now we pick an arbitrary and denote for any natural r and . Since the system is time-invariant, we have
As a result, we see that in Equation (19),
We proceed by using the elementary inequality and the quantity defined in Equation (A6) for the compact set , which contains all points of the form ,
where by the definition of . Thus
Now we estimate by employing Equation (21)
Here, the sum is the Riemann sum of the continuous function of , and does not depend on r. So be letting and by invoking that , we get
Thus we have arrived at Equation (19) modulo the definitions of and from Proposition 3. It remains to note that the function is bounded since has this property by the assumptions of Proposition 3. □
Appendix B.2. Proof of Proposition 4
We first note that Equation (19) with means that whenever , we have
We are going to show that for any natural r, Equation (A8) holds whenever , arguing by induction on r. As a result, we will show that Equation (A8) is valid for all .
For , this claim is initially given. Suppose that it is true for some r, and consider . If , Equation (A8) is true by the induction hypothesis. Let . Then , where . By putting and invoking Assumption 1 and Equation (15), we see that
where the start of the second line is due to Equation (18). By using these relations, we arrive at Equation (A8) via adding Equation (A8) with to the inequality
which holds by the induction hypothesis. Thus, Equation (A8) is true whenever .
Now we introduce a finite upper bound on the bounded function . Let . Then and so Equation (18) yields that . By Equation (A8),
As was remarked, does not depend on the matrix norm in Equation (15). Meanwhile, Lemma 2 ensures that for all small enough . Hence
□
Appendix B.3. Proof of Proposition 5
Proof.
From Assumption 4, we have that are the roots of the equation
Since P is positive definite and symmetric, we can decompose it as
where U is nonsingular. Equation (A9) can thus be rewritten as
If we premultiply with and postmultiply with , the solutions are unchanged and the equation becomes
We thus know that are the eigenvalues of the matrix or the square of the singular values of the matrix . From Equation (4) with the same d and , we have that
which thus also implies that
where e is Euler’s number. This can be rewritten as
We now apply Theorem 2 from [30] with , , and to obtain
for T sufficiently large. □
Appendix C. Proofs of Section 6
Proof of Theorem 4
Proof.
Proposition 5 yields that to prove the first sentence from the conclusion of Theorem 4, it suffices to justify Assumptions 1–3 and Assumption 4 with and defined in Equation (29) for the discrete-time dynamical system associated with the smoothened Lozi map. Assumptions 1–3 do hold since the map (27) is smooth and is a compact invariant set by the assumptions of Theorem 4. It remains to check Assumption 4, where in Equation (19) since we are in discrete time now.
We are going to justify Assumption 4 with . Since we have that
Equation (20) becomes
To simplify the notations, we introduce . Equation (A10) admits two solutions
Since by Equation (28),
given Assumption 5, we know that is always larger than one (in particular, for all , ). This implies that and that . Note that the latter inequality is strict as a result of Assumption 5. After some computations and due to Equation (A11), the left-hand side of Equation (19) becomes
Which, using Equation (A12) can be upper bounded in the following way
To satisfy Assumption 4 with some d and , we are looking for such that the left-hand side of the previous equation is negative. We thus obtain the following sufficient condition
which, using Assumption 5, can be rewritten as
Note that from the conditions from Assumption 5, the following is always true . Since Assumption 4 with and is verified, the proof is completed by applying Proposition 5 which yields
To prove the second part of this theorem, we will use Proposition 6. We consider the following diagonalizing coordinate change matrix
which is nonsingular and well-defined for all parameters satisfying Assumption 5.
To compute , we first compute
From Assumption 4 with and , we have and which means we have . We thus compute
This latter equation can be rewritten as
Since we assumed , all conditions of Proposition 5 are verified. We thus obtain
□
References
- Elia, N.; Mitter, S. Stabilization of linear systems with limited information. IEEE Trans. Autom. Control 2001, 46, 1384–1400. [Google Scholar] [CrossRef]
- Nair, G.; Fagnani, F.; Zampieri, S.; Evans, R. Feedback Control Under Data Rate Constraints: An Overview. Proc. IEEE 2007, 95, 108–137. [Google Scholar] [CrossRef]
- Baillieul, J.; Antsaklis, P.J. Control and Communication Challenges in Networked Real-Time Systems. Proc. IEEE 2007, 95, 9–28. [Google Scholar] [CrossRef]
- Andrievsky, B.; Matveev, A.; Fradkov, A. Control and Estimation under Information Constraints: Toward a Unified Theory of Control, Computation and Communications. Autom. Remote Control 2010, 71, 572–633. [Google Scholar] [CrossRef]
- De Persis, C. A Note on Stabilization via Communication Channel in the presence of Input Constraints. In Proceedings of the 42nd IEEE Conference on Decision and Control, Maui, HI, USA, 9–12 December 2003; pp. 187–192. [Google Scholar]
- Baillieul, J. Data-rate requirements for nonlinear feedback control. In Proceedings of the 6th IFAC Symposium on Nonlinear Control Systems (NOLCOS 2004), Stuttgart, Germany, 1–3 September 2004; pp. 1277–1282. [Google Scholar]
- Liberzon, D.; Hespanha, J. Stabilization of nonlinear systems with limited information feedback. IEEE Trans. Autom. Control 2005, 50, 910–915. [Google Scholar] [CrossRef]
- Adler, R.L.; Konheim, A.G.; McAndrew, M. Topological entropy. Trans. Am. Math. Soc. 1965, 114, 309–319. [Google Scholar] [CrossRef]
- Nair, G.N.; Evans, R.J.; Mareels, I.M.; Moran, W. Topological feedback entropy and nonlinear stabilization. IEEE Trans. Autom. Control 2004, 49, 1585–1597. [Google Scholar] [CrossRef]
- Kawan, C. Invariance Entropy for Deterministic Control Systems an Introduction; Springer: Cham, Switzerland, 2009. [Google Scholar]
- Colonius, F.; Kawan, C.; Nair, G. A note on topological feedback entropy and invariance entropy. Syst. Control Lett. 2013, 62, 377–381. [Google Scholar] [CrossRef]
- Savkin, A.V. Analysis and synthesis of networked control systems: Topological entropy, observability, robustness and optimal control. Automatica 2006, 42, 51–62. [Google Scholar] [CrossRef]
- Kawan, C.; Yüksel, S. On optimal coding of non-linear dynamical systems. IEEE Trans. Inf. Theory 2018, 64, 6816–6829. [Google Scholar] [CrossRef]
- Liberzon, D.; Mitra, S. Entropy and Minimal Bit Rates for State Estimation and Model Detection. IEEE Trans. Autom. Control 2018, 63, 3330–3340. [Google Scholar] [CrossRef]
- Matveev, A.S.; Savkin, A.V. Estimation and Control over Communication Networks; Birkhauser: Basel, Switzerland, 2009. [Google Scholar]
- Kawan, C. Exponential state estimation, entropy and Lyapunov exponents. Syst. Control Lett. 2018, 113, 78–85. [Google Scholar] [CrossRef]
- Sibai, H.; Mitra, S. Optimal Data Rate for State Estimation of Switched Nonlinear Systems. In Proceedings of the 19th International Conference on Hybrid Systems: Computation and Control, Pittsburgh, PA, USA, 18–20 April 2017; pp. 71–80. [Google Scholar]
- Pogromsky, A.; Matveev, A. Data rate limitations for observability of nonlinear systems. IFAC-PapersOnLine 2016, 49, 119–124. [Google Scholar] [CrossRef]
- Fradkov, A.L.; Andrievsky, B.; Evans, R.J. Synchronization of nonlinear systems under information constraints. Chaos 2008, 18, 037109. [Google Scholar] [CrossRef]
- Douady, A.; Oesterle, J. Dimension de Hausdorff des attracteurs. Comptes Rendus Acad. Sci. Ser. A 1980, 290, 1135–1138. [Google Scholar]
- Falconer, K. Fractal Geometry: Mathematical Foundations and Applications; John Wiley & Sons: Hoboken, NJ, USA, 1997. [Google Scholar]
- Takens, F. Detecting strange attractors in turbulence. In Dynamical Systems and Turbulence, Warwick; Rand, D., Young, L.S., Eds.; Springer: Berlin/Heidelberg, Germany, 1980; pp. 366–381. [Google Scholar]
- Kawan, C. Upper and lower estimates for invariance entropy. Discret. Contin. Dyn. Syst. 2011, 30, 169–186. [Google Scholar] [CrossRef]
- Young, L.S. Entropy, Lyapunov Exponents, and Hausdorff Dimension in Differentiable Dynamical Systems. IEEE Trans. Circuits Syst. 1983, 30, 599–607. [Google Scholar] [CrossRef]
- Ledrappier, F.; Young, L.S. The Metric Entropy of Diffeomorphisms: Part II: Relations between Entropy, Exponents and Dimension. Ann. Math. 1985, 122, 540–574. [Google Scholar] [CrossRef]
- Stojanovski, T.; Kocarev, L.; Harris, R. Applications of Symbolic Dynamics in Chaos Synchronization. IEEE Trans. Circuits Syst. 1997, 44, 1014–1018. [Google Scholar] [CrossRef]
- Siegmund, S.; Taraba, P. Approximation of box dimension of attractors using the subdivision algorithm. Dyn. Syst. 2006, 21, 1–24. [Google Scholar] [CrossRef]
- Hunt, B.R. Maximum local Lyapunov dimension bounds the box dimension of chaotic attractors. Nonlinearity 1996, 9, 845–852. [Google Scholar] [CrossRef]
- Boichenko, V.A.; Leonov, G.A.; Reitmann, V. Dimension Theory for Ordinary Differential Equations; Teubner Verlag: Leipzig, Germany, 2005. [Google Scholar]
- Kuznetsov, N.V. The Lyapunov dimension and its estimation via the Leonov method. Phys. Lett. A 2016, 380, 2142–2149. [Google Scholar] [CrossRef]
- Sahai, A.; Mitter, S. The Necessity and Sufficiency of Anytime Capacity for Stabilization of a Linear System over a Noisy Communication Link—Part I: Scalar Systems. IEEE Trans. Inf. Theory 2006, 52, 3369–3395. [Google Scholar] [CrossRef]
- Martins, N.C.; Dahleh, M.A.; Elia, N. Feedback Stabilization of Uncertain Systems in the Presence of a Direct link. IEEE Trans. Autom. Control 2006, 51, 438–447. [Google Scholar] [CrossRef]
- Simsek, T.; Jain, R.; Varaiya, P. Scalar Estimation and Control With Noisy Binary Observations. IEEE Trans. Autom. Control 2004, 49, 1598–1603. [Google Scholar] [CrossRef]
- Matveev, A.S.; Savkin, A.V. An Analogue of Shannon Information Theory for Detection and Stabilization via Noisy Discrete Communication Channels. SIAM J. Control Optim. 2007, 46, 1323–1367. [Google Scholar] [CrossRef]
- Voortman, Q.; Pogromsky, A.; Matveev, A.S.; Nijmeijer, H. Continuous Time Observers of Nonlinear Systems with Data-Rate Constraints. In Proceedings of the 5th IFAC Conference on Analysis and Control of Chaotic Systems, Eindhoven, The Netherlands, 30 October–1 November 2018. [Google Scholar]
- Voortman, Q.; Pogromsky, A.Y.; Matveev, A.S.; Nijmeijer, H. A Data Rate Constrained Observer for Discrete Nonlinear Systems. In Proceedings of the 57th IEEE Conference on Decision and Control, Miami Beach, FL, USA, 17–19 December 2018; pp. 3355–3360. [Google Scholar]
- Khalil, H. Nonlinear Systems, 3rd ed.; Prentice Hall: Upper Saddle River, NJ, USA, 2002. [Google Scholar]
- Matveev, A.; Pogromsky, A. Two Lyapunov methods in nonlinear state estimation via finite capacity communication channels. IFAC-PapersOnLine 2017, 50, 4132–4137. [Google Scholar] [CrossRef]
- Pogromsky, A.; Matveev, A. Estimation of the topological entropy via the direct Lyapunov method. Nonlinearity 2011, 24, 1937–1959. [Google Scholar] [CrossRef]
- Matveev, A.; Pogromsky, A. Observation of nonlinear systems via finite capacity channels: Constructive data rate limits. Automatica 2016, 70, 217–229. [Google Scholar] [CrossRef]
- Ito, S. An Estimate from above for the Entropy and the Topological Entropy of a C1-diffeomorphism. Proc. Jpn. Acad. 1970, 46, 226–230. [Google Scholar] [CrossRef]
- Leonov, G.A.; Kuznetsov, N.V.; Korzhemanova, N.A.; Kusakin, D.V. Lyapunov dimension formula for the global attractor of the Lorenz system. Commun. Nonlinear Sci. Numer. Simul. 2016, 41, 84–103. [Google Scholar] [CrossRef]
- Lozi, R. Un Attracteur Étrange du Type de Hénon. J Phys. Colloq. 1978, 39, C5-9–C5-10. (In French) [Google Scholar] [CrossRef]
- Elhadj, Z. Lozi Mappings: Theory and Applications; CRC Press: Boca Raton, FL, USA, 2013. [Google Scholar]
- Aziz-Alaoui, M.A.; Robert, C.; Grebogi, C. Dynamics of a Henon-Lozi-type map. Chaos Solitons Fractals 2001, 12, 2323–2341. [Google Scholar] [CrossRef]
- Misiurewicz, M. Strange attractors for the Lozi mappings. Ann. N. Y. Acad. Sci. 1980, 357, 348–358. [Google Scholar] [CrossRef]
- Lorenz, E.N. Deterministic Nonperiodic Flow. J. Atmos. Sci. 1963, 20, 130–141. [Google Scholar] [CrossRef]
- Sparrow, C. The Lorenz Equations: Bifurcations, Chaos, and Strange Attractors; Springer: New York, NY, USA, 1982. [Google Scholar]
- Rudin, W. Principles of Mathematical Analysis, 3rd ed.; MaGraw Hill: New York, NY, USA, 1976. [Google Scholar]
- Horn, R.; Johnson, C. Matrix Analysis; Cambridge University Press: New York, NY, USA, 2013. [Google Scholar]
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

