Abstract
We consider, in this paper, the problem of state estimation for a class of dynamical systems governed via continuous-time McKean–Vlasov stochastic differential equations. The estimation problem is stated and solved under an norm setting. We adopt a Riccati-based approach in order to solve the optimal estimation problem.
1. Introduction
McKean–Vlasov stochastic differential equations (SDEs) are a powerful tool used to reflect some kind of mean-field-type interaction phenomena in large-scale dynamical systems. For some pioneering works on theoretical, as well as practical, aspects of such kinds of SDEs, one can refer to [1,2]. As the present article is a control-oriented one, we restricted our bibliographical efforts accordingly. Several contributions have been made over these last few years in different control theoretical fields related to the class of McKean–Vlasov SDEs [3,4,5,6,7,8]. More recently, LQ control problems, as well as LQ games, have gained a lot of interest among the control community; one can refer, for example, to [9,10,11,12,13,14,15]. In these works, the authors studied a class of McKean–Vlasov SDEs that are similar to the one we consider in the present article.
We consider, in this paper, the problem of state estimation for a class of continuous-time, time-varying McKean–Vlasov SDEs. The metric used as an optimality measure of the proposed estimation scheme belongs to the -type norm setting. More specifically, we introduce a performance criterion expressed in terms of the mean square of the deviation of the estimated signal from the value of the signal that must be estimated. To the best of the authors’ knowledge, it seems that the problem of state estimation for McKean–Vlasov SDEs has not yet been addressed in the literature, or at least it has been addressed only very marginally when compared to the control counterpart. The main objective of the present work is to initiate a filtering research axis for this class of systems. We adopted a Riccati-based approach in order to solve the optimal estimation problem. The present work could be viewed as a generalization of our previous work on filtering for Itô-type SDEs [16] in the case where the dynamic equations describing the evolution of the state variable incorporate its mathematical expectation (see Equation (1)).
This paper is organized as follows: Section 2 presents the mathematical model of the considered class of systems and describes the problem setting. Section 3 provides some auxiliary results that will be used in Section 4 in order to obtain the formulae for the computation of the performance value. The main results are presented in Section 5. Some numerical experiments are included in Section 6.
2. Problem Formulation
We consider the dynamical system () with the state space representation described as follows:
, where is the system-state vector, and are the measurements, while is the remote signal that must be estimated.
In (1), the sequences , , are stochastic processes defined on a given probability space, , satisfying the following assumptions:
Assumption 1.
- (i)
- is a r-dimensional, standard Wiener process with a zero mean, and, is the identity matrix of size ;
- (ii)
- is a -dimensional Wiener process with a zero mean and, where is a known matrix;
- (iii)
- and are independent stochastic processes.
Throughout the paper, stands for the mathematical expectation, and the superscript ⊤ denotes the transposition of a vector or a matrix.
Regarding the coefficients of the system (1), we make the following assumption:
Assumption 2.
- (i)
- , , , , and are continuous matrix-valued functions that are periodic with period ;
- (ii)
- The mean field stochastic linear differential equation (MF-SLDE),is exponentially stable in the mean square sense (ESMS).
Our aim is to design a dynamic linear system () named filter that, fed with the measurements and generates at its output a signal, , which must be the best estimation of the remote signal generated via the dynamical system ().
In our approach, the family of admissible filters consists of all dynamical systems that have the state space representation of the following form:
where are the state parameters of the filter . In (3), , , are arbitrary continuous and -periodic matrix-valued functions.
It is worth mentioning that the dimension of the state space of an admissible filter (3) is not prefixed.
In what follows, denotes the family of all dynamic systems () of the form (3) of arbitrary dimension that satisfy the following additional condition:
- The ordinary linear differential equation (OLDE):is exponentially stable (ES).
In order to measure the quality of the estimation achieved via a filter () , we introduce the following performance criterion:
In Section 4, we derive an explicit formula for the computation of the value of . We show that this value does not depend either upon the initial time, , or the initial states of the system (1) and the filter (3). In Section 5, we provide a set of conditions that guarantee the existence of a filter, , minimizing the cost (5) over the set . We also provide a state-space representation of the optimal filter.
3. Some Preliminary Issues
If Assumption 2 (ii) holds, then there exist and , with the property that the solutions of the MF-SLDE (2) satisfy the following:
for all . From the relation , we deduce via (6) that
for any solution, , of (2).
On the other hand, from (2), one obtains confirmation that solves the OLDE in :
Since (8) in an OLDE with periodic coefficients, we deduce via (7) that there exist and such that
If , , is an arbitrary solution to (2), we set the following:
We have confirmation that and for all . Employing the properties of the stochastic Itô-type integrals, we deduce that solves the following:
where
. From (6), (9), and (10), we may infer that the SLDE (11) is ESMS if the MF-SLDE (2) is ESMS. In other words, if (2) is ESMS, then there exist and such that the following applies:
for all . Let . We recall that, if is a natural number, then denotes the vector space of symmetric matrices of size . The elements of the vector space are pairs of symmetric matrices. In , we consider the following inner product:
for all and from . Equipped with the inner product (14), becomes a finite-dimensional, real Hilbert space. Moreover, is an ordered Hilbert space with the ordering relation ”⩾” induced via the convex cone , where is the convex cone of positive semidefinite matrices.
We consider the linear operator defined as follows: , where
for all . Based on the adjoint operator definition, we obtain that the adjoint operator (with respect to the inner product (14)) of the operator introduced via (15) is given via , where:
for all .
Let be an arbitrary solution to the SLDE (11). We set . One sees the following:
where
On the other hand, the Itô formula applied in the case of the solutions of (11) allows us to deduce that solves the OLDE in the space :
Employing (12), (17), and (18), we obtain confirmation that (19) is equivalent to the following linear differential equation on the Hilbert space :
where .
Lemma 1.
Under Assumptions 1 and 2, the following are equivalent:
The next result is used in the development of the next sections.
Proposition 1.
Under Assumptions 1 and 2, the following are equivalent:
- (i)
- The MF-SLDE (2) is ESMS;
- (ii)
- The SLDEis ESMS, and the OLDEis exponentially stable.
Proof.
According to the equivalence from Lemma 1, we may infer that the MF-SLDE (2) is ESMS if and only if the linear differential Equation (20) is exponentially stable. On the other hand, from (15), we deduce via Theorem 2.6.1 from [17] that (20) is a linear differential equation that generates a positive evolution in the ordered space , i.e., if , , then , , where is a solution to (20).
Invoking the equivalence from Theorem 2.4.2 of [17], we obtain that (20) is exponentially stable if and only if the non-homogeneous linear differential equation in ,
has a unique bounded solution, , with the property that there exist positive constants, , such that
for all , where .
4. Computation of the Performance Value
Let be a solution to (1). Using (10), written for this solution, we obtain the following equivalent form of the state-space representation of the dynamical system (1):
, where and are described in (12) and
Applying a filter, , from to the dynamical system described by (27), one obtains the following closed-loop system:
, where and:
. Let and be arbitrary but fixed. Let be the solution to (29a) with the initial value . With these notations, we obtain confirmation that:
where
for all . Thus, (5) becomes
Using the Itô formula [18] in the case of the stochastic process , one obtains confirmation that defined according to (31) is the solution to the following problem with a given initial value in the space ():
where is described as follows:
for all .
Remark 1.
Even if the closed-loop system (29) works for , it follows from (30) that, under the assumption H2) ii) and the definition of an admissible filter (3), the coefficients of the closed-loop system are θ-periodic functions. Hence, the coefficients of (29), (33a) and (34) can be extended via periodicity to the whole real axis. Particularly, we have for all .
Regarding the system of SLDE,
obtained from (29a) when , , we have the following:
Proposition 2.
Under the Assumptions 1 and 2, for any filter, , from , the corresponding SLDE (35) is ESMS.
Proof.
According to (30a), (30b), we obtain the following partition of (35):
, where we denote the following:
Since (36a) is just (11), it follows that its solutions satisfy (13). Thus, defined according to (37) satisfies the assumptions of Theorem 3.6.1 from [17]. Bearing in mind the exponential stability of (4), one concludes via Theorem 3.6.1 from [17] applied to the affine SLDE (36b) that
Combining (13) and (38), we may infer that the solutions of (35) satisfy the following:
Finally, using the fact that (35) has periodic coefficients, we conclude via Theorem 3.2.5 from [17] that the closed-loop SLDE (35) is ESMS. Thus, the proof is complete. □
The operator-valued function introduced in (34) defines the following linear differential equation in the Hilbert space :
Let be the linear evolution operator in generated via the linear differential Equation (39). We recall that, if is the solution to (39) with the initial value , then , for all and .
As a consequence of Proposition 2, we obtain the following:
Corollary 1.
Under the Assumptions 1 and 2, for any filter, , from , the corresponding linear differential Equation (39) is exponentially stable, that is, the following applies:
where and do not depend upon t and .
Proof.
Let be the fundamental matrix solution to the SLDE (35). It satisfies the following:
Applying Theorem 3.1.1 from [17] in the special case , we obtain that , being the adjoint operator of with respect to the inner product of :
for any . If is an arbitrary vector with , we obtain via Proposition 2 that
for all , , where and do not depend upon . This leads to , for all , , and being the Euclidian norm of a matrix. Further, we obtain via Corollary 2.1.7 (i) and Theorem 2.1.10 from [17], applied in the case of the linear evolution operator , that , .
From the equivalence between the norms of the finite-dimensional Hilbert space , we deduce that , , being the operator norm induced via the norm generated through the inner product (41).
Finally, from the equality , we obtain confirmation that (40) holds. This ends the proof. □
Proposition 3.
Under the Assumptions 1 and 2, for each admissible filter, , the corresponding non-homogeneous linear differential Equation (33a) has a unique solution that is bounded. This solution has the following representation formula:
for all . Additionally, this solution is a θ-periodic function, and it satisfies .
Proof.
First, let us show that the integral from the right-hand side of Equation (42) is convergent. To this end, we use Equation (42) in order to obtain the following:
, where . Hence, is well defined according to (42). The fact that is a solution to (33a) can be directly deduced using the rule of differentiation of an integral. In order to prove the unicity of the solution defined according to (42), let us assume that is another bounded solution to (33a). By using the constant-variation formula for any , we obtain the following:
Employing (40) again, we obtain the following:
where . Letting in Equation (43), we obtain via Equation (44) that ,. Hence, any bounded solution to (33a) coincides with defined according to (42).
In order to check the periodicity property of , we write the following:
Thus, the periodicity property of the bounded solution is confirmed.
Finally, Theorem 2.6.1 from [17], applied in the case of the Lyapunov-type linear differential Equation (33a), allows us to infer that , . This leads to , because is a closed convex cone. Thus, the proof is complete. □
The main result of this section is given in the following theorem:
Theorem 1.
Proof.
Using the version of (5) given in (32), we may write the following:
Since , , is a solution to the linear differential Equation (39), it has the following representation formula:
Employing (33b) and (40), we obtain the following:
where . Further, (47) allows us to deduce that . Substituting this last equality in (46), we get
On the other hand, the boundedness of the function allows us to obtain the equality
Plugging (49) in (48), we get
which is just the right-hand side of (45). For the last equality, we have taken into account that the integration is a -periodic function. Thus, the proof is complete. □
Remark 2.
From (45), one sees that the value of the performance achieved using a filter, , from does not depend on either the initial time instance, , or the initial states, and , of the dynamical system and the filter , respectively.
5. The Optimal Filter
We consider the linear differential equation in :
or, equivalently,
where is defined in (15).
Proposition 4.
Under the Assumptions 1 and 2, the linear differential Equation (50) admits a unique bounded solution, . Additionally, it is a θ-periodic function, and it has the following structure:
where is the unique bound of the solution to the linear differential equation in :
Proof.
We look for a solution, , to the Equation (50), which is bounded on and has the following structure:
with . Employing (12), together with (28a), we obtain the following partition of the Equation (50) written for the solution :
. If the MF-SLDE (2) is ESMS, then according to from Proposition 1, we obtain confirmation that the OLDE (22) is ES. In this case, the linear differential Equation (54d) has a unique bound in the solution, and this solution is , .
Hence, (54a) reduces to
Invoking again the implication from Proposition 1, we deduce that the SLDE (21) is ESMS. In this case, the linear differential equation of the Lyapunov type associated with (21),
defines a positive evolution, and it is exponentially stable in . Hence, the non-homogeneous Equation (55) has a unique bound in the solution, and this solution is a -periodic function that satisfies , .
Since (55) coincides with (53), it follows that and , which is the unique bound in the solution to Equation (53). The need to search for the bound in the solutions of (54c) remains. First, let us remark that, if the SLDE (21) is ESMS, then the OLDE
is exponentially stable. Let be the monodromy matrix of the differential Equation (57). That is, , where is the matrix solution to (57), which satisfies . The eigenvalues of the monodromy matrix are located in the disk because (57) is ES. Similarly, the eigenvalues of the monodromy matrix of the differential Equation (22) are located in the disk . If is the monodromy operator of the Equation (54c), it is obtained via .
The eigenvalues of are, then, of the form , where is an eigenvalue of , and is an eigenvalue of . Hence, . We conclude that Equation (54c) may have only one bound in the solution (see, for example, Theorem 2.3.7 from [17]). This means that , is the unique bound in the solution to (54c). This ends the proof. □
Let
Using (28b) and (30), we obtain confirmation that (58) becomes
Now, we introduce the following forward Riccati differential equation (F-RDE):
where
Employing (12), (28a)–(28c), and (52), we obtain confirmation that:
with
We recall now the following definition:
Definition 1.
A globally defined solution, , of the F-RDE (60) is named a stabilizing solution if the OLDE on ,
is exponentially stable, where
Remark 3.
- (a)
- From (58) and (61), one obtains confirmation that
- (b)
- Via a direct calculation, one obtains confirmation that the F-RDE (60), verified via its stabilizing solution, , can be written in the following form:where , . In this case, (66) may have only one θ-periodic solution, and such a solution is a positive semidefinite because the OLDE (63) is exponentially stable, and (65) holds. Hence, the stabilizing and bounded solution to (60), if any, satisfies the following condition:
- (c)
- (d)
- Substracting (60) (written for and replaced with ) from (50) (written for and replaced with ), we obtain confirmation that is a bound in the solution to the following linear differential equation:Bearing in mind that the OLDE (68) is ES and for all , we conclude that (69) has a unique bound in the solution, and additionally, this solution is positive and semidefinite. Under the Assumptions 1 and 2, the stabilizing and bounded solution to the F-RDE, if any, necessarily satisfies the following constraint:for all .
- (e)
- Let be the partition of the matrix , such that , . Employing (52) and (70), we conclude that we necessarily have and , . Thus, under the Assumptions 1 and 2, the bounded and stabilizing solution of the F-RDE (60), if any, has the following structure:where solves the F-RDE of the lower dimension,where and are defined in (62).
By adapting Definition 1 to the case of the F-RDE (72), we say that a solution, , of (72) is a stabilizing solution if the following OLDE on ,
is ES, where
The following result establishes a relationship between the -periodic and stabilizing solution to the F-RDE (72) and the -periodic and stabilizing solution to the F-RDE (60).
Lemma 2.
Proof.
Remark 4.
Under the Assumptions 1 and 2, in order to test the existence of the stabilizing solution to the F-RDE (60), it is sufficient to test the existence of the stabilizing solution to the F-RDE of lower dimension (72). To this end, one can use the set of necessary and sufficient existence conditions proposed in [16].
The main result of this work is given below:
Theorem 2.
Assume the following:
- (a)
- The Assumptions 1 and 2 are fulfilled.
- (b)
- and the F-RDE (72) have a stabilizing solution, , which is θ-periodic.
Let be the feedback gain associated with the stabilizing solution via (74). Let and , , be defined as in (71) and (75), respectively. Consider the filter with dimension and the following state space representation:
. Under the considered assumptions, the filter lies in and minimizes the cost functional (5) over . The minimal value achieved via the performance index (5) is as follows:
Proof.
The fact that the filter (76) lies in is obvious because, in this case, the corresponding system (4) coincides with (63), and therefore, it is ES. In order to check that the filter (76) provides the minimal value of the cost (5), let us consider an arbitrary but fixed filter, . Let , , be the partition of the -periodic solution of the corresponding linear differential equation of type (33a), such that and . Based on (30a)–(30c), we obtain the following partition of the Equation (33a) written for its -periodic solution:
One sees that the Equation (78a) coincides with (50). From the uniqueness of the -periodic solution to the non-homogeneous, Lyapunov-type linear differential equation associated with a mean square, exponentially stable, stochastic linear differential equation, we deduce that .
Let . Via a direct calculation involving (60), together with (78) (for which is replaced with ), we obtain confirmation that is a -periodic solution to the following Lyapunov-type differential equation:
where , and was introduced in (58). Since the differential Equation (39) defines a positive evolution on the ordered Hilbert space , and (40) holds, we deduce via the equivalence in Theorem 2.4.2 from [17], applied in the case of the linear differential Equation (39), that there exists a -matrix-valued function satisfying
and
This means that the OLDE on ,
is ES. This allows us to conclude that (79) has a unique -periodic and positive semidefinite solution. This leads to the following:
Further, we rewrite (45) as follows:
Combining (80) and (81), we deduce that
for all . It remains to show that, in the special case of the filter introduced via (76), the inequality (82) becomes equal. To this end, let us remark that, in the case of the filter (76), we have the following:
with being the 11-block of the matrix , . Via a direct calculation involving the differential Equation (79), we obtain confirmation that is the -periodic solution to the linear differential equation
Since the differential Equation (63) is ES, we conclude that the differential Equation (84) has a unique -periodic solution, namely , .
6. Numerical Experiments
It appears from Theorem 2 that the synthesis of the optimal -filter for system (1) relies on the resolution to the periodic F-RDE (72) whose coefficients depend on the unique -periodic solution to the non-homogeneous, Lyapunov-type linear differential Equation (50). Hence, in order to synthesize the optimal filter of type (76), one has first to solve the equations (50) and (72), respectively.
In what follows, we generate an artificial -periodic linear system of the McKean–Vlasov type. To this end, we generalize the procedure proposed in [19]. First, we generate a stochastic linear time-invariant (LTI) system of the McKean–Vlasov type:
Next, the LTI system (85) is transformed into a stochastic-periodic linear system via the change in the system coordinate , where
for a given . This results in the following stochastic -periodic McKean–Vlasov-type system:
where
and . For the numerical application, we take the following:
and .
6.1. Solution to the Lyapunov-Type Linear Differential Equation (50)
First, we solve the following Lyapunov-like algebraic equation:
This is done by rewriting (87) as follows:
where ⊗ is the Kronecker product, and for a matrix, , one has . Hence, we get .
The exact unique -periodic solution to the Equation (50) is then given via .
6.2. Solution to the F-RDE (72)
Here, we follow a similar procedure as in the previous subsection. First, we compute the unique stabilizing solution to the following algebraic Riccati equation (ARE):
where
with P being the unique solution to (87), using existing stable solvers such as ICARE in MATLAB. The -periodic stabilizing solution to the F-RDE (72) is then given via , with Y being the stabilizing solution to the ARE (89).
Figure 1 shows the evolution of the estimation error for one realization of the stochastic processes.
Figure 1.
Estimation error: .
7. Conclusions
In this paper, we have considered the problem of optimal state estimation for a class of continuous-time, time-varying McKean–Vlasov SDEs. The solution to the considered optimization problem has been expressed in terms of the stabilizing solution to a suitably defined, generalized Riccati differential equation. Some numerical experiments have been provided to show the effectiveness of the proposed method.
Author Contributions
The authors contributed equally in all fields. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Institutional Review Board Statement
Not applicable.
Data Availability Statement
Data is contained within the article.
Conflicts of Interest
The authors declare no conflicts of interest.
References
- Kac, M. Foundations of kinetic theory. In Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability, III; University of California Press: Berkeley, CA, USA, 1956; pp. 171–197. [Google Scholar]
- McKean, H.P. Propagation of chaos for a class of non-linear parabolic equations. Lect. Ser. Differ. Equ. 1967, 7, 41–57. [Google Scholar]
- Ahmed, N.U.; Ding, X. Controlled McKean–Vlasov equations. Commun. Appl. Anal. 2001, 5, 183–206. [Google Scholar]
- Ahmed, N.U. Nonlinear diffusion governed by McKean–Vlasov equation on Hilbert space and optimal control. SIAM J. Control Optim. 2007, 46, 356–378. [Google Scholar] [CrossRef]
- Andersson, D.; Djehiche, B. A maximum principle for SDEs of mean-field type. Appl. Math. Optim. 2011, 63, 341–356. [Google Scholar] [CrossRef]
- Buckdahn, R.; Djehiche, B.; Li, J. A general maximum principle for SDEs of mean-field type. Appl. Math. Optim. 2011, 64, 197–216. [Google Scholar] [CrossRef]
- Meyer-Brandis, T.; Oksendal, B.; Zhou, X.Y. A mean-field stochastic maximum principle via Malliavin calculus. Stochastics 2012, 84, 643–666. [Google Scholar] [CrossRef]
- Park, J.Y.; Balasubramaniam, P.; Kang, Y.H. Controllability of McKean–Vlasov stochastic integrodifferential evolution equation in Hilbert spaces. Numer. Funct. Anal. Optim. 2008, 29, 1328–1346. [Google Scholar] [CrossRef]
- Aberkane, S.; Dragan, V. An addendum to the problem of zero-sum LQ stochastic mean-field dynamic games. Automatica 2023, 153, 111007. [Google Scholar] [CrossRef]
- Huang, J.; Li, X.; Yong, J. A linear-quadratic optimal control problem for mean-field stochastic differential equations in infinite horizon. Math. Control Relat. Fields 2015, 5, 97–139. [Google Scholar] [CrossRef]
- Moon, J. Linear-quadratic mean field stochastic zero-sum differential games. Automatica 2020, 120, 109067. [Google Scholar] [CrossRef]
- Sun, J.; Wang, H.; Wu, Z. Mean-Field Stochastic Linear-Quadratic Optimal Control Problems: Open-Loop Solvabilities. ESAIM Control. Optim. Calc. Var. 2017, 23, 1099–1127. [Google Scholar] [CrossRef]
- Sun, J.; Wang, H.; Wu, Z. Mean-Field Linear-Quadratic Stochastic Differential Games. J. Differ. Equ. 2021, 296, 299–334. [Google Scholar] [CrossRef]
- Tian, R.; Yu, Z.; Zhang, R. A closed-loop saddle point for zero-sum linear-quadratic stochastic differential games with mean-field type. Syst. Control. Lett. 2020, 136, 104624. [Google Scholar] [CrossRef]
- Yong, J. Linear-Quadratic Optimal Control Problems for Mean-Field Stochastic Differential Equations. SIAM J. Control Optim. 2013, 51, 2809–2838. [Google Scholar] [CrossRef]
- Dragan, V.; Aberkane, S. H2 optimal filtering for continuous-time periodic linear stochastic systems with state-dependent noise. Syst. Control. Lett. 2014, 66, 35–42. [Google Scholar] [CrossRef]
- Dragan, V.; Morozan, T.; Stoica, A.M. Mathematical Methods in Robust Control of Linear Stochastic Systems, 2nd ed.; Springer: New York, NY, USA, 2013. [Google Scholar]
- Oksendal, B.K. Stochastic Differential Equations: An Introduction with Applications; Springer: Berlin, Germany, 2003. [Google Scholar]
- Gusev, S.; Johansson, S.; Kagström, B.; Shiriaev, A.; Varga, A. A numerical evaluation of solvers for the periodic Riccati differential equation. BIT Numer. Math. 2010, 50, 301–329. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
