1. Preliminaries on Weighted Finite Automata and Zeroing Neural Networks
Simulations between WFA ensure its containment, while bisimulations ensure the equivalence of WFA. As a result of the transition from various boolean to quantitative systems, both simulations and bisimulations become quantitative. Corresponding models are based on the use of matrices whose entries supply a quantitative measurement of the relationship between states of underlying systems.
Hereafter, denotes the field of real numbers, and denotes the set of natural numbers without zero, while the set of all positive real numbers is denoted by . Additionally, is a non-empty finite set with k elements, where , called an alphabet, while is the set of all finite sequences of elements of X, which are called words over the alphabet X, and , where is a symbol that denotes the empty word of length 0. With respect to the conventional concatenation operation on words (sequences), forms a semigroup, while is a structure representing a monoid with the identity element .
A weighted finite automaton over the field of real numbers and the alphabet X is defined as a quadruple , where denotes the dimension of A; , are the initial vector and terminal vector, respectively, and is a collection of transition matrices. The initial vector is treated as a row vector, while the terminal vector is treated as a column vector. The behavior of a weighted finite automaton is expressed as the product , representing the initial weights, matrices representing the weights of the transitions induced by input letters, and the column vector representing the terminal weights.
The collection
is extended up to a collection
of
compound transition matrices expressed as
where
denotes the
identity matrix. The matrices
,
, defined in (
1), are known as the
compound transition matrices of
A. The multiplication of transition matrices carry numerical values over
, known as weights. A function
is called a
word function. In particular, each weighted finite automaton
gives rise to a word function
defined as follows:
The word function
defined in (
2) is called the
behavior of
A, or a
word function computed by A. The behavior of an automaton is a mapping that relates a weight to words over a semiring.
Consider the weighted finite automata (WFA) and over the field of real numbers and X. The following notations are used:
, for every ;
, for every .
WFA A and B over and the alphabet X are said to be equivalent if . On the other hand, if , then A is said to be contained in B. The problem of determining whether WFA are equivalent is called the equivalence problem, and the problem of determining whether one of two WFA is contained in another is called the containment problem. A solution to the equivalence problem decides whether two WFA compute the same word function. On the other hand, a solution to the containment problem determines whether the word function computed by one WFA is less than or equal to the word function corresponding to the other WFA
A matrix (resp. vector) is said to be a positive matrix (resp. positive vector) if all its entries are positive real numbers, and a weighted finite automaton A is said to be a positive automaton if its initial and terminal vectors, as well as all its transition matrices, are positive.
Weighted automata have been applied to describe quantitative properties in various systems, as well as to represent probabilistic models, image compression, speech recognition, and finite representations of formal languages. Context–free grammars are used in the development of programming languages as well as in artificial intelligence.
The theoretical foundations of current investigations involve two types of simulations and two types of bisimulations defined in [
1], in the general context of WFA over a semiring. The approach we use consists of defining quantitative simulations and bisimulations as matrices that are solutions to certain systems of matrix inequations. Such an approach was introduced in [
2], where quantitative simulations and bisimulations between fuzzy finite automata were introduced and their basic properties were examined. Algorithms for testing their existence were developed in [
3]. The same algorithms compute the greatest simulations and bisimulations in cases when they exist. Then, the same approach was applied to the study of bisimulations and simulations for non-deterministic automata [
4], WFA over an additively idempotent semiring [
5], and max-plus automata [
6], as well as for WFA over an arbitrary semiring [
1,
7], which encompass all the previous ones. It turns out that an almost identical methodology can also be applied to social networks [
8]. In [
9], it was proven that two probabilistic finite automata are equivalent if and only if there is a bisimulation between them, where the bisimulation is defined as a classical binary relation between the vector spaces corresponding to those automata.
In the present paper, we investigate forward and backward simulations and bisimulations for WFA over the field of real numbers. It is worth noting that there are some very important specifics in this case. For most WFA types, the
problem of equivalence (determining whether two automata compute the same word function) and the
minimization problem (determining an automaton with the minimal number of states equivalent to a given automaton) are computationally hard. In these cases, bisimulations have two very important roles. The first role is to provide an efficient procedure for witnessing the existence of the equivalence of two automata, and the second one is to provide an efficient way to construct an automaton equivalent to a given one, with a not necessarily minimal but reasonably smaller number of states. However, it is not the case with WFA over the field of real numbers, for which there are efficient algorithms for testing the equivalence and performing minimization. Despite this observation, the importance of bisimulations for these automata is not diminished. Bisimulations are still needed as a means of determining the measure of similarity between the states of different automata, which algorithms for testing the equivalence are unable to do. In the context of weighted automata over the field of real numbers, such measures have already been studied in [
10] by means of bisimulation seminorms and pseudometrics, and in [
11] by means of linear bisimulations; in our upcoming research, we will deal with the relationships between bisimulation seminorms, linear bisimulations, and our concepts of bisimulations.
Following the definitions of simulations and bisimulations over various algebraic structures, an analogous approach has been used in defining simulations and bisimulations for WFA over the field of real numbers. The problem of simulations and bisimulations for WFA over the field of real numbers reduces to the system of two vector inequations and a number of matrix inequations. There is a notable lack of numerical methods for solving simulation and bisimulationproblems. Urabe and Hasuo proposed the idea of reducing the problem of testing the existence of simulations to the problem of linear programming (LP) and implemented it in [
7] (Section 5). Seen more generally, the research described in this paper shows that the ZNN design is usable in solving systems of matrix and vector inequations in linear algebra. Our goal is to show that the zeroing neural network (ZNN) dynamics are an effective tool to decide on the containment or equivalence between WFA. A comparison between the ZNN and LP approach is presented.
On the other hand, the application of dynamical systems is a robust tool for solving various matrix algebra problems, primarily owing to the global exponential convergence, parallel distributed essence, convenience of hardware implementation, suitability for online computations involving TV objects, and possibility of providing convergence in a finite time frame [
12,
13]. First, ZNN models have been used to solve the TV matrix inversion problem [
14]. Standard and finite-time convergent ZNN dynamical systems aimed at solving time-varying (TV) linear matrix equations have been widely investigated [
12,
15,
16,
17,
18]. The applications of ZNN design, mainly focusing on robot manipulator path tracking, motion planning, and chaotic systems, were surveyed in [
19]. ZNN dynamical systems for solving TV linear matrix–vector inequalities (TVLMVI) and TV linear matrix inequalities (TVLMI) have been broadly investigated [
12,
15,
20,
21,
22,
23,
24,
25,
26,
27]. Moreover, various ZNN models for solving TVLMI have been applied, mainly in obstacle avoidance for redundant robots and robot manipulator control [
12,
28,
29]. Typically, TVLMVI and TVLMI of type “≤” are solved by utilizing an additional matrix or vector of appropriate dimensions with non-negative entries. A TV matrix inequality of the Stein form
was considered in [
21]. A TVLMVI problem of the general form
was considered in [
24,
26,
27]. Two ZNN models for solving systems of two TVLMVI were developed in [
15]. In [
22], the authors proposed ZNNs for solving TV nonlinear inequalities. Finite-time dynamics for solving general TVLMVI
were proposed in [
25]. A comparison between ZNN and gradient-based networks for solving
was investigated in [
23]. The computational time for solving TV equations increases due to the large number of calculations of TV requirements [
30].
The problem under consideration is more complex because it requires us to solve systems of linear matrix and vector inequations. The structure of ZNN models developed in the current research is based on composite models with a prescribed number of error functions in matrix form and two in vector form. The ZNN dynamics aim to force the convergence of the involved error functions to zero over the considered time interval [
13]. But the ZNN model in this paper aims to solve several matrix–vector equations that are inconsistent in the general case. Our strategy is to utilize ZNN neurodynamics to generate simulations between two WFA with weights over real numbers. In this way, our objective involves the topic of numerical linear algebra.
This research is aimed at the development and analysis of four novel ZNN models for addressing the systems of matrix and vector inequalities involved in simulations between WFA. The problem considered in this paper is specific and complex, and it requires solving a system of two vector inequations and a couple of matrix inequations. Using positive slack matrices, matrix and vector inequalities are transformed into corresponding equalities. In this case, it is useful to utilize the development of ZNN dynamics based on several inequalities and Zhang error functions. ZNN algorithms established upon a few error functions have been investigated in several studies, such as [
31,
32,
33,
34]. Our motivation for the application of ZNN arises from a verified fact that it is a powerful tool for solving various matrix algebra models, possessing global exponential convergence and a parallel distributed structure [
12,
13]. Therefore, it is interesting to construct the ZNN evolution for such a problem and study its behavior. A detailed convergence analysis is considered. Numerical examples are performed with different initial state matrices.
The main results are emphasized as follows.
- (1)
Two types of quantitative simulations and two types of bisimulations between WFA are determined as solutions to particular systems of several matrix and two vector inequations over .
- (2)
The approach used to solve the problem of simulations and bisimulations in this research is unique and based on the application of the ZNN dynamical evolution in solving underlying matrix and vector inequations.
- (3)
A detailed convergence analysis of the proposed ZNN dynamics is presented.
- (4)
Numerical examples are performed under different initial state matrices, and a comparison between the ZNN and LP approach is presented.
The overall organization of the sections is as follows. Preliminaries on WFA and ZNN are presented in
Section 1. Global results are highlighted in the same section. Two types of simulations and four types of bisimulations proposed in [
1] in the general context of WFA over a semiring are generalized in the context of WFA over the field of real numbers in
Section 2. ZNN designs for simulations and bisimulations of WFA over real numbers are presented in
Section 3.
Section 4 is aimed at testing the developed ZNN dynamical systems and making comparisons with the LP solver. Concluding remarks are given in
Section 5.
2. Simulations and Bisimulations of WFA over Real Numbers
As a continuation of the research presented in [
1], here we correspondingly introduce definitions of two types of simulations and two types of bisimulations in the context of WFA over
. For this purpose, consider two WFA
and
over the field of real numbers
and the alphabet
X. A matrix
is called a
forward simulation between
A and
B if it satisfies the following conditions with respect to
U:
and it is termed as
backward simulation between
A and
B if it fulfills
Our intention is to apply the notion of transposed automaton from [
35] to reverse the transitions’ flow direction. If both
U and
are forward simulations between
A and
B and vice versa, i.e., if they fulfil
then
U is termed as a
forward bisimulation between
A and
B, and if both
U and
are backward simulations between
A and
B and vice versa, i.e., if they satisfy
then
U is known as a
backward bisimulation between
A and
B.
It is important to note that, for any , the conditions (-1), (-2), and (-3) can be treated a system of matrix inequations with the unknown matrix U, and simulations or bisimulations of type are precisely solutions to this system. This is extremely important because simulations between weighted automata over the field of real numbers are searched for by solving the corresponding systems of matrix inequalities.
Another important note is that the main role of simulations is to witness containment between automata A and B, while the main role of bisimulations is to witness equivalence between A and B. However, forward and backward simulations and bisimulations are defined by matrix inequations. On that note, in order to prove that simulations achieve containment and bisimulations achieve equivalence, we need the inequations to be preserved by multiplying, on either side, by the transition matrices, as well as by the initial and terminal vectors. Multiplication by matrices and vectors containing negative entries can violate inequalities, and, therefore, in order for simulations and bisimulations defined by systems of inequations to make full sense, we consider these types of bisimulations and simulations only between positive automata.
Theorem 1 is a modified version of [
1] (Theorem 1).
Theorem 1. The following statements are valid for positive WFA A and B over :
- (a)
For , if there is a simulation of type ω between A and B, then .
- (b)
For , if there is a bisimulation of type ω between A and B, then .
The modification is reflected in the following. A slightly different version of Theorem 1 was proved in [
1] [Theorem 1] for WFA over a positive semiring. Theorem 1 could also be formulated for
A and
B as WFA over the positive semiring
of nonnegative real numbers, but such a formulation would mean that the simulations and bisimulations between
A and
B should also be over the semiring
, that is, they should be positive matrices, which is not necessary. Namely, for positive WFA over an arbitrary ordered semiring (not necessarily positive), the proof of [
1] (Theorem 1) also holds for simulations and bisimulations that contain negative entries, and Theorem 1 is formulated to allow for such simulations and bisimulations as well.
As this article is primarily concerned with solving systems of matrix inequations, nothing important will change if we consider the more general case and allow the transition matrices, as well as the initial and terminal vectors, to have negative entries, which is performed below. On the other hand, in some applications of simulations and bisimulations, for example in the dimensionality reduction for WFA, there is a need to find positive solutions of the considered systems of matrix inequations. For this reason, we consider systems with an additional condition requiring the positivity of the solution. It should be noted that the proposed procedures for solving the systems remain valid even in the case when this condition is omitted, and in the same way, in that case we obtain solutions that do not have to be positive.
3. ZNN Designs for Simulations and Bisimulations of WFA over Real Numbers
This section defines and analyzes four novel ZNN models for addressing the systems of inequations (
3)–(
6). For the remainder of this section, let
and
be two WFA over
, where
,
,
and
,
,
with
.
Also, it is crucial to mention that the process of building a ZNN model usually involves two primary steps. The error matrix equation’s (EME) function,
, must be initially declared. Secondly, the dynamic system represented by the continuous differential equation of the general form
needs to be employed. The dynamical evolution (
7) relates the time derivative
to
in proportion to the positive real coefficient
. The convergence rate of the dynamical system (
7) is altered by manipulating the parameter
. More precisely, with increasing values of
, any ZNN model converges even faster [
13,
36,
37]. The primary goal of the dynamics (
7) is to force
to approach 0 as
. The continuous learning principle that emerges from the EME’s construction in Equation (
7) is used to manage this goal. EME is, therefore, considered as a tracking indication in the context of the ZNN model’s learning.
Special attention should be paid to a few notations that are used in the remainder of this work. The matrices with all ones and all zeros as entries are indicated by and , whereas the matrices with all ones and all zeros as entries are indicated by and . Furthermore, the identity matrix is indicated by , whereas , and stand for the vectorization process, the Kronecker product, the Hadamard (or elementwise) product, the Hadamard exponential, pseudoinversion, and the matrix Frobenius norm, respectively. Finally, denotes an matrix whose entries consist of random numbers.
3.1. The ZNN-fs Model
In line with (
3), the following group of inequations must be satisfied:
with respect to an unknown matrix
. Utilizing the vectorization in conjunction with the Kronecker product, the system (
8) is reformulated into the vector inequations form
To calculate
more efficiently, (
9) must be simplified. Thus, the vectorization-related Lemma 1 derived from [
38] is given.
Lemma 1. The vectorization of the transpose of is defined bywhere is a constant permutation matrix that depends on the number of columns n and number of rows m in W. The algorithmic procedure for generating the permutation matrix
P in (
10) is presented in the following Algorithm 1.
Algorithm 1 The permutation matrix P formation. |
Input: The number of rows m and columns n of a matrix .
- 1:
procedure Perm_Mat() - 2:
Put eye and reshape - 3:
return reshape - 4:
end procedure Output: P |
Using the permutation matrix
P for generating
, inequations (
9) can be rewritten in the form
wherein the last constraint imposes non-negativity on the solution. The corresponding block matrix form of (
11) is given by
such that
and
Then, considering the vector of slack variables
, the inequation (
12) can be converted into the corresponding equation
in which
is the time-varying term with secured non-negative entries.
Thereafter, the ZNN approach considers the following EME, which is based on (
12), to simultaneously satisfy all the inequations in (
8):
where
and
are the unknown matrices that need to be found. The ZNN design (
7) exploits the first time derivative of (
15)
Combining Equations (
15) and (
16) with the generic ZNN design (
7), we obtain
As a result, setting
the next system of linear equations with respect to
is obtained:
The ZNN dynamics are applicable in solving (
18) if the mass matrix
is invertible. To avoid this restriction, it is appropriate to use the pseudoinverse (best approximate) solution
An appropriate
ode MATLAB R2022a solver can be used to handle the ZNN dynamics (
19), additionally referred to as the ZNN-fs model. The ZNN-fs model’s convergence and stability investigation is shown in Theorem 2.
Theorem 2. Let and be the WFA over and the alphabet , where , , and , , with . The dynamics (17) in linewith the ZNN method (7) lead to the theoretical solution (TSOL), determined by , which is stable according to Lyapunov. Proof. Let
Using vectorization, Kronecker product, and the permutation matrix
P for constructing
, defined by Algorithm 1, the system (
20) is reformulated as
The equivalent form of (
21) is
where
and
are declared in Equation (
13). Then, considering the slack variable
, the inequation (
22) can be converted into the equation
in which
is always a non-negative time-varying term.
The substitution
gives
The 1st derivative of
is equal to
As a result, after substituting (
14) for
, the following holds
or
where
and
are declared in (
13). Then, the following results follow from (
7):
or equivalently
Next, for confirming the convergence, we choose the plausible Lyapunov function
The following is confirmed for
:
Because of (
24), the following is valid:
With
being the equilibrium point of the system (
23), we have
It appears that the equilibrium state
is stable in accordance with Lyapunov theory. Afterwards, when
, the following holds:
which finalizes the proof. □
Theorem 3. Let and be the WFA over and , where , , and , , with . Beginning from any initial point , the ZNN-fs model of (19) converges exponentially to , which refers to the TSOL of (3). Proof. Firstly, the system of (
8) is considered to find the solution
that is affiliated to the time-varying backward-forward bisimulation between
A and
B of (
3). Secondly, the system of (
8) is reformulated into the system of (
9) utilizing vectorization and the Kronecker product and, then, into the system of (
12) utilizing the operational permutation matrix
P for
. Thirdly, considering the slack variable
, the inequality constraint of the system of (
12) is converted into an equality constraint in the system of (
14). Fourthly, the EME of (
15) is constructed, in keeping with the ZNN technique and the system of (
14), to generate the solution
that is affiliated with the system of (
3). Fifthly, the model of (
17) is yielded in accordance to the ZNN technique of (
7) for zeroing (
15). According to Theorem 2, the EME of (
15) converges to zero as
. Consequently, the solution of (
19) converges to
as
. Furthermore, it is obvious that (
19) is (
17) in a different form because of the derivation process. After that, the proof is accomplished. □
3.2. The ZNN-bs Model
In line with (
4), the following group of inequations must be satisfied:
where
denotes the unknown matrix to be found. Utilizing vectorization and the Kronecker product, the system of inequations (
25) is rewritten in the equivalent form
and its corresponding matrix form is
where
Then, considering the slack variable
, the inequation (
26) can be converted into the equation
where
is always a non-negative time-varying term.
Thereafter, the ZNN approach considers the following EME, which is based on (
27), for simultaneously satisfying all the inequations in (
25):
where
and
are the unknown matrices to be found. The first time derivative of (
28) is
Then, combining Equations (
28) and (
29) with the ZNN design (
7), we obtain
As a result, setting
the next model is obtained:
Since the ZNN dynamics in solving (
31) requires invertibility of the mass matrix
, it is practical to use the best approximate solution to (
31), which leads to
An appropriate
ode MATLAB solver can be used to handle the ZNN model of (
32), additionally referred to as the ZNN-bs flow. The ZNN-bs model’s convergence and stability investigation is shown in Theorem 4.
Theorem 4. Let and be WFA over , where , , and , , with . The dynamics (30) in line with the ZNN method of (7) lead to the TSOL, shown by , which is stable according to Lyapunov. Proof. The proof is omitted since it is similar to the proof of Theorem 2. □
Theorem 5. Let and be WFA over , where , , and , , with . Beginning from any initial point , the ZNN-bs design (32) converges exponentially to , which refers to the TSOL of (4). Proof. The proof is omitted since it is similar to the proof of Theorem 3. □
3.3. The ZNN-fb Model
In line with (
5), the following group of inequations must be satisfied:
where
implies the unknown matrix to be generated. Utilizing vectorization in combination with the Kronecker product, the system of (
33) is reformulated as
Using the permutation matrix
P for
, (
34) is rewritten as
and its corresponding matrix form is
where
and
Then, considering the slack variables vector
, the inequation (
35) is converted into the equation
Thereafter, the ZNN approach considers the following EME, which is based on (
35), for simultaneously satisfying all the inequations in (
33):
where
and
are the unknown matrices to be found. The first time derivative of (
36) is equal to
Then, combining Equations (
36) and (
37) with the ZNN design (
7), we obtain the following:
As a result, setting
(
38) is transformed into the model
whose pseudoinverse solution is equal to
An appropriate
ode MATLAB solver can be used to handle the ZNN model (
39), additionally referred to as the ZNN-fb model. The ZNN-fb model’s convergence and stability investigation is shown in the next theorem.
Theorem 6. Let and be the WFA over , where , , and , , with . The dynamics of (38) in line with the ZNN method of (7) lead to the TSOL, shown by , which is stable according to Lyapunov. Proof. The proof is omitted since it is similar to the proof of Theorem 2. □
Theorem 7. Let and be the WFA over , where , , and , , with . Beginning from any initial point , the ZNN-bs model of (39) converges exponentially to , which refers to the TSOL of (5). Proof. The proof is similar to the proof of Theorem 3. □
3.4. The ZNN-bb Model
In line with (
6), the following group of inequations must be satisfied:
where
stands for the unknown matrix. The system of (
40) is reformulated as follows:
Using the permutation matrix
P for generating
, (
41) is rewritten as
and its corresponding matrix form is the following:
where
Then, considering the slack variable
, the inequation (
42) can be converted into the equation
in which
is always a non-negative time-varying term.
Thereafter, the ZNN approach considers the following EME, which is based on (
42), for simultaneously satisfying all the equations in (
40):
where
and
are the unknown matrices to be found. The first time derivative of (
43) is given as
Then, combining Equations (
43) and (
44) with the ZNN design of (
7), we can obtain
As a result, setting
the next model is obtained:
or an equivalent:
An appropriate
ode MATLAB solver can be used to handle the ZNN model of (
46), additionally referred to as the ZNN-bb model. The ZNN-bb model’s convergence and stability investigation is shown in the next theorem.
Theorem 8. Let and be the WFA over , where , , and , , with . The dynamics of (45) in line with the ZNN method of (7) lead to the TSOL, shown by , which is stable according to Lyapunov. Proof. The proof is omitted since it is similar to the proof of Theorem 2. □
Theorem 9. Let and be the WFA over , where , , and , , with . Beginning from any initial point , the ZNN-bb model of (46) converges exponentially to , which refers to the TSOL of (6). Proof. The proof is similar to the proof of Theorem 3. □