Modeling and Optimal Supervisory Control of Networked Discrete-Event Systems and Their Application in Traffic Management

Hou, Yunfeng; Shen, Yanni; Li, Qingdu; Ji, Yunfeng; Li, Wei

doi:10.3390/math11010003

Open AccessArticle

Modeling and Optimal Supervisory Control of Networked Discrete-Event Systems and Their Application in Traffic Management

by

Yunfeng Hou

¹

,

Yanni Shen

²,

Qingdu Li

¹

,

Yunfeng Ji

¹

and

Wei Li

^3,4,*

¹

Institute of Machine Intelligence, University of Shanghai for Science and Technology, Shanghai 200093, China

²

Department of Trade Union, Shanghai Publishing and Printing College, Shanghai 200093, China

³

School of Finance, Shanghai Lixin University of Accounting and Finance, Shanghai 201209, China

⁴

Postdoctoral Station of Applied Economics, Fudan University, Shanghai 200433, China

^*

Author to whom correspondence should be addressed.

Mathematics 2023, 11(1), 3; https://doi.org/10.3390/math11010003

Submission received: 13 November 2022 / Revised: 6 December 2022 / Accepted: 15 December 2022 / Published: 20 December 2022

Download

Browse Figures

Versions Notes

Abstract

In this paper, we investigate the modeling and control of networked discrete-event systems (DESs), where a supervisor is connected to the plant via an observation channel and the control commands issued by the supervisor are delivered to the actuator of the plant via a control channel. Communication delays exist in both the observation channel and the control channel. First, a novel modeling framework for the supervisory control of DESs subject to observation delays and control delays is presented. The framework explicitly models the interaction process between the plant and the supervisor over the communication channels. Compared with the previous work, a more accurate “dynamics” of the closed-loop system is specified. Under this framework, we further discuss how to estimate the states of the closed-loop system in the presence of observation delays and control delays. Based on the state estimation, we synthesize an optimal supervisor on the fly to maximize the controlled behaviors while preventing the system from leaving the desired behaviors under communication delays. We compare the proposed supervisor with the supervisor proposed in the literature and show that the proposed supervisor is more permissive. As an application, we show how the proposed approach can be applied to manage vehicles in a signal intersection. Finally, we show how to extend the proposed framework to model a system whose actuators and sensors are distributed at different sites.

Keywords:

networked DESs; communication delays; state estimation; online supervisor synthesis; signal intersection

MSC:

93C65

1. Introduction

The dynamics of DESs are driven by sequences of asynchronous events. The main control theory developed for DESs is the supervisory control theory, where a supervisor is desired to disable events that lead to some undesirable event sequences. Since the supervisor cannot control and observe all the events, the desired behaviors (control objective) could be unachievable. The necessary and sufficient conditions for the existence of a supervisor are characterized as controllability [1] and observability [2]. Since then, the supervisory control is extended in several directions, such as decentralized supervisory control [3], robust supervisory control [4], asynchronous supervisory control [5], and quantitative supervisory control [6].

Nowadays, in many industrial applications, the supervisor is usually connected to the plant via communication networks. Such a network structure provides efficient ways for controlling DESs. However, the communication delays existing in the observation channel and the control channel pose significant challenges to the supervisory control of DESs [7,8,9,10,11,12,13,14]. Thus, networked DESs have drawn much attention in the past few years [15,16,17,18,19,20] Most of the current works on networked supervisory control focus on verifying if a given control objective can be achieved under observation delays and control delays [20,21,22,23,24,25,26,27,28,29,30], which is known as the supervisor existence problem. When the desired language cannot be exactly achieved, one would compute a safe control policy online or offline, known as the supervisor synthesis problem [31,32,33,34,35,36]. In this paper, we focus on solving the maximally-permissive supervisor synthesis problem under observation delays and control delays. In particular, based on the infinite observed sequence of events, an online algorithm is presented in this paper to calculate a maximal supervisor under observation delays and control delays. The calculated online supervisor is optimal because (i) the system is prevented from leaving the desired language even if communication delays exist in both the observation channel and the control channel, and (ii) given (i) is satisfied, the language generated by the closed-loop system is maximized.

In the supervisor synthesis, state estimation is a crucial step in determining a valid control action after each new observation. The state estimation problem can be briefly stated as follows: estimate all of the states of the closed-loop system that may be under communication delays, given that all future control decisions are unknowable. To synthesize an optimal supervisor under communication delays, the authors in [22,24,37,38] compute the state estimate based on the open-loop system without using the information of the controls imposed on the system. As stated in [39], the state estimate calculated in [22,24,37,38] contains some states that have been prevented from reaching. Therefore, the solutions computed in [22,24,37,38] are suboptimal for the unrestricted domain of observed event sequences. In [29], the state estimates are computed based on the assumption that the control delays and the observation delays are constant. The proposed approach fails to deal with nondeterministic observation delays and control delays. Recently, the authors in [26] calculated the state estimates of the networked DESs by taking the information of the control decision’s history into consideration. Nevertheless, the work of [26] can be further improved in two directions. First, the framework of the networked supervisory control adopted in [26] is conservative in the sense that the specified language of the closed-loop system is an over-approximation of the actual language of the closed-loop system. That is, it may include some sequences that never occur in practice (see Example 1 for more details), and the state estimate computed by [26] may contain some states that the closed system never reaches. Thus, the synthesized supervisor could be restrictive in the sense that it may disable some unnecessary events. Second, the work of [26] considers only control delays. When only control delays exist, the observation of a supervisor to a string is deterministic and the control command made after a string can be uniquely determined. In practice, however, the delays often exist in both the observation channel and the control channel. If this is the case, the observation of a supervisor to a string is nondeterministic and varies with the different observations. The supervisor may make different control decisions based on different observations, which complicates the supervisor’s synthesis problem.

In this paper, a new modeling framework for the supervisory control of DESs under control delays and observation delays is first presented. Specifically, in the newly proposed framework, we model the observation channel by a sequence of pairs of an occurred event and its observation delays (called the observation channel configuration). We also model the control channel by a sequence of pairs of an issued control command and its control delays (called the control channel configuration). We then build an automaton to model the interaction process between the plant and the supervisor over the observation channel and the control channel. In the automaton, two special types of events representing the respective receptions of observable events and the executions of control commands are introduced. Each state of the automaton dynamically tracks the plant state, the current control command, the observation channel configuration, the control channel configuration, and the supervisor state. Based on the constructed automaton, the exact language of the closed-loop system can be specified. Under the framework, we then discuss how to estimate (and predict) all the states of the current (and future) closed-loop system. Without any structural assumption on the solution space, an online algorithm is finally presented to calculate a maximal network-controlled policy based on the infinite observed sequence of events. We further compare the proposed supervisor with the supervisor proposed in [26]. The previous framework may contain some physically impossible strings. This may damage the supervisor’s synthesis because a synthesized good supervisor may be mistakenly taken as a bad supervisor. There exists the possibility that all of the illegal strings that may be generated by the closed-loop system are physically impossible. In such situations, the controlled system can never reach an illegal state as all of the illegal strings never occur in reality. Since the proposed framework excludes all physically impossible strings, the state estimation is more precise than the previous approach. Thus, the proposed supervisor is more permissive than the previous one.

To show the application of the proposed modeling and control approach, we consider the vehicle management problem in a signal intersection. When a self-driving vehicle arrives at the intersection, it needs to communicate with the intersection to determine the traffic light color. If the traffic light is yellow or red, it must stop and wait until the traffic light is switched to green. Otherwise, if the traffic light is green, it can pass through the intersection. We show that the proposed approach can be used to achieve control objectives when control delays and observation delays exist.

Finally, we briefly discuss how to extend the proposed approaches to deal with non-FIFO observations and controls. Specifically, we consider a system where the actuators and the sensors are distributed at different sites. For each actuator, the supervisor sends control commands to it over an individual control channel, and for each sensor, the detected information is sent to the supervisor over an individual observation channel. Different channels may have different upper bounds of delays. Techniques are developed to model the dynamics of the closed-loop system.

The proposed supervisor synthesis approach differs from the existing works in the following sense.

In contrast to [26], we consider both the control delays and the observation delays in this paper. That is, the observation of the supervisor to a string is nondeterministic and varies with the different observation delays. For different observations, the supervisor may make different control decisions. An event after a string may be allowed to occur after some of these control decisions but not be allowed to occur for the other control decisions. Thus, we must consider all possibilities. In addition, the closed-loop system behaviors specified in the proposed framework exclude those strings that never occur in reality and are shown to be more accurate. As a result, the supervisor can estimate the states of the closed-loop system more accurately and make control decisions more reasonable at any instant.
Compared with [22,24,37,38], the supervisor makes control decisions based on closed-loop systems. In other words, the synthesized supervisor considers controls imposed on the system when making control decisions. Thus, the control command made by the proposed supervisor is optimal with respect to the unrestricted domain of the observed event sequences.
Different from [29], the proposed model assumes that the observation delays and control delays are nondeterministic, which often happens. In this paper, the observation delays and control delays are measured by the number of events occurring in the plant. More specifically, the observation delays and control delays are upper-bounded by $N_{o}$ and $N_{c}$ events, respectively. That is, all of the events delayed at the observation channel can be communicated to the supervisor (in the same order that they are generated) before no more than $N_{o}$ event occurrences. All control commands delayed at the control channel can be executed by the actuator (in the same order that they are issued) before no more than $N_{c}$ events (since they are issued).

The rest of this paper is organized as follows. Section 2 presents some preliminary concepts and the required assumptions in this paper. Section 3 introduces a new modeling framework for supervisory control with observation delays and control delays. An online procedure for estimating the states of the closed-loop system is presented in Section 4. Section 5 synthesizes a maximal and safe networked supervisor on the fly. Section 6 discusses an application for the vehicle control in a signal intersection. Section 7 extends the proposed approaches to deal with non-FIFO observations and controls. Section 8 concludes this paper.

2. Preliminaries

We model a DES using a deterministic finite-state automaton

G = (Q, Σ, δ, q_{0})

, where Q is the finite set of states;

Σ

is the finite set of events;

δ : Q \times Σ \to Q

is the transition function;

q_{0}

is the initial state.

Σ^{*}

is the Kleene closure of

Σ

, i.e., the set of all sequences over events in

Σ

.

δ

is extended to

Q \times Σ^{*}

in the usual way [40]. The language generated by G is denoted by

L (G)

.

ε

is the empty sequence. “!” means “is defined”.

Given a

s = σ_{1} σ_{2} \dots σ_{k} \in Σ^{*}

, we write

s^{i} = σ_{1} σ_{2} \dots σ_{i}

for

i = 1, 2, \dots, k

, and

s^{0} = ε

.

| s |

is the length of s.

\bar{s} = {s^{'} | (\exists s^{″}) s^{'} s^{″} = s}

denotes the set of all prefixes of s.

s_{- i}

denotes the prefix of s, such that

| s_{- i} | = max {0, | s | - i}

. Let

Σ^{\leq N} = {s \in Σ^{*} : | s | \leq N}

. Let

s \ t

be the suffix of s after its prefix t, i.e.,

t (s \ t) = s

. If t is not a prefix of s, then

s \ t

is not defined. The prefix closure of a language

L \subseteq Σ^{*}

is denoted by

\bar{L}

. L is prefix-closed if

L = \bar{L}

. In this paper, only prefix-closed languages are considered.

N

is the set of natural numbers. Let

[0, N] = {n \in N : n \leq N}

be the set of natural numbers no larger than N. Given

G_{1}

and

G_{2}

, we say

G_{1}

is a sub-automaton of

G_{2}

, denoted by

G_{1} ⊑ G_{2}

, if

G_{1}

can be obtained from

G_{2}

by deleting some states in

G_{2}

and all transitions connect to these states.

In many applications, the original system G may not satisfy the desired specification. To make the system fulfill the specification, the supervisory control finds a supervisor to dynamically disable events that lead to some undesirable sequences. In general, not all of the events are controllable and observable. We partition

Σ = Σ_{c} \cup Σ_{u c}

into the set of controllable events

Σ_{c}

and the set of uncontrollable events

Σ_{u c}

. We also partition

Σ = Σ_{o} \cup Σ_{u o}

into the set of observable events

Σ_{o}

and the set of unobservable events

Σ_{u o}

. The natural projection

P : L (G) \to Σ_{o}^{*}

is recursively defined as:

P (ε) = ε

and, for all

s, s σ \in L (G)

,

P (s σ) = P (s) σ

, if

σ \in Σ_{o}

, and

P (s σ) = P (s)

, if

σ \in Σ_{u o}

.

We denote, in this paper, the supervisor by a pair

S = (A, χ)

, where

A = (X, Σ_{o}, ξ, x_{0})

is a deterministic automaton with

L (A) = Σ_{o}^{*}

, and

χ : X \to 2^{Σ}

is a function that specifies the set of events to be enabled. Specifically, for any

t \in Σ_{o}^{*}

, we denote

χ (ξ (x_{0}, t))

by the set of events to be enabled after observing t. With a slight abuse of notation, we write

S (t) = χ (ξ (x_{0}, t))

. More details on the definition of S are provided in Example 1. Let

Π = {π \in 2^{Σ} : Σ_{u c} \subseteq π}

be the set of all the admissible control commands. Since we cannot disable an uncontrollable event,

S (t) \in Π

for all

t \in Σ_{o}^{*}

. The control objective in this paper is given by a specification language

K \subseteq L (G)

. We assume that K can be represented by a sub-automaton

H ⊑ G

of G. The automaton representation of language

K = L (H)

with

K \subseteq L (G)

can always be changed to satisfy

H ⊑ G

.

As shown in Figure 1, in the networked DESs, communications from the plant (supervisor) to the supervisor (plant) for the observation (control) are carried out over an observation channel (control channel) subject to random delays. We assume first-in-first-out (FIFO) is satisfied in both the observation and control, i.e., the observations of events are sent to the supervisor in the same order that they are generated and the control commands are executed in the same order that they are issued. As shown in [21,22,23,24], the delays are measured by the number of event occurrences (observable or not). We assume that (1) the observation delays are upper-bounded by

N_{o}

event occurrences, i.e., when an observable event occurs, it can be communicated to the supervisor before no more than

N_{o}

additional event occurrences; (2) the control delays are upper-bounded by

N_{c}

event occurrences, i.e., after a control command is issued, it can be executed before no more than

N_{c}

event occurrences. We assume that the initial control command has been deployed in the actuator of the plant beforehand. When the plant is initialized and starts to work, the initial control command can be executed without any delays.

Given system G and a supervisor S defined over

Σ_{o}^{*}

, we consider all possible strings, which may be generated by the closed-loop system (also called the controlled system) when the observation delays and control delays are upper-bounded by

N_{o}

and

N_{c}

, respectively. Before that, let us first recall how the previous works specify the language of the closed-loop system under observation delays and control delays. As shown in [23], an upper bound on possible strings, denoted by

L_{a} (S / G)

, which may be generated by the controlled system under observation delays and control delays is defined as follows:

$ε \in L_{a} (S / G)$ ;
for any $s \in L_{a} (S / G)$ and $s σ \in L (G)$ with $σ \in Σ$ , $s σ \in L_{a} (S / G)$ if $σ$ is enabled by one of the control commands issued in the past $N_{c} + N_{o}$ steps, i.e.,

$\begin{matrix} [(\forall s \in L_{a} (S / G)) (\forall σ \in Σ) s σ \in L (G)] s σ \in L_{a} (S / G) \Leftrightarrow \\ σ \in S (P (s)) \cup S (P (s_{- 1})) \cup \dots \cup S (P (s_{- N_{o} - N_{c}})) . \end{matrix}$

In [22,24], the language

L_{a} (S / G)

is also referred to as the large language. However, as discussed in [23,26],

L_{a} (S / G)

is not the exact language that may be generated by the closed-loop system. It is essentially an over-approximation of the actual language that may be generated by the closed-loop system and may contain some sequences that never occur in reality. To make this paper self-contained, we use the following simple example to illustrate this.

Example 1.

Consider the system G depicted in Figure 2a with

Σ = {α, β, η}

,

Σ_{o} = {α, β}

, and

Σ_{c} = Σ

. Let

N_{o, 1} = N_{c, 1} = 1

, i.e., the upper bounds of control delays and observation delays are both 1. The supervisor

S = (A, χ)

is depicted in Figure 2b. The function χ is specified by the set of events associated with each state in Figure 2b. Specifically,

S (ε) = χ (x_{0}) = π_{0} = {α, η}

. When α is observed, automaton A moves to state

x_{1}

from state

x_{0}

, and

S (α) = χ (x_{1}) = π_{1} = {β}

. When

α β

is observed, automaton A moves to state

x_{2}

from state

x_{1}

, and

S (α β) = χ (x_{2}) = π_{2} = {η}

. For the other

t \in Σ_{o}^{*} \ {ε, α, α β}

,

S (t) = χ (x_{3}) = π_{3} = \emptyset

. We first show that

α β α \in L (S / G)

.

At first, we have

ε \in L_{a} (S / G)

. Since

ε \in L_{a} (S / G)

,

α \in S (ε)

, and

α \in L (G)

, by definition,

α \in L_{a} (S / G)

. Moreover, since

α \in L_{a} (S / G)

,

β \in S (P (α)) = S (α)

, and

α β \in L (G)

, by definition,

α β \in L_{a} (S / G)

. Then, since

α β \in L_{a} (S / G)

,

α \in S (P ({(α β)}_{- 2})) = S (ε)

, and

α β α \in L (G)

, by definition,

α β α \in L_{a} (S / G)

. We next show that

α β α

never occurs in practice.

Since

α \in S (ε)

,

α \notin S (α)

, and

α \notin S (α β)

, one can check that α can occur after

α β

only if

S (ε)

is taking effect when α occurs after

α β

. However, since

β \in S (α)

and

β \notin S (ε)

,

S (α)

must have been executed at the time β occurs after α. In other words,

S (ε)

must have been replaced by

S (α)

after the occurrence of

α β

. Therefore,

α β α

never occurs (under S) in reality.

To obtain the exact language of the closed-loop system, we need a new modeling framework for networked DESs, which will be discussed in the following section.

3. Modeling Framework for Networked Supervisory Control

In this section, we consider a new modeling framework for the network supervisory control of DESs. In the new framework, we model the observation channel by a sequence of observable events waiting to be communicated and their observation delays. We also model the control channel by a sequence of control commands waiting to be executed and their control delays. We then build an automaton to describe how the supervisor and the plant interact with each other over the observation channel and the control channel. It is shown that the language of the closed-loop system subject to communication delays can be simply “decoded” from sequences of the constructed automaton.

3.1. Modeling of the Communication Channels

Let us first consider the observation channel.

Definition 1.

The observation channel configuration is defined as a sequence of pairs:

θ_{o} = (σ_{1}, n_{1}) \dots (σ_{k}, n_{k}),

where

σ_{1} \dots σ_{k} \in Σ_{o}^{*}

are the observable events (in the same order that they are generated) that have occurred but are currently delayed at the observation channel, and

n_{i} \in [0, N_{o}]

,

i = 1, \dots, k

is the number of event occurrences since

σ_{i}

occurred. If the observation channel is empty,

θ_{o} = ε

.

We denote by

Θ_{o} \subseteq {(Σ_{o} \times [0, N_{o}])}^{\leq N}

the set of all the possible observation channel configurations, where

N \in N

is the maximum length of

θ_{o} \in Θ_{o}

. The observation channel configuration

θ_{o}

is evolving when a new event occurs or a new observable event is communicated. To update

θ_{o}

, we introduce the following operators.

When a new event $σ \in Σ$ occurs, to update the observation channel configuration, we define the operator ${IN}^{o b s} : Θ_{o} \times Σ \to Θ_{o}$ as: for all $θ_{o} \in Θ_{o}$ and all $σ \in Σ$ ,

$\begin{matrix} {IN}^{o b s} (θ_{o}, σ) = \{\begin{matrix} θ_{o}^{+} & if σ \in Σ_{u o} \\ θ_{o}^{+} (σ, 0) & if σ \in Σ_{o}, \end{matrix} \end{matrix}$

(1)

where if $θ_{o} = (σ_{1}, n_{1}) \dots (σ_{k}, n_{k}) \neq ε$ , $θ_{o}^{+} = (σ_{1}, n_{1} + 1) \dots (σ_{k}, n_{k} + 1)$ , and if $θ_{o} = ε$ , $θ_{o}^{+} = ε$ .
When a new observable event $σ \in Σ_{o}$ delayed at the observation channel is communicated, to update the observation channel configuration, we define the operator ${OUT}^{o b s} : Θ_{o} \times Σ_{o} \to Θ_{o}$ as: for all $θ_{o} \in Θ_{o}$ and all $σ \in Σ_{o}$ ,

$\begin{matrix} {OUT}^{o b s} (θ_{o}, σ) = \{\begin{matrix} θ_{o} \ (σ_{1}, n_{1}) & if θ_{o} = (σ_{1}, n_{1}) \dots (σ_{k}, n_{k}) \neq ε \land \\ σ = σ_{1} \\ undefined & otherwise . \end{matrix} \end{matrix}$

(2)

When an event

σ \in Σ

occurs in the plant, all the natural numbers in

θ_{o}

should be ’plus 1’ since they are used to count the observation delays. Furthermore, if

σ \in Σ_{o}

, by FIFO, we still need to add

(σ, 0)

to the end of

θ_{o}

for recording the new observable event occurrence. That is what the operator

{IN}^{o b s} (\cdot)

does in Equation (1). On the other hand, when a new observable event is communicated to the supervisor, by FIFO, it must be the first event queued at the observation channel. Therefore, we define

{OUT}^{o b s} (\cdot)

to remove the first event

σ

from

θ_{o}

. Next, let us consider the control channel.

Definition 2.

The control channel configuration is defined as a sequence of pairs:

θ_{c} = (π_{1}, m_{1}) \dots (π_{h}, m_{h}),

where

π_{1} \dots π_{h} \in Π^{*}

is a sequence of control commands (in the same order that they are issued) that have been issued but are currently delayed at the control channel, and

m_{i} \in [0, N_{c}]

,

i = 1, \dots, h

is the number of event occurrences since

π_{i}

has been issued. If the control channel is empty,

θ_{c} = ε

.

We denote by

Θ_{c} \subseteq {(Π \times [0, N_{c}])}^{\leq M}

the set of all possible control channel configurations, where

M \in N

is the maximum length of

θ_{c} \in Θ_{c}

. To update

θ_{c}

, we introduce the following operators.

When a new event $σ \in Σ$ occurs in the plant, to update the control channel configuration, we define the operator $PLUS : Θ_{c} \to Θ_{c}$ as: for all $θ_{c} \in Θ_{c}$ ,

$\begin{matrix} PLUS (θ_{c}) = θ_{c}^{+}, \end{matrix}$

(3)

where if $θ_{c} = (π_{1}, m_{1}) \dots (π_{h}, m_{h}) \neq ε$ , $θ_{c}^{+} = (π_{1}, m_{1} + 1) \dots (π_{h}, m_{h} + 1)$ , and if $θ_{c} = ε$ , $θ_{c}^{+} = ε$ .
When a new control command $π \in Π$ is issued by the supervisor, to update the control channel configuration, we define the operator ${IN}^{c t r} : Θ_{c} \times Π \to Θ_{c}$ as: for all $θ_{c} \in Θ_{c}$ and all $π \in Π$ ,

$\begin{matrix} {IN}^{c t r} (θ_{c}, π) = θ_{c} (π, 0) . \end{matrix}$

(4)
When a new control command $π \in Π$ delayed at the control channel is executed, to update the control channel configuration, we define the operator ${OUT}^{c t r} : Θ_{c} \times Π \to Θ_{c}$ as: for all $θ_{c} \in Θ_{c}$ and all $π \in Π$ ,

$\begin{matrix} {OUT}^{c t r} (θ_{c}, π) = \{\begin{matrix} θ_{c} \ (π_{1}, m_{1}) & if θ_{c} = (π_{1}, m_{1}) \dots (π_{h}, m_{h}) \neq ε \\ \land π = π_{1} \\ undefined & otherwise . \end{matrix} \end{matrix}$

(5)

When a new event occurs, for recording the control delays,

PLUS (θ_{c})

adds 1 to all of the natural numbers in

θ_{c}

. When a new control command is issued (following a new observation),

{IN}^{c t r} (θ_{c}, π)

adds the newly issued control command

π

to the end of

θ_{c}

. Moreover, when a new control command is executed,

{OUT}^{c t r} (θ_{c}, π)

removes the first control command

π

from

θ_{c}

.

3.2. Language of the Closed-Loop System

We next show how to specify the language that may be generated by the controlled system subject to observation delays and control delays. Specifying an accurate language requires the information of the control command that takes effect in the interval between two successive observable event communications. In the standard supervisory control framework without communication delays, the control command taking effect is exactly the one that was most-recently issued, which can be uniquely determined by the sequence that has been observed so far. However, in the presence of communication delays, the control commands taking effect (between two successive observable event communications) are non-deterministic.

To obtain the exact language of the closed-loop system, we construct an automaton that dynamically tracks the state of the plant, the current control command, the observation channel configuration, the control channel configuration, and the state of the supervisor. This model essentially captures the interaction process between the supervisor and the plant over the observation channel and the control channel. Before we formally construct the automaton, let us introduce two special types of events.

To keep track of what has been successfully communicated so far, we define the bijection $f : Σ_{o} \to Σ_{f}$ , such that $Σ_{f} = {f (σ) : σ \in Σ_{o}}$ is a set disjointed from $Σ$ . For all $σ \in Σ_{o}$ , we use $f (σ)$ to denote that the occurrence of $σ$ was communicated. Define $f^{- 1}$ as, for all $f (σ) \in Σ_{f}$ , $f^{- 1} (f (σ)) = σ$ . We extend f to a set of sequences, as $f (ε) = ε$ and, for all $s = σ_{1} σ_{2} \dots σ_{k} \in Σ_{o}^{*}$ , $f (s) = f (σ_{1}) f (σ_{2}) \dots f (σ_{k}) \in Σ_{f}^{*}$ . We also extend $f^{- 1}$ to a set of sequences, as $f^{- 1} (ε) = ε$ and, for all $f (s) = f (σ_{1}) f (σ_{2}) \dots f (σ_{k}) \in Σ_{f}^{*}$ , $f^{- 1} (f (s)) = σ_{1} σ_{2} \dots σ_{k} \in Σ_{o}^{*}$ .
To model which control action is taken, we define bijection $g : Π \to Σ_{g}$ , such that $Σ_{g} = {g (π) : π \in Π}$ is disjointed from $Σ \cup Σ_{f}$ . For all $π \in Π$ , we use $g (π)$ to denote that the control command $π$ was executed. Define $g^{- 1}$ as, for all $g (π) \in Σ_{g}$ , $g^{- 1} (g (π)) = π$ . We extend g to a set of sequences, as $g (ε) = ε$ and, for all $μ = π_{1} π_{2} \dots π_{k} \in Π^{*}$ , $g (μ) = g (π_{1}) g (π_{2}) \dots g (π_{k}) \in Σ_{g}^{*}$ . We also extend $g^{- 1}$ to a set of sequences, as $g^{- 1} (ε) = ε$ and, for all $g (μ) = g (π_{1}) g (π_{2}) \dots g (π_{k}) \in Σ_{g}^{*}$ , $g^{- 1} (g (μ)) = π_{1} π_{2} \dots π_{k} \in Π^{*}$ .

We show in Figure 3 how the plant interacts with the supervisor over the observation channel and the control channel. When a new observable event

σ \in Σ_{o}

occurs in the plant, it is immediately added to the end of the observation channel. Since the observation delays are upper-bounded by

N_{o}

event occurrences, the maximum observation delays after the occurrence of

σ

should be no larger than

N_{o}

, i.e.,

n_{1} + 1 \leq N_{o}

. Similarly, since the control delays are upper-bounded by

N_{c}

event occurrences, the maximum control delays after the occurrence of

σ

should be no larger than

N_{c}

, i.e.,

m_{1} + 1 \leq N_{c}

. By FIFO, the first event delayed at the observation channel, i.e.,

σ_{1}

can be communicated to the supervisor. If

σ_{1}

is communicated to the supervisor, we need to remove

(σ_{1}, n_{1})

from the head of the observation channel. Meanwhile, following the observation of

σ_{1}

, a new control command

χ (ξ (x, σ_{1}))

is made and is added to the end of the control channel. Moreover, by FIFO, the control commands are executed in the same order that they are issued. Thus,

π_{2}, \dots, π_{h}

cannot be executed until

π_{1}

is executed. If

π_{1}

is executed, we need to remove

(π_{1}, m_{1})

from the head of

θ_{c}

.

Notations: Given a

θ_{o} \in Θ_{o}

, if

θ_{o} = (σ_{1}, n_{1}) \dots (σ_{k}, n_{k}) \neq ε

, let

MAX (θ_{o}) = n_{1}

be the maximum delay occurring in the observation channel, and if

θ_{o} = ε

, let

MAX (θ_{o}) = 0

. Similarly, given a

θ_{c} \in Θ_{c}

, if

θ_{c} = (π_{1}, m_{1}) \dots (π_{h}, m_{h}) \neq ε

, let

MAX (θ_{c}) = m_{1}

be the maximum delay occurring in the control channel, and if

θ_{c} = ε

, let

MAX (θ_{c}) = 0

.

Given a supervisor

S = (A, χ)

with

A = (X, Σ_{o}, ξ, x_{0})

, we formally construct

G_{S} = (Q_{S}, Σ_{S}, δ_{S}, q_{0, S})

, where

Q_{S} \subseteq Q \times Π \times Θ_{o} \times Θ_{c} \times X

is the state space;

q_{0, S} = (q_{0}, S (ε), ε, ε, x_{0})

is the initial state, where

S (ε)

is the initial control command (since the initial control command can be immediately executed when the plant starts to work, we assume that the initial control command takes effect at first);

Σ_{S} \subseteq Σ \cup Σ_{f} \cup Σ_{g}

is the event set; the transition function

δ_{S} : Q_{S} \times Σ_{S} \to Q_{S}

is defined as:

For all $\tilde{q} = (q, π, θ_{o}, θ_{c}, x) \in Q_{S}$ and all $σ \in Σ$ ,

$\begin{matrix} δ_{S} (\tilde{q}, σ) = \{\begin{matrix} {\tilde{q}}^{'} & if δ (q, σ)! \land σ \in π \land MAX (θ_{o}^{+}) \leq N_{o} \land MAX (θ_{c}^{+}) \leq N_{c} \\ undefined & otherwise, \end{matrix} \end{matrix}$

(6)

where ${\tilde{q}}^{'} = (δ (q, σ), π, {IN}^{o b s} (θ_{o}, σ), PLUS (θ_{c}), x)$ ;
For all $\tilde{q} = (q, π, θ_{o}, θ_{c}, x) \in Q_{S}$ and all $f (σ) \in Σ_{f}$ ,

$\begin{matrix} δ_{S} (\tilde{q}, f (σ)) = \{\begin{matrix} {\tilde{q}}^{'} & if {OUT}^{o b s} (θ_{o}, σ)! \\ undefined & otherwise, \end{matrix} \end{matrix}$

(7)

where ${\tilde{q}}^{'} = (q, π, {OUT}^{o b s} (θ_{o}, σ), {IN}^{c t r} (θ_{c}, χ (ξ (x, σ))), ξ (x, σ))$ ;
For all $\tilde{q} = (q, π, θ_{o}, θ_{c}, x) \in Q_{S}$ and all $g (γ) \in Σ_{g}$ ,

$\begin{matrix} δ_{S} (\tilde{q}, g (γ)) = \{\begin{matrix} {\tilde{q}}^{'} & if {OUT}^{c t r} (θ_{c}, γ)! \\ undefined & otherwise, \end{matrix} \end{matrix}$

(8)

where ${\tilde{q}}^{'} = (q, γ, θ_{o}, {OUT}^{c t r} (θ_{c}, γ), x)$ .

Equation (6): for any

(q, π, θ_{o}, θ_{c}, x) \in Q_{S}

, an event

σ \in Σ

can occur at q only if (i)

σ

is active at q, i.e.,

δ (q, σ)!

; (ii)

σ

is allowed to occur by the control command in use, i.e.,

σ \in π

; (iii) after the occurrence of

σ

, the control delays and the observation delays are no larger than

N_{c}

and

N_{o}

, respectively, i.e.,

MAX (θ_{c}^{+}) \leq N_{c} \land MAX (θ_{o}^{+}) \leq N_{o}

. When

σ

occurs at q, to track the plant state, we set

q \leftarrow δ (q, σ)

. Meanwhile, to update the observation channel configuration and the control channel configuration, by Equations (1) and (3), we set

θ_{o} \leftarrow {IN}^{o b s} (θ_{o}, σ)

and

θ_{c} \leftarrow PLUS (θ_{c})

. Since no new control command is executed, we keep

π

unchanged.

Equation (7): for any

(q, π, θ_{o}, θ_{c}, x) \in Q_{S}

, if a new observable event

σ

is communicated (denoted by

f (σ) \in Σ_{f}

), by FIFO,

σ

must be the first event queued at the observation channel, i.e.,

{OUT}^{o b s} (θ_{o}, σ)!

. When

σ

is communicated, by Equation (2), we set

θ_{o} \leftarrow {OUT}^{o b s} (θ_{o}, σ)

. Meanwhile, upon the communication of

σ

, the supervisor moves to state

ξ (x, σ)

and sends a new control command

χ (ξ (x, σ))

to the actuator of the plant. Correspondingly, we set

x \leftarrow ξ (x, σ)

and

θ_{c} \leftarrow {IN}^{c t r} (θ_{c}, χ (ξ (x, σ)))

.

Equation (8): for any

(q, π, θ_{o}, θ_{c}, x) \in Q_{S}

, if a new control command

γ \in Π

is executed (denoted by

g (γ) \in Σ_{g}

), by FIFO, it must be the first control command queued at the control channel, i.e.,

{OUT}^{c t r} (θ_{c}, γ)!

. When

γ

is executed, by FIFO, the control command that takes effect becomes

γ

, and the control commands delayed at the control channel become

π_{2} \dots π_{h}

. Correspondingly, we have

π \leftarrow γ

and

θ_{c} \leftarrow {OUT}^{c t r} (θ_{c}, γ)

.

Remark 1.

By the construction,

G_{S}

satisfies all of the assumptions made in this paper. Specifically, by Equation (6), we know the maximum delay occurring in the observation channel is no larger than

N_{o}

, and the maximum delay occurring in the control channel is no larger than

N_{c}

. By Equations (7) and (8), the delayed observable events are communicated to the supervisor in the same order that they are generated, and the delayed control commands are delivered to the plant in the same order that they are issued, i.e., both the observation channel and the control channel satisfy the FIFO property. Moreover, by Equations (7) and (8), the observation delays and control delays are non-deterministic. That is, an observable event can be communicated in any one of the following

N_{o}

steps from the occurrence, and a control command can be executed in any one of the following

N_{c}

steps from when it is issued.

Remark 2.

In some control applications, there may exist communication losses between the plant and the supervisor. For example, some observable transitions may be lost when they are communicated to the supervisor. Let us denote the set of transitions of G by

δ_{G} = {(q, σ, q^{'}) : δ (q, σ) = q^{'}}

. We also denote the set of observable transitions of G by

δ_{G, o} = {(q, σ, q^{'}) : δ (q, σ) = q^{'} \land σ \in Σ_{o}}

. We partition

δ_{G, o}

into

δ_{G, L}

and

δ_{G, o} \ δ_{G, L}

, where

δ_{L}

is the set of transitions whose corresponding event occurrences are either observed without losses or observed with losses. To model possible observation losses, we can first refine the structure of G by adding parallel ε-transitions to the transitions that may be lost in

δ_{L}

and obtain

G^{'} = (Q, Σ \cup {ε}, δ^{'}, q_{0})

, where

δ^{'} = δ \cup {(q, ε, q^{'}) : (q, σ, q^{'}) \in δ_{L}}

. Using techniques developed in this section, we can construct

G_{S}^{'}

. Note that when constructing

G_{S}^{'}

, the supervisor does not need to make any control decisions following the communication of ε (

f (ε)

occurs). Although the occurrence of ε cannot be sensed by the supervisor, all of the natural numbers in

θ_{c}

and

θ_{o}

should be added by 1 when ε occurs (as an event occurred but was lost). In this paper, we focus on dealing with the nondeterministic observation delays and control delays existing between the supervisor and the plant. The formal approaches for implementing supervisory control under communication delays and losses are beyond the scope of this paper, yet a fruitful area for future exploration.

We use the following example to further illustrate how to construct

G_{S}

.

Example 2.

Consider the system G depicted in Figure 2a with

Σ = {α, β, η}

,

Σ_{o} = {α, β}

, and

Σ_{c} = Σ

. Let

N_{o, 1} = N_{c, 1} = 1

. The supervisor

S = (A, χ)

is depicted in Figure 2b. We now construct

G_{S}

using G and S.

The initial state of

G_{S}

is

{\tilde{q}}^{0} = (q^{0}, π^{0}, θ_{o}^{0}, θ_{c}^{0}, x^{0}) = (0, π_{0}, ε, ε, x_{0})

. By Figure 2a, we have

δ (q^{0}, α) = 1

. Moreover, since

α \in π^{0}

,

MAX ({(θ_{o}^{0})}^{+}) = 0 \leq N_{o}

, and

MAX ({(θ_{c}^{0})}^{+}) = 0 \leq N_{c}

, by Equation (6), we have

δ_{S} ({\tilde{q}}^{0}, α) = {\tilde{q}}^{1} = (q^{1}, π^{1}, θ_{o}^{1}, θ_{c}^{1}, x^{1})

, where

q^{1} = δ (q^{0}, α)

,

π^{1} = π^{0}

,

θ_{o}^{1} = {IN}^{o b s} (θ_{o}^{0}, α)

,

θ_{c}^{1} = PLUS (θ_{c}^{0})

, and

x^{1} = x^{0}

. Since

α \in Σ_{o}

, by Equation (1),

{IN}^{o b s} (θ_{o}^{0}, α) = (α, 0)

. By Equation (3),

PLUS (θ_{c}^{0}) = ε

. Therefore,

{\tilde{q}}^{1} = (q^{1}, π^{1}, θ_{o}^{1}, θ_{c}^{1}, x^{1}) = (1, π_{0}, (α, 0), ε, x_{0})

.

Next, consider state

{\tilde{q}}^{1}

. Since

θ_{o}^{1} = (α, 0)

, by Equation (2),

{OUT}^{o b s} (θ_{o}^{1}, α) = ε

. By Equation (7),

δ_{S} ({\tilde{q}}^{1}, f (σ)) = (q^{2}, π^{2}, θ_{o}^{2}, θ_{c}^{2}, x^{2}),

where

q^{2} = q^{1}

,

π^{2} = π^{1}

,

θ_{o}^{2} = {OUT}^{o b s} (θ_{o}^{1}, σ)

,

θ_{c}^{2} = {IN}^{c t r} (θ_{c}^{1}, χ (ξ (x^{1}, σ)))

, and

x^{2} = ξ (x^{1}, α)

. By Figure 2b,

ξ (x^{1}, α) = ξ (x_{0}, α) = x_{1}

and

χ (ξ (x^{1}, α)) = χ (x_{1}) = π_{1}

. By Equation (4),

{IN}^{c t r} (θ_{c}^{1}, χ (ξ (x^{1}, α))) = (π_{1}, 0)

. Therefore,

δ_{S} ({\tilde{q}}^{1}, f (σ)) = (1, π_{0}, ε, (π_{1}, 0), x_{1})

.

In this way, we can define all of the transitions. Finally, the complete

G_{S}

is constructed as shown in Figure 4.

For all

μ \in L (G_{S})

, let

ψ (μ)

and

ψ^{f} (μ)

be the sequences obtained by removing all the event occurrences in

Σ_{f} \cup Σ_{g}

and

Σ \cup Σ_{g}

, respectively, without changing the order of the remaining event occurrences in

μ

. For example, consider

μ = α f (α) g (π_{1}) β \in L (G_{S})

in Figure 4. By the definitions of

ψ (\cdot)

and

ψ^{f} (\cdot)

, we have

ψ (μ) = α β

and

ψ^{f} (μ) = f (α)

. We extend

ψ

and

ψ^{f}

to a set of sequences in the usual way. Intuitively, for all

μ \in L (G_{S})

,

ψ (μ)

tracks the sequence that occurred in the plant, and

f^{- 1} (ψ^{f} (μ))

tracks what the supervisor observed after the occurrence of

ψ (μ)

. The following proposition formally proves this.

Proposition 1.

Given an arbitrary

μ \in L (G_{S})

, we write

δ_{S} (q_{0, S}, μ) = (q, π, θ_{o}, θ_{c}, x)

. Then, (i)

q = δ (q_{0}, ψ (μ))

and (ii)

x = ξ (x_{0}, f^{- 1} (ψ^{f} (μ)))

.

Proof.

Please see Appendix A. □

By Proposition 1, the dynamics of the closed-loop system can be simply obtained by removing all of the events in

Σ_{f} \cup Σ_{g}

from sequences generated by

G_{S}

, which yields the following definition.

Definition 3.

Given a system G and a supervisor S, we construct

G_{S}

as described above. All possible strings that may be generated by the closed-loop system when the observation delays and control delays are upper-bounded by

N_{o}

and

N_{c}

, respectively, are defined as

L (S / G) = ψ (L (G_{S}))

.

Remark 3.

By Definition 3 and Figure 4,

α β α

is not included in

L (S / G)

. As we already discussed in Example 1,

α β α

never occurs in practice. This example justifies the advantage of the proposed modeling framework.

Given two supervisors

S_{1}

and

S_{2}

, we say that

S_{1}

is smaller than

S_{2}

, denoted by

S_{1} \subseteq S_{2}

, if for all

t \in Σ_{o}^{*}

,

S_{1} (t) \subseteq S_{2} (t)

, and we say that

S_{1}

is strictly smaller than

S_{2}

if

S_{1} \subseteq S_{2}

, and there exists

t \in Σ_{o}^{*}

, such that

S_{1} (t) \subset S_{2} (t)

. The following proposition states that the more events a supervisor S enables, the larger the language

L (S / G)

the closed-loop system generates.

Proposition 2.

Given two

S_{i} = (A_{i}, χ_{i})

with

A_{i} = (X_{i}, Σ_{o}, ξ_{i}, x_{0, i})

,

i \in {1, 2}

, we construct

G_{S_{i}} = (Q_{S_{i}}, Σ_{S_{i}}, δ_{S_{i}}, q_{0, S_{i}})

as described above. Then, if

S_{1} \subseteq S_{2}

, we have

L (S_{1} / G) \subseteq L (S_{2} / G)

.

Proof.

The proof is provided in Appendix B. □

By Proposition 2, to synthesize a supervisor, such that the closed-loop language is maximal and safe, we only need to synthesize a supervisor’s maximal supervisor, such that the closed-loop system behaviors are safe. We next formally formulate the optimal supervisor synthesis problem.

3.3. Problem Formulation

Before we formally present the problem to be solved, let us first introduce the following notation. Given a supervisor S, for a prefix-closed language

L \subseteq Σ_{o}^{*}

,

{S |}_{L}

means that S is restricted to a smaller domain L, defined as

{S |}_{L} (t) = S (t)

, if

t \in L

, and undefined, otherwise. Assuming that the supervisor observes

t \in Σ_{o}^{*}

, our goal is to compute a maximal and safe supervisor on the fly, for each

t^{i} \in \bar{t}

,

i = 0, 1, \dots, | t |

.

Problem 1.

Assuming that the system G executes an arbitrarily long sequence

s \in L (G)

and the current observation for s is

t \in Σ_{o}^{*}

, we find a supervisor S, such that:

S is safe, i.e., $L (S / G) \subseteq K$ ;
${S |}_{\bar{t}}$ is maximal, i.e., there is no other $S^{'}$ that satisfies (1) with $S^{'} {|_{\bar{t}} \supset S |}_{\bar{t}}$ .

Remark 4.

Since we focus on an online supervisor synthesis, we only need to ensure that the control decisions that make up the current instant are optimal. That is why S is only required to be optimal on

\bar{t}

(instead of the whole

Σ_{o}^{*}

).

Remark 5.

The solutions to Problem 1 need not be unique. Actually, there may exist several incomparable maximal solutions. In this paper, we emphasize how to online synthesize a “greedily maximal” supervisor, rather than ambitiously calculate all possible solutions.

4. State Estimation under Communication Delays

To make a control decision right after each new observation, the supervisor needs to estimate the states of the closed-loop system (subject to observation delays and control delays) on the fly. We focus on the problem of online networked state estimation in this section. Let us first introduce the definition of the networked state estimate (NSE) as follows.

Definition 4.

Given a DES G and a supervisor S, we construct

G_{S}

as described in Section 3.2. For any

t \in f^{- 1} (ψ^{f} (L (G_{S})))

, define

\begin{matrix} E_{S} (t) = & {q \in Q : (\exists μ \in L (G_{S})) q = δ (q_{0}, ψ (μ)) \land t = f^{- 1} (ψ^{f} (μ))}, \end{matrix}

(9)

as the NSE of t under S, which is the set of all the possible states that system G may be in after observing t (subject to observation delays and control delays) under S.

If S is given beforehand, we can calculate

E_{S} (t)

by constructing an observer of

G_{S}

with the set of observable events

Σ_{f}

. However, we focus on online network supervisory control in this paper. That is, we need to calculate the state estimate right after each new communication (all future controls are unknown). To this end, besides the plant state

q \in Q

, we also need to estimate the current control command

π \in Π

, the observation channel configuration

θ_{o} \in Θ_{o}

, and the control channel configuration

θ_{c} \in Θ_{c}

, because all of them can affect the behaviors of the closed-loop system. Therefore, we denote each “state” of the closed-loop system by a four-tuple

(q, π, θ_{o}, θ_{c}) \in Q \times Π \times Θ_{o} \times Θ_{c}

. We call such a state an augmented state. Let

\tilde{Q} = {(q, π, θ_{o}, θ_{c}) : q \in Q \land π \in Π \land θ_{o} \in Θ_{o} \land θ_{c} \in Θ_{c}}

be the set of all the augmented states. We next show that we can estimate all possible states that the plant may be in by estimating all possible augmented states that the controlled system may reach.

To precisely estimate the augmented states, we need the following two operators.

Let

Z \in 2^{\tilde{Q}}

be a set of augmented states calculated immediately after a new observation or the initial

Z = \emptyset

(since the plant does not work until it is initialized, we let

Z = \emptyset

before the initial control command is executed).The delayed unobservable reach of Z under an admissible control command

γ \in Π

, denoted by

DUR (Z, γ)

, is defined as follows.

If $Z = \emptyset$ , then

$\begin{matrix} (q_{0}, γ, ε, ε) \in DUR (Z, γ), \end{matrix}$

(10)

and if $Z \neq \emptyset$ , then for all $(q, π, θ_{o}, θ_{c})$ ,

$\begin{matrix} (q, π, θ_{o}, {IN}^{c t r} (θ_{c}, γ)) \in DUR (Z, γ); \end{matrix}$

(11)
Then, we repeatedly apply the following operations until convergence is achieved.
- For all $(q, π, θ_{o}, θ_{c}) \in DUR (Z, γ)$ and all $σ \in Σ$ , if $δ (q, σ)!$ and $σ \in π$ and $MAX (θ_{o}^{+}) \leq N_{o}$ and $MAX (θ_{c}^{+}) \leq N_{c}$ , then
  
  $\begin{matrix} (δ (q, σ), π, {IN}^{o b s} (θ_{o}, σ), PLUS (θ_{c})) \in DUR (Z, γ); \end{matrix}$
  
  (12)
- For all $(q, π, θ_{o}, θ_{c}) \in DUR (Z, γ)$ and all $γ^{'} \in Π$ , if ${OUT}^{c t r} (θ_{c}, γ^{'})!$ , then
  
  $\begin{matrix} (q, γ^{'}, θ_{o}, {OUT}^{c t r} (θ_{c}, γ^{'})) \in DUR (Z, γ) . \end{matrix}$
  
  (13)

If

Z = \emptyset

, no control commands have been executed. That is,

γ

is the initial control command. By assumption,

γ

can be executed without any delays. Hence, by Equation (10), we have

(q_{0}, γ, ε, ε) \in DUR (Z, γ)

. Otherwise, if

Z \neq \emptyset

,

γ

is not the initial control command. By FIFO, it will not be executed until all of the control commands that are now delayed at the control channel are executed. Thus, for all

(q, π, θ_{o}, θ_{c}) \in Z

, Equation (11) adds

γ

to the end of

θ_{c}

, i.e.,

θ_{c} \leftarrow {IN}^{c t r} (θ_{c}, γ)

. Meanwhile, Equations (12) and (13) consider the cases of “an event (observable or not) occurs” and “a control command is executed”, respectively. When there exist observation delays and control delays, only “an observable event is communicated” is observable. Therefore,

DUR (Z, γ)

consists of all the augmented states that may be reached from Z in an “unobservable” way.

Let

Z \in 2^{\tilde{Q}}

be the current set of augmented states. The delayed observable reach of Z under an observable event

σ \in Σ_{o}

, denoted by

DOR (Z, σ)

, is defined as:

\begin{matrix} DOR (Z, σ) = & {(q, π, {OUT}^{o b s} (θ_{o}, σ), θ_{c}) : (\exists (q, π, θ_{o}, θ_{c}) \in Z) {OUT}^{o b s} (θ_{o}, σ)!} . \end{matrix}

(14)

Intuitively,

DOR (Z, σ)

includes all of the augmented states that can be reached from Z upon a new communication of

σ

. By FIFO, an observable event can be communicated only if it is the first event queued at the observation channel. Hence, we consider all the

σ \in Σ_{o}

, such that there exists

(q, π, θ_{o}, θ_{c}) \in Z

with

{OUT}^{o b s} (θ_{o}, σ)!

. When

σ

is communicated, we remove

(σ_{1}, n_{1})

from

θ_{o}

. As we can see, we set

θ_{o} \leftarrow {OUT}^{o b s} (θ_{o}, σ)

. We assume that

DOR (Z, σ)

is updated right after a new observation of

σ

but before the next control command is issued. Therefore, we keep

θ_{c}

unchanged.

We next present how to online estimate augmented states.

Definition 5.

Let G be a DES and S be a supervisor. We construct

G_{S}

as described in Section 3.2. For a

t \in f^{- 1} (ψ^{f} (L (G_{S})))

, let

{\tilde{E}}_{S} (t)

be the augmented state estimate for t.

{\tilde{E}}_{S} (t)

is calculated by alternatively applying

D U R (\cdot)

and

DOR (\cdot)

as follows.

Initially, ${\tilde{E}}_{S} (ε) = D U R (\emptyset, S (ε))$ ;
For all $t^{i}, t^{i} σ \in \bar{t}$ , $i = 0, 1, \dots, | t | - 1$ ,

${\tilde{E}}_{S} (t^{i} σ) = DUR (DOR ({\tilde{E}}_{S} (t^{i}), σ), S (t^{i} σ)) .$

Remark 6.

{\tilde{E}}_{S} (t^{i} σ)

indeed online estimates the augmented states. As shown in Figure 5, the online procedure for estimating augmented states can be briefly summarized as repeatedly executing (i) an observable event occurrence

σ \in Σ_{o}

(after

t^{i}

) is communicated to the supervisor, and the set of augmented states is updated to

Z^{'} = DOR ({\tilde{E}}_{S} (t^{i}), σ)

; (ii) following the observation of σ, a new control command

π = S (t^{i} σ) \in Π

is issued by the supervisor S. Then, the corresponding augmented state estimate is updated to

{\tilde{E}}_{S} (t^{i} σ) = Z = D U R (Z^{'}, π)

.

We next show that

{\tilde{E}}_{S} (t)

indeed estimates the plant state, the current control command, the observation channel configuration, and the control channel configuration. Let us first define

\begin{matrix} T (t) = & {(q, π, θ_{o}, θ_{c}) \in \tilde{Q} : (\exists μ \in L (G_{S})) f^{- 1} (ψ^{f} (μ)) = t \land \\ δ_{S} (q_{0, S}, μ) = (p, γ, ω_{o}, ω_{c}, x) \land p = q \land γ = π \land ω_{o} = θ_{o} \land ω_{c} = θ_{c}} . \end{matrix}

The following lemma will be used later.

Lemma 1.

For any

t \in f^{- 1} (ψ^{f} (L (G_{S})))

if

z = (q, π, θ_{o}, θ_{c}) \in T (t)

and

z^{'}

is the augmented state calculated by applying Equation (12) or (13) on z, then

z^{'} \in T (t)

.

Proof.

Without a loss of generality, we write

z = (q, π, θ_{o}, θ_{c})

and

z^{'} = (q^{'}, π^{'}, θ_{o}^{'}, θ_{c}^{'})

. Since

z \in T (t)

, by the definition of

T (\cdot)

, there exists a

μ \in L (G_{S})

, such that

f^{- 1} (ψ^{f} (μ)) = t

and

δ_{S} (q_{0, S}, μ) = (p, γ, ω_{o}, ω_{c}, x)

with

p = q

,

γ = π

,

ω_{o} = θ_{o}

, and

ω_{c} = θ_{c}

. Since

z^{'}

is the augmented state obtained by applying one of the operations in Equations (12)∼(13) on z, one of the following two cases must be true.

Case 1:

z^{'} = (δ (q, σ), π, {IN}^{o b s} (θ_{o}, σ), PLUS (θ_{c}))

. By Equation (12),

δ (q, σ)!

,

σ \in π

,

MAX (θ_{o}^{+}) \leq N_{o}

, and

MAX (θ_{c}^{+}) \leq N_{c}

. Since

δ_{S} (q_{0, S}, μ) = (p, γ, ω_{o}, ω_{c}, x)

, by Equation (6),

δ_{S} (q_{0, S}, μ σ) = (δ (q, σ), π, {IN}^{o b s} (θ_{o}, σ), PLUS (θ_{c}), x)

. Thus,

z^{'} \in T (f^{- 1} (ψ^{f} (μ σ))) = T (t)

.

Case 2:

z^{'} = (δ (q, σ), γ^{'}, θ_{o}, {OUT}^{c t r} (θ_{c}, γ^{'}))

. By Equation (13),

{OUT}^{c t r} (θ_{c}, γ^{'})!

. Since

δ_{S} (q_{0, S}, μ) = (p, γ, ω_{o}, ω_{c}, x)

, by Equation (8),

δ_{S} (q_{0, S}, μ g (γ^{'})) = (q, γ^{'}, θ_{o}, {OUT}^{c t r} (θ_{c}, γ^{'}), x)

. Thus, we have that

z^{'} \in T (f^{- 1} (ψ^{f} (μ g (γ^{'}))) = T (t)

. □

Theorem 1.

Given a DES G and a supervisor S, we construct

G_{S} = (Q_{S}, Σ_{S}, δ_{S}, q_{0, S})

as described in Section 3.2. For any

t \in f^{- 1} (ψ^{f} (L (G_{S})))

, we have

\begin{matrix} {\tilde{E}}_{S} (t) = & {(q, π, θ_{o}, θ_{c}) \in \tilde{Q} : (\exists μ \in L (G_{S})) f^{- 1} (ψ^{f} (μ)) = t \land \\ δ_{S} (q_{0, S}, μ) = (p, γ, ω_{o}, ω_{c}, x) \land p = q \land γ = π \land ω_{o} = θ_{o} \land ω_{c} = θ_{c}} . \end{matrix}

Proof.

(\subseteq)

We first prove

{\tilde{E}}_{S} (t) \subseteq T (t)

by contradiction. Suppose there exists

t \in f^{- 1} (ψ^{f} (L (G_{S})))

, such that

{\tilde{E}}_{S} (t) \neg \subseteq T (t)

. Without loss of generality (w.l.o.g.), we assume that t is the shortest sequence in

f^{- 1} (ψ^{f} (L (G_{S})))

, satisfying

{\tilde{E}}_{S} (t) \neg \subseteq T (t)

. We now show

t \neq ε

. By Definition 5, for any

z \in {\tilde{E}}_{S} (ε)

, there exists a sequence of augmented states

z_{0} z_{1} \dots z_{k}

, such that

z_{0} = (q_{0}, S (ε), ε, ε)

,

z_{k} = z

, and

z_{i + 1}

is the augmented state calculated by applying Equation (12) or (13) on

z_{i}

,

i = 0, 1, \dots, k - 1

. Since

δ_{S} (q_{0, S}, ε) = (q_{0}, S (ε), ε, ε, x_{0})

,

z_{0} \in T (ε)

. By repeatedly applying Lemma 1,

z_{1}, \dots, z_{k} \in T (ε)

. Therefore,

{\tilde{E}}_{S} (ε) \subseteq T (ε)

.

Since

t \neq ε

, we write

t = t^{'} σ

for some

σ \in Σ_{o}

. Since

{\tilde{E}}_{S} (t) \neg \subseteq T (t)

,

\exists z \in {\tilde{E}}_{S} (t)

such that

z \notin T (t)

. Since

z \in {\tilde{E}}_{S} (t)

, by Definition 5, there exists a sequence of augmented states

z_{0} z_{1} \dots z_{k}

with

z_{0} = (q, π, {OUT}^{o b s} (θ_{o}, σ), {IN}^{c t r} (θ_{c}, S (t^{'} σ)))

for some

(q, π, θ_{c}, θ_{o}) \in {\tilde{E}}_{S} (t^{'})

,

z_{k} = z

, and

z_{i + 1}

is the augmented state calculated by applying Equation (12) or (13) on

z_{i}

,

i = 0, 1, \dots, k - 1

. Next, we prove

z_{0} \in T (t^{'} σ)

.

Since

(q, π, θ_{c}, θ_{o}) \in {\tilde{E}}_{S} (t^{'}) \subseteq T (t^{'})

,

\exists μ \in L (G_{S})

, such that

f^{- 1} (ψ^{f} (μ)) = t^{'}

,

δ_{S} (q_{0, S}, μ) = (p, γ, ω_{o}, ω_{c}, x)

, and

p = q \land γ = π \land ω_{o} = θ_{o} \land ω_{c} = θ_{c}

. Since

{OUT}^{o b s} (θ_{o}, σ)!

and

θ_{o} = ω_{o}

, we have

{OUT}^{o b s} (ω_{o}, σ)!

. By Equation (7),

δ_{S} (q_{0, S}, μ f (σ)) = (p, γ, {OUT}^{o b s} (ω_{o}, σ), {IN}^{c t r} (ω_{c}, χ (ξ (x, σ))), ξ (x, σ)) .

Since

f^{- 1} (ψ^{f} (μ)) = t^{'}

, we have

f^{- 1} (ψ^{f} (μ f (σ))) = t^{'} σ

. By Proposition 1, we have

ξ (x, σ) = ξ (x_{0}, f^{- 1} (ψ^{f} (μ f (σ)))) = ξ (x_{0}, t^{'} σ)

. By definition, we have

χ (ξ (x, σ)) = S (t^{'} σ)

. Thus,

δ_{S} (q_{0, S}, μ f (σ)) = (p, γ, {OUT}^{o b s} (ω_{o}, σ), {IN}^{c t r} (ω_{c}, S (t^{'} σ)), ξ (x, σ)) .

By the definition of

T (\cdot)

,

(p, γ, {OUT}^{o b s} (ω_{o}, σ), {IN}^{c t r} (ω_{c}, S (t^{'} σ))) \in T (f^{- 1} (ψ^{f} (μ f (σ))) = T (t^{'} σ) .

Since

p = q \land γ = π \land ω_{o} = θ_{o} \land ω_{c} = θ_{c}

,

(q, π, {OUT}^{o b s} (θ_{o}, σ), {IN}^{c t r} (θ_{c}, S (t^{'} σ))) \in T (t^{'} σ)

. Hence,

z_{0} \in T (t^{'} σ)

. By repeatedly applying Lemma 1,

z_{1}, \dots, z_{k} \in T (t^{'} σ)

, which contradicts

z = z_{k} \notin T (t^{'} σ)

.

(\supseteq)

We next prove

{\tilde{E}}_{S} (t) \supseteq T (t)

. To prove

{\tilde{E}}_{S} (t) \supseteq T (t)

, we only need to prove that for all

μ \in L (G_{S})

, if

δ_{S} (q_{0, S}, μ) = (q, π, θ_{o}, θ_{c}, x)

, then

(q, π, θ_{o}, θ_{c}) \in {\tilde{E}}_{S} (f^{- 1} (ψ^{f} (μ)))

. The proof is by induction on the finite length of sequences in

L (G_{S})

.

Since

δ_{S} (q_{0, S}, ε) = (q_{0}, S (ε), ε, ε, x_{0})

and

(q_{0}, S (ε), ε, ε) \in {\tilde{E}}_{S} (ε)

, the base case is true. The induction hypothesis is that for all

μ \in L (G_{S})

with

| μ | \leq k

, we write

δ_{S} (q_{0, S}, μ) = (q, π, θ_{o}, θ_{c}, x)

. Then,

(q, π, θ_{o}, θ_{c}) \in {\tilde{E}}_{S} (f^{- 1} (ψ^{f} (μ)))

.

We next prove the same is also true for

μ e \in L (G_{S})

with

| μ | = k

. We write

δ_{S} (q_{0, S}, μ e) = (p, γ, ω_{o}, ω_{c}, y)

. Then,

δ_{S} ((q, π, θ_{o}, θ_{c}, x), e) = (p, γ, ω_{o}, ω_{c}, y)

. Since

e \in Σ_{S}

,

e \in Σ

,

e \in Σ_{f}

, or

e \in Σ_{g}

. We consider each of them separately as follows.

Case 1:

e = σ \in Σ

. Since

δ_{S} ((q, π, θ_{o}, θ_{c}, x), σ) = (p, γ, ω_{o}, ω_{c}, y)

, by Equation (6), we have

$δ (q, σ)!$ , $σ \in π$ , $MAX (θ_{o}^{+}) \leq N_{o}$ , and $MAX (θ_{c}^{+}) \leq N_{c}$ ;
$p = q$ , $γ = π$ , $ω_{o} = {IN}^{o b s} (θ_{o}, σ)$ , and $ω_{c} = PLUS (θ_{c})$ .

Since

(q, π, θ_{o}, θ_{c}) \in {\tilde{E}}_{S} (f^{- 1} (ψ^{f} (μ)))

and Condition 1 in Case 1, by Equation (12),

(δ (q, σ), π, {IN}^{o b s} (θ_{o}, σ), PLUS (θ_{c})) \in {\tilde{E}}_{S} (f^{- 1} (ψ^{f} (μ))) .

Since

σ \in Σ

, we have

f^{- 1} (ψ^{f} (μ σ)) = f^{- 1} (ψ^{f} (μ))

. By Condition 2 in Case 1,

(p, γ, ω_{o}, ω_{c}) \in {\tilde{E}}_{S} (f^{- 1} (ψ^{f} (μ σ)))

.

Case 2:

e = f (σ) \in Σ_{f}

. For brevity, we write

t = f^{- 1} (ψ^{f} (μ))

. Then,

f^{- 1} (ψ^{f} (μ f (σ))) = t σ

. Since

δ_{S} ((q, π, θ_{o}, θ_{c}, x), f (σ)) = (p, γ, ω_{o}, ω_{c}, y)

, by Equation (7),

${OUT}^{o b s} (θ_{o}, σ)!$ ;
$p = q$ , $γ = π$ , $ω_{o} = {OUT}^{o b s} (θ_{o}, σ)$ , $ω_{c} = {IN}^{c t r} (θ_{c}, χ (y))$ , and $y = ξ (x, σ)$ .

Since

δ_{S} (q_{0, S}, μ f (σ)) = (p, γ, ω_{o}, ω_{c}, y)

, by Proposition 1,

y = ξ (x_{0}, t σ)

. Thus,

χ (y) = S (t σ)

. By the induction hypothesis,

(q, π, θ_{o}, θ_{c}) \in {\tilde{E}}_{S} (t)

. Since

{OUT}^{o b s} (θ_{o}, σ)!

, by Equation (14),

(q, π, {OUT}^{o b s} (θ_{o}, σ), θ_{c}) \in D O R ({\tilde{E}}_{S} (t), σ)

. Moreover, since

χ (y) = S (t σ)

, by Equation (12),

(q, π, {OUT}^{o b s} (θ_{o}, σ), {IN}^{c t r} (θ_{c}, χ (y))) \in DUR (DOR ({\tilde{E}}_{S} (t), σ), S (t σ)) .

By Definition 5, we have

(q, π, {OUT}^{o b s} (θ_{o}, σ), {IN}^{c t r} (θ_{c}, χ (y))) \in {\tilde{E}}_{S} (t σ)

. Thus, by Condition 2 in Case 2,

(p, γ, ω_{o}, ω_{c}) \in {\tilde{E}}_{S} (t σ) = {\tilde{E}}_{S} (f^{- 1} (ψ^{f} (μ f (σ)))) .

Case 3:

e = g (γ^{'}) \in Σ_{g}

. Since

g (γ^{'}) \in Σ_{g}

,

f^{- 1} (ψ^{f} (μ)) = f^{- 1} (ψ^{f} (μ g (γ^{'})))

. Since

δ_{S} ((q, π, θ_{o}, θ_{c}, x), σ) = (p, γ, ω_{o}, ω_{c}, y)

, by Equation (8), we have

${OUT}^{c t r} (θ_{c}, γ^{'})!$ ;
$p = q$ , $γ = γ^{'}$ , $ω_{o} = θ_{o}$ , and $ω_{c} = {OUT}^{c t r} (θ_{c}, γ^{'})$ .

Moreover, since

(q, π, θ_{o}, θ_{c}) \in {\tilde{E}}_{S} (f^{- 1} (ψ^{f} (μ)))

and

{OUT}^{c t r} (θ_{c}, γ^{'})!

, by Equation (13), we have that

(q, γ^{'}, θ_{o}, {OUT}^{c t r} (θ_{c}, γ^{'})) \in {\tilde{E}}_{S} (f^{- 1} (ψ^{f} (μ))) = {\tilde{E}}_{S} (f^{- 1} (ψ^{f} (μ g (γ^{'})))) .

By Condition 2 in Case 3,

(p, γ, ω_{o}, ω_{c}) \in {\tilde{E}}_{S} (f^{- 1} (ψ^{f} (μ g (γ^{'})))) .

□

Let

z = (q, π, θ_{o}, θ_{c}) \in \tilde{Q}

be a given augmented state. We denote

FC (z) = q

by the first component (the plant state) of z. (“FC” means “first component”). We extend

FC (\cdot)

to a set of augmented states

Z \in 2^{\tilde{Q}}

as follows:

FC (Z) = {FC (z) : \forall z \in Z}

. The following corollary discusses the relationship between

E_{S} (t)

and

{\tilde{E}}_{S} (t)

.

Corollary 1.

Let G be a DES and S be a supervisor. We construct

G_{S}

as described in Section 3.2. For any

t \in f^{- 1} (ψ^{f} (L (G_{S})))

,

E_{S} (t) = FC ({\tilde{E}}_{S} (t))

.

Proof.

The proof directly follows from Theorem 1 and Definition 4. □

By Corollary 1, we can estimate the plant states by taking the first component of the estimated augmented states. We use the following example to further illustrate our online state estimation procedure.

Example 3.

Consider again the system G depicted in Figure 2a and the supervisor S depicted in Figure 2b. Let

Σ_{o} = {α, β}

,

Σ_{c} = {α, β, η}

, and

N_{o} = N_{c} = 1

. We now compute

{\tilde{E}}_{S} (ε)

,

E_{S} (ε)

and

{\tilde{E}}_{S} (α)

,

E_{S} (α)

.

Initially, by Equation (10),

(0, π_{0}, ε, ε) \in {\tilde{E}}_{S} (ε)

. Since

δ (0, α) = 1

,

α \in π_{0}

, and

MAX (ε^{+}) = 0 \leq N_{c}, N_{o}

, by Equation (12),

(1, π_{0}, (α, 0), ε) \in {\tilde{E}}_{S} (ε)

. Then, since

δ (1, η) = 2

,

η \in π_{0}

,

MAX ({(α, 0)}^{+}) = 1 \leq N_{c}

, and

MAX (ε^{+}) = 0 \leq N_{c}

, also by Equation (12),

(2, π_{0}, (α, 1), ε) \in {\tilde{E}}_{S} (ε)

. Therefore,

\begin{matrix} {\tilde{E}}_{S} (ε) = & {(0, π_{0}, ε, ε), (1, π_{0}, (α, 0), ε), (2, π_{0}, (α, 1), ε)} . \end{matrix}

By Corollary 1,

E_{S} (ε) = {0, 1, 2}

. Since

{OUT}^{o b s} ((α, 0), α) = {OUT}^{o b s} ((α, 1), α) = ε

, by Equation (14),

DOR ({\tilde{E}}_{S} (ε), σ) = {(1, π_{0}, ε, ε), (2, π_{0}, ε, ε)} .

By Definition 5,

{\tilde{E}}_{S} (α) = DUR (DOR ({\tilde{E}}_{S} (ε),

σ), S (α))

. Since

S (α)

is not the initial control command, by Equation (11),

(1, π_{0}, ε, (π_{1}, 0)), (2, π_{0}, ε, (π_{1}, 0)) \in DUR (DOR ({\tilde{E}}_{S} (ε), σ), S (α)) .

Then, by Equations (12) and (13), we have

\begin{matrix} {\tilde{E}}_{S} (α) = & {(1, π_{0}, ε, (π_{1}, 0)), (2, π_{0}, ε, (π_{1}, 0)], (2, π_{0}, ε, (π_{1}, 1)) \\ (2, π_{1}, ε, ε), (1, π_{1}, ε, ε), (3, π_{1}, (β, 0), ε), (4, π_{1}, (β, 1) (β, 0), ε)} . \end{matrix}

By Corollary 1,

E_{S} (α) = {1, 2, 3, 4}

.

5. Online Network Supervisory Control

In this section, we calculate a maximal and safe control on the fly based on the state estimation techniques developed in Section 4.

5.1. State Prediction

To determine if the control decision made at the moment is safe, we need to predict all states that we cannot prevent from reaching under observation delays and control delays. To this end, for a

z = (q, π, θ_{c}, θ_{o}) \in \tilde{Q}

appeared in an augmented state estimate, we construct an automaton

G_{z}

to check what states the plant may reach from q, if we disable all controllable events in the future. The basic idea for the construction of

G_{z}

is similar to that of

G_{S}

. That is, starting from z,

G_{z}

dynamically tracks the plant state, the current control command, the observation channel configuration, and the control channel configuration, given that all future controls are

Σ_{u c}

.

Formally, we construct

G_{z} = (Q_{z}, Σ_{z}, δ_{z}, z)

, where

Q_{z} \subseteq Q \times Π \times Θ_{o} \times Θ_{c}

is the state space;

z = (q, π, θ_{c}, θ_{o})

is the initial state;

Σ_{z} \subseteq Σ \cup Σ_{f} \cup Σ_{g}

is the event set; the transition function

δ_{z} : Q_{z} \times Σ_{z} \to Q_{z}

is defined as:

For all $z = (q, π, θ_{o}, θ_{c}) \in Q_{z}$ and all $σ \in Σ$ ,

$\begin{matrix} δ_{z} (z, σ) = \{\begin{matrix} z^{'} & if δ (q, σ)! \land σ \in π \land MAX (θ_{o}^{+}) \leq N_{o} \land MAX (θ_{c}^{+}) \leq N_{c} \\ undefined & otherwise, \end{matrix} \end{matrix}$

(15)

where $z^{'} = (δ (q, σ), π, {IN}^{o b s} (θ_{o}, σ), PLUS (θ_{c}))$ ;
For all $z = (q, π, θ_{o}, θ_{c}) \in \tilde{Q}$ and all $f (σ) \in Σ_{f}$ ,

$\begin{matrix} δ_{z} (\tilde{q}, f (σ)) = \{\begin{matrix} z^{'} & if {OUT}^{o b s} (θ_{o}, σ)! \\ undefined & otherwise, \end{matrix} \end{matrix}$

(16)

where $z^{'} = (q, π, {OUT}^{o b s} (θ_{o}, σ), {IN}^{c t r} (θ_{c}, Σ_{u c}))$ ;
For all $z = (q, π, θ_{o}, θ_{c}) \in \tilde{Q}$ and all $g (γ) \in Σ_{g}$ ,

$\begin{matrix} δ_{z} (z, g (γ)) = \{\begin{matrix} z^{'} & if {OUT}^{c t r} (θ_{c}, γ)! \\ undefined & otherwise, \end{matrix} \end{matrix}$

(17)

where $z^{'} = (q, γ, θ_{o}, {OUT}^{c t r} (θ_{c}, γ))$ .

Since we assume all the controllable events are disabled in the future when a new observable event is communicated, we adopt the control command

Σ_{u c}

. As shown in Equation (16), we set

θ_{c} \leftarrow IN (θ_{c}, Σ_{u c})

after the communication of

σ

.

The following proposition states that for any

z = (q, π, θ_{c}, θ_{o}) \in \tilde{Q}

and any

ν \in L (G_{z})

, we cannot disable the occurrence of

ψ (ν)

from q even if we disable all of the controllable events in the future.

Proposition 3.

Let G be a DES and S a supervisor. For any

μ \in L (G_{S})

, we write

δ_{S} (q_{0, S}, μ) = (q, π, θ_{o}, θ_{c}, x)

. Let

z = (q, π, θ_{o}, θ_{c})

and

G_{z}

be the automaton constructed as described above. Then, if

ν \in L (G_{z})

,

ψ (μ ν) \in L (S / G)

.

Proof.

Please see Appendix C. □

Given a

z = (q, π, θ_{o}, θ_{c}) \in \tilde{Q}

, all of the plant states that we cannot prevent from reaching q via some

ψ (ν) \in ψ (L (G_{z}))

can be obtained by taking the first component of

Q_{z}

. That is,

\begin{matrix} FC (Q_{z}) = {δ (q, ψ (ν)) : \forall ν \in L (G_{z})} . \end{matrix}

(18)

We can prove Equation (18) by inducing the finite length of sequences in

L (G_{z})

, which is similar to the proof of Proposition 1, and is omitted here for brevity.

5.2. Online Algorithm

Suppose that the current observation of the system is

t \in Σ_{o}^{*}

. When a new event is observed, the supervisor S makes a new control command

π \in Π

, and the augmented state estimate will be updated to

{\tilde{E}}_{S} (t σ) = DUR (DOR ({\tilde{E}}_{S} (t), σ), π)

. As discussed in Section 5.1, for any

z = (q, π, θ_{c}, θ_{o}) \in {\tilde{E}}_{S} (t σ)

,

FC (Q_{z})

collects all of the plant states that may be reached from q no matter what control commands we adopt in the future. Therefore, we define the set of “bad” augmented states as:

\begin{matrix} T_{s p e c} = {z \in \tilde{Q} : FC (Q_{z}) \cap (Q \ Q_{H}) \neq \emptyset} . \end{matrix}

(19)

For safety, all the augmented states in

T_{s p e c}

should never be reached. To make the problem non-trivial, we assume that the controlled system is safe if we choose to disable all of the controllable events after each new observation.

With the above preparations, we are now ready to introduce our online algorithm. We first pre-compute

T_{s p e c}

offline. The networked supervisory control for G is implemented on the fly in Algorithm 1 as follows: when the supervisor receives a new observable event occurrence

σ

, Line 9 is executed with the new communication of

σ

. The set of events to be enabled following the communication of

σ

is then calculated by the for-loop on Line 3, where all the controllable events are checked one by one to see if they can be enabled while the system cannot reach some “bad” augmented states. The above processes are repeated when another observable event occurrence is communicated.

Algorithm 1:Online maximal networked control

Definition 6.

For any

t \in Σ_{o}^{*}

, the online network supervisor

S_{t}

is defined as: for all

t^{i} \in \bar{t}

,

i = 0, 1, \dots, | t |

,

S_{t} (t^{i})

is the set of events that is enabled right after the communication of

t^{i}

, and for all

t^{'} \in Σ_{o}^{*}

with

t^{'} \notin \bar{t}

,

S_{t} (t^{'}) = Σ_{u c}

.

Note that

S_{t}

can be represented as an automaton with a finite state space.

Remark 7.

The maximal networked supervisors are not unique. Given a different order

Σ_{c} = {σ_{1}, \dots, σ_{k}}

on the controllable events, Algorithm 1 may return different results. However, the order of controllable events can be changed dynamically after each new communication, if desired. However, all of the possible supervisors returned by Algorithm 1 are safe and maximal.

Remark 8.

Algorithm 1 tries to enable a maximum allowable set of controllable events at any instant to ensure the closed-loop system is within the desired specification language. However, as discussed in Remark 7, there may exist several incomparable maximum control decisions after each new observation. In many applications, enabling a controllable event could involve financial and human costs. In such situations, it is preferable to select a maximum allowable set of controllable events with the minimum enablement of costs at each instant. A simple approach is to consider all maximum control commands and select one with the minimum enablement cost. Another approach is to list all of the controllable events in ascending order according to their enablement costs:

Σ_{c} = {σ_{1}, \dots, σ_{k}}

, where

σ_{1}

is a controllable event that is the least costly to enable, and

σ_{k}

is the event that is the most costly to enable. By the for-loop on Line 3, a controllable event with a smaller enablement cost has a priority to be considered. The first approach is optimal but needs more computational resources than the second approach. The second approach may be suboptimal but is more efficient compared with the first approach.

Remark 9.

For a given

z \in \tilde{Q}

, we know the number of states in

G_{z}

is upper-bounded by

| \tilde{Q} |

. Since the number of verifiers to be constructed is

| \tilde{Q} |

, the computational complexity for calculating

T_{s p e c}

is the order of

O (| \tilde{Q} |^{2})

. By definition,

\tilde{Q} \subseteq Q \times Π \times Θ_{o} \times Θ_{c}

, Since

Π \subseteq 2^{Σ}

,

Θ_{o} \subseteq {(Σ_{o} \times [0, N_{o}])}^{\leq N}

, and

Θ_{c} \subseteq {(Π \times [0, N_{c}])}^{\leq M}

, the complexity for calculating

T_{s p e c}

is polynomial with respect to (w.r.t.)

| Q |

and exponential w.r.t.

| Σ |

.

After each new communication, for each

σ \in Σ_{c}

, we need to test whether or not σ can be enabled. In each test,

DUR (Ξ, π)

is updated by Line 5 and we need to search the state space

\tilde{Q}

once. Therefore, the computational complexity of Algorithm 1 is the stepwise order of

O (| \tilde{Q} |)

, which is also polynomial w.r.t.

| Q |

and exponential w.r.t.

| Σ |

.

Next, we show that the control commands made at each step in Algorithm 1 guarantee that the controlled system is safe.

Theorem 2.

Suppose the current observation of the system is

t \in Σ_{o}^{*}

. Let

{\tilde{E}}_{S_{t}} (t^{i})

be the augmented state estimate for

t^{i} \in \bar{t}

under

S_{t}

,

i = 1, \dots, | t |

. Then,

(\forall i = 1, \dots, | t |) {\tilde{E}}_{S_{t}} (t^{i}) \cap T_{s p e c} = \emptyset \Leftrightarrow L (S_{t} / G) \subseteq K .

Proof.

Please see Appendix D. □

The following corollary states that

S_{t}

is the solution to Problem 1.

Corollary 2.

Suppose the current observation of the system is

t \in Σ_{o}^{*}

. The online supervisor

S_{t}

derived by Algorithm 1 satisfies conditions 1 and 2 of Problem 1.

Proof.

The proof directly follows from Problem 1 and Theorem 2. □

5.3. Comparison with the Existing Work

In this section, we compare the proposed algorithm with the algorithm proposed in [26]. Similar to [26], we assume that there are only control delays with an upper bound of

N_{c}

, and there are no observation delays, i.e.,

N_{o} = 0

in this section. To make this paper self-contained, we first review the state estimation techniques proposed in [26].

A channel configuration is defined in [26] as a set of pairs in the form of

θ = {(π_{1}, n_{1}), (π_{2}, n_{2}), \dots, (π_{k}, n_{k})},

where

π_{i} \in Π

is an admissible control action that is delayed at the control channel, and

n_{i} \in [0, N_{c}]

is a nonnegative integer indicating that the control action

π_{i}

is still effective for the next

n_{i}

steps. We denote by

Γ (θ)

the union of all the control actions in

θ

, i.e.,

Γ (θ) = \cup_{i = 1, \dots, k} π_{i}

. We also denote by

Θ \subseteq 2^{Π \times [0, N_{c}]}

the set of all channel configurations. To update a

θ \in Θ

after a new event occurrence, we define the “next” operator

N X : Θ \to Θ

as follows: for any

θ \in Θ

,

NX (θ) = {(π, n - 1) \in Π \times N : (π, n) \in θ, n \geq 1} .

NX (θ)

decreases the timing index of each element of

θ

by one unit and only keeps the elements of

θ

with nonnegative natural numbers. Thus,

θ

collects all the control actions issued in the past

N_{c}

steps (including the current step).

We define an extended state as a pair of a plant state

q \in Q

and a channel configuration

θ \in Θ

. Let

\hat{Q} = Q \times Θ

be the set of all extended states. Let

Z \in 2^{\hat{Q}}

be a set of extended states and

π \in Π

be a control action. Then, the networked unobservable reach of Z under

π

, denoted by

NU R_{π} (Z)

, is defined recursively as follows:

For any $(q, θ) \in Z$ , we have

$\begin{matrix} (q, θ \cup {(π, N_{c})}) \in NU R_{π} (Z); \end{matrix}$

(20)
For any $(q, θ) \in NU R_{π} (Z)$ and any unobservable event $σ \in Σ_{u o}$ , if $σ \in Γ (θ)$ and $δ (q, σ)!$ , then

$\begin{matrix} (δ (q, σ), NX (θ) \cup {(π, N_{c})}) \in NU R_{π} (Z) . \end{matrix}$

(21)

Operation Equation (20) is used to add the latest control action

π

into the channel configuration. Operation Equation (21) computes all the extended states that can be reached from any

(q, θ) \in NU R_{π} (Z)

via an unobservable event occurrence. In Equation (17), an event

σ

can occur at an extended state

(q, θ)

if it is active at state q, i.e.,

δ (q, σ)!

, and it is allowed to occur by one of the control actions issued in the past

N_{c}

steps, i.e.,

σ \in Γ (θ)

.

Let

Z \in 2^{\hat{Q}}

be a set of extended states and

σ \in Σ_{o}

be an observable event. The networked observable reach (

NOR

) of Z upon the occurrence of

σ

, denoted by

NO R_{σ} (Z)

, is defined as:

\begin{matrix} N O R_{σ} (Z) = {(δ (q, σ), N X (θ)) \in \tilde{Q} : (q, θ) \in x, σ \in Γ (θ)} . \end{matrix}

(22)

Operation Equation (22) collects all of the extended states that can be immediately reached from elements of Z via

σ

.

Let S be a given networked supervisor. The set of extended states that the controlled system may be in after a communicated

t \in Σ_{o}^{*}

, denoted by

{\hat{E}}_{S} (t)

, can be calculated as follows:

Initially, ${\hat{E}}_{S} (ε) = N U R_{S (ε)} ({(q_{0}, \emptyset)})$ ;
For all $t^{i}, t^{i} σ_{i + 1} \in \bar{{t}}$ , $i = 0, 1, \dots, | t | - 1$ ,

${\hat{E}}_{S} (t^{i} σ_{i + 1}) = N U R_{S (t^{i} σ_{i + 1})} (N O R_{σ_{i + 1}} ({\hat{E}}_{S} (t^{i}))) .$

Then, it is shown by Corollary 1 of [26] that the set of plant states that the controlled system may be in after observing t can be simply obtained by taking the first components of

{\hat{E}}_{S} (t)

.

Let

θ = {(γ_{1}, n_{1}), (γ_{2}, n_{2}), \dots, (γ_{k}, n_{k})}

be a channel configuration and

m \in [0, N_{c}]

be a non-negative integer. We denote by

Γ_{\geq m} (θ)

the union of all control decisions that can take effect in the next m steps, i.e.,

\begin{matrix} Γ_{\geq m} (θ) = ⋃_{i \in [1, k] : n_{i} \geq m} γ_{i} . \end{matrix}

(23)

The uncontrollable language for

θ

can be defined as:

\begin{matrix} L_{u c} (θ) : = \bar{Γ_{\geq 0} (θ) Γ_{\geq 1} (θ) \dots Γ_{\geq N_{c}} (θ) .} \end{matrix}

(24)

Given an extended state

\tilde{q} = (q, θ) \in \tilde{Q}

, the uncontrollable state prediction of

\tilde{q}

, denoted by

U S P (\tilde{q})

, is defined as

\begin{matrix} U S P (\tilde{q}) = {δ (q, s) \in Q : s \in L_{u c} (θ)} . \end{matrix}

(25)

The online supervisor synthesis approaches proposed in [26] mainly consist of the following two steps.

Step 1: When an observable event sequence

t \in Σ_{o}^{*}

is communicated, calculate the extended state estimate

{\hat{E}}_{S} (t)

;

Step 2: Find a maximal control decision

γ \in Γ

, such that

U S P (N U R_{γ} ({\hat{E}}_{S} (t))) \subseteq Q_{H}

.

Next, we use two examples to show that the proposed supervisor can be more permissive than that proposed in [26].

Example 4.

Consider the uncontrolled system G and the desired system H depicted in Figure 6a and Figure 6b, respectively. Let

Σ_{c} = Σ

and

Σ_{o} = {α, β}

.

Initially, we start from

{(0, \emptyset)}

and choose a maximal control decision

S (ε) = γ_{0}

, such that

UPS (NU R_{γ_{0}} ({(0, \emptyset)}) \subseteq Q_{H}

. One can check that

γ_{0} = {α, γ}

is such a maximal control action. Then, we can compute the extended state estimate

{\hat{E}}_{S} (ε) = NU R_{γ_{0}} ({(0, \emptyset)}) = {(0, {(γ_{0}, 2)}), (3, {(γ_{0}, 1)})}

. If α is observed, we have

NO R_{α} ({\hat{E}}_{S} (ε)) = {(1, {γ_{0}, 1})}

. Then, we again find a maximal control decision

S (α) = γ_{1}

, such that

UPS (N U R_{γ_{1}} ({(1, {γ_{0}, 1})})) \subseteq Q_{H}

.

By definition,

\tilde{q} = (1, {(γ_{0}, 1), (γ_{1}, 2)}) \in NU R_{γ_{1}} ({(1, {γ_{0}, 1})})

. One can check that

β \notin γ_{1}

, because otherwise, by Equation (24),

β γ \in L_{u c} (θ)

. Then, by Equation (25), we have

5 \in UPS (\tilde{q}) \in Q \ Q_{H}

. Therefore, we have

γ_{1} = {α, γ}

. Since

β \notin γ_{0}

and

β \notin γ_{1}

, β will never occur at State 1. Therefore, all possible behaviors that may occur under the synthesized supervisor include

{ε, γ, α}

.

In the previous framework, it was assumed that all control actions issued in the past

N_{c}

steps may take effect. Thus, in Example 4, since

γ \in γ_{0}

,

γ_{0}

may take effect after

α β

. Therefore, we must disable

β

after

α

to prevent the system from reaching State 5. Since

β \notin γ_{0}, γ_{1}

, we know that

β

will never occur after

α

. However, as shown in the following example, we can actually enable

β

after observing

α

, and the controlled system can never reach State 5.

Example 5.

Continue with Example 4. We now show how to apply Algorithm 1 to compute an optimal supervisor.

Algorithm 1 starts from

Ξ^{0} = \emptyset

and iterates the for-loop on Line 3 for computing a maximal

S (ε) = E_{a}^{0}

, such that

D U R (D O R (Ξ^{0}, E_{a}^{0})) \cap T_{s p e c} = \emptyset

. One can check that

E_{a}^{0} = {α, γ}

is such an optimal control decision. The augmented state estimate is updated to

\begin{matrix} {\tilde{E}}_{S} (ε) = & {[0, E_{a}^{0}, ε, ε], [3, E_{a}^{0}, ε, ε], [1, E_{a}^{0}, (α, 0), ε]} . \end{matrix}

After that, the supervisor observes α and estimates

Ξ^{1} = DOR (\tilde{E} (ε), σ) = {[1, E_{a}^{0}, ε, ε]} .

Then, iterating the for-loop on Line 3 leads to

S (α) = E_{a}^{1} = {α, β}

. The augmented state estimate is updated to

\begin{matrix} {\tilde{E}}_{S} (α) = {[1, E_{a}^{0}, ε, (E_{a}^{1}, 0)], [1, E_{a}^{1}, ε, ε], [2, E_{a}^{1}, (β, 0), ε]} . \end{matrix}

Then, the supervisor observes β and estimates

Ξ^{2} = DOR (\tilde{E} (α), β) = {[2, E_{a}^{1}, ε, ε]} .

Iterating the for-loop on Line 3, we have

S (α β) = E_{a}^{1} = {α, β}

.

Note that we cannot enable γ after observing

α β

. Therefore, under the synthesized supervisor, we may reach States 0, 1, 2, and 3. That is, all the possible behaviors that may be generated by the closed-loop system include

{ε, γ, α, α β}

.

By Examples 4 and 5, the language of the closed-loop system under the supervisor synthesized by Algorithm 1 is larger than the language of the closed-loop system under the supervisor synthesized by [26]. Since the proposed framework excludes all physically impossible strings, the state estimate calculated is more precise than that calculated by the previous approach. Thus, the proposed supervisor is more permissive than the previous one.

6. Application in Traffic Control

We consider a signalized intersection as shown in Figure 7. When a self-driving vehicle x arrives at the intersection, it needs to communicate with the intersection to observe the signal and make a control decision accordingly. The observation and control are realized through a network. Due to network characteristics, observation delays and control delays are unavoidable. We assume in this example the observation delays are upper-bounded by 1 and the control decision are upper-bounded by 1, i.e.,

N_{o} = 1

and

N_{c} = 1

. We define seven events as shown in Table 1.

Event a denotes that Vehicle x arrives at the intersection. Event p denotes that Vehicle x passes through the intersection. Event y denotes that the traffic signal is switched to yellow. The green time in one signal cycle is

t_{g}

seconds, and we divide

t_{g}

into

g_{1}

and

g_{2}

equally:

g_{1}

denotes the first

t_{g} / 2

seconds and

g_{2}

denotes the remaining

t_{g} / 2

seconds. Similarly, the red time in one signal cycle is

t_{r}

seconds, and we divide

t_{r}

into

r_{1}

and

r_{2}

equally:

r_{1}

denotes the first

t_{r} / 2

seconds and

r_{2}

denotes the remaining

t_{r} / 2

seconds.

Events a and p are controllable since Vehicle x can choose to approach or pass through the intersection. Events

r_{1}

,

r_{2}

,

g_{1}

,

g_{2}

, and y are observable but are not controllable since Vehicle x can observe but cannot change the color of the traffic light. The system model

G = (Q, Σ, δ, q_{0})

for vehicle x is displayed in Figure 8a.

Let us interpret the construction of G in Figure 8a as follows. When Vehicle x arrives at the intersection (a occurs), the system enters State 1. If the signal in the forward direction is switched to red for no more than

t_{r} / 2

seconds after green, i.e.,

r_{1}

. (respectively, red for more than

t_{r} / 2

seconds but no more than

t_{r}

seconds after green, i.e.,

r_{2}

, green for no more than

t_{g} / 2

seconds after red, i.e.,

g_{1}

, green for more than

t_{g} / 2

seconds but no more than

t_{g}

seconds after red, i.e.,

g_{2}

, and yellow, i.e., y), the system makes a state transition to State 5 (respectively, States 2, 6, 3, and 4). If the system is uncontrolled, Vehicle x can pass through the intersection at any time. Hence, p can occur in States 2, 3, 4, 5, and 6. Let us suppose that the traffic light is

r_{1}

when Vehicle x arrives at the intersection. Thus, the system is in State 5. If Vehicle x chooses to pass through the intersection, then the system moves to State 9. Otherwise, if Vehicle x stops at the intersection, then upon the occurrence of

r_{2}

, the traffic light enters the second stage of the red cycle, and the system makes a state transition from State 5 to State 2. Then, if Vehicle x chooses to pass through the intersection, the system moves from State 2 to State 9. Otherwise, by the switching rule, the traffic light is further switched to green (

g_{1}

occurs), the system makes a state transition from State 2 to State 6, and so on.

By traffic laws, passing the intersection (enabling p) is not permitted when the traffic light is red or yellow when the vehicle approaches the intersection. Therefore, we should disable the occurrence of p at States 2, 4, and 5. On the other hand, we can enable the occurrence of p at States 3 and 6. In particular, when the system is in State 3, the traffic lights may be switched to yellow. Upon the occurrence of y, the system moves to State 7. By the traffic law, enabling p is legal if the traffic light is switched from green to yellow when Vehicle x is passing through the intersection. Thus, we can enable p at State 7. The desired system H is depicted in Figure 8b.

We now apply Algorithm 1 to calculate an optimal control command after each new observation. We denote

Ξ^{i}

by the set of augmented states returned by Line 9 after the observation of the ith event. We also denote

E_{a}^{i}

by the set of events returned by Line 7 after the observation of the ith event.

Initially, by Lines 2 and 3, we have

Ξ = \emptyset

and

E_{a} \leftarrow Σ_{u c} = {y, r_{1}, r_{2}, g_{1}, g_{2}}

. Let

Σ_{c} = {a, p}

. By the for-loop on Line 3, we first try to add a into

E_{a}

and set

π \leftarrow Σ_{u c} \cup {a}

. By the definition of

DUR (\cdot)

, one can verify that

DUR (Ξ, π) \cap T_{s p e c} \neq \emptyset

since only the occurrence of p can lead the controlled system to the “illegal” state 9, and p can never occur if we choose to disable p now and in the future. Thus, by Line 6, we have

E_{a} = Σ_{u c} \cup {a}

. By the for-loop on Line 3, we next try to add p into

E_{a}

and set

π \leftarrow Σ_{u c} \cup {a, p}

. It can be checked that

z = (5, π, (r_{1}, 0), ε) \in DUR (Ξ, π)

and

z \in T_{s p e c}

since

π

may take effect after

a r_{1}

, and p is prevented from occurring after

a r_{1}

. Thus, we have

Ξ^{0} = \emptyset

and

E_{a}^{0} = Σ_{u c} \cup {a}

. Let

π_{0} \leftarrow E_{a}^{0}

. By definition,

\begin{matrix} DUR (Ξ^{0}, E_{a}^{0}) = & {(0, π_{0}, ε, ε), (1, π_{0}, ε, ε), (2, π_{0}, (r_{2}, 0), ε), (3, π_{0}, (g_{2}, 0), ε) \\ (4, π_{0}, (y, 0), ε), (5, π_{0}, (r_{1}, 0), ε), (6, π_{0}, (g_{1}, 0), ε) \\ (6, π_{0}, (r_{2}, 1) (g_{1}, 0), ε), (7, π_{0}, (g_{2}, 1) (y, 0), ε), (5, π_{0}, (y, 1) (r_{1}, 0), ε) \\ (2, π_{0}, (r_{1}, 1) (r_{2}, 0), ε), (3, π_{0}, (g_{1}, 1) (g_{2}, 0), ε)} . \end{matrix}

Next, if

g_{2}

is communicated, by Line 9,

\begin{matrix} Ξ^{1} = DOR (DUR (Ξ^{0}, E_{a}^{0}), g_{2}) = & {(3, π_{0}, ε, ε), (7, π_{0}, (y, 0), ε)} . \end{matrix}

Then, let us go to Line 2, and we have

E_{a} \leftarrow Σ_{u c}

. By the for-loop on Line 3, we first try to add a into

E_{a}

and set

π \leftarrow Σ_{u c} \cup {a}

. One can verify that

DUR (Ξ, π) \cap T_{s p e c} \neq \emptyset

. We next try to add p into

E_{a}

and set

π \leftarrow Σ_{u c} \cup {a, p}

.

Since

(7, π_{0}, (y, 0), ε) \in Ξ^{1}

, when

π

is issued, augmented states can occur in the order as follows:

\begin{matrix} (7, π_{0}, (y, 0), (π, 0)) \overset{g (π)}{⟶} (7, π, (y, 0), ε) \overset{r_{1}}{⟶} \\ (5, π, (y, 1) (r_{1}, 0), ε) \overset{f (y)}{⟶} (5, π, (r_{1}, 0), (π^{'}, 0)) \overset{p}{⟶} (9, π, (r_{1}, 1), (π^{'}, 1)), \end{matrix}

where

π^{'}

is the control command made after the communication of

r_{1}

. As we can see, the control action

π

may take effect at the time p occurs at

(5, π, (r_{1}, 0), (π^{'}, 0))

, which violates the traffic law. Hence, we can only add a into

E_{a}

. We have

E_{a}^{1} \leftarrow Σ_{u c} \cup {a}

.

The above process is repeated until the vehicle passes through the intersection. The synthesized supervisor is depicted in Figure 9. For brevity, we only list all of the controllable events to be enabled at each state of the supervisor (all of the uncontrollable events are omitted). From Figure 9, to pass through the intersection safely, Vehicle x must stop if the traffic light is

r_{1}

,

r_{2}

, y, or

g_{1}

when the vehicle arrives at the intersection. Vehicle x can choose to pass through the intersection only when

g_{1}

is communicated, i.e., Vehicle x observes the occurrence of

g_{1}

.

7. Extension of the Proposed Framework

In this section, we briefly discuss how to model a system with non-FIFO observations and controls.

In many control applications, such as cyber–physical systems, the sensors are often distributed at different sites, and the detected information is communicated to the supervisor over different observation channels. Different observation channels may have different upper bounds of observation delays. The nondeterministic observation delays may change the order of events communicated to the supervisor. In other words, the supervisor may receive observable event occurrences in different orders as they occur. On the other hand, the enablement and disablement of controllable events are achieved by single actuators, and all actuators are distributed at different sites. The supervisor sends the control decisions for disabling or enabling events to the corresponding actuators upon each new event observation. Different control channels may have different upper bounds of control delays.

As shown in Figure 10, there are

| Σ_{o} |

sensors, each is associated with an observable event. For brevity, we write

Σ_{o} = {σ_{1}, \dots, σ_{n}}

, where

n = | Σ_{o} |

. For each

σ_{i} \in Σ_{o}

, the occurrence can be detected by sensor i and communicated to the supervisor over observation channel i. We assume that observation delays occurring in the observation channel i are upper-bounded by

N_{o, i}

event occurrences. That is, when an event

σ_{i}

occurs, it will be communicated to the supervisor before no more than

N_{o, i}

additional event occurrences. On the other hand, there are

| Σ_{c} |

actuators, and each is associated with a controllable event. We write

Σ_{c} = {e_{1}, \dots, e_{m}}

, where

m = | Σ_{c} |

. For each

e_{i} \in Σ_{c}

, the enablement and disablement are achieved by actuator i, and the control decision for enabling or disabling

e_{i}

is sent to the actuator i over the control channel i. We assume that control delays occurring in control channel i are upper-bounded by

N_{c, i}

event occurrences. That is, the control decision made for an event

e_{i}

can be executed before no more than

N_{c, i}

additional event occurrences.

For each

e_{i} \in Σ_{c}

, the supervisor sends 0 or 1 to actuator i over control channel i, where “0” means “disablement” and “1” means “enablement”. Thus, we denote

Φ = {([0, 1] \times {e_{1}}) \times \dots \times ([0, 1] \times {e_{m}})}

by the set of all the possible control decisions that the supervisor may make. Correspondingly, we denote, in this section, the supervisor by a pair

D = (W, ω)

, where

W = (Z, Σ_{o}, ζ, z_{0})

is a deterministic automaton with

L (W) = Σ_{o}^{*}

, and

ζ : Z \to Φ

is a function that specifies control decisions for disabling or enabling

e_{1}, \dots, e_{m}

. Specifically, for any

t \in Σ_{o}^{*}

, we denote

ω (ζ (z_{0}, t))

by control decisions for disabling or enabling

e_{1}, \dots, e_{m}

. With a slight abuse of notation, we write

D (t) = ω (ζ (z_{0}, t))

. For any

t \in Σ_{o}^{*}

, we have

D (t) \in Φ

. For any

e \in Σ_{c}

and any

\bar{ϕ} = [(b_{1}, e_{1}), \dots, (b_{m}, e_{m})] \in Φ

, we say that e is allowed to be enabled by

\bar{ϕ}

, denoted by

e \in \bar{ϕ}

, if

(\exists i = 1, \dots, m) e = e_{i} \land b_{i} = 1

.

Definition 7.

The observation channel i configuration is defined as a sequence of pairs:

θ_{o, i} = (σ_{i}, n_{1}) \dots (σ_{i}, n_{k}),

where

σ_{i} \dots σ_{i} \in Σ_{o}^{*}

is a sequence of observable events

σ_{i}

that have been detected by sensor i but were currently delayed at the observation channel i, and

n_{j} \in [0, N_{o, i}]

,

j = 1, \dots, k

is the number of event occurrences since

σ_{i}

occurred and was detected. If observation channel i is empty,

θ_{o, i} = ε

.

We denote by

Θ_{o, i} \subseteq {({σ_{i}} \times [0, N_{o, i}])}^{\leq N_{i}}

the set of all possible observation channel i configurations, where

N_{i}

is the maximum length of

θ_{o, i}

. Given a

θ_{o, i} = (σ_{i}, n_{1}) \dots (σ_{i}, n_{k}) \in Θ_{o, i}

, let

MAX (θ_{o, i}) = n_{1}

be the maximum observation delays occurring in the observation channel i. The overall state of the observation channels is defined as a vector

{\bar{θ}}_{o} = [θ_{o, 1}, \dots, θ_{o, n}]

, where

θ_{o, i}

is the observation channel i configuration. Let

Θ_{o} = Θ_{o, 1} \times \dots \times Θ_{o, n}

be the set of all the states of observation channel configurations.

When a new event $σ \in Σ$ occurs, to update the state of the observation channel configurations, we define the operator ${IN}^{o b s} : Θ_{o} \times Σ \to Θ_{o}$ as: for all ${\bar{θ}}_{o} = [θ_{o, 1}, \dots, θ_{o, n}] \in Θ_{o}$ and all $σ \in Σ$ ,

${IN}^{o b s} ({\bar{θ}}_{o}, σ) = [θ_{o, 1}^{'}, \dots, θ_{o, n}^{'}],$

such that

$\begin{matrix} θ_{o, i}^{'} = \{\begin{matrix} θ_{o, i}^{+} (σ, 0) & if σ = σ_{i} \land MAX (θ_{o, i}^{+}) \leq N_{o, i} \\ θ_{o, i}^{+} & if σ \neq σ_{i} \land MAX (θ_{o, i}^{+}) \leq N_{o, i} \\ undefined & otherwise, \end{matrix} \end{matrix}$

(26)

where if $θ_{o, i} = (σ_{i}, n_{1}) \dots (σ_{i}, n_{k}) \neq ε$ , $θ_{o, i}^{+} = (σ_{i}, n_{1} + 1) \dots (σ_{i}, n_{k} + 1)$ , and if $θ_{o, i} = ε$ , $θ_{o, i}^{+} = ε$ .
When a new observable event $σ \in Σ_{o}$ is communicated to the supervisor, to update the state of the observation channel configurations, define the operator ${OUT}^{o b s} : Θ_{o} \times Σ_{o} \to Θ_{o}$ as for all ${\bar{θ}}_{o} = [θ_{o, 1}, \dots, θ_{o, n}] \in Θ_{o}$ and all $σ \in Σ_{o}$ ,

${OUT}^{o b s} ({\bar{θ}}_{o}, σ) = [θ_{o, 1}^{'}, \dots, θ_{o, n}^{'}],$

such that

$\begin{matrix} θ_{o, i}^{'} = \{\begin{matrix} θ_{o, i} \ θ_{o, i}^{1} & if σ = σ_{i} \land θ_{o, i} \neq ε \\ θ_{o, i} & if σ \neq σ_{i} \\ undefined & otherwise, \end{matrix} \end{matrix}$

(27)

where $θ_{o, i}^{1}$ is the first component of $θ_{o, i}$ . That is, for all $θ_{o, i} = (σ_{i}, n_{1}) \dots (σ_{i}, n_{k}) \neq ε$ , we have $θ_{o, i}^{1} = (σ_{i}, n_{1})$ .

When an event

σ \in Σ

occurs in the plant, all natural numbers in

θ_{o, i}

should be plus 1 since they are used to counting the observation delays. Furthermore, if

σ \in Σ_{o, i}

, by FIFO, we still need to add

(σ_{i}, 0)

to the end of

θ_{o, i}

for recording the new observable event occurrence. On the other hand, when a new observable event

σ \in Σ_{o}

is communicated to the supervisor,

{OUT}^{o b s} ({\bar{θ}}_{o}, σ)

removes

σ

from the head of

θ_{o, i}

if

σ = σ_{i}

.

Definition 8.

The control channel i configuration is defined as a sequence of pairs:

θ_{c, i} = (ϕ_{1}, m_{1}) \dots (ϕ_{h}, m_{h}),

where

ϕ_{1} \dots ϕ_{h} \in {({0, 1} \times {e_{i}})}^{*}

is a sequence of control decisions made for enabling or disabling

e_{i}

but are currently delayed at the control channel i, and

m_{j} \in [0, N_{c, i}]

,

j = 1, \dots, k

is the number of event occurrences since

π_{j}

has been issued. If the control channel i is empty,

θ_{c, i} = ε

.

We denote by

Θ_{c, i} \subseteq {(({0, 1} \times {e_{i}}) \times [0, N_{c, i}])}^{\leq M_{i}}

the set of all the possible control channel i configurations, where

M_{i} \in N

is the maximum length of

θ_{c, i} \in Θ_{c, i}

. Given a

θ_{c, i} = (ϕ_{1}, m_{1}) \dots (ϕ_{h}, m_{h}) \in Θ_{c, i}

, let

MAX (θ_{c, i}) = m_{1}

be the maximum control delays occurring in the control channel i. The overall state of the control channel is defined as a vector

{\bar{θ}}_{c} = [θ_{c, 1}, \dots, θ_{c, n}]

, where

θ_{c, i}

is the control channel i configuration. Let

Θ_{c} = Θ_{c, 1} \times \dots \times Θ_{c, m}

be the set of all the states of control channel configurations. To update

{\bar{θ}}_{c} \in Θ_{c}

, we introduce the following operators.

When a new event $σ \in Σ$ occurs in the plant, to update the state of the control channel configurations, define the operator $PLUS : Θ_{c} \to Θ_{c}$ as: for all ${\bar{θ}}_{c} = [θ_{c, 1}, \dots, θ_{c, m}] \in Θ_{c}$ ,

$PLUS ({\bar{θ}}_{c}) = [θ_{c, 1}^{'}, \dots, θ_{c, m}^{'}],$

such that

$\begin{matrix} θ_{c, i}^{'} = \{\begin{matrix} θ_{c, i}^{+} & if MAX (θ_{c, i}^{+}) \leq N_{c, i} \\ undefined & otherwise, \end{matrix} \end{matrix}$

(28)

where if $θ_{c, i} = (ϕ_{1}, m_{1}) \dots (ϕ_{h}, m_{h}) \neq ε$ , $θ_{c, i}^{+} = (ϕ_{1}, m_{1} + 1) \dots (ϕ_{h}, m_{h} + 1)$ , and if $θ_{c, i} = ε$ , $θ_{c, i}^{+} = ε$ .
When a new control command $\bar{ϕ} = [ϕ_{1}, \dots, ϕ_{m}] \in Φ$ is issued by the supervisor, to update the state of the control channel configurations, we define the operator ${IN}^{c t r} : Θ_{c} \times Φ \to Θ_{c}$ as: for all ${\bar{θ}}_{c} = [θ_{c, 1}, \dots, θ_{c, m}] \in Θ_{c}$ and all $\bar{ϕ} = [ϕ_{1}, \dots, ϕ_{m}] \in Φ$ ,

${IN}^{c t r} ({\bar{θ}}_{c}, \bar{ϕ}) = [θ_{c, 1}^{'}, \dots, θ_{c, m}^{'}],$

such that $θ_{c, i}^{'} = θ_{c, i} (ϕ_{i}, 0)$ for all $i = 1, \dots, m$ .
When a new control command $ϕ \in {0, 1} \times Σ_{c}$ is executed, to update the states of the control channel configurations, define the operator ${OUT}^{c t r} : Θ_{c} \times ({0, 1} \times Σ_{c}) \to Θ_{c}$ as: for all ${\bar{θ}}_{c} = [θ_{c, 1}, \dots, θ_{c, m}] \in Θ_{c}$ and all $ϕ = (b, e) \in {0, 1} \times Σ_{c}$ ,

${OUT}^{c t r} ({\bar{θ}}_{c}, ϕ) = [θ_{c, 1}^{'}, \dots, θ_{c, m}^{'}],$

such that

$\begin{matrix} θ_{c, i}^{'} = \{\begin{matrix} θ_{c, i} \ θ_{c, i}^{1} & if e = e_{i} \land θ_{c, i} \neq ε \\ θ_{c, i} & if e \neq e_{i} \\ undefined & otherwise . \end{matrix} \end{matrix}$

(29)

When a new event occurs, for recording the control delays,

PLUS ({\bar{θ}}_{c})

adds 1 to all of the natural numbers in

θ_{c, i}

. When a new control command

\bar{ϕ} = [ϕ_{1}, \dots, ϕ_{m}] \in Φ

is issued (following a new observation),

{IN}^{c t r} ({\bar{θ}}_{c}, \bar{ϕ})

adds the newly issued control command to the end of control channel i. When a new control command is executed by actuator i,

{OUT}_{i}^{c t r} ({\bar{θ}}_{c}, ϕ)

removes the first control command

ϕ

from

θ_{c, i}

.

To keep track of what has been successfully communicated to the supervisor so far, define bijection $h : Σ_{o} \to Σ_{h}$ , such that $Σ_{h} = {h (σ) : σ \in Σ_{o}}$ is a set disjoint from $Σ$ . For all $σ \in Σ_{o}$ , we use $h (σ)$ to denote that the occurrence of $σ$ has been communicated to the supervisor.
To model which control action has been executed by one of the actuators, we define bijection $d : {0, 1} \times Σ_{c} \to Σ_{d}$ , such that $Σ_{d} = {d (ϕ) : ϕ \in {0, 1} \times Σ_{c}}$ is disjoint from $Σ \cup Σ_{h}$ . For all $ϕ \in {0, 1} \times Σ_{c}$ , we use $d (ϕ)$ to denote that the control command $ϕ$ has been executed by the corresponding actuator.

Given a supervisor

D = (W, ω)

with

W = (Z, Σ_{o}, ζ, z_{0})

, we formally construct

G_{D} = (Q_{D}, Σ_{D}, δ_{D}, q_{0, D}),

where

Q_{D} \subseteq Q \times Φ \times Θ_{o} \times Θ_{c} \times Z

is the state space;

q_{0, D} = (q_{0}, D (ε), [{\underset{︸}{ε, \dots, ε}}_{n}], [{\underset{︸}{ε, \dots, ε}}_{m}], z_{0})

is the initial state;

Σ_{D} \subseteq Σ \cup Σ_{h} \cup Σ_{d}

is the event set; the transition function

δ_{D} : Q_{D} \times Σ_{D} \to Q_{D}

is defined as:

For all $\tilde{q} = (q, \bar{ϕ}, {\bar{θ}}_{o}, {\bar{θ}}_{c}, z) \in Q_{D}$ and all $σ \in Σ$ ,

$\begin{matrix} δ_{D} (\tilde{q}, σ) = \{\begin{matrix} {\tilde{q}}^{'} & if δ (q, σ)! \land [σ \in Σ_{c} \Rightarrow σ \in \bar{ϕ}] \land \\ {IN}^{o b s} ({\bar{θ}}_{o}, σ)! \land PLUS ({\bar{θ}}_{c})! \\ undefined & otherwise, \end{matrix} \end{matrix}$

(30)

where ${\tilde{q}}^{'} = (δ (q, σ), \bar{π}, {IN}^{o b s} ({\bar{θ}}_{o}, σ), PLUS ({\bar{θ}}_{c}), z)$ ;
For all $\tilde{q} = (q, \bar{ϕ}, {\bar{θ}}_{o}, {\bar{θ}}_{c}, z) \in Q_{D}$ and all $h (σ) \in Σ_{h}$ ,

$\begin{matrix} δ_{D} (\tilde{q}, h (σ)) = \{\begin{matrix} {\tilde{q}}^{'} & if {OUT}^{o b s} ({\bar{θ}}_{o}, σ)! \\ undefined & otherwise, \end{matrix} \end{matrix}$

(31)

where ${\tilde{q}}^{'} = (q, \bar{ϕ}, {OUT}^{o b s} ({\bar{θ}}_{o}, σ), {IN}^{c t r} ({\bar{θ}}_{c}, ω (ζ (z, σ))), ζ (z, σ))$ ;
For all $\tilde{q} = (q, \bar{ϕ}, {\bar{θ}}_{o}, {\bar{θ}}_{c}, z) \in Q_{D}$ and all $d (ϕ) \in Σ_{d}$ ,

$\begin{matrix} δ_{D} (\tilde{q}, d (ϕ)) = \{\begin{matrix} {\tilde{q}}^{'} & if {OUT}^{c t r} ({\bar{θ}}_{c}, ϕ)! \\ undefined & otherwise, \end{matrix} \end{matrix}$

(32)

where ${\tilde{q}}^{'} = (q, UD (\bar{ϕ}, ϕ), {\bar{θ}}_{o}, {OUT}^{c t r} ({\bar{θ}}_{c}, ϕ), z)$ .

For all

μ \in L (G_{D})

, let

ψ^{'} (μ)

be the sequences obtained by removing all the event occurrences in

Σ_{h} \cup Σ_{d}

without changing the order of the remaining event occurrences in

μ

. We extend

ψ^{'} (\cdot)

to a set of sequences in the usual way. The dynamics of the closed-loop system can be simply obtained from

G_{S}

as follows.

Definition 9.

Given system G and supervisor D, we construct

G_{D}

as described above. All possible strings that may be generated by the closed-loop system with the observation delays

N_{o, 1}, \dots, N_{o, n}

and the control delays

N_{c, 1}, \dots, N_{c, m}

, are defined as:

L (D / G) = ψ^{'} (L (G_{D}))

.

By Definition 9, we can specify the dynamics of the closed-loop system when the sensors and the actuators are distributed at different sites. Furthermore, we can extend the proposed approaches to make the state estimation and synthesize the supervisor for the “distributed” system. Since this paper focuses on the case where there is one control channel and one observation channel, such an extension to the “distributed” system is beyond the scope of this paper.

8. Conclusions

In this paper, we considered the optimal supervisory control of DESs under communication delays. It is assumed that (i) delays do not change the order of the observations and controls; and (ii) both the observation delays and control delays have upper bounds. A modeling framework for supervisory control under communication delays was developed and evaluated. With this proposed framework, an online algorithm for the state estimation of the supervised system is proposed. The proposed algorithm can be used to solve the supervisor’s synthesis problem in networked DESs. Compared with the supervisor proposed in the existing work, (i) the synthesized supervisor can be more permissive as the proposed framework and state estimation approaches are more precise; (ii) the proposed framework considers the nondeterministic observation delays and control delays, which often happen. An application is provided to show how to implement the proposed algorithm. Finally, we extended the proposed framework to specify the dynamics of the closed-loop system when the sensors and actuators of the system are distributed, where delays may change the order of the observations and the controls.

One direction for future research can be to enhance the application scope of the proposed approach by accommodating communication losses in the system model. Researches can also look at how to estimate the states and synthesize supervisors when the sensors and actuators of the system are distributed.

Author Contributions

Conceptualization, Y.H. and W.L.; methodology, Y.H. and W.L.; software, Y.S.; validation, Y.S., Y.J., and Q.L.; formal analysis, Y.H. and Y.S.; investigation, W.L.; resources, Y.J. and W.L.; data curation, Y.S.; writing—original draft preparation, Y.H. and Y.S.; writing—review and editing, Y.H., Y.S., and W.L.; visualization, Y.S. and Y.J.; supervision, Q.L. and W.L.; project administration, Q.L. and W.L.; funding acquisition, Q.L. and W.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China under grant 92048205 and the Pujiang Talents Plan of Shanghai under grant 2019PJD035.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Proposition 1

Proof.

The proof is by induction on the finite length of sequences in

L (G_{S})

.

Since

δ_{S} (q_{0, S}, ε) = (q_{0}, S (ε), ε, ε, x_{0})

,

δ (q_{0}, ε) = q_{0}

, and

ξ (x_{0}, ε) = x_{0}

, the base case is true.

The induction hypothesis is that for all

μ \in L (G_{S})

with

| μ | \leq n

, if we write

δ_{S} (q_{0, S}, μ) = (q, π, θ_{o}, θ_{c}, x)

, then

q = δ (q_{0}, ψ (μ))

and

x = ξ (x_{0}, f^{- 1} (ψ^{f} (μ)))

. We next prove the same is also true for

μ e \in L (G_{S})

. Let us write

δ_{S} (q_{0, S}, μ e) = (q^{'}, π^{'}, θ_{o}^{'}, θ_{c}^{'}, x^{'})

. By the definition of

Σ_{S}

,

e \in Σ

,

e \in Σ_{f}

, or

e \in Σ_{g}

. We consider each of them separately as follows.

Case 1:

e = σ \in Σ

. By Equation (6),

q^{'} = δ (q, σ)

and

x^{'} = x

. Since

σ \in Σ

, by the definitions of

ψ (\cdot)

and

ψ^{f} (\cdot)

,

ψ (μ σ) = ψ (μ) σ

and

f^{- 1} (ψ^{f} (μ σ)) = f^{- 1} (ψ^{f} (μ))

. Moreover, since

q = δ (q_{0}, ψ (μ))

and

x = ξ (x_{0}, f^{- 1} (ψ^{f} (μ)))

,

δ (q_{0}, ψ (μ σ)) = δ (q, σ) = q^{'}

and

ξ (x_{0}, f^{- 1} (ψ^{f} (μ σ))) = ξ (x_{0}, f^{- 1} (ψ^{f} (μ))) = x = x^{'}

.

Case 2:

e = f (σ) \in Σ_{f}

. By Equation (7),

q^{'} = q

and

x^{'} = ξ (x, σ)

. Since

f (σ) \in Σ_{f}

, by the definitions of

ψ (\cdot)

and

ψ^{f} (\cdot)

,

ψ (μ f (σ)) = ψ (μ)

and

f^{- 1} (ψ^{f} (μ f (σ))) = f^{- 1} (ψ^{f} (μ)) σ

. Moreover, since

q = δ (q_{0}, ψ (μ))

and

x = ξ (x_{0}, f^{- 1} (ψ^{f} (μ)))

, we have

δ (q_{0}, ψ (μ f (σ))) = δ (q_{0}, ψ (μ)) = q = q^{'}

and

ξ (x_{0}, f^{- 1} (ψ^{f} (μ f (σ)))) = ξ (x_{0}, f^{- 1} (ψ^{f} (μ)) σ) = ξ (x, σ) = x^{'}

.

Case 3:

e = g (γ) \in Σ_{g}

. By Equation (8), we have

q^{'} = q

and

x^{'} = x

. Since

g (γ) \in Σ_{g}

, by the definitions of

ψ (\cdot)

and

ψ^{f} (\cdot)

,

ψ (μ g (γ)) = ψ (μ)

and

f^{- 1} (ψ^{f} (μ g (γ))) = f^{- 1} (ψ^{f} (μ))

. Moreover, since

q = δ (q_{0}, ψ (μ))

and

x = ξ (x_{0}, f^{- 1} (ψ^{f} (μ)))

, we have

δ (q_{0}, ψ (μ g (γ))) = δ (q_{0}, ψ (μ)) = q = q^{'}

and

ξ (x_{0}, f^{- 1} (ψ^{f} (μ g (γ)))) = ξ (x_{0}, f^{- 1} (ψ^{f} (μ))) = x = x^{'}

. □

Appendix B. Proof of Proposition 2

Proof.

Let us first introduce the following notation.

For any

θ_{c} = (π_{1}, n_{1}) \dots (π_{k}, n_{k}) \in Θ_{c}

and any

θ_{c}^{'} = (π_{1}^{'}, n_{1}^{'}) \dots (π_{k}^{'}, n_{k}^{'}) \in Θ_{c}

, we say

θ_{c} \leq θ_{c}^{'}

if

(\forall i = 1, \dots, k) π_{i} \subseteq π_{i}^{'} \land n_{i} = n_{i}^{'}

. Note that

ε \leq ε

holds.

We now prove that

\forall μ_{1} \in L (G_{S_{1}})

with

δ_{S_{1}} (q_{0, S_{1}}, μ_{1}) = (q_{1}, π_{1}, θ_{o, 1}, θ_{c, 1}, x_{1})

, there always exists a

μ_{2} \in L (G_{S_{2}})

such that

ψ (μ_{1}) = ψ (μ_{2})

,

f^{- 1} (ψ^{f} (μ_{1})) = f^{- 1} (ψ^{f} (μ_{2}))

, and

δ_{S_{2}} (q_{0, S_{2}}, μ_{2}) = (q_{2}, π_{2}, θ_{o, 2}, θ_{c, 2}, x_{2})

with

q_{1} = q_{2} \land π_{1} \subseteq π_{2} \land θ_{o, 1} = θ_{o, 2} \land θ_{c, 1} \leq θ_{c, 2}

. The proof is by induction on the finite length of sequences in

L (G_{S_{1}})

.

Since

δ_{S_{1}} (q_{0, S_{1}}, ε) = (q_{0}, S_{1} (ε), ε, ε, x_{0, 1})

,

δ_{S_{2}} (q_{0, S_{2}}, ε) = (q_{0}, S_{2} (ε), ε, ε, x_{0, 2})

,

ψ (ε) = ψ (ε)

,

f^{- 1} (ψ^{f} (ε)) = f^{- 1} (ψ^{f} (ε))

, and

q_{0} = q_{0} \land S_{1} (ε) \subseteq S_{2} (ε) \land ε = ε \land ε \leq ε

, the base case is true.

The induction hypothesis is that

\forall μ_{1} \in L (G_{S_{1}})

with

| μ_{1} | \leq k

, if

δ_{S_{1}} (q_{0, S_{1}}, μ_{1}) = (q_{1}, π_{1}, θ_{o, 1}, θ_{c, 1}, x_{1})

, then there exists a

μ_{2} \in L (G_{S_{2}})

such that

ψ (μ_{1}) = ψ (μ_{2})

,

f^{- 1} (ψ^{f} (μ_{1})) = f^{- 1} (ψ^{f} (μ_{2}))

, and

δ_{S_{2}} (q_{0, S_{2}}, μ_{2}) = (q_{2}, π_{2}, θ_{o, 2}, θ_{c, 2}, x_{2})

with

q_{1} = q_{2} \land π_{1} \subseteq π_{2} \land θ_{o, 1} = θ_{o, 2} \land θ_{c, 1} \leq θ_{c, 2}

. Next, we prove the same is also true for

μ_{1} e \in L (G_{S_{1}})

such that

| μ_{1} | = k

. w.l.o.g., let us write

δ_{S_{1}} (q_{0, S_{1}}, μ_{1} e) = (q_{1}^{'}, π_{1}^{'}, θ_{o, 1}^{'}, θ_{c, 1}^{'}, x_{1}^{'})

. By definition, we have (i)

e \in Σ

, (ii)

e \in Σ_{f}

, or (iii)

e \in Σ_{g}

. We consider each of them separately as follows.

Case 1:

e = σ \in Σ

. Since

δ_{S_{1}} (q_{0, S_{1}}, μ_{1}) = (q_{1}, π_{1}, θ_{o, 1}, θ_{c, 1}, x_{1})

and

δ_{S_{1}} (q_{0, S_{1}}, μ_{1} σ) = (q_{1}^{'}, π_{1}^{'}, θ_{o, 1}^{'}, θ_{c, 1}^{'}, x_{1}^{'})

, by Equation (6),

$δ (q_{1}, σ)! \land σ \in π_{1} \land MAX (θ_{o, 1}^{+}) \leq N_{o} \land MAX (θ_{c, 1}^{+}) \leq N_{c}$ ;
$q_{1}^{'} = δ (q_{1}, σ)$ , $π_{1}^{'} = π_{1}$ , $θ_{o, 1}^{'} = {IN}^{o b s} (θ_{o, 1}, σ)$ , $θ_{c, 1}^{'} = PLUS (θ_{c, 1})$ , and $x_{1}^{'} = x_{1}$ .

Moreover, since

δ_{S_{2}} (q_{0, S_{2}}, μ_{2}) = (q_{2}, π_{2}, θ_{o, 2}, θ_{c, 2}, x_{2})

with

q_{1} = q_{2} \land π_{1} \subseteq π_{2} \land θ_{o, 1} = θ_{o, 2} \land θ_{c, 1} \leq θ_{c, 2}

, we have

δ (q_{2}, σ)! \land σ \in π_{2} \land MAX (θ_{o, 2}^{+}) \leq N_{o} \land MAX (θ_{c, 2}^{+}) \leq N_{c}

. By Equation (6),

δ_{S_{2}} (q_{0, S_{2}}, μ_{2} σ) = (q_{2}^{'}, π_{2}^{'}, θ_{o, 2}^{'}, θ_{c, 2}^{'}, x_{2}^{'})

, where

q_{2}^{'} = δ (q_{2}, σ)

,

π_{2}^{'} = π_{2}

,

θ_{o, 2}^{'} = {IN}^{o b s} (θ_{o, 2}, σ)

,

θ_{c, 2}^{'} = PLUS (θ_{c, 2})

, and

x_{2}^{'} = x_{2}

. Therefore, we have

$[q_{2}^{'} = δ (q_{2}, σ) \land q_{1}^{'} = δ (q_{1}, σ) \land q_{1} = q_{2}] \Rightarrow [q_{2}^{'} = q_{1}^{'}]$ ;
$[π_{2}^{'} = π_{2} \land π_{1}^{'} = π_{1} \land π_{2} \supseteq π_{1}] \Rightarrow [π_{2}^{'} \supseteq π_{1}^{'}]$ ;
$[θ_{o, 2}^{'} = {IN}^{o b s} (θ_{o, 2}, σ) \land θ_{o, 1}^{'} = {IN}^{o b s} (θ_{o, 1}, σ) \land θ_{o, 1} = θ_{o, 2}] \Rightarrow [θ_{o, 2}^{'} = θ_{o, 1}^{'}]$ ;
$[θ_{c, 2}^{'} = PLUS (θ_{c, 2}) \land θ_{c, 1}^{'} = PLUS (θ_{c, 2}) \land θ_{c, 2} \geq θ_{c, 1}] \Rightarrow [θ_{c, 2}^{'} \geq θ_{c, 1}^{'}]$ .

Since

ψ (μ_{1}) = ψ (μ_{2})

and

f^{- 1} (ψ^{f} (μ_{1})) = f^{- 1} (ψ^{f} (μ_{2}))

, by definitions,

ψ (μ_{1} σ) = ψ (μ_{1}) σ = ψ (μ_{2}) σ = ψ (μ_{2} σ)

and

f^{- 1} (ψ^{f} (μ_{1} σ)) = f^{- 1} (ψ^{f} (μ_{2} σ))

.

Case 2:

e = f (σ) \in Σ_{f}

. Since

δ_{S_{1}} (q_{0, S_{1}}, μ_{1}) = (q_{1}, π_{1}, θ_{o, 1}, θ_{c, 1}, x_{1})

and

δ_{S_{1}} (q_{0, S_{1}}, μ_{1} f (σ)) = (q_{1}^{'}, π_{1}^{'}, θ_{o, 1}^{'}, θ_{c, 1}^{'}, x_{1}^{'})

, by Equation (7), we have

${OUT}^{o b s} (θ_{o, 1}, σ)!$ ;
$q_{1}^{'} = q_{1}$ , $π_{1}^{'} = π_{1}$ , $θ_{o, 1}^{'} = {OUT}^{o b s} (θ_{o, 1}, σ)$ , $θ_{c, 1}^{'} = {IN}^{c t r} (θ_{c, 1}, χ_{1} (ξ_{1} (x_{1}, σ)))$ , and $x_{1}^{'} = ξ_{1} (x_{1}, σ)$ .

Since

θ_{o, 1} = θ_{o, 2}

and

{OUT}^{o b s} (θ_{o, 1}, σ)!

,

{OUT}^{o b s} (θ_{o, 2}, σ)!

. Since

δ_{S_{2}} (q_{0, S_{2}}, μ_{2}) = (q_{2}, π_{2}, θ_{o, 2}, θ_{c, 2}, x_{2})

, by Equation (7),

δ_{S_{2}} (q_{0, S_{2}}, μ_{2} f (σ)) = (q_{2}^{'}, π_{2}^{'}, θ_{o, 2}^{'}, θ_{c, 2}^{'}, x_{2}^{'})

, where

q_{2}^{'} = q_{2}

,

π_{2}^{'} = π_{2}

,

θ_{o, 2}^{'} = {OUT}^{o b s} (θ_{o, 2}, σ)

,

θ_{c, 2}^{'} = {IN}^{c t r} (θ_{c, 2}, χ_{2} (ξ_{2} (x_{2}, σ)))

, and

x_{2}^{'} = ξ_{2} (x_{2}, σ)

. Therefore, we have

$[q_{2}^{'} = q_{2} \land q_{1}^{'} = q_{1} \land q_{1} = q_{2}] \Rightarrow [q_{1}^{'} = q_{2}^{'}]$ ;
$[π_{2}^{'} = π_{2} \land π_{1}^{'} = π_{1} \land π_{2} \supseteq π_{1}] \Rightarrow [π_{2}^{'} \supseteq π_{1}^{'}]$ ;
$[θ_{o, 2}^{'} = {OUT}^{o b s} (θ_{o, 2}, σ) \land θ_{o, 1}^{'} = {OUT}^{o b s} (θ_{o, 1}, σ) \land θ_{o, 1} = θ_{o, 2}] \Rightarrow [θ_{o, 2}^{'} = θ_{o, 1}^{'}]$ ;

Since

f^{- 1} (ψ^{f} (μ_{1})) = f^{- 1} (ψ^{f} (μ_{2}))

, we have

f^{- 1} (ψ^{f} (μ_{1} f (σ))) = f^{- 1} (ψ^{f} (μ_{2} f (σ)))

. We write

f^{- 1} (ψ^{f} (μ_{1} f (σ))) = t

. By Proposition 1,

x_{1}^{'} = ξ_{1} (x_{0, 1}, t)

and

x_{2}^{'} = ξ_{2} (x_{0, 2}, t)

. By the definitions of

S_{1}

and

S_{2}

,

S_{1} (t) = χ_{1} (x_{1}^{'})

and

S_{2} (t) = χ_{2} (x_{2}^{'})

. Since

S_{1} \subseteq S_{2}

,

χ_{1} (x_{1}^{'}) \subseteq χ_{2} (x_{2}^{'})

. Since

θ_{c, 1}^{'} = {IN}^{c t r} (θ_{c, 1}, χ_{1} (x_{1}^{'}))

,

θ_{c, 2}^{'} = {IN}^{c t r} (θ_{c, 2}, χ_{2} (x_{2}^{'}))

, and

θ_{c, 2} \geq θ_{c, 1}

, by the definition of

{IN}^{c t r} (\cdot)

,

θ_{c, 2}^{'} \geq θ_{c, 1}^{'}

. Moreover, since

ψ (μ_{1}) = ψ (μ_{2})

,

ψ (μ_{1} f (σ)) = ψ (μ_{1}) = ψ (μ_{2}) = ψ (μ_{2} f (σ))

.

Case 3:

e = g (γ) \in Σ

. Since

δ_{S_{1}} (q_{0, S_{1}}, μ_{1}) = (q_{1}, π_{1}, θ_{o, 1}, θ_{c, 1}, x_{1})

and

δ_{S_{1}} (q_{0, S_{1}}, μ_{1} g (γ)) = (q_{1}^{'}, π_{1}^{'}, θ_{o, 1}^{'}, θ_{c, 1}^{'}, x_{1}^{'})

, by Equation (8), we have

${OUT}^{c t r} (θ_{c, 1}, γ)!$ ;
$q_{1}^{'} = q_{1}$ , $π_{1}^{'} = γ$ , $θ_{o, 1}^{'} = θ_{o, 1}$ , and $θ_{c, 1}^{'} = {OUT}^{c t r} (θ_{c, 1}, γ_{1})$ , and $x_{1}^{'} = x_{1}$ .

Since

{OUT}^{c t r} (θ_{c, 1}, γ)!

, we know

{OUT}^{c t r} (θ_{c, 2}, γ^{'})!

, where

γ^{'}

is the first control command of

θ_{c, 2}

. Since

θ_{2} \geq θ_{1}

,

γ^{'} \supseteq γ

. Therefore, we have

$[q_{2}^{'} = q_{2} \land q_{1}^{'} = q_{1} \land q_{1} = q_{2}] \Rightarrow [q_{1}^{'} = q_{2}^{'}]$ ;
$[π_{2}^{'} = γ^{'} \land π_{1}^{'} = γ \land γ^{'} \supseteq γ] \Rightarrow [π_{2}^{'} \supseteq π_{1}^{'}]$ ;
$[θ_{o, 2}^{'} = θ_{o, 2} \land θ_{o, 1}^{'} = θ_{o, 1} \land θ_{o, 1} = θ_{o, 2}] \Rightarrow [θ_{o, 2}^{'} = θ_{o, 1}^{'}]$ ;
$[θ_{c, 2}^{'} = {OUT}^{c t r} (θ_{c, 2}, γ^{'}) \land θ_{c, 1}^{'} = {OUT}^{c t r} (θ_{c, 2}, γ) \land θ_{c, 2} \geq θ_{c, 1}] \Rightarrow [θ_{c, 2}^{'} \geq θ_{c, 1}^{'}]$ .

Moreover, since

ψ (μ_{1}) = ψ (μ_{2})

and

f^{- 1} (ψ^{f} (μ_{1})) = f^{- 1} (ψ^{f} (μ_{2}))

,

ψ (μ_{1} g (γ)) = ψ (μ_{2} g (γ^{'}))

and

f^{- 1} (ψ^{f} (μ_{1} g (γ))) = f^{- 1} (ψ^{f} (μ_{2} g (γ^{'})))

. □

Appendix C. Proof of Proposition 3

Proof.

We first introduce the following notation.

Given any

θ_{c} = (π_{1}, n_{1}) \dots (π_{k}, n_{k}) \in Θ_{c}

and any

θ_{c}^{'} = (π_{1}^{'}, n_{1}^{'}) \dots (π_{k}^{'}, n_{k}^{'}) \in Θ_{c}

, we say

θ_{c} \geq θ_{c}^{'}

if

(\forall i = 1, \dots, k) π_{i} \supseteq π_{i}^{'} \land n_{i} = n_{i}^{'}

. Note that

ε \geq ε

holds.

Since

ν \in L (G_{z})

, let us write

δ_{z} (z, ν^{i}) = (p^{i}, γ^{i}, ω_{o}^{i}, ω_{c}^{i})

for

i = 1, \dots, | ν |

. Next, we prove

δ_{S} (q_{0, S}, μ ν^{i})!

and

δ_{S} (q_{0, S}, μ ν^{i}) = (q^{i}, π^{i}, θ_{o}^{i}, θ_{c}^{i}, x^{i})

with

q^{i} = p^{i}

,

π^{i} \supseteq γ^{i}

,

θ_{o}^{i} = ω_{o}^{i}

, and

θ_{c}^{i} \geq ω_{c}^{i}

for

i = 1, \dots, | ν |

.

Since

δ_{z} (z, ε) = (q, π, θ_{o}, θ_{c})

and

δ_{S} (q_{0, S}, μ) = (q, π, θ_{o}, θ_{c}, x)

, the base case is true. The induction hypothesis is that for all

μ ν^{i}

with

i \leq k

, we have

δ_{S} (q_{0, S}, μ ν^{i}) = (q^{i}, π^{i}, θ_{o}^{i}, θ_{c}^{i}, x^{i})

with

q^{i} = p^{i}

,

π^{i} \supseteq γ^{i}

,

θ_{o}^{i} = ω_{o}^{i}

, and

θ_{c}^{i} \geq ω_{c}^{i}

. We now prove the same is also true for

μ ν^{k + 1} = μ ν^{k} e

. By definition, (i)

e = σ \in Σ

, (ii)

e = f (σ) \in Σ_{f}

, or (iii)

e = g (γ) \in Σ_{g}

.

Case 1:

e = σ \in Σ

. Since

δ_{z} (z, ν^{k}) = (p^{k}, γ^{k}, ω_{o}^{k}, ω_{c}^{k})

and

δ_{z} (z, ν^{k} σ) = (p^{k + 1}, γ^{k + 1}, ω_{o}^{k + 1}, ω_{c}^{k + 1}),

by Equation (15), we have

$δ (p^{k}, σ)! \land σ \in γ^{k} \land MAX ({(ω_{o}^{k})}^{+}) \leq N_{o} \land MAX ({(ω_{c}^{k})}^{+}) \leq N_{c}$ ;
$p^{k + 1} = δ (p^{k}, σ)$ , $γ^{k + 1} = γ^{k}$ , $ω_{o}^{k + 1} = {IN}^{o b s} (ω_{o}^{k}, σ)$ , and $ω_{c}^{k + 1} = PLUS (ω_{c}^{k})$ .

By induction hypothesis,

δ_{S} (q_{0, S}, μ ν^{k}) = (q^{k}, π^{k}, θ_{o}^{k}, θ_{c}^{k}, x^{k})

with

q^{k} = p^{k}

,

π^{k} \supseteq γ^{k}

,

θ_{o}^{k} = ω_{o}^{k}

, and

θ_{c}^{k} \geq ω_{c}^{k}

. Hence, we have

δ (q^{k}, σ)! \land σ \in π^{k} \land MAX ({(θ_{o}^{k})}^{+}) \leq N_{o} \land MAX ({(θ_{c}^{k})}^{+}) \leq N_{c} .

By Equation (6),

δ_{S} (q_{0, S}, μ ν^{k} σ) = (q^{k + 1}, π^{k + 1}, θ_{o}^{k + 1}, θ_{c}^{k + 1}, x^{k + 1})

, where

q^{k + 1} = δ (q^{k}, σ)

,

π^{k + 1} = π^{k}

,

θ_{o}^{k + 1} = {IN}^{o b s} (θ_{o}^{k}, σ)

,

θ_{c}^{k + 1} = PLUS (θ_{c}^{k})

, and

x^{k + 1} = x^{k}

. Therefore,

p^{k + 1} = q^{k + 1} \land π^{k + 1} \supseteq γ^{k + 1} \land θ_{o}^{k + 1} = ω_{o}^{k + 1} \land θ_{c}^{k + 1} \geq ω_{c}^{k + 1}

.

Case 2:

e = f (σ) \in Σ_{f}

. Since

δ_{z} (z, ν^{k}) = (p^{k}, γ^{k}, ω_{o}^{k}, ω_{c}^{k})

and

δ_{z} (z, ν^{k} f (σ)) = (p^{k + 1}, γ^{k + 1}, ω_{o}^{k + 1}, ω_{c}^{k + 1}),

by Equation (16), we have

${OUT}^{o b s} (ω_{o}^{k}, σ)!$ ;
$p^{k + 1} = p^{k}$ , $γ^{k + 1} = γ^{k}$ , $ω_{o}^{k + 1} = {OUT}^{o b s} (ω_{o}^{k}, σ)$ , and $ω_{c}^{k + 1} = {IN}^{c t r} (ω_{c}^{k}, Σ_{u c})$ .

By induction hypothesis,

δ_{S} (q_{0, S}, μ ν^{k}) = (q^{k}, π^{k}, θ_{o}^{k}, θ_{c}^{k}, x^{k})

with

q^{k} = p^{k}

,

π^{k} \supseteq γ^{k}

,

θ_{o}^{k} = ω_{o}^{k}

, and

θ_{c}^{k} \geq ω_{c}^{k}

. Hence,

{OUT}^{o b s} (θ_{o}^{k}, σ)!

By Equation (7),

δ_{S} (q_{0, S}, μ ν^{k} f (σ)) = (q^{k + 1}, π^{k + 1}, θ_{o}^{k + 1},

θ_{c}^{k + 1}, x^{k + 1})

, where

q^{k + 1} = q^{k}

,

π^{k + 1} = π^{k}

,

θ_{o}^{k + 1} = {OUT}^{o b s} (θ_{o}^{k}, σ)

,

θ_{c}^{k + 1} = {IN}^{c t r} (θ_{c}^{k}, χ (ξ (x^{k}, σ)))

, and

x^{k + 1} = ξ (x^{k}, σ)

. Since

χ (ξ (x^{k}, σ)) = Σ_{u c}

,

θ_{c}^{k + 1} = {IN}^{c t r} (ω_{c}^{k}, Σ_{u c})

. Thus,

p^{k + 1} = q^{k + 1} \land π^{k + 1} \supseteq γ^{k + 1} \land θ_{o}^{k + 1} = ω_{o}^{k + 1} \land θ_{c}^{k + 1} \geq ω_{c}^{k + 1}

.

Case 3:

e = g (γ) \in Σ_{g}

. Since

δ_{z} (z, ν^{k}) = (p^{k}, γ^{k}, ω_{o}^{k}, ω_{c}^{k})

and

δ_{z} (z, ν^{k} g (γ)) = (p^{k + 1}, γ^{k + 1}, ω_{o}^{k + 1}, ω_{c}^{k + 1}),

by Equation (17), we have

${OUT}^{c t r} (ω_{c}^{k}, γ)!$ ;
$p^{k + 1} = p^{k}$ , $γ^{k + 1} = γ$ , $ω_{o}^{k + 1} = ω_{o}^{k}$ , and $ω_{c}^{k + 1} = {OUT}^{c t r} (ω_{c}^{k}, γ)$ .

By induction hypothesis,

δ_{S} (q_{0, S}, μ ν^{k}) = (q^{k}, π^{k}, θ_{o}^{k}, θ_{c}^{k}, x^{k})

with

q^{k} = p^{k}

,

π^{k} \supseteq γ^{k}

,

θ_{o}^{k} = ω_{o}^{k}

, and

θ_{c}^{k} \geq ω_{c}^{k}

. Hence,

{OUT}^{c t r} (θ_{c}^{k}, γ)!

By Equation (8),

δ_{S} (q_{0, S}, μ ν^{k} g (γ)) = (q^{k + 1}, π^{k + 1}, θ_{o}^{k + 1}, θ_{c}^{k + 1},

x^{k + 1})

, where

q^{k + 1} = q^{k}

,

π^{k + 1} = γ

,

θ_{o}^{k + 1} = θ_{o}^{k}

,

θ_{c}^{k + 1} = {OUT}^{c t r} (θ_{c}^{k}, γ)

, and

x^{k + 1} = x^{k}

. Therefore,

p^{k + 1} = q^{k + 1} \land π^{k + 1} \supseteq γ^{k + 1} \land θ_{o}^{k + 1} = ω_{o}^{k + 1} \land θ_{c}^{k + 1} \geq ω_{c}^{k + 1}

.

Overall, we have

δ_{S} (q_{0, S}, μ ν)!

. By Definition 3,

ψ (μ ν) \in L (S / G)

. □

Appendix D. Proof of Theorem 2

Proof.

(\Leftarrow)

The proof is by contradiction. Suppose

(\exists i \in {1, \dots, | t |}) {\tilde{E}}_{S_{t}} (t^{i}) \cap T_{s p e c} \neq \emptyset

. w.l.o.g., we write

z = (q, π, θ_{o}, θ_{c}) \in {\tilde{E}}_{S_{t}} (t^{i}) \cap T_{s p e c}

. Let

S_{t} = (A, χ)

with

A = (X, Σ_{o}, ξ, x_{0})

. Let

G_{S_{t}} = (Q_{S_{t}}, Σ_{S_{t}}, δ_{S_{t}}, q_{0, S_{t}})

be the automaton constructed using procedures proposed in Section 3.2. Since

z \in {\tilde{E}}_{S_{t}} (t^{i})

, by Theorem 1,

\exists μ \in L (G_{S_{t}})

with

f^{- 1} (ψ^{f} (μ)) = t^{i}

such that

δ_{S_{t}} (q_{0, S_{t}}, μ) = (p, γ, ω_{o}, ω_{c}, x)

and

p = q \land γ = π \land ω_{o} = θ_{o} \land ω_{c} = θ_{c}

. Meanwhile, since

z \in T_{s p e c}

, by Equation (19),

FC (Q_{z}) \cap (Q \ Q_{H}) \neq \emptyset

. By Equation (18), there exists

ν \in L (G_{z})

, such that

δ (q, ψ (ν)) \in Q \ Q_{H}

. Since

ν \in L (G_{z})

, by Proposition 3,

ψ (μ ν) \in L (S_{t} / G)

. Since

δ_{S_{t}} (q_{0, S_{t}}, μ) = (q, π, θ_{o}, θ_{c}, x)

, by Proposition 1,

δ (q_{0}, ψ (μ)) = q

. Moreover, since

δ (q, ψ (ν)) \in Q \ Q_{H}

,

δ (q_{0}, ψ (μ ν)) \in Q \ Q_{H}

. Therefore,

L (S_{t} / G) \neg \subseteq K

, which contradicts

L (S_{t} / G) \subseteq K

.

Moreover,

(\Rightarrow)

is also by contradiction. Suppose there exists

s \in L (S_{t} / G)

, such that

s \notin K

. By Definition 3,

\exists μ \in L (G_{S_{t}})

with

s = ψ (μ)

. We write

μ = μ_{1} μ_{2}

such that

μ_{1} \in \bar{μ}

is the longest prefix of

μ

with

f^{- 1} (ψ^{f} (μ_{1})) \in \bar{t}

. Since

f^{- 1} (ψ^{f} (μ_{1})) \in \bar{t}

, we have

f^{- 1} (ψ^{f} (μ_{1})) = t^{j}

for some

j \in {0, 1, \dots, | t |}

. We write

δ_{S_{t}} (q_{0, S_{t}}, μ_{1}) = (q, π, θ_{o}, θ_{c}, x)

and

z = (q, π, θ_{o}, θ_{c})

. By Theorem 1,

z \in {\tilde{E}}_{S_{t}} (f^{- 1} (ψ^{f} (μ_{1}))) = {\tilde{E}}_{S_{t}} (t^{j})

. We next prove

z \in T_{s p e c}

.

Since

μ = μ_{1} μ_{2} \in L (G_{S_{t}})

, we write

δ_{S_{t}} (q_{0, S_{t}}, μ_{1} μ_{2}^{i}) = (p^{i}, γ^{i}, ω_{o}^{i}, ω_{c}^{i}, y^{i})

for

i = 0, 1, \dots, | μ_{2} |

. Clearly,

(p^{0}, γ^{0}, ω_{o}^{0}, ω_{c}^{0}) = (q, π, θ_{o}, θ_{c})

. We now prove

δ_{z} (z, μ_{2}^{i}) = (p^{i}, γ^{i}, ω_{o}^{i}, ω_{c}^{i})

by induction on

μ_{2}^{i}

for

i = 0, 1, \dots, | μ_{2} |

.

The base case is true since

δ_{z} (z, μ_{2}^{0}) = z = (q, π, θ_{o}, θ_{c})

with

(q, π, θ_{o}, θ_{c}) = (p^{0}, γ^{0}, ω_{o}^{0}, ω_{c}^{0})

. The induction hypothesis is that for all

μ_{2}^{i}

with

i \leq k

,

δ_{z} (z, μ_{2}^{i}) = (p^{i}, γ^{i}, ω_{o}^{i}, ω_{c}^{i})

. We next prove the same is also true for

μ_{2}^{k + 1} = μ_{2}^{k} e

. Since

δ_{S_{t}} (q_{0, S_{t}}, μ_{1} μ_{2}^{i}) = (p^{i}, γ^{i}, ω_{o}^{i}, ω_{c}^{i}, y^{i})

for

i \leq | μ_{2} |

,

δ_{S_{t}} (q_{0, S_{t}}, μ_{1} μ_{2}^{k}) = (p^{k}, γ^{k}, ω_{o}^{k}, ω_{c}^{k}, y^{k})

and

δ_{S_{t}} (q_{0, S_{t}}, μ_{1} μ_{2}^{k} e) = (p^{k + 1}, γ^{k + 1}, ω_{o}^{k + 1}, ω_{c}^{k + 1}, y^{k + 1}) .

By definition, (i)

e = σ \in Σ

, (ii)

e = f (σ) \in Σ_{f}

, or (iii)

e = g (γ) \in Σ_{g}

.

Case 1:

e = σ \in Σ

. By Equation (6), we have

$δ (p^{k}, σ)! \land σ \in γ^{k} \land MAX ({(ω_{o}^{k})}^{+}) \leq N_{o} \land MAX ({(ω_{c}^{k})}^{+}) \leq N_{c}$ ;
$p^{k + 1} = δ (p^{k}, σ)$ , $γ^{k + 1} = γ^{k}$ , $ω_{o}^{k + 1} = {IN}^{o b s} (ω_{o}^{k}, σ)$ , $ω_{c}^{k + 1} = PLUS (ω_{c}^{k})$ , and $y^{k + 1} = y^{k}$ .

Since

δ_{z} (z, μ_{2}^{k}) = (p^{k}, γ^{k}, ω_{o}^{k}, ω_{c}^{k})

, by Equation (15),

δ_{z} (z, μ_{2}^{k} σ) = (δ (p^{k}, σ), γ^{k}, {IN}^{o b s} (ω_{o}^{k}, σ), PLUS (ω_{c}^{k})) .

Therefore,

δ_{z} (z, μ_{2}^{k} σ) = (p^{k + 1}, γ^{k + 1}, ω_{o}^{k + 1}, ω_{c}^{k + 1})

.

Case 2:

e = f (σ) \in Σ_{f}

. By Equation (7), we have

${OUT}^{o b s} (θ_{o}^{k}, σ)!$ ;
$p^{k + 1} = p^{k}$ , $γ^{k + 1} = γ^{k}$ , $ω_{o}^{k + 1} = {OUT}^{o b s} (ω_{o}^{k}, σ)$ , $ω_{c}^{k + 1} = {IN}^{c t r} (ω_{c}^{k}, χ (ξ (y^{k}, σ)))$ , and $y^{k + 1} = ξ (y^{k}, σ)$ .

By Proposition 1,

y^{k + 1} = ξ (x_{0}, f^{- 1} (ψ^{f} (μ_{1} μ_{2}^{k} f (σ))))

. Since

f^{- 1} (ψ^{f} (μ_{1} μ_{2}^{k} f (σ))) \notin \bar{t}

,

χ (y^{k + 1}) = Σ_{u c}

. Thus,

ω_{c}^{k + 1} = {IN}^{c t r} (θ_{c}^{k}, Σ_{u c})

. Since

δ_{z} (z, μ_{2}^{k}) = (p^{k}, γ^{k}, ω_{o}^{k}, ω_{c}^{k})

, by Equation (16),

δ_{z} (z, μ_{2}^{k} σ) = (p^{k}, γ^{k}, {OUT}^{o b s} (ω_{o}^{k}, σ), {IN}^{c t r} (θ_{c}^{k}, Σ_{u c})) .

Thus,

δ_{z} (z, μ_{2}^{k} f (σ)) = (p^{k + 1}, γ^{k + 1}, ω_{o}^{k + 1}, ω_{c}^{k + 1})

.

Case 3:

e = g (γ) \in Σ

. By Equation (8), we have

${OUT}^{c t r} (ω_{c}^{k}, γ)!$ ;
$p^{k + 1} = p^{k}$ , $γ^{k + 1} = γ$ , $ω_{o}^{k + 1} = ω_{o}^{k}$ , $ω_{c}^{k + 1} = {OUT}^{c t r} (ω_{c}^{k}, γ)$ , and $y^{k + 1} = y^{k}$ .

Since

δ_{z} (z, μ_{2}^{k}) = (p^{k}, γ^{k}, ω_{o}^{k}, ω_{c}^{k})

, by Equation (17),

δ_{z} (z, μ_{2}^{k} g (γ)) = (p^{k}, γ, ω_{o}^{k}, {OUT}^{c t r} (ω_{c}^{k}, γ))

. Therefore,

δ_{z} (z, μ_{2}^{k} g (γ)) = (p^{k + 1}, γ^{k + 1}, ω_{o}^{k + 1}, ω_{c}^{k + 1})

.

Overall, we have

δ_{z} (z, μ_{2}) = (p^{| μ_{2} |}, γ^{| μ_{2} |}, ω_{o}^{| μ_{2} |}, ω_{c}^{| μ_{2} |})

. Since

δ_{S_{t}} (q_{0, S_{t}}, μ_{1} μ_{2}) = (p^{| μ_{2} |}, γ^{| μ_{2} |}, ω_{o}^{| μ_{2} |}, ω_{c}^{| μ_{2} |}, y^{| μ_{2} |})

and

ψ (μ_{1} μ_{2}) \in L (G) \ K

, by Proposition 1,

p^{| μ_{2} |} \in Q \ Q_{H}

. By Equation (18),

z \in T_{s p e c}

. Since

z \in {\tilde{E}}_{S} (t^{j})

, we have

z \in {\tilde{E}}_{S} (t^{j}) \cap T_{s p e c}

, which contradicts

(\forall i = 1, \dots, | t |) {\tilde{E}}_{S} (t^{i}) \cap T_{s p e c} = \emptyset

. □

References

Ramadge, P.J.; Wonham, W.M. Supervisory Control of a Class of Discrete Event Processes. SIAM J. Control Optim. 1987, 25, 206–230. [Google Scholar] [CrossRef]
Lin, F.; Wonham, W.M. On observability of discrete-event systems. Inf. Sci. 1988, 44, 173–198. [Google Scholar] [CrossRef]
Lin, F.; Wonham, W.M. Decentralized Supervisory Control of Discrete-event Systems. Inf. Sci. 1988, 44, 199–224. [Google Scholar] [CrossRef]
Lin, F. Robust and adaptive supervisory control of discrete event systems. IEEE Trans. Autom. Control 1993, 38, 1848–1852. [Google Scholar] [CrossRef]
Rashidinejad, A.; Reniers, M.; Fabian, M. Supervisor control of discrete-event systems in an asynchronous setting. In Proceedings of the 2019 IEEE 15th International Conference on Automation Science and Engineering (CASE), Vancouver, BC, Canada, 22–26 August 2019; pp. 6730–6735. [Google Scholar]
Ji, Y.; Yin, X.; Lafortune, S. Local Mean Payoff Supervisory Control for Discrete Event Systems. IEEE Trans. Autom. Control 2022, 67, 2282–2297. [Google Scholar] [CrossRef]
Rohloff, K. Sensor failure tolerant supervisory control. In Proceedings of the 44th IEEE Conference on Decision and Control (CDC), Seville, Spain, 15 December 2005; pp. 3493–3498. [Google Scholar]
Park, S.J.; Cho, K.H. Delay-robust supervisory control of discrete-event systems with bounded communication delays. IEEE Trans. Autom. Control 2006, 51, 2282–2297. [Google Scholar] [CrossRef]
Pruekprasert, S.; Ushio, T. Supervisory Control of Communicating Timed Discrete Event Systems for State Avoidance Problem. IEEE Control Syst. Lett. 2019, 4, 259–264. [Google Scholar] [CrossRef]
Sadid, W.H.; Ricker, L.; Hashtrudi-Zad, S. Robustness of synchronous communication protocols with delay for decentralized discrete-event control. Discret. Event Dyn. Syst. 2015, 25, 159–176. [Google Scholar] [CrossRef]
Zhang, R.; Cai, K.; Gan, Y.; Wonham, W.M. Distributed supervisory control of discrete-event systems with communication delay. Discret. Event Dyn. Syst. 2016, 26, 263–293. [Google Scholar] [CrossRef][Green Version]
Zhang, R.; Cai, K.; Gan, Y.; Wonham, W.M. Delay-Robustness in Distributed Control of Timed Discrete-Event Systems Based on Supervisor Localization. Int. J. Control 2014, 89, 2055–2072. [Google Scholar] [CrossRef]
Zgorzelski, M.; Lunze, J. A new approach to tracking control of networked discrete-event systems. IFAC-PapersOnLine 2018, 51, 448–455. [Google Scholar] [CrossRef]
Zgorzelski, M.; Lunze, J. A method for the synchronisation of networked discrete-event systems. In Proceedings of the Proceedings of the 13th International Workshop on Discrete Event Systems (WODES), Xi’an, China, 30 May 2016–1 June 2016; pp. 444–451. [Google Scholar]
Yang, S.; Hou, J.; Yin, X.; Li, S. Opacity of Networked Supervisory Control Systems over Insecure Communication Channels. IEEE Trans. Control Netw. Syst. 2021, 8, 884–896. [Google Scholar] [CrossRef]
Takai, S. A general framework for diagnosis of discrete event systems subject to sensor failures. Automatica 2021, 129, 109669. [Google Scholar] [CrossRef]
Lin, L.; Zhu, Y.; Tai, R.; Ware, S.; Su, R. Networked supervisor synthesis against lossy channels with bounded network delays as non-networked synthesis. Automatica 2022, 142, 110279. [Google Scholar] [CrossRef]
Zhu, Y.; Lin, L.; Tai, R.; Su, R. Distributed Control of Timed Networked System against Communication Delays. In Proceedings of the 2022 IEEE 17th International Conference on Control and Automation (ICCA), Naples, Italy, 27–30 June 2022; pp. 1008–1013. [Google Scholar]
Zhou, L.; Shu, S.; Lin, F. Detectability of Discrete-Event Systems Under Nondeterministic Observations. IEEE Trans. Autom. Sci. Eng. 2021, 18, 1315–1327. [Google Scholar] [CrossRef]
Tai, R.; Lin, L.; Zhu, Y.; Su, R. A new modeling framework for networked discrete-event systems. Automatica 2022, 138, 1–7. [Google Scholar] [CrossRef]
Lin, F. Control of networked discrete event systems: Dealing with communication delays and losses. SIAM J. Control Optim. 2014, 52, 1276–1298. [Google Scholar] [CrossRef]
Shu, S.; Lin, F. Supervisor synthesis for networked discrete event systems with communication delays. IEEE Trans. Autom. Control 2015, 60, 2183–2188. [Google Scholar] [CrossRef]
Shu, S.; Lin, F. Predictive networked control of discrete event systems. IEEE Trans. Autom. Control 2017, 62, 4698–4705. [Google Scholar] [CrossRef]
Shu, S.; Lin, F. Deterministic networked control of discrete event systems with nondeterministic communication delays. IEEE Trans. Autom. Control 2017, 62, 190–205. [Google Scholar] [CrossRef]
Wang, F.; Shu, S.; Lin, F. Robust networked control of discrete event systems. IEEE Trans. Autom. Sci. Eng. 2016, 13, 1258–1540. [Google Scholar] [CrossRef]
Liu, Z.; Yin, X.; Shu, S.; Lin, F.; Li, S. Online supervisory control of networked discrete-event systems with control delays. IEEE Trans. Auto. Control 2021, 2021, 1. [Google Scholar]
Zhao, B.; Lin, F.; Wang, C.; Zhang, X.; Polis, M.; Wang, L. Supervisory control of networked timed discrete event systems and its applications to power distribution networks. IEEE Trans. Control Netw. Syst. 2017, 4, 146–158. [Google Scholar] [CrossRef]
Alves, M.; Carvalho, L.; Basilio, J. Supervisory Control of Networked Discrete Event Systems with Timing Structure. IEEE Trans. Autom. Control 2021, 66, 2206–2218. [Google Scholar] [CrossRef]
Rashidinejad, A.; Reniers, M.; Feng, L. Supervisory control of timed discrete-event systems subject to communication delays and non-fifo observations. IFAC-PapersOnLine 2018, 51, 456–463. [Google Scholar] [CrossRef]
Zhu, Y.; Lin, L.; Simon, W.; Su, R. Supervisor synthesis for networked discrete event systems with communication delays and lossy channels. In Proceedings of the IEEE 58th Conference on Decision and Control (CDC), Nice, France, 11–13 December 2019; pp. 6730–6735. [Google Scholar]
Lin, F.; Heymann, M. On-line control of partially observed discrete event systems. Discret. Event Dyn. Syst. Theory Appl. 1994, 4, 221–236. [Google Scholar]
Hadj-Alouane, N.; Lafortune, S.; Lin, F. Centralized and distributed algorithms for on-Line synthesis of maximal control policies under partial observation. Discret. Event Dyn. Syst. Theory Appl. 1996, 6, 379–427. [Google Scholar] [CrossRef]
Yin, X.; Lafortune, S. Synthesis of Maximally Permissive Supervisors for Partially-Observed Discrete-Event Systems. IEEE Trans. Autom. Control 2016, 61, 1239–1254. [Google Scholar] [CrossRef]
Yin, X.; Lafortune, S. A Uniform Approach for Synthesizing Property-Enforcing Supervisors for Partially-Observed Discrete-Event Systems. IEEE Trans. Autom. Control 2016, 61, 2140–2154. [Google Scholar] [CrossRef]
Yin, X.; Lafortune, S. Synthesis of Maximally-Permissive Supervisors for the Range Control Problem. IEEE Trans. Autom. Control 2017, 62, 3914–3929. [Google Scholar] [CrossRef]
Yin, X.; Lafortune, S. Synthesis of Maximally Permissive Nonblocking Supervisors for the Lower Bound Containment Problem. IEEE Trans. Autom. Control 2018, 63, 4435–4441. [Google Scholar] [CrossRef]
Hou, Y.; Wang, W.; Zang, Y.; Lin, F.; Yu, M.; Gong, C. Relative network observability and its relation with network observability. IEEE Trans. Autom. Control 2020, 65, 3584–3735. [Google Scholar] [CrossRef]
Wang, F.; Shu, S.; Lin, F. On network observability of discrete event system. In Proceedings of the IEEE 54th Conference on Decision and Control (CDC), Osaka, Japan, 15–18 December 2015; pp. 3528–3533. [Google Scholar]
Lin, F.; Wang, W.; Han, L.; Shen, B. State estimation of multi-channel networked discrete event systems. IEEE Trans. Control Netw. Syst. 2020, 7, 53–63. [Google Scholar] [CrossRef]
Cassandras, C.G.; Lafortune, S. Introduction to Discrete Event Systems, 2nd ed.; Springer: New York, NY, USA, 2007. [Google Scholar]

Figure 1. Supervisory control of networked DESs.

Figure 2. System G and Supervisor

S = (A, χ)

.

Figure 2. System G and Supervisor

S = (A, χ)

.

Figure 3. The interaction process between the plant and the supervisor.

Figure 4. Automaton model

G_{S}

in Example 2.

Figure 4. Automaton model

G_{S}

in Example 2.

Figure 5. Online state estimation under communication delays.

Figure 6. Uncontrolled system G and desired system H.

Figure 7. A signalized intersection.

Figure 8. System G and desired system H.

Figure 9. Supervisor

S^{*}

.

Figure 9. Supervisor

S^{*}

.

Figure 10. Supervisory control of networked DESs with non-FIFO observations and controls.

Table 1. Events in the transport safety model.

Events	Description	Controllable	Observable
a	Vehicle x arrives at the intersection	Yes	No
p	Vehicle x leaves the intersection	Yes	No
y	The traffic light is switched to yellow	No	Yes
$g_{1}$	The traffic light is in the first half of the green cycle	No	Yes
$g_{2}$	The traffic light is in the second half of the green cycle	No	Yes
$r_{1}$	The traffic light is in the first half of the red cycle	No	Yes
$r_{2}$	The traffic light is in the second half of the red cycle	No	Yes

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hou, Y.; Shen, Y.; Li, Q.; Ji, Y.; Li, W. Modeling and Optimal Supervisory Control of Networked Discrete-Event Systems and Their Application in Traffic Management. Mathematics 2023, 11, 3. https://doi.org/10.3390/math11010003

AMA Style

Hou Y, Shen Y, Li Q, Ji Y, Li W. Modeling and Optimal Supervisory Control of Networked Discrete-Event Systems and Their Application in Traffic Management. Mathematics. 2023; 11(1):3. https://doi.org/10.3390/math11010003

Chicago/Turabian Style

Hou, Yunfeng, Yanni Shen, Qingdu Li, Yunfeng Ji, and Wei Li. 2023. "Modeling and Optimal Supervisory Control of Networked Discrete-Event Systems and Their Application in Traffic Management" Mathematics 11, no. 1: 3. https://doi.org/10.3390/math11010003

APA Style

Hou, Y., Shen, Y., Li, Q., Ji, Y., & Li, W. (2023). Modeling and Optimal Supervisory Control of Networked Discrete-Event Systems and Their Application in Traffic Management. Mathematics, 11(1), 3. https://doi.org/10.3390/math11010003

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Modeling and Optimal Supervisory Control of Networked Discrete-Event Systems and Their Application in Traffic Management

Abstract

1. Introduction

2. Preliminaries

3. Modeling Framework for Networked Supervisory Control

3.1. Modeling of the Communication Channels

3.2. Language of the Closed-Loop System

3.3. Problem Formulation

4. State Estimation under Communication Delays

5. Online Network Supervisory Control

5.1. State Prediction

5.2. Online Algorithm

5.3. Comparison with the Existing Work

6. Application in Traffic Control

7. Extension of the Proposed Framework

8. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Conflicts of Interest

Appendix A. Proof of Proposition 1

Appendix B. Proof of Proposition 2

Appendix C. Proof of Proposition 3

Appendix D. Proof of Theorem 2

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI