1. Introduction
In this article, we consider a control problem of multidimensional objects. In [
1], Feldbaum suggested dual control theory. Dual control algorithms combine control and object learning processes. This theory was extensively developed by Wittenmark and Astrom [
2,
3] who suggested applying dual control algorithms in two cases: a short time horizon and rapidly changing object parameters. In the first stages of theory development, it was used for linear stochastic systems with unknown parameters [
4,
5]. Dual control algorithms were developed for the case of parametric uncertainty [
6]. Using this type of dual algorithm assumes that the structure of the true system is a priory known and the control task is to optimize its parameters. The dual control approach is widely used in the development of model predictive control (MPC) systems [
7]. Thus, in [
8,
9], an adaptive MPC strategy was suggested for linear multi-input multi-output systems. In [
10], the MPC approach for model-structure uncertainty is introduced. The authors highlight the beneficial effect of MPC with active learning under parametric or structural model uncertainty. In practice, dual control algorithms were applied in a wide variety of fields, such as diabetes investigation [
11], a semi-batch reactor equipped with a cooling jacket modeling [
12], energy hub modeling [
13] and a penicillin fermentation process control [
14].
In practice, there is not much a priori information about the control process. In [
15], the author presents an overview of adaptive control methods based on how much model information is needed. Processes in the industry (metallurgy, chemical industry, mining industry, production of electronic components, etc.) are complex and the researcher has no data on the mathematical structure of the system. It could be difficult and time-consuming to build an accurate parametric model. Thus, a parametric approach for constructing control systems might be impractical [
16]. Therefore, data-driven or modal-free methods for creating controllers have become widespread [
16]. One of these [
17] shows some results in constructing a control system on data-dependent matrices that can replace systems models. In [
18], dual control algorithms based on neural networks are used to approximate a priory unknown functions. The neural approach is applicable for multidimensional dynamic systems with unknown structures [
19]. Another method that allows modeling systems with unknown structures is the echo state networks [
20]. A dual control algorithm for multidimensional dynamic objects using nonparametric estimation of a reverse regression function was suggested in [
21].
In control systems, the value of desired output is set by an expert. It is not frequent that the knowledge of experts is precise and such imprecise knowledge of experts should be represented by interval numbers for reflecting the imprecision. To solve the dual control problem we will use interval arithmetic which is called single-level constraint interval arithmetic (SLCIA) [
22]. There are also some articles devoted to a framework of interval-based data analysis in control problems [
23,
24,
25]. In this study, we include interval values in a nonparametric dual control algorithm for multidimensional systems. Similar to the work [
24], we use SLCIA to calculate control actions as it simplifies the process of calculations and computer implementation.
In the existing methods [
26], traditional fuzzy control algorithms are used to deal with dynamic systems which can be described with differential equations. In real applications, sometimes it is impossible to take into account the dynamics of the process if the measurement interval of the output variable is more than the output time constant. So it is impossible to take into account the dynamics of the process in the control system. We study static objects with transport delay. In the presence of such delay, it is possible to make a shift in the observation matrix by the delay value to bring the one-to-one correspondence between the values of the input and output variables and not take it into account in further reasoning.
Thus, the main problem considered in the article is the construction of control algorithms under conditions of uncertainty. For this, a synthesis of the following approaches is proposed. The first is the dual control theory for control in the absence of a training sample for setting up the control device. Dual control theory is used to combine two competing goals: training and control. In [
27,
28], a dual control algorithm for stochastic systems with multiple uncertainties is suggested for crisp values. The second one is the theory of nonparametric control for objects whose mathematical description is a priori unknown up to parameters. It is a general-purpose algorithm, meaning it does not depend on the object’s mathematical description. It can be applied to a wide class of objects with known qualitative properties (dynamic or static, for example). Moreover, SLCIA is introduced for working with interval values of setpoints. In [
29], SLCIA was used for the fuzzy interval optimal control problem. We propose this approach to the dual control problems for interval variables under uncertainties. In this paper, for the first time, a nonparametric interval dual control algorithm is proposed.
The rest of the paper is organized as follows. In
Section 2, we present the formulation of the dual control problem. In
Section 3, we propose a nonparametric dual control algorithm. In
Section 4, the results of the numerical experiments of modeling multidimensional objects are described. We conclude our work in
Section 5.
2. Problem Formulation under Interval Uncertainty
Consider a control system, whose general scheme is shown in
Figure 1. The notation is as follows:
is an output variable of the process,
is a vector of set points,
is a control input vector,
is a vector random disturbances,
are the system response channels corresponding to output variables and including control tools,
is the random inaccuracy of measurements of output variables of the process with zero mathematic expectation and limited dispersion.
The input and output variables are continuous because of the nature of the process but the measurements are made at discrete times due to control tools so we investigate discrete-continuous systems. Such systems are also called hybrids as the continuous part consists of multiple-operation technological chains and the discrete part consists of digital controllers [
30]. The agreed notation is as follows:
,
—the
ith measurement of
jth component of the control variable
u;
,
,
—the
ith measurement of
jth component of the output variable
x. We have an initial sample of observations
, where
s is sample size.
The task of the control unit is to generate such a control action
u that the difference between object output value
x and the value
is minimal. In the previous paragraph, we commented on the case when all variables were considered crisp numbers. However, in practice, it is usual that decision information is uncertain. It might not be flexible or convenient for decision-makers to exactly quantify their opinions with crisp numbers. A possible solution to model and deal with such uncertainty is by means of interval values. In this regard, and following, we introduce and formulate a situation when values of reference variables are intervals. For that, we use intervals notation proposed by Stefanini and Bede in [
31] to define the set of real intervals as
where
notes the classic real interval. In multidimensional case,
that is,
is the space of nonempty compact and convex sets of
n-dimensional real numbers
.
The value of desired output is set by an expert. So, we refer to new information given by an expert and due to this information, we deal with intervals. For this purpose, we make the transition from to . Under the assumption, we are going to obtain intervals instead of crisp values for control variables. We introduce a new notation of control variable for interval values. We use different notations , , for observations, which we obtain by measuring input and output variables of the process and , , for approximation.
3. SLCIA Basic Concepts
On the topic of interval arithmetic and analysis we can find discussions and notations by Stefanini and Bede in [
31], Moore [
32,
33], and Alefeld and Herzberger [
34], among others. In [
22], it is proposed a variant of constraint interval arithmetic (CIA) that operates with a single parameter (level) in each interval operand of an expression, called single-level constraint interval arithmetic (SLCIA). This arithmetic was used in the discrete-time interval optimal control problem [
29], and in the next section, we propose its extension to the evaluation of expressions in interval-valued dual control problems.
Let us bring the basic definitions of single-level constraint interval arithmetic [
22].
Definition 1 ([
22])
. Let be any interval. Then- 1.
A continuous function such that will be called a constraint function associated with A.
- 2.
Associated with the interval A we define the decreasing convex constraint function by means - 3.
Associated with the interval A we define the increasing convex constraint function by means
For discussions and examples of SLCIA, we refer to [
22], particularly, for the evaluation of expression with intervals. In this regard, we highlight the following definitions of expression in interval arithmetic, with a role for the calculus of interval-valued expressions.
Definition 2 ([
22])
. An expression is a correct expression in interval arithmetic if is a correctly constructed expression in a formal language for arithmetic operations with real number operands and usual arithmetic operations on real numbers. Definition 3 ([
22])
. Let be the decreasing convex constraint functions associated to , . The evaluation of a correct expression is performed according to the following rule:This is the evaluation of the expression E with the given arguments provided that the min and max exist. A similar role exists for increasing convex constraint functions associated with .
Let us observe that, as usual in computational calculus, it is interesting to explore how to evaluate the interval-valued expression
E, by means of its corresponding real-valued expression given in (
1), at a discrete set of values for
. For such discretization, and given
, let us consider a classic partition of
, with
; in particular, let us use
, for
. As a result after computation on the partition, it is expected to obtain an approximation of the expression
E. To this end, in the following, we present a useful result for computational calculus in the next section under continuity of the expression
.
Proposition 1. Let us consider the decreasing convex constraint functions associated to , and an expression in interval arithmetic , in which the corresponding real-valued expression is continuous. For any , consider the partition on , with , for . Then, is a correct expression in interval arithmetic, and Proof. On one hand,
are decreasing convex constraint functions, and then, from Definition 1, they are continuous on
. Since the real-valued expression
is continuous, it derives that
is continuous on the compact set
, what implies that there exist the minimum and maximum of
on
. Therefore, following Definitions 2 and 3,
is a correct expression in interval arithmetic and can be calculated by the Equality (
1). On the other hand, from the continuity of
on the compact set
, it follows that
, and
, and then the Equality (
2) is fulfilled. □
As a consequence of the previous result, let us point out that given
p and its associated partition
, the interval given by
can be used as an approximation to interval
, which is useful in the computational calculus in the next section.
4. Nonparametric Interval Dual Control Algorithm
The mathematical description of the investigated object can be as follows:
where
A is an unknown object operator. If there exists an inverse operator
,
, then
From now on, we assume that
exists and it is a continuous function. It is an ill-posed problem [
35]. The exact solution exists for the case of output noise absence. In the presence of noise, some regularization methods can be applied to obtain an exact solution for systems that could be modeled by a linear differential equation [
36]. For the model-free case or situation of nonparametric uncertainty, it is advisable to use kernel estimations to obtain the estimation of the inverse operator [
37].
The “ideal” regulator could have the form (
4). The Formula (
4) could be used in order to obtain the desired trajectory
. In this case, we calculate the “ideal” value of the control variable
. The major problem is that in many cases it is impossible to construct such a scheme because the operator
A is unknown. The estimation of the inverse operator
is used to obtain the estimation
. The idea is to estimate it directly from input
u and output
x [
38].
Consider the dual control algorithms which were first proposed by Feldbaum [
1]. The control aim of such algorithms has dual nature: caution and probing [
3]. Feldbaum considered a situation when the structure of the model and the laws of the distribution of the random disturbances are known. In [
39], the idea of applying the nonparametric estimation of regression function in control systems was first suggested for crisp values. The method is robust to nonparametric uncertainty: the mathematical description of the object is unknown.
As a task of control unit to obtain control action
, so the inverse function
of (
3) exists:
where
is a continuous function. As
are compact sets and
are continuous functions, then we find that
are also intervals.
Previously, the nonparametric algorithm of dynamic processes dual control for crisp values was suggested in [
21]. Due to uncertain data context and the presence of interval-valued data, we propose the nonparametric interval dual control algorithm of a multidimensional object by means of SLCIA. The proposed algorithm includes the following steps:
Step 1. Under the new framework, we deal with intervals instead of points. For this purpose, taking into account SLCIA, we define each interval-valued variable of
by means of their decreasing convex constraint functions associated, and give an initial value to
p. Then, we consider the partition
, with
, for
, for the discretization for each interval variable
, what provides the following discrete subsets:
with
. Note that
,
.
Then, for each level of the interval we calculate the level of the control variable . We operate on all levels and then take the minimum and maximum of the operations in relation to for each to obtain the extremes of the new interval of the control variable . For each value of control variable we use the following control algorithm.
Step 2. We use Nadaraya–Watson nonparametric estimation of inverse regression function [
39,
40]. For this purpose, it is necessary to define bandwidth parameters
and
. Bandwidth parameters for each component of the vector of variables
u and
x are determined due to the following algorithm.
(i) Calculate the value of bandwidth parameter
:
where coefficient
,
is the closest observation to the value
of the sample
.
(ii) Determine the value of the coefficient
:
where coefficient
,
is the closest observation to the value
of the sample
. The sampling points satisfy the following conditions:
Step 3. We calculate the component
which accumulates the knowledge about the object. The first variable
can be calculated as a nonparametric estimation of the regression function for discrete observations
in the following form:
where
is a kernel function. Kernel function
and bandwidth parameter
satisfies the following convergence conditions [
39,
41]:
The main idea is that each subsequent value
depends on the value
found in the previous step. The estimation of
,
is based on a Nadaraya–Watson estimation of inverse regression function which refers to the local approximation methods [
39].
Step 4. So, for components
addend
is proposed to calculate due to the formula:
Step 5. The search step
could have the form:
where
could be found as a minimum of quadratic criterion:
The value of variable belongs to the interval .
Step 6. In this case, the nonparametric dual control algorithm can be represented as follows:
where the component
accumulates the knowledge about the object, the component
is the “learners” search step.
Step 7. For each value of reference variables
, we have calculated the value of control variable
,
, for
. Then, we choose the minimum and the maximum value of the control variable. For example, for variable
:
. So, we obtain intervals for every control variable
:
We use the Gaussian kernel function as it is continuous and universal. From Proposition 1 it follows that the minimum and the maximum value of the control variable exist. So, we obtain an approximation of the interval values
using the proposed algorithm, as was concluded in
Section 3.
5. Numerical Examples
At the initial stage of the control algorithm (
14) search step
(
12) plays a key role. This component stands for the ability of control to lead the object to the desired output. A sample of observations
of input and output variables begins to accumulate from the first measurement and grows in the process of system control. The increased sample size leads to the growing role of the component
, this term contains the knowledge about the controlled object. This is the case of active data accumulation.
The combined method of data accumulation assumes that there is an initial sample of observations , but at the following times sample is supplemented with new elements , , …. In this case, the active and passive methods of data accumulation are associated. An available sample of observations is not sufficient to construct a high-quality system, but at the beginning, such a system is trained more than in the case of active data accumulation. This case is the most consistent with practice because the development of complex adaptive systems does not start from scratch.
Let us consider the case of the combined method of data accumulation of simulation of the object which has three input
and two output
variables. Let the object be described by the following equations:
As it was said, if reference variables are intervals, control input
u is also interval variables, and expressions given in (
15) are interpreted following the arithmetic given by SLCIA. To illustrate the situation when the reference variables are intervals we conduct computational experiments under the considered interval-valued arithmetic, by means of a discretization of the parameter
. Then, in order to use the suggested algorithm for each level
of the interval
we calculate the level
of the control variable
. In the experiment, the set points are
,
.
The object control system is constructed by using a nonparametric estimation. In this case, the dual control algorithm has the following form:
Firstly, it is necessary to calculate the component
of dual control Algorithm (
16):
The component
is calculated as follows:
In numeric experiments, we use the Gaussian kernel function which is a popular and practical choice [
42]. For instance, it has the following form for the variable
x [
43]:
To assess the results of the simulation of control algorithms using a nonparametric model, the quadratic relative error was used for each
-level:
where
—the estimation of mathematic expectation of the
j-th component of output variable
x.
Then, the control error could have the following form:
The value of the relative error (
21) belongs to the interval
. A small error value (close to zero) indicates the high accuracy of control algorithms.
There is a case of the combined method of information accumulation. The sample
was passively accumulated. Since the 200 step dual control (
14) algorithm starts working on the next 500 steps. In the following experiment control, we calculate input 100 times and find the minimum and maximum values to obtain intervals
. The results are presented in
Table 1.
Let us consider the results of the experiment when the desired output
has a stepwise form and is presented as crisp values:
In the first step, the algorithm is adjusted and then causes the object to the desired output. The simulation results in the absence of interference are shown in
Figure 2. In
Figure 2 and
Figure 3, the index means the number of a sample element. The first 200 sample elements from 0 to 199 were passively accumulated, and the control process started from the 200 elements, so the first index is 200 in
Figure 2. With the new value of the reference variable,
or
algorithm tuning occurs, then the algorithm causes the object to the desired value. The graphics of control variables
,
,
are shown in
Figure 3. The control error is 0.08.