Next Article in Journal
Optimization of Leakage Risk and Maintenance Cost for a Subsea Production System Based on Uncertain Fault Tree
Previous Article in Journal
Best Approximation Results for Fuzzy-Number-Valued Continuous Functions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Nonparametric Dual Control Algorithm of Multidimensional Objects with Interval-Valued Observations

by
Manuel Arana-Jiménez
1,*,
Alexander V. Medvedev
2 and
Ekaterina Chzhan
3
1
Department of Statistics and Operations Research, INDESS, University of Cádiz, 11405 Jerez, Spain
2
Department of Information Systems, School of Space and Information Technology, Siberian Federal University, 660041 Krasnoyarsk, Russia
3
Department of Intelligent Control Systems, School of Space and Information Technology, Siberian Federal University, 660041 Krasnoyarsk, Russia
*
Author to whom correspondence should be addressed.
Axioms 2023, 12(2), 193; https://doi.org/10.3390/axioms12020193
Submission received: 19 December 2022 / Revised: 29 January 2023 / Accepted: 8 February 2023 / Published: 11 February 2023
(This article belongs to the Topic Mathematical Modeling)

Abstract

:
We focus on the dual interval control problem of multidimensional objects with delay. We propose a new nonparametric algorithm. In such a case, it is not necessary to determine a parametric structure of the investigated object. Another difficulty lies in the complex nature of the decision-making field as it might not be flexible or convenient for decision-makers to exactly quantify their opinions with crisp numbers. Due to this fact, we introduce the interval-valued observations into the algorithm by means of the single-level constraint interval arithmetic. The results of computational experiments illustrate the effectiveness of the algorithm in the case of using intervals instead of crisp values.

1. Introduction

In this article, we consider a control problem of multidimensional objects. In [1], Feldbaum suggested dual control theory. Dual control algorithms combine control and object learning processes. This theory was extensively developed by Wittenmark and Astrom [2,3] who suggested applying dual control algorithms in two cases: a short time horizon and rapidly changing object parameters. In the first stages of theory development, it was used for linear stochastic systems with unknown parameters [4,5]. Dual control algorithms were developed for the case of parametric uncertainty [6]. Using this type of dual algorithm assumes that the structure of the true system is a priory known and the control task is to optimize its parameters. The dual control approach is widely used in the development of model predictive control (MPC) systems [7]. Thus, in [8,9], an adaptive MPC strategy was suggested for linear multi-input multi-output systems. In [10], the MPC approach for model-structure uncertainty is introduced. The authors highlight the beneficial effect of MPC with active learning under parametric or structural model uncertainty. In practice, dual control algorithms were applied in a wide variety of fields, such as diabetes investigation [11], a semi-batch reactor equipped with a cooling jacket modeling [12], energy hub modeling [13] and a penicillin fermentation process control [14].
In practice, there is not much a priori information about the control process. In [15], the author presents an overview of adaptive control methods based on how much model information is needed. Processes in the industry (metallurgy, chemical industry, mining industry, production of electronic components, etc.) are complex and the researcher has no data on the mathematical structure of the system. It could be difficult and time-consuming to build an accurate parametric model. Thus, a parametric approach for constructing control systems might be impractical [16]. Therefore, data-driven or modal-free methods for creating controllers have become widespread [16]. One of these [17] shows some results in constructing a control system on data-dependent matrices that can replace systems models. In [18], dual control algorithms based on neural networks are used to approximate a priory unknown functions. The neural approach is applicable for multidimensional dynamic systems with unknown structures [19]. Another method that allows modeling systems with unknown structures is the echo state networks [20]. A dual control algorithm for multidimensional dynamic objects using nonparametric estimation of a reverse regression function was suggested in [21].
In control systems, the value of desired output is set by an expert. It is not frequent that the knowledge of experts is precise and such imprecise knowledge of experts should be represented by interval numbers for reflecting the imprecision. To solve the dual control problem we will use interval arithmetic which is called single-level constraint interval arithmetic (SLCIA) [22]. There are also some articles devoted to a framework of interval-based data analysis in control problems [23,24,25]. In this study, we include interval values in a nonparametric dual control algorithm for multidimensional systems. Similar to the work [24], we use SLCIA to calculate control actions as it simplifies the process of calculations and computer implementation.
In the existing methods [26], traditional fuzzy control algorithms are used to deal with dynamic systems which can be described with differential equations. In real applications, sometimes it is impossible to take into account the dynamics of the process if the measurement interval of the output variable is more than the output time constant. So it is impossible to take into account the dynamics of the process in the control system. We study static objects with transport delay. In the presence of such delay, it is possible to make a shift in the observation matrix by the delay value to bring the one-to-one correspondence between the values of the input and output variables and not take it into account in further reasoning.
Thus, the main problem considered in the article is the construction of control algorithms under conditions of uncertainty. For this, a synthesis of the following approaches is proposed. The first is the dual control theory for control in the absence of a training sample for setting up the control device. Dual control theory is used to combine two competing goals: training and control. In [27,28], a dual control algorithm for stochastic systems with multiple uncertainties is suggested for crisp values. The second one is the theory of nonparametric control for objects whose mathematical description is a priori unknown up to parameters. It is a general-purpose algorithm, meaning it does not depend on the object’s mathematical description. It can be applied to a wide class of objects with known qualitative properties (dynamic or static, for example). Moreover, SLCIA is introduced for working with interval values of setpoints. In [29], SLCIA was used for the fuzzy interval optimal control problem. We propose this approach to the dual control problems for interval variables under uncertainties. In this paper, for the first time, a nonparametric interval dual control algorithm is proposed.
The rest of the paper is organized as follows. In Section 2, we present the formulation of the dual control problem. In Section 3, we propose a nonparametric dual control algorithm. In Section 4, the results of the numerical experiments of modeling multidimensional objects are described. We conclude our work in Section 5.

2. Problem Formulation under Interval Uncertainty

Consider a control system, whose general scheme is shown in Figure 1. The notation is as follows: x = ( x 1 , x 2 , , x n ) Ω ( x ) R n is an output variable of the process, x * = ( x 1 * , x 2 * , , x n * ) Ω ( x ) R n is a vector of set points, u = ( u 1 , u 2 , , u m ) Ω ( u ) R m is a control input vector, ξ is a vector random disturbances, G x 1 , G x 2 , , G x n are the system response channels corresponding to output variables and including control tools, g x = ( g x 1 , g x 2 , , g x n ) Ω ( g x ) R n is the random inaccuracy of measurements of output variables of the process with zero mathematic expectation and limited dispersion.
The input and output variables are continuous because of the nature of the process but the measurements are made at discrete times due to control tools so we investigate discrete-continuous systems. Such systems are also called hybrids as the continuous part consists of multiple-operation technological chains and the discrete part consists of digital controllers [30]. The agreed notation is as follows: u i , j , j = 1 , 2 , , m , i = 1 , 2 , , s —the ith measurement of jth component of the control variable u; x i , j , j = 1 , 2 , , n , i = 1 , 2 , , s —the ith measurement of jth component of the output variable x. We have an initial sample of observations { u i , x i , i = 1 , 2 , , s } , where s is sample size.
The task of the control unit is to generate such a control action u that the difference between object output value x and the value x * is minimal. In the previous paragraph, we commented on the case when all variables were considered crisp numbers. However, in practice, it is usual that decision information is uncertain. It might not be flexible or convenient for decision-makers to exactly quantify their opinions with crisp numbers. A possible solution to model and deal with such uncertainty is by means of interval values. In this regard, and following, we introduce and formulate a situation when values of reference variables are intervals. For that, we use intervals notation proposed by Stefanini and Bede in [31] to define the set of real intervals as
K C = { [ a ̲ , a ¯ ] : a ̲ a ¯ , a ̲ , a ¯ R } ,
where [ a ̲ , a ¯ ] notes the classic real interval. In multidimensional case,
K C n = K C × × K C n times ,
that is, K C n is the space of nonempty compact and convex sets of n-dimensional real numbers R n .
The value of desired output is set by an expert. So, we refer to new information given by an expert and due to this information, we deal with intervals. For this purpose, we make the transition from x * = ( x 1 * , x 2 * , , x n * ) Ω ( x ) R n to y * = ( y 1 * , y 2 * , , y n * ) K C n . Under the assumption, we are going to obtain intervals instead of crisp values for control variables. We introduce a new notation of control variable v = ( v 1 , , v m ) K C m for interval values. We use different notations { u i , x i , i = 1 , 2 , , s } , x ( t ) R n , u R m for observations, which we obtain by measuring input and output variables of the process and { v , y * } , y * K C n , v K C m for approximation.

3. SLCIA Basic Concepts

On the topic of interval arithmetic and analysis we can find discussions and notations by Stefanini and Bede in [31], Moore [32,33], and Alefeld and Herzberger [34], among others. In [22], it is proposed a variant of constraint interval arithmetic (CIA) that operates with a single parameter (level) in each interval operand of an expression, called single-level constraint interval arithmetic (SLCIA). This arithmetic was used in the discrete-time interval optimal control problem [29], and in the next section, we propose its extension to the evaluation of expressions in interval-valued dual control problems.
Let us bring the basic definitions of single-level constraint interval arithmetic [22].
Definition 1
([22]). Let A = { [ a ̲ , a ¯ ] } K C be any interval. Then
1. 
A continuous function A : [ 0 , 1 ] R such that
min 0 λ 0 A ( λ ) = a ̲ , max 0 λ 0 A ( λ ) = a ¯ ,
will be called a constraint function associated with A.
2. 
Associated with the interval A we define the decreasing convex constraint function A : [ 0 , 1 ] R by means
A ( λ ) = λ a ̲ + ( 1 λ ) a ¯ , 0 λ 1 ,
or equivalently
A ( λ ) = ( a ̲ a ¯ ) λ + a ¯ , 0 λ 1 .
3. 
Associated with the interval A we define the increasing convex constraint function A : [ 0 , 1 ] R by means
A ( λ ) = ( 1 λ ) a ̲ + λ a ¯ , 0 λ 1 ,
or equivalently
A ( λ ) = a ̲ + λ ( a ¯ a ̲ ) , 0 λ 1 .
For discussions and examples of SLCIA, we refer to [22], particularly, for the evaluation of expression with intervals. In this regard, we highlight the following definitions of expression in interval arithmetic, with a role for the calculus of interval-valued expressions.
Definition 2
([22]). An expression E ( A 1 , , A q ) is a correct expression in interval arithmetic if E ( x 1 , , x q ) is a correctly constructed expression in a formal language for arithmetic operations with real number operands x 1 , , x q and usual arithmetic operations on real numbers.
Definition 3
([22]). Let A 1 ( λ ) , , A q ( λ ) be the decreasing convex constraint functions associated to A 1 , , A q K C , E ( A 1 ( λ ) , , A q ( λ ) ) . The evaluation of a correct expression is performed according to the following rule:
E ( A 1 , , A q ) = min λ [ 0 , 1 ] E A 1 ( λ ) , , A q ( λ ) , max λ [ 0 , 1 ] E A 1 ( λ ) , , A q ( λ ) .
This is the evaluation of the expression E with the given arguments provided that the min and max exist. A similar role exists for increasing convex constraint functions associated with A 1 , , A q K C .
Let us observe that, as usual in computational calculus, it is interesting to explore how to evaluate the interval-valued expression E, by means of its corresponding real-valued expression given in (1), at a discrete set of values for λ . For such discretization, and given p N , let us consider a classic partition of [ 0 , 1 ] π p = { λ k : k = 1 , , p + 1 } , with 0 = λ 1 < λ 2 < < λ p + 1 = 1 ; in particular, let us use λ k = k 1 p , for k = 1 , , p + 1 . As a result after computation on the partition, it is expected to obtain an approximation of the expression E. To this end, in the following, we present a useful result for computational calculus in the next section under continuity of the expression E ( x 1 , , x q ) .
Proposition 1.
Let us consider A 1 ( λ ) , , A q ( λ ) the decreasing convex constraint functions associated to A 1 , , A q K C , and an expression in interval arithmetic E ( A 1 , , A q ) , in which the corresponding real-valued expression E ( x 1 , , x q ) is continuous. For any p N , consider the partition on [ 0 , 1 ] π p = { λ k : k = 1 , , p + 1 } , with λ k = k 1 p , for k = 1 , , p + 1 . Then, E ( A 1 , , A q ) is a correct expression in interval arithmetic, and
E ( A 1 , , A q ) = lim p min λ k π p E A 1 ( λ k ) , , A q ( λ k ) , lim p max λ k π p E A 1 ( λ k ) , , A q ( λ k ) .
Proof. 
On one hand, A 1 ( λ ) , , A q ( λ ) are decreasing convex constraint functions, and then, from Definition 1, they are continuous on [ 0 , 1 ] . Since the real-valued expression E ( x 1 , , x q ) is continuous, it derives that E A 1 ( λ ) , , A q ( λ ) is continuous on the compact set [ 0 , 1 ] , what implies that there exist the minimum and maximum of E A 1 ( λ ) , , A q ( λ ) on [ 0 , 1 ] . Therefore, following Definitions 2 and 3, E ( A 1 , , A q ) is a correct expression in interval arithmetic and can be calculated by the Equality (1). On the other hand, from the continuity of E A 1 ( λ ) , , A q ( λ ) on the compact set [ 0 , 1 ] , it follows that min λ [ 0 , 1 ] E A 1 ( λ ) , , A q ( λ ) = lim p min λ k π p E A 1 ( λ k ) , , A q ( λ k ) , and max λ [ 0 , 1 ] E A 1 ( λ ) , , A q ( λ ) = lim p max λ k π p E A 1 ( λ k ) , , A q ( λ k ) , and then the Equality (2) is fulfilled. □
As a consequence of the previous result, let us point out that given p and its associated partition π p , the interval given by
min λ k π p E A 1 ( λ k ) , , A q ( λ k ) , max λ k π p E A 1 ( λ k ) , , A q ( λ k )
can be used as an approximation to interval E ( A 1 , , A q ) , which is useful in the computational calculus in the next section.

4. Nonparametric Interval Dual Control Algorithm

The mathematical description of the investigated object can be as follows:
x = A ( u ) ,
where A is an unknown object operator. If there exists an inverse operator A 1 , A 1 A = 1 , then
x = A 1 A < x = x * > , u = A 1 < x > .
From now on, we assume that A 1 exists and it is a continuous function. It is an ill-posed problem [35]. The exact solution exists for the case of output noise absence. In the presence of noise, some regularization methods can be applied to obtain an exact solution for systems that could be modeled by a linear differential equation [36]. For the model-free case or situation of nonparametric uncertainty, it is advisable to use kernel estimations to obtain the estimation of the inverse operator [37].
The “ideal” regulator could have the form (4). The Formula (4) could be used in order to obtain the desired trajectory x = x * . In this case, we calculate the “ideal” value of the control variable u * . The major problem is that in many cases it is impossible to construct such a scheme because the operator A is unknown. The estimation of the inverse operator A 1 ^ is used to obtain the estimation u * ^ . The idea is to estimate it directly from input u and output x [38].
Consider the dual control algorithms which were first proposed by Feldbaum [1]. The control aim of such algorithms has dual nature: caution and probing [3]. Feldbaum considered a situation when the structure of the model and the laws of the distribution of the random disturbances are known. In [39], the idea of applying the nonparametric estimation of regression function in control systems was first suggested for crisp values. The method is robust to nonparametric uncertainty: the mathematical description of the object is unknown.
As a task of control unit to obtain control action u ( t ) , so the inverse function A i 1 of (3) exists:
u j = A j 1 ( x i * ) , i = 1 , 2 , , n , j = 1 , 2 , , m ,
where A j 1 is a continuous function. As y i * are compact sets and A j 1 are continuous functions, then we find that v j are also intervals.
Previously, the nonparametric algorithm of dynamic processes dual control for crisp values was suggested in [21]. Due to uncertain data context and the presence of interval-valued data, we propose the nonparametric interval dual control algorithm of a multidimensional object by means of SLCIA. The proposed algorithm includes the following steps:
Step 1. Under the new framework, we deal with intervals instead of points. For this purpose, taking into account SLCIA, we define each interval-valued variable of y * = ( y 1 * , y 2 * , , y n * ) by means of their decreasing convex constraint functions associated, and give an initial value to p. Then, we consider the partition π p = { λ k : k = 1 , , p + 1 } , with λ k = k 1 p , for k = 1 , , p + 1 , for the discretization for each interval variable y j * = [ y j * ̲ , y j * ¯ ] , j = 1 , 2 , , n , what provides the following discrete subsets:
{ y j * k } [ y j * ̲ , y j * ¯ ] , k = 1 , 2 , , p + 1 ,
with y j * k = λ k y j * ̲ + ( 1 λ k ) y j * ¯ . Note that y j * 1 = y j * ̲ , y j * p + 1 = y j * ¯ .
Then, for each level λ k , k = 1 , 2 , , p + 1 of the interval ( y 1 * ( λ k ) , y 2 * ( λ k ) , , y n * ( λ k ) ) we calculate the level λ k of the control variable v = ( v 1 ( λ k ) , v 2 ( λ k ) , , v m ( λ k ) . We operate on all levels and then take the minimum and maximum of the operations in relation to λ for each v j , j = 1 , 2 , , m to obtain the extremes of the new interval of the control variable v = ( v 1 , v 2 , , v m ) . For each value of control variable v = ( v 1 , v 2 , , v m ) we use the following control algorithm.
Step 2. We use Nadaraya–Watson nonparametric estimation of inverse regression function [39,40]. For this purpose, it is necessary to define bandwidth parameters h x and h u . Bandwidth parameters for each component of the vector of variables u and x are determined due to the following algorithm.
(i) Calculate the value of bandwidth parameter h w x , w = 1 , 2 , , n :
h w x = β | y w , s + 1 * x w 0 | ,
where coefficient β > 1 , x w 0 is the closest observation to the value y w , s + 1 * of the sample { x w , i , i = 1 , 2 , , s } , w = 1 , 2 , , n .
(ii) Determine the value of the coefficient h u :
h u = γ | u s + 1 u 0 | ,
where coefficient γ > 1 , u 0 is the closest observation to the value u s + 1 of the sample { u i , i = 1 , 2 , , s , s < s } . The sampling points satisfy the following conditions:
| y w , s + 1 * x w , i | / h w x 1 , i = 1 , , s , w = 1 , 2 , , n .
Step 3. We calculate the component v j , s * ( λ k ) , j = 1 , 2 , , m which accumulates the knowledge about the object. The first variable v 1 , s * ( λ k ) can be calculated as a nonparametric estimation of the regression function for discrete observations { u i , x i , i = 1 , 2 , , s } in the following form:
v 1 , s * ( λ k ) = i = 1 s u 1 , i w = 1 n Φ y w , s + 1 * ( λ k ) x w , i h w x i = 1 s w = 1 n Φ y w , s + 1 * ( λ k ) x w , i h w x ,
where w = 1 n Φ y w , s + 1 * ( λ k ) x w , i h w x is a kernel function. Kernel function w = 1 n Φ y w , s + 1 * ( λ k ) x w , i h w x and bandwidth parameter h w x satisfies the following convergence conditions [39,41]:
h w x > 0 ; & & 0 Φ ( y w , s + 1 * ( λ k ) x w , i ) / h w x < ; lim s h w x = 0 ; & & Ω ( x ) Φ ( y w , s + 1 * ( λ k ) x w , i ) / h w x d x w , i = 1 ; lim s s ( h w x ) n = ; & & 1 h w x lim s Φ ( y w , s + 1 * ( λ k ) x w , i ) / h w x = δ ( x w , s + 1 * ( λ k ) x w , i ) .
The main idea is that each subsequent value v i ( λ k ) , i = 2 , 3 , , m depends on the value v i ( λ k ) , i = 1 , 2 , , m 1 found in the previous step. The estimation of v j , s * ( λ k ) , j = 2 , 3 , , m is based on a Nadaraya–Watson estimation of inverse regression function which refers to the local approximation methods [39].
Step 4. So, for components v j ( λ k ) , j = 2 , 3 , , m addend v j , s * ( λ k ) , j = 2 , 3 , , m is proposed to calculate due to the formula:
v j , s * ( λ k ) = i = 1 s u j , i w = 1 j 1 Φ v j , s * ( λ k ) u w , i h u w = 1 n Φ y w , s + 1 * ( λ k ) x w , i h w x i = 1 s w = 1 j 1 Φ u w , s + 1 u w , i h u w = 1 n Φ y w , s + 1 * ( λ k ) x w , i h w x .
Step 5. The search step Δ v j , s + 1 could have the form:
Δ v j , s + 1 ( λ k ) = i = 1 n Θ i y i , s + 1 * ( λ k ) x i , s , j = 1 , 2 , , m ,
where Θ = Θ 1 , Θ 2 , , Θ n could be found as a minimum of quadratic criterion:
R Θ 1 , Θ 2 , , Θ n = ( i = 1 s 1 p = 1 n ( y p , i + 1 * ( λ k ) j = 1 s 1 x p , j + 1 × q = 1 m Φ v q , i * + w = 1 n Θ w y w , i + 1 * ( λ k ) x w , i u q , j + 1 h u × q = 1 m Φ v q , i * ( λ k ) + w = 1 n Θ w y w , i + 1 * ( λ k ) x w , i u q , j + 1 h u ( λ k ) 1 ) ) 2 min Θ 1 , Θ 2 , , Θ n
The value of variable Θ belongs to the interval [ 0 , 1 ] .
Step 6. In this case, the nonparametric dual control algorithm can be represented as follows:
v j , s + 1 ( λ k ) = v j , s * ( λ k ) + Δ v j , s + 1 ( λ k ) , j = 1 , 2 , , m ,
where the component v j , s * accumulates the knowledge about the object, the component Δ v j , s + 1 is the “learners” search step.
Step 7. For each value of reference variables ( y 1 ( λ k ) , y 2 ( λ k ) , , y n ( λ k ) ) , we have calculated the value of control variable v = ( v 1 ( λ k ) , v 2 ( λ k ) , , v m ( λ k ) ) , λ k = k 1 p , for k = 1 , , p + 1 . Then, we choose the minimum and the maximum value of the control variable. For example, for variable v 1 : v 1 ̲ = min { v 1 ( λ k ) } , v 1 ¯ = max { v 1 ( λ k ) } , k = 1 , , p + 1 . So, we obtain intervals for every control variable v = ( v 1 , v 2 , , v m ) :
v j = [ v j ̲ , v j ¯ ] , j = 1 , 2 , , m .
We use the Gaussian kernel function as it is continuous and universal. From Proposition 1 it follows that the minimum and the maximum value of the control variable exist. So, we obtain an approximation of the interval values v = ( v 1 , v 2 , , v m ) using the proposed algorithm, as was concluded in Section 3.

5. Numerical Examples

At the initial stage of the control algorithm (14) search step Δ v j , s + 1 ( λ k ) (12) plays a key role. This component stands for the ability of control to lead the object to the desired output. A sample of observations { u i , x i , i = 1 , 2 , , s } of input and output variables begins to accumulate from the first measurement and grows in the process of system control. The increased sample size leads to the growing role of the component v j , s * ( λ k ) , j = 1 , 2 , , s , this term contains the knowledge about the controlled object. This is the case of active data accumulation.
The combined method of data accumulation assumes that there is an initial sample of observations { u i , x i , i = 1 , 2 , , s } , but at the following times sample is supplemented with new elements ( u s + 1 , x s + 1 ) , ( u s + 2 , x s + 2 ) , …. In this case, the active and passive methods of data accumulation are associated. An available sample of observations is not sufficient to construct a high-quality system, but at the beginning, such a system is trained more than in the case of active data accumulation. This case is the most consistent with practice because the development of complex adaptive systems does not start from scratch.
Let us consider the case of the combined method of data accumulation of simulation of the object which has three input u = ( u 1 , u 2 ) and two output x = ( x 1 , x 2 ) variables. Let the object be described by the following equations:
x 1 = u 1 + 1.5 u 2 , x 2 = 1.5 u 1 + 2 u 2 .
As it was said, if reference variables are intervals, control input u is also interval variables, and expressions given in (15) are interpreted following the arithmetic given by SLCIA. To illustrate the situation when the reference variables are intervals we conduct computational experiments under the considered interval-valued arithmetic, by means of a discretization of the parameter λ . Then, in order to use the suggested algorithm for each level λ k , k = 1 , 2 , , p + 1 of the interval ( y 1 * ( λ k ) , y 2 * ( λ k ) , , y n * ( λ k ) ) we calculate the level λ k of the control variable v = ( v 1 ( λ k ) , v 2 ( λ k ) , , v m ( λ k ) . In the experiment, the set points are y 1 * = [ 1.8 , 2 ] , y 2 * = [ 2.8 , 3 ] .
The object control system is constructed by using a nonparametric estimation. In this case, the dual control algorithm has the following form:
v 1 , s + 1 ( λ k ) = v 1 , s * ( λ k ) + Δ v 1 , s + 1 ( λ k ) , v 2 , s + 1 ( λ k ) = v 2 , s * ( λ k ) + Δ v 2 , s + 1 ( λ k ) .
Firstly, it is necessary to calculate the component v 1 , s * of dual control Algorithm (16):
v 1 , s * ( λ k ) = i = 1 s u 1 , i w = 1 2 Φ y w , s + 1 * ( λ k ) x w , i h w x i = 1 s w = 1 2 Φ y w , s + 1 * ( λ k ) x w , i h w x .
The component v 2 , s * is calculated as follows:
v 2 , s * ( λ k ) = i = 1 s u 2 , i Φ v 1 , s * ( λ k ) u 1 , i h u w = 1 2 Φ y w , s + 1 * ( λ k ) x w , i h w x i = 1 s Φ v 1 , s * ( λ k ) u 1 , i h u w = 1 2 Φ y w , s + 1 * ( λ k ) x w , i h w x .
In numeric experiments, we use the Gaussian kernel function which is a popular and practical choice [42]. For instance, it has the following form for the variable x [43]:
Φ y w , s + 1 * ( λ k ) x w , i h w x = 1 2 π e 1 2 y w , s + 1 * ( λ k ) x w , i h w x 2 .
To assess the results of the simulation of control algorithms using a nonparametric model, the quadratic relative error was used for each λ -level:
W ( λ k ) = 1 s i = 1 s j = 1 n y j , i * ( λ k ) x j , i 2 1 s 1 i = 1 s j = 1 n x j , i m ^ x j 2 ,
where m ^ x j —the estimation of mathematic expectation of the j-th component of output variable x.
Then, the control error could have the following form:
W = 1 p + 1 i = 1 p + 1 W ( λ k ) .
The value of the relative error (21) belongs to the interval [ 0 , 1 ] . A small error value (close to zero) indicates the high accuracy of control algorithms.
There is a case of the combined method of information accumulation. The sample { u 1 , i , u 2 , i , x i , i = 1 , 2 , , s } , s = 200 was passively accumulated. Since the 200 step dual control (14) algorithm starts working on the next 500 steps. In the following experiment control, we calculate input 100 times and find the minimum and maximum values to obtain intervals v j = [ v j ̲ , v j ¯ ] , j = 1 , , 3 . The results are presented in Table 1.
Let us consider the results of the experiment when the desired output x * has a stepwise form and is presented as crisp values:
x 1 , i * = 5 , x 2 , i * = 7 , i = 200 , 299 ¯ , x 1 , i * = 2 , x 2 , i * = 3 , i = 300 , 400 ¯ .
In the first step, the algorithm is adjusted and then causes the object to the desired output. The simulation results in the absence of interference are shown in Figure 2. In Figure 2 and Figure 3, the index means the number of a sample element. The first 200 sample elements from 0 to 199 were passively accumulated, and the control process started from the 200 elements, so the first index is 200 in Figure 2. With the new value of the reference variable, x 1 * or x 2 * algorithm tuning occurs, then the algorithm causes the object to the desired value. The graphics of control variables u 1 , u 2 , u 3 are shown in Figure 3. The control error is 0.08.

6. Conclusions

In this paper, an interval dual control problem was proposed and the nonparametric algorithm was extended to the control theory using single-level constrained interval arithmetic. First, the training control algorithm is conducted at the same time as the control process. Second, the use of intervals allows for taking into account a variety of random factors, such as the inaccuracy of measurements. The proposed algorithm (Equation (14)) is effective in finding the interval solution of the control problem. A numerical example shows that the procedure to solve interval dual control problems is efficient. Moreover, a discretization method has a practical solution for a control problem decision. Future work will consider the insertion of the nonparametric dual control theory into the theory of fuzzy sets, i.e., we intend to study fuzzy control problems using single-level constrained fuzzy arithmetic. As another future research line, we will study the case when A 1 is not necessarily a function, and extend the method for the construction of the interval/fuzzy solution for such a new case.

Author Contributions

Methodology, M.A.-J., A.V.M. and E.C.; Software, E.C.; writing—original draft preparation, M.A.-J., E.C.; writing—review and editing, M.A.-J., A.V.M. and E.C. All authors contributed to the conceptualization, writing, and editing of this article. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Acknowledgments

The authors thank the editors and the anonymous reviewers for their insightful comments which improved the quality of the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Feldbaum, A.A. Fundamentals of the Theory of Optimal Automatic Systems; Fizmatgiz Publishing: Moscow, Russia, 1963. [Google Scholar]
  2. Wittenmark, B. Adaptive dual control methods: An overview. IFAC Proc. Vol. 1995, 28, 67–73. [Google Scholar] [CrossRef]
  3. Astrom, K.; Wittenmark, B. Problems of identification and control. J. Math. Anal. Appl. 1971, 34, 90–113. [Google Scholar] [CrossRef]
  4. Wenk, C.J.; Bar-Shalom, Y. A multiple model adaptive dual control algorithm for stochastic systems with unknown parameters. Autom. Control. 1980, 25, 703–710. [Google Scholar] [CrossRef]
  5. Filatov, N.M.; Unbehauen, H. Survey of adaptive dual control methods. In Proceedings of the IEEE 2000 Adaptive Systems for Signal Processing, Communications, and Control Symposium, Lake Louise, AB, Canada, 4 October 2000; pp. 119–128. [Google Scholar]
  6. Umenberger, J.; Schön, T.B. Optimistic robust linear quadratic dual control. In Proceedings of the 2nd Conference on Learning for Dynamics and Control, Berkeley, CA, USA, 11–12 June 2020; pp. 550–560. [Google Scholar]
  7. Mesbah, A. Stochastic model predictive control with active uncertainty learning: A Survey on dual control. Annu. Rev. Control. 2018, 45, 107–117. [Google Scholar] [CrossRef]
  8. Heirung, T.A.N.; Foss, B.; Ydstie, B.E. MPC-based dual control with online experiment design. J. Process. Control 2015, 32, 64–76. [Google Scholar] [CrossRef]
  9. Heirung, T.A.N.; Ydstie, B.E.; Foss, B. Dual adaptive model predictive control. Automatica 2017, 80, 340–348. [Google Scholar] [CrossRef]
  10. Heirung, T.A.N.; Paulson, J.A.; Lee, S.; Mesbah, A. Model predictive control with active learning under model uncertainty: Why, when, and how. AIChE J. 2018, 64, 3071–3081. [Google Scholar] [CrossRef]
  11. Bhattacharjee, A.; Sutradhar, A. Data driven nonparametric identification and model based control of glucose-insulin process in type 1 diabetics. J. Process. Control 2016, 41, 14–25. [Google Scholar] [CrossRef]
  12. Thangavel, S.; Lucia, S.; Paulen, R.; Engell, S. Dual robust nonlinear model predictive control: A multi-stage approach. J. Process. Control 2018, 72, 39–51. [Google Scholar] [CrossRef]
  13. Sun, Q.; Zhang, N.; You, S.; Wang, J. The Dual Control With Consideration of Security Operation and Economic Efficiency for Energy Hub. IEEE Trans. Smart Grid 2019, 10, 5930–5941. [Google Scholar] [CrossRef]
  14. Byun, H.E.; Kim, B.; Lee, B. Dual Adaptive Control of a Fed-Batch Bioreactor Based on Approximate Dynamic Programming. In Proceedings of the Foundations of Process Analytics and Machine Learning (FOPAM 2019), Raleigh, NC, USA, 6–9 August 2019; pp. 5275–5280. [Google Scholar]
  15. Benosman, M. Model-based vs data-driven adaptive control: An overview. Int. J. Adapt. Control. Signal Process. 2018, 32, 753–776. [Google Scholar] [CrossRef]
  16. Hou, Z.S.; Wang, Z. From model-based control to data-driven control: Survey, classification and perspective. Inf. Sci. 2013, 235, 3–35. [Google Scholar] [CrossRef]
  17. De Persis, C.; Tesi, P. Formulas for data-driven control: Stabilization, optimality, and robustness. IEEE Trans. Autom. Control 2019, 65, 909–924. [Google Scholar] [CrossRef]
  18. Fabrit, S.; Kadirkamanathant, V. Dual Adaptive Control of Nonlinear Stochastic Systems using Neural Networks. Automatica 1998, 34, 245–253. [Google Scholar] [CrossRef]
  19. Fabri, S.G.; Bugeja, M.K. Functional adaptive dual control of a class of nonlinear MIMO systems. Trans. Inst. Meas. Control 2015, 37, 1009–1025. [Google Scholar] [CrossRef]
  20. Cao, S.; Xu, W.; Hu, X. Dual adaptive control of nonlinear stochastic systems based on echo state network. In Proceedings of the 27th Chinese Control and Decision Conference (CCDC 2015), Qingdao, China, 23–25 May 2015; pp. 4579–4584. [Google Scholar]
  21. Medvedev, A.V.; Raskina, A.V. On the Nonparametric Identification and Dual Adaptive Control of Dynamic Processes. J. Sib. Fed. Univ. Math. Phys. 2017, 10, 96–107. [Google Scholar] [CrossRef]
  22. Chalco-Cano, Y.; Lodwick, W.A.; Bede, B. Single level constraint interval arithmetic. Fuzzy Sets Syst. 2014, 257, 146–168. [Google Scholar] [CrossRef]
  23. Campos, J.R.; Assunção, E.; Silva, G.N.; Lodwick, W.A.; Teixeira, M.C. Discrete-time interval optimal control problem. Int. J. Control. 2019, 92, 1778–1784. [Google Scholar] [CrossRef]
  24. Leal, U.; Lodwick, W.; Silva, G.; Maqui-Huaman, G.G. Interval optimal control for uncertain problems. Fuzzy Sets Syst. 2021, 402, 142–154. [Google Scholar] [CrossRef]
  25. Treanţă, S. On a Dual Pair of Multiobjective Interval-Valued Variational Control Problems. Mathematics 2021, 9, 893. [Google Scholar] [CrossRef]
  26. Nguyen, A.T.; Taniguchi, T.; Eciolaza, L.; Campos, V.; Palhares, R.; Sugeno, M. Fuzzy control systems: Past, present and future. IEEE Comput. Intell. Mag. 2019, 14, 56–68. [Google Scholar] [CrossRef]
  27. Xuehui, M.; Qian, F.; Zhang, S. Dual control for stochastic systems with multiple uncertainties. In Proceedings of the Shenyang, China IEEE 2020 39th Chinese Control Conference (CCC), Shenyang, China, 27–29 July 2020; pp. 1001–1006. [Google Scholar]
  28. Ji, R.; Liang, Y.; Xu, L.; Wei, Z. Estimation of dual-mode nonlinear stochastic systems with unknown parameters. Int. J. Robust Nonlinear Control 2022, 32, 9258–9274. [Google Scholar] [CrossRef]
  29. Campos, J.R.; Assunção, E.; Silva, G.N.; Lodwick, W.A.; Teixeira, M.C.M.; Maqui-Huamán, G.G. Fuzzy interval optimal control problem. Fuzzy Sets Syst. 2020, 385, 169–181. [Google Scholar] [CrossRef]
  30. Mitroshin, V.N.; Rogachev, G.N.; Chostkovskii, B.K.; Rogachev, N.G. Fuzzy Optimization in Discrete-Continuous Control Systems for Multiple-Operation Technological Processes. Optoelectron. Instrum. Data Process. 2019, 55, 376–382. [Google Scholar] [CrossRef]
  31. Stefanini, L.; Bede, B. Generalized Hukuhara differentiability of interval-valued functions and interval differential equations. Nonlinear Anal. 2009, 71, 1311–1328. [Google Scholar] [CrossRef]
  32. Moore, R.E. Interval Analysis; Prentice Hall: Englewood Cliffs, NJ, USA, 1966. [Google Scholar]
  33. Moore, R.E. Method and Applications of Interval Analysis; SIAM: Philadelphia, PA, USA, 1979. [Google Scholar]
  34. Alefeld, G.; Herzberger, J. Introduction to Interval Computations; Academic Press: New York, NY, USA, 1983. [Google Scholar]
  35. Tikhonov, A.N.; Arsenin, V.I. Solution of Ill-Posed Problems; John Wiley & Sons: New York, NY, USA, 1977. [Google Scholar]
  36. Turetsky, V. Two Inverse Problems Solution by Feedback Tracking Control. Axioms 2021, 10, 137. [Google Scholar] [CrossRef]
  37. Blanken, L.; Oomen, T. Kernel-based identification of non-causal systems with application to inverse model control. Automatica 2020, 114, 108830. [Google Scholar] [CrossRef]
  38. Boeren, F.; Oomen, T.; Steinbuch, M. Iterative motion feedforward tuning: A data-driven approach based on instrumental variable identification. Control. Eng. Pract. 2015, 37, 11–19. [Google Scholar] [CrossRef]
  39. Nadaraya, E.A. On non-parametric estimates of density functions and regression curves. Theory Probab. Its Appl. 1965, 10, 186–190. [Google Scholar] [CrossRef]
  40. Zhu, L.X.; Fang, K.T. Asymptotics for kernel estimate of sliced inverse regression. Ann. Stat. 1996, 24, 1053–1068. [Google Scholar] [CrossRef]
  41. Härdle, W. Applied Nonparametric Regression; Cambridge University Press: Cambridge, MA, USA, 1990; p. 333. [Google Scholar]
  42. Ali, T.H. Modification of the adaptive Nadaraya-Watson kernel method for nonparametric regression (simulation study). Commun.-Stat.-Simul. Comput. 2022, 51, 391–403. [Google Scholar] [CrossRef]
  43. Silverman, B.W. Density Estimation for Statistics and Data Analysis; Chapman and Hall: London, UK, 1986. [Google Scholar]
Figure 1. The general scheme of closed loop system.
Figure 1. The general scheme of closed loop system.
Axioms 12 00193 g001
Figure 2. The control results in the absence of interference, when the task control is a stepwise impact for variables: (a) x 1 ; (b) x 2 .
Figure 2. The control results in the absence of interference, when the task control is a stepwise impact for variables: (a) x 1 ; (b) x 2 .
Axioms 12 00193 g002
Figure 3. Control variables values: (a) u 1 ; (b) u 2 .
Figure 3. Control variables values: (a) u 1 ; (b) u 2 .
Axioms 12 00193 g003
Table 1. The results of control for various values of the parameters.
Table 1. The results of control for various values of the parameters.
s v 1 v 2 y 1 * y 2 *
200[1.234, 1.690][0.164, 0.492][1.8, 2][2.8, 3]
300[1.184, 1.681][0,032, 0.191][1.8, 2][2.8, 3]
400[1.344, 1.712][0.116, 0.432][1.8, 2][2.8, 3]
500[1.688, 1.808][0.082, 0.152][1.8, 2][2.8, 3]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Arana-Jiménez, M.; Medvedev, A.V.; Chzhan, E. A Nonparametric Dual Control Algorithm of Multidimensional Objects with Interval-Valued Observations. Axioms 2023, 12, 193. https://doi.org/10.3390/axioms12020193

AMA Style

Arana-Jiménez M, Medvedev AV, Chzhan E. A Nonparametric Dual Control Algorithm of Multidimensional Objects with Interval-Valued Observations. Axioms. 2023; 12(2):193. https://doi.org/10.3390/axioms12020193

Chicago/Turabian Style

Arana-Jiménez, Manuel, Alexander V. Medvedev, and Ekaterina Chzhan. 2023. "A Nonparametric Dual Control Algorithm of Multidimensional Objects with Interval-Valued Observations" Axioms 12, no. 2: 193. https://doi.org/10.3390/axioms12020193

APA Style

Arana-Jiménez, M., Medvedev, A. V., & Chzhan, E. (2023). A Nonparametric Dual Control Algorithm of Multidimensional Objects with Interval-Valued Observations. Axioms, 12(2), 193. https://doi.org/10.3390/axioms12020193

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop