1. Introduction and Initial Considerations
1.1. State-of-the Art
Real-world systems are often very complex in nature—inherently nonlinear, highly dimensional, time-varying, and, thus, difficult to model or control within the bounds of classical control theory. The solution can often be found in advanced control, i.e., intelligent control techniques [
1], which have flourished in the last few decades, especially after breakthroughs in formal mathematical tools related to the optimality, stability, and uncertainty of these systems. These techniques found their applications in various areas such as industry [
2], energy [
3], or robotic systems [
4]. We consider the control method “intelligent” if it employs some artificial intelligence computing approach such as neural networks, fuzzy logic, machine learning, evolutionary computation, or genetic algorithms. Intelligent control may often imply combining two or more artificial intelligence tools into hybrid intelligent systems, such as neuro–fuzzy control, which merges human-like reasoning of fuzzy logic with the learning capabilities and connectionist architecture of neural networks. The most valuable approach to intelligent control proved to be the one based on the application of neural networks, due to a combination of their features such as parallel processing, adaptability and power of learning. It has been proven that neural networks with nonlinear differentiable activation functions can approximate arbitrary functions, so that recurrent networks were applied in system identification when a set of input-output data pairs was given. Consequently, neural networks were successfully employed in many control algorithms where an accurate model of the controlled system was needed. Paper [
5] provided a systematic classification of neural-network-based control. The main thesis was that all approaches could be classified into two categories: a neural network which was auxiliary to the main control logic, and a neural network which was the main controller. In [
6], different aspects of stable adaptive neural network control were discussed. Concrete interesting examples of neural network-based control can be found in [
7], where a neural network was applied in the control of the four-rotor helicopter, and [
8], in the case of the autonomous underwater vehicle.
To overcome major drawbacks in using neural networks for modelling and control of dynamical systems (such as setting the optimal neural network structure, initial values, and training method), the authors of this paper experimented intensively with structures called orthogonal polynomial neural networks [
9]: single-layer neural networks based on orthogonal functions (Gegenbauer orthogonal functions [
10] are used in this paper). These structures proved superior to classical neural networks in terms of accuracy/complexity ratio due to their natural in-built optimality [
11]. Designed orthogonal polynomial neural networks were successfully applied in modeling [
12] and control [
13] of dynamical systems.
1.2. Previous Work of the Authors
Further improvement in the architecture of polynomial neural networks can be achieved by introducing an adaptive measure of variability, which can be used to model uncertainties or time-varying behavior of a real-time operating dynamical system. As this algorithm reminded us of a biological process of glands in the human body producing hormones, it was called the endocrine mechanism. The endocrine mechanism allows the network to adapt to the change in environmental factors, sensed disturbances, measurement errors, and occurred uncertainties within dynamic systems. The authors of this paper proposed numerous solutions based on the utilization of orthogonal functions within endocrine neural networks.
Table 1 lists published research, comments regarding their utility value, and contributions to the field of control systems and artificial intelligence.
1.3. Motivation and Argumentation
The motivation for proposing a novel structure in this paper is threefold. First of all, the idea is to offer an intelligent solution capable of treating numerous environmental stimuli, noises and disturbances, and use the acquired insights to improve the control performances of dynamical systems. The second goal is to make further progress in the domain of artificial glands and their integration within neural networks. Specifically, the aim is to propose a new type of artificial gland characterized by a novel mechanism for treating input signals and generating optimal output values. The final objective is to consider and discuss potential principles of networking of proposed artificial glands with the point of providing extensive stimulation capabilities of underlying neural networks and making a proper framework for merging all environmental stimuli in a single network, enabling each of them to create influence on another stimulus (as in the nature). The concluding task is to test the aforementioned research goals in a real-world environment, by controlling a complex nonlinear dynamical system in a laboratory environment. One of the most popular benchmark systems for trying new control algorithms is the magnetic levitation system (MLS). This system aims to keep a ferromagnetic ball levitating in a place (or to follow a desired trajectory) by applying an adequate voltage to an electromagnet. MLS is a highly nonlinear system with open-loop instability and time-varying dynamics, making it ideal for testing complex control algorithms, and will also be used as a case study for testing the performance of our control algorithm in this paper. The system was not time-varying at the time of operation and individual performance recording during one experimental session. Nevertheless, the system shows a time-varying nature in our long-term work with it. In any experimental attempt, when stable levitation of a ferromagnetic ball is not established and the equilibrium state is not reached, the levitation object moves uncontrollably. It hits the electromagnet, causing the ball to fall out of the system environment and suffer potential damage. Since it is a laboratory model, it does not include protective equipment that would prevent damage to electromagnets and levitation balls. As a result, there are long-term changes in performance due to such damages to the mentioned mechanical elements. Comparing the performance of this specific ML system from a few years ago with the moment of writing this paper, the difference of when the identical experimental setup was made is significant. Described time-varying properties of the system strictly enforce this performance difference. Several exciting approaches emerged over time based on applying different neural networks in the control of MLS. In [
20], hybrid control using a recurrent neural network (RNN) was proposed. The controller itself was based on feedback linearization, while RNN functions were used for uncertainty observers. Study [
21] considered a neural adaptive method for position control and identification of MLS. The controller consisted of three parts: PID controller, radial basis function network controller and radial basis function network identifier. On the other hand, a synergistic combination of neural network and sliding mode control was proposed in [
22].
Justification for introducing and using this complex, intelligent structure can be found in the following considerations. Generally speaking, artificial neural networks possess the capability of a precise generalization and show robustness when dealing with disturbances. However, they can have difficulties coping with uncertainties, providing interpretability and identifying the importance of each input signal and environmental stimuli for a related system. The last-mentioned deficiency is one of the crucial reasons for considering the OENN + AOG structure for implementation. If an examined dynamical system is linear or simple nonlinear, it cannot be advised to apply a novel structure. Additionally, if there is a lack of data for building a data set, available information is limited or environmental stimuli are not measurable, it is also better to utilize another simpler intelligent or traditional controller. It is justified to use the proposed OENN + AOG structure when it is required to control a complex nonlinear dynamical system when traditional control logics do not provide satisfying results. Particular interest in using the architecture is when numerous measurable environmental stimuli and disturbances exist and can be sensed. The structure’s benefit is in introducing an unlimited number of such stimuli to the gland network and later to a designed OENN. It was experimentally shown in the previous section that the novel structure can provide representative results (on the level of the similar complex smart solution) but with significantly less computation time. All acquired and processed environmental stimuli provide a significant value to a connected neural network and controlled system. They can help in training free parameters if treated correctly according to the current environmental conditions.
1.4. The Structure of the Paper
The remainder of the paper is structured as follows.
Section 2 presents a novel mechanism of work of artificial glands and the way of processing two different types of input signals.
Section 3 introduces an algorithm for networking artificial glands and explains the approach of integrating such a network within one kind of artificial neural network. A universal method of combining the novel intelligent structure within a dynamical system control logic is provided in
Section 4.
Section 5 introduces a case study based on exploiting a laboratory magnetic levitation system, as the system that will be used for real-time verification of control performances of the proposed system. Finally, in
Section 6 and
Section 7 the experimental results, comparisons of control performances with other already verified control logics and related discussions are presented. In the end, all collected experimental results are used to make generalized conclusions and recommendations toward the applicability of novel intelligent structures.
2. Artificial Orthogonal Gland (AOG) Mechanism
This section will present a novel approach to developing artificial glands where a novel artificial orthogonal gland (AOG) mechanism will be proposed. Additionally, a strategy for building a network of AOGs within a single system will be provided. In previous attempts to develop artificial glands [
14,
15,
16,
23,
24,
25,
26], the main idea was to acquire environmental stimuli, transform them into appropriate input signals and introduce them to the glands. The next part calculated hormone concentration within each gland and introduced calculated values to a neural network. Each gland was mostly independent of other involved glands, covering different environmental stimuli.
The novel approach proposes a gland structure based on some previously established gland features in combination with new in-gland mechanisms. Further networking of single gland units will enable mutual interactions between different hormones and an additional level of adaptability of connected neural networks. In many previous attempts [
14,
15,
16,
23,
24,
25,
26], each environmental stimulus was assumed to be completely independent of other stimuli. However, in practice and real-world industrial applications, one environmental condition could affect other related dependencies and volatility of numerous factors. For example, a strong wind signal can significantly affect a control system’s performance, if mounted outside. Additionally, the wind can directly influence decreasing the system’s temperature, increasing mechanical parts’ vibrations and noise level, etc. From this example, it is evident that one environmental stimulus may affect the values of other stimuli. For that reason, a network of hormones can better represent similar situations, simulate mutual correlations, and put the importance and influence of all included stimuli on a system of interest in proper order. In order to propose upgraded functionalities to an artificial gland, the novel gland architecture is proposed in
Figure 1.
The gland mechanism from
Figure 1 is designed in such a manner to accept two inputs: control input (
CI) and signal input (
SI). The primary purpose of
CI is to control hormone production, and it is not related to any calculation of hormone stimulations.
CI enables optional interactions between different glands and allows hormones to connect and make a network of hormones. The second input to the gland mechanism,
SI, determines the hormone stimulation level within a gland and defines the gland’s influence on the neural network. The AOG mechanism includes four main sub mechanisms: control mechanism, signal mechanism, activation mechanism and hormonal mechanism. All these sub-mechanisms will be presented next.
2.1. Signal Mechanism
The signal mechanism within the gland cell is used to process environmental signals in the form of SIs and stimulate hormone production. Such a mechanism theoretically enables multiple inputs (environmental stimuli) to access the single gland in comparison with the previous proposals of the authors [
17,
18,
19]. The output of the signal mechanism can be calculated as:
In (1), n is the number of signal inputs introduced to the gland, and fSM function is the standard deviation function. This function shows how dispersed the data of included environmental stimuli are concerning the mean. A low standard deviation implies that the data are clustered around the mean, and a high standard deviation shows that the data are more spread out. If a high deviation occurs, the will be prominent and hormone stimulation will be significant. If dispersion between SI signals is low, will be a small integer value, and its influence on the upcoming gland mechanism will be smaller. The rounding part of Equation (1) is performed to prepare to be applicable within the activation mechanism, which requires integer numbers as inputs.
2.2. Control Mechanism
The processing unit of
CI is the control mechanism (
Figure 1).
CI input can have four different states:
- (1)
CI = 0 or CI = 1: zero-state (inhibitory state) is activated when the gland threshold is not achieved (0 ≤ CI < threshold_value), implying a gland not to be active;
- (2)
CI = 2: when the excitatory manner of work is activated;
- (3)
CI = 3: when the negative feedback logic of work is activated;
- (4)
CI = 4: when the positive feedback logic of work is activated.
The state of inhibition (a zero state) is active when the gland threshold is not achieved and results in the output of the control mechanism equal to . The threshold value can be set in accordance with control needs: if there is a requirement to build an extra sensitive system that will register even the most minor environmental disturbances, the threshold_value should be set to a value close to 0. If the requirement is to provide a robust system that will activate glands on some significant disturbances or peak occurrences, the threshold_value should be large.
The following introduced feature is the excitatory feature, active when threshold_value is achieved and can be described with the following equation:
fCM from Equation (2) can be selected as a linear function, bipolar, sigmoid, step, etc. The selection should be made according to a specific use case. The third possibility that the control mechanism can exploit is a negative feedback logic that can be described as follows:
When the negative feedback feature (Equation (3)) is active, the hormone production will be suppressed. The final option is to use the positive feedback logic to calculate
:
When the positive feedback feature (Equation (4)) is active, the hormone production intensifies. Negative (Equation (3)) and positive (Equation (4)) feedback controls should be activated once the specified hormone concentrations are achieved (low or high), and the equilibrium points are reached. Feedback controls can rapidly increase or decrease hormone stimulations, enabling quick responses of artificial glands to changes occurring in the environment. Traditional artificial glands need a significant number of iterations to adjust their parameters to sudden changes in environmental conditions and large volatilities. Negative and positive feedback features can make these adjustments significantly faster.
2.3. Activation Mechanism
The task of the activation mechanism from
Figure 1 is to determine the production of hormone based on
and
. The mechanism processes two signals in each time step, resulting in
:
where
could be implemented in the form of conventional neural network functions such as sigmoidal, linear, step, Gaussian, etc. However, keeping in mind the previous work of the authors of this paper and extensive experience in developing and implementing novel types of orthogonal functions within neural network environments, this research makes an additional effort in implementing
in the form of Gegenbauer orthogonal functions. Gegenbauer functions are orthogonal polynomials defined on the interval [−1, 1]. They generalize Legendre polynomials and Chebyshev polynomials and represent exceptional Jacobi polynomials cases. Gegenbauer polynomials are characterized by rapid convergence rates providing an excellent foundation for building an efficient activation mechanism. Each artificial gland has its row of the orthogonal function: for n glands assigned to the neural network,
n Gegenbaurer functions will be expended from the initial one and be used for each function
(j = 0, 1, 2, …,
n − 1). Gegenbauer polynomials and appropriate mathematical foundation will be introduced in the following subsection.
2.3.1. Gegenbauer (Ultra Sphere) Polynomials
The Gegenbauer polynomials represent a generalization of the classic Legendre polynomials that will be denoted with
, where r is an integer and
λ is a real number. These polynomials represent the exceptional cases of Jacobi polynomials, and the following relation defines them:
By introducing binomial expansion of the left side of Equation (6), the previous equation can be presented as:
If
and
, then
and
k can vary in the range from 0 to [
r], where:
Hence, it can be further concluded:
Finally, by comparing Equation (9) with Equation (6), it can be obtained:
In the second part of proposing a suitable mathematical apparatus, the equality
is specified (for the orthogonality interval), in addition to condition
(as a special case of Jacobi polynomials obtained by setting
). After a complex mathematical calculation from [
27,
28] the following series is obtained:
where symbol
denotes
, i.e.,
. A recurrence relation can be presented as [
28]:
In order to obtain a basis which will represent the functions of activation mechanisms of artificial glands and activation functions of future OENN, the following function is introduced:
Finally, the Gegenbauer series of the polynomials applicable for implementations within the proposed intelligent structure are:
For
n glands, the activation mechanism within each gland (1 to
n) will be calculated as follows:
where
and
are appropriate inputs to the
i-th gland.
2.4. Hormonal Mechanism
The fourth and last component of novel gland architecture is hormonal mechanism, which aims to determine the hormone concentration at each time instance,
t. The mechanism secrets hormone concertation,
Cg, in accordance with the following equations:
Equations (16) and (17) are introduced from [
17], and showed to be an effective method for calculating hormone concentrations of artificial glands. The main difference between the two approaches is in calculating the stimulation parameter
that is based on the introduced
parameter as the main feature that affects the stimulation volatility. The last two unexplained parameters from Equations (16) and (17) are
and
that represent the stimulation rate and the decay rate, respectively. Their values are always limited in the range [0, 1].
3. Implementation of AOG Network Mechanism within OENNs
3.1. Basics of OENNs
The OENN network used in this research is based on the network structure proposed in [
17]. The difference between the two approaches in calculating network output
is in the existence of hormone sensitivity parameter,
. In [
17],
is introduced to apply sensitivity features of excitatory and inhibitory glands. In this novel work, the algorithm is simplified, different gland structure (excitatory and inhibitory glands) does not exist, and only one gland unit (AOG mechanism) is introduced. This setup results in excluding parameter
from the mathematical apparatus, which is performed by making
for all proposed cases. That enables the network to calculate the output as follows:
where
X is the input value,
is the
i-th network weight coefficient,
is the concentration of hormones of a single gland,
is an orthogonal function (processing element),
t is a time instance, and
R(X,m,) represents an expansion error, which is decreasing when the number of expansion terms
m is increasing. The main difference between [
17] and the novel network is building the orthogonal part of the structure. In [
17] Chebyshev functions were used, while in this paper, previously described Gegenbauer functions will be utilized.
3.2. Networking of Glands
A complete structure of networked glands and integration of formed structure within the OENN system is presented in
Figure 2. In the continuation of this section, how the configuration from
Figure 2 is created and what components are introduced for that purpose will be explained.
Three types of glands are proposed in this research to provide optimal responses to various environmental stimuli: a leaf gland, a mediator gland and a root gland. However, the same gland structure and operational algorithm described in the previous section are used in all three cases. The difference between glands is their position in a gland network and is determined based on the layer they individually belong to. In order to propose the gland types, first of all, the classification of all possible environmental stimuli should be made into two categories: primary and secondary environmental stimuli. The primary environmental stimuli are those that can influence the system performance significantly and that are in direct correlation with system responses. Secondary stimuli are considered those whose influence on the system cannot considerably affect the system but are still present in the system’s environment.
A leaf gland directly responds to a secondary environmental stimulus (SES). Suppose an SES signal value is larger than the threshold value specified for that SES: in that case, the leaf gland is activated, the implemented gland mechanism from the previous section is triggered, and the processing signal activities are conducted. On the other hand, if the SES signal is smaller than the threshold, the leaf gland will be in the standby regime, resulting in the signal equal to 0 or 1. The maximal number of leaf glands can be equal to the number of SES. Still, that number could be less than that number, considering that multiple stimuli can be introduced to a single gland cell.
A mediator gland is used to process a primary environmental stimulus (PES) brought to the gland in the form of a
signal. The number of mediator glands is less or equal to the number of identified PESs. Following
Figure 2, additional inputs to every mediator gland are hormone concentration signals (
,
,…,
) introduced from the leaf glands. To accept these signals, a threshold of each mediator gland should be set, and mathematical apparatus from the previous section applied. Unlike leaf glands, mediator glands must always be in the active state. If the leaf glands’ outputs are zero (if SESs do not exist or are negligible), it will not be allowed for mediator glands’
CIs signals to be zero or insufficient to reach the gland threshold. In other words, the mediator glands must always be activated by setting them up to one of the three values:
CI = 2,
CI = 3 or
CI = 4. In order to provide an optimal control input (
CI) to a mediator gland, the Algorithms 1 is implemented.
Algorithms 1 Control input selection process |
Initial setup of all control inputs of the mediator glands is CI = 2 If the current system error () > acceptable error for If sum () ≤ Mediator_Glands_Threshold Then CIMG = 4 (production of hormones is intensified) Else CIMG = 3 (production of hormones is suppressed) Else CIMG = 2 |
Finally, all output signals of the mediator glands (
CIMG1, …,
CIMGe) are introduced to a single root gland (
Figure 2). Contrary to the mediator glands, when the zero state (
CI = 0) is not possible, that option is enabled here. If the sum of (
CIMG1, …,
CIMGe) is not enough to trigger the root gland, all PES and SES influences on a system are negligible, and there is no need for the neural network to receive any information from the AOG network. In that case, the entire AOG network will be in a standby work regime. The final input signal (
SIRG) introduced to the root gland represents the error signal/noise signal generated during the work of a system of interest and represents the main parameter for monitoring the system’s performances and accepting feedback information of the current system response. In addition to the PES signals, SIRG represents the main factor in generating appropriate hormone concentrations that will be used to influence the output layer weights of OENN.
3.3. Training Process
The learning algorithm of OENN is based on the algorithm proposed in [
17], and implementing modified recursive least square (RLS) algorithm for updating network weights and calculating the errors:
where
is an error calculated before the weight update process, and
n represents a sample rate defined by system dynamics. Further:
where
is calculated error after the procedure of updating weights. In accordance with [
17], the idea is that during the time, two errors converge with each other, and in the ideal case their ratio will be:
Finally, the learning rate adaptation algorithm [
17] is also exploited. Its primary purpose is to monitor the error difference sign in two consequent iterations. If the sign changes, the learning rate value decreases. On the contrary, the learning rate will increase if the sign is the same in two iterations.
4. Integration of OENN Structure Powered by AOG Network Mechanism within a Control Logic of a Laboratory Dynamical System
In order to exploit the proposed framework within a real-time laboratory environment, a presentation of the approach of integrating OENN with AOG Network (OENN + AOG) within an existing control logic of a laboratory dynamical system, is required. The implementation approach is presented in
Figure 3.
To implement the structure appropriately, all identified environmental stimuli should be categorized as SES or PES. The number of SES (l) defines the number of leaf glands, where one leaf gland is required for every unique SES. A similar procedure is required for PES, where each identified PES should be represented by one mediator gland. Another implementation option is to use one gland for multiple identified stimuli, but for accuracy and simplicity reasons, one gland per stimulus will be used for further implementation.
After identifying the number of mediator glands (m), it is necessary to create a fully connected network of glands with three layers and with number of nodes, l-m-l. One is the number of nodes in the last layer, and by default, this layer always includes just a single root gland. The output root gland signal should be introduced to the output layer weights of OENN, in accordance with Equation (18). In order to form PES and SES, the relevant sensing equipment is required to collect environmental data, process it, and introduce it to the gland network.
OENN structure should be defined following the number of input signals that should be processed and the number of output control signals that must be introduced to a plant. By default, OENN possesses two inputs introduced from the system of interest: output signal
Y(t) and delayed signal
Y(t − n). If desired performances of the system are not achieved, the number of inputs can be increased, and additional delayed signals can be introduced to the network. Once the number of network inputs and outputs is specified, the number of hidden layer nodes and required setup can be established following the procedure from
Section 2.3.1 and introducing a proper number of Gegenbauer orthogonal functions.
Finally, the generated output(s) of the OENN
X(t) is introduced to the plant input and the input of the developed plant model. The plant model should be a mathematical representation of a real-time system that will accurately describe system dynamics. The model process is in parallel signal
X(t), and represents simulation verification of work of a real plant. In the end, the real-time output of the plant with the model’s output is compared. The error is found and introduced in front of mediator glands as a trigger component for their activation (
Section 3.2).
5. Case Study: Magnetic Levitation System
Inteco magnetic levitation system (MLS) [
29] is an open-loop unstable and nonlinear laboratory system for demonstrating phenomena of magnetic levitation (
Figure 4). Levitation techniques are used in practical applications such as transportation, contactless melting, magnetic bearings, etc. One successful approach of providing optimal control of a multivariable magnetic levitation system is presented in [
30]. In general, the levitation principles are based on digital and analogue solutions to maintain a steel ball in an electromagnetic field. MLS elements are the electromagnet, the hollow steel sphere, the position sensor, and the computer interface board with appropriate connecting cables. It is a single degree of freedom control system with time-varying dynamical characteristics.
For levitation purposes of the laboratory model, a real-time controller is required. In order to keep a levitating sphere in a desirable position, the equilibrium stage of the gravitational and electromagnetic force is required. MLS operates the voltage to the electromagnet to enable the object to levitate where the levitating object’s position is determined in every time step (0.01 s) through the sensor. The measured voltage corresponds to the sphere distance. Additionally, the coil current, as another measurable variable, can help explore control strategies. All control experiments of MLS can be executed in real-time in the Matlab software package, specifically Simulink Toolbox. The software control logic can be used for online identification of processes, modelling of control systems, and all design and simulation phases. Finally, the implementation of new control algorithms can also be performed in real-time.
MLS can be mathematically described in the following format [
29]:
where,
,
,
,
. The electromagnetic force (
) depends on the distance of ball from the electromagnet and the electromagnetic coil current.
6. Experimental Verification
6.1. Control Logics Used for Experimental Comparisons
In order to verify the performance of the proposed model in a laboratory environment, four different control logics were used. The first was a default logic integrated with the MLS model, linear quadratic tracking control logic [
29]. The next was based on dynamical neural networks, the approach that showed significant performance in control of nonlinear dynamical systems [
30]. The third was the OEI controller, the previous proposal of the authors of this paper [
17], a hybrid structure based on the combination of OENN and OEANFIS that showed a significant performance in the control of nonlinear systems weighted with notable disturbances. Finally, the last was the model proposed in this new research, OENN + AOG structure. A brief explanation of all four control logics that will be used in this section, follows.
6.1.1. Linear Quadratic Tracking Control
The default tracking control of the MLS system used in Matlab is Linear Quadratic (LQ) control (
Figure 5). The control and tracking policy is built for an equilibrium point. For each value of a ball position, the current values are recalculated: velocity, coil current and the control signal. All needed information about the mathematical apparatus of LQ controller can be found in [
29].
6.1.2. DNN
DNN structure from [
31] was used for comparative purposes because of its efficient control of linear and non-linear MIMO systems. A modelled system is controlled by the proposed adaptive controller, where the algorithm uses the system outputs to tune the controller’s parameters. Another reason for using DNN intelligent controller is the capability of the mechanism to cancel disturbances utilizing an adaptive filter. Dealing with disturbances was one of the main goals of this overall research paper and the OENN + OAG system, and it would be informative to see a comparison of performances achieved.
6.1.3. OEI Controller
The following reference model is the OENN-OEANFIS structure proposed in [
17]. Instead of Gegenbauer orthogonal functions, the model from [
17] is based on the utilization of Chebyshev functions. Unlike this novel research where the OENN network is used as a basis, in [
17] the hybrid combination of OENN and OEANFIS was exploited. The next difference is the usage of artificial glands, where [
17] used a combination of excitatory and inhibitory glands, while the AOG mechanism is exploited here. The OENN algorithms’ training in both networks is similar, expanded with the learning rate adaptation algorithm. The environmental stimulus used for completing the OEI structure was pressure, measured valuable by utilizing sensing equipment.
6.2. Implementation of Control Logics
LQ control logic was implemented as presented in
Figure 5, and no further modifications were made. All three intelligent control logics were implemented to be as similar as possible in order to be comparable and in accordance with
Figure 3. The similarity was achieved by putting the same number of neurons within three neural networks and selecting the same activation functions and initial values of weights. The number of neurons within the networks was determined empirically by trying the OENN + AOG structure with hidden layer neurons from 2–200. The best performances were achieved with 40 neurons in the hidden layer. The control problem determined the number of neurons in the input layer, and it was two for the MLS problem. The number of output neurons was 1. According to the determined structure, all three intelligent control logic were based on neural networks with the structure 2-40-1.
Finally, the OENN+OEANFIS model and OENN + AOG structure required environmental stimuli for employing artificial glands. For that purpose, the following environmental stimuli were used. For OENN + OEANFIS, there was a single environmental stimulus used, a temperature. The temperature was measured by using an appropriate laboratory sensor. On the other hand, the point of OENN + AOG is to exploit numerous environmental stimuli in order to provide all relevant information to the control logic. To achieve that, the following stimuli were measured: magnetic field detection, temperature, humidity, vibration and the concentration of combustible gas in the air.
The complete process of designing the OENN + AOG structure was performed using the developed algorithm in
Section 2,
Section 3 and
Section 4 of this paper. To integrate the AOG mechanism, it was necessary to classify all determined stimuli into two categories, which was performed in the following way: the strength of magnetic field and vibration were classified as PES (two mediator glands), while temperature, humidity and the concentration of combustible gas were classified as SES (three leaf glands). Summarized, the AOG structure was in the format (3 leaf glands–2 mediator glands–1 root gland). At that moment, the classification of all introduced environmental parameters was a subjective process and was based on human evaluation of available stimuli and determination of their importance. In future work, the authors will try to establish a universal automatic approach for classifying all relevant environmental stimuli in two categories in a reliable manner.
6.3. Learning Phase
The data set exploited to train the proposed structure was created by collecting real-time data from the actual work of the MLS system. The data included a reference signal that determined the desired position of the levitation object in each time step and correlated control signal that the default LQ controller generated. The sample time of exploited signals was pre-defined by the manufacturer of the system; it was 0.01 s by default, and empirically was determined that 10,000 samples per variable (100 s of measurements) would provide optimal results. Following
Figure 3, a minimal number of inputs was two, a specified input signal was
Y(
t), and a delayed version of the specified input signal was
Y(
t −
n). Following that rule, for building the dataset for this research,
Y(
t) represented the position signal, while for the delayed signal
Y(
t −
n) used delayed signal
Y(
t) with a delay equal to 1, or
Y(
t − 1) was formed. The recorded control signal of the default LQ control logic was used as the training output of the neural network. After building the set of training data for performing training processes of the OEI controller, DNN, and OENN + AOG networks, the training set was split in the following manner: 70% of the data were used for the training, 15% for validation purposes, and 15% for test cases. The test set was used to evaluate the performance of the trained model, while the validation set was used to follow the convergence of the error signal toward zero. If the validation error was equal in six consequent iterations, the network could not improve further, and the training stopped.
6.4. Experimental Results
Laboratory verification of the proposed structure was performed on real-time testing using the described MLS model. Tracking performances of four introduced control logics were recorded and compared simultaneously. Performances were evaluated through the position signals that individual control systems achieved and their comparisons with the reference tracking signal were assumed to be optimal. In an ideal case, a tracking position achieved due to a controller should overlap entirely with the reference signal. However, in a real-world environment, this is almost an impossible task.
Figure 6 and
Figure 7 show the tracking performances of four tested control logics.
7. Discussion Section
7.1. Achieved Performances
From
Figure 6, it can be concluded that three of the four control logics successfully performed levitation of the ball for the specified trajectory. Only the DNN control algorithm could not provide tracking performances in multiple repeated experimental cases: the levitation processes stopped in every attempt, and the ball fell to the ball holder (position 0.0128 on the y axis occurred after 3.7 s).
Figure 7 presents an enlarged view of part of
Figure 6 and provides better insight into the difference between the other three tracking signals compared with the reference signal. Finally, the numerical presentation of achieved results from
Figure 6 and summarized results are presented in
Table 2. The root mean square error (RMSE) was used to calculate the overall measure of achieved performances, keeping in mind it was calculated as a difference between the measured output position MLS signal and the desired signal (the reference signal).
As can be concluded from
Table 2, OENN + AOG provided similar results to the OEI controller, with a significant reduction in computational costs expressed through the training time. In comparison to the OEI controller, OENN + AOG provided similar performances with a simpler structure and lower computational costs. Further, the OEI controller was based on the combination of online and offline learning and required special attention for setting OEANFIS. Finally, both examined structures showed smaller RMSE from the default LQ control. As final insights from conducted experiments,
Figure 8 presents control signals generated from appropriate controllers.
Figure 8 also presents unstable control from the DNN controller, resulting in uncontrolled oscillations and levitation process interruption. Additionally, it shows three successful control signals which enabled stable ball levitation.
7.2. Stability Considerations
To analyze the stability of the dynamic system and proposed OENN + AOG control logic, the MATLAB LMI Control Toolbox maybe utilized [
32]. LMI stability criteria are a powerful tool for matrix operations based on linear matrix inequality techniques. If there is a solution to the matrix inequalities, then the MLS system (and the corresponding intelligent control algorithm) can be considered stable [
33]. However, it is challenging to check the algebraic inequality method due to numerous adjustable network and gland parameters considering a lack of information on how to tune these parameters in every new training iteration. The exceedingly complex LMI-based stability results are valuable only for numerical purposes, and the theoretical meaning is deprived.
Further, the stability of OENN + AOG intelligent control without global feedback may be analyzed layer by layer. A single Lyapunov function can be used for each layer and generated in independent runs of the LMI solver. For example, for OENN with one hidden layer without global feedback, it should be determined if the layer is stable. This can be performed by running the LMI solver to solve LMI for this layer alone. If a solution is feasible, the stability of other hidden layers or the output layer can be analyzed. Due to the non-zero connection matrix between the layers and non-zero outputs of the first layer at its equilibrium, biases of the second layer should be adjusted accordingly before invoking the LMI solver for another layer. If a solution for the first layer is not feasible, then nothing can be concluded about the stability of the second layer either. Generalization to cases with more than two layers is straightforward. The LMI method can easily incorporate adjustable parameters into stability criteria and decrease conservativeness. In the sense of total performance evaluation of the desired results, LMI-based results are considered very effective.
Finally, the more complex the stability condition is, the less it will influence physical and theoretical meaning for analysis and will become useful only for numerical purposes. It can further lead to challenges in comparing different conditions among the LMI-based stability results. Therefore, the efficiency of exploited stability conditions can only be compared by means of a specific MLS example and not in a universally analytical way.
7.3. Significance of Proposed Control Logic
The contribution of the proposed algorithm can be described from two perspectives: scientific and engineering. Scientific contribution means making another step forward toward new orthogonal endocrine neural networks that have shown significant control potential in the last few years. It was shown that Gegenbauer orthogonal functions could be as equally efficient as other already utilized functions, creating a suitable environment for further considerations of different functions’ applicability. Further, artificial glands were networked within a single structure (AOG structure) and tuned in a manner to treat appropriate environmental stimuli with different levels of importance for a controlled system. Finally, a new type of artificial glands was proposed, with a specially tailored mechanism within it. It was shown that such a gland could be made as an adjustable unit, where each sub-mechanism could be tailored for a specific use case. Conversely, the engineering and practical contribution of the research is in the potential to implement a similar structure for control of real-world nonlinear dynamic systems. A significant value resides in the possibility of processing an unlimited number of environmental signals, noises and disturbances with the proposed architecture. The structure is upgradable, and every new noticed environmental factor can be easily added to the existing OENN + AOG control logic. That way, the control signal in each time step is modified based on the current system’s factors and environmental factors, enabling robust control and stable system work.
8. Conclusions
This paper presents a novel intelligent structure to control nonlinear dynamical systems. The structure is based on combining one type of orthogonal neural network (ONN) with a specially developed artificial orthogonal gland (AOG) network. The ONN used in the research is a three-layer network where the main, hidden layer is built from a series of Gegenbauer orthogonal polynomials. Each neuron has its unique polynomial. The output signals of AOG network influence output layer weights of the network. This network is a three-layer network built by combining multiple artificial glands within its structure. The gland used in this research is a new gland model, especially developed to optimize the process of collecting and processing environmental stimuli.
The AOG mechanism of a single gland includes four sub mechanisms: control, signal, activation and hormonal mechanism. This concept aims to acquire control and signal input to the gland, process it through these four sub mechanisms and generate a calculated hormone concentration. In order to enable the processing of a large number of environmental stimuli, the AOG network is made by combining single AOG mechanisms within one unit. To achieve that, the AOG network is organized into three layers: leaf layer, mediator layer and root layer. Leaf and mediator layer glands receive the input signals from the environment in the following way. First, environmental stimuli are manually divided into primary environmental stimuli (PES) and secondary environmental stimuli (SES). PES are stimuli of primary interest and importance for the network and system control. At the same time, SES includes signals that do not influence the system dynamics significantly, but still exist in the environment and could be counted as relevant.
Further, the leaf layer receives SES signals and possesses the number of leaf glands equal to the number of introduced SES stimuli. The number of mediator glands is equal to the number of PES signals. Leaf glands and mediator glands are fully connected, meaning that each leaf gland is connected to every mediator gland. Finally, all the outputs of mediator glands are introduced to the single root gland.
In order to verify the proposed intelligent structure, the laboratory model of the magnetic levitation system (MLS) was used as a complex nonlinear dynamical system. ML is characterized by significant sensitivity to disturbances and environmental conditions, and it is an excellent challenge for testing new control logics. Three additional control logics were exploited for verification purposes: default linear quadratic (LQ) tracking control logic, dynamical neural network (DNN) and the OENN + OEANFIS structure proposed in the previous research by the authors of this paper. The goal of the control systems was to track the referenced levitation signal and maintain the still ball levitation in specified positions. The OENN + AOG structure showed tracking performances similar to the more complex OENN + OEANFIS structure but with significantly less computation cost and training effort needed to achieve these results. The new structure, that relies on acquiring and processing all measurable environmental stimuli, was useful for controlling nonlinear systems and presenting the foundation for building intelligent control logic in a costless manner. Future research attempts will show its potential for the control of other linear and nonlinear systems and making an automatized approach for classifying primary and secondary environmental stimuli and selecting the proper structure of the AOG network.
Author Contributions
Conceptualization and methodology, M.M.; MATLAB software, M.M. and M.S.; formal analysis, S.N. and A.D.; resources, S.N. and A.O.; data curation, writing—original draft preparation, M.M. and A.O.; writing, A.D. and M.M.; visualization, A.D. and M.S.; supervision, S.N. and M.M.; project administration and funding acquisition, A.O. All authors have read and agreed to the published version of the manuscript.
Funding
The authors disclosed the receipt of the following financial support for the research, authorship, and/or publication of this article: This research received funding from the Shift2Rail Joint Undertaking under the European Union’s Horizon 2020 research and innovation program under Grant Agreement No. 881784.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Acknowledgments
This work has been supported by the Ministry of Education, Science and Technological Development of the Republic of Serbia. Special thanks to Serbian Railways Infrastructure, and Serbia Cargo for their support in conducting the SMART and SMART2 OD field tests.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Liu, J. Intelligent Control Design and MATLAB Simulation; Tsinghua University Press: Beijing, China; Springer Nature: Singapore, 2018. [Google Scholar] [CrossRef]
- Lu, Y.-Z. Industrial Intelligent Control: Fundamentals and Applications; John Wiley & Sons: Hoboken, NJ, USA, 1996. [Google Scholar]
- Dounis, A. Intelligent Control in Energy Systems. Energies 2019, 12, 3017. [Google Scholar] [CrossRef] [Green Version]
- Behera, L.; Kumar, S.; Patchaikani, P.K.; Nair, R.R.; Dutta, S. Intelligent Control of Robotic Systems; CRC Press: Boca Raton, FL, USA, 2020. [Google Scholar]
- Agarwal, M. A systematic classification of neural-network-based control. IEEE Contr. Syst. Mag. 1997, 17, 75–93. [Google Scholar] [CrossRef]
- Chang, S.S.G.; Tong, C.H.; Zhang, H.L.T. Stable Adaptive Neural Network Control; Springer: New York, NY, USA, 2002. [Google Scholar]
- Dunfied, J.; Tarbouchi, M.; Labonte, G. Neural network based control of a four rotor helicopter. In Proceedings of the 2004 IEEE International Conference on Industrial Technology, Hammamet, Tunisia, 8–10 December 2004; pp. 1543–1548. [Google Scholar] [CrossRef]
- Ishii, K.; Fujii, T.; Ura, T. An on-line adaptation method in a neural network based control system for AUVs. IEEE J. Ocean. Eng. 1995, 20, 221–228. [Google Scholar] [CrossRef]
- Oh, S.-K.; Pedrycz, W.; Park, B.-J. Polynomial neural networks architecture: Analysis and design. Comput. Electr. Eng. 2003, 29, 703–725. [Google Scholar] [CrossRef]
- Belmehdi, S. Generalized Gegenbauer orthogonal polynomials. J. Comput. Appl. Math. 2001, 133, 195–205. [Google Scholar] [CrossRef] [Green Version]
- Antić, D.; Danković, B.; Nikolić, S.; Milojković, M.; Jovanović, Z. Approximation based on orthogonal and almost orthogonal functions. J. Franklin I. 2012, 349, 323–336. [Google Scholar] [CrossRef]
- Milojković, M.; Milovanović, M.; Nikolić, S.; Spasić, M.; Antić, A. Designing optimal models of nonlinear MIMO systems based on orthogonal polynomial neural networks. Math. Comp. Model. Dyn. 2021, 27, 242–262. [Google Scholar] [CrossRef]
- Milojkovic, M.T.; Djordjevic, A.D.; Peric, S.L.; Milovanovic, M.B.; Peric, Z.H.; Dankovic, N.B. Model predictive control of nonlinear MIMO systems based on adaptive orthogonal polynomial networks. Electron. Elektrotech. 2021, 27, 4–10. [Google Scholar] [CrossRef]
- Milojkovic, M.; Antić, D.; Milovanovic, M.; Nikolić, S.S.; Peric, S.; Almawlawe, M. Modeling of Dynamic Systems Using Orthogonal Endocrine Adaptive Neuro-Fuzzy Inference Systems. J. Dyn. Syst. Meas. Control 2015, 137, 091013. [Google Scholar] [CrossRef] [Green Version]
- Peric, S.L.; Antic, D.S.; Milovanovic, M.B.; Mitic, D.B.; Milojkovic, M.T.; Nikolic, S.S. Quasi-Sliding Mode Control With Orthogonal Endocrine Neural Network-Based Estimator Applied in Anti-Lock Braking System. IEEE/ASME Trans. Mechatron. 2015, 21, 754–764. [Google Scholar] [CrossRef]
- Milovanović, M.B.; Antić, D.S.; Milojković, M.T.; Nikolić, S.S.; Perić, S.L.; Spasić, M.D. Adaptive PID control based on orthogonal endocrine neural networks. Neural Netw. 2016, 84, 80–90. [Google Scholar] [CrossRef] [PubMed]
- Milovanovic, M.B.; Antic, D.S.; Milojkovic, M.T.; Spasic, M.D. Adaptive Control of Nonlinear MIMO System With Orthogonal Endocrine Intelligent Controller. IEEE Trans. Cybern. 2020, 52, 1221–1232. [Google Scholar] [CrossRef]
- Milovanović, M.; Antić, D.; Nikolić, S.S.; Perić, S.; Milojković, M.; Spasić, M. Neural Network Based on Orthogonal Polynomials Applied in Magnetic Levitation System Control. Electron. Electr. Eng. 2017, 23, 24–29. [Google Scholar] [CrossRef] [Green Version]
- Milovanović, M.; Antić, D.; Milojković, M.; Nikolić, S.S.; Spasić, M.; Perić, S. Time series forecasting with orthogonal endocrine neural network based on postsynaptic potentials. J. Dyn. Syst. Meas. Control. 2017, 139, 041006. [Google Scholar] [CrossRef]
- Lin, F.-J.; Shieh, H.-J.; Teng, L.-T.; Shieh, P.-H. Hybrid controller with recurrent neural network for magnetic levitation system. IEEE T. Magn. 2005, 41, 2260–2269. [Google Scholar] [CrossRef]
- Hajimani, M.; Gholami, M.; Dashti, Z.A.S.; Jafari, M.; Shoorehdeli, M.A. Neural adaptive controller for magnetic levitation system. In Proceedings of the Iranian Conference on Intelligent Systems, Bam, Iran, 4–6 February 2014; pp. 1–6. [Google Scholar] [CrossRef]
- Phuah, J.; Lu, J.; Yasser, M.; Yahagi, T. Neuro-sliding mode control for magnetic levitation systems. In Proceedings of the IEEE International Symposium on Circuits and Systems, Kobe, Japan, 23–26 May 2005; pp. 5130–5133. [Google Scholar] [CrossRef]
- Chen, D.; Wang, J.; Zou, F.; Yuan, W.; Hou, W. Time series prediction with improved neuro-endocrine model. Neural Comput. Appl. 2014, 24, 1465–1475. [Google Scholar] [CrossRef] [Green Version]
- Sauze, C.; Neal, M. Artificial endocrine controller for power management in robotic systems. IEEE Trans. Neural Netw. Learn. Syst. 2013, 24, 1973–1985. [Google Scholar] [CrossRef]
- Timmis, J.; Neal, M.; Thorniley, J. An adaptive neuro-endocrine system for robotic systems. In Proceedings of the IEEE Workshop on Robotic Intelligence in Informationally Structured Space, Nashville, TN, USA, 30 March–2 April 2009; pp. 129–136. [Google Scholar] [CrossRef]
- Yang, S.-S.; Tseng, C.-S. An orthogonal neural network for function approximation. IEEE Trans. Syst. Man Cybern. B Cybern. 1996, 26, 779–785. [Google Scholar] [CrossRef]
- Szegö, G. Orthogonal Polynomials, American Mathematical Society; Colloquium Publications: RI, USA, 1975. [Google Scholar]
- Mastroianni, G.; Milovanović, G. Interpolation Processes-Basic Theory and Applications; Springer: Berlin/Heidelberg, Germany, 2008. [Google Scholar] [CrossRef]
- Magnetic Levitation System—User’s Manual. Available online: www.inteco.com.pl (accessed on 11 May 2021).
- Majewski, P.; Pawuś, D.; Szurpicki, K.; Hunek, W.P. Toward Optimal Control of a Multivariable Magnetic Levitation System. Appl. Sci. 2022, 12, 674. [Google Scholar] [CrossRef]
- Plett, G.L. Adaptive inverse control of linear and nonlinear systems using dynamic neural networks. IEEE Trans. Neural Netw. 2003, 14, 360–376. [Google Scholar] [CrossRef] [Green Version]
- Safonov, M.G.; Limebeer, D.J.N.; Chiang, R.Y. LMI Control Toolbox User’s Guide; Mathworks, Inc.: Natick, MA, USA, 1995. [Google Scholar]
- Zhang, H.; Wang, Z.; Liu, D. A Comprehensive Review of Stability Analysis of Continuous-Time Recurrent Neural Networks. IEEE Trans. Neural Netw. Learn. Syst. 2014, 25, 1229–1262. [Google Scholar] [CrossRef]
| Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).