Data-Driven MPC Scheme for Inertial Platform with Uncertain Systems Against External Vibrations

Zhao, Junhu; Yang, Qifan; Li, Huiping

doi:10.3390/electronics13244945

Open AccessArticle

Data-Driven MPC Scheme for Inertial Platform with Uncertain Systems Against External Vibrations

by

Junhu Zhao

^1,2,

Qifan Yang

^3,*

and

Huiping Li

³

¹

School of Instrument Science and Opto-Electronic Engineering, Beihang University, Beijing 100191, China

²

Beijing Institute of Aerospace Control Devices, Beijing 100854, China

³

School of Marine Science and Technology, Northwestern Polytechnical University, Xi’an 710072, China

^*

Author to whom correspondence should be addressed.

Electronics 2024, 13(24), 4945; https://doi.org/10.3390/electronics13244945

Submission received: 13 November 2024 / Revised: 29 November 2024 / Accepted: 9 December 2024 / Published: 16 December 2024

(This article belongs to the Special Issue Multi-UAV Systems and Mobile Robots)

Download

Browse Figures

Versions Notes

Abstract

For inertial platforms with unknown model parameters and internal information, traditional model-free controllers fail to resist external vibrations solely based on the platform gyroscope, deteriorating the performance of inertial platforms. Therefore, we apply the light gradient boosting machine (LightGBM) to identify an end-to-end platform model, followed by proposing a data-driven MPC scheme to improve the control performance. Furthermore, an expectation maximization (EM) method is designed to solve the optimization problems with non-differentiable identification models, which are challenges for the traditional gradient descent-based optimizer. In addition, an adaptive compensation strategy is designed for generalizing the data-driven control scheme to different external vibrations. Finally, experimental results demonstrate the feasibility, efficacy, and generalization ability of the proposed method.

Keywords:

model predictive control; data-driven control; uncertain systems; inertial platform

1. Introduction

The inertial platform plays an enabling role in navigation systems of various vehicles and robots [1,2], where reliability and safety are crucial indicators for evaluating its quality. However, the performance of the inertial platform is affected by many external factors, such as friction torque, cable flexibility torque, and unbalanced torque. Therefore, numerous research efforts have been made to the stabilization loop of the inertial platform for resisting external vibrations and maintaining system stability within the inertial space [3,4]. By virtue of the parameter adaptation methodology, Dey et al. proposed a fuzzy PID controller to improve the performance of a three-axis inertial stabilized platform [3]. Furthermore, Liu et al. designed a backstepping terminal sliding mode control law with an extended state observer for an inertial platform stability loop [4]. However, the aforementioned works have sufficient observable information and prior knowledge of system models. For the inertial platform considered in this paper, its internal model is unknown, and the observable data only include gyroscope outputs of the platform and base, which can be utilized to calculate the platform angles and the external vibrations. The platform will vibrate with external vibrations. The control objective of the stabilization loop is to make the platform angle as close to zero as possible. Since the inertial platform is a complex system, traditional model-free controllers fail to keep the platform stable solely based on the gyroscope output. Recently, model predictive control (MPC) has attracted increasing interest concurrently from academia because of its optimal control performance and the ability to tackle various system constraints [5,6]. Due to these distinctive features, MPC can be utilized to regulate the inertial platform against external vibrations.

In the MPC scheme, the precise dynamic model is necessary for guaranteeing optimal control performance. However, most complex systems have a difficulty in establishing mathematical models precisely, resulting in uncertain parameters or partially unknown models. As an effective tool to tackle external disturbances, the robust MPC scheme has successfully been applied to many applications for various real robots [7,8]. Sun et al. proposed two RMPC schemes, including a tube-MPC scheme and a nominal robust MPC scheme, for tracking unmanned ground vehicles with input constraints and bounded disturbances [7]. To handle state and control input constraints of USVs, a novel robust model predictive control-based controller is designed for collision-free formation navigation [8]. Considering the controller design for unknown models, some RMPC schemes, e.g., the min-max MPC and tube-based MPC, can deal with system uncertainties by regarding them as disturbances [9,10]. Moreover, sufficient conditions for ensuring the stability of the closed-loop system have been established and discussed in [11,12], which lay a promising theoretical foundation of developing RMPC. Owing to the prediction mechanism, MPC has the ability to learn the parameter uncertainty according to the input and output data [13]. Extending this idea, many learning MPC schemes are proposed to identify uncertain parameters and models online [14,15].

However, these algorithms require prior knowledge of the internal model, which fails to solve the control problem of the encapsulated inertial platform since its internal information is unobservable [16]. How to design robust control algorithms against external vibrations, parametric uncertainties, and environment disturbances when the model is unknown is still a challenge. For the complex inertial platform, it is difficult to obtain a mathematical model according to traditional identification methods. Due to their powerful learning capabilities, deep learning or gradient boosted decision trees can accurately calculate system models, e.g., LSTM network and LightGBM [17,18]. By taking advantage of low computational complexity, we propose a LightGBM-based system identification algorithm for an unobservable inertial platform. Moreover, most MPC schemes require the system update function and cost function to be differentiable and convex, in which the optimization problem is usually solved by the gradient descent algorithm. Since the identification model is non-differentiable and lacks interpretability, traditional MPC algorithms fail to solve the control problem with the identification model of the inertial platform. To handle this issue, Liang et al. proposed an economic model predictive control framework to solve the USV control problem with non-convex cost functions by virtue of the importance sampling method [19].

Therefore, this paper proposes a data-driven MPC scheme with a system identification algorithm, an expectation–maximization (EM) method and an importance sampling method for uncertain systems and external vibrations. The main contributions of this paper are as follows. (1) After identifying the system model of the inertial platform according to LightGBM, a novel data-driven MPC scheme with an EM method is designed for robust control of the high dimensional end-to-end model. Compared with traditional MPC requiring differentiable and convex models, the proposed method expands the application of MPC to more complex system models. (2) Furthermore, the Lagrange duality theory is applied to deal with the constraint satisfaction issue during optimization, which ensures the practical feasibility of the data-driven MPC scheme. (3) Finally, the adaptive compensation strategy is designed for external vibrations to improve the generalization ability of the proposed method. The experimental results show that the proposed method can better control the inertial platform against external vibrations compared to the PID controller, which verifies its effectiveness and advantage for the stabilization loop of the inertial platform.

Notations: The symbols of all real numbers and natural numbers are denoted by

R

and

N

, respectively.

N_{[a, b]} = {x \in N : a \leq x \leq b}

. Given a vector

x \in R^{n}

, its

Q

-weighted norm is given by

{∥ x ∥}_{Q} = \sqrt{x^{T} Q x}

. Let

col (a_{1}, a_{2}, \dots, a_{m}) = {[a_{1}^{T}, a_{2}^{T}, \dots, a_{m}^{T}]}^{T}

denote the column notation.

The remainder of this paper is organized as follows. Section 2 describes the system identification algorithm and corresponding control problem for the inertial platform. Since then, the data-driven MPC scheme with the EM method and adaptive compensation strategies are presented in Section 3. Finally, the efficacy of the data-driven MPC algorithm is verified via numerical examples in Section 4, followed by the conclusion in Section 5.

2. Problem Formulation

As shown in Figure 1, the inertial platform is regulated by a stabilization loop to maintain the stability of the inertial space, where two gyroscopes are installed on the inertial platform and its base, and the external vibration will drive the platform to rotate. The base gyroscope measures the relative angular velocity between the external vibration and the platform, and the platform gyroscope measures the angular velocity of the platform. Therefore, the external vibration can be obtained by summing the outputs of the two gyroscopes. The controller in the stabilization loop calculates control signals of the motor according to the gyroscope output, followed by regulating the platform angle as close to zero as possible. However, for the encapsulated inertial platform, the internal information cannot be observed, and the robust control scheme of the inertial platform is designed by virtue of the output of the platform gyroscope and base gyroscope for the PWM value. Therefore, we regard the motor and inertial platform as a combined system with unknown models. Let PWM values denote the control signals, with the system output being set as the observed platform angular velocity. Then, we assume that the dynamics of the combined system are derived as follows:

\bar{v} (k + 1) = f_{a} (\bar{v} (k), u (k)),

(1)

where

\bar{v} (k)

is the nominal angular velocity of the platform;

u (k)

is the control signal of the motor. However, it is difficult to obtain the nominal angular velocity

\bar{v} (k)

. The observed angular velocity

v (k + 1)

is given by

v (k + 1) = \bar{v} (k + 1) + ρ (k + 1),

(2)

where the observed angular velocity includes uncertain disturbances

ρ (k + 1)

from the system friction, unbalanced torque and cable flexibility torque. For the inertial platform against external vibrations, the disturbances will change with the vibration frequency and amplitude. Moreover, the system is subject to control input constraints

u (k) \in U = {u | B u \leq g}

, where

B = {[I_{n_{u}}, - I_{n_{u}}]}^{T}

,

I_{n_{u}}

is the identity matrix,

n_{u}

is the control input number,

g = {[\hat{u}, \hat{u}]}^{T}

, and

\hat{u}

is the upper bound of the control input.

Due to the complex model and joint effect of the motor and the platform, the mapping function between the control signal and the platform angular velocity is difficult to obtain by traditional identification algorithms. Therefore, we sample multiple PWM values and collect corresponding gyroscope outputs to build a dataset. Since then, a LightGBM-based identification method for the inertial platform is proposed, where the LightGBM is an efficient machine learning algorithm based on gradient boosted decision trees. By utilizing the gradient-based one-side sampling and exclusive feature bundling methods, the large number of data instances and large number of features are tackled by the LightGBM method, while reducing the computational burden and memory consumption. Therefore, the identification model-based update function is given by

v (k + 1) = f_{a, θ} (v (k), u (k))

, where

θ

is the learned complex mapping relationship between the PWM values and corresponding angular velocities of the platform.

To achieve the control objective, the optimization problem is formulated as follows:

\begin{matrix} min_{u (k)} J (x (k), x_{r} (k), u (k)) = \sum_{n = 0}^{N - 1} l (x (k + n | k), x_{r} (k + n | k), u (k + n | k)) \\ s . t . v (k + n + 1 | k) = f_{a, θ} (v (k + n | k), u (k + n | k)) \\ x (k + n + 1 | k) = x (k + n | k) + v (k + n | k) δ \\ u (k + n | k) \in U, n \in N_{[0, N - 1]}, \end{matrix}

(3)

where

x_{r} (k)

is the rotation angle of the platform due to external vibrations, and

δ

is the control interval. We have

x_{r} (k) = γ V_{r} (k)

,

γ

is the attenuation factor, and

V_{r} (k)

is the external vibration.

x (k)

is the rotation angle of the platform regulated by the motor against external vibrations.

u (k) = col (u (k | k), u (k + 1 | k), \dots, u (k + N - 1 | k))

,

x (k) = col (x (k | k), x (k + 1 | k), \dots, x (k + N - 1 | k))

, and

x_{r} (k) = col (x_{r} (k | k), x_{r} (k + 1 | k), \dots, x_{r} (k + N - 1 | k))

. N is the prediction horizon, and the stage cost is formulated as

l (x (k + n | k), x_{r} (k + n | k), u (k + n | k)) = {∥x (k + n | k) + x_{r} (x + n | k)∥}_{Q}^{2} + {∥u (k + n | k)∥}_{P}^{2}

.

Q

and

P

are positive definite matrices.

3. Data-Driven MPC Framework

In this section, the data-driven MPC for the stability of the inertial platform with uncertain systems and external vibrations is proposed to solve the problem (3). First, a Monte Carlo expectation–maximization (MC-EM) method is utilized to calculate the optimal solution. Then, a dual optimization problem is designed to handle system constraints. Finally, the implementation process and generalization strategy of the proposed method are presented.

3.1. Foundation of the EM-Based MPC

Instead of mathematical models in traditional MPC schemes, the identification model calculated by LightGBM is non-differentiable and inexplicable; thus, the gradient descent algorithm fails to solve the optimization problem (3). To tackle this issue, an EM-based MPC framework is proposed for the inertial platform. Extending to the idea in [20], a binary reward event

R = 1

is defined as the observed variable. Let

p (R | u)

denote the probability of this reward event, which is proportional to the state cost. The maximum likelihood problem is formulated as

max_{ζ} log p_{ζ} (R = 1) = log \int_{u} p (R ∣ u) p_{ζ} (u) d u,

(4)

where

ζ

is the control input distribution, such that

ζ = [μ, Σ]

.

μ

and

Σ

are the values of the mean and covariance of the control policy distribution, respectively. To solve this problem, we apply the EM algorithm, and the maximum likelihood problem is transformed to

log p_{ζ} (R = 1) = Γ_{ζ} (q (u)) + D (q (u), p_{ζ} (u ∣ R)),

(5)

where

q (u)

is a variational distribution, and

D (\cdot)

is the Kullback–Leibler (KL) divergence between

q (u)

and the reward-weighted trajectory distribution

p_{ζ} (u ∣ E)

. Since the KL divergence is larger than 0,

Γ_{ζ} (q (u))

is the lower bound of

log p_{ζ} (R = 1)

.

In the EM algorithm, the optimal control policy is calculated by alternating an expectation step and a maximization step. In the expectation step, to minimize the KL divergence, one gets

q (u) = p_{ζ} (u ∣ E) \propto p (E ∣ u) p_{ζ} (u)

. In the maximization step, policy parameters are updated by maximizing the following likelihood function:

\begin{matrix} ζ^{*} = & arg max_{ζ} \sum_{m}^{M_{s}} p (E ∣ u_{m}) log p_{ζ} (u_{m}) \\ = & arg max_{ζ} \sum_{m}^{M_{s}} c_{m} log p_{ζ} (u_{m}), \end{matrix}

(6)

where

M_{s}

is the sample number, and each u is weighted based on the optimization cost in (3). Furthermore, a weight function is designed to increase the weight of the sample with a low cost value, which is given by

c_{m} = f_{c} (S_{m}) = \frac{1}{1 + e^{ϕ S_{m}}},

(7)

S_{m} = \{\begin{matrix} Sort (l (x_{m}, x_{r}, u_{m})), & i \in E_{m} \\ inf, & i \notin E_{m} \end{matrix},

(8)

where

S_{m}

is the order value of the i-th sample in the sorted cost sequence. Furthermore, the elite set is

E_{m} = {n | l (x_{n}, x_{r}, u_{n}) \leq β, n \in N_{[1, M_{s}]}}

, and

β

is the

i_{β}

-th cost value in the sorted vector, where

i_{β} = α M_{s}

and

α \in R_{(0, 1]}

.

α

and

ϕ

are tuning parameters. In addition, the mean and covariance are updated according to the following equations:

\begin{matrix} μ = & \frac{(\sum_{m = 1}^{M_{s}} c_{m} u_{m})}{(\sum_{m = 1}^{M_{s}} c_{m})}, \\ Σ = & H \sum_{m = 1}^{M_{s}} c_{m} (u_{m} - μ) {(u_{m} - μ)}^{T}, \\ H = & \frac{\sum_{m = 1}^{M_{s}} c_{m}}{{(\sum_{m = 1}^{M_{s}} c_{m})}^{2} - \sum_{m = 1}^{M_{s}} {(c_{m})}^{2}} . \end{matrix}

(9)

We repeat the above steps until the update number reaches the upper bound or the state cost is less than a designed threshold. Since then, the control policy distributions of the next N steps are updated in turn, and the mean vector is regarded as the control input.

3.2. Dual Optimization Algorithm

Traditional EM-based algorithms use the unconstrained cost value as the weight to update the control policy, which may violate constraints with a certain probability. To handle this issue, the Lagrange duality theory is designed to deal with the system constraints, and the dual loss function is derived as follows:

\begin{matrix} l (x, x_{r}, u, v) = l (x, x_{r}, u) + max (C (u), 0) \end{matrix}

(10)

where v is the dual variable associated with the constraint

C (u) = B u - g

. Then, the cost function

l (x, x_{r}, u, v)

is applied to calculate the weight in (7), and

v^{*} = arg max_{v} min_{ζ} \sum_{m = 1}^{M_{s}} l (x_{m}, x_{r}, u_{m}, v) .

(11)

Therefore, the control input and state constraints are satisfied in the EM update process according to the dual optimization problem. The calculation process of the EM-based MPC scheme is as shown in Algorithm 1, where

M_{I}

is the maximum update number.

Algorithm 1 EM-based MPC.

Require:

x_{r}

,

x

, identification system, initial dual updated step size

{β_{l, 0}, β_{l, 1}, \dots}

, initial control policy distribution

N (μ_{0}, Σ_{0})

;

1:: for $i \leq M_{I}$ do
2:: Sample $M_{s}$ variables from $N (μ_{i - 1}, Σ_{i - 1})$ .
3:: Expectation:
4:: Calculate the stage cost of $M_{s}$ samples and sort them for $S_{m}$ .
5:: Select the top $α$ of the samples as elite samples.
6:: Calculate the weight of each elite sample.
7:: Maximization:
8:: $ζ^{*} = arg {max}_{ζ} \sum_{m}^{M_{s}} c_{m} log p_{ζ} (u_{m})$ .
9:: Update the control policy distribution $N (μ_{i}, Σ_{i})$ by partial elite samples.
10:: Update the Lagrangian dual variable $v_{i} \leftarrow v_{i} + β_{l, i} \sum_{k = 1}^{M_{s}} max (C (u (k)), 0)$ .
11:: end for

3.3. Implementation and Generalization Strategy

One of the keys to the development of the data-driven controller is its generalization ability. For the inertial platform, the change of external vibrations will lead to complex system disturbances, deteriorating the accuracy of the identification model. For generalizing the identification system to different external vibrations, we design a polynomial regression algorithm to build a mapping function between different vibration signals and compensation distributions. Let

V_{r, n} (t) = A_{n} sin (2 π w_{n} t)

denote the vibration signal, where

A_{n}

and

w_{n}

are the amplitude and frequency of the vibration signal

n \in N_{[1, N_{v}]}

, and

N_{v}

is the number of different vibration signals. The compensation value is the deviation between the real value of the platform angular velocity in the dataset and the identification value calculated by the identification model. First, we analyze the identification model and extract the compensation distributions

{N (μ_{e}^{1}, Σ_{e}^{1}), N (μ_{e}^{2}, Σ_{e}^{2}), \dots, N (μ_{e}^{N_{e}}, Σ_{e}^{N_{e}})}

of the samples in the dataset, where a series of Gaussian distributions with different means and variances are utilized to describe compensations for the output of the identification model. For example, the identification model is trained by the vibration

V_{r, 1} (t)

, and then the compensation distribution is utilized to compensate this identification model for resisting other vibrations.

N_{e}

is the distribution number that can describe the compensations of different PWM values. Assume that the amplitude and frequency of the vibration signal at the current time are

A (k)

and

w (k)

. Then, the polynomial regression algorithm is applied to find the adaptive compensation distributions that cater to vibration features, such that

\begin{matrix} μ_{e}^{n_{e}} (A (k), w (k)) = & K_{μ}^{n_{e}} \cdot {[b_{A, μ} (A (k)), b_{w, μ} (w (k))]}^{T}, \\ Σ_{e}^{n_{e}} (A (k), w (k)) = & K_{Σ}^{n_{e}} \cdot {[b_{A, Σ} (A (k)), b_{w, Σ} (w (k))]}^{T}, \end{matrix}

(12)

where

K_{μ}^{n_{e}}

and

K_{Σ}^{n_{e}}

are coefficient matrices calculated by the polynomial regression method.

b_{A, μ} (A (k))

,

b_{w, μ} (w (k))

,

b_{A, Σ} (A (k))

, and

b_{w, Σ} (w (k))

are polynomial spaces of vibration features, where the polynomial spaces include the first-order terms, second-order terms and constant terms. For more complex systems, other basis functions can be applied to supplement the polynomial space according to real conditions. Since then, the compensation distributions are converted to

N (μ_{e}^{n_{e}} (A (k), w (k))

,

Σ_{e}^{n_{e}} (A (k), w (k)))

,

n_{e} \in N_{[1, N_{e}]}

, which vary in accordance with external vibrations.

To ensure that the angle of the platform is close to zero, we design a compensation strategy. After performing spectrum analysis on the vibration signal to obtain its vibration amplitude and frequency, the corresponding compensation distribution is obtained through an adaptive compensation identification model. Then, we sample N compensation values, followed by compensating the collected gyroscope output. Finally, the specific steps are described in Algorithm 2. It is worth noting that

N_{e}

is an empirical parameter. It is obtained by comparing the variances of the Gaussian distribution sequence under different

N_{e}

, and the one with the smallest variance value is selected as the optimal distribution number.

Algorithm 2 Data-driven MPC framework.

Require: Dataset

D_{L}

including external vibrations, PWM values, and corresponding angular velocities of the inertial platform;

1:: Offline:
2:: Identify the combined system model based on the LightGBM for the main vibration.
3:: Use the K-Means algorithm to divide PWM values in the dataset into $N_{e}$ categories.
4:: Calculate the deviations between the actual values in the dataset and the outputs of the identification model.
5:: Divide the deviations into $N_{e}$ categories based on the PWM values and corresponding categories.
6:: Consider deviations as compensations between other vibrations and the main vibration in the dataset, and calculate the mean and variance of deviations for the compensation distributions ${N (μ_{e}^{j}, Σ_{e}^{j}), j \in N_{[1, N_{e}]}}$ based on the $N_{e}$ categories.
7:: Obtain the correlation function between the compensation distributions and vibration features.
8:: Online:
9:: for each $k = 0, 1, 2, 3, \dots$ do
10:: Apply spectrum analysis to extract vibration features including $A (k)$ and $w (k)$ from historical vibration signals.
11:: Calculate the control policies based on the EM method in Algorithm 1.
12:: Obtain the control input $u (k)$ from the control policy distributions and compensation distributions, followed by sending it to the motor.
13:: Select the compensation distributions for the output of the identification model in the future N steps.
14:: end for

4. Experiment

4.1. Experimental System and Identification Result

In this section, we built an inertial platform with a stability system to verify the efficacy of the data-driven MPC method. Furthermore, two gyroscopes are installed on the platform and base, where the base gyroscope measures the relative angular velocity of the external vibration to the platform, and the platform gyroscope measures the angular velocity of the platform. Therefore, the sum of the gyroscope’s outputs can describe the external vibration. The upper and lower bounds of the PWM values are given by 40 and 60, respectively. Since then, an external vibration signal

V_{r, 0} (t) = A_{0} sin (2 π w_{0} t)

is applied to the base, where the amplitude and vibration frequency are given by

A_{0} = 8^{'}

and

w_{0} = 4.8

Hz, respectively. The model and internal information of the inertial platform and brushless motor are unknown. Only the outputs of two gyroscopes are applied as feedback for the stability control of the inertial platform. To construct the dataset for data-driven MPC design, a PID-based controller in the stabilization loop regulates the motor against the external vibration in Figure 2. The attenuation factor

γ

of the external vibration to the platform is assumed to be

0.3

, and thus the platform angle is the orange line in Figure 2a when the motor stops rotating. Then, we collect PWM values and the outputs of two gyroscopes to identify the uncertain system by the LightGBM method. For the parameters in LightGBM, the estimator number is 64, the maximum depth for the tree model is 24, the learning rate is 0.05, and the maximum number of leaves in one tree is 256. The experimental environment is a PC equipped with an Intel i5 CPU, 16 GB RAM, and the 64-b Windows 11 operating system.

The PWM value of the motor, the corresponding angular velocity of the platform, and the identification results are shown in Figure 3. In addition, the mean squared error of the system identification algorithm is

3.85 \times 10^{- 5}

, which verifies the efficacy of the proposed method. Particularly, because of internal disturbances and controller limitations, PID cannot completely eliminate the impact of external vibrations on the platform. Therefore, the PID controller is converted to the proposed data-driven MPC for compensating external vibrations.

4.2. Numerical Example

In the numerical example, the control interval is given by

δ

= 0.02 s, and the prediction horizon is

N = 6

. The weight parameters are defined as

Q = 10

and

P = 0.2

. For the EM method, the maximum update number and sample number are given by 5 and 50, and the tuning parameters are

α = 0.4

and

ϕ = 0.02

. The external vibration signal is

V_{r, 1} (t) = A_{1} sin (2 π w_{1} t)

, where

A_{1} = 6^{'}

, and

w_{1} = 4.8

Hz. According to the identification system, the EM-based MPC is utilized to calculate the optimal control policy by maximizing the likelihood function.

Figure 4 shows the angular trajectory of the platform. The platform angle converges into a set close to 0, verifying the efficacy of the stabilization loop system. Figure 4 shows the optimization progress of the Gaussian policy in the EM method, which has fixed initial distributions (

μ = 50, Σ = 50

). Compared with the PID controller in Figure 2, the data-driven MPC scheme reduces the absolute average value of the inertial platform angle from

{0.6907}^{'}

to

{0.0155}^{'}

, which confirms the control efficacy of the proposed method. By setting the maximum update number and sample number as 5 and 50, the optimization time of the proposed method is

0.01

s. It is worth noting that the complexity of the proposed method will increase as the number of samples increases. Therefore, we make a trade-off between the convergence speed and algorithm complexity to satisfy the control requirement of the inertial platform. To test the impact of different parameters on search velocity, we conducted three experiments through the control variable method. The results are shown in Figure 5. For the sample number experiment, the update number, the tuning parameter

ϕ

and the elite number are set as 10,

0.020

and 20, respectively. For the elite number experiment, the sample number and the tuning parameter

ϕ

are set as 50 and

0.020

, respectively. Furthermore, for the tuning parameter

ϕ

experiment, the sample number and the elite number are set as 50 and 40, respectively. The learning is data-efficient and stable as the policy converges after five iterations. The convergence velocity of the algorithm will be affected by selecting different tuning parameters. Moreover, increasing the number of samples can improve the convergence velocity of the algorithm, and a high elite parameter indicates that more samples are weighted. Many bad samples with high cost values may reduce the convergence velocity. In addition, a large tuning parameter

ϕ

has rapid convergence because many samples with lower costs have greater weights.

4.3. Generalization Experiment

The disturbance in the inertial platform is related to the external vibration signals, which in turn affects the identification model accuracy and the control efficacy of MPC. For generalizing the proposed control scheme to different external vibrations, the mapping relationship between the compensation distributions and the vibration features is obtained by the polynomial regression method in (12). For different vibration signals, we use power spectrum analysis to obtain their amplitudes and frequencies in real time as signal features, followed by calculating the compensation distributions for different vibration signals. By virtue of the K-means algorithm, the PWM values in the dataset are divided into 20 categories such that

N_{e} = 20

. As shown in Figure 6, different PWM values have different compensation values, so the error of the identification model can be compensated correspondingly. To verify the efficacy of the proposed method, we utilize the identification model for the vibration signal

V_{r, 2} (t) = A_{2} sin (2 π w_{2} t)

, where

A_{2} = 5^{'}

and

w_{2} = 3.2

Hz. Due to internal disturbances, the model based on the identification model of the vibration signal

A_{0} sin (2 π w_{0} t)

cannot describe the mapping relationship of the new vibration signal, deteriorating the control efficacy of the proposed scheme. Since then, we use the compensation strategy to modify the identification model. In Figure 6, the platform angle of the proposed method is closer to 0. Experimental results show that the compensation strategy can enhance the ability of the platform to resist different vibrations.

Therefore, when the external vibration signal changes, the identification model is still able to describe the mapping relationship between the PWM values and platform angular velocities, guaranteeing system stability within the inertial space. It is worth noting that compensation strategies ensure the control performance under the same type of external vibrations. For vibration signals that are different from the vibration signals of the dataset utilized to train the identification model, the proposed method fails to solve the control problem due to the huge differences in the distribution of model deviations and internal disturbances. To handle this issue, a signal classification algorithm can be applied to judge the external vibration type, followed by selecting an appropriate identification model for the stabilization loop. Finally, the proposed compensation strategy is utilized to improve the control performance when the amplitude and frequency of the vibration signal change.

5. Conclusions

In this paper, we have developed a novel data-driven MPC algorithm with an EM optimization method to control the inertial platform with an unknown model against external vibrations. For system uncertainties, the LightGBM is utilized to identify the combined system model. The proposed scheme can solve non-differentiable and non-convex optimization problems, thereby expanding the applications of the MPC controller. In addition, we design the dual optimization problem for the EM method to handle system constraints when searching for the optimal control policy. Furthermore, the generalization ability is guaranteed by the Gaussian distribution-based compensation strategy. From the experiment, it can be observed that the platform angle maintains a small value, which demonstrates the higher efficiency and enhanced performance of the data-driven MPC scheme compared with the PID controller. The application objective of this paper is a single-axis inertial platform. As the control input dimension increases, the computational complexity of searching for the optimal control law increases. Our future research will explore the utilization of the EM algorithm to address the challenge of optimizing control with high-dimensional inputs while making a trade-off between optimization time and control performance.

Author Contributions

Conceptualization, methodology, and software, J.Z. and Q.Y.; validation, formal analysis, investigation, and resources, J.Z. and H.L.; writing—original draft preparation, writing—review and editing, and visualization, Q.Y.; supervision, project administration, and funding acquisition, J.Z. and H.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Major Basic Research Program for Equipment under Grant 909010207-302-1.

Data Availability Statement

Data are unavailable due to privacy restrictions.

Acknowledgments

The authors would like to thank the Associate Editor and anonymous reviewers for their constructive suggestions that improved this paper.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

MPC	Model predictive control
PWM	Pulse width modulation
LightGBM	Light gradient boosting machine
EM	Expectation maximization

References

Zeinali, B.; Zanddizari, H.; Chang, M.J. Imunet: Efficient regression architecture for inertial IMU navigation and positioning. IEEE Trans. Instrum. Meas. 2024, 73, 1–13. [Google Scholar] [CrossRef]
Junhui, X.; Zhicheng, Y.; Donghui, X.; Bin, S. A pre-launch integrated calibration method of platform inertial navigation system. Aerosp. Control 2022, 40, 72–78. [Google Scholar]
Dey, S.; Kumar, T.K.S.; Ashok, S.; Shome, S.K. Design of adaptive fuzzy pid control framework for 3-axis platform stabilization. In Proceedings of the 2023 International Conference on Power, Instrumentation, Energy and Control (PIECON), Aligarh, India, 10–12 February 2023; pp. 1–6. [Google Scholar] [CrossRef]
Liu, H.; Hu, S.; Cai, Q.; Zhang, Y.; Zhang, K. Backstepping terminal sliding mode control of fog inertial platform stability loop based on extended state observer. In Proceedings of the 2022 7th International Conference on Control, Robotics and Cybernetics (CRC), Zhanjiang, China, 15–17 December 2022; pp. 29–34. [Google Scholar] [CrossRef]
Zhao, M.; Li, H. Distributed model predictive contouring control of unmanned surface vessels. IEEE Trans. Ind. Electron. 2024, 71, 13012–13019. [Google Scholar] [CrossRef]
Yang, Q.; Li, H. RMPC-based visual servoing for trajectory tracking of quadrotor UAVs with visibility constraints. IEEE/CAA J. Autom. Sin. 2024, 11, 2027–2029. [Google Scholar] [CrossRef]
Sun, Z.; Dai, L.; Liu, K.; Xia, Y.; Johansson, K.H. Robust MPC for tracking constrained unicycle robots with additive disturbances. Automatica 2018, 90, 172–184. [Google Scholar] [CrossRef]
Wen, G.; Lam, J.; Fu, J.; Wang, S. Distributed MPC-based robust collision avoidance formation navigation of constrained multiple USVs. IEEE Trans. Intell. Veh. 2023, 9, 1804–1816. [Google Scholar] [CrossRef]
Fleming, J.; Kouvaritakis, B.; Cannon, M. Robust tube MPC for linear systems with multiplicative uncertainty. IEEE Trans. Autom. Control 2015, 60, 1087–1092. [Google Scholar] [CrossRef]
Zhou, J.; Tian, D.; Sheng, Z.; Duan, X.; Qu, G.; Zhao, D.; Cao, D.; Shen, X. Robust min-max model predictive vehicle platooning with causal disturbance feedback. IEEE Trans. Intell. Transp. Syst. 2022, 23, 15878–15897. [Google Scholar] [CrossRef]
Rakovic, S.V.; Kouvaritakis, B.; Cannon, M.; Panos, C.; Findeisen, R. Parameterized tube model predictive control. IEEE Trans. Autom. 2012, 57, 2746–2761. [Google Scholar] [CrossRef]
Mayne, D.Q.; Seron, M.M.; Raković, S.V. Robust model predictive control of constrained linear systems with bounded disturbances. Automatica 2005, 41, 219–224. [Google Scholar] [CrossRef]
Zhang, K.; Sun, Q.; Shi, Y. Trajectory tracking control of autonomous ground vehicles using adaptive learning MPC. IEEE Trans. Neural Netw. Learn. Syst. 2021, 32, 5554–5564. [Google Scholar] [CrossRef]
Pin, G.; Parisini, T. Networked predictive control of uncertain constrained nonlinear systems: Recursive feasibility and input-to-state stability analysis. IEEE Trans. Autom. Control 2011, 56, 72–87. [Google Scholar] [CrossRef]
Lorenzen, M.; Cannon, M.; Allgöwer, F. Robust MPC with Recursive Model Update. Automatica 2019, 103, 461–471. [Google Scholar] [CrossRef]
Wang, R.; Li, H.; Liang, B.; Shi, Y.; Xu, D. Policy learning for nonlinear model predictive control with application to USVs. IEEE Trans. Ind. 2024, 71, 4089–4097. [Google Scholar] [CrossRef]
Schubnel, B.; Carrillo, R.E.; Alet, P.-J.; Hutter, A. A hybrid learning method for system identification and optimal control. IEEE Trans. Neural Netw. Learn. Syst. 2021, 32, 4096–4110. [Google Scholar] [CrossRef] [PubMed]
Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.-Y. Lightgbm: A highly efficient gradient boosting decision tree. Adv. Neural Inf. Process. Syst. 2017, 30, 3149–3157. [Google Scholar]
Liang, H.; Li, H.; Shi, Y.; Constantinescu, D.; Xu, D. Energy-efficient integrated motion planning and control for unmanned surface vessels. IEEE Trans. Control Syst. Technol. 2023, 32, 250–257. [Google Scholar] [CrossRef]
Song, Y.; Scaramuzza, D. Policy search for model predictive control with application to agile drone flight. IEEE Trans. Robot. 2022, 38, 2114–2130. [Google Scholar] [CrossRef]

Figure 1. Stabilization loop of the inertial platform.

Figure 2. Inertial platform information.

Figure 3. Identification result.

Figure 4. Control performance of the proposed method.

Figure 5. Learning curves of the control policy.

Figure 6. Compensation distribution and generalization capability.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhao, J.; Yang, Q.; Li, H. Data-Driven MPC Scheme for Inertial Platform with Uncertain Systems Against External Vibrations. Electronics 2024, 13, 4945. https://doi.org/10.3390/electronics13244945

AMA Style

Zhao J, Yang Q, Li H. Data-Driven MPC Scheme for Inertial Platform with Uncertain Systems Against External Vibrations. Electronics. 2024; 13(24):4945. https://doi.org/10.3390/electronics13244945

Chicago/Turabian Style

Zhao, Junhu, Qifan Yang, and Huiping Li. 2024. "Data-Driven MPC Scheme for Inertial Platform with Uncertain Systems Against External Vibrations" Electronics 13, no. 24: 4945. https://doi.org/10.3390/electronics13244945

APA Style

Zhao, J., Yang, Q., & Li, H. (2024). Data-Driven MPC Scheme for Inertial Platform with Uncertain Systems Against External Vibrations. Electronics, 13(24), 4945. https://doi.org/10.3390/electronics13244945

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Data-Driven MPC Scheme for Inertial Platform with Uncertain Systems Against External Vibrations

Abstract

1. Introduction

2. Problem Formulation

3. Data-Driven MPC Framework

3.1. Foundation of the EM-Based MPC

3.2. Dual Optimization Algorithm

3.3. Implementation and Generalization Strategy

4. Experiment

4.1. Experimental System and Identification Result

4.2. Numerical Example

4.3. Generalization Experiment

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI