1. Introduction
With significant advancements in computational performance in recent years, numerous studies have reported the application of Model Predictive Control (MPC) to aircraft systems [
1,
2,
3,
4,
5,
6,
7,
8,
9,
10,
11,
12,
13,
14,
15]. MPC enables the determination of optimal control inputs based on an objective function using an ideal dynamic model of the control target. It explicitly handles constraints for both system state variables and actuators. These advantages make MPC well suited for an aircraft control system, as it can enable fuel-efficient flight and a reduction in arrival time under various constraints, such as flight speed and altitude. Additionally, applications have been reported for scenarios such as the landing [
7] of Unmanned Aerial Vehicles (UAVs) and formation flights [
8].
Monte Carlo Model Predictive Control (MCMPC) is a type of MPC that derives control inputs from the evaluation function using the Monte Carlo method [
16,
17,
18,
19,
20]. Unlike general solution methods for non-linear MPC, such as the C/GMRES method [
21], MCMPC does not require the evaluation function to be continuous. This characteristic makes MCMPC suitable for systems that involve discontinuous phenomena, such as collisions. A previous study reported that applying MCMPC to the control system of a quadrotor demonstrated superior control performance compared to conventional MPC in scenarios involving collisions with walls [
18]. However, like conventional MPC, MCMPC predicts the system’s response to control inputs based on an internal model, leading to concerns about control performance degradation in environments with disturbances, such as modeling errors or unknown forces. One approach to mitigate the influence of these unknown disturbances is to predict the system’s response by introducing random perturbations into the internal model parameters. This method allows for the determination of effective control inputs in models with disturbances, but the computational load increases dramatically, making real-time implementation challenging even with high-performance computing resources.
In response to these challenges, this study proposes a novel Robust Monte Carlo Model Predictive Control (RMCMPC) that improves the robustness of MCMPC by constraining the state variables of control objects to a hypersurface in the phase space. In Sliding Mode Control (SMC), it is well known that constraining the state variables to a hyperplane suppresses the effects of disturbances that satisfy the matching condition [
22,
23,
24,
25,
26,
27,
28]. By applying this theory to MCMPC, we aim to mitigate the degradation of controller performance under the influence of disturbances, such as unknown forces and modeling errors that satisfy the matching condition. This method introduces an evaluation function that accounts for the convergence of state variables to a hypersurface. Compared to traditional MCMPC, the increase in computational load is limited to calculating predicted values for the switching function that ensures the convergence of state variables to the hypersurface, thus enhancing robust performance without increasing theoretical complexity. In other words, this approach can be easily applied to issues such as flight control systems designed for the attitude stabilization of quadrotors during collisions, as investigated in previous studies. We expect that this approach will contribute substantially to research fields focused on these types of control challenges.
The control performance of the proposed RMCMPC under the influence of disturbances is validated through numerical simulations. In these simulations, the controller targets the non-linear rotational dynamics of a quadrotor, applying both the proposed RMCMPC and the conventional MCMPC. For each controller, input disturbances satisfy matching conditions, including constant values, periodic disturbances with trigonometric functions, and square wave disturbances. Furthermore, the performance is compared when the control target values are constant and when the target values change over time to evaluate the control performance under disturbance conditions. Control performance is assessed using three indices, Integral Square Error (ISE), Integral Time Absolute Error (ITAE), and Integral Time Absolute Error (IAE), based on the difference between the target value and the state variables.
2. Problem Statement and Disturbance Suppression Method
The proposed Robust Monte Carlo Model Predictive Control (RMCMPC) method focuses on affine systems, described by the following state equation:
where
is the input vector, and
is the state vector.
is the dimension of the input,
is the dimension of the state variables, and
denotes the set of real numbers. In practical applications, systems often face problems due to uncertainties such as modeling errors, parameter variations, and disturbances. To account for these uncertainties, the system described by Equation (1) is reformulated as follows:
In this equation,
represents external forces such as disturbances, which are assumed to vary based on the state variables and time
. It is assumed that
This assumption implies that
resides within the range space of the input matrix
, meaning that disturbances act through the same channels as the inputs. Therefore, the external force
, due to disturbances, can be expressed as follows using the input vector
:
In other words, the disturbances in the control target system are assumed to satisfy the matching condition.
It is well known in SMC theory that by constraining the system’s state variables to a hypersurface in the phase space, SMC can achieve high robustness against disturbances that satisfy the matching condition. One such type of SMC is the Integral Sliding Mode Control (ISMC) [
26], proposed by Utkin [
24,
25]. Unlike conventional SMC, ISMC determines the control input based on two types of inputs with different purposes, as shown below:
where
is the input that determines the response based on an ideal system model without disturbances, and
is the input that constrains the system’s state variables to the hypersurface.
The switching function
is defined as follows [
26]:
The first term
is the coefficient matrix of the switching hypersurface and can be designed similarly to conventional SMC to determine the motion characteristics of the system when constrained to the hypersurface. However, unlike conventional SMC, the hypersurface in ISMC determines the behavior of the error state variables, which are the differences between the state variables of the ideal model and the practical system, including disturbances. The second term
is defined by the following equation [
26]:
The initial value of
is given by
where
. Considering the system where the state variables are constrained to the hypersurface using the defined input and switching function, the switching function becomes 0 when the state is constrained to the hypersurface, and its time derivative also becomes 0. Thus, the switching function can be expressed as
Therefore, with the state variables constrained to the hypersurface, the switching function defined in Equation (6) can be rewritten as
By substituting the system with uncertainty expressed in Equation (2) and the control input defined in Equation (5) into the above equation, we obtain
Simplifying the above equation gives
This indicates that when the state variables are constrained to the hypersurface, the input
cancels out the external input
caused by disturbances. Substituting the result obtained from Equation (12) back into Equation (2) yields
This indicates that when the state variables are constrained to the hypersurface, the system’s dynamic characteristics are determined solely by the input without being affected by disturbances. This characteristic gives ISMC superior robustness against uncertainties, such as modeling errors and disturbances, which satisfy the matching condition. However, ISMC and SMC face the issue of chattering, which occurs due to the frequent switching of inputs near the hypersurface due to sign functions being used to constrain the state variables. Ideally, the switching frequency of the control input would be infinite when sign functions are employed. However, achieving such ideal behavior in practical systems is impossible due to actuator limitations. To address this, methods utilizing sigmoid or saturation functions have been proposed. Nevertheless, these approaches involve a trade-off between chattering suppression and system robustness. The RMCMPC proposed in this paper aims to improve robustness by constraining the state variables to a hypersurface, similar to ISMC. Unlike SMC or ISMC, the proposed RMCMPC does not use sign functions to calculate the input and thus has no chattering problem. ISMC also requires the use of a separate controller, such as PID control, to determine the response to an ideal motion model, whereas RMPC can determine the optimal value of the input for an ideal system based on an evaluation function.
3. Robust Monte Carlo Model Predictive Control
This section details the calculation method for control inputs in RMCMPC. MCMPC is a sample-based MPC approach in which multiple input samples are used to predict responses using an internal model and the optimal input is selected based on these predictions. In conventional MCMPC, the optimal input is determined solely based on the difference between the target trajectory and the state variables. However, in the proposed RMCMPC, in addition to computing the optimal input for the target trajectory, another input is simultaneously determined to constrain the state variables on the switching hypersurface by evaluating the switching function.
First, define
as the prediction horizon of the RMCMPC at a given time step
. Prepare
samples of two types of input
and
for each time step across
prediction steps. Let
represent a sample index; the vectors
and
representing the control inputs
and
for the
-th sample over
prediction steps are shown in the following equations.
Also, define the optimal input vectors
and
as those obtained by RMCMPC at time step
.
The control input vectors for the
-th sample,
and
, are determined based on a normal distribution with standard deviation
as follows:
where
denotes a normal distribution with mean
and variance
. The mean input vectors
and
are set using the optimal input vectors computed in the previous time step:
Since the inputs for the final step of the current prediction horizon cannot be obtained from the previous time step, the input for the final prediction step is set to be equal to the inputs of the prediction step one step before the final prediction step. Additionally, the initial values of the mean input vectors
and
are set to
represents an
zero matrix. The dynamical model in Equation (1) is in continuous time, while RMCMPC requires state variables at each time step in discrete time. Therefore, the state variables at each time step are computed by integrating Equation (1) over an interval of sampling time
, with the input as a zero-order hold. From the state variables at time step
and the input samples, the state variables at the next time step
are computed as follows:
where
is the state variable of the system when the
-th sample’s control input
is applied. Various numerical methods can be used for integration in the second term, such as the Euler method and Simpson’s rule. In this study, a fourth-order Runge–Kutta method is employed. At time
, i.e., at time step
,
is assigned the current state
of the system under consideration, which is assumed to be obtained from a sensor or an observer. That is, at each time step
, the response to the sample input is predicted from the current state variables. Next, using the predicted state variables, the evaluation function
is calculated using the following equations.
where
is the target value of the state variables, and
and
are weight matrices used to adjust the convergence of the state variables and the magnitude of the control inputs, respectively. The weight matrices
and
are diagonal matrices, and by adjusting the magnitude of their diagonal elements, it is possible to control both the speed of convergence of the state variables and the magnitude of the control input. For example, increasing the diagonal elements of
raises the sensitivity of the evaluation function to errors in state variables, thereby accelerating state convergence. Similarly, by setting larger diagonal elements for
, the sensitivity of the evaluation function to the input is increased, which enables the suppression of the controller’s input. The values of these weight matrices are fine-tuned through numerical simulations, during which the system response is monitored and adjusted accordingly. The input samples
are ranked based on their evaluation function values, and the top
samples
are selected. The optimal control input vector
is then determined using the following weighted average:
is a normalization factor for the evaluation function
, calculated as
The evaluation function for determining
is defined as
The weight matrices
and
included in the evaluation function are used to adjust the convergence properties of the switching function and the magnitude of the control inputs, respectively. The matrices
and
are also diagonal matrices. Adjusting the magnitude of their diagonal components allows for control over the speed of convergence to the hyperplane and the input magnitude in the same way as
and
. The switching function value
can be calculated using Equation (6) as
where
represents the predicted state variable when the optimal control input vector
obtained from Equation (25) and the input sample
are applied. This predicted state variable is calculated as follows:
Similarly to Equation (23), the initial sample
for the state variable at
is set to the current state variable
of the control object. In Equation (28),
indicates the predicted state variable corresponding to the optimal control input vector
obtained from Equation (25).
At the time step
, the state variable
is determined using the input
from the previous control step as
Additionally, the state variable
is set to
at the initial time, allowing the state variable to be constrained on the hypersurface from the beginning of the control. In conventional SMC, the system’s robustness decreases in the reaching mode, where the state variables move toward the sliding surface, as external disturbances cannot be effectively suppressed. However, in RMCMPC, the reaching mode can be avoided, or its duration can be shortened, by appropriately setting the initial state
. The numerical integration for Equations (29) to (31) is performed using a fourth-order Runge–Kutta method, similar to Equation (23). Using the predicted state variables and the evaluation function obtained above, the input samples
are ranked based on their evaluation function values, and the top
samples
are selected. The optimal control input vector
is then determined using the following weighted average:
Finally, the optimal control input vector
and
calculated above are used to determine the RMCMPC input at time step
:
Based on the above calculations, the control input for RMCMPC is determined.
Table 1 provides an overview of the general procedure for calculating the control input.
4. Rotational Dynamics of a Quadrotor
This section describes the rotational dynamics of the quadrotor used in this study. The coordinate definitions, state variables, and input definitions for the quadrotor are shown in
Figure 1.
The body-fixed coordinate system of the quadrotor is defined by its origin
at the center of mass of the vehicle, where the
-axis points forward and the
-axis points downward. The angular velocities of each axis of the body-fixed coordinate system are denoted as
,
, and
, and the moments generated around each axis by the four rotors are denoted as
,
, and
. In practice, to achieve the desired moments around each axis, the input moments are decoupled to determine the rotational speeds of each rotor. However, for the sake of simplification, the motion model used in this study treats the moments directly as inputs. The attitude of the quadrotor is expressed using Euler angles in the
sequence, where the roll, pitch, and yaw angles are denoted as
,
, and
, respectively. Based on the above definitions, the state vector is defined as
. The rotational dynamics of the quadrotor are given by the following equations:
where
,
, and
are the moments of inertia around the
,
, and
axes of the body-fixed coordinate system, respectively. In addition, considering the time delay in the moments generated by the thrust forces of each rotor, the moment input commands
,
,
and the corresponding input moments
,
,
are assumed to have the following time-delay relationship:
where
is the time constant of the actuator. In the numerical simulations presented in the next section, the control inputs
are determined by the controller for the system represented by Equations (34) to (42).
5. Numerical Simulation
In this section, the proposed RMCMPC and the conventional MCMPC are applied to the control system for the rotational dynamics of the quadrotor, as described in
Section 4, to evaluate their control performance. RMCMPC and MCMPC, like standard MPC, can incorporate input constraints and state variable constraints. For example, they can be implemented by including an expression in the evaluation function that penalizes states deviating from the constraints. However, since the focus of this study is on improving the robustness of MCMPC and ensuring that the effectiveness of the proposed method is clearly demonstrated, no constraints are imposed in this simulation.
Table 2 summarizes the simulation settings, including the parameters of each controller and those of the quadrotor. In the table,
and
denote the
identity matrix and the
zero matrix, respectively. The target quadrotor in this study is modeled with a mass
of 1 kg and a distance
of 0.3 m from the center of mass to each rotor, from which the moment of inertia is derived. To ensure fair comparison and avoid parameter bias between controllers, common parameters such as the weight matrices
and
, prediction horizon
, input sample count
, and selected sample count
are set to the same values for both controllers. Since these parameters need to be adjusted according to the characteristics of the vehicle, they were fine-tuned by verifying the control response through numerical simulations against the specified vehicle parameters. When the vehicle’s structure affects its inertia, the time response speed also changes. Therefore, it is essential to configure the controller’s sampling time and prediction horizon to match the desired response rate. For instance, in the case of a smaller vehicle with a small moment of inertia, a faster response is anticipated, suggesting the need for a shorter sampling time than that used in this simulation. Additionally, when the moment of inertia is small, the sensitivity to input increases, which may lead to excessively large control inputs. Therefore, it is anticipated that adjustments to increase the values in the weight matrices
and
for control inputs will be necessary to mitigate this effect.
In this study, the performance of RMCMPC and MCMPC is compared under eight different initial conditions and disturbance types, as shown in
Table 3. The disturbance types listed in
Table 3 include a constant disturbance
, a periodic disturbance
, and a square wave disturbance
, which are defined by Equations (43), (44), and (45), respectively.
where
represents the sign function;
,
, and
are the magnitudes of the disturbances along each axis; and
is the angular frequency of the disturbance. The target value
of the time-varying state variable is expressed as
,
, and
are the target Euler angles for roll, pitch, and yaw, respectively.
,
, and
are the amplitudes of the target angle variations, and
is the angular frequency of the target angles. In conditions 1 to 4, the initial Euler angles
,
, and
are set to 40°, −30°, and 20°, respectively. In conditions 5 to 8, they are all set to 0°. The initial angular velocities are set to 0°/s in all conditions. Under the simulation conditions described above, the performance of each controller is evaluated using three performance indices: ISE (Integral Squared Error), ITAE (Integral Time Absolute Error), and IAE (Integral Absolute Error). The equations for these indices are shown below.
In these indices, is the end time of the numerical simulation, and represents the error between the target Euler angle and the attitude angle of the quadrotor.
Figure 2 and
Figure 3 show the time histories of the state variables of the quadrotor for conditions 1 to 4 and conditions 5 to 8, respectively. In
Figure 2, the target attitude angles for the quadrotor are set to 0°, and the response is shown for initial Euler angles, where
is 40°,
is −30°, and
is 20°. In the figure, the black dashed line represents the target attitude angle (0°), while the solid red, green, and blue lines correspond to the roll angle
, pitch angle
, and yaw angle
of the quadrotor, respectively.
Figure 3 illustrates the response when the initial Euler angles
,
, and
are all set to 0° and the target values vary over time according to Equation (46). The solid line definitions in
Figure 3 follow those in
Figure 2, where the red, green, and blue dashed lines indicate the time-varying target attitude angles. First, in conditions without disturbances, shown in
Figure 2a,b and
Figure 3a,b (conditions 1 and 5), it can be confirmed that both the MCMPC and the proposed RMCMPC controllers converge to the target attitude angles whether the target Euler angle is fixed at 0° or varied over time. In fact, there is no significant difference when comparing the three indices shown in
Table 4. On the other hand, in the disturbed conditions where the target Euler angle is fixed at 0° (conditions 2 to 4), the time responses shown in
Figure 2c–h indicate that while the proposed RMCMPC converges to the target Euler angle, the conventional MCMPC exhibits a steady-state error and oscillations around the target Euler angle. Thus, it can be concluded that the conventional MCMPC is affected by constant disturbances, periodic disturbances, and square wave disturbances, whereas the proposed RMCMPC demonstrates robustness against such disturbances. Furthermore, in the conditions with disturbances where the target Euler angle is varied over time (conditions 5 to 8), the time responses shown in
Figure 3c–h illustrate that the proposed RMCMPC closely follows the target attitude angle. In contrast, it is evident that the traditional MCMPC does not adequately track the target value represented by the dotted line in the figures. Additionally, the indices presented in
Table 4 indicate that the RMCMPC consistently exhibits smaller errors compared to the MCMPC across all disturbance conditions.
Figure 4 and
Figure 5 show the time histories of control inputs for each controller under conditions 1 through 4 and conditions 5 through 8, respectively. In
Figure 4 and
Figure 5, the solid red, green, and blue lines represent the moments
,
, and
generated around the body axes, respectively. For each controller, the maximum control input remains below 0.2 Nm, demonstrating that the proposed RMCMPC achieves effective control without significant increases in input compared to the MCMPC.