1. Introduction
In the field of industrial control, it is essential to characterize process dynamics to properly design and tune closed-loop controllers. For this purpose, simplified mathematical models are used to capture key aspects of the system without introducing unnecessary complexity [
1]. An appropriate reduced-order model must meet two key requirements: (a) it must be easily identifiable from experimental data, and (b) it must accurately represent the process behavior near its operating point [
2].
Within this context, the proportional-integral-derivative (PID) controller remains the predominant choice in industrial settings due to its effectiveness and ease of implementation [
3]. For tuning purposes, reduced-order models are commonly used, with the first-order plus dead-time (FOPDT) model being the most widely employed in industrial settings. Model accuracy significantly impacts control loop performance, and previous studies have shown that using more representative models can improve PID tuning rules, leading to optimized system responses [
4].
In recent years, advancements in fractional-order calculus and computational methods have facilitated the development of sophisticated identification techniques for modeling processes governed by fractional differential equations [
5]. Fractional-order systems extend classical integer-order models by incorporating derivatives and integrals of non-integer orders. This generalization offers greater modeling flexibility and more accurate representations of real-world dynamic behaviors [
6]. One of the main advantages of these models is their ability to more accurately capture memory effects and hereditary dynamics, which are prevalent in many physical systems, including thermal, electrical, mechanical, and biological domains. Fractional derivatives enable the modeling of long-term memory effects and non-local behavior, providing a more accurate representation of real-world dynamics compared to integer-order models. Additionally, fractional models offer greater flexibility in parameter tuning based on experimental data, allowing for a more precise characterization of processes whose dynamic responses cannot be adequately captured by traditional models.
As a result, the use of fractional models in industrial environments has been increasing. For a comprehensive overview of fractional-order systems in automatic control, see [
7]. The FOPDT model is widely used in practice for characterizing process dynamics and designing control systems [
3]. However, given the enhanced capability of fractional models to capture complex dynamics, the fractional first-order plus dead-time (FFOPDT) model extends the traditional FOPDT framework [
8], offering a better balance between simplicity and accuracy in system identification.
An increasing number of methods have been developed for estimating fractional-order models based on the reaction curve. These techniques require only a step test to obtain the reaction curve and demand minimal system information, making them widely applicable in industrial settings.
Broadly, these methods generally fall into two categories: analytical and optimization-based approaches. Among analytical techniques, Tavakoli-Kakhki’s pioneering work in [
9] introduced analytical techniques for estimating FFOPDT model parameters directly from step response data.
Recent efforts have focused on adapting classical analytical identification methods for fractional-order processes. In particular, Ref. [
10] examines Sundaresan’s technique for modeling fractional-order systems, highlighting a limitation related to the convergence of the response derivative in the frequency domain. To address this issue, a corrective equation and a simplified formulation that avoids the inversion of the Mittag-Leffler function are proposed.
Within this context, Gude and García Bringas introduced an identification procedure for FFOPDT models based on three arbitrary points of the process reaction curve
-
-
[
11]. Building on this work, Ref. [
12] introduced a simplified approach by exploiting the symmetrical distribution of three key points on the reaction curve
-50-
. Both techniques are designed for overdamped processes evaluated in an open-loop system subjected to a step input. Furthermore, new analytical techniques have been introduced to enhance the accuracy of fractional model estimation. In [
13], a method is presented that estimates the fractional order of the model by leveraging the asymptotic behavior of the Mittag-Leffler function, while determining the parameters in the time domain using two arbitrary points,
and
, from the process reaction curve. It has also been demonstrated that the accuracy of the identified model is influenced by the placement of these points.
In fractional-order model identification, nonlinear optimization is the most commonly used approach in industrial applications. This method seeks to minimize the discrepancy between the step response of the estimated model and the process reaction curve. Several techniques have been proposed to achieve this. For instance, Ref. [
14] used curve fitting and analytical solutions in the time domain, incorporating the Mittag-Leffler function to identify fractional-order models. Similarly, Ref. [
15] applied integral equations for model identification from step test data. Additionally, Ref. [
16] utilized both the Mittag-Leffler and Grünwald–Letnikov methods to identify FFOPDT models based on step response measurements.
Another relevant approach uses the CRONE methodology, which was applied in [
17] for fractional-order model identification. This method has also been used in the design of fractional-order PI and PID controllers, as discussed in [
18].
Analytical identification methods are known for their simplicity and low computational cost, whereas optimization-based techniques are recognized for offering greater accuracy at the expense of higher computational demands. In the technical literature, there is an ongoing debate on this topic, highlighting the need to find a balance between accuracy, robustness, and computational efficiency. Expanding on this approach, the distinctiveness of the hybrid method presented in [
19] lies in its combination of single-variable optimization to estimate the fractional order
with analytical expressions for determining the remaining FFOPDT model parameters.
Within this context, reduced-order models simplify complex dynamic systems while preserving key behaviors, making them highly valuable for real-time applications and resource-constrained edge devices. Their main advantages include the following:
Computational efficiency: They simplify system representation for faster calculations.
Lower memory and processing requirements: This makes them suitable for embedded controllers and IoT devices.
Faster response times: They help ensure that control algorithms meet strict timing constraints.
Improved controller design and model interpretability: By highlighting essential dynamics, they facilitate controller development and enhance model understanding.
Reduced power consumption: This is especially crucial for battery-powered systems.
Due to these advantages, reduced-order modeling is widely used in industrial control.
Traditional analytical methods for fractional-order system identification often rely on approximations that, while computationally efficient, can lead to reduced accuracy when capturing complex system dynamics. Optimization-based approaches improve accuracy but are computationally expensive, making them unsuitable for real-time deployment in edge computing scenarios. The proposed hybrid framework leverages PSO to refine parameter estimation with high fidelity while employing AI-based inference for rapid, deterministic execution. By combining these techniques, the framework achieves a balance between precision and computational efficiency, outperforming purely analytical methods in accuracy and surpassing traditional optimization techniques in execution speed. This dual benefit enables real-time identification and control applications on resource-constrained devices, an aspect that is critical for industrial automation and embedded system implementations.
The main objective of this study is to derive a reduced-order fractional model from the dynamic response of a high-order system using an innovative combination of PSO optimization and AI-based algorithms. By leveraging information from the process reaction curve obtained through a simple step test, the proposed method enhances estimation accuracy, computational efficiency, and model generalization. Specifically designed for overdamped step responses, it integrates PSO with AI techniques to improve parameter estimation for the FFOPDT model. Its effectiveness is demonstrated through illustrative examples, highlighting advantages over analytical and optimization-based methods. Finally, the identification algorithm is implemented on microprocessor-based hardware, validating its applicability to real-world thermal process identification in experimental prototypes.
The key contributions of this work can be summarized as follows:
A hybrid identification framework that combines PSO-based optimization with AI-driven inference to derive reduced-order models from high-order fractional dynamic systems.
An innovative edge computing integration that enables real-time deployment of the identification framework, ensuring low latency and efficient resource usage.
A comprehensive experimental validation using a custom-built heating system, demonstrating the proposed method’s effectiveness in accurately capturing the dynamics of overdamped systems and enabling real-time control on edge devices.
A detailed comparative analysis of PSO and AI-based approaches, highlighting the trade-offs between accuracy and computational efficiency, and providing insights for deploying these methods in resource-constrained environments.
The remainder of this paper is organized as follows.
Section 2 presents the theoretical background, including the fundamentals of fractional calculus, the formulation of fractional-order systems, and an overview of the optimization techniques employed, with a particular focus on PSO and various AI-based approaches.
Section 3 details the materials and methods used in this study, covering data generation and balancing strategies, as well as the implementation of both PSO-based parameter identification and AI-based estimation methods.
Section 4 describes the experimental setup and presents the results, providing a comprehensive evaluation of model accuracy and computational performance across different edge devices. The Discussion section (
Section 5) examines the trade-offs between estimation accuracy and execution speed, exploring the practical implications of deploying these methods in real-time edge computing environments. Finally,
Section 6 summarizes the key findings, outlines the theoretical and practical contributions of this work, and suggests directions for future research.
2. Theoretical Background
This section provides an overview of the fundamental concepts and methodologies used in this work. It covers key aspects of fractional calculus, optimization algorithms, and AI-based techniques applicable to system identification.
The section is organized as follows.
Section 2.1 presents the fundamental concepts of fractional calculus necessary for this work.
Section 2.2 describes the PSO algorithm, used to estimate optimal FFOPDT model parameters by minimizing the error between its and the process’s response. Finally,
Section 2.3 introduces various AI-based techniques employed for the identification of dynamic systems.
2.1. Fractional Calculus
This section introduces fundamental concepts in fractional calculus, which extends classical differentiation and integration to non-integer orders. For a more in-depth discussion, refer to [
20,
21].
Fractional calculus generalizes the conventional derivative
to the non-integer-order operator
, where
denotes the fractional order, and
a and
t represent the lower and upper limits of the differ-integral operator. This operator is defined as follows:
Although is typically a real number, it can also be complex. In this study, we focus exclusively on real-valued fractional orders.
The literature presents various definitions of fractional-order operators. Below is a brief overview of those used in this work.
Definition 1. Riemann–Liouville:
The Riemann–Liouville fractional operator is one of the most commonly used formulations in fractional calculus. It is important to note that this is the definition used for the mathematical development in this work. The fractional derivative of a function is defined as follows:where m is the smallest integer greater than α (i.e., , with ), is the Gamma function [20], , and . Definition 2. Grünwald–Letnikov:
The Grünwald–Letnikov operator provides a discrete approximation to fractional differentiation and integration, making it well-suited for numerical computations. It is defined as follows:where corresponds to differentiation, to integration, represents the binomial coefficient, and h is the step size. Unlike the Riemann–Liouville formulation, the Grünwald–Letnikov approach aligns more closely with numerical methods, making it useful for computational implementations of fractional operators. This definition is the one used for the implementation of the fractional operators in the results of this work.
Definition 3. Mittag-Leffler Function:
The Mittag-Leffler function plays a fundamental role in fractional calculus, serving as a generalization of the exponential function. The one-parameter Mittag-Leffler function is defined as follows [22]:where is the Gamma function, , and . Because it generalizes exponential behavior, the Mittag-Leffler function is fundamental for modeling fractional-order systems, including the FFOPDT model [23]. Definition 4. FFOPDT model and step response:
The FFOPDT model is one of the most widely used reduced-order fractional models. Its transfer function is given as follows:where K is the process gain, is the time constant, is the apparent time delay, and α is the fractional order of the model. Note that the FFOPDT model reduces to the standard FOPDT model when . The time-domain expression for the response of the FFOPDT model to a step signal of magnitude is the following:where the output signal change is and represents the one-parameter Mittag-Leffler function [20]. 2.2. PSO Optimization
PSO is a population-based stochastic optimization technique inspired by the social behavior of bird flocking and fish schooling [
24]. In PSO, a swarm of candidate solutions (particles) is initialized in the parameter space, and each particle iteratively updates its position and velocity based on its own best-found solution and the global best solution discovered by the swarm. This process allows PSO to efficiently explore complex, multidimensional search spaces, making it particularly well-suited for challenging optimization problems in system identification [
25].
In our work on fractional-order system identification, PSO is employed to identify the optimal parameters that define the reduced-order fractional model. The optimization objective is to minimize the discrepancy between the step response of the high-order system and that of the reduced model, quantified by error metrics such as the mean squared error (MSE).
The PSO Algorithm 1 used in this study can be summarized as follows:
Algorithm 1: PSO algorithm for reduced-order parameter identification. |
![Mathematics 13 01308 i001]() |
The use of PSO in our framework is justified by its ability to handle non-convex, high-dimensional optimization problems without requiring gradient information, making it robust against local minima. This attribute, coupled with its proven success in previous system identification studies [
19], underscores its suitability for our approach. By leveraging PSO, we ensure that the reduced-order model closely approximates the dynamic behavior of the high-order system, providing a reliable foundation for subsequent control and real-time implementation on edge devices.
2.3. AI-Based Algorithms for System Identification
AI is increasingly used to identify reduced-order models that approximate complex dynamical systems while preserving essential behavior at a lower computational cost. AI-driven techniques facilitate the estimation of these models from experimental or simulated data, improving efficiency in control and optimization tasks.
The use of AI in the modeling and control of fractional systems has gained significant attention in recent years. Fractional systems, characterized by the presence of non-integer-order derivatives, have proven to be more suitable for describing physical systems with memory and hysteresis. Identifying key parameters such as K, T, L, and in these systems is essential for their control and optimization. In this context, various machine learning techniques have been employed to improve the accuracy of parameter estimation.
Raubitzek et al. (2023) [
26] present a comprehensive review on the combination of fractional derivatives with AI. They classify the approaches into three main categories: data preprocessing, where fractional operators are used to enhance input features in learning algorithms; machine learning-based fractional modeling, approaches that integrate machine learning with fractional differential equations to improve the accuracy of system dynamics prediction; and hyperparameter optimization, where machine learning is used to fine-tune fractional models based on observational data.
2.3.1. Comparison with Traditional Optimization Techniques
Supervised learning has been a key tool for parameter estimation in fractional control models. Cheng et al. (2022) [
27] apply fractional derivatives along with supervised learning techniques such as SVR and random forest to improve the estimation of the content of photosynthetic pigment in apple leaves. This work demonstrates that using fractional operators in combination with machine learning improves the accuracy of estimating relevant physical variables. Moreover, it has also been applied in the biomedical field, as Annadurai et al. (2024) [
28] introduce an approach that integrates transfer learning techniques with fractional operators to enhance medical image quality. Their method, called ETLFOD, combines pre-trained deep neural network models (DenseNet121, ResNet50, etc.) with fractional operators for noise reduction in MRI, CT, and X-ray images. The application of fractional operators in this case helps preserve crucial image features, demonstrating their applicability in signal processing tasks.
2.3.2. AI-Based Parameter Estimation vs. Traditional Methods
While traditional optimization techniques have been widely used in system identification, AI-based approaches offer notable advantages:
Robustness to nonlinearities: AI techniques handle nonlinearity better than traditional mathematical models.
Flexibility and adaptability: Machine learning models can generalize across different datasets, improving the estimation of parameters such as K, T, L, and .
Higher efficiency in large datasets: AI-based optimization can process large volumes of data faster, whereas conventional methods may struggle with scalability.
Given these advantages, the integration of AI-based methodologies in system identification and parameter estimation represents a promising direction to enhance traditional optimization frameworks. This complements the PSO approach [
24] by refining the parameter search process and improving convergence to optimal solutions.
2.3.3. Machine Learning Techniques
Machine learning has become an essential tool for modern data-driven applications, providing diverse methodologies for predictive modeling and optimization. Different algorithms have been used to address complex system identification problems, balancing interpretability, computational efficiency, and predictive performance. In this subsection, we describe the machine learning algorithms used in this experiment, highlighting their advantages and common applications in parameter estimation and predictive modeling.
Linear Regression
Linear regression is one of the most fundamental statistical methods used for predictive modeling. It assumes a linear relationship between the dependent variable and one or more independent variables, making it efficient and interpretable. However, its performance is often limited in cases where nonlinear relationships exist or when high multicollinearity is present in the data [
29].
Random Forest
Random forest is an ensemble learning method based on decision trees, introduced by Breiman (2001). It builds multiple decision trees and combines their predictions to improve accuracy and reduce overfitting. This algorithm is widely used for classification and regression tasks due to its robustness to noise and ability to capture nonlinear relationships in data [
30].
Neural Networks
Artificial neural networks (ANNs) are inspired by the structure of the human brain and have become a crucial technique in deep learning. They consist of multiple interconnected layers of artificial neurons that extract complex patterns from data. Although ANNs require large datasets and significant computational resources, they have demonstrated superior performance in predictive modeling and classification across various domains [
31].
Deep Learning
Deep learning is an extension of ANNs with multiple hidden layers, often referred to as deep neural networks (DNNs). These models excel in recognizing complex patterns and modeling sequential or structured data. Methods such as convolutional neural networks (CNNs) and recurrent neural networks (RNNs) have been highly effective in image processing and time-series forecasting [
32].
Extreme Gradient Boosting—XGBoost
XGBoost is an optimized version of the boosting technique, designed for high-performance classification and regression tasks. It improves accuracy by combining weak models iteratively, minimizing errors and reducing overfitting. XGBoost has gained popularity in data science competitions due to its efficiency and effectiveness in handling structured data [
33].
Support Vector Regression—SVR
SVR is an extension of support vector machines (SVMs) for regression tasks. It aims to find an optimal hyperplane that minimizes prediction errors while maintaining robustness against outliers. SVR is particularly effective for high-dimensional data and applications requiring strong generalization capabilities [
34].
Gaussian Process Regression—GPR
GPR is a probabilistic approach that models distributions over functions, making it highly suitable for uncertainty quantification. Unlike traditional regression techniques, GPR provides confidence intervals for predictions, making it particularly useful in applications where understanding the model’s uncertainty is essential [
35].
5. Discussion
5.1. Comparative Analysis of PSO and AI-Based Parameter Estimation Methods
Our experimental evaluation comprised two real-world experiments conducted on our custom-built heating system. Both experiments aimed to assess the performance of the hybrid identification framework, where PSO-based optimization is used to generate high-fidelity parameter estimates that subsequently train neural network models under actual operating conditions. The parameters of the reduced-order model and the corresponding performance metrics are summarized in
Table 7 and
Table 8 for Experiment 1, and in
Table 9 and
Table 10 for Experiment 2. In addition,
Figure 4 and
Figure 5 visually compare the real system output with the estimates provided by both the PSO and neural network approaches.
In both experiments, the PSO-based method consistently achieved very high accuracy by directly minimizing the mean squared error between the high-order system output and the reduced model output. Although the absolute error metrics varied slightly between experiments—reflecting minor differences in environmental conditions and system noise—the PSO approach maintained an excellent fit overall. However, its inherent iterative optimization process and the adaptive stopping criteria, which are set loosely to ensure broad applicability, led to variable and generally higher execution times.
In contrast, the neural network component of our hybrid approach, trained on the PSO-generated data, exhibited slightly higher error metrics compared to PSO but offered significant advantages in terms of speed and determinism. The neural network’s fixed computational steps ensured consistent, low-latency inference, making it particularly suitable for real-time applications on edge devices. Across both experiments, the performance trends of the neural network remained robust, with error metrics that were closely aligned despite the slightly elevated values relative to PSO.
Moreover, the reduced-order parameters—namely,
K,
T,
L, and
—estimated by both methods were highly comparable between the two experiments (see
Table 7 and
Table 9). This consistency confirms the robustness and generalizability of the hybrid identification framework, as the neural network was able to replicate the high-fidelity estimates generated by PSO across varying real-world conditions.
5.2. Model Accuracy and Trade-Offs
Our analysis of the reduced-order models derived via PSO and AI-based identification methods indicates that they effectively capture the key dynamic behaviors of the original high-order system. However, several trade-offs and potential sources of error emerge when simplifying complex dynamics into a lower order representation.
5.2.1. Accuracy Assessment
The error metrics presented in
Table 8 and
Table 10 and the visual comparison in
Figure 4 and
Figure 5 demonstrate that both the PSO and neural network approaches yield models that closely approximate the real system output. High
values, along with low MSE, RMSE, and MAE, indicate that the reduced-order models accurately reproduce both the transient and steady-state dynamics. This confirms that the simplified models are capable of representing the essential characteristics required for control and prediction tasks.
5.2.2. Potential Sources of Error
Despite the promising accuracy, several factors may contribute to discrepancies between the reduced-order models and the original high-order system:
Model simplification: Reducing a high-order system to a lower order model necessitates certain approximations, which may omit higher frequency dynamics or nonlinear effects inherent in the full-order system. With our approach, the PSO estimates, which capture the intricate behavior of the full-order system, serve as a robust training dataset for the neural network. As a result, even though the reduced-order model inherently simplifies the system dynamics, the neural network is trained to generalize from the detailed PSO output, thereby compensating for potential oversimplifications.
Parameter estimation: Both PSO and AI-based algorithms introduce estimation errors. For instance, the PSO algorithm employs a relatively loose stopping criterion to ensure generality across various scenarios, which can result in suboptimal parameter convergence. Similarly, while neural networks offer rapid inference, their performance is dependent on the quality and representativeness of the simulated training data. Our experimental results provide clear evidence that the hybrid approach effectively addresses challenges in parameter estimation. These consistently low errors across experiments affirm that the hybrid approach successfully mitigates parameter estimation challenges.
Noise and uncertainty: Measurement noise, sensor inaccuracies, and environmental variations in the experimental setup can lead to errors in the recorded output. These inaccuracies directly affect the fidelity of the parameter estimation process. In our experiments, the raw acquired signal—complete with inherent measurement noise and environmental uncertainty—was directly fed into both the PSO and the hybrid approaches. Remarkably, despite the presence of such noise in the raw data, both methods yielded low error metrics. The PSO-generated estimates maintained excellent accuracy, while the neural network trained on these high-fidelity outputs. These results demonstrate that our hybrid approach is robust against noise and uncertainty, effectively extracting the true system dynamics from real-world measurements.
Computational trade-offs: There is a clear trade-off between computational efficiency and model accuracy. The PSO approach, while delivering high-fidelity estimates, incurs longer and more variable execution times, which may limit its applicability in real-time scenarios. In contrast, the neural network-based component of our hybrid approach offers a dramatic reduction in computational time, enabling rapid, deterministic inference. Importantly, the slight increase in error metrics observed with the neural network is minimal and remains within acceptable bounds. Thus, the significant decrease in compute time more than compensates for the marginal increase in error, making the hybrid approach highly suitable for real-time applications where determinism and speed are critical.
5.3. Novel Contributions
Our approach introduces several novel contributions that significantly advance the field of fractional-order system identification. First, we propose a hybrid framework that integrates PSO-based optimization with AI-driven inference. The novelty of our approach does not stem from the use of PSO alone but rather from its integration within a hybrid system identification framework. This integration enables the accurate estimation of reduced-order model parameters from high-order systems by leveraging the high precision of PSO and the deterministic, rapid execution of AI methods. As a result, our methodology achieves a favorable balance between estimation accuracy and real-time performance, which is crucial for edge computing applications.
Second, our comprehensive evaluation encompasses both simulated data and experiments on a real heating system. By validating our methods in a practical setting, we demonstrate that the proposed framework is robust and applicable in real-world scenarios. The detailed error metric analysis and execution time measurements further confirm that our approach can reliably capture the dynamics of complex fractional-order systems while remaining computationally efficient.
Third, we provide a systematic comparison between PSO and AI-based techniques, highlighting the trade-offs associated with each. Our results indicate that while PSO achieves marginally better accuracy, its variable and higher execution times may limit its use in time-sensitive applications. In contrast, the AI-based method, particularly using neural networks, offers consistent and low-latency performance, making it a more attractive option for deployment in resource-constrained environments.
Finally, our work advances the current state-of-the-art by introducing a dynamic sampling strategy for parameter estimation and by rigorously analyzing both the accuracy and computational costs involved in the identification process. Collectively, these contributions pave the way for more efficient and reliable fractional-order system identification, particularly in applications that require real-time operation on edge devices.
5.4. Implications for Edge Device Deployment
Deploying the proposed hybrid identification models on edge devices presents both significant benefits and notable challenges. One of the primary advantages is the capability for low-latency, real-time processing. The deterministic execution times of the AI-based approach, particularly the neural network implementation, enable rapid inference that is crucial for real-time control and monitoring tasks. This has been evidenced by the performance metrics on platforms such as the NVIDIA Jetson Xavier NX, Jetson Nano, and Jetson Orin Nano (see
Table 5 and
Table 6). The consistent, low-latency performance makes these models well-suited for edge computing scenarios where prompt decision-making is critical.
In summary, the combination of high-speed AI-based inference with the localized processing capabilities of edge devices presents a powerful solution for real-time dynamic system identification. Future improvements in algorithm efficiency and hardware compatibility will further enhance these speed-ups, making the deployment of sophisticated identification models on resource-constrained edge devices increasingly feasible and effective.