Four-Wheeled Vehicle Sideslip Angle Estimation: A Machine Learning-Based Technique for Real-Time Virtual Sensor Development

Napolitano Dell’Annunziata, Guido; Ruffini, Marco; Stefanelli, Raffaele; Adiletta, Giovanni; Fichera, Gabriele; Timpone, Francesco

doi:10.3390/app14031036

Open AccessArticle

Four-Wheeled Vehicle Sideslip Angle Estimation: A Machine Learning-Based Technique for Real-Time Virtual Sensor Development

by

Guido Napolitano Dell’Annunziata

^1,*

,

Marco Ruffini

¹

,

Raffaele Stefanelli

¹

,

Giovanni Adiletta

¹

,

Gabriele Fichera

²

and

Francesco Timpone

^1,*

¹

Department of Industrial Engineering, University of Naples Federico II, 80125 Naples, Italy

²

Department of Civil Engineering and Architecture, University of Catania, 95123 Catania, Italy

^*

Authors to whom correspondence should be addressed.

Appl. Sci. 2024, 14(3), 1036; https://doi.org/10.3390/app14031036

Submission received: 23 December 2023 / Revised: 18 January 2024 / Accepted: 23 January 2024 / Published: 25 January 2024

(This article belongs to the Special Issue Vehicle Technology and Its Applications)

Download

Browse Figures

Versions Notes

Abstract

:

In the last few decades, the role of vehicle dynamics control systems has become crucial. In this complex scenario, the correct real-time estimation of the vehicle’s sideslip angle is decisive. Indeed, this quantity is deeply linked to several aspects, such as traction and stability optimization, and its correct understanding leads to the possibility of reaching greater road safety, increased efficiency, and a better driving experience for both autonomous and human-controlled vehicles. This paper aims to estimate accurately the sideslip angle of the vehicle using different neural network configurations. Then, the proposed approach involves using two separate neural networks in a dual-network architecture. The first network is dedicated to estimating the longitudinal velocity, while the second network predicts the sideslip angle and takes the longitudinal velocity estimate from the first network as input. This enables the creation of a virtual sensor to replace the real one. To obtain a reliable training dataset, several test sessions were conducted on different tracks with various layouts and characteristics, using the same reference instrumented vehicle. Starting from the acquired channels, such as lateral and longitudinal acceleration, steering angle, yaw rate, and angular wheel speeds, it has been possible to estimate the sideslip angle through different neural network architectures and training strategies. The goodness of the approach was assessed by comparing the estimations with the measurements obtained from an optical sensor able to provide accurate values of the target variable. The obtained results show a robust alignment with the reference values in a great number of tested conditions. This confirms that the adoption of artificial neural networks represents a reliable strategy to develop real-time virtual sensors for onboard solutions, expanding the information available for controls.

Keywords:

velocity estimation; virtual sensor; sideslip angle estimation; vehicle dynamics; machine learning; artificial neural networks

1. Introduction

Over the years, car safety tech and crash prevention systems have improved a lot, and this has made active safety tech even better at stopping car accidents [1]. Efforts to reduce traffic jams and boost overall traffic efficiency have also increased [2]. Through complete digitization, the vehicles and all the elements related to the transportation system are evolving into sophisticated, interconnected complex-networked mobile computers. In particular, this aspect includes various networked microprocessor-based electronic control units (ECU) that enhance their functions and serve as a foundation for continuous control [3,4]. Notably, the real-time estimation of data regarding the changes in vehicle’s velocity components plays a vital role in three key areas:

Driving assistance systems;
Autonomous driving;
The motorsport field.

For what concerns the driving assistance systems, the increasing level of vehicle dependence on software has made complex control systems essential for modern vehicle development.

Advanced safety systems, such as the anti-lock braking system (ABS), traction control system (TCS), and stability control systems, have become common features in modern vehicles [5,6]. To guarantee the optimal performance of these vehicle control systems, it is crucial that all the data collected from on-board instruments, combined with insights from predictive models, maintain a high level of quality [7,8,9,10,11,12,13].

An accurate estimation of the vehicle longitudinal speed [14], combined with the estimation of the sideslip angle, is crucial for the design and implementation of safety-related control integrated systems. Indeed, the knowledge of the mentioned quantities can be useful for emergency braking, by enabling systems to detect potential collisions and respond promptly; for traction control, by controlling wheel spin, maintaining grip on the road, especially in adverse conditions; and for stability control, for preventing skidding and maintaining control during aggressive maneuvers.

The estimation of a vehicle’s longitudinal velocity and sideslip angle is a crucial topic also in the autonomous driving field [15]. Indeed, it allows autonomous vehicles to continuously monitor their velocity and direction to adapt both of them appropriately to road conditions. Moreover, thanks to these quantities, the vehicle is able to predict the trajectories, maintaining stability during turns and thereby enhancing overall safety.

Another field of interest for the estimation of a vehicle’s longitudinal velocity and sideslip angle is motorsport. Indeed, accurately estimating tire dynamics requires reliable knowledge of the longitudinal velocity and sideslip angle to determine slip indices [16]. In addition, these quantities are really useful for selecting the best vehicle setup to ensure the proper use of tires during races. For this aspect, it is also essential to characterize the tires from both a mechanical and thermal point of view [17,18,19], and this is related to the evaluation of the forces acting on the tires and the optimal thermal window of the same. In this way, it is possible to understand how to maximize the best overall vehicle performance, taking into account the tire/road interaction phenomena.

The sideslip angle estimation can be also useful to predict when the vehicle is going to saturate the tires and lose directional stability [20], allowing for easy identification of understeer and oversteer conditions [21]. In particular, the estimation of the sideslip angle can be combined with sideslip control, particularly in high-performance and sports driving scenarios. This approach is designed to maintain the vehicle’s sideslip angle within a specific range by employing various controls [22,23,24]. Given the significant relationship between the sideslip angle and the tire’s operating conditions, it plays a critical role as an input for estimating the maximum friction coefficient between the tire’s contact patch and the road surface [25,26,27]. Further, this can also be used to assess tire wear [28,29].

Unfortunately, the measurement of the longitudinal velocity and the sideslip angle requires the use of complex and relatively expensive devices. This equipment includes optical sensors, the Global Positioning System, and the inertial platform. Furthermore, these instruments may encounter challenges, such as reflection issues with optical sensors when pointed at reflective surfaces like snow-covered roads, or general failures in the acquisition system.

In addition, in some contexts like motorsport, it is not even possible to use these sensors during the races; then it becomes essential the virtual estimation of both longitudinal velocity and sideslip angle.

According to the methods present in the literature, two main categories of vehicle sideslip angle estimation have been identified: observer-based and neural network-based [30,31]. For what concerns the first one, the most famous is the Kalman filter [32,33]. In the automotive domain, a multitude of nonlinear Kalman filters have been developed, and they find significant applications, mainly due to the nonlinearity of vehicle dynamics models [34]. On the other hand, the main obstacle with this approach lies in finding a balance between the complexity of the vehicle model, the quality of results, accuracy, and the computational burden required to achieve real-time sideslip angle estimation. The other category refers to the estimation of the sideslip angle through the use of neural networks (NN), a paradigm of the broader machine learning techniques [35,36,37,38,39]. An artificial neural network consists of interconnected neurons communicating through layers, adjusting their structure based on training information [40]. In recent years, neural networks have gained significance in the automotive industry for estimating variables without specific measuring procedures. This approach offers cost-effective computations, easy algorithm implementation in vehicles, and accurate results, especially in complex real-time control environments. It is important to note that the output of the neural network relies heavily on the data used for training. To achieve accurate generalization, the neural network requires a large amount of reliable data. However, if the network is appropriately trained on a robust and large dataset, it can accurately predict vehicle behavior, even in strongly nonlinear conditions. Moreover, the neural network does not require physical models. On the other hand, as stated, the Kalman filter requires a dependable vehicle dynamic or kinematic model to work. Regarding the sideslip angle estimation, the first type relies on tire models, which can become inaccurate in strongly nonlinear conditions, while the latter type in the Kalman filter tends to become unobservable for low yaw rate values and provide uncertain estimates [24,41].

For this reason, starting from previous work of the authors, in which is presented a novel methodology for estimating longitudinal vehicle velocity, this paper extends the approach to evaluate the sideslip angle and integrates the estimates of these two quantities [42]. In more detail, the proposed methodology introduces a dual-network architecture, that is, two distinct neural networks working in sequence: one is a dedicated neural network for estimating the longitudinal velocity, while the other is a separate neural network for predicting the sideslip angle, which also takes as input the longitudinal velocity estimate made by the previous network. In this way, after training the two networks appropriately, it is possible to create a virtual sensor that can replace the optical one. This virtual sensor estimates the longitudinal velocity of a vehicle using commonly found sensors. Then, a second neural network can be fed with this velocity and data from the same sensors to obtain an estimated sideslip angle at any given time. By implementing the trained networks on the vehicle control unit, these estimates can provide real-time information that improves the operation of driver assistance systems, without requiring expensive sensors. It can also help improve the vehicle’s dynamic behavior and performance where necessary.

The paper presents, in Section 2, a brief overview of the sensors installed on the vehicle, of the acquired channels, and of the tests performed with the aim of deepening the capability of the neural network; Section 3 presents the types of neural networks used and how they were designed. In Section 4, the data preparation procedure is presented; this is a critical aspect for a machine learning model to work properly. Finally, in the last section, a detailed comparison of the results related to the different types of trained networks is reported, with particular attention to the key performance indicators (KPIs), which is useful to define the best solution for the presented case study.

2. Sensors and Test Methods

In the context of this research activity based on an artificial neural network, where inputs rely on specific values acquired through sensor data, it is necessary to present a comprehensive overview of the sensor technologies used to collect the necessary data for feeding the artificial neural network (ANN). This paper focuses on various analyses for sideslip angle estimation using a high-performance reference vehicle equipped with multiple sensors placed in various locations to gather data for the ANN. Additionally, key characteristics of the reference vehicle, such as the center of gravity (CoG) position, tire specifications, and engine and suspension features, were also known. Acquisitions were made on different circuits that had varying characteristics and layouts. One of them was identified by tight, slow turns that had a high aerodynamic load. The other one had the opposite characteristics, with long straights and a low aerodynamic load. The third circuit had intermediate characteristics, with several fast corners. On all these circuits, the vehicle was sensorized in the same way. In addition, with the same tires and on each circuit, long runs and short runs were performed with the same vehicle. In the former, the driver was more focused on tire management and had more fuel load; in the latter, the driver was more focused on performance. In addition, limit situations such as tire locking under braking were also tested so that the NN could also be trained to recognize particular phenomena with different slip values.

The data acquired from the sensors are often subject to noise, electromagnetic disturbances, and high-frequency vibrations. Therefore, a preliminary operation involved filtering the channels of interest to remove the noise from the acquired signals. Considering unavoidable misalignments between the sensors and the vehicle’s reference system, it was essential to detect and eliminate these offsets. Additionally, to compare the outputs from various instruments mounted in different parts of the vehicle, all of the data were reported to the center of gravity (CoG).

2.1. Input Channels

This research activity, as previously mentioned, aims to estimate the sideslip (

β

) angle through the use of neural networks (NN). The input channels for the NN, which are useful to achieve the mentioned evaluation, are presented in Table 1.

A block diagram that summarizes the structure of the neural network inputs is depicted in Figure 1.

These channels are essential in the field of vehicle dynamics, as they are useful to describe the vehicle’s behavior during both cornering and straight-line driving. For this reason, these variables were chosen as inputs for the neural network to estimate the sideslip angle.

In addition, to acquire data in different conditions, several analyses were performed by varying both the road and the tire conditions. Indeed, in this way, it was possible to understand the influence of these variables on the sideslip angle estimation.

2.2. Equipped Sensors

The input channels for the neural network are obtained from various sensors properly mounted on the reference vehicle. For what concerns the steering angle and the wheels’ speed of the four corners, these are acquired from the vehicle CAN-bus [43]. A controller area network (CAN-bus) is a standardized vehicle communication network, developed to facilitate communication between microcontrollers and a variety of sensors, all without the dependency on a central host computer [44,45]. It employs a message-based protocol consisting of four frame types: Data, Remote, Error, and Overload. The Data frame is the only one for actual data transmission. Each device transmits data serially, and if multiple devices attempt to transmit simultaneously, the highest priority device continues while others yield. The CAN-bus is then essential because it enables communication among the various instruments mounted on the vehicle for data acquisition and provides crucial data for this research activity.

Differently, the acquisition of the yaw rate and both lateral and longitudinal accelerations is achieved with an inertial platform (depicted in Figure 2a). A sensor module comprising three accelerometers and three gyroscope units, functioning as angular rate sensors, is employed to calculate all resultant outputs. It is based on mathematical algorithms used to make precise real-time measurements of motion. Utilizing the Global Navigation Satellite System (GNSS), the instrument has the capacity to rectify all recorded parameters, thereby ensuring the accurate assessment of metrics such as accelerations, roll, pitch, and yaw. All of the data are processed in real time, time-stamped, and synchronized with GPS time. This information is transmitted over the CAN-bus and can be viewed in real time on a laptop using specialized software.

To determine the longitudinal velocity and the reference sideslip angle for training the artificial neural network, an optical sensor is employed (depicted in Figure 2b). This instrument, utilizing Correvit technology, is valuable for directly measuring vehicle velocity and its components through non-contact measurements. A built-in GPS receiver enables the acquisition of position and speed data, while integrated accelerometers allow for the detection of the vehicle’s longitudinal and transverse accelerations. Additionally, integrated angular rate sensors measure pitch and roll angles, as well as the rotation around the vehicle’s vertical axis. Furthermore, the sensor also calculates additional signals such as leveled acceleration and curve radius.

A schematic representation that summarizes the associated instruments for the various neural network inputs is presented in Table 2.

The described sensors are all linked through an external data acquisition system with the ability to collect a range of signal types and ensure their proper synchronization.

In the motorsport field, the use of optical sensors on vehicles during races and official test sessions is constrained by specific regulations. This restriction leads to less information that is useful to improve the overall vehicle performance. Additionally, it is essential to note that the sensors presented are both expensive and delicate, making them impractical for installation on all vehicles.

Hence, the proposed methodology, which relies on neural networks for estimating the sideslip angle, offers an effective solution for overcoming the challenges associated with sensor equipment. In particular, with regard to the input related to longitudinal velocity, this paper presents two cases: the first case uses the value from the optical sensor as input, while the second case employs the estimation of longitudinal velocity based on the procedure presented in [42]. The novelty of the proposed methodology lies precisely in this second approach, that is, the introduction of a dual-network architecture, consisting of a dedicated neural network for longitudinal velocity estimation and a separate neural network designed for sideslip angle prediction, which is also fed with the estimated longitudinal velocity. So, this innovative approach involves the collaboration of two distinct neural networks operating in cascade, each addressing specific aspects of the system dynamics. In this way, it is possible to design a virtual sensor that replaces the optical one.

3. Artificial Neural Networks

Deep learning is a groundbreaking technology in artificial intelligence. It uses artificial neural networks to recognize patterns and make intelligent decisions, revolutionizing how we tackle complex problems and interpret large datasets.

An artificial neural network, a construct of interconnected neurons organized into layers, operates as an adaptable system. This adaptability allows it to modify its internal structure by assimilating information during the training phase, fostering its ability to process and analyze data effectively [46]. This network can simulate complex relationships between inputs and outputs, which can be challenging for traditional analytic functions. Moreover, neural networks can generalize their learning, accurately generating outputs for inputs not in their training dataset through multiple learning cycles [47]. It is precisely this remarkable ability to adapt and generalize that piques our interest in this work [48].

3.1. Artificial Neural Network Structure

To properly understand how the developed model to evaluate the sideslip angle works, it is necessary to deep into the most important features of an artificial net. Every neuron in a neural network has multiple inputs that can be either the input signals of the problem or signals coming from previous neurons (Figure 3). These inputs are added up for each neuron based on the weight of their connections. During the training phase, the weights are adjusted to enhance the overall learning of the network. Additionally, every neuron in the network has a bias that helps in fine-tuning the optimal working point of the neuron. Another crucial aspect of the neuron is the activation function that determines its output based on the inputs, weights of the connections, and the bias. In brief, the neuron’s ability to learn and adapt is governed by these key features.

Hence, the output of a neuron is returned by Equation (1).

{o u t p u t}_{i} = f (\sum_{j = 1}^{p} θ_{j i} i n p u t_{j} + θ_{0 i})

(1)

where

$θ_{j i}$ are the weights of the i-th neuron;
$i n p u t_{j}$ are the inputs of the i-th neuron;
$b i a s$ is the bias of the i-th neuron;
$o u t p u t_{i}$ is the output of the of the i-th neuron;
f is the activation function.

As mentioned, multiple neurons aggregate to create neural networks, complex structures organized into layers. These layers include an input layer, an output layer, and one or more intermediate layers, which are called hidden layers. The number of hidden layers and neurons in each layer varies based on specific application needs, with no fixed rules dictating their selection.

3.2. Artificial Neural Network Training

Training a neural network involves optimizing the weights and biases associated with its neurons, specifically for a subset of the dataset known as the training set. The goal is to progressively minimize prediction errors. To achieve this, an error or loss function, along with an algorithm, is introduced. The backpropagation algorithm, which is based on gradient descent methods, is commonly used to reduce the error between the network’s output and the desired target.

In essence, after a forward pass, backpropagation adjusts parameters through a backward pass, fine-tuning the model [49]. It starts with random weight and bias initialization, comparing results to target outcomes, leading to the loss error function. The gradient descent method then calculates partial derivatives to identify influential nodes in minimizing loss, as shown in Equation (2).

θ_{i}^{(t + 1)} = θ_{i}^{(t)} - η \frac{\partial J (θ_{0}, θ_{1}, θ_{2}, \dots, θ_{p})}{\partial θ_{i}}

(2)

where

$θ_{i}^{(t + 1)}$ represents the updated weight (or bias) at time $t + 1$ .
$θ_{i}^{(t)}$ represents the current weight (or bias) at time t.
$η$ is the learning rate, which controls the step size for weight updates.

Those nodes receive smaller weights to reduce the overall loss of the model. In other words, it measures how sensitive the loss is to changes in each weight. This is carried out using calculus and gradients to determine the direction and magnitude of the steepest increase in the loss. The learning rate controls the update magnitude, with smaller rates leading to more stable convergence, but more iterations may be required. Larger rates can speed up convergence but risk overshooting the minimum. In summary, during each training cycle (epoch) of the neural network, the backpropagation algorithm adjusts weights by subtracting derivatives scaled by a “learning rate” (which avoids abrupt variations of the weights). This iterative optimization process continues until the error function reaches a desirable threshold for the application.

To perform the explained training process of the neural network, the authors decided to use the Levenberg–Marquardt algorithm. From the foundational principles of standard backpropagation with gradient descent, the Levenberg–Marquardt algorithm emerged as an innovative optimization method for neural network training. This algorithm introduces a novel approach by incorporating the Hessian matrix, capturing the second-order partial derivatives of the cost function with respect to network weights. It dynamically adjusts a damping parameter to modulate the step size in weight updates, adapting to the local curvature of the cost function’s surface. The resulting weight updates become a function of the gradient, Jacobian matrix, error vector, and Hessian matrix, enabling faster convergence and improved stability during training. Its functioning is summarized in Equation (3).

θ^{(t + 1)} = θ^{(t)} - {[J^{T} \cdot J + λ \cdot diag (J^{T} \cdot J)]}^{- 1} \cdot J^{T} \cdot e

(3)

where

$θ_{i}^{(t + 1)}$ represents the updated weight (or bias) at time $t + 1$ .
$θ_{i}^{(t)}$ represents the current weight (or bias) at time t.
J is the Jacobian matrix, representing the first-order partial derivatives of the cost function with respect to the weights.
e is the error vector, representing the difference between the network’s predictions and the target values.
$d i a g (J^{T} * J)$ represents the diagonal elements of the Hessian matrix.

This method is particularly valuable when training neural networks with relatively few weights and is recognized for its ability to expedite the optimization process in various scientific and engineering domains.

3.3. Overfitting Phenomenon

A recurring problem when using supervised learning models is the occurrence of overfitting. Overfitting is a common challenge in machine learning where a model is trained to learn from a specific dataset to the point that it becomes too specialized to that dataset. This can lead to the model becoming less accurate, or even ineffective, when presented with new, unseen data. Essentially, overfitting occurs when the model has learned to recognize the training dataset so well that it becomes unable to generalize and make accurate predictions on new data. For this reason, a validation procedure is necessary to prevent the occurrence of the overfitting phenomenon. There are several ways of performing the validation process that can be grouped into two categories:

Dataset split or data partitioning: This allows for the evaluation of the model’s performance on a separate set of data that the model has never seen before;
Cross-validation: This involves dividing the dataset into multiple subsets (folds) and systematically training and testing the model on different combinations of these folds to assess its generalization performance and to tune hyperparameters.

In this work, it was decided to use the data-partitioning technique to avoid overfitting. This decision was made because a relatively large dataset was available, so dividing it did not pose any risk to net training. With this technique, the overall dataset is divided into two subsets: a training subset and a validation subset. The former is used to train the model, while the latter is used to verify that the model generalizes correctly. Specifically, if the model’s performance on the validation set starts to degrade while the training error continues to decrease, it is possible to stop the training to prevent overfitting.

Furthermore, in order to utilize the extensive dataset effectively, a test subset was included for independent evaluation at the end of training. This subset helps to assess the model’s performance on unseen data, providing an unbiased estimate.

Regarding this division of the dataset, it was performed on Matlab with the appropriate “Divideblock” function, with which data are split into three subsets using three contiguous blocks of the original dataset. The fraction of the original data that goes into each subset is as follows:

$70 %$ of the dataset for the training phase;
$20 %$ of the dataset for the validation phase;
$10 %$ of the dataset for the test phase.

The dataset was divided in this way strategically to strike a balance between preventing underfitting, ensuring an adequate number of training samples, and guarding against overfitting by providing a sufficiently sized validation set for assessing model generalization.

3.4. Neural Network Architectures

This section describes the neural network architecture typologies used in this study. Indeed, there are different types of neural networks whose choice depends on the application and complexity of the problem to be solved. As the research deals with discrete-time sequences, two types of networks were considered for the study:

Feedforward neural networks (FNNs) follow a structure where connections exclusively link neurons from one level to the next. These networks lack backward connections or links between neurons at the same level. This architecture means that FNNs do not retain a memory of past events, and their output solely depends on the current inputs [50];
Recurrent neural networks (RNNs) are a class of neural networks characterized by the presence of loops: the output values of a higher-level neuron (closer to the output) can be used as an input for a lower-level neuron (closer to the entrance). Thanks to the looping mechanism, RNNs have an internal memory [51].

More in detail, as for recurrent neural networks, two different variants of them have been tried and compared:

Standard recurrent neural network: the prediction of the current output is typically based only on past outputs and internal hidden states;
Nonlinear autoregressive with exogenous inputs (NARX): this kind of RNN explicitly accounts for both past outputs and past exogenous inputs when predicting future outputs [52]. This makes it suitable for modeling systems where the current output depends not only on past outputs but also on external factors [53].

3.5. Neural Network Hyperparameters

This section presents the neural network hyperparameters chosen for the study, including the number of neurons, hidden layers, epochs, and learning rate value. It is crucial to set these parameters accurately to avoid overfitting, underfitting, and lack of prediction accuracy.

The selection of the number and dimensions of hidden layers is motivated by the problem’s intricacy. Since the very complex trend of the sideslip angle time series, a neural network architecture with two hidden layers is employed in this work. Indeed, networks with two hidden layers are capable of representing functions of any shape and complexity, while, based on current understanding, there is no theoretical justification for using neural networks with more than two hidden layers [54]. The use of two hidden layers offers increased modeling capacity while managing computational resources effectively. This configuration strikes a balance between complexity and training efficiency.

There are many rule-of-thumb methods for determining the correct number of neurons to use in the first hidden layers. Choosing too few neurons can result in underfitting and high statistical bias. On the other hand, selecting too many neurons can lead to overfitting, high variance, and a longer training time. To achieve a balance between accuracy, generalization, and processing time, the following guidelines were followed:

The number of hidden neurons should be less than twice the size of the input layer;
The number of hidden neurons should be between the size of the input layer and the size of the output layer.

Starting from these observations, the number of neurons in the first hidden layer was set to

2 / 3

the size of the input layer plus one for the output layer size. Regarding the number of neurons in the second hidden layer, a looping mechanism was introduced to systematically explore various combinations of neurons in both hidden layers. The goal was to optimize the network’s performance through multiple training sessions driven by the looping mechanism. Subsequently, the network configuration with the pair of neurons that minimized a specific error function was selected. The neural network with the optimal pair of neurons was subjected to additional training to assess if further performance improvement was achievable. If the performance showed positive enhancement during this additional training, the last trained network was retained. Otherwise, the previous configuration was reinstated. Ultimately, the best-performing neural network configuration, as determined by this process, was used for making predictions.

Neuron initial weights are not defined because the correlation between input and output is not preventively known, so they were defined randomly by software.

The number of epochs, that is, the number of presentations to the network of all patterns in the training set, was chosen equal to 200 to find the best trade-off between the quality of results and the network processing time. Once the epochs were chosen, the number of validation checks for early stopping was set too. Early stopping is a regularization technique where training is halted when the validation performance no longer improves, preventing overfitting. This value was set equal to the

20 %

of the number of epochs.

The choice of activation functions influences the network’s ability to capture nonlinearities in the data. For the type of data linked to this application, the hyperbolic tangent (tanh) activation function was chosen. This function maps input values to a range

(- 1, 1)

, and is commonly used in neural networks for its ability to model nonlinear relationships.

Finally, it is important to underline that the networks have been trained with different dataset configurations to evaluate the effect on the estimation accuracy and the possible occurrence of overfitting. In more detail, the nets have been trained using both a dataset with all circuits and one without the circuit on which the prediction was then made. In addition, the prediction was made considering both new and worn tires.

These settings were used for all designed neural networks, and they are summarized in Table 3 and shown in Figure 4, Figure 5 and Figure 6, where the schemes of these nets are reported.

3.6. Model Evaluation

To evaluate the performance of the neural network in terms of estimation error, the root mean square error (RMSE) is employed. This is a widely used metric for assessing the accuracy of a prediction model, usually in regression analysis. It considers the magnitude of the error between the predicted values and the actual values. Together with the RMSE, with the same aim, it is used also the coefficient of determination. It is a statistical metric used to evaluate the goodness of fit of a regression model. It measures the proportion of the variance in the dependent variable (the target) that is explained by the independent variables (predictors) in the model. It results in a value between 0 and 1, where a higher value indicates a better fit of the model to the data.

3.7. Analyzed Approaches

In this part, an explanation of the approaches, which are aimed at enhancing the accuracy of the angle prediction performed by the network and the ability to generalize correctly of the network itself, is presented. For what concerns the ability of the network to work with circuits having different layouts—and therefore, characteristics—the following approaches have been tried:

All Tracks: For training and validating the neural network, data from all the available tracks are used. The system randomly selects data for the training set and uses the remainder for the validation and test subsets;
Exclude Track: The neural network is trained on a dataset from which all data related to a specific track have been removed. These data are then used for the prediction and performance evaluation of the net.

Additionally, the model has been verified to predict the sideslip angle on the same track, using tires with varying levels of wear, thus demonstrating its adaptability to different tire wear levels [20].

4. Data Preparation for Neural Networks

In the machine learning world, data preparation is an important first step in transforming raw data into well-structured data, optimized for successful use in machine learning models; indeed, the overall performance of these models is highly dependent on the quality of data preparation performed. Great care and attention to this important process are therefore necessary to achieve the best results. Data preparation includes methods such as data cleaning and filtering, data normalization, feature engineering and selection, handling of missing values, dimensionality reduction, and so on. The quality of data preparation plays a significant role in the accuracy and efficiency of a machine learning model. The main goals pursued in this step for the model under consideration were as follows:

Improved model accuracy: Data preparation enhances the quality of the dataset, leading to more accurate and reliable machine learning models;
Enhanced feature relevance: data preparation allows for effective feature selection. By selecting the most informative variables, the model can make accurate predictions while reducing the risk of overfitting. This, in turn, ensures that the model can generalize well and avoid loss of accuracy;
A faster and more efficient training phase: Well-prepared data speed up model training, reducing computational burden and processing time.

4.1. Data Cleaning

The first steps taken to prepare the dataset were the following basic data-cleaning operations are as follows:

Identification and resolution of duplicate rows;
Identification and resolution of gaps in columns and rows.

These steps are essential to prepare the dataset for the model. They ensure that the data are in a format that is compatible with the model, and that it does not contain any useless information that may impair its operation.

4.2. Feature Selection

The process of feature selection involves reducing the number of input variables used in a model to only the most relevant ones. In this context, relevance refers to the link between the input quantities and the target used for training. This procedure has three main aims:

By retaining key features and eliminating irrelevant ones, it boosts a model’s predictive performance;
It guards against overfitting by reducing data complexity and focusing on crucial attributes;
Models with fewer features are quicker to train and less resource-intensive, which ia ideal for large datasets and real-time applications, reducing computational costs.

In order to select the features for the neural network responsible for predicting sideslip angle, a methodology was employed that combined concepts and equations of vehicle dynamics [16] with the channels that can be obtained through sensors that are easily installed, purchased, and used on various types of vehicles, as explained in Section 2. Using this information, the most relevant acquired channels were selected.

4.3. Data Filtering

As stated in Section 2, acquired data need a preliminary filtering operation. Signals acquired from various sensors in a vehicle are subjected to different types of noise, such as electromagnetic interference, sensor inaccuracies, and environmental factors, so the information they provide to the tool could be not directly linked with the phenomena involved in the vehicle dynamics field. Filtering becomes crucial in reducing this noise to ensure that the data obtained are more reliable and accurate. For this purpose, a Butterworth filter is used, which is a type of signal-processing filter designed to have a frequency response that is as flat as possible in the passband. In particular, a third-order, 5 Hz low-pass filter is used, with a Nyquist normalizing frequency depending on the sampling rate, commonly equal to 50 or 100 Hz. The results of the filtering operation can be seen for all the channels of interest in Figure 7.

4.4. Static and Dynamic Offset Detection

On-board instrumentation in vehicles is often installed with misalignments between its own reference system and that of the vehicle. This can lead to miscalculations during certain procedures. To prevent this, it is important to detect and correct these misalignments or offsets. There are two types of offsets that can be identified: static and dynamic. To fix these offsets, the authors followed the subsequent procedures:

In order to detect the static offset, it is required to select a range of data acquisitions with the vehicle in stationary conditions. Once the range is selected, the average values of longitudinal and lateral accelerations, as well as longitudinal velocity channels are calculated within the selected range. These average values are then subtracted from the original signals;
To detect the dynamic offset, a specific range of acquisitions is chosen in which the vehicle is moving on a straight line with a constant speed, usually in the pit lane. Once the range is selected, the average steering angle, yaw rate, and sideslip angle values are calculated within that range. These average values are then subtracted from the original signals to obtain the dynamic offset.

The non-contact optical sensor, which was used to acquire longitudinal velocity and sideslip angle, can have also an inclination in the

x y

plane. This inclination can be calculated with Equation (4).

β_{0} = arctan (\frac{V_{y_{0}}}{V_{x_{0}}})

(4)

where

V_{y_{0}}

and

V_{x_{0}}

are, respectively, the mean of the longitudinal and lateral velocity measured in straight stationary maneuvers. This angle enables the correction of optical sensor measurements.

4.5. Outliers Removal

A technique based on median absolute deviation (MAD) was used for identifying and dealing with outliers in the dataset. MAD is a robust statistical measure, based on calculating the distance between the median of each point in the univariate distribution (single channel in the dataset) and the median of the distribution itself, as described in Equation (5).

M A D = m e d (| x_{i} - m e d (X) |)

(5)

Instead of using the mean and standard deviation (standard z-score method for outliers imputation), this method uses the median as the measure of central tendency and the median absolute deviation to quantify the spread or dispersion of the data, as can be seen in Equation (6). The median is robust to outliers and represents the center of the data without being heavily influenced by extreme values. The MAD is also robust to outliers and characterizes the variability in the data without being distorted by extreme values [55].

Z_{M A D} = \frac{x_{i} - m e d (X)}{M A D}

(6)

where if

Z_{M A D} > T h r e s h o l d

, then

x_{i}

is considered an outlier. The threshold value, as in the standard z-score method, is chosen according to the application.

In summary, the MAD-based z-score method was preferred over the standard z-score for identifying and handling outliers because it is less influenced by extreme values, interpretable, and robust in various data scenarios.

4.6. Data Standardization and Transformation

Input and output variables involved in the data acquisition process are, generally, physical quantities of different types. For this reason, they may have very different magnitudes, which could affect the artificial neural network negatively. When the data used for training a network have significantly different magnitudes, the convergence and learning process can be slowed down significantly. In some cases, this can cause the network to not effectively perform the learning process [56]. This is why the quantities of interest for the model should be preprocessed in order to obtain magnitudes all within the same range. Normalization and standardization are two widely used preprocessing methods to achieve that. In this paper, a process of standardization has been performed. This process, for each predictor individually, involves scaling data so that the mean results in zero (centering), and the standard deviation results in unity and retains the shape properties of the original dataset, that is, the same skewness and kurtosis [57].

The standardization related to a generic observation

x_{i}

is defined as in Equation (7).

z_{i} = \frac{x_{i} - μ}{σ} .

(7)

where

μ

is the mean, and can be defined as in Equation (8), and

σ

is the standard deviation, and is defined in Equation (9).

μ = \frac{1}{n} \cdot \sum_{i = 1}^{n} x_{i}

(8)

σ = \sqrt{\frac{1}{n - 1} \cdot \sum_{i = 1}^{n} {(x_{i} - \bar{x})}^{2}} .

(9)

It is crucial to emphasize that to prevent data leakage, the process of standardization should only be performed on the training subset. Afterward, the same values of mean and standard deviation should be used to standardize the channels associated with the validation and test subsets.

5. Results

Experimentation Results

A set of tests were performed to validate the developed model and to look at the achieved estimations of the sideslip angle. The following section presents the results for each neural network type and for the explained approaches.

In Figure 8 and Figure 9 are shown the feedforward neural network predictions of the angle of interest for two circuits with very different layouts using the “All Tracks” approach. It is clear from the red curve that the estimated angle closely matches the angle acquired by the optical sensor represented by the black curve for both tracks. Therefore, after training, the feedforward network can quickly and accurately estimate the vehicle’s sideslip angle.

Figure 10 displays the training, validation, and test performance over epochs during the training operation. Initially, the error rapidly decreases but then stabilizes, reflecting the network’s understanding of the input–target relationship learned from the training dataset. The optimal network performance is determined based on the epoch with the lowest validation error. Beyond this point, the validation error starts to rise, indicating the onset of overfitting. The network then reverts to the epoch preceding this phenomenon. In Figure 11, regression plots illustrate the relationship between predicted values and target values. The y-label in these plots represents the equation linking the predicted output (dependent variable) to the target (independent variable). The target coefficient, close to unity, indicates the proximity of the training output to the target from the dataset. The second term, ideally close to zero, represents the error or residue required to align the scaled target with the training output. The coefficient of regression in the title, ideally equal to unity, signifies the strong alignment between the target and output. These results affirm the successful execution of the training phase, achieving optimal working conditions.

In summary, these plots highlight that the training phase was effectively executed, leading to a well-performing neural network model.

Figure 12 and Figure 13, on the other hand, show the results of angle predictions made with NARX and RNN neural networks on Track “A”.

To analyze and draw conclusions in more detail, as mentioned, two statistical parameters have been calculated for each network forecast: the coefficient of determination and the RMSE. For the precision on Track “A”, the prevision errors for each network architecture are summarized in Table 4.

In order to verify the effective work efficiency of the networks, they were tested on a second run related to the same circuit (Table 5), but this time, the vehicle was equipped with worn tires, while in the first run, a new set of tires was used. In this way, it is possible to check that the neural network has not adapted to the data of the first run, and that it is therefore able to work with different datasets, always giving equally valid results.

With the same aim, Table 6 presents the statistical indicator values for the estimation related to a different circuit (“Track B”) that has a different layout.

The findings reported in the graphs and tables presented shed light on the forecast accuracy of the considered neural networks. In particular, the results obtained for the vehicle sideslip angle estimate by means of three different typologies of artificial neural networks indicate that the feedforward neural network is the most accurate, followed closely by NARX. On the other hand, it is fair to say that the RNN is the least precise among all the models. As can be seen graphically, it falls short in comparison to the others. Based on the analysis of the plots, it is not surprising that the statistical values for both feedforward and NARX are quite similar. The slightly higher statistical indicator values of the first with respect to the second are linked to the fact that the time series in feedback in the NARX is characterized by imperfections that, when passed into input, can lead to the presence of anomalies and inaccuracies.

As far as processing time is concerned, the best net is again the feedforward, followed by the NARX and the RNN.

Since these considerations can also be repeated for simulations on other circuits and other runs on the same circuits, this work seems to show that the best neural network for performing vehicle sideslip angle estimation is the network without loops.

Referring to this type of network, the results for the network trained with the “Exclude Track” approach are shown in Figure 14. Figure 15 illustrates the comparison between the predictions made by the model using two different approaches. The results presented clearly show that the model provides consistent predictions over time. In fact, the inclusion or exclusion of the track used for prediction in the training dataset does not affect the model’s robust predictive capabilities. This indicates that the model can provide similar results regardless of the inclusion or exclusion of the circuit.

So far, the input to the network has been the longitudinal velocity from the optical sensor. However, in the development of a virtual sensor, it is crucial to ensure that the neural network for the sideslip angle can function with a longitudinal velocity estimated using another neural network as input. This approach enables the creation of a virtual sensor that can operate with two cascading neural networks and replace the optical sensor that was previously utilized. The results of this approach are illustrated in Figure 16. Additionally, Figure 17 displays the estimated longitudinal velocity using a proper feedforward neural network, which was used to feed the sideslip angle neural network.

It is evident from these graphs that since the estimated longitudinal velocity is quite similar to that acquired by the optical sensor, the results of sideslip angle estimation remain excellent.

6. Conclusions

A novel method for estimating vehicle sideslip angles using a neural network-based approach has been introduced. The outcomes of this research consist of a machine learning model where the inputs are typical vehicle dynamic signals: steering angle, lateral and longitudinal acceleration, yaw rate, and wheel speeds. These signals are accessible from the CAN-bus, as they are commonly measured in both motorsport and passenger vehicles.

The analysis was conducted using various neural network architectures. This phase aimed to understand which net design, between feedforward, NARX, and recurrent neural networks, was more suitable for the prediction of the quantity of interest. The results showed that feedforward and NARX give very similar results but feedforward, having no feedback, is much faster. The worst was the recurrent neural network, both for accuracy and training time. These results arise because the feedback signal cannot exhibit significant delays to maintain real-time operational conditions and because the inaccuracies present in some parts of the estimate negatively interfere when going as input by means of the looping mechanism.

The network’s generalization capability was thoroughly tested across various scenarios, including multiple runs on different tracks while taking into account both new and worn tires. This testing approach aimed to assess the model’s adaptability and robustness under various real-world conditions. By examining the network results on various circuits, each with unique characteristics, its ability to make accurate predictions across different track configurations was evaluated. The network demonstrated its ability to consistently estimate the angle of interest across different scenarios, showcasing its adaptability and suitability for various applications. Furthermore, the network was tested on both new and worn tires to account for variations in tire conditions, evaluating the model’s solidity to changing tire properties throughout its life cycle. The results highlighted the network’s remarkable capability to maintain accuracy and reliability, regardless of the tire’s structural and viscoelastic properties, further underlining its real-world applicability. The neural network’s great generalization performance across various circuits and tire conditions reinforces its potential for onboard applications, where it can provide real-time estimations of vehicle dynamics quantities with consistently high precision and reliability. For this purpose, the accurate results obtained by it when cascaded by another neural network that estimates longitudinal velocity are crucial.

The methodology utilized in this study presents several important advantages, with one of the most prominent being its ability to provide accurate, near-instantaneous, real-time estimations of this fundamental physical quantity. The strong agreement between the estimated sideslip angle and the acquired one supports the suitability of these procedures for onboard implementation, rendering the need for bulky and expensive instrumentation obsolete. In summary, the machine learning model utilized in this study has low computational requirements and provides reliable outcomes, making it ideal for use in onboard control systems, such as ABS and TCS.

In order to further enhance the benefits of the presented model, our next steps involve utilizing its outputs to feed vehicle dynamics models that can accurately calculate the slip and forces applied to the tire. These models will integrate with the driver assistance systems to further improve their precision so as to increase safety and performance. However, before implementing this, we plan to thoroughly test the model’s functionality on various types of vehicles, in order to expand its use.

Author Contributions

Conceptualization, M.R. and R.S.; methodology, M.R. and F.T.; software, G.N.D.; validation, G.F. and G.A.; formal analysis, G.A.; investigation, F.T.; resources, G.N.D.; data curation, M.R. and G.F.; writing—original draft preparation, M.R., G.N.D., and R.S.; writing—review and editing, G.F. and R.S.; visualization, G.A.; supervision, F.T.; project administration, G.N.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data presented in this study are not available for confidentiality reason.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Perumal, P.S.; Sujasree, M.; Chavhan, S.; Gupta, D.; Mukthineni, V.; Shimgekar, S.R.; Khanna, A.; Fortino, G. An insight into crash avoidance and overtaking advice systems for Autonomous Vehicles: A review, challenges and solutions. Eng. Appl. Artif. Intell. 2021, 104, 104406. [Google Scholar] [CrossRef]
Yang, S.; Du, M.; Chen, Q. Impact of connected and autonomous vehicles on traffic efficiency and safety of an on-ramp. Simul. Model. Pract. Theory 2021, 113, 102374. [Google Scholar] [CrossRef]
Ibañez-Guzmán, J.; Laugier, C.; Yoder, J.D.; Thrun, S. Autonomous Driving: Context and State-of-the-Art. In Handbook of Intelligent Vehicles; Springer London: London, UK, 2012; pp. 1271–1310. [Google Scholar]
Jin, Y.; Ahn, H.; Kim, K.; Choi, S.; Mueck, M.; Frascolla, V.; Haustein, T. Adaptive automotive communications solutions of 10 years lifetime enabled by ETSI RRS software reconfiguration technology. In Proceedings of the 2017 25th European Signal Processing Conference (EUSIPCO), Kos, Greece, 28 August–2 September 2017; pp. 903–906. [Google Scholar]
Rajamani, R. Vehicle Dynamics and Control, 2nd ed.; Mechanical Engineering Series; Springer: Berlin/Heidelberg, Germany, 2012; p. 498. [Google Scholar]
Santini, S.; Albarella, N.; Arricale, V.M.; Brancati, R.; Sakhnevych, A. On-board road friction estimation technique for autonomous driving vehicle-following maneuvers. Appl. Sci. 2021, 11, 2197. [Google Scholar] [CrossRef]
Lefevre, S.; Laugier, C.; Ibanez-Guzman, J. Risk assessment at road intersections: Comparing intention and expectation. In Proceedings of the 2012 IEEE Intelligent Vehicles Symposium, Alcalá de Henares, Madrid, Spain, 3–7 June 2012; pp. 165–171. [Google Scholar]
Short, M.; Pont, M. Hardware in the loop simulation of embedded automotive control system. In Proceedings of the 2005 IEEE Intelligent Transportation Systems, Vienna, Austria, 16 September 2005; pp. 426–431. [Google Scholar]
Sarhadi, P.; Yousefpour, S. State of the art: Hardware in the loop modeling and simulation with its applications in design, development and implementation of system and control software. Int. J. Dyn. Control 2015, 3, 470–479. [Google Scholar] [CrossRef]
Zhao, P.; Chen, J.; Mei, T.; Liang, H. Dynamic motion planning for autonomous vehicle in unknown environments. In Proceedings of the 2011 IEEE Intelligent Vehicles Symposium (IV), Baden-Baden, Germany, 5–9 June 2011; pp. 284–289. [Google Scholar]
Ono, E.; Hattori, Y.; Muragishi, Y.; Koibuchi, K. Vehicle dynamics integrated control for four-wheel-distributed steering and four-wheel-distributed traction/braking systems. Veh. Syst. Dyn. 2006, 44, 139–151. [Google Scholar] [CrossRef]
Halbach, S.; Sharer, P.; Pagerit, S.; Rousseau, A.P.; Folkerts, C. Model Architecture, Methods, and Interfaces for Efficient Math-Based Design and Simulation of Automotive Control Systems. In Proceedings of the SAE 2010 World Congress & Exhibition, Detroit, MI, USA, 13 April 2010. [Google Scholar]
Sakhnevych, A.; Arricale, V.M.; Bruschetta, M.; Censi, A.; Mion, E.; Picotti, E.; Frazzoli, E. Investigation on the model-based control performance in vehicle safety critical scenarios with varying tyre limits. Sensors 2021, 21, 5372. [Google Scholar] [CrossRef] [PubMed]
Srinivasan, S.; Sa, I.; Zyner, A.; Reijgwart, V.; Valls, M.; Siegwart, R. End-to-End Velocity Estimation For Autonomous Racing. IEEE Robot. Autom. Lett. 2020, 5, 6869–6875. [Google Scholar] [CrossRef]
Wei, S.; Zou, Y.; Zhang, X.; Zhang, T.; Li, X. An integrated longitudinal and lateral vehicle following control system with radar and vehicle-to-vehicle communication. IEEE Trans. Veh. Technol. 2019, 68, 1116–1127. [Google Scholar] [CrossRef]
Guiggiani, M. The Science of Vehicle Dynamics; Springer: Cham, Switzerland, 2023. [Google Scholar]
Genovese, A.; Pastore, S.R. Development of a portable instrument for non-destructive characterization of the polymers viscoelastic properties. Mech. Syst. Signal Process. 2021, 150, 107259. [Google Scholar] [CrossRef]
Genovese, A.; Farroni, F.; Sakhnevych, A. Fractional calculus approach to reproduce material viscoelastic behavior, including the time–temperature superposition phenomenon. Polymers 2022, 14, 4412. [Google Scholar] [CrossRef]
Genovese, A.; Maiorano, A.; Russo, R. A novel methodology for non-destructive characterization of polymers’ viscoelastic properties. Int. J. Appl. Mech. 2022, 14, 2250017. [Google Scholar] [CrossRef]
Giuliacci, T.A.; Ballesio, S.; Fainello, M.; Mair, U.; King, J. Recurrent Neural Network Model for On-Board Estimation of the Side-Slip Angle in a Four-Wheel Drive and Steering Vehicle. SAE Int. J. Passeng. Veh. Syst. 2023, 17, 15-17-01-0003. [Google Scholar] [CrossRef]
Ishak, M.I.; Ogino, H.; Yamamoto, Y. Numerical simulation analysis of an oversteer in-wheel small electric vehicle integrated with four-wheel drive and independent steering. Int. J. Veh. Technol. 2016, 2016, 7235471. [Google Scholar] [CrossRef]
Fu, C.; Hoseinnezhad, R.; Bab-Hadiashar, A.; Jazar, R.N. Electric vehicle side-slip control via electronic differential. Int. J. Veh. Auton. Syst. 2015, 13, 1–26. [Google Scholar] [CrossRef]
Hac, A.; Bedner, E. Robustness of side slip estimation and control algorithms for vehicle chassis control. In Proceedings of the Proceedings of ESV Conference, Lyon, France, 18–21 June 2007. [Google Scholar]
Mosconi, L.; Farroni, F.; Sakhnevych, A.; Timpone, F.; Gerbino, F.S. Adaptive vehicle dynamics state estimator for onboard automotive applications and performance analysis. Veh. Syst. Dyn. 2022, 61, 3244–3268. [Google Scholar] [CrossRef]
Bian, M.; Chen, L.; Luo, Y.; Li, K. A Dynamic Model for Tire/Road Friction Estimation under Combined Longitudinal/Lateral Slip Situation; Technical Report, SAE Technical Paper; SAE: Warrendale, PA, USA, 2014. [Google Scholar]
Shao, L.; Jin, C.; Lex, C.; Eichberger, A. Nonlinear adaptive observer for side slip angle and road friction estimation. In Proceedings of the 2016 IEEE 55th Conference on Decision and Control (CDC), Las Vegas, NV, USA, 12–14 December 2016; IEEE: New York, NY, USA, 2016; pp. 6258–6265. [Google Scholar]
Ping, X.; Cheng, S.; Yue, W.; Du, Y.; Wang, X.; Li, L. Adaptive estimations of tyre–road friction coefficient and body’s sideslip angle based on strong tracking and interactive multiple model theories. Proc. Inst. Mech. Eng. Part D J. Automob. Eng. 2020, 234, 3224–3238. [Google Scholar] [CrossRef]
Ivanov, R. Tire wear modeling. Transp. Probl. 2016, 11, 111–120. [Google Scholar] [CrossRef]
Ziaukas, Z.; Busch, A.; Wielitzka, M. Estimation of Vehicle Side-Slip Angle at Varying Road Friction Coefficients Using a Recurrent Artificial Neural Network. In Proceedings of the 2021 IEEE Conference on Control Technology and Applications (CCTA), San Diego, CA, USA, 9–11 August 2021; IEEE: New York, NY, USA, 2021; pp. 986–991. [Google Scholar]
Chindamo, D.; Lenzo, B.; Gadola, M. On the vehicle sideslip angle estimation: A literature review of methods, models, and innovations. Appl. Sci. 2018, 8, 355. [Google Scholar] [CrossRef]
Zhang, C.; Feng, Y.; Wang, J.; Gao, P.; Qin, P. Vehicle Sideslip Angle Estimation Based on Radial Basis Neural Network and Unscented Kalman Filter Algorithm. Actuators 2023, 12, 371. [Google Scholar] [CrossRef]
Best, M.C.; Gordon, T.; Dixon, P. An extended adaptive Kalman filter for real-time state estimation of vehicle handling dynamics. Veh. Syst. Dyn. 2000, 34, 57–75. [Google Scholar]
Chen, B.C.; Hsieh, F.C. Sideslip angle estimation using extended Kalman filter. Veh. Syst. Dyn. 2008, 46, 353–364. [Google Scholar] [CrossRef]
Xia, X.; Hashemi, E.; Xiong, L.; Khajepour, A. Autonomous vehicle kinematics and dynamics synthesis for sideslip angle estimation based on consensus kalman filter. IEEE Trans. Control Syst. Technol. 2022, 31, 179–192. [Google Scholar] [CrossRef]
Melzi, S.; Sabbioni, E. On the vehicle sideslip angle estimation through neural networks: Numerical and experimental results. Mech. Syst. Signal Process. 2011, 25, 2005–2019. [Google Scholar] [CrossRef]
Lee, J. Measurement of machine performance degradation using a neural network model. Comput. Ind. 1996, 30, 193–209. [Google Scholar] [CrossRef]
Zhai, Y.J.; Yu, D.L. Neural network model-based automotive engine air/fuel ratio control and robustness evaluation. Eng. Appl. Artif. Intell. 2009, 22, 171–180. [Google Scholar] [CrossRef]
Meijer, G.C. Smart Sensor Systems; John Wiley & Sons, Ltd.: Chichester, UK, 2008. [Google Scholar]
Song, R.; Fang, Y. Estimation of Vehicle Sideslip Angle based on Modified Sliding Mode Observer and Recurrent Neural Network. In Proceedings of the 2022 7th Asia-Pacific Conference on Intelligent Robot Systems (ACIRS), Tianjin, China, 1–3 July 2022; IEEE: New York, NY, USA, 2022; pp. 135–139. [Google Scholar]
Demuth, H.B.; Beale, M.H.; De Jess, O.; Hagan, M.T. Neural Network Design; Martin Hagan: Stillwater, OK, USA, 2014. [Google Scholar]
Wielitzka, M.; Busch, A.; Dagen, M.; Ortmaier, T.; Serra, G. Unscented kalman filter for state and parameter estimation in vehicle dynamics. In Kalman Filters-Theory for Advanced Applications; InTech: Rijeka, Croatia, 2018; pp. 56–75. [Google Scholar]
Napolitano Dell’Annunziata, G.; Arricale, V.M.; Farroni, F.; Genovese, A.; Pasquino, N.; Tranquillo, G. Estimation of Vehicle Longitudinal Velocity with Artificial Neural Network. Sensors 2022, 22, 9516. [Google Scholar] [CrossRef] [PubMed]
Li, Z.; Wang, D.; Kang, Q. The Development of Data Acquisition System of Formula SAE Race Car Based on CAN Bus Communication Interface and Closed-Loop Design of Racing Car. Wirel. Commun. Mob. Comput. 2021, 2021, 4211010. [Google Scholar] [CrossRef]
Johansson, K.H.; Törngren, M.; Nielsen, L. Vehicle applications of controller area network. In Handbook of Networked and Embedded Control Systems; Springer: Berlin/Heidelberg, Germany, 2005; pp. 741–765. [Google Scholar]
HPL, S.C. Introduction to the controller area network (CAN). In Application Report SLOA101; Texas Instruments: Dallas, TX, USA, 2002; pp. 1–17. [Google Scholar]
Walczak, S.; Cerpa, N. Artificial Neural Networks. In Encyclopedia of Physical Science and Technology, 3rd ed.; Meyers, R.A., Ed.; Academic Press: Cambridge, MA, USA, 2003; pp. 631–645. [Google Scholar]
Mandic, D.; Chambers, J. Recurrent Neural Networks for Prediction: Learning Algorithms, Architectures and Stability; John Wiley & Sons Inc.: Hoboken, NJ, USA, 2001. [Google Scholar]
Korstanje, J. Advanced Forecasting with Python; Apress: Berkeley, CA, USA, 2021; p. 296. [Google Scholar]
Hecht-Nielsen, R. Theory of the backpropagation neural network. In Neural Networks for Perception; Elsevier: Amsterdam, The Netherlands, 1992; pp. 65–93. [Google Scholar]
McClarren, R.G. Machine Learning for Engineers; Springer: Cham, Switzerland, 2021; p. 247. [Google Scholar]
Yu, Y.; Si, X.; Hu, C.; Zhang, J. A review of recurrent neural networks: LSTM cells and network architectures. Neural Comput. 2019, 31, 1235–1270. [Google Scholar] [CrossRef]
Liu, J.; Wang, Z.; Zhang, L.; Walker, P. Sideslip Angle Estimation of Ground Vehicles: A Comparative Study. IET Control Theory Appl. 2021, 14, 3490–3505. [Google Scholar] [CrossRef]
Diaconescu, E. The use of NARX neural networks to predict chaotic time series. Wseas Trans. Comput. Res. 2008, 3, 182–191. [Google Scholar]
Heaton, J. Introduction to Neural Networks with Java; Heaton Research: St. Louis, MO, USA, 2005. [Google Scholar]
Yaro, A.; Maly, F.; Pražák, P. Outlier Detection in Time-Series Receive Signal Strength Observation Using Z-Score Method with Sn Scale Estimator for Indoor Localization. Appl. Sci. 2023, 13, 3900. [Google Scholar] [CrossRef]
Sola, J.; Sevilla, J. Importance of input data normalization for the application of neural networks to complex industrial problems. IEEE Trans. Nucl. Sci. 1997, 44, 1464–1468. [Google Scholar] [CrossRef]
James, G.; Witten, D.; Hastie, T.; Tibshirani, R. An Introduction to Statistical Learning, 1st ed.; Springer Texts in Statistics; Springer: New York, NY, USA, 2013; p. 426. [Google Scholar]

Figure 1. Neural network input channels.

Figure 2. Equipped sensors. (a) Inertial platform. (b) Optical sensor.

Figure 3. Neural network neuron scheme.

Figure 4. Feedforward NN layout. The numbers in the figure identify the amount of inputs, outputs or neurons in the layers, W represents the weights, and b the bias.

Figure 5. NARX NN layout. The numbers in the figure identify the amount of inputs, outputs or neurons in the layers, W represents the weights, b the bias. The 1 in the clock represents the number of past outputs considered.

Figure 6. Recurrent NN layout. The numbers in the figure identify the amount of inputs, outputs or neurons in the layers, W represents the weights, b the bias. The 1 in the clock represents the number of past outputs considered.

Figure 7. Filtered vs. no-filtered channels.

Figure 8. FF estimated

β

for track “A”—all tracks approach.

Figure 8. FF estimated

β

for track “A”—all tracks approach.

Figure 9. FF estimated

β

for track “B”—all tracks approach.

Figure 9. FF estimated

β

for track “B”—all tracks approach.

Figure 10. FF training performance results—all tracks approach.

Figure 11. FF training regression results—all tracks approach.

Figure 12. NARX estimated

β

for track “A”—all tracks approach.

Figure 12. NARX estimated

β

for track “A”—all tracks approach.

Figure 13. RNN estimated

β

for track “A”—all tracks approach.

Figure 13. RNN estimated

β

for track “A”—all tracks approach.

Figure 14. FF estimated

β

for track “A”—exclude track approach.

Figure 14. FF estimated

β

for track “A”—exclude track approach.

Figure 15. All tracks vs. exclude track comparison.

Figure 16. FF estimated

β

with FF estimated

V_{x}

.

Figure 16. FF estimated

β

with FF estimated

V_{x}

.

Figure 17. FF estimated

V_{x}

for track “A”.

Figure 17. FF estimated

V_{x}

for track “A”.

Table 1. Input channels for the neural network.

Input Channels
Input Channels	Unit of Measure	Nomenclature
Longitudinal acceleration	${m / s}^{2}$	$a_{x}$
Lateral acceleration	${m / s}^{2}$	$a_{y}$
Steering angle	deg	$δ$
Yaw rate	deg/s	r
Wheels speed for each corner	rpm	$ω_{i j}$
Vehicle longitudinal velocity	m/s	$v_{x}$

Table 2. NN input channels and their acquisition devices.

Sensors for Input Channels
Input Channels	Acquisition Devices
Longitudinal acceleration	Inertial platform
Lateral acceleration	Inertial platform
Steering angle	CAN-bus
Yaw rate	Inertial platform
Wheels speed for each corner	CAN-bus
Vehicle longitudinal velocity	Optical sensor

Table 3. Neural network settings.

Neural Network Settings
Info	Setting
Training Algorithms	Levenberg–Marquadrat
Number of Hidden Layers	2
Number of Epochs	200
Number of Validation Checks	20
Dataset Division Technique	Divideblock (70/20/10)
Activation Function	Hyperbolic Tangent

Table 4. Results of

β

estimations on track “A”—first run.

Table 4. Results of

β

estimations on track “A”—first run.

$β$ Estimations on Track “A”—First Run
Neural Network	$R^{2}$	RMSE
Feedforward	0.8694	0.4212
Narx	0.8213	0.4870
Recurrent	0.7434	0.5836

Table 5. Results of

β

estimations on track “A”—second run.

Table 5. Results of

β

estimations on track “A”—second run.

$β$ Estimations on Track “A”—Second Run
Neural Network	$R^{2}$	RMSE
Feedforward	0.8543	0.4434
Narx	0.8519	0.4470
Recurrent	0.8056	0.5010

Table 6. Results of

β

estimations on track “B”—first run.

Table 6. Results of

β

estimations on track “B”—first run.

$β$ Estimations on Track “B”—First Run
Neural Network	$R^{2}$	RMSE
Feedforward	0.9063	0.4948
Narx	0.8635	0.5168
Recurrent	0.8298	0.6023

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Napolitano Dell’Annunziata, G.; Ruffini, M.; Stefanelli, R.; Adiletta, G.; Fichera, G.; Timpone, F. Four-Wheeled Vehicle Sideslip Angle Estimation: A Machine Learning-Based Technique for Real-Time Virtual Sensor Development. Appl. Sci. 2024, 14, 1036. https://doi.org/10.3390/app14031036

AMA Style

Napolitano Dell’Annunziata G, Ruffini M, Stefanelli R, Adiletta G, Fichera G, Timpone F. Four-Wheeled Vehicle Sideslip Angle Estimation: A Machine Learning-Based Technique for Real-Time Virtual Sensor Development. Applied Sciences. 2024; 14(3):1036. https://doi.org/10.3390/app14031036

Chicago/Turabian Style

Napolitano Dell’Annunziata, Guido, Marco Ruffini, Raffaele Stefanelli, Giovanni Adiletta, Gabriele Fichera, and Francesco Timpone. 2024. "Four-Wheeled Vehicle Sideslip Angle Estimation: A Machine Learning-Based Technique for Real-Time Virtual Sensor Development" Applied Sciences 14, no. 3: 1036. https://doi.org/10.3390/app14031036

APA Style

Napolitano Dell’Annunziata, G., Ruffini, M., Stefanelli, R., Adiletta, G., Fichera, G., & Timpone, F. (2024). Four-Wheeled Vehicle Sideslip Angle Estimation: A Machine Learning-Based Technique for Real-Time Virtual Sensor Development. Applied Sciences, 14(3), 1036. https://doi.org/10.3390/app14031036

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Four-Wheeled Vehicle Sideslip Angle Estimation: A Machine Learning-Based Technique for Real-Time Virtual Sensor Development

Abstract

1. Introduction

2. Sensors and Test Methods

2.1. Input Channels

2.2. Equipped Sensors

3. Artificial Neural Networks

3.1. Artificial Neural Network Structure

3.2. Artificial Neural Network Training

3.3. Overfitting Phenomenon

3.4. Neural Network Architectures

3.5. Neural Network Hyperparameters

3.6. Model Evaluation

3.7. Analyzed Approaches

4. Data Preparation for Neural Networks

4.1. Data Cleaning

4.2. Feature Selection

4.3. Data Filtering

4.4. Static and Dynamic Offset Detection

4.5. Outliers Removal

4.6. Data Standardization and Transformation

5. Results

Experimentation Results

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI