Braking Friction Coefficient Prediction Using PSO–GRU Algorithm Based on Braking Dynamometer Testing

Wang, Shuwen; Yu, Yang; Liu, Shuangxia; Barton, David

doi:10.3390/lubricants12060195

Open AccessArticle

Braking Friction Coefficient Prediction Using PSO–GRU Algorithm Based on Braking Dynamometer Testing

by

Shuwen Wang

^1,*

,

Yang Yu

¹,

Shuangxia Liu

¹ and

David Barton

²

¹

College of Mechanical Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China

²

School of Mechanical Engineering, University of Leeds, Leeds LS2 9JT, UK

^*

Author to whom correspondence should be addressed.

Lubricants 2024, 12(6), 195; https://doi.org/10.3390/lubricants12060195

Submission received: 21 April 2024 / Revised: 21 May 2024 / Accepted: 27 May 2024 / Published: 29 May 2024

(This article belongs to the Special Issue Tribology in Vehicles)

Download

Browse Figures

Versions Notes

Abstract

:

The coefficients of friction (COFs) is one of the most important parameters used to evaluate the braking performance of a friction brake. Many indicators that affect the safety and comfort of automobiles are associated with brake COFs. The manufacturers of friction brakes and their components are required to spend huge amounts of time and money to carry out experimental tests to ensure the COFs of a newly developed braking system meet the required standards. In order to save time and costs for the development of new friction brake applications, the GRU (Gate Recurrent Unit) algorithm optimized by the improved PSO (particle swarm optimization) global optimization method is employed in this work to predict brake COFs based on existing experimental data obtained from friction braking dynamometer tests. Compared with the LSTM (Long Short-Term Memory) method, the GRU algorithm optimized by PSO avoids the accuracy reduction problem caused by gradient descent in the training process and hence reduces the prediction error and computational cost. The combined PSO–GRU algorithm increases the coefficient of determination (R²) of the prediction by 4.7%, reduces the MAE (mean absolute error) by 14.3%, and increases the prediction speed by 40.1% compared with the standalone GRU method. The prediction method based on machine learning proposed in this study can not only be applied to the prediction of automobile braking COFs but also for other frictional system problems, such as the prediction of braking noise and the friction of various bearing transmission components.

Keywords:

friction; machine learning; RNN; optimization; PSO; disk brake

1. Introduction

Friction braking systems are critical to the safety, handling stability, comfort, and performance of all automobiles, including electric vehicles, which, despite the use of regenerative braking, are still required to carry friction brakes. The coefficients of friction (COFs) at the brake sliding interface determine the quality of the braking performance of a vehicle and play an important role in the generation of braking noise and vibrations. Since the requirements of braking COFs imposed by an automobile’s original equipment manufacturers (OEMs) are very strict, braking system manufacturers need to spend a lot of human resources, testing, and financial resources to ensure that the braking COFs meet the requirements of the OEMs. If a vehicle is running at a high speed and the braking COFs are too small, the braking forces could be insufficient to decelerate the vehicle safely, which may cause serious traffic accidents. On the other hand, if the braking COFs are too large, the road wheels of the vehicle could become locked during braking operation, causing the vehicle to flick its tail, slip sideways, or even roll over. High COFs are also associated with increased noise and vibrations from a friction brake, which can lead to high warranty costs for the vehicle manufacturer. Therefore, if the braking COFs can be quickly and accurately predicted and the braking noise can be assessed according to the predicted COFs, the time spent on the development of new braking systems can be shortened, which can significantly improve the economic efficiency and profitability of brake manufacturers.

Although the generation mechanism and control techniques of frictional braking noise and vibrations have been studied for nearly one hundred years, the accurate prediction and control of friction-induced vibrations and noise have always been a challenge in the design of braking systems [1,2] due to the lack of the knowledge of real-time dynamic friction between the brake disk and pads during braking. Therefore, the prediction of friction in braking has attracted many researchers since the 1930s. Khairnar et al. [3] computed the COFs for symmetric and asymmetric drum brake shoes. The extracted COFs were used in an antilock braking system (ABS) algorithm to calculate the brake torque. Riva et al. [4] used a finite element analysis (FEA) approach combined with a coefficient of friction (COF) p-v map to compute the global COFs of a disk brake system. The local COFs were determined from a p-v map for each local sliding velocity and the contact pressure was determined by the FEA. Based on the local COFs, the braking force of the entire brake system and the global COFs can be evaluated. Meng et al. [5] reviewed a large number of peer-reviewed papers and concluded that COFs were critical to many areas, such as lubrication, wear, surface engineering, etc. In addition, COFs play a very important role in the theories or mechanisms of the generation of frictional vibrations and noise, e.g., in the stick–slip theory [6] and self-lock–slip theory [7]. A large number of studies have proved that the COFs have a significant effect on the generation of frictional vibrations and noise [8,9,10]. Jarvis et al. [9] believed that the COFs can be used as a good indicator of the tendency of braking friction materials to produce noise. Nishiwaki et al. [10] developed a theoretical model to study drum and disk brake noise and concluded that the braking noise was caused by the dynamic instability of the braking system and the transient change of COFs.

In recent years, due to the rapid development of machine learning (ML) theory and artificial intelligence (AI) technology, more and more ML algorithms have been applied in regression and classification prediction applications. Zhang et al. [11] employed the LSTM (Long Short-term Memory) algorithm to predict operation conditions of industrial IoT equipment and Jiang et al. [12] applied it to predict the health evolution trends of an aero-engine. Zhang et al. [13] combined CNN (Convolutional Neural Networks) and SVM (Support Vector Machine) for fault diagnosis of braking friction of mechanical equipment. Zhang et al. [14] used the CNN algorithm for feature extraction and combined it with the GRU algorithm to predict the uneven wear state of the friction block. Yang et al. [15] applied deep RNN (recurrent neural network) to the dynamic state estimation of the advanced brake system of electric vehicles. Sabanvoic et al. [16] used the lightweight squeeze net deep neural network model for the identification and classification of road-surface types so as to identify road pavement surfaces with different COFs. Stender et al. [17] used Convolutional Neural Networks (CNN) based on machine learning to detect vibrations combined with the RNN algorithm to predict braking noise. Wang et al. [18] used the LSTM algorithm and the optimized XGBoost algorithm to predict the braking COFs and braking noise, and the prediction results were in good agreement with the experimental results. Alexsendric and Barton [19] used artificial neural networks (ANN) to predict the COFs of a disk brake system for different operating conditions taking account of the composition of the friction material. The Bayesian regulation learning algorithm was found to give the best fit to the experimental data. Alexsendric et al. [20] also used ANN techniques coupled with the Bayesian regulation learning algorithm to investigate brake fade and recovery following high-temperature brake operation as a function of the composition and manufacturing process of the brake pads.

The main research goal of this present work is to use the GRU algorithm combined with particle swarm optimization (PSO) to enable the rapid and more accurate prediction of brake COFs so as to realize the fast design of brakes, shorten the experimental period, reduce the design cost, effectively control the quality, and improve the efficiency of braking system development. The overall concept is that if an experimental data set is available over a limited range of braking conditions, the model can be trained to use this data to allow for the prediction of COFs for different conditions, albeit these conditions must be the same rotor/pad combination. Thus, if a manufacturer wants to modify the test conditions for a new application, there is no need to repeat the experimental test program, which is expensive and time-consuming, because reliable predictions of COFs can be obtained from the model.

The overall methodology proposed can be divided into the following steps: first, collecting experimental data via the Link3900 brake dynamometer testing; second, data feature engineering that includes dealing with the missing values and outliers of the experimental data, investigating the correlation between one feature and another and between features and targets, dividing all data sets into training sets and test sets; and finally, using the optimized GRU algorithm to predict the braking COFs. Figure 1 illustrates the research procedure adopted for this study, which also reflects the structure of the paper.

2. Braking Dynamometer Testing and Typical Results

The Link3900 brake performance test bench produced by LINK company (Shanghai, China) was employed for the experimental tests. The test bench is a standard testing machine for testing the performance of various vehicle brakes, as shown in Figure 2. Tests were carried out according to the SAE-J2521 standard [21], which defines a set of experimental procedures recognized by the industry to simulate the braking process of a vehicle. The SAE-J2521 standard braking test procedure consists mainly of the following basic brake conditions: Snub Brake, Brake, Deceleration Brake, Cold Brake, and Fade Brake, which includes a total of 2321 braking stops in 31 different test modules with various braking conditions.

In this present work, nearly 1000 separate items of COF data were obtained from braking tests of one particular friction pair consisting of a grey cast iron brake disk with laser-machined 96 M-shaped grooves on each of its frictional surfaces and standard NAO (non-asbestos organic) friction pads. The COFs obtained under various braking conditions were used for the training and validation of the prediction of braking COFs. More detailed information about the brake materials and the setup of the braking dynamometer testing can be found in Ref. [18]. Table 1 demonstrates typical experimental data obtained from just 11 out of the approximately 1000 braking dynamometer tests. In these 11 tests, the main parameter varied was the average deceleration (Avg Decel) of the simulated braking event, which was from the shown initial speed to a brake release speed of 30 kph.

3. Feature Engineering

3.1. Data Cleaning

To enable the accurate prediction of braking COFs, two data cleaning methods are used in this study, namely Min–Max scaling [22] and Z-Score standardization [23], as the input and output ranges of the nonlinear activation functions used in the GRU neural network model need to be defined when predicting the experimental data. The two kinds of data cleaning processing are described below.

3.1.1. Min–Max Scaling

Min–Max scaling, also known as deviation normalization, is a linear transformation of the original data. The original value x is normalized by Min–Max to give a standardized value in the interval [0,1], and its calculation formula is shown in Equation (1):

X' = \frac{x - x_{\min}}{x_{\max} - x_{\min}}

(1)

where

x_{\max}

is the maximum value of all the data samples,

x_{\min}

is the minimum value of all samples, and

X^{'}

is the standardized value.

3.1.2. Z-Score Normalization

The Z-Score normalization aims to reduce the amount of calculation and improve the efficiency of the model. Z-Score normalization, also known as standard score, is the difference between a data sample and the average value of all data μ divided by the standard deviation σ of the data set. In most cases, by centralizing and standardizing the data, we will obtain the data in the form of the standard normal distribution (where the mean value is 0 and the standard deviation is 1). The calculation formula is shown in Equation (2):

x' = \frac{x - μ}{σ}

(2)

where μ refers to the mean of all samples, σ refers to the standard deviation of all samples, and

x^{'}

is the normalized result.

3.1.3. Outlier and Missing Value Treatment

In many cases, the original data obtained have outliers and/or missing values; if the abnormal and missing data are not processed, the machine learning algorithm will not be able to work. In this study, the Fillan method programmed in the Python (3.9.16) language is used to automatically detect and fill in the missing values in the data set.

Figure 3 presents the box plot of the experimental data, where the input features of BSpe, RSpe, ADec, ATor, MTor, APre, MPre, ACof, ITem, and FTem represent the initial braking speed, release speed, average deceleration, average torque, max torque, average pressure, max pressure, average COF, initial braking temperature, and final braking temperature, respectively. The main function of the box plots is to check whether there are outliers in the data. If the data appear outside the box, it means that the data are outliers; otherwise, they are normal data.

It can be seen from Figure 3 that there are several abnormal data (circles out of the boxes in Figure 3) in the chosen features of release speed, average pressure, and max pressure, which were eliminated from the data sets because they are clearly outliers. No abnormal data were found for other features. The averaged COF data or circles out of the box were due to the various braking conditions, e.g., the various initial braking temperatures used in the dynamometer testing, and all these ACof data were used in the modeling training.

3.2. Investigating the Relationships between Features

In statistics, there are many ways to measure the degree of correlation between two variables. In this study, the correlation between the features chosen to represent the bench experimental data is evaluated by two methods, namely Pearson product moment correlation coefficient (PPMCC) method [24] and the Maximal Information Coefficient (MIC) method [25].

3.2.1. Pearson Correlation Coefficient

The Pearson product moment correlation coefficient (PPMCC) is widely used in the field of data science. PPMCC is defined as the quotient of covariance and standard deviation between two data sets, which can be calculated using Equation (3):

r = \frac{\sum_{i = 1}^{n} (x_{i} - x) (y_{i} - y)}{\sqrt{\sum_{i = 1}^{n} {(x_{i} - \bar{x})}^{2}} \sqrt{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}}}

(3)

where

\bar{x}, \bar{y}

is the average value of x and y for n experimental data, respectively. The PPMCC can range from −1 to +1, with values closer to +1 indicating a stronger positive linear correlation. Conversely, the closer the value is to −1, the stronger the negative linear correlation.

The PPMCC values between the ten input features and the output feature of ACof were calculated in turn, and the correlation values are shown in Figure 4. For example, the value of −0.48 in this figure indicates that there is a negative correlation between the initial temperature and the average COF, that is, the higher the initial temperature, the lower the average COF. This is consistent with the known inverse relationship between COF and rotor temperature.

3.2.2. Maximal Information Coefficient

The Maximal Information Coefficient (MIC) is used to measure the linear and nonlinear relationships between variables, as well as the non-functional dependence for effective measurement. Let the data set D be a finite ordered data set, which is divided into a grid G, and let

D |_{G}

denote the probability distribution of the data set D on the grid G, then the MIC between the variables X and Y may be calculated by Equation (4):

M I C (D) = \max_{x y \leq B (n)} \frac{I^{*} (D, x, y)}{\log_{2} \min {x, y}}

(4)

where

I^{*} (D, x, y) = \max I (D |_{G})

is the MIC under different grid division; i.e., for variables on the set D, the X-axis is divided into X grids, and the Y-axis is divided into Y grids of all possible grid G;

I (D |_{G})

is the mutual information given the probability distribution

D |_{G}

; B is a monotone increasing function and satisfies

B (n) \leq O (n^{1 - ε})

and

0 < ε < 1

. In this work,

B (n) = n^{0.6}

is chosen according to reference.

The MIC can range from 0 to 1, with larger values indicating stronger correlations. Conversely, the smaller the absolute value, the weaker the correlation. The MIC values between the ten input features and the output feature of ACof were calculated in turn, and the correlation values are shown in Figure 5. It can be seen from Figure 5 that the maximum MIC value is 0.52 between features ITem and ACof, indicating that the initial temperature of the disk is the most important factor affecting the braking COFs.

3.3. Selection of Features

Experimental data feature engineering is used to mine the bench experimental data set and investigate internal relationships between the data parameters. Here, it is used to more deeply understand the features affecting braking COFs and remove unnecessary features in order to provide concise and effective model characteristic parameters for the development of a braking COF prediction model. Based on the above PPMCC and MIC analysis results, as shown in Figure 4 and Figure 5, four input features, namely the initial braking speed (BSpe), the brake release speed (RSpe), the average hydraulic pressure (APre), and the initial rotor temperature prior to braking (ITem) were selected to be the most important in order to develop the model for predicting the output feature of the average measured COF (ACof).

4. Prediction Algorithms

The RNN (recurrent neural network) is a kind of recurrent neural network that takes sequence data as the input, re-curses all nodes (re-circulating unit) in the evolution direction of the sequence, and connects them in chains. In deep learning, the RNN is particularly suitable for processing and predicting a class of neural network models related to serial data. Figure 6 demonstrates the structure of a typical RNN cell:

The LSTM (Long Short-Term Memory) and GRU (Gate Recurrent Unit) are two good variants of RNN, which can selectively add or reduce information and can effectively alleviate the problem of RNN gradient disappearance or gradient explosion [26]. In RNN, the gradient is calculated by a multiplication of time steps, which means that the gradient is obtained by the continuous multiplication of multiple identical or similar matrices, such as the cyclic weight matrix. If the values of these matrices are less than 1, the multiplication causes the gradient to decrease exponentially, causing the gradient to disappear. The gradient explosion is similar except, if the values of the matrices are greater than 1, the gradient increases exponentially to very large values.

4.1. LSTM Method

The LSTM model adds input gates, output gates, and forget gates to neurons in each layer corresponding to time points to realize the selective memory of neurons. Its structural principle is shown in Figure 7 (the red X represents multiplication and the blue + represents addition, indicating that the two functions are multiplied and added, respectively), which consists of forget gates, input gates, and output gates. The functions of these three gates are as follows:

Forget gates are used to decide what information and structure to discard from the “cell”. This layer reads the current input

x_{t}

and the pre-neuron information

h_{t - 1}

, and it is up to

f_{t}

to decide the discarded information.

Input gates are used to determine the new information stored in the cell state to determine the value

i_{t}

to be updated; the tanh layer is used to create a new vector of candidate values

{\tilde{C}}_{t}

to add to the state.

Output gates are used to determine the value of the next hidden state, which contains the information of the previous input.

4.2. GRU Method

GRU is a new generation of recurrent neural networks. Similar to LSTM, GRU removes cell states and uses hidden states for information transfer. It contains only two gates: the reset gate and the update gate, which are explained as follows:

(1): The reset gate determines how much previous information is forgotten and how new input information is combined with the previous memory, and it uses the current input information to make the hidden state forget any information that is found to be irrelevant to the prediction in the future. It also allows for the construction of more interdependent features. Essentially, the reset gate determines how much of the past data should be forgotten.
(2): The update gate acts similarly to the forget gate and input gate in LSTM. It decides what information to forget and what new information needs to be added. Controlling how much information from previous hidden states gets passed to the current hidden state is very similar to the memory cells in LSTM networks. It helps RNNs to remember long-term information and decide whether to copy all information from the past to reduce the risk of vanishing gradients.

GRU is a very popular network because its structure is simpler and its characteristics are better than the LSTM. Therefore, this study utilizes the GRU algorithm to predict braking COFs.

Gated Computation for GRU

The structure of a GRU neural network is shown in Figure 8, where the red X represents multiplication and the blue + represents addition, indicating that the two functions are multiplied and added, respectively. Compared with the traditional recurrent neural networks, the advantage of the GRU neural network is that it only contains two gates, namely the reset gate and update gate, and the detailed operation of both of which are described below.

(1): Update gate.

At time step t, we first need to compute the update gate

Z_{t}

using the following formula:

Z_{t} = σ (ω_{Z} \cdot [h_{t}, x_{t}])

(5)

where

x_{t}

is the input vector at the t-th time step, that is, the t-th component of the input sequence x, which undergoes a linear transformation when multiplied by the weight matrix

ω_{Z}

. The

h_{t}

vector holds the information from the previous time step t, which also undergoes the linear transformation. The update gate adds these two pieces of information and feeds them into the Sigmoid activation function, thus compressing the activation result to be between 0 and 1.

(2): Reset gate.

Essentially, the reset gate determines how much of the past information needs to be forgotten, which can be calculated using the following expression:

r_{t} = σ (ω_{Z} \cdot [h_{t - 1}, x_{t}])

(6)

This expression is the same as the expression for the update gate, but the parameters of the linear transformation and its use are different. The

h_{t - 1}

holds the information from the previous time step t − 1, which also undergoes the linear transformation.

(3): Current memory content.

Now let us discuss in detail how exactly these gates affect the final output. In the use of reset gates, the new memory content will use the reset gate to store relevant information from the past to calculate the new data:

h_{t} = \tanh (ω \cdot [r_{t} \times h_{t - 1}, x_{t}])

(7)

The input

x_{t}

and the previous time information are first subjected to a linear transformation by the right multiplication matrix

w

. Then, the Hadamard product of the reset gate

r_{t}

with

h_{t - 1}

is computed. Because the reset gate computed earlier is a vector of 0 to 1 values, it will measure the magnitude of the gate opening. For example, if the gating value for an element is 0, it means that the information for that element has been completely forgotten. The Hadamard product will determine the previous information to retain and forget. The results of these two calculations are added together and then put into the hyperbolic tangent activation function.

(4): The final memory of the current time step.

In the final step, the network needs to compute the

h_{t}

vector, retain the information from the current cell and pass it on to the next cell. In this process, it needs to use the update gate, which determines what information needs to be collected in the current memory content

h_{t}

and the previous time step

h_{t - 1}

. This process can be expressed as:

h_{t} = (1 - z_{t}) \times h_{t - 1} + z_{t} \times \tilde{h_{t}}

(8)

where

z_{t}

is the activation result of the update gate, which also controls the inflow of information in the form of gate control. The Hadamard product of

z_{t}

and

h_{t - 1}

represents the information retained in the final memory at the previous time, which, together with the information retained by the current memory in the final memory, equals the output of the final gated loop unit.

4.3. Model Evaluation Index

4.3.1. Correlation Index R²

R^{2}

is the determining coefficient, also known as the correlation index, which represents the degree of fit of the regression equation to the data set as a means of measuring the reliability of the prediction of the regression equation. The closer the value is to 1, the closer the actual prediction point is to the goal line, the better the degree of fit of the model to the data, and therefore, the more accurate the prediction of the model. Equation (9) can be used to calculate the

R^{2}

value ranges from 0 to 1:

R^{2} = 1 - \frac{R S S}{T S S} = 1 - \frac{{\sum_{i = 1}^{n} (y_{i} - \hat{y_{i}})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - \bar{y_{i}})}^{2}}

(9)

where RSS is the sum of residual squares, namely:

R S S = {\sum_{i = 1}^{n} (y_{i} - \hat{y_{i}})}^{2}

(10)

And TSS is the sum of the squares of the total deviation, i.e.:

T S S = \sum_{i = 1}^{n} {(y_{i} - \bar{y_{i}})}^{2}

(11)

4.3.2. MAE Index

The mean absolute error (MAE), also known as mean absolute deviation, measures the absolute difference between the real value and the predicted value. The smaller it is, the more accurate the prediction is. MAE is the deviation between all the observed values and the arithmetic mean value, which is obtained by averaging the absolute values. The calculation formula is as follows:

M A E = \frac{1}{n} \sum_{i = 1}^{n} |h (x_{i}) - y_{i}|

(12)

where

h (x_{i})

is the predicted value and

y_{i}

is the measured value.

The MAE avoids the positive and negative errors canceling each other, so it can more accurately reflect the magnitude of the prediction error than the correlation index.

5. Prediction Results

5.1. Model Comparison

The train_test_split function of the sklearn routine within the Python library is used to split the whole dataset into a training set, a validation set, and a test set, accounting for 60%, 20%, and 20% of the available dynamometer data, respectively. The training set is used for model training, the validation set is used for preliminary evaluation of whether the model training results meet the requirements, and the test set is used for model prediction. The final evaluation index of the model was evaluated by comparing the final prediction results with the values of the test set. Then, the pytorch, numpy, and other routines within the Python library are used to define the LSTM and GRU algorithm models, respectively. The model is trained by inputting data representing the independent input features and target output features from the training data set. Predictions are then made using the validation data set composed of a new set of test variables in order to evaluate the model performance and adjust the hyperparameters. The test set data is retained as unique data for the final evaluation of the optimized model performance.

Figure 9 represents the MAE and the R² evaluations for the GRU and LSTM predictions for the validation data set, respectively. It is observed from Figure 9 that the MAE of GRU is smaller, while the R² value of GRU is larger. The closer the MAE is to 0 and the closer the R² value is to 1, the more accurate the prediction. Therefore, the GRU model was selected in preference to the LSTM to further develop the prediction model for the braking COF.

5.2. Predictions of the GRU Model

Table 2 shows the names and default values of the parameters required for the GRU model. The number of input features has already been set to 4 (BSpe, RSpe, APre, ITem), and the number of output features is 1 (ACof). Based on these parameters, the COF predicted by the GRU algorithm are compared with the validation set of measured data from the bench tests, as shown in Figure 10. It can be seen that, although the general trend of the measured data is well predicted, there are some large discrepancies approaching a maximum of 10% between the predicted and measured COF values. Hence, it was decided to implement the PSO method in order to further optimize the GRU algorithm.

5.3. Predictions of the GRU Model Optimized with the PSO Algorithm

The particle swarm optimization (PSO) algorithm is a global optimization algorithm based on the bionic study of bird foraging behavior in nature. It considers each possible solution in the global variable as a particle that has its own direction and speed so that all particles move towards the local optimal position. By constantly updating the local optimal position

x b e s t

and the global optimal position

P b e s t

of the particle, an optimal solution of the objective function can be obtained.

The hyperparameters of the GRU neural network are classified as the attributes of the particle, and the fitness of the particle is evaluated. Through continuous iteration, the optimal particle is updated. Finally, the optimal particle with the best fitness is obtained to give the required optimal hyperparameters. Figure 11 illustrates the optimization process for the braking COF prediction model using the combined PSO–GRU algorithm.

In summary, the PSO is used to optimize the hyperparameters of the GRU algorithm to find the best hyperparameters, and the best hyperparameters are then employed in the GRU algorithm to predict the braking COFs. If the acceptance criterion of R² > 0.9 is not achieved in the predicted values, the PSO optimization is repeated and the GRU algorithm rerun until the required level of accuracy has been achieved.

The value of the loss function (loss) is a measure of the discrepancy between the predicted value and the measured value. Figure 12 shows that the loss curve of the PSO–GRU model converges faster to a zero value than that of the GRU model used in isolation.

Figure 13 presents the measured COFs and the predicted results from the combined PSO–GRU model for the validation data set. It is obvious from the comparison of Figure 10 and Figure 13 that the PSO–GRU algorithm exhibits superior prediction performance compared to the GRU-only algorithm.

5.4. Model Comparison

The same experimental data were input into the GRU algorithm and PSO–GRU algorithm, and the model evaluation indicators (R², MAE, training time) obtained from the GRU algorithm for the validation data set are compared with those from the combined PSO–GRU algorithm in Table 3. Compared with the GRU algorithm before optimization, the R² of the combined PSO–GRU algorithm is increased by 4.7%, the MAE is reduced by 14.3%, and the prediction speed is increased by 40.1%.

6. Conclusions

The rapid and effective prediction of COFs is of great significance for the study of braking performance and frictional noise, and also to further understand the factors controlling friction. One of the biggest challenges to this goal is the nonlinearity of the COF, which is affected by a large number of factors.

In this study, a GRU neural network combined with an improved PSO algorithm has been successfully used to solve the complex nonlinear problem of predicting braking COFs from a limited set of experimental data. The PSO algorithm has been used to improve the convergence rate and prediction accuracy of the original GRU neural network. The PSO–GRU algorithm increased the R² of the prediction by 4.7%, reduced the MAE by 14.3%, and increased the prediction speed by 40.1%. It is clear from the results that, after applying the PSO parameter optimization algorithm, the GRU prediction model has a shorter training time and better prediction accuracy for an unseen test data set than the standalone GRU model. Therefore, the combined PSO–GRU algorithm is a better choice for the accurate prediction of COFs from a limited set of experimental data.

The prediction of braking COFs with the PSO–GRU algorithm has significance not only for the fast development and evaluation of an automotive braking system, but also provides meaningful reference for solving complicated tribological problems in other applications.

Author Contributions

Conceptualization, S.W.; Methodology, S.W.; Validation, S.W. and Y.Y.; Formal analysis, Y.Y. and S.W.; Investigation, Y.Y. and S.L.; Data curation, S.W. and Y.Y.; Writing—original draft preparation, Y.Y.; Writing—review and editing, S.W. and D.B.; Supervision, S.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was financially supported by the Science and Technology Committee of Shanghai Municipality (Grant No. 18060502400) and Natural Science Foundation of Shanghai (Grant No. 21ZR1445000).

Data Availability Statement

The data presented in this study are available upon request from the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Nomenclature

Symbol	Definition	Unit
ACof	average COF	/
ADec	average deceleration	g
APre	average pressure	bar
ATor	average torque	Nm
B	monotone increasing function	/
BSpe	initial braking speed	kph
COF	coefficient of friction	/
COFs	coefficients of friction	/
FTem	final braking temperature	°C
GRU	gated recurrent unit	/
h_t	hidden state at time step t	/
h(x_i)	predicted value	/
I(D\|_G)	probability distribution	/
I^*(D, x, y)	probability distribution under different grid division
ITem	initial braking temperature	°C
LSTM	long short-term memory	/
MAE	mean absolute error	/
MIC	maximal information coefficient	/
MIC(D)	value of MIC	/
MPre	max pressure	bar
MTor	max torque	Nm
PPMCC	Pearson product moment correlation coefficient	/
PSO	particle swarm optimization	/
r	value of PPMCC	/
r_t	activation vector of reset gate	/
R²	evaluation of the model	/
RSpe	release speed	kph
RSS	the sum of residual squares	/
TSS	the sum of the squares of the total deviation	/
vparticle	particle velocity	/
$x_{\max}$	maximum value of all samples	/
$x_{\min}$	minimum value of all samples	/
x_i	measured value of x	/
$x^{'}$	normalized result	/
$\bar{x}$	average value of x	/
$X^{'}$	standardized value	/
$\bar{y}$	average value of y	/
y_i	measured value of y	/
Z_t	activation vector of update gate	/
$μ$	mean of all samples	/
$σ$	standard deviation of all samples	/
$ω_{z}$	weight matrix of the update gate	/
$ω_{r}$	weight matrix of the reset gate	/

References

Zhu, D.; Yu, X.; Sai, Q.; Wang, S.; Barton, D.; Fieldhouse, J.; Kosarieh, S. Noise and vibration performance of automotive disk brakes with laser-machined M-shaped grooves. Proc. Inst. Mech. Eng. Part D J. Automob. Eng. 2022, 237, 978–990. [Google Scholar] [CrossRef]
Wang, S.; Guo, W.; Zeng, K.; Zhang, X. Characterization of automotive brake discs with laser-machined surfaces. Automot. Innov. 2019, 2, 190–200. [Google Scholar] [CrossRef]
Khairnar, H.P.; Phalle, V.M.; Mantha, S.S. Estimation of automotive brake drum-shoe interface friction coefficient under varying conditions of longitudinal forces using Simulink. Friction 2015, 3, 214–227. [Google Scholar] [CrossRef]
Riva, G.; Varriale, F.; Wahlström, J. A finite element analysis (FEA) approach to simulate the coefficient of friction of a brake system starting from material friction characterization. Friction 2021, 9, 191–200. [Google Scholar] [CrossRef]
Meng, Y.; Xu, J.; Ma, L.; Jin, Z.; Prakash, B.; Ma, T.; Wang, W. A review of advances in tribology in 2020–2021. Friction 2022, 10, 1443–1595. [Google Scholar]
Balaji, V.; Lenin, N.; Anand, P.; Rajesh, D.; Raja, V.B.; Palanikumar, K. Brake squeal analysis of disc brake. Mater. Today Proc. 2021, 46, 3824–3827. [Google Scholar] [CrossRef]
Crolla, D.A.; Lang, A.M. Brake noise and vibration: The state of the art. Veh. Tribol. 1991, 18, 165–174. [Google Scholar]
Oberst, S.; Lai, J. Chaos in brake squeal noise. J. Sound Vib. 2011, 330, 955–975. [Google Scholar] [CrossRef]
Jarvis, R.P.; Member, B.M. Vibrations induced by dry friction. Proc. Inst. Mech. Eng. 1963, 178, 847–857. [Google Scholar] [CrossRef]
Nishiwaki, M. Generalized theory of brake noise. Proc. Inst. Mech. Eng. Part D J. Automob. Eng. 1993, 207, 195–202. [Google Scholar] [CrossRef]
Zhang, W.; Guo, W.; Liu, X.; Liu, Y.; Zhou, J.; Li, B.; Lu, Q.; Yang, S. LSTM-based analysis of industrial IoT equipment. IEEE Access 2018, 6, 23551–23560. [Google Scholar] [CrossRef]
Jiang, W.; Zhang, N.; Xue, X.; Xu, Y.; Zhou, J.; Wang, X. Intelligent deep learning method for forecasting the health evolution trend of aero-engine with dispersion entropy-based multi-scale series aggregation and LSTM neural network. IEEE Access 2020, 8, 34350–34361. [Google Scholar] [CrossRef]
Zhang, X.; Zhang, M.; Xiang, Z.; Mo, J. Research on diagnosis algorithm of mechanical equipment brake friction fault based on MCNN-SVM. Measurement 2021, 186, 110065. [Google Scholar] [CrossRef]
Zhang, M.; Zhang, X.; Mo, J.; Xiang, Z.; Zheng, P. Brake uneven wear of high-speed train intelligent monitoring using an ensemble model based on multi-sensor feature fusion and deep learning. Eng. Fail. Anal. 2022, 137, 106219. [Google Scholar] [CrossRef]
Yang, X.; Chen, L. Dynamic state estimation for the advanced brake system of electric vehicles by using deep recurrent neural networks. IEEE Trans. Ind. Electron. 2019, 99, 9536–9547. [Google Scholar]
Šabanovič, E.; Žuraulis, V.; Prentkovskis, O.; Skrickij, V. Identification of road-surface type using deep neural networks for friction coefficient estimation. Sensors 2020, 20, 612. [Google Scholar] [CrossRef] [PubMed]
Stender, M.; Tiedemann, M.; Spieler, D.; Schoepflin, D.; Hoffmann, N.; Oberst, S. Deep learning for brake squeal: Brake noise detection, characterization and prediction. Mech. Syst. Signal Process. 2021, 149, 107181. [Google Scholar] [CrossRef]
Wang, S.; Zhong, L.; Niu, Y.; Liu, S.; Wang, S.; Li, K.; Wang, L.; Barton, D. Prediction of frictional braking noise based on brake dynamometer test and artificial intelligent algorithms. Proc. Inst. Mech. Eng. Part D J. Automob. Eng. 2021, 236, 2681–2695. [Google Scholar] [CrossRef]
Alexsendric, D.; Barton, D.C. Neural network prediction of disc brake performance. Tribol. Int. 2009, 42, 1074–1080. [Google Scholar] [CrossRef]
Alexsendric, D.; Barton, D.C.; Vasic, B. Prediction of brake friction materials recovery performance using artificial neural networks. Tribol. Int. 2010, 43, 2092–2099. [Google Scholar] [CrossRef]
J2521_201304; Disc and Drum Brake Dynamometer Squeal Noise Test Procedure. SAE International: Warrendale, PA, USA, 2013.
Shalabi, L.A.; Shaaban, Z.; Kasasbeh, B. Data mining: A preprocessing engine. J. Comput. Sci. 2006, 2, 735–739. [Google Scholar] [CrossRef]
Curtis, A.E.; Smith, T.A.; Ziganshin, B.A.; Elefteriades, J.A. The mystery of the Z-score. Aorta 2016, 4, 124–130. [Google Scholar] [CrossRef] [PubMed]
Benesty, J.; Chen, J.D.; Huang, Y. On the importance of the Pearson correlation coefficient in noise reduction. IEEE Trans. Audio Speech Lang. Process. 2008, 16, 757–765. [Google Scholar] [CrossRef]
Reshef, D.N.; Reshef, Y.A.; Finucane, H.K.; Grossman, S.R.; McVean, G.; Turnbaugh, P.J.; Lander, E.S.; Mitzenmacher, M.; Sabeti, P.C. Detecting novel associations in large data sets. Science 2011, 334, 1518–1524. [Google Scholar] [CrossRef]
Zargar, S. Introduction to Sequence Learning Models: RNN, LSTM, GRU; Department of Mechanical and Aerospace Engineering: Colorado Springs, CO, USA, 2021. [Google Scholar]

Figure 1. Research procedure and the structure of the paper.

Figure 2. LINK3900 brake performance test rig (partial).

Figure 3. BP box plot.

Figure 4. PPMCC matrix heatmap.

Figure 5. MIC matrix heatmap.

Figure 6. RNN cell expansion structure diagram.

Figure 7. Gated structure in LSTM.

Figure 8. Gated structure in GRU.

Figure 9. Evaluation metrics.

Figure 10. Comparison of predicted and measured COF for test set.

Figure 11. Flow chart of PSO–GRU prediction model for brake COF.

Figure 12. Loss curves of GRU model and PSO–GRU model for test set.

Figure 13. Comparison of the predicted and measured COF for test set after model optimization.

Table 1. Screenshot of typical experimental data obtained from braking tests.

Stop	Brake Speed (kph)	Release Speed (kph)	Stop Time (s)	Avg Decel (g)	Avg Torq (Nm)	Max Torq (Nm)	Avg Press (bar)	Max Press (bar)	Avg μ Level	Initial Temp Rotor C	Final Temp Rotor C	Peak Level (dBA)	Frequency of Peak (Hz)	Above Threshold 70 dBA
1	79.7	30.0	8.377	0.18	565	617	15.0	15.8	0.44	100	158	73.5	2800	YES
2	80.0	30.0	4.68	0.35	1087	1145	29.9	30.6	0.42	100	163	69.9	5900
3	80.1	30.0	8.32	0.18	569	620	15.0	15.7	0.44	100	158	59.2	5950
4	79.8	30.0	7.01	0.22	674	730	18.0	18.6	0.44	100	157	69.0	5975
5	80.0	30.0	5.99	0.26	819	877	21.9	22.5	0.43	100	158	70.9	2800	YES
6	79.8	30.0	3.83	0.44	1357	1404	37.8	38.6	0.42	100	164	71.5	2800	YES
7	79.7	30.0	8.35	0.18	563	615	15.0	15.9	0.44	100	156	69.1	6200
8	79.7	30.0	5.25	0.31	948	1010	25.9	26.5	0.43	100	160	69.9	5975
9	80.1	30.0	7.06	0.22	670	726	18.0	18.5	0.43	100	157	67.6	6325
10	80.0	30.0	4.19	0.40	1232	1288	33.9	34.6	0.42	100	160	71.4	2925	YES
11	80.0	30.0	8.36	0.18	564	621	15.0	15.8	0.44	100	155	68.6	6350

Table 2. Description of the parameters required by the GRU.

Parameter	Parameter Interpretation	Default
input_size	Number of input features	4
output_size	Number of output features	1
rnn_unit	Hidden layers	64
lr	Learning rate	0.001
epoch	Iterations	100

Table 3. Performance comparison of GRU and LSTM algorithms.

Model	R²	MAE	Training Time (ms)
GRU	0.893	0.016	220
PSO–GRU	0.935	0.014	157

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, S.; Yu, Y.; Liu, S.; Barton, D. Braking Friction Coefficient Prediction Using PSO–GRU Algorithm Based on Braking Dynamometer Testing. Lubricants 2024, 12, 195. https://doi.org/10.3390/lubricants12060195

AMA Style

Wang S, Yu Y, Liu S, Barton D. Braking Friction Coefficient Prediction Using PSO–GRU Algorithm Based on Braking Dynamometer Testing. Lubricants. 2024; 12(6):195. https://doi.org/10.3390/lubricants12060195

Chicago/Turabian Style

Wang, Shuwen, Yang Yu, Shuangxia Liu, and David Barton. 2024. "Braking Friction Coefficient Prediction Using PSO–GRU Algorithm Based on Braking Dynamometer Testing" Lubricants 12, no. 6: 195. https://doi.org/10.3390/lubricants12060195

APA Style

Wang, S., Yu, Y., Liu, S., & Barton, D. (2024). Braking Friction Coefficient Prediction Using PSO–GRU Algorithm Based on Braking Dynamometer Testing. Lubricants, 12(6), 195. https://doi.org/10.3390/lubricants12060195

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Braking Friction Coefficient Prediction Using PSO–GRU Algorithm Based on Braking Dynamometer Testing

Abstract

1. Introduction

2. Braking Dynamometer Testing and Typical Results

3. Feature Engineering

3.1. Data Cleaning

3.1.1. Min–Max Scaling

3.1.2. Z-Score Normalization

3.1.3. Outlier and Missing Value Treatment

3.2. Investigating the Relationships between Features

3.2.1. Pearson Correlation Coefficient

3.2.2. Maximal Information Coefficient

3.3. Selection of Features

4. Prediction Algorithms

4.1. LSTM Method

4.2. GRU Method

Gated Computation for GRU

4.3. Model Evaluation Index

4.3.1. Correlation Index R²

4.3.2. MAE Index

5. Prediction Results

5.1. Model Comparison

5.2. Predictions of the GRU Model

5.3. Predictions of the GRU Model Optimized with the PSO Algorithm

5.4. Model Comparison

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Nomenclature

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Braking Friction Coefficient Prediction Using PSO–GRU Algorithm Based on Braking Dynamometer Testing

Abstract

1. Introduction

2. Braking Dynamometer Testing and Typical Results

3. Feature Engineering

3.1. Data Cleaning

3.1.1. Min–Max Scaling

3.1.2. Z-Score Normalization

3.1.3. Outlier and Missing Value Treatment

3.2. Investigating the Relationships between Features

3.2.1. Pearson Correlation Coefficient

3.2.2. Maximal Information Coefficient

3.3. Selection of Features

4. Prediction Algorithms

4.1. LSTM Method

4.2. GRU Method

Gated Computation for GRU

4.3. Model Evaluation Index

4.3.1. Correlation Index R2

4.3.2. MAE Index

5. Prediction Results

5.1. Model Comparison

5.2. Predictions of the GRU Model

5.3. Predictions of the GRU Model Optimized with the PSO Algorithm

5.4. Model Comparison

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Nomenclature

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.3.1. Correlation Index R²