Next Article in Journal
Support Vector Regression for the Modeling and Synthesis of Near-Field Focused Antenna Arrays
Previous Article in Journal
Effects of the Body Wearable Sensor Position on the UWB Localization Accuracy
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Design and Analysis for Early Warning of Rotor UAV Based on Data-Driven DBN

1
School of Optical-Electrical and Computer Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China
2
O’Neill School of Public and Environmental Affairs, Indiana University, Bloomington, IN 47405, USA
3
Department of Mathematics and Computer Science, Northeastern State University, Tahlequah, OK 74464, USA
*
Author to whom correspondence should be addressed.
Electronics 2019, 8(11), 1350; https://doi.org/10.3390/electronics8111350
Submission received: 14 October 2019 / Revised: 4 November 2019 / Accepted: 11 November 2019 / Published: 14 November 2019
(This article belongs to the Section Computer Science & Engineering)

Abstract

:
The unmanned aerial vehicle (UAV), which is a typical multi-sensor closed-loop flight control system, has the properties of multivariable, time-varying, strong coupling, and nonlinearity. Therefore, it is very difficult to obtain an accurate mathematical diagnostic model based on the traditional model-based method; this paper proposes a UAV sensor diagnostic method based on data-driven methods, which greatly improves the reliability of the rotor UAV nonlinear flight control system and achieves early warning. In order to realize the rapid on-line fault detection of the rotor UAV flight system and solve the problems of over-fitting, limited generalization, and long training time in the traditional shallow neural network for sensor fault diagnosis, a comprehensive fault diagnosis method based on deep belief network (DBN) is proposed. Using the DBN to replace the shallow neural network, a large amount of off-line historical sample data obtained from the rotor UAV are trained to obtain the optimal DBN network parameters and complete the on-line intelligent diagnosis to achieve the goal of early warning as possible as quickly. In the end, the two common faults of the UAV sensor, namely the stuck fault and the constant deviation fault, are simulated and compared with the back propagation (BP) neural network model represented by the shallow neural network to verify the effectiveness of the proposed method in the paper.

1. Introduction

The rotor UAV [1] is an aircraft that does not carry a pilot. It has been widely used in military and civilian fields for its unique advantages, so it is indispensable to ensure the safety and reliability of the rotor UAV flight control system. As an important device for information acquisition, the sensors [2] provide guarantees for the reliable safety of systems. Once faults occur, the flight safety of the rotor UAV will be affected, which will inevitably bring about system control performance degradation. Therefore, rapid detection of sensor failure [3] is a prerequisite for ensuring flight safety.
At present, for rotor UAV flight system sensors, the fault diagnosis method [4] is mostly model based [5], which [6] relies on the accurate model of the system [7]. However, owing to the improvement of computer capabilities, the advancement of artificial intelligence, and ultra-precision technology recently, the rotor UAV flight system has emerged an increasingly complex development trend, which is very difficult to obtain accurate mathematical models [8,9]. In contrast, the data-driven fault diagnosis method [10] has been proposed due to having no need for obtaining an accurate model of the system. The fault diagnosis can be completed only by using the input and output data of the system. Among them, the neural network method, which has self-association, self-adaptation, and no need to establish accurate mathematical models, are widely used in data-driven fault diagnosis methods. So far, researchers have used neural network methods to conduct extensive research on fault detection technology. The paper [11] presents a method of classifying impact noises obtained from a washer machine by obtaining the time frequency image of the sound signals, which is employed as the input signal to an artificial neural network classifier. A convolutional neural network, which is used to extract the residual signal from different sensor faults into the corresponding time-frequency map and fault characteristics to realize the diagnosis of the UAV sensor, is proposed in the literature [12]. From the visualization, the sensor failure information can be successfully constructed by the convolutional neural network (CNN) extracting the fault diagnosis logic between the residual and the health state. Reference [13] proposed wavelet packet threshold denoising and BP neural network methods for fault diagnosis of rolling bearings. In this method, the Levenberg–Maquardt algorithm was used to improve the traditional BP neural network, which greatly improves the diagnostic level. In reference [14], a novel data-driven adaptive neuron fuzzy inference system (ANFIS)-based approach was proposed to detect on-board navigation sensor faults in UAVs. The main advantages of this algorithm are that it allows the Kalman filter to estimate real-time model-free residual and ANFIS to build a reliable fault detection system. According to the experimental results, it was demonstrated that the method can not only detect fault quickly, but also can be used in real-time applications.
Nevertheless, the traditional shallow neural network method [15] has the disadvantages of over-fitting, local minimum, gradient attenuation, and poor generalization ability, which makes the effect of fault detection unsatisfactory [16].
Therefore, this paper proposes a deep learning method, deep belief network (DBN), instead of shallow neural network. As one of the classic algorithms for deep learning, DBN [17] solves problems such as dimension reduction, information retrieval, and fault classification successfully because of an excellent training algorithm and feature extraction. Therefore, it is applied to the field of fault diagnosis and has certain practicability.
In view of the above discussion, the fault diagnosis of rotor UAV flight control system sensor has been taken as an example and a fault diagnosis method for DBN is presented by this paper. By training a large number of offline historical sample data, the optimal network parameters obtained perform the feature extraction of fault and analyze more essential data features to make it easy to detect faults.

2. Mathematical Model of the Rotor UAV Flight Control System

2.1. Four-Rotor UAV Model

The Quadrotor AUV [18] is a system controlled by six degrees of freedom with strong coupling, nonlinearity, and interference sensitivity. The four rotors are symmetrically distributed in an “X” shape or a “十” shape, and the center of gravity of the rotor UAV is at geometric center. The power of the UAV is generated by four rotors [19], and the rotation of the rotor produces an upward lift, the magnitude of which is proportional to the square of the angular velocity of the rotor rotation   w , that is:
F i = K w i 2 i = 1 , 2 , 3 , 4
The Quadrotor AUV controls the attitude and position of the flight through four rotors. The two sets of rotors rotate in the opposite direction to counteract the anti-torsion moment to maintain the attitude stability. The total lift in the vertical direction is generated by four rotors [20], and the rotational speed difference of all the rotors produces the torque of horizontal direction to cause a yawing motion; the difference in rotational speed between the front and rear rotors controls the pitching motion; the left and right rotors controls the rolling motion [21]. The lift force are expressed as follows:
U z = F 1 + F 2 + F 3 + F 4 , U θ = b l ( F 2 + F 4 ) , U ϕ = b l ( F 1 F 3 ) , U φ = d ( F 1 + F 2 F 3 + F 4 ) ,
where b , d , l are respectively the rotor lift coefficient, the drag index, and the distance from the center of gravity to the axis of the quadrotor UAV ;   U z , U θ , U ϕ , U φ   are respectively the total lift, rolling moment, pitching moment, and yawing moment of the rotor UAV [22]. Through the Newton–Eulerian formula, and assuming that the UAV is in slow flight or hover transition, the kinematics model is obtained. The results are as follows:
X ¨ = ( c o s ϕ s i n θ c o s φ + s i n ϕ s i n φ ) U z / m , Y ¨ = ( c o s ϕ s i n θ c o s φ + s i n ϕ s i n φ ) U z / m , Z ¨ = g ( c o s ϕ c o s θ ) U z / m , ϕ ¨ = ( θ φ ¨ ( J Y J Z ) + U θ ) / J X , θ ¨ = ( ϕ φ ¨ ( J Z J X ) + U ϕ ) / J Y , φ ¨ = ( θ ϕ ¨ ( J X J Y ) ) / J Z ,
where X ¨ , Y ¨ , Z ¨ are the accelerations of the rotor UAV in the ground coordinates and θ , ϕ , φ   are respectively the roll, pitch, and yaw of the four-rotor UAV. m is the mass of the four-rotor UAV and g is the acceleration of the UAV. J X , J Y , J Z are the moment of inertia of the   X   shaft, Y   shaft, and Z   shaft.

2.2. Flight Coordinate System Model

2.2.1. North East Coast Coordinate System

The Northeast coordinate system [23] is the geodetic coordinate system used by the DJI aircraft. Origin O g   is the take-off point. The three axes of the coordinate system are marked as the right north direction   O g X g , the right east direction   O g Y g , and the vertical ground direction   O g Z g . In the attitude data packet, the north, east, and downward speed curves can be found. In these curves, the value is positive indicating that the speed is north, east, or down.

2.2.2. Aircraft Local Coordinate System

The aircraft center point O b is regarded as the coordinate origin in the aircraft local coordinate system. The three axes correspond to the front and back   O b X b , left and right   O b Y b , and up and down   O b Z b of the aircraft, respectively; positive and negative apply to the right hand screw rule, as shown in Figure 1.

2.2.3. Speed Coordinate Systems

The origin is taken at the center of gravity of the aircraft, and the axis O a X a is in the same direction as the flight speed   V ; the axis   O a Z a   is located on the vertical axis O a X a of the plane of symmetry of the aircraft, pointing to the belly; O a Y a is perpendicular to the plane   X a O a Z a , pointing to the right [24]. In the attitude data packet, the north, east, and downward speed curves can be found. In these curves, the value is positive indicating that the speed is north, east, or down.

2.2.4. Kinematic Equations for Angular Velocity

In order to describe the movement of the rotor UAV relative to the ground, the geometric relationship between the triaxial attitude angle [25] change rate and the three angular velocity components of the UAV is established as follows:
θ ˙ = p + q s i n θ t a n ϕ + c o s θ t a n ϕ , ϕ ˙ = q c o s θ r s i n θ , φ ˙ = q s i n θ / c o s ϕ + r c o s θ / c o s ϕ ,
where p represents the rolling rate, q represents the pitching rate, and r   represents the yaw rate.

2.3. Deep Confidence Network Model

The deep learning idea [26] is inspired by the biological nervous system. It is made up of an input layer, multiple hidden layers, and an output layer. Each layer is connected to each other through nodes or neurons. The output of the previous layer is regarded as the input of each hidden layer. DBN, one of the classical algorithms for deep learning, can automatically extract low-level to high-level, concrete-to-abstract features [27] from raw data through a series of nonlinear transformations and is composed of a number of restricted Boltzmann machines (RBM) [28], which are commonly used to initially set the parameters of the feedforward neural network in order to improve the generalization ability of the model. The RBM network consists of   n   neurons and   m   hidden layer neurons. The connection between nodes exists only between layers. RBM comes from the classical thermal theory. The smaller the energy function is, the more stable the system is. The minimum energy of the network is trained to obtain the optimal parameters of the network. The energy function is expressed as follow:
E ( v , h ) = i = 1 n a i v i j = 1 m b j h j i , j v i h j w i j ,
where v i ,   h j   are respectively the random state of the   i   unit of the visible layer and the   j unit of the hidden layer; a i and   b j are the corresponding bias; w i j are the weights between the two units. The purpose of training the network is to derive the optimal parameters ( w i j , a i ,   b j ) . The core is the process of using the layer-by-layer greedy learning algorithm to optimize the connection weight of the deep neural network, which firstly use the unsupervised layer-by-layer training method to effectively mine the fault features in the device to be diagnosed, and then add the corresponding classifier based through the way that reverse supervised fine-tuning to optimize the fault diagnosis capability of DBN. Some nonlinear complex functions can be learned from unsupervised layer-by-layer training by directly mapping data from input to output, which is the key to powerful feature extraction capabilities. Typical network structure [29] is shown in Figure 2.

3. Fault Diagnosis Method Based on Deep Confidence Network

3.1. Off-Line Training Based on the DBN Model

3.1.1. Deep Confidence Network Feature Extraction

Deep belief network, a self-learning feature extraction algorithm [30], has been widely used in many application fields with its powerful feature extraction capability [31] and participation without requiring a large amount of tag data. The process of DBN extracting fault features is shown in Figure 3.
The data layer is the visible layer and the initial input data. First, the data vector Data and the first layer hidden layer are used as the first RBM to train the weight w and bias a of the RBM, and then the parameters of the RBM are fixed;   h 1 is regarded as the visible vector, and   h 2 is treated as the hidden vector to train the second RBM to get its parameters, namely weight   w and bias   b , and then fix these parameters; finally, to train RBM, which is composed of   h 1 and   h 2 , the specific training algorithm is shown in Algorithm 1.
Algorithm 1. Description of RBM update algorithm.
This is the RBM update procedure for binomial units. It can easily adapted to other types of units.
x 1 is a sample from the training distribution for the RBM
λ is a learning rate for the stochastic gradient descent in Contrastive Divergence
w is the RBM weight matrix, of dimension(number of hidden units, number of inputs)
a is the RBM offset vector for input units
b is the RBM offset vector for hidden units
Notation: Q ( h 2 = 1 | x 2 ) is the vector with elements
Q ( h 2 i = 1 | x 2 )
 for all hidden units i do
  compute   Q ( h 1 i = 1 | x 1 ) (for binomial units, sigm( b i + j w i j x 1 j ))
  sample h 1 i ϵ { 0 , 1 } from Q ( h 1 i | x 1 )
 end for
 for all visible units   j do
  compute P ( x 2 j = 1 | h 1 ) (for binomial units, sigm ( a j + i w i j h 1 i ) )
  sample x 2 j ϵ { 0 , 1 } from P ( x 2 j = 1 | h 1 )
 end for
 for all hidden units i do
  compute Q ( h 2 i = 1 | x 2 ) (for binomial units, sigm ( b i + j w i j x 2 j ) )
 end for
w w + λ ( h 1 x 1 Q ( h 2 = 1 | x 2 ) x 2 )
a a + λ ( x 1 x 2 )
b = b + λ ( h 1 Q ( h 2 = 1 | x 2 ) )

3.1.2. Deep Confidence Network Training

In fact, the training of RBM is to find the probability distribution that produces the training samples well. Therefore, in order to eliminate the error caused by the data difference between different latitude and longitude, the original data need to be normalized.
X = X . min (   ) , X + = X . max (   ) .
The DBN learning and training process is mainly divided into two parts:
1. Unsupervised pre-training based on restricted Boltzmann machines from the bottom to the top.
Since the deep belief network is a neural network based on probability model, the decisive factor of its probability distribution depends on the weight w , so the goal of the training is to find the best weight. The contrastive divergence (CD) algorithm that finds the best weight is to randomly initialize the parameter set of RBM [32] w i j , a i , b j . Among them, a i is the bias of the   i   node of the visible layer;   b j is the bias of the   j node of the hidden layer;   w i j is the connection weight of the   i node of the visible layer; and the   j node of the hidden layer.   r e c o n is a reconstructed sample obtained by sampling Gibbs to the sample to estimate the expectation. The learning algorithm is as follows, in Equation (7). Simultaneously, the description of the CD-k algorithm and train unsupervised DBN algorithm are expressed in Algorithms 2 and 3, respectively.
Δ w i j = λ ( v i h j d a t a v i h j r e c o n ) , Δ a i = λ ( v i d a t a v i r e c o n ) , Δ b j = λ ( h j d a t a h j r e c o n ) ,
Algorithm 2. CD-k algorithm description.
Input:   RBM ( V 1 , , V n , H 1 , , H m ), training batch S
Output: gradient approximation Δ w i j , Δ a i   and   Δ b j for i = 1 , , n ; j = 1 , , m .
1 
initialize Δ w i j = Δ a i = Δ b j = 0 for i = 1 , , n ; j = 1 , , m
2 
  for all the v ϵ S do
3 
   v ( 0 ) v
4 
  for t = 0 , , k 1 do
5 
   for i = 1 , , n do sample h i ( t ) p ( h i | v ( t ) )
6 
   for j = 1 , , m do sample   v j ( t + 1 ) p ( v j | h ( t ) )
7 
  for i = 1 , , n ; j = 1 , , m do
8 
   Δ w i j Δ w i j + p ( H i = 1 | v ( 0 ) ) · v j 0 p ( H i = 1 | v ( k ) ) · v j ( k )
9 
   Δ a i Δ a i + v j ( 0 ) v j ( k )
10 
   Δ b j Δ b j + p ( H i = 1 | v ( 0 ) ) p ( H i = 1 | v ( k ) )
Algorithm 3. Description of the train unsupervised DBN algorithm.
Train a DBN in a purely unsupervised way, with the greedy layer-wise procedure in which each added layer is trained as an RBM.
P ^ is the input training distribution for the network
λ is a learning rate for the RBM training
η is the number of layers to train
w k is the weight matrix for k , for   k from 1 to η
a k is the visible units offset vector for RBM at level k , for   k from 1 to η
b k is the hidden units offset vector for RBM at level   k , for   k from 1 to η
Mean_field_computation is Boolean that is true if training data at each additional level is obtained by a mean-field approximation instead of stochastic sampling
for k = 1 to η do
  initialize w k = 0 , a k = 0 , b k = 0
  while not stopping criterion do
    sample h 0 = x from P ^
    for i = 1 to k 1 do
     if mean_field_computation then
      assign h j i to Q ( h j i = 1 | h i 1 ) , for all elements j of h i
     else
      sample h j i from Q ( h j i   |   h i 1 ) , for all elements   j of h i
     end if
    end for
     RBMupdate ( h k 1 , λ , W k , a k , b k ) {thus providing Q ( h k | h k 1 ) for future use}
end while
end for
The CD algorithm is used to train layer by layer for DBN, obtaining the parameters of each layer and initializing the DBN, and then fine-tuning the parameters with the supervised learning algorithm.
2. Supervised tuning training from the top to the bottom.
For supervised tuning training, the forward propagation algorithm is used to obtain a certain output value from the input firstly, and then the backward propagation algorithm is used to update the weights and bias values of the network.
• Forward Propagation Algorithm
1. Pre-trained w , b with the CD algorithm to determine the opening and closing of the corresponding hidden elements. Calculating the stimulus values for each hidden element are as follows:
h ( l ) = w ( l ) · v + b ( l )
where l is the layer index of the neural network. The values of w and b are as follows:
w = [ w 1 , 1 w 2 , 1 w 1 , 2 w 2 , 2       w n , 1 w n , 2 w 1 , m w 2 , m w n , m ] , b = [ b 1 b 2 b m ] ,
where w i , j represents the weight from the   i explicit element to the   j   hidden element.
2. Spread out layer by layer, calculate the excitation value of each hidden element in the hidden layer layer by layer, and standardize it with sigmoid function, as shown below:
σ ( h j ) ( l ) = 1 1 + e h j .
3. Finally, the excitation value and output of the output layer are calculated as follows:
h ( l ) = w ( l ) · h ( l 1 ) + b ( l ) , X ^ = f ( h ( l ) ) ,
where f ( · ) represents the activation function of the output layer and the output value of the output layer is X ^ .
• Back Propagation Algorithm
1. The error back propagation algorithm of the reconstruction error criterion is used to update the parameters of the whole network and evaluate whether the RBM is trained in the paper. The reconstruction error is the difference between the training data and the original data after the Gibbs sampling by RBM, as shown below:
J = k = 1 n     v v ( k )   .
The reconstruction error is continuously reduced by iteration times until all RBM training is completed. Finally, the global fine-tuning is performed. Since the last layer of the deep confidence network is used for parameter fitting, the activation function of the last layer selects the hyperbolic tangent function, namely:
f ( x ) = 1 1 / ( 1 + e 2 x ) .
The output value of the network between −1 and 1 is made. The process of fine-tuning the deep belief network parameters is the process of tuning using the back propagation algorithm. Given the input and output samples, the gradient descent algorithm is used to update the network weights and bias parameters as follows:
( w l , b l ) ( w l , b l ) λ · E ( w l , b l ) ,
where λ is the learning rate whose numerical value represents the step size of each parameter adjustment. It is generally between 0.005 and 0.200, where λ = 0.10 . In order to overcome the problem that the training process easily falls into the local minimum value, the impulse term is introduced, and the parameter update direction is inconsistent with the gradient direction. The method is as follows:
w i j t + 1 = m w i j t + λ θ w i j ,
where   m is the momentum term, where m = 0.5 ; t is the number of iterations for the sample.

3.2. Objective Function Establishment of DBN Fault Diagnosis Model

The deep belief network model is essentially a mapping relationship between input data and output data, that is,
θ x = f ( H x 1 , H ˙ x 1 , W x , p x , q x , ϕ x , θ x 1 ) , ϕ x = f ( H x 1 , H ˙ x 1 , W x , q x , r x , θ x , ϕ x 1 ) , φ x = f ( H x 1 , H ˙ x 1 , W x , q x , r x , θ x , φ x 1 ) ,
where H stands for the flight height of the aircraft; H ˙ expresses the rate of change of altitude;   W expresses the wind speed.

3.3. Online Diagnosis Based on the DBN Model

After the offline training of the deep confidence network is completed, the online estimation can be performed, and the residual between the estimated value and the real value is used to judge whether the fault is faulty. The residual at a certain moment is specifically described as follows:
e ( t ) = o ˜ ( t ) o ( t ) .
The detection threshold   K t is set for each parameter sensor. By comparing the residual and the threshold, we can determine if there is a fault. When | e ( t ) | < K t , it is judged to be faultless; when | e ( t ) | K t , it is determined to be faulty. The value of K t depends on the specific parameters as the case may be. The sensor output value can intuitively determine the stuck fault within a certain period of time, but other sensor fault types such as the constant deviation fault needs to satisfy the mathematical expression of the unknown fault type, as follows:
Y ( t ) = k y ( t ) + a , t T
where k is the failure factor, a is the deviation, and t is the time at which the failure occurred.
Different fault types correspond to different parameters k and a in Equation (18). In the case of a fault within a certain period of time, the DBN estimated value y ˜ ( t ) is used instead of the true value y ( t ) in Equation (18), that is:
Y ( t ) = k y ˜ ( t ) + a , t T .
As long as the values of the parameters k and a are known and the estimated values k ˜ and a ˜ are obtained by a function fitting, the category of the fault can be distinguished. The specific fault type and parameter are as follows in Table 1.
When the sensor fault is detected by the deep confidence network, the signal reconstruction should be taken in time; that is, the output signal of the sensor is disconnected, and the output of the sensor is replaced by the output estimated by the DBN to ensure that the aircraft continues to fly safely.

4. Experiment and Analysis

4.1. Experimental Platform

The experiment is run on the Windows Operating System, which is configured as Intel Core i7, 16G memory. The encoding is done on the PyCharm platform that include the TensorFlow Framework. By comparing the model of the traditional BP neural network and the DBN network in the paper, the results will be analyzed accordingly.
The data used in this experiment are derived from the DJI four-rotor UAV. The data in the rotor UAV flight control data record mainly consist of eight parts, namely attitude data, on-screen display (OSD) data, controller data, remote control data, motor data, motor governor data, battery data, and obstacle avoidance data. The data are collected mainly from the attitude data as the source of experimental data in the paper. The attitude data mainly include information on sensors such as position, velocity, angular velocity, accelerometer, gyroscope, magnetometer, barometer, and so on.
The flight data are obtained from the flight attitude data of the rotor UAV. Via the process of normalization, the training samples and the test samples are established. First, the training samples are used for model training, and the weight and bias are continuously adjusted to make it converge quickly, and then the test samples is tested for the fault diagnosis effect to obtain the optimal DBN model. The fault diagnosis process is shown in Figure 4. When collecting data [33], data obtained in various flight state as much as possible, including altitude, altitude change rate, wind speed, pitch, yaw, roll pitch rate, yaw rate, and roll rate. When collecting training data, the flight altitude is 200 m, the flight speed is 40 m/s, the sampling time is 600 s, and the sampling period is 0.1 s.

4.2. Model Evaluation Index Determination

In order to better analyze the model in the paper, root mean square error (RMSE) and coefficient of determination (R2) are used for the index of regression evaluation. The description is defined as follows.

4.2.1. The Description of the RMSE

RMSE is a measure that reflects the degree of difference between predicted value and actual value. The larger the value is, the larger the difference is. The formula is as follows:
RMSE ( y i , y i ^ ) = 1 n i = 1 n ( y i y i ^ ) 2 ,
where n is the dimension of the sequence,   y i ^ represents the prediction value of sensor angular rate, and y i represents the actual value of sensor angular rate.

4.2.2. The Description of the R2

R2 is also called the goodness of fit. The larger the goodness of fit, the denser the observation point is near the regression line. R2 is in the range between 0 and 1. The larger the value, the better the prediction effect. The expression formula is as follows:
R 2 ( y i , y i ^ ) = 1 i = 1 m ( y i y i ^ ) 2 i = 1 m ( y i y i ¯ ) 2 ,
where y i ¯ is expressed as the average value of sensor angular rate.

4.3. Model Structure Selection and Training Results

In the testing, the accuracy of fault diagnosis has a certain relationship with the training samples and the number of RBM layers in the network. When the training samples is different, the RBM layers of the network will also change accordingly. The relationship between the three variables is shown in Figure 5. Firstly, the underlying RBM is constructed with a network model of 3 to 10 layers. The number of neurons in the middle layer is initialized by the random number between 10 and 100. After an overwhelming number of training tests, the iteration times and the reconstruction error convergence curve are as shown in Figure 6.
As can be seen from Figure 6, in the initial stage, as the number of iteration times increases, the reconstruction error decreases rapidly. When the number of iteration time is greater than 200 times, the error reduction gradually stabilizes. Therefore, the number of RBM iteration time per layer is set to 200. On the premise that the training samples is fixed and the number of RBM iterations per layer is set to 200, the diagnostic accuracy of different RBM layers is tested. Change the number of RBM layers, starting at level 0 and modeling only the top level classifier until level 10. The correct relationship diagram is shown in Figure 7.
Figure 7 shows that with the increase of RBM layers, the accuracy of fault dignosis is on the rise, and the trend gradually becomes slow. When the RBM is 0 layer, the classification result has the lowest correct rate, only 30%. As the number of layers increases, the discriminative performance of the model increases continuously. However, when the effect of seven layers is reached, the correct rate curve almost no longer rises. The number of bottom RBM layers is six layers and it has reached 95% or more, so the number of RBM layers of the selected model is six layers. Lastly, using the roll sensor, the yaw sensor, and the pitch sensor under normal working conditions, the shallow neural network BP [34] is compared with the deep neural network DBN proposed in the paper, as shown in Figure 8, Figure 9 and Figure 10.
In Figure 8, it can be seen that at 22 s of flight time (x = 22), the flying hand hits about 2% of the crossbar (y = −2.1°) to the left. Then, it leaned to the right again. Corresponding to the actual flight situation, the flying hand went to the left and rolled the bar, but it was released again. According to this situation, the DBN model proposed in this paper can respond well to the stroke situation and has a strong generalization ability.
As can be seen in the trend of the curve in Figure 9, the aircraft yaw is −68° (y = −68°) at 9 s of flight time (x = 9). Nevertheless, at about 10 s, the yaw changes and the value increases from small to large, indicating that the aircraft has rotated clockwise.
In Figure 10, the aircraft pitch is stable in the beginning, but between 6 s and 13 s, the aircaft pitch fluctuates violently, which shows that the flying hand went to the left and right continuously. The method DBN proposed in the paper can fit the measured value more accurately.
Table 2 presents the RMSE value and the R2 value between the predicted value and the actual value calculated by the BP neural network and the DBN network of Figure 7, Figure 8 and Figure 9. As can be seen from Table 2, the RMSE is lower and the R2 is more accurate based on the DBN network. It can be shown that DBN proposed in the paper can better fit the real value of the system compared with the traditional shallow BP neural network, so as to quickly diagnose faults for providing a good foundation.

4.4. Experimental Results

After the training of the DBN model is completed, it can be used for online diagnosis of angle sensor faults. The following is a simulation verification of the pitch, roll, and yaw three sensor in the injection faults.

4.4.1. Sensor Stuck Fault Diagnosis

1. Pitch injection failure
In Figure 11, after the 10 s injection fault, the measurement value does not change any more. The method DBN proposed in the paper can fit the measured value more accurately. When the sensor fault is detected, the output value estimated by the DBN is used to replace the measurement value of the sensor, which provides a guarantee for the safe flight of the UAV.
2. Roll injection failure
In Figure 12, at 10 s of flight time (x = 10), the flying hand hits about 2% of the crossbar (y = 2.2°) to the right, and the duration of the entire crossbar is about 8 s to 10 s. Then, it leaned to the left again. Corresponding to the actual flight situation, the flying hand went to the right and rolled the bar, but it was released again. Therefore, the embodiment of the posture is to tilt to the right and immediately reverse the brake to slow down. According to this situation, the DBN model proposed in this paper can respond well to the stroke situation. When the sensor fault is detected, the output value estimated by the DBN is used to replace the measurement value of the sensor.
3. Yaw injection Failure
As can be seen in the trend of the curve in Figure 13, the aircraft yaw is −68° (y = −68°) at 9 s of flight time (x = 9). Nevertheless, at about 10 s, the yaw changes and the value increases from small to large, indicating that the aircraft has rotated clockwise. When the sensor fault is detected, the output value estimated by the DBN is used to replace the measurement value of the sensor.

4.4.2. Sensor Constant Deviation Fault Diagnosis

1. Pitch injection failure
Taking the pitch sensor fault as an example, 2°/s constant deviation fault is injected at 10 s, and other simulation conditions are unchanged. The results are as follows.
As can be seen from Figure 15, the DBN network has a smaller and more accurate estimation error than the BP network. In order to further judge the type of fault, linearity fitting is used to obtain k ˜ and a ˜ . The fault fitting result shown in Figure 14 is k ˜ = 0.957, a ˜ = 2.135, and the fault corresponding to the deviation is about 2. It can be obtained from Figure 15 that the K t is set to 1.8. By fitting the parameter a ˜ , it can be quickly inferred that the fault type is the sensor constant deviation fault. After the constant deviation fault is identified, the deviation of a ˜ is corrected based on the sensor fault signal to achieve reconstruction of the fault signal.
2. Roll injection failure
Taking the roll sensor fault as an example, 0.3°/s constant deviation fault is injected at 18 s, and other simulation conditions are unchanged. The results are as follows.
From Figure 16, the fault fitting result is k ˜ = 1.023, a ˜ = 0.295, and the corresponding deviation is about 0.3. It can be obtained from Figure 17 that the K t is set to 0.3. By fitting the parameter a ˜ , it can be quickly inferred that the fault type is the sensor constant deviation fault. After the constant deviation fault is identified, the deviation of a ˜ is corrected based on the sensor fault signal to achieve reconstruction of the fault signal.
3. Yaw injection failure
Taking the yaw sensor fault as an example, 0.08°/s constant deviation fault is injected at 21 s, and other simulation conditions are unchanged. The results are as follows.
The fault fitting result obtained by Figure 18 is k ˜ = 0.975, a ˜ = 0.078, and the corresponding deviation is about 0.08. It can be obtained from Figure 19 that the K t is set to 0.07. By fitting the parameter a ˜ , it can be quickly inferred that the fault type is the sensor constant deviation fault. After the constant deviation fault is identified, the deviation of a ˜ is corrected based on the sensor fault signal to achieve reconstruction of the fault signal.
From all the above simulation figures, it can be concluded that compared with the traditional BP network model, the DBN network model proposed in the paper can more accurately estimate the UAV’s pitch, yaw, roll, and actively respond to the UAV’s stroke. Whether the sensor is faulty or not depends on whether the measured value is a fixed value at a certain time. When it comes to faults, fault isolation is immediately performed to ensure safe operation of the rotor UAV flight system.

5. Conclusions and Future Works

In the paper, the DBN method based on data-driven methods is applied to the fault diagnosis of the rotor UAV flight system sensors, which effectively solves the problems of the shallow neural networks, such as over-fitting, local minimum, generalization ability, complex functions, insufficient representation ability, and so on, by establishing a deep network structure. The simulation results show that compared with the BP model, the fault diagnosis model has higher convergence speed and diagnostic accuracy and can be extended to sensor diagnosis of other systems. The method currently realizes the real-time fault diagnosis and is applicable to complex the rotor UAV nonlinear systems. On the basis of this, the memory characteristics of long- short-term memory networks (LSTM) can be used to explore the short-term fault prediction of sensors, and to curb the working state of the sensor anytime and anywhere and enable the fault to be effectively prevented before it occurs, which is a necessary means to ensure safe and efficient operation of the rotor UAV. Therefore, the current state of the rotor UAV flight system sensor is estimated in terms of big data mining to realize the system’s conditional maintenance and avoid major safety accidents.
In a word, based on the research of the paper, it is indispensable to analyze the different models of different models of UAVs to verify the applicability of the model. In addition, in terms of improving system reliability, it is necessary to carry out deep excavation of the rotor UAV flight data in order to realize the conditional maintenance and life prediction of the equipment.

Author Contributions

C.-X.W. and X.-M.C. principally conceived the idea for the study and was responsible for project administration. X.-M.C. and R.H. were responsible for preprocessing all the data and setting up experiments. X.-M.C., Y.W., N.-x.X., B.-B.J., and S.Z. wrote the initial draft of the manuscript and were responsible for revising and improving of the manuscript according to reviewers’ comments.

Funding

This research was supported by the National Key Research and Development Program of China (No. 2018YFC0810204, 2018YFB17026), National Natural Science Foundation of China (No. 61872242, 61502220), Shanghai Science and Technology Innovation Action Plan Project (17511107203, 16111107502) and Shanghai key lab of modern optical system.

Acknowledgments

The authors would like to appreciate all anonymous reviewers for their insightful comments and constructive suggestions to polish this paper in high quality.

Conflicts of Interest

The authors declare no conflict of interest. The founding sponsors had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, and in the decision to publish the results.

References

  1. Huo, Y.; Dong, X.; Lu, T.; Xu, W.; Yuen, M. Distributed and Multilayer UAV Networks for Next-Generation Wireless Communication and Power Transfer: A Feasibility Study. IEEE Internet Things J. 2019, 6, 7103–7115. [Google Scholar] [CrossRef]
  2. Gao, Y.H.; Zhao, D.; Li, Y.B. UAV Sensor Fault Diagnosis Technology: A Survey. Appl. Mech. Mater. 2012, 220, 1833–1837. [Google Scholar] [CrossRef]
  3. Ducard, G.; Rudin, K.; Omari, S.; Siegwart, R. Strategies for Sensor-fault Compensation on UAVs: Review, Discussions & Additions. In Proceedings of the 2014 European Control Conference (ECC), Strasbourg, France, 24–27 June 2014. [Google Scholar]
  4. Hansen, S.; Blanke, M. Diagnosis of Airspeed Measurement Faults for Unmanned Aerial Vehicles. IEEE Trans. Aerosp. Electron. Syst. 2014, 50, 224–239. [Google Scholar] [CrossRef]
  5. López-Estrada, F.R.; Ponsart, J.C.; Theilliol, D.; Astorga-Zaragoza, C.M.; Zhang, Y.M. Robust Sensor Fault Diagnosis and Tracking Controller for a UAV Modelled as LPV System. In Proceedings of the International Conference on Unmanned Aircraft Systems (ICUAS), Orlando, FL, USA, 27–30 May 2014. [Google Scholar]
  6. Ding, S.X. Model-Based Fault Diagnosis Techniques. IFAC Pap. 2016, 49, 50–56. [Google Scholar]
  7. Yoon, S.; Kim, S.; Bae, J.; Kim, Y.; Kim, E. Experimental evaluation of fault diagnosis in a skew-configured UAV sensor system. Control Eng. Pract. 2011, 19, 158–173. [Google Scholar] [CrossRef]
  8. Freeman, P.; Seiler, P.; Balas, G.J. Air data system fault modeling and detection. Control Eng. Pract. 2013, 21, 1290–1301. [Google Scholar] [CrossRef]
  9. Remus, A. Fault Diagnosis and Fault-Tolerant Control of Quadrotor UAVs. Ph.D. Thesis, Wright State University, Dayton, OH, USA, 2016. [Google Scholar]
  10. Yu, P.; Liu, D. Data-driven prognostics and health management: A review of recent advances. Chin. J. Sci. Instrum. 2014, 35, 481–495. [Google Scholar]
  11. Kim, J.H. Time Frequency Image and Artificial Neural Network Based Classification of Impact Noise for Machine Fault Diagnosis. Int. J. Precis. Eng. Manuf. 2018, 19, 821–827. [Google Scholar] [CrossRef]
  12. Guo, D.; Zhong, M.; Ji, H.; Liu, Y.; Yang, R. A hybrid feature model and deep learning based fault diagnosis for unmanned aerial vehicle sensors. Neurocomputing 2018, 319, 155–163. [Google Scholar] [CrossRef]
  13. Younes, Y.A.; Rabhi, A.; Noura, H.; Hajjaji, A.E. Sensor fault diagnosis and fault tolerant control using intelligent-output-estimator applied on quadrotor UAV. In Proceedings of the International Conference on Unmanned Aircraft Systems (ICUAS), Arlington, VA, USA, 7–10 June 2016. [Google Scholar]
  14. Sun, R.; Cheng, Q.; Wang, G.; Ochieng, W. A Novel Online Data-Driven Algorithm for Detecting UAV Navigation Sensor Faults. Sensors 2017, 17, 2243. [Google Scholar] [CrossRef] [PubMed]
  15. Qi, J.; Zhao, X.; Jiang, Z.; Han, J. An Adaptive Threshold Neural-Network Scheme for Rotorcraft UAV Sensor Failure Diagnosis; Springer: Berlin/Heidelberg, Germany, 2007. [Google Scholar]
  16. Shen, Y.; Wu, C.; Liu, C.; Wu, Y.; Xiong, N. Oriented Feature Selection SVM Applied to Cancer Prediction in Precision Medicine. IEEE Access 2018, 6, 48510–48521. [Google Scholar] [CrossRef]
  17. Guo, Y.; Shuang, W.; Gao, C.; Shi, D.; Zhang, D.; Hou, B. Wishart RBM based DBN for polarimetric synthetic radar data classification. In Proceedings of the 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Milan, Italy, 26–31 July 2015. [Google Scholar]
  18. Senkul, F.; Altug, E. Adaptive control of a tilt—Roll rotor quadrotor UAV. In Proceedings of the 2014 International Conference on Unmanned Aircraft Systems (ICUAS), Orlando, FL, USA, 27–30 May 2014. [Google Scholar]
  19. Zeng, Y.; Xu, J.; Zhang, R. Energy Minimization for Wireless Communication With Rotary-Wing UAV. IEEE Trans. Wirel. Commun. 2019, 18, 2329–2345. [Google Scholar] [CrossRef]
  20. Mateo, G.; Luka, J. Gimbal Influence on the Stability of Exterior Orientation Parameters of UAV Acquired Images. Sensors 2017, 17, 401. [Google Scholar]
  21. Warsi, F.A.; Hazry, D.; Ahmed, S.F.; Joyo, M.K.; Tanveer, M.H.; Kamarudin, H.; Razlan, Z.M. Yaw, Pitch and Roll controller design for fixed-wing UAV under uncertainty and perturbed condition. In Proceedings of the 2014 IEEE 10th International Colloquium on Signal Processing and its Applications, Kuala Lumpur, Malaysia, 7–9 March 2014. [Google Scholar]
  22. Chao, Y.; Wu, J.; Wang, X. Roll and yaw control of unmanned helicopter based on adaptive neural networks. In Proceedings of the 2008 27th Chinese Control Conference, Kunming, China, 16–18 July 2008. [Google Scholar]
  23. Altan, A.; Hacioglu, R. Modeling of three-axis gimbal system on unmanned air vehicle (UAV) under external disturbances. In Proceedings of the 2017 25th Signal Processing and Communications Applications Conference (SIU), Antalya, Turkey, 15–18 May 2017. [Google Scholar]
  24. Rajesh, R.J.; Ananda, C.M. PSO tuned PID controller for controlling camera position in UAV using 2-axis gimbal. In Proceedings of the 2015 International Conference on Power and Advanced Control Engineering (ICPACE), Bangalore, India, 12–14 August 2015. [Google Scholar]
  25. Jie, K.; Li, Z.; Zhong, W. Design of Robust Roll Angle Control System of Unmanned Aerial Vehicles Based on Atmospheric Turbulence Attenuation. In Proceedings of the 2010 8th World Congress on Intelligent Control and Automation, Jinan, China, 7–9 July 2010. [Google Scholar]
  26. Wu, C.; Luo, C.; Xiong, N.; Zhang, W.; Kim, T. A Greedy Deep Learning Method for Medical Disease Analysis. IEEE Access 2018, 6, 20021–20030. [Google Scholar] [CrossRef]
  27. Nasersharif, B. Noise Adaptive Deep Belief Network For Robust Speech Features Extraction. In Proceedings of the 2017 Iranian Conference on Electrical Engineering (ICEE), Tehran, Iran, 2–4 May 2017. [Google Scholar]
  28. Chen, D.; Jiang, D.; Ravyse, I.; Sahli, H. Audio-Visual Emotion Recognition Based on a DBN Model with Constrained Asynchrony. In Proceedings of the 2009 Fifth International Conference on Image and Graphics, Xi’an, China, 20–23 September 2009. [Google Scholar]
  29. Alkhateeb, J.H.; Alseid, M. DBN—Based learning for Arabic handwritten digit recognition using DCT features. In Proceedings of the 2014 6th International Conference on Computer Science and Information Technology (CSIT), Amman, Jordan, 26–27 March 2014. [Google Scholar]
  30. Ke, W.; Chunxue, W.; Wu, Y.; Xiong, N. A New Filter Feature Selection Based on Criteria Fusion for Gene Microarray Data. IEEE Access 2018, 6, 61065–61076. [Google Scholar] [CrossRef]
  31. Hajinoroozi, M.; Jung, T.P.; Lin, C.T.; Huang, Y. Feature extraction with deep belief networks for driver’s cognitive states prediction from eeg data. In Proceedings of the IEEE China Summit & International Conference on Signal & Information Processing (ChinaSIP), Chengdu, China, 12–15 July 2015. [Google Scholar]
  32. Lei, Y.; Feng, J.; Jing, L.; Xing, S.; Ding, S. An intelligent fault diagnosis method using unsupervised feature learning towards mechanical big data. IEEE Trans. Ind. Electron. 2016, 63, 3137–3147. [Google Scholar]
  33. Lin, B.; Guo, W.; Xiong, N.; Chen, G.; Vasilakos, A.V.; Zhang, H. A Pretreatment Workflow Scheduling Approach for Big Data Applications in Multicloud Environments. IEEE Trans. Netw. Serv. Manag. 2016, 13, 581–594. [Google Scholar] [CrossRef]
  34. Liu, X.; Hu, Y.; Xu, Z.; Ren, Y.; Gao, T. Fault diagnosis for hydraulic system of naval gun based on BP-Adaboost model. In Proceedings of the 2017 Second International Conference on Reliability Systems Engineering (ICRSE), Beijing, China, 10–12 July 2017; pp. 1–6. [Google Scholar]
Figure 1. Rotor unmanned aerial vehicle (UAV) physical axis in (a) the axis of X, Y, (b) the axis of Z, and (c) the definition of Axis direction.
Figure 1. Rotor unmanned aerial vehicle (UAV) physical axis in (a) the axis of X, Y, (b) the axis of Z, and (c) the definition of Axis direction.
Electronics 08 01350 g001aElectronics 08 01350 g001b
Figure 2. Deep belief network (DBN) basic network structure.
Figure 2. Deep belief network (DBN) basic network structure.
Electronics 08 01350 g002
Figure 3. DBN feature extraction process in (a)the restricted Boltzmann machines (RBM) of the first layer, (b) the RBM of the second layer, and (c) the RBM of the third layer.
Figure 3. DBN feature extraction process in (a)the restricted Boltzmann machines (RBM) of the first layer, (b) the RBM of the second layer, and (c) the RBM of the third layer.
Electronics 08 01350 g003
Figure 4. Fault diagnosis algorithm flow.
Figure 4. Fault diagnosis algorithm flow.
Electronics 08 01350 g004
Figure 5. Accuracy of different RBM layers and training samples.
Figure 5. Accuracy of different RBM layers and training samples.
Electronics 08 01350 g005
Figure 6. The relationship between the number of iteration times and the error.
Figure 6. The relationship between the number of iteration times and the error.
Electronics 08 01350 g006
Figure 7. Effect of RBM layer number on classification accuracy.
Figure 7. Effect of RBM layer number on classification accuracy.
Electronics 08 01350 g007
Figure 8. The test of roll.
Figure 8. The test of roll.
Electronics 08 01350 g008
Figure 9. The test of yaw.
Figure 9. The test of yaw.
Electronics 08 01350 g009
Figure 10. The test of pitch.
Figure 10. The test of pitch.
Electronics 08 01350 g010
Figure 11. Pitch 10 s and −6.7° injection failure.
Figure 11. Pitch 10 s and −6.7° injection failure.
Electronics 08 01350 g011
Figure 12. Roll 18 s and −1.8° injection failure.
Figure 12. Roll 18 s and −1.8° injection failure.
Electronics 08 01350 g012
Figure 13. Yaw 21 s and −66.6° injection failure.
Figure 13. Yaw 21 s and −66.6° injection failure.
Electronics 08 01350 g013
Figure 14. Sensor deviation 2°/s pitch response curve.
Figure 14. Sensor deviation 2°/s pitch response curve.
Electronics 08 01350 g014
Figure 15. 2°/s error curve.
Figure 15. 2°/s error curve.
Electronics 08 01350 g015
Figure 16. Sensor deviation 0.3°/s roll response curve.
Figure 16. Sensor deviation 0.3°/s roll response curve.
Electronics 08 01350 g016
Figure 17. 0.3°/s error curve.
Figure 17. 0.3°/s error curve.
Electronics 08 01350 g017
Figure 18. Sensor deviation 0.08°/s yaw response curve.
Figure 18. Sensor deviation 0.08°/s yaw response curve.
Electronics 08 01350 g018
Figure 19. 0.08°/s error curve.
Figure 19. 0.08°/s error curve.
Electronics 08 01350 g019
Table 1. Corresponding of fault type and parameter.
Table 1. Corresponding of fault type and parameter.
Fault TypeParameter
Constant deviation fault k = 1 , | a | K t
Constant gain fault k 1 , | a | < K t
Table 2. Comparison of root mean square error (RMSE) (°)/s and R2 of the two models.
Table 2. Comparison of root mean square error (RMSE) (°)/s and R2 of the two models.
ModelsRollYawPitchR2
BP10.34156.3474.750.82
DBN1.6589.425.640.94

Share and Cite

MDPI and ACS Style

Chen, X.-M.; Wu, C.-X.; Wu, Y.; Xiong, N.-x.; Han, R.; Ju, B.-B.; Zhang, S. Design and Analysis for Early Warning of Rotor UAV Based on Data-Driven DBN. Electronics 2019, 8, 1350. https://doi.org/10.3390/electronics8111350

AMA Style

Chen X-M, Wu C-X, Wu Y, Xiong N-x, Han R, Ju B-B, Zhang S. Design and Analysis for Early Warning of Rotor UAV Based on Data-Driven DBN. Electronics. 2019; 8(11):1350. https://doi.org/10.3390/electronics8111350

Chicago/Turabian Style

Chen, Xue-Mei, Chun-Xue Wu, Yan Wu, Nai-xue Xiong, Ren Han, Bo-Bo Ju, and Sheng Zhang. 2019. "Design and Analysis for Early Warning of Rotor UAV Based on Data-Driven DBN" Electronics 8, no. 11: 1350. https://doi.org/10.3390/electronics8111350

APA Style

Chen, X.-M., Wu, C.-X., Wu, Y., Xiong, N.-x., Han, R., Ju, B.-B., & Zhang, S. (2019). Design and Analysis for Early Warning of Rotor UAV Based on Data-Driven DBN. Electronics, 8(11), 1350. https://doi.org/10.3390/electronics8111350

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop