1. Introduction
Globally, in response to the technological advancement of drone micro-electromechanical systems, drone costs have fallen sharply. Applications of optoelectronic payloads and stabilizers become a standard component. Optoelectronic payloads are detection devices installed on servo-stabilized systems on moving carriers such as drones and track ground or air targets based on boresight stabilization technology. With the varying attitude angle of the carrier, the gyroscope’s photoelectric payload can be moved without interference from the carrier, stabilizing the picture and ensuring the detection equipment on the photoelectric payload. It can be found that optoelectronic payloads for drones have the advantage of key components in the drone market. Therefore, from 2016 to 2018, many solutions were obtained due to the microchip processing speed and data amount increased. It causes many different applications of optoelectronic payloads used in various military and civilian fields. As the speed of the processor increases and the amount of data processing is larger. The ability to perform calculations on images and signals simultaneously in one processor emerges is required. The traditional advantages of mechanical processing are gradually reduced, and instead, the image stabilization processing is performed by the image and the mechanical gimbal frame at the same time. Image processing research is the basis of this method. It can greatly increase the processing efficiency and reduce the use of the hardware gimbal frame. It further reduces the weight and resource consumption of image stabilization and can further utilize one processor to complete multiple tasks in the traditional method. The performance evaluation index of the photoelectric payload is the reliability of the data captured by the optical sensor. Therefore, the reliability of the data captured by the sensor is a key issue. Ref. [
1] proposed an adaptive unscented Kalman filter for target tracking with time-varying noise covariance based on multi-sensor information fusion. An innovative optimal information fusion methodology is proposed based on adaptive and robust unscented Kalman filter (UKF) for multi-sensor nonlinear stochastic systems. Based on the linear minimum variance criterion, this multi-sensor information fusion method has a two-layer architecture: at the first layer, a new adaptive UKF scheme for the time-varying noise covariance was developed and served as a local filter to improve the adaptability together with the estimated measurement noise covariance by applying the redundant measurement noise covariance estimation, which is isolated from the state estimation; the second layer is the fusion structure to calculate the optimal matrix weights and gives the final optimal state estimations. Based on the hypothesis testing theory with the Mahalanobis distance, the new adaptive UKF scheme utilizes both the innovation and the residual sequences to adapt the process noise covariance timely. The results of the target tracking simulations indicate that the proposed method is effective under the condition of time-varying process-error and measurement noise covariance. Ref. [
2] proposed the design of a stability control system for pan, tilt, and roll. The traditional gyro stabilization control method uses the angle information obtained by the gyro to extend the Kalman Filter and then corrects the position of the three-degree-of-freedom velocity integration in the original space [
3]. The value is provided to calculate the parameters of the controller, and the angle information can be converted into a unit pulse (Pulse Width Modulation, PWM) signal to the MOSFET to generate a higher voltage to the motor. Ref. [
4] uses a fixed-wing UAV to implement an automatic tracking system design. Reference [
5] analyzed the various error sources of the three-axis gimbal, used the accuracy of the gyro error model to calibrate the accuracy, and established the photoelectric payload and the gyro model error equation. However, the actual internal error of the controller includes: gyro accuracy error, acceleration caused by gyro angle error, motor response time delay, payload vibration, and image scanning frequency resonance [
6,
7]; the above four items cause the final imaging image to be unclear or fuzzy. Key factors proposed a fuzzy PID control algorithm to control the operational stability of the photoelectric payload. However, updating the accuracy of the gyroscope causes a serious burden on the private sector. Acceleration caused by the gyro angle error can be corrected in real-time by setting the spatial position of one axis in the middle of the three-axis degrees of freedom as a constant. References [
3,
8] reposed a PTZ controller design method for UAV’s own motion compensation. By analyzing the effects of translation and rotation movement of UAV on the imaging of the target in the image plane, the relationship between the angular velocity of the body and the angular velocity of the PTZ attitude. In order to compensate for the change in the camera’s boresight caused by the UAV’s own motion, the position of the target in the image plane is used to correct the camera tracking error. In this paper, applied to the electro-optical infrared payload system’s vision offset function, the multiple layer Kalman Filter characteristic [
9] was used to analyze the filter inertial measurement unit and design a trigger to set up the layer number of Kalman Filter [
10]. This function can decrease the electro-optical infrared payload system’s number of stabilized motors, the weight of the device, central processing unit resource, and power. In this research, an illustrated example of a three-axis, three-gimbal system was analyzed by mathematical and simulation inertial measurement unit vibration. Three-axis degrees of freedom vector and gimbal motion data obtained by Kalman filter were provided to the graphics processing unit for offsetting each frame [
10,
11]. However, none of the studies mentioned above can be optimized for the efficiency of image stabilization. Additionally, the constraints of the main research are based on the design of the existing hardware. The fundamental problem is not substantially solved, and the response of the motor of the ring frame is lower due to the physical limit. Less response performance means that a powerful motor and lower energy consumption are required. The stabilization of the image only assists the system. Therefore, based on the literature [
9,
10,
11], this study designs the input of the adaptive Kalman filter and combines the image tracking technology of [
7] to improve the image quality. The goal was to use the proposed method to reduce the dependence of the optoelectronic payload driven by the motor. Ref. [
12] proposed a dual-optimized adaptive Kalman filtering algorithm based on bp neural network and variance compensation for laser absorption spectroscopy. The analysis and design of a dual-optimized adaptive Kalman filtering (DO-AKF) algorithm based on back propagation (BP) neural network and variance compensation was developed for high-sensitivity trace gas detection in laser spectroscopy [
13,
14]. The BP neural network was used to optimize the Kalman filter (KF) parameters. Variance compensation was introduced to track the state of the system and to eliminate the variations in the parameters of dynamic systems. The proposed DO-AKF algorithm showed the best performance compared with the traditional multi-signal average, extended KF, unscented KF, KF optimized by BP neural network (BP-KF), and KF optimized by variance compensation (VC-KF) [
15]. The optimized DO-AKF algorithm was applied to a QCL-based gas sensor system for an exhaled CO analysis [
16]. The experimental results revealed a sensitivity enhancement factor of 23. The proposed algorithm can be widely used in the fields of environmental pollutant monitoring, industrial process control, and breath gas diagnosis.
The above mainly describes the design improvement and analysis of the EKF. The improvement of this algorithm was also used on UAVs, such as in reference [
17]. A UAV swarm navigation using dynamic adaptive Kalman filter and network navigation was proposed. Ref. [
18] also proposed a multi-sensor fusion based on adaptive Kalman filtering. The optimal performance of the conventional Kalman filters is not guaranteed when there is uncertainty in the process and measurement noise covariances. In this paper, in order to reduce the effect of noise covariance uncertainty, the Fuzzy Adaptive Iterated Extended Kalman Filter (FAIEKF) and Fuzzy Adaptive Unscented Kalman Filter (FAUKF) are proposed to overcome this drawback. The Proposed FAIEKF and FAUKF were applied to fuse signals from Global Positioning System (GPS) and Inertial Navigation Systems (INS) for the autonomous vehicles’ navigation. In order to validate the accuracy and convergence of the proposed approaches, results obtained by FAUKF and FAIEKF were compared to the Fuzzy Adaptive Extended Kalman Filter (FAEKF), Extended Kalman Filter (EKF), Unscented Kalman Filter (UKF), and Iterated Extended Kalman Filter (IEKF). The simulation results illustrate superior performance. It can be seen that the design and improvement of EKF are very common on UAVs. This research mainly focuses on the integration and optimization of image stabilization and UAV photoelectric stabilization systems. Ref. [
10] showed a modified unscented Kalman filter using modified filter gain and variance scale factor for highly maneuvering target tracking. It improved the low tracking precision caused by lagged filter gain or imprecise state noise when the target highly maneuvers; therefore, a modified unscented Kalman filter algorithm based on the improved filter gain and adaptive scale factor of state noise was presented. In every filter process, the estimated scale factor is used to update the state noise covariance, and the improved filter gain is obtained in the filter process of unscented Kalman filter (UKF) via predicted variance, which is similar to the standard Kalman filter. Simulation results show that the proposed algorithm provides better accuracy and ability to adapt to the highly maneuvering target compared with the standard UKF. Ref. [
8] proposed a dynamic Siamese network with an adaptive Kalman filter for object tracking in complex scenes. It solved the problem that the tracking accuracy of a fully-convolutional Siamese network (SiamFC) in complex scenes such as similar object interference, fast-moving, and appearance change is not good. A new object tracker based on a dynamic template updating strategy and the re-location mechanism based on the adaptive Kalman is proposed. In order to suppress the object interference and overcome the instability of fast-moving object tracking, an adaptive Kalman filter method was designed to change the selection method of search region and select the bounding box of the object closest to the predicted position. For the adaptation of appearance change, the high-confidence tracking results are fused with the initial template to dynamic update the template. Compared with the traditional Kalman filter, the expectation of residual error for the adaptive Kalman filter method can be controlled in a low range by the adjustment of the gain online. The introduction of the adaptive Kalman-based re-location mechanism improves the discriminative ability of SiamFC in the interference scene. With the dynamic template updating strategy, the tracker obtains strong generalization capability to adapt to the appearance change of the tracking target. It was demonstrated that the proposed method performs real-time object tracking at the speed of 43 fps and achieves competitive performance on OTB, VOT, and TC128 datasets compared with other state-of-the-art trackers.
The above research shows that the trend of the current technology is gradually changing from separate motor and image stabilization to pure gyro image stabilization. However, the above literature also shows that it is very unwise to use the gyroscope for image stabilization directly, and the noise on the gyroscope still has a large deviation even after traditional filtering. Therefore, how to use the gyroscope on the drone to perform image stabilization more effectively is a key issue.
This study shows the completion of the coordinate analysis of photoelectric gimbal payload and the preliminary description of coordinates, and the analysis was carried out under the assumption that variables such as material deformation are not considered. Mainly, the Minimum Output Sum of Squared Error (MOSSE) and Fully-Convolutional Siamese Networks (SiamFC) extraction methods of images were firstly established for theoretical establishment. The tracked target was then extracted relative to the error variable in the image. The theory of adaptive extended Kalman filter was established and described. Then substitute the variables were extracted by MOSSE and SiamFC. The actual hardware was verified first and then verified on different devices. The image error should be lower than one pixel, and the steady-state error of the gyroscopes in the experimental should be lower than 100 LSB/°/s. The key experimental data related to this study were extracted and shown in the following sections of the result and conclusion.
Table 1 shows the relevant abbreviations and symbols used in the following.
3. Experimental Results and Analysis
In this research, the inertial measurement units are used in the experiment, where the angular scale range is ±2000°/s, the angular of sensitivity is 16.4 LSB/°/s, the angular noise rate is 0.005 mdps/rtHz, the acceleration of scale range is ±16 gravity, and the acceleration of sensitivity is 2048 LSB/g. Two electric optical payloads are on the same platform and use the same inertial measurement units. The inertial measurement units used in the electric optical payload are with angular scale range ±1000°/s, angular of sensitivity 32.8 LSB/°/s, angular noise rate 0.005 mdps/rtHz, acceleration of scale range ±8 gravity, and acceleration of sensitivity 4096 LSB/g.
In this study, a constant with a radix of 4 was given as an input, plus a variable of random
intervals to simulate the noise, and a single-layer extended Kalman filter was performed for the experiment. As shown in
Figure 5, the input source is marked blue line. The value under extended Kalman filtering is given as an orange line.
Figure 5b shows the simulation noise with random
interval variables under extended Kalman filtering.
As shown in
Figure 5a, with a single layer of extended Kalman filtering, the original base number constant 4 is shown in
Figure 5b. The output result is different from the original base number 4, so it still causes errors from the original signal that must be compensated in the controller design. The gyro stabilization system designed by this institute is equipped with a three-gyro system on a drone and sacrifices one axis as a calibration axis for each unit of inertial measurement. The three inertial measurement outputs are complementary to the other two. Furthermore, the stability accuracy of drones and optoelectronic payloads are improved. The angular acceleration and acceleration output by the inertial measurement unit is obtained through an integrator, and the speed directly enters the adaptive extended Kalman filter loop. The angular velocity then outputs the angle data through the integrator and into the adaptive extended Kalman filter loop. The angular velocity and velocity output is integrated through the angular acceleration and acceleration output from the inertial measurement unit, and the angular velocity is output as an angle once integrated. The system designs a low-pass filter at the angular acceleration to filter and output to the trigger unit, where the unit decides to extend the Kalman filter filtering times. Through the final outputs of angle and speed to compare whether its Kalman gain value is much low, and raise a correction warning.
In the system design layout, the low-pass filter is generally limited to the average value of the input signal. The current average value of the input signal is obtained through the average filter, and upper and lower limits are given for the average value of the input signal. The process of Kalman filtering mainly avoids triggering more layers of extended Kalman filtering, so the limits for this low-pass filtering are shown in
Table 2.
When the difference is larger, a multi-layer Kalman filter is used. Due to the infinite increase in the simulation, the experimental hardware is physically limited in the actual environment. There must be an upper limit and a warning to provide a gyroscope for recalibration and a warning that the Kalman filter gain is too low and needs to be recalibrated. Among them, judgment is given in the experiment.
In the actual flight process of the strong wind environment or high-speed flight, the acceleration has a chance to exceed 6 m/s2, so only the alarm reminder is provided; when the square difference exceeds 8 m/s2, the damage to the aircraft is lost. Therefore, during the process of warning and calibration of the gyroscope, it is detected synchronously whether the inertial gauge is malfunctioning or other sensing element is malfunctioning.
Figure 6 shows the result with the analog input of the sine wave and with one layer EKF. The input signal source is blue, the output signal after filtering through EKF is orange, and the output image at the same time is 54 fps. The average power consumption of the body system is 5.233 W.
Figure 7 shows the result with the analog input of the sine wave and with the EKF of three layers. The input signal source is marked by a blue line, and the output signal with filtering through EKF is as an orange line. The frame rate of the output image is 41 fps. The average power consumption of the body system is 7.006 W.
Figure 8 shows the result with the analog input of a sine wave and with an EKF of four layers. The input signal source is a blue line, and the output signal is an orange line. The output frame rate is 39 fps. The average power consumption of the body system is 7.853 W.
Figure 9 shows the result with the analog input of a straight-up and EKF of one layer. The input signal source is a blue line, and the output signal with EKF is an orange line. The output image is 54 fps per frame, while the average power consumption of the body system is 4.321 W.
Figure 10 shows the result with three layers of EKF. The input signal source is a blue line, and the output signal is an orange line. The output image at the same time is 43 fps, and the average power consumption of the body system is 6.337 W.
Figure 11 shows the result with the analog input of a straight-up and EKF of four layers. The input signal source is a marked blue line, and the output signal is a marked orange line. The output frame rate is 40 fps, and the average power consumption of the body system can be calculated as 7.100 W.
After adaptive multi-layer extended Kalman filtering, the output information can be provided to the image compensation for the stabilization effect that cannot be achieved by mechanical stabilization. As shown in
Figure 12, the image vibration compensation is provided after filtering through the gyroscope to fit the image stabilization requirements of the drone or photoelectric payload
The angle of view of the roll effect when the drone is placed horizontally is shown in
Figure 13a, and the angle of view of the payload tilted by 45 degrees is shown in
Figure 13b. The image can be compensated by the gyroscope to achieve the effect of non-mechanical stabilization.
Figure 14 shows the result of a single-layer EKF actually swinging within
degrees. The input signal is blue, and the output signal is orange. The output image is 56 fps, airborne. The average power consumption of the ROS hardware system is 4.511 W.
Figure 15 shows the result of two layers of EKF flying in the range of
degrees. The input signal is blue, the output signal is orange, and the output image is 49 fps. The average power consumption of the ROS hardware system is 5.277 W.
Figure 16 shows the result of three layers EKF under actually swinging within
degrees. The input signal is blue, the output signal is orange, and the output image is 43 fps. The average power consumption is 6.866 W.
Figure 17 shows the result of four layers of EKF with actually swinging at
degrees. The input signal is blue, and the output signal is orange. The output image is 38 fps. The average power consumption of the ROS hardware system is 7.122 W.
Figure 18 shows the result of a single-layer EKF actually swinging within
degrees. The input signal is blue, and the output signal is orange. The output image is 51 fps. The average power consumption of the ROS hardware system is 3.998 W.
Figure 19 shows the result of a single-layer EKF actually swinging within
degrees. The input signal is blue, and the output signal is orange. The output image at the same time is 38 fps. The average power consumption of the ROS hardware system is 5.512 W.
In the above verification data, it can be found that the method designed in this study is feasible. Despite the constraints of the hardware, the simultaneous processing of image and gyro information by a single processor results in a reduction in frames per second and cannot maintain the original image output of 60 fps.
Table 3 shows the comparison between the proposed method and the traditional method that separates frame stabilization and image stabilization. Note that the gimbal holder is locked and can be measured during the study. It can be seen that using the traditional method, because the camera is separated from the gimbal frame, the frame rate of the output image is corrected and stabilized by the fixed global image of each frame, so it is kept at 50 fps. However, the proposed method in this study uses a single GPU for calculation, provides partial sampling and performs adaptive Kalman filtering for image stabilization, and cannot provide stable output frames per second, which is between 31 and 56 fps. In the power consumption comparison of GPU, the average power consumption of the traditional method is 9.6 watts, and the power consumption of the method designed in this study is 6.3 watts. In the comparison of the horizontal and vertical angles with the maximum allowable and no steady-state error, because the traditional method uses global image sampling, the cropping area is small, and the sampling cannot exceed the existing steady-state error, so the horizontal angle cannot be shifted by more than
degrees, and the vertical cannot exceed
degrees. The method used in this study is local sampling, so as long as it is within the camera’s viewing angle, it can be sampled and corrected at the same time with the gyroscope. Therefore, the horizontal angle cannot be offset by more than
degrees, and the vertical angle cannot exceed
degrees.
However, in terms of practical application scenarios, the ring frame design of the traditional optoelectronic payload will activate the motor for hardware stabilization, thus increasing the maximum horizontal and vertical allowable angles, as shown in
Table 4. Horizontal and vertical angles can be rotated without restriction. However, in this way, the power consumption is greatly increased to 32.1 watts.