1. Introduction
A tropical cyclone (TC) is an inherent atmospheric feature of tropical and subtropical regions. TCs generate strong winds and waves, and also often include heavy rain and storm surges that result in significant damage to coastal communities. The frequency of TCs in the Western North Pacific (WNP) is very high compared to other regions [
1]. As a result, coastal communities around the WNP suffer serious casualties and property losses annually [
2,
3,
4]. The area covered by the Northwest Pacific Basin is to the north of the equator and to the west of 180° E, and includes the South China Sea. The forecast skill of TC intensity in this region has displayed no signs of significant improvement by using numerical simulation methods in recent years [
5]. Thus, this study focuses on improving the forecast skill of TC intensity in the WNP.
The characteristics that affect TC intensity are nonlinear and thus difficult to predict [
2]. In recent years, many researchers have studied TC intensity prediction primarily using numerical forecasting and statistical methods [
6,
7,
8,
9,
10]. Numerical forecasting is the main tool used to forecast TCs around the world, and systems such as the European Centre for Medium-Range Weather Forecasts-Integrated Forecasting System (ECMWF-IFS) [
11], the Japan Meteorological Agency’s global spectral model (JMA-GSM) [
12], and the National Centers for Environmental Prediction-Global Forecast System (NCEP-GFS) [
13] have been developed as operational techniques. Statistical intensity forecast methods such as Climatology and Persistence (CLIPER) [
14] and the Statistical Hurricane Intensity Prediction Scheme (SHIPS) [
15] have also been developed as operational techniques. However, advancements in TC intensity prediction have been relatively slow, although notable work has been done in the past 20 years on predicting TC paths [
16]. One of the reasons for this is that the internal structure of storms is not yet sufficiently understood. TCs have an asymmetric structure, which may be caused by thermal and dynamic factors such as uneven distribution of sea surface temperature and humidity, horizontal or vertical shear, and asymmetric distribution of convection [
2,
17]. Changes in TC intensity are controlled by many environmental and oceanic variables, including oceanic heat, vertical wind shears, and underlying surfaces changes [
9]. Because these factors exhibit nonlinear characteristics, TC intensity is difficult to predict accurately using traditional statistical and numerical forecasting methods [
2]. Thus, conventional linear statistical methods have difficulties when applied to nonlinear systems (e.g., forecasting TC intensity). At present, artificial neural network (ANN) methods can be used to effectively solve complex nonlinear issues, and numerous studies have concentrated on novel modeling techniques to improve the accuracy of TC intensity predictions [
18,
19,
20].
In recent years, studies have clearly showed the potential of machine learning systems such as back propagation neural networks (BPNNs) to forecast TC intensity [
18]. However, the traditional BPNN is unstable and less accurate than a probabilistic neural network (PNN) [
2], which is a radial basis function neural (RBFN) network that is strongly fault-tolerant, and has adaptive capabilities. The k-nearest-neighbor (K-NN) machine learning algorithm has also been applied to predict TC intensity based on microwave imager data [
21]. A comparison of TC intensity prediction accuracy using multilayer perceptron (MLP), multiple linear regression, RBFN, and ordinary linear regression indicated that MLP had the smallest prediction error [
22]. The MLP model may thus also be considered an alternative to the conventional operational forecast models for predicting TC intensity [
23]. Other research predicted cyclone wind intensity in the South Pacific using Elman recurrent neural networks [
24] and proved that the accuracy of TC intensity forecasts using a double hidden layer neural network is higher than that found by using a single layer neural network [
25]. These previous models for predicting TC intensity have therefore been popular with researchers.
Although these models have been used widely by many researchers to forecast TC intensity, each method has unique shortcomings in various areas, including predictive accuracy, model interpretability, and computational efficiency. For example, BPNN performs well for most of these standards; however, the accuracy of BPNN model predictions does not satisfy the needs of operational applications. Therefore, more accurate TC intensity prediction models are needed. Several researchers have stated that the extreme gradient boosting (XGBOOST) machine learning method, which is based on the gradient boosting decision tree (GBDT), is a promising classification model [
26]. GBDT is an iterative decision tree algorithm comprising multiple decision trees, with the sum of the results of all decision trees constituting the final result. It is frequently applied to solve classification and regression problems [
27]. In the process of gradient boosting machine model operation, the objective of every iteration is to reduce the residuals of the previous iteration. In order to eliminate the residual, the gradient boosting machine model builds a new model in the direction of the gradient of residual reduction. However, the objective function of the GBDT has no penalty term and may therefore be overfitted. The error function of the GBDT is the first derivative, which has a slow convergence speed. Based on the GBDT model, the XGBOOST model has the following changes: (1) Penalty terms are incorporated into the objective function of XGBOOST to prevent overfitting; and (2) the error function of XGBOOST is the second derivative, which can converge faster on the training set. These advantages and the fact that it has a very strong predictive ability and has recently been successful in many machine learning competitions led us to choose the XGBOOST model to forecast TC intensity in the WNP. Sheridan et al. used the XGBOOST model to quantify structure-activity relationships, with fast convergence speed being exhibited [
28]. The XGBOOST model has also been used to predict agricultural crop yields [
29]. In addition, it provides the following advantages: (1) It utilizes parallelism, is easy to use, and has impressive prediction accuracy; and (2) it has an intrinsic capability to manage the highly diverse and complex features of predictors. The structure of a TC and the factors affecting its development are very complicated; the XGBOOST model is highly suited for such conditions.
In this study, XGBOOST-based frameworks for TC intensity prediction in the WNP were established based on China Meteorological Administration-Shanghai Typhoon Institute (CMA-STI) data for the period 1979–2017. These data were used because they include significant amounts of coastal sounding data, which are useful for analyzing TCs in the WNP. Analyses of the dynamics of the development of TC intensity were conducted by comparing the differences between the predicted and real data. Predicting TC intensity using the XGBOOST model not only results in a series of predictors, but also improves the accuracy of the predictions. The primary objectives of this study were to (1) extract various features for predicting TC intensity from CMA-STI data; (2) establish a TC dataset (1979–2017) based on machine learning and explore XGBOOST frameworks for predicting TC intensity with lead times of 6, 12, 18, and 24 h; and (3) demonstrate that this method outperforms several baseline techniques. The results of this study provide a novel and feasible method with which to improve TC intensity prediction accuracy.
4. Discussion
In order to determine the impact of different factors on the forecast results, we designed six models: A1 (PF, IC, BF), A2 (A1 + CF), B1 (A1 + MON), B2 (A2 + MON), C1 (B1 + EF), and C2 (B2 + EF). The research results indicate that when climatology and persistence factors, brainstorm and predictive features, intensity category, and TC month are used as model inputs, the best TC intensity forecast accuracy with 6, 12, 18, and 24 h lead times occurs in the six models. We theorize that there are three main reasons for this.
Firstly, we found that the XGBOOST model with the TC month feature is better than one without it because of differing relationships between the predictors and TC intensity in each month, which made it necessary for monthly experiments to be performed. Huang et al. divided all the samples into five parts and built a monthly ANN model (June–October). They found that the results of the monthly ANN model were better than those of the CLIPER method for TC intensity forecasts [
41]. Several researchers consider the months from July to November as the period in the year with the most TC activity [
42]. In this study, for the first group, the B2 model with the predictor TC month with a 24 h lead time had good MAE, CC, and NRMSE results: 4.06 m/s, 0.91, and 7.84%, respectively. The A2 model without the TC month predictor and 24 h lead time had the following results: MAE of 4.35 m/s, CC of 0.90, and NRMSE of 8.31%. For the second group, when we trained the PF, IC, BF, and MON datasets (B1 model), the MAE of the 24 h forecast for the test samples was 4.09 m/s; when we trained the PF, IC, and BF datasets (A1 model), the MAE of the 24 h forecast for the test samples was 4.41 m/s. Thus, the month feature is very important to process TC predictions. Secondly, our next finding was that an XGBOOST model with environmental factors is better than one without those features. For the first group, when it was trained with the PF, IC, BF, MON, and EF datasets, the MAE of the 24 h forecast for the test samples was 3.91 m/s; when trained with the PF, IC, BF, and MON datasets, the same MAE was 4.09 m/s. For the second group, the MAE of the 24 h forecast in model B2 for the test samples was 4.06 m/s; the MAE of 24 h forecast in model C2 for the same samples was 3.70 m/s. Finally, climatology predictors also affect the forecast results from the XGBOOST model. The climatology predictors developed by Neumann [
43] are feasible and have reasonable bases in meteorology [
44]. For example, such predictors include: latitude and longitudes at present and 12, 24, 36 and 48 h prior, pressures at present and 12, 24, 36 and 48 h prior, directions of the storm motions at present and 12, 24, 36 and 48 h prior. In addition to the above information, when we established the climatology predictors, we added the difference between the latitude at the current time and the latitude 6, 12, 18, and 24 h prior, the difference between the longitude at the current time and the latitude 6, 12, 18, and 24 h ago, the difference between the 2-min mean maximum sustained wind near the center of the TC at the current time and the latitude 6, 12, 18, and 24 h prior, etc. (
Table 3). The MAE for the 24 h forecast in model A1 for the test samples was 4.41 m/s, while that value in model A2 was 4.35 m/s. The MAE of 24 h forecast in model B1 for the test samples was 4.09 m/s, while the MAE of the 24 h forecast in model B2 for the test samples was 4.06 m/s. The MAE of the 24 h forecast in model C1 for the test samples was 3.91 m/s, while that value was 3.70 m/s in model C2.
As the prediction time increases, the accuracy of the TC intensity predicted using the XGBOOST model gradually decreases. The result indicates that the MAE and NRMSE values gradually increase with prediction time, whereas the value of CC gradually decreases. For example, the CCs of the 6, 12, 18, and 24 h forecasts for test samples in model C1 were 0.99, 0.97, 0.95, and 0.93, respectively. The NRMSEs of these forecasts were 3.15, 4.86, 6.25, and 7.59, respectively, and the MAEs were 1.64, 2.50, 3.24, and 3.91 m/s, respectively. However, the reduction/growth amplitude of each model in the CC/NRMSE differs as the prediction time increases. The relationship between the CC and NRMSE for TC intensity forecast with a 6, 12, 18, and 24 h lead time is shown in
Figure 8. The results show that the distance between the C models and the A models in the graph increase with the prediction time. On combining
Figure 8a–d, it can be seen that the C models approach the lower right-hand corner with increase in lead time. For example, the CCs of the 6, 12, 18, and 24 h forecasts for the test samples in model C2 were 0.99, 0.97, 0.95, and 0.93, respectively. The NRMSEs of these forecasts were 3.09, 4.72, 6.00, and 7.18, respectively. This means that the CCs of the C models approach one and the NRMSEs approach zero with increasing lead time. When all parts of
Figure 8 are combined, the A models approach the top left-hand corner with increase in lead time. For example, the CCs of the 6, 12, 18, and 24 h forecasts for the test samples in model A2 were 0.98, 0.96, 0.93, and 0.90, respectively, whereas their NRMSEs were 3.39, 5.28, 6.87, and 8.31, respectively. This means that the CCs of the A models have lower correlations, with values much less than one, whereas the NRMSEs are higher.
The prediction results show that the XGBOOST method is appropriate for predicting TC intensities.
Table 8 shows the MAEs of the maximum wind speed forecasts with a 24 h lead time using each method. The prediction error of the XGBOOST model is comparable to that of a previously used model for 24 h predictions. The MAEs for a 24 h lead time in seven machine learning methods—k-nearest neighbor [
21], neural network [
45], fuzzy neural network [
46], artificial neural network [
5], multilayer feed forward neural nets [
25], Elman recurrent network [
24], and probabilistic neural network [
2]—were determined for test samples and found to be 8.22, 3.44, 3.52, 4.74, 2.98, 3.58, and 2.93 m/s, respectively. Because the predictors used in the previous studies are not exactly the same as the predictors in this paper, we used a BPNN to predict the TC intensity under the same sample input parameter. The MAEs for a 24 h lead time in six BPNN methods—A3 (input parameters the same as model A1), A4 (input parameters the same as model A2), B3 (input parameters the same as model B1), B4 (input parameters the same as model B2), C3 (input parameters the same as model C1), and C4 (input parameters the same as model C2)—were determined for test samples and found to be 5.48, 5.25, 5.41, 5.23, 5.05, and 4.57 m/s, respectively. The MAEs of 24 h lead time in six models (A1, A2, B1, B2, C1, and C2) for the test samples were 4.41, 4.35, 4.09, 4.06, 3.91, and 3.70 m/s, respectively. The prediction results of the XGBOOST models were better than those made by the BPNN model with the same sample requirements. The XGBOOST model has many advantages, including a simple training process, low computer-processing costs, and fast convergence, compared to ANNs [
47]. Therefore, using the XGBOOST model is very advantageous for predicting TC intensity. This significant finding thus supports performing predictions within 24 h using the XGBOOST model as a new method for TC intensity prediction.
5. Conclusions
In this study, we established a series of predictors using the Best Track TC dataset to predict the intensity of TCs in the Western North Pacific (WNP) and conducted the following experiments to improve intensity prediction accuracy. First, we designed a feature set using the brainstorming and CLIPER methods. Then, we used the CMA of the WNP near China as a data source and predicted the 6, 12, 18, and 24 h intensities of TCs for the period 1979–2017 under six scenarios using the XGBOOST model. Finally, we tested the performance of the XGBOOST model using the strongest recent TCs—specifically, Hato, Rammasum, Mujiage, and Hagupit. The following results were obtained:
(1) The prediction accuracy of the XGBOOST model improved by climatology and persistence factors, environmental factors, brainstorm features, intensity category, and TC month. We analyzed the prediction of TC intensity using the XGBOOST model under six scenarios; all of them produced a mean absolute error (MAE) < 4.50 m/s, a correlation coefficient (CC) > 0.89, and a normalized root mean square error (NRMSE) < 10.00%. Among models A (A1 and A2), B (B1 and B2), and C (C1 and C2), we determined that model C2 was the most accurate predictor of TC intensity in the six scenarios.
(2) The NRMSE, MAE, and CC parameters were used to evaluate the performance of the XGBOOST model in the WNP. The MAEs of the 6, 12, 18, and 24 h lead times for the test sample forecasts were 1.61, 2.44, 3.10, and 3.70 m/s, respectively; the CCs were 0.99, 0.97, 0.95, and 0.93, respectively; and the NRMSEs were 3.09, 4.72, 6.00, and 7.18%, respectively. The MAE and NRMSE values gradually increased with lead time, whereas the CC value gradually decreased. The prediction accuracy of our XGBOOST model was found to be higher than that of traditional BPNN models for the same predictors and independent prediction samples.