1. Introduction
Tropical cyclones are low-pressure cyclones that occur in tropical or subtropical oceans and have a great impact on the surrounding environmental systems. Tropical cyclones can grow into tropical depressions (TDs), tropical storms (TSs), strong tropical storms (STSs), typhoons (TYs), strong typhoons (STYs), and super typhoons (super TYs) (defined by the China Meteorological Administration). When the maximum wind speed near the center of a tropical cyclone reaches Level 12 (32.7 m/s or more), it is called a typhoon. A typhoon is a kind of disastrous weather system with strong destructive force. When a typhoon arrives, it causes a storm surge, heavy rainfall, a tide change, and other disasters, which bring huge losses to the lives and property of people. Based on the data recorded by the China Meteorological Administration (CMA), on average, no less than seven typhoons hit the southeast coastal region of China every year, which is the most highly developed and populated region in China. Therefore, it is of great practical significance to forecast typhoons in an accurate and timely manner.
Several methods for typhoon forecasting exist. At present, most researchers use numerical prediction methods based on dynamics theory [
1,
2,
3,
4]. However, these methods are expensive and complex [
5], and it is difficult to gain internal skills using them [
6]. In recent years, with the rise of artificial neural networks (ANNs), especially feed-forward neural network (FNNs), some researchers have tried to forecast typhoons based on an FNN. This method, based on statistical theory and big data, does not require knowledge of the detailed physical processes during the development of typhoons, which shows the positive meaning for typhoon forecasting.
However, for typhoon forecasting, there are too many prediction factors that influence the prediction results. Sooyoul Kom et al. [
7] applied some forecasting models based on FNNs with different prediction factors to predict the storm surges caused by typhoons. Ying Huang et al. [
8] combined a neural network with the locally linear embedding (LLE) algorithm to predict the precipitation caused by typhoons. Shao Limin et al. [
9] applied a prediction model based on an FNN using the prediction factors selected by multivariate stepwise regression analysis to predict typhoon tracks. The above prediction results show the effective and suitable prediction factors for making the prediction results better for typhoon forecasting.
In recent years, Ruyun Wang et al. [
6] studied typhoon intensity forecasting using a prediction model based on an FNN and applied the model for typhoons Nakri (2014) and Molave (2015), which achieved better results than the National Meteorological Center (NMC), the Joint Typhoon Warning Center (JTWC), the Japan Meteorological Agency (JMA), and the Korea Meteorological Agency (KMA) in 48 h using the minimum pressure and the maximum wind speed near the typhoon’s center, as well as the position data of the typhoon center for the current time and for every 6 h in the previous 18 h. However, they did not obtain ideal results for more than 48 h. On the contrary, for the regional forecasting of wind speed during a typhoon landfall, the results could be worse with more uncertainty [
10]. These results show the limit of the abilities of prediction models based on an FNN to predict a typhoon. Therefore, it is very necessary to find a practical method to forecast typhoon intensity in a more timely and accurate manner, especially with a longer lead time of more than 48 h.
In general, typhoon intensity is measured by the minimum pressure and the maximum wind speed near the typhoon’s center, so typhoon intensity forecasting is actually pressure and wind speed forecasting. For wind speed forecasting, some researchers have tried to reduce prediction errors by using a hybrid of the population particle algorithm and a multi-quantile robust extreme learning machine [
11], wavelet decomposition and wavelet neural networks optimized by the Cuckoo search algorithm [
12], and an ANN combined with the discrete wavelet transform method [
13]. In terms of the results, all of the hybrid methods improved the prediction results of wind speed forecasting with few prediction steps (no more than five steps) and a short lead time generally. Moreover, the prediction factor of these studies was only wind speed. Thus, for typhoon intensity forecasting, it is still necessary to find a practical method with more prediction steps and a long lead time.
With the development of a neural network, especially a deep neural network (DNN), some researchers have tried to study the problem of typhoons based on a DNN, such as a convolution neural network (CNN) [
2,
14] and long short-term memory (LSTM) [
15,
16], but not a simple ANN. Wei Yuzhou and Xu Xining [
17] applied LSTM to forecast wind speed for the next 10 min and achieved a high prediction accuracy rate of more than 98%. Yin Hao et al. [
18] designed a wind speed prediction model based on the fuzzy information granulation method and LSTM optimized by the Adam algorithm, for which the prediction results of wind speed for the next 1 h were better than the models based on a support vector machine (SVM) and an ANN using the back-propagation algorithm (BPNN). Gholamreza Memarzadeh and Farshid Keynia [
19] proposed a wind speed prediction model based on LSTM using the wavelet transform, feature selection, and crow search algorithms, which showed the best performance for the wind speed in the future 1 h compared to other models based on LSTM not using these algorithms. Xuechao Liao et al. [
20] proposed a wind speed prediction model based on LSTM using the attention mechanism, wavelet decomposition, and variational mode decomposition methods. Compared to models based on the autoregressive moving average and support vector regression, the proposed prediction model was stable and had cumulative errors within the next 5 h every 1 h. These studies aimed to predict short-term wind speed and to show that prediction models based on LSTM are better for wind speed prediction problems, but they all only used wind speed data for the experiments and were not in the field of typhoon intensity forecasting. Because of the complexity of typhoon forecasting and the recent improvements of LSTM, only a few papers have used prediction models based on LSTM to predict typhoon intensity. Additionally, the study of typhoon intensity forecasting based on LSTM is only applicable to the next 24 h [
21], which shows that there is an important problem with forecasting typhoon intensity for a longer period of time.
Based on the above problems regarding typhoon intensity forecasting and prediction models based on neural networks, typhoon intensity forecasting models based on LSTM are proposed. In addition, considering the development processes of typhoon cases, in this experiment, typhoon intensity is forecast as a time series problem. During the training and test phases, optimal prediction factors are selected for typhoon intensity forecasting. After determining the optimal prediction factors for typhoon intensity forecasting, prediction models based on LSTM using the optimal prediction factors are applied for typhoons Chan-hom and Soudelor in 2015. Finally, these models are validated against a model based on an FNN.
The remainder of this paper is organized as follows:
Section 2 introduces the study area and processed data of the experiment.
Section 3 describes the methods to develop the typhoon intensity forecasting models based on LSTM and FNN and the evaluation indicators to validate these models. The forecasting models based on LSTM are presented in
Section 4.
Section 5 shows the method for selecting the optimal prediction factors and the analysis of the results of the experiment.
Section 5 concludes this paper.
2. Study Area and Data
In the experiment, typhoons that occurred in the Northwest Pacific were studied. Typhoons that occurred in other areas were outside the scope of this study. The data for the experiment were from the best tropical cyclone track data set every 6 h [
22] from the Tropical Cyclone Data Center of the CMA (tcdata.typhoon.org.cn, accessed on 14 January 2021). These typhoon data include the intensity and position information of typhoons occurring in the Northwest Pacific every 6 h from 1949. Data of the samples of each typhoon case can be seen in
Table 1. In
Table 1, the latitude (a positive value indicates north latitude) and longitude (a value smaller than 1800 indicates east longitude, a value bigger than 1800 west longitude) are 10 times that of the real values. For example, if the latitude is 95, the real value is 9.5.
Because the number of typhoons in 2015 (
Figure 1) was higher than in other years, as well as the number of typhoons that reached the strong typhoon and super typhoon levels, the data from 2000 to 2014 were used to train and test the prediction models; for the former, seventy percent of the data were used to train the models, while the remainder were used to test them, and typhoons Chan-hom and Soudelor (2015) were used to validate the models. Additionally, the typhoon cases that did not reach the typhoon level were not used in the experiment so as to avoid the negative effects for typhoon intensity forecasting and to achieve better prediction results.
The maximum wind speed (WS) and the minimum pressure (P) near the typhoon’s center, as well as the latitude (Lat) and the longitude (Lon) of the typhoon’s center were included in the data source. Additionally, the moving speed (MS) of the typhoon was also considered in the experiment. According to the theory of the cosine function and triangle function, the moving speed of a typhoon can be calculated as follows:
where
and
are the latitude and longitude in the previous 6 h, respectively;
and
are the latitude and longitude at the current time, respectively;
t is the interval time between two adjacent samples (the value of
t was six in the experiments). In addition, an MS of zero indicates a typhoon case that is motionless during the previous 6 h. The descriptive statistics of these variables of the processed typhoon data are shown in
Table 2.
The variables and prediction factors in this experiment can be seen in
Table 3. In
Table 3, P(t), P(t-1), P(t-2), P(t-3), and P(t-4) refer to the data of minimum pressure at the current time, in the previous 6 h, in the previous 12 h, and in the previous 18 and 24 h, respectively, which can be seen as the prediction factors based on the minimum pressure. Similarly, the following four rows of
Table 3 display the data of the maximum wind speed, the latitude, the longitude, and the moving speed, respectively, which can be seen as the prediction factors based on the variables WS, Lat, Lon, and MS, respectively.
6. Conclusions
When a typhoon arrives, it can bring with it huge losses to the lives and property of people. Improving the prediction results of typhoon intensity, such as accurate and timely typhoon intensity forecasting for as long as 120 h, remains a challenge. As shown by existing studies, suitable prediction factors can improve the prediction results, while more or fewer prediction factors could lead to worse prediction results. Moreover, the prediction model based on an FNN shows a limited ability for typhoon forecasting with a longer lead time of more than 48 h. Therefore, some prediction models based on LSTM are designed to determine the optimal prediction factors. Then, with the optimal prediction factors, one of the models based on LSTM is validated using the typhoon cases Chan-hom and Soudelor in 2015, compared to the FNN model using the same prediction factors. During all of the experiments, the typhoon intensity forecasting was studied as a time series problem based on historical typhoon data and LSTM.
As per the results, for typhoon intensity forecasting, the suitable and effective prediction factors can easily obtain better prediction results. The prediction factors containing the maximum wind speed and the minimum pressure near the typhoon’s center, as well as the latitude and longitude of the typhoon’s center at the current time and in the previous 6 h are enough to forecast typhoon intensity with lead times of 6, 12, 24, 48, 72, 96, and 120 h. Compared to the FNN model, the LSTM model shows better prediction ability, has lower uncertainties and smaller prediction errors, and is practical and more stable even for newer typhoon cases. It is very meaningful and practical to study typhoon intensity forecasting as a time series problem, as well as with a long lead time of more than 48 h. The application and findings of this study may be helpful and meaningful for engineering modeling, typhoon forecasting, and research of numerical models. In future work, more experiments and methods will be considered to make the future experiments of typhoon intensity forecasting better in terms of being more timely and accurate.