Next Article in Journal
A New Method Proposed for the Estimation of Exposure to Atmospheric Pollution through the Analysis of Black Pigments on the Lung Surface
Next Article in Special Issue
Stochastic Parameterization of Moist Physics Using Probabilistic Diffusion Model
Previous Article in Journal
Application of an Empirical Model to Improve Maximum Value Predictions in CFD-RANS: Insights from Four Scientific Domains
Previous Article in Special Issue
Improving Air Quality Prediction via Self-Supervision Masked Air Modeling
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Calibration of Typhoon Track Forecasts Based on Deep Learning Methods

1
College of Oceanography and Ecological Science, Shanghai Ocean University, Shanghai 201306, China
2
Research Center for Monitoring and Environmental Sciences, Taihu Basin and East China Sea Ecological Environment Supervision and Administration Authority, Ministry of Ecology and Environment, Shanghai 200125, China
3
State Key Laboratory of Tropical Oceanography, South China Sea Institute of Oceanology, Chinese Academy of Sciences, Guangzhou 510301, China
*
Author to whom correspondence should be addressed.
Atmosphere 2024, 15(9), 1125; https://doi.org/10.3390/atmos15091125
Submission received: 22 August 2024 / Revised: 5 September 2024 / Accepted: 11 September 2024 / Published: 17 September 2024
(This article belongs to the Special Issue Applications of Artificial Intelligence in Atmospheric Sciences)

Abstract

:
An accurate forecast of typhoon tracks is crucial for disaster warning and mitigation. However, existing numerical weather prediction models, such as the Weather Research and Forecasting (WRF) model, still exhibit significant errors in track forecasts. This study aims to improve forecast accuracy by correcting WRF-forecasted tracks using deep learning models, including Bidirectional Long Short-Term Memory (BiLSTM) + Convolutional Long Short-Term Memory (ConvLSTM) + Wide and Deep Learning (WDL), BiLSTM + Convolutional Gated Recurrent Unit (ConvGRU) + WDL, and BiLSTM + ConvLSTM + Extreme Deep Factorization Machine (xDeepFM), with a comparison to the Kalman Filter. The results demonstrate that the BiLSTM + ConvLSTM + WDL model reduces the 72 h track prediction error (TPE) from 255.18 km to 159.23 km, representing a 37.6% improvement over the original WRF model, and exhibits significant advantages across all evaluation metrics, particularly in key indicators such as Bias2, Mean Squared Error (MSE), and Sequence. The decomposition of MSE further validates the importance of the BiLSTM, ConvLSTM, WDL, and Temporal Normalization (TN) layers in enhancing the model’s spatio-temporal feature-capturing ability.

1. Introduction

The northwestern Pacific, a highly active region for tropical cyclones, including typhoons, presents significant challenges for accurate track prediction. This predominantly tropical area is influenced by the East Asian Monsoon and the Western North Pacific Monsoon, which drive seasonal variability in typhoon activity, with most storms forming between June and October. The southwest monsoon brings hot, humid summers, while the northeast monsoon results in cooler, drier winters. Warm sea surface temperatures [1] fuel typhoon development, while prevailing wind patterns dictate their movement and eventual dissipation.
Several atmospheric mechanisms further complicate typhoon dynamics in this region. The Western Pacific Subtropical High (WPSH) often steers typhoons westward and northward, particularly as it shifts northward during the summer. The Intertropical Convergence Zone (ITCZ) enhances convection and cyclogenesis, while the Madden–Julian Oscillation (MJO) [2] modulates typhoon activity on intraseasonal timescales by influencing convection and the likelihood of formation.
Typhoons, as highly destructive natural disasters, cause significant economic losses and casualties worldwide each year [3]. Accurate prediction of typhoon tracks is crucial for mitigating disaster impacts and enhancing emergency response efficiency. Numerical Weather Prediction (NWP) models, such as the WRF model, have become primary tools for typhoon track forecasting. However, due to the high complexity of atmospheric systems and sensitivity to initial conditions [4], traditional NWP models still exhibit certain limitations in track prediction, especially under rapidly changing meteorological conditions.
In the field of typhoon track prediction, traditional methods remain significant. The Climatology and Persistence (CLIPER) model, serving as a benchmark approach, simulates the two-dimensional characteristics of typhoons by combining historical climatological data and persistence information [5]. Although its prediction accuracy is limited, the CLIPER model provides a simple yet effective reference in the absence of more complex numerical models. Additionally, the Kalman Filter [6] technique, known for its state estimation capabilities in dynamic systems, is frequently employed to correct the prediction errors of NWP models, thereby enhancing typhoon track forecast accuracy [7].
With the rise of machine learning, numerous data-driven approaches have been introduced to typhoon track prediction. Support Vector Machine (SVM), commonly used for classification and regression, has been applied to various meteorological forecasting tasks by effectively fitting nonlinear data through optimal hyperplane search [8]. Ensemble learning methods based on decision trees, such as Extreme Gradient Boosting (XGBoost) [9] and Light Gradient Boosting Machine (LightGBM) [10], have demonstrated strong performance in handling high-dimensional, large-scale data, excelling in both prediction accuracy and computational efficiency. The application of these machine learning methods in typhoon track prediction has expanded the range of predictive models and approaches, offering new possibilities for improving forecast accuracy and reliability.
Meanwhile, deep learning models, particularly Convolutional Neural Network (CNN) [11,12,13,14,15], Recurrent Neural Network (RNN) [16], and their variants such as Long Short-Term Memory (LSTM) networks [17] and ConvLSTM networks, have shown significant advantages in processing complex spatiotemporal data [18]. These models, by automatically learning intricate patterns from large-scale data, are capable of capturing the spatiotemporal evolution characteristics of typhoon tracks, thereby further enhancing prediction accuracy and model robustness [19].
Despite the considerable potential demonstrated by deep learning methods in typhoon track prediction, several unresolved challenges remain. Firstly, the reliance of deep learning models on large datasets means that their performance may be constrained by the quality and quantity of the available data. Secondly, effectively integrating deep learning models with traditional NWP models remains a critical challenge. Although some progress has been made in combining these approaches, fully leveraging the physical foundations of NWP models alongside the nonlinear fitting capabilities of deep learning models is an area that has not yet been thoroughly explored.
This study aims to enhance the accuracy of the WRF model in forecasting and correcting typhoon tracks. Specifically, by constructing a dataset based on the CLIPER method and a three-dimensional construction technique, this research compares the effectiveness of traditional methods, machine learning approaches [20], and deep learning techniques [21] in track correction. The study particularly focuses on the role of BiLSTM [22] and TN [23] in track correction, and it analyzes the performance of different network architectures, including WDL [24], Neural Factorization Machine (NFM) [25], and xDeepFM [26]. All these network architectures incorporate ConvLSTM modules to capture the spatiotemporal evolution characteristics of typhoon tracks. The primary objective of this study is to evaluate the performance of these different network structures in track correction and to identify the optimal approach.
First, the WRF model is used to forecast pressure and wind fields, from which typhoon tracks are extracted. These tracks are then used to generate high-quality datasets through the CLIPER method and three-dimensional construction techniques. Subsequently, traditional methods, machine learning approaches, and deep learning techniques are employed to correct the WRF-forecasted tracks to match the actual tracks. The study places special emphasis on the critical role of BiLSTM and TN in the correction process, as well as the performance of different network architectures incorporating ConvLSTM modules (including WDL, NFM, and xDeepFM). Additionally, to investigate the sources of error and improve model interpretability, this research adopts the error decomposition method proposed by Hodson et al. [27], decomposing the MSE [28] into Bias2, distribution, and sequence components. Subsequently, error diagnosis and analysis are conducted for each correction scheme, providing insights for further model optimization.
The contributions of this study are as follows:
  • A systematic comparison of traditional methods, machine learning approaches, and deep learning techniques in WRF track correction is conducted, clarifying the strengths and weaknesses of each method.
  • The study emphasizes the importance of BiLSTM and TN in track correction and analyzes the performance of different deep learning frameworks after the integration of ConvLSTM modules.
  • This research delves into the performance of WDL, NFM, and xDeepFM in typhoon track correction, significantly enhancing the ability to process complex meteorological data and improving the accuracy and efficiency of predictions. These optimized network architectures demonstrate the immense potential of deep learning techniques in improving typhoon track prediction accuracy.
  • By introducing the error decomposition method, error diagnosis and analysis are performed for each correction scheme, enhancing model interpretability and providing valuable insights for further optimization.

2. Data

2.1. Best-Track Data

The Best-Track data used in this study are sourced from the China Meteorological Administration (CMA) [29,30]. This dataset has recorded the tracks and intensity information of tropical cyclones in the Western North Pacific (WNP) basin since 1949. The CMA dataset provides updates on typhoon center locations and intensity every six hours and serves as a crucial foundational data source for typhoon track prediction research.

2.2. Reanalysis Data

This study utilizes two reanalysis datasets: ERA-Interim and NCEP FNL, each applied at different stages.
The ERA-Interim dataset, provided by the European Centre for Medium-Range Weather Forecasts (ECMWF), serves as a crucial data source for constructing the three-dimensional structure of typhoons. This dataset has been continuously recording 14 reanalysis variables of the global atmosphere since 1979, with data available four times daily (00:00, 06:00, 12:00, and 18:00) across 37 pressure levels. In this study, the geopotential height data with a 1° × 1° grid resolution is used to construct the three-dimensional structure of typhoons. Geopotential height refers to the gravitational potential energy per unit mass relative to mean sea level.
The NCEP FNL dataset is an operational global analysis dataset provided by the National Centers for Environmental Prediction (NCEP) in the U.S., primarily used for forecasting typhoon tracks with the WRF model. This dataset covers the globe (from 0° E to 359° E, and from 90° N to 90° S) with a grid resolution of 1° × 1°, including 26 standard vertical levels ranging from 1000 millibars to 10 millibars. The NCEP FNL data include various atmospheric parameters such as temperature, surface winds, upper-level winds, vertical wind speed and velocity, and potential temperature.

3. Methods

3.1. Numerical Model Forecast

In this study, the WRF (version 4.2.1) [31] model was utilized to forecast the tracks of tropical cyclones [32], providing part of the input data for the deep learning models. Given the substantial computational resources and extended integration time required for WRF forecasts, typhoons from June to November in the Western North Pacific (WNP) from 2000 to 2022 were selected [33]. Only typhoons reaching or exceeding tropical storm strength (≥34 kts) and lasting at least three days are included in the forecasts. A total of 452 typhoon cases (Figure 1). The tracks were forecast with integration times of 24 h, 48 h, and 72 h. The forecasts were conducted with a horizontal resolution of 27 km, and the computational domain was defined by a rectangular area with an additional 100 km added to both the length and width to ensure complete coverage.
Initial and boundary conditions were provided by NCEP-FNL data, with a resolution of 1° × 1°. The WRF parameterization settings were based on previous studies of typhoons in the WNP [35], including the WRF Single-Moment 6-class microphysics scheme (Hong and Lim 2006) [36], the Rapid Radiative Transfer Model (RRTM) longwave radiation scheme (Mlawer et al., 1997) [37], the Dudhia shortwave radiation scheme (Dudhia 1989) [38], the Kain–Fritsch cumulus parameterization scheme (Kain 2004) [39], the Yonsei University planetary boundary layer scheme (Hong et al., 2006) [40], and the thermal diffusion scheme (Dudhia 1996) [41]. Typhoon tracks were recorded every six hours, with the typhoon center determined by the minimum sea-level pressure [42].

3.2. Dataset and Preprocessing Method

The dataset used in this study comprises both two-dimensional data (Table 1) and three-dimensional data (Figure 2). The two-dimensional data extend the dataset used in a previous study [34], incorporating additional latitude and longitude information for tropical cyclone centers forecasted by the WRF model at 24, 48, and 72 h. These data help correct the forecasted tracks, aligning them with the actual tracks in the best track dataset, thereby enhancing the accuracy of forecasts.
The three-dimensional data capture the atmospheric environment. It is generated for each time instance in the two-dimensional dataset, with geopotential height [43] being the primary feature. These data are instrumental in capturing the three-dimensional evolution of tropical cyclones, further enhancing the model’s capability for track correction.
Typhoons from 2000–2017 were used for training, and those from 2018–2022 as the test set, with the final 10% of the training set designated for validation. All data underwent preprocessing before being input into the model. The two-dimensional data were normalized, while the three-dimensional data were standardized to ensure consistency and stability during model training. This dataset supported the training and validation of the deep learning model, ultimately enhancing prediction accuracy.
Figure 3 illustrates the methodological workflow of this study, covering data acquisition, preprocessing, model training, and evaluation.

3.3. Temporal Normalization and BiLSTM

TN is a preprocessing technique designed to standardize data across different time periods, eliminating inconsistencies. Time series data often contain nonlinear trends, seasonal fluctuations, and noise, which can negatively impact model performance. By applying TN, the input data achieve improved generalization capability. TN helps the model predict future trends by balancing information across different periods.
BiLSTM improves accuracy by processing time series bidirectionally. Traditional LSTM networks process data in a forward direction, whereas BiLSTM uses two parallel LSTM layers to capture both forward and backward information. This allows BiLSTM to strengthen the model’s understanding of sequences. In tasks like typhoon track prediction, BiLSTM boosts accuracy by integrating information from both the past and future. BiLSTM is also more robust in handling long-term dependencies and complex patterns.
In summary, TN ensures input data consistency, while BiLSTM optimizes predictions through bidirectional processing. This combination offers significant advantages in time series modeling.

3.4. Network Architecture

3.4.1. WDL

The WDL model combines a wide linear model with a deep neural network. The wide component captures interactions among high-dimensional sparse features, while the deep component learns complex nonlinear interactions. By jointly training both, the WDL model generalizes to unseen feature combinations while retaining high memorization capacity.
In this study, the WDL architecture (Figure 4) consists of feature extraction via a linear layer, fused with the deep component’s output. The final latitude and longitude predictions are made through a fully connected layer. The deep component includes ConvLSTM layers and a residual channel attention mechanism, processed with TimeDistributed layers to capture spatiotemporal characteristics of typhoon tracks.
Batch normalization and dropout are used to prevent overfitting. The output layer predicts typhoon track latitude and longitude using linear regression. By training both components, the WDL model efficiently captures historical path patterns and applies broader feature combinations, leading to accurate typhoon track prediction.

3.4.2. NFM

The NFM model combines the strengths of Factorization Machine (FM) and Deep Neural Network (DNN) to capture both low-order and high-order feature interactions. The FM component models second-order interactions, while the DNN component extracts higher-order, more complex interactions through neural network layers.
In this study, the NFM architecture (Figure 5) divides input features into Wide Features and Deep Features. Deep Features are processed through ConvLSTM layers with channel attention and pooling operations, then mapped into an embedding space and processed by a Bi-Interaction Pooling layer, which explicitly models second-order interactions. Wide Features are processed through a linear layer and fused with the Bi-Interaction Pooled Deep Features. The final output, predicting typhoon track latitude and longitude, is generated through a linear layer.
NFM’s Bi-Interaction Pooling layer captures nonlinear interactions without additional feature engineering, while deep learning layers enhance high-order interaction capture. Compared to traditional models, NFM is flexible in feature extraction and efficient with sparse data, providing a reliable solution for modeling complex spatiotemporal features in typhoon track prediction.

3.4.3. xDeepFM

In this study, the xDeepFM model was employed to capture complex typhoon track features. xDeepFM integrates both explicit and implicit high-order feature interactions. The explicit component includes the Compressed Interaction Network (CIN), which captures multi-level feature relationships through layer-by-layer interactions. CIN models high-order interactions with increasing network depth.
As shown in Figure 6, the CIN component processes embedded input features through compressed interaction layers. The CIN output is fused with other interaction outputs, and final predictions are generated through a fully connected layer. xDeepFM combines Factorization Machines (FM) with high-order interaction learning via CIN, enhancing the model’s expressiveness and prediction accuracy.
This architecture effectively captures spatiotemporal interactions, supporting accurate typhoon track prediction.

3.5. Performance Evaluation

3.5.1. Evaluation Metrics

To comprehensively evaluate the performance of the model in typhoon track correction, this study employs various statistical metrics that help quantify the predictive accuracy of the model and reveal its performance under different conditions. The primary evaluation metrics include Mean Bias Error (MBE), Root Mean Square Error (RMSE), correlation coefficient R 2 , and Track Position Error (TPE).
MBE is used to measure the systematic bias between the model’s predictions and actual observations. The formula for calculating MBE is as follows:
MBE = 1 n i = 1 n ( y i y ^ i ) ,
where y i represents the ith predicted value, y ^ i represents the ith observed value, and n is the number of samples. MBE can reveal whether there is an overall bias in the model, making it one of the key indicators for evaluating the quality of model predictions.
Root Mean Square Error (RMSE) is another commonly used accuracy metric, which assesses the average difference between predicted and actual values. The formula for RMSE is:
RMSE = 1 n i = 1 n ( y i y ^ i ) 2 ,
A smaller RMSE indicates that the model’s predictions are more accurate and better reflect the actual data.
The correlation coefficient is used to evaluate the linear correlation between the model’s predictions and actual observations. Its formula is:
R 2 = 1 i = 1 n ( y i y ^ i ) 2 i = 1 n ( y i y ¯ ) 2 ,
where y ¯ is the mean of the predicted values. The closer the R 2 value is to 1, the stronger the model’s explanatory power and the higher the agreement between the predicted and actual values.
TPE, calculated following the method outlined in [44], is a metric specifically designed to assess the error in typhoon track predictions, representing the geographical deviation between the predicted and actual tracks. It serves as a crucial metric for evaluating typhoon track prediction performance, effectively reflecting the model’s overall error at different time steps and is one of the key evaluation metrics in this study. In addition to TPE, the combined use of these evaluation metrics provides a multidimensional framework for assessing the model’s performance, ensuring the reliability and accuracy of the prediction results.
Based on the literature [6,45,46,47,48,49,50,51,52], other commonly used evaluation metrics for assessing differences between predicted and actual paths after model processing include Mean Absolute Error (MAE), RMSE, MBE, Mean Squared Error (MSE), R2, Mean Distance Error (MDE), Mean Spherical Distance (MSD), and Spherical Distance (SD). These metrics are widely applied due to their effectiveness in measuring spatial errors and are complementary to TPE, which specifically assesses spatial discrepancies between predicted and actual paths. Generally, the smaller the error, the better the model’s performance, which is the primary reason for selecting these metrics.
Among various evaluation methods, this study specifically chose MBE, RMSE, R2, and TPE based on their frequent use in the literature and their clarity. MBE and RMSE reflect the model’s bias and overall error magnitude, while R2 measures goodness-of-fit, indicating the linear relationship between predicted and actual values. TPE, as a spatial error metric, directly reflects the accuracy of the predicted path. By combining these metrics, this study comprehensively evaluates the model’s performance from multiple perspectives.
Additionally, to address the limitations of relying on a single evaluation method, this study employs MSE decomposition. By breaking down MSE into Bias2, Distribution, and Sequence components, the study provides a detailed analysis of different sources of error, offering a more thorough model evaluation. This approach not only enhances the depth of evaluation but also strengthens the understanding of model performance, ensuring the reliability and academic value of the findings. Thus, the combined use of traditional and decomposed metrics offers a comprehensive framework to robustly assess model performance across different scenarios.

3.5.2. Error Decomposition

Conventional metrics like Mean Squared Error (MSE) quantify overall error but may not explain specific aspects of model performance. To enhance interpretability, this study follows Hodson et al.’s approach and decomposes MSE into a Bias Term [53], Distribution Term, and Sequence Term. This method enables detailed error diagnosis and guides future optimization.
MSE is calculated using the following formula:
MSE = 1 n i = 1 n ( f i o i ) 2 ,
where f i represents the forecasted value and o i represents the observed value on the ith time point. According to the decomposition method proposed by Geman et al. [27], MSE can be decomposed into a bias term and a variance term:
MSE ( e ) = ( E ( e 2 ) E ( e ) 2 ) + E ( e ) 2 = Var ( e ) + Bias ( e ) 2 ,
where e represents the model’s prediction error, Var ( e ) is the variance component of the error, and Bias ( e ) 2 is the bias component. The variance term evaluates the model’s ability to reproduce the variability of the observed values, while the bias term assesses the model’s ability to reproduce the mean characteristics of the observations.
Next, the variance term is further decomposed into a sequence term and a distribution term. To calculate these error components, the forecasted and observed values need to be sorted, and the new error is calculated:
w = sort ( f ) sort ( o ) ,
where sort ( f ) denotes the sorted forecasted values and sort ( o ) denotes the sorted observed values, with w being the error after sorting. Consequently, the new MSE can be expressed as:
MSE ( w ) = Bias ( w ) 2 + Var ( w ) ,
where Var ( w ) represents the difference in data distribution, i.e., the distribution term. According to this formula, the distribution term can be expressed as:
Var ( w ) = Distribution ( e ) ,
The sequence term is then expressed as:
Sequence ( e ) = MSE ( e ) MSE ( w ) ,
Thus, the final decomposition of MSE can be expressed as:
MSE ( e ) = Bias ( e ) 2 + Sequence ( e ) + Distribution ( e ) ,
In this formula, Bias ( e ) 2 quantifies the model’s ability to reproduce the mean characteristics of the observations, Sequence ( e ) represents the errors induced by temporal sequence variations, and Distribution ( e ) reflects the errors caused by differences in the distribution between forecasted and observed values. By using this decomposition method, a more comprehensive understanding of error sources can be obtained, allowing for more targeted model optimization.

4. Result

4.1. Analysis of WRF Model Track Forecast Results

We used the WRF model to forecast typhoon tracks at different time scales and evaluated its performance using a series of metrics. Figure 7 compares the forecasted latitude and longitude results from the WRF model at 24-h, 48-h, and 72-h time scales with the actual tracks. To comprehensively analyze the model’s performance, we employed metrics such as TPE, RMSE, and MBE.
Figure 7 illustrates the bar charts of track error and RMSE at each time scale. As shown, both TPE and RMSE increase as the forecast time extends. This trend indicates that the error in WRF-forecasted typhoon tracks gradually increases over time, with the maximum error at the 72-h mark.
Specifically, the TPE was 223.57 km at 72 h, 183.46 km at 48 h, and 104.57 km at 24 h. This indicates that the WRF model performs better on shorter time scales (24 h) than longer time scales (48 and 72 h). Similarly, the RMSE for both latitude and longitude increased at 72 h, with RMSE values of 2.0882° for latitude and a notably high 3.2427° for longitude. At 48 and 24 h, RMSE values decreased. For the 24-h forecast, the RMSE values for latitude and longitude were 1.4783° and 1.8422°, respectively, indicating more accurate predictions over shorter periods.
Notably, at all time scales, the RMSE for longitude consistently exceeded that for latitude, likely due to more pronounced lateral shifts in typhoon tracks. The longitudinal path is influenced by more complex weather systems and topography, increasing prediction difficulty and leading to higher errors. Optimizing longitudinal direction forecast errors could be key to improving overall accuracy.
Figure 8 illustrates year-by-year TPE variations from 2000 to 2022. TPE volatility increases at longer time scales (72 h), consistent with earlier analysis. In years like 2007 and 2015, the 72-h TPE was significantly higher, indicating particularly pronounced errors at longer time scales.
Finally, Figure 9 compares WRF-forecasted latitude and longitude with actual tracks at different time scales. In the 24-h forecast, the R2 value was 0.9884, indicating high model fit and minimal error over short time scales. At 72 h, the R2 value dropped to 0.9328 (latitude) and 0.9641 (longitude), with corresponding increases in RMSE and MBE, indicating larger errors at longer time scales.
In summary, the WRF model performs best in forecasting typhoon tracks over short time scales (24 h), with errors increasing significantly as the time scale extends. Thus, greater emphasis should be placed on leveraging the model’s short-term prediction capabilities, while optimizing long-term forecasts to enhance overall accuracy.

4.2. Analysis of Deep Learning-Based Track Correction Results

We conducted a comparative analysis of the performance of different deep learning models and traditional methods in typhoon track correction, focusing on their advantages over directly using WRF model forecast results.
Table 2 shows that the TPE values for the WRF model are 255.18 km, 236.41 km, and 94.80 km at 72 h, 48 h, and 24 h, respectively. These values differ somewhat from the TPE obtained by directly using the WRF model forecasts, mainly due to the reprocessing and filtering of data during dataset construction, which affects the observed errors.
From Table 2, it is evident that the BiLSTM model performs better across all time scales, especially in track correction at 72 h, 48 h, and 24 h, with TPE values of 207.67 km, 150.01 km, and 112.86 km, respectively. This is mainly due to BiLSTM’s ability to better capture temporal dependencies in typhoon tracks, maintaining high accuracy even over longer time scales. Additionally, the performance of the BiLSTM model improves when the TN layer is enabled. Without the TN layer, the TPE values for the BiLSTM model are 428.72 km, 156.49 km, and 120.10 km at 72 h, 48 h, and 24 h, respectively—significantly higher than what the TN layer enabled. This indicates that the TN layer plays a crucial role in mitigating seasonal variations and noise, significantly enhancing the model’s generalization capability.
In Table 3, the BiLSTM + ConvLSTM + WDL model shows higher accuracy in typhoon track correction. Specifically, its TPE values are 159.23 km, 148.25 km, and 75.31 km at 72 h, 48 h, and 24 h, respectively, outperforming the standalone BiLSTM model. This suggests that the inclusion of ConvLSTM and WDL enhances the model’s ability to process three-dimensional input data and significantly improves track correction accuracy.
Further analysis reveals that the BiLSTM + ConvLSTM + WDL model outperforms other deep learning methods, machine learning methods, and the traditional Kalman Filter across all time scales. The BiLSTM + ConvGRU + NFM model ranks just below the BiLSTM + ConvGRU + WDL and BiLSTM + ConvLSTM + WDL models, showing some advantages in feature interaction and track correction. However, the BiLSTM + ConvLSTM + NFM model performs worse, especially at longer time scales (72 h and 48 h), with significantly higher TPE values. This suggests that NFM works better with ConvGRU but underperforms when paired with ConvLSTM.
The BiLSTM + ConvLSTM + WDL model consistently yields lower TPE values across all time scales compared to the BiLSTM + ConvLSTM + NFM and BiLSTM + ConvLSTM + xDeepFM models, validating WDL’s superiority in feature combination and nonlinear processing. WDL’s efficient handling of wide and deep feature combinations gives it an edge, whereas the complex feature interactions in NFM and xDeepFM do not yield significant performance improvements.
Table 4 and Table 5 summarize the prediction performance of different models in the longitude and latitude directions, evaluated using MBE, RMSE, and R2 across three time scales (72, 48, and 24 h). Overall, the BiLSTM + ConvLSTM + WDL model excels, particularly in R2 and RMSE. For example, in the 24-h longitude prediction, it achieves an R2 of 0.9965, while in the 72-h latitude prediction, it shows an RMSE of 1.2574°, demonstrating high accuracy.
As the prediction time shortens (from 72 to 24 h), most models show improved accuracy, with decreasing RMSE and increasing R2, indicating smaller errors for shorter-term forecasts. Notably, the evaluation metrics for longitude are generally higher than for latitude, suggesting that longitude predictions may be less stable. Overall, the BiLSTM + ConvLSTM + WDL model excels in both stability and accuracy across all time scales.
In conclusion, the BiLSTM + ConvLSTM + WDL model performs best in typhoon track correction, particularly with three-dimensional input data and long-term track predictions. WDL outperforms NFM and xDeepFM in combining wide and deep features, suggesting that a simple yet efficient feature combination strategy is more crucial in track correction. This analysis provides valuable insights for optimizing typhoon track prediction models.

4.3. Evaluations of Error Decomposition

Based on the overall performance illustrated in Figure 10, we observe the trends in MSE, Bias2, Distribution, and Sequence across different time scales (72 h, 48 h, 24 h).
For MSE, both the line charts and scatter plots (Figure 10 and Figure 11) show a clear upward trend in MSE values as the forecast time increases from 24 to 72 h, with higher MSE values in the longitudinal direction compared to the latitudinal direction. This indicates that models tend to incur greater errors when forecasting longitude tracks. Specifically, the LSTM + ConvLSTM + WDL model (B) consistently shows the lowest MSE values across all forecast times, especially at the 72-h mark, where its errors are significantly lower than those of other models. In contrast, the Kalman Filter (E) and BiLSTM + ConvLSTM + xDeepFM (D) models exhibit higher MSE values across all forecast times, with this trend being particularly pronounced at 72 h, indicating larger errors in long-term forecasts. In the MSE-Latitude and MSE-Longitude scatter plots (Figure 11), the BiLSTM + ConvLSTM + WDL model shows relatively low MSE in both latitude and longitude, especially in the 24-h prediction, where regions such as the southeast coast and southern China exhibit smaller MSE values, indicated by green areas in the plots. In comparison, the BiLSTM + ConvGRU + WDL model’s MSE performance is slightly inferior but still surpasses that of the BiLSTM + ConvLSTM + xDeepFM model. Notably, the BiLSTM + ConvLSTM + xDeepFM model exhibits significantly higher MSE values in longitude, especially along the eastern coast of China and the Taiwan region, where large red areas indicate substantial prediction errors. This could be due to the complex topography and diverse track changes in the longitudinal direction, leading to greater errors as the model processes these variations.
Regarding Bias2, both the line charts and scatter plots (Figure 10 and Figure 12) indicate that Bias2 values are generally higher in the longitudinal direction than in the latitudinal direction, reflecting greater systematic errors in longitude tracks. The BiLSTM + ConvGRU + WDL model (A) consistently shows Bias2 values close to zero across all forecast times, especially in the 24-h latitude forecast, demonstrating stable performance with well-controlled bias. In contrast, the Kalman Filter (E) and BiLSTM + ConvLSTM + xDeepFM (D) models exhibit peak Bias2 values in the 72-h forecast, indicating significant systematic bias in long-term forecasts. In the Bias2-Latitude and Bias2-Longitude scatter plots (Figure 12), the BiLSTM + ConvLSTM + WDL model shows balanced performance in both latitude and longitude, with consistently low Bias2 values across all time scales. In contrast, the Bias2 values of the BiLSTM + ConvGRU + WDL model are slightly higher but still reflect relatively low systematic bias. However, the Bias2 values of the BiLSTM + ConvLSTM + xDeepFM model are noticeably higher in longitude, especially in the 72-h prediction, where extensive red areas indicate larger errors in longitude.
For Distribution, the line charts and scatter plots (Figure 10 and Figure 13) show that the WRF model (C) consistently exhibits the lowest Distribution values, especially in the 72-h forecast, showcasing excellent data distribution consistency. This suggests that the WRF model can accurately replicate the distribution characteristics of the observed data when forecasting latitude paths, with the predicted path closely aligning with the actual observations. Similarly, the WRF model performs well in the longitudinal direction, maintaining relatively low Distribution values despite slight fluctuations across different forecast times. In contrast, the Kalman Filter (E) and BiLSTM + ConvLSTM + xDeepFM (D) models show significantly higher Distribution values, particularly in the 72-h forecast and in the longitudinal direction. This indicates that these models struggle to maintain consistent data distribution between predictions and observations, leading to a poor match between the forecasted and actual paths, especially in long-term forecasts.
For Sequence, both the line charts and scatter plots (Figure 10 and Figure 14) reflect that Sequence values increase with forecast time, with a more pronounced increase in the longitudinal direction, suggesting poorer temporal consistency when forecasting longitude paths. The LSTM + ConvGRU + WDL model (A) consistently demonstrates the lowest Sequence values across all forecast times, particularly in the 24-h latitude forecast, indicating optimal temporal sequence consistency. In contrast, the Sequence values for the Kalman Filter (E) and BiLSTM + ConvLSTM + xDeepFM (D) models peak in the 72-h longitude forecast, indicating significant shortcomings in temporal sequence consistency for long-term path forecasts. In the Sequence-Latitude and Sequence-Longitude scatter plots (Figure 14), the BiLSTM + ConvLSTM + WDL model shows low temporal sequence errors in both latitude and longitude, particularly in the 24-h prediction, where regions like the southeast coast and southern China exhibit the lowest Sequence errors. This suggests that the model maintains good temporal sequence consistency, with more noticeable correction effects. However, the BiLSTM + ConvLSTM + xDeepFM model exhibits significantly higher Sequence errors in longitude, especially in complex path areas like the South China Sea and Taiwan Strait.
Overall, the models exhibit commonalities and differences in error patterns across latitudinal and longitudinal directions. MSE, Bias2, Sequence, and Distribution values are generally higher in the longitudinal direction, indicating that models are more prone to greater errors, systematic bias, poor data distribution consistency, and more pronounced lead or lag issues when forecasting longitude paths. Conversely, the models demonstrate more stable performance in the latitudinal direction, with smaller errors, closer data distribution to observations, and better temporal sequence consistency, especially in short-term forecasts. These differences highlight the need for future path forecast efforts to focus more on optimizing longitudinal processing to enhance overall forecast accuracy and improve model performance in long-term forecasts. Through the above comparative analysis, it is evident that the BiLSTM + ConvLSTM + WDL model outperforms the other models across various metrics, particularly excelling in key indicators such as Bias2, MSE, and Sequence. This model demonstrates balanced performance in both longitude and latitude, providing stable predictions, especially along the southeast coast and southern regions of China. As forecast time decreases, all statistical metrics show a downward trend, further confirming the BiLSTM + ConvLSTM + WDL model’s superiority in short timescales, significantly enhancing overall forecast outcomes.
From Figure 15, it is evident that the BiLSTM + ConvLSTM + WDL model demonstrates a significant advantage over the BiLSTM + ConvGRU + WDL model in correcting WRF forecast results. The scatter plots show that the BiLSTM + ConvLSTM + WDL model achieves R2 values of 0.9726 and 0.9836 for longitude and latitude, respectively, which are both higher than the BiLSTM + ConvGRU + WDL model’s R2 values of 0.9242 and 0.9848. This indicates a stronger correlation between the forecast results and actual observation data for the BiLSTM + ConvLSTM + WDL model.
In terms of RMSE, the BiLSTM + ConvLSTM + WDL model records RMSE values of 0.6252° for longitude and 0.5748° for latitude, significantly lower than the BiLSTM + ConvGRU + WDL model’s values of 1.0412° and 0.5534°. This demonstrates that the BiLSTM + ConvLSTM + WDL model more effectively reduces errors when correcting WRF forecast paths, providing more accurate path corrections. Additionally, the MBE for the BiLSTM + ConvLSTM + WDL model is also lower in both longitude and latitude, further emphasizing its superiority in controlling systematic errors.
Figure 16 and Figure 17 further validate these conclusions. The BiLSTM + ConvLSTM + WDL model (green path) shows a higher degree of alignment with historical typhoon paths (black path), especially in complex and variable regions. In contrast, although the BiLSTM + ConvGRU + WDL model (blue path) also exhibits good correction effects, it shows larger deviations in certain segments, particularly in the second path diagram, where the BiLSTM + ConvGRU + WDL model’s predicted path significantly diverges from the historical path.
In summary, the BiLSTM + ConvLSTM + WDL model exhibits higher accuracy and stability when correcting WRF forecast paths, establishing itself as one of the most effective models for typhoon path correction. While the BiLSTM + ConvGRU + WDL model generally performs well, its effectiveness is somewhat lacking in complex paths. Overall, the BiLSTM + ConvLSTM + WDL model more accurately captures subtle changes in typhoon paths, significantly improving prediction accuracy and demonstrating superior performance in the field of typhoon path correction.

5. Discussion

This study systematically compares the performance of various deep learning models, machine learning methods, and traditional approaches in typhoon track correction, highlighting the strengths and weaknesses of the BiLSTM + ConvLSTM + WDL model, BiLSTM + ConvGRU + WDL model, BiLSTM + ConvLSTM + xDeepFM model, and the Kalman Filter. The experimental results demonstrate that the BiLSTM + ConvLSTM + WDL model excels across all evaluation metrics, especially in key indicators such as Bias2, MSE, and Sequence. This model exhibits well-balanced performance in both longitude and latitude predictions, particularly in the southeastern coastal and southern regions of China, achieving higher predictive accuracy and stability. Moreover, the BiLSTM + ConvLSTM + WDL model effectively reduces systematic bias and distribution bias in track forecasts, significantly enhancing the accuracy of typhoon track corrections.
In this study, the BiLSTM plays a crucial role in capturing the spatiotemporal features of typhoon tracks. Its bidirectional structure enables the model to consider both past and future information simultaneously, thereby improving the accuracy of track forecasts. The ConvLSTM, by integrating convolutional operations with LSTM’s memory capabilities, better captures local spatial features of typhoon tracks, leading to significantly improved performance in complex track forecasts. The TN layer is essential in eliminating nonlinear trends and noise in time series data, ensuring consistent data processing across different time periods, thus enhancing the model’s generalization ability.
While effective in many cases, the BiLSTM + ConvGRU + WDL model shows some deficiencies in correcting complex tracks. The BiLSTM + ConvLSTM + xDeepFM model exhibits considerable bias in long-term track forecasts, particularly in the longitude direction, where it shows higher errors and distribution bias. Moreover, the Kalman Filter’s performance is markedly inferior to the deep learning models, especially in handling complex tracks.
In recent years, research on correcting tracks predicted by the WRF model has been limited. A notable study conducted by Kyoungmin Kim [54] et al. in 2023 employed machine learning methods for track correction in the Western North Pacific (WNP) region. The study used data from the typhoon active season between June and November from 2006 to 2018. In their 72-h forecast, the track prediction error (TPE) was reduced from 179.02 km to 171.25 km, an improvement of approximately 4.34%. However, their testing dataset included data only from 2016 to 2018, while the current study uses data from 2018 to 2022. These differences in datasets limit direct comparability, making their results a reference point rather than a direct comparison.
In addition to recent advancements, a related study by Chengfei He [6] et al. in 2015 used the Kalman filter to improve multi-model ensemble track forecasts over the WNP, reducing the TPE to 143.9 km in the 72-h forecast.
Recently, mainstream studies have leaned towards using deep learning methods for direct typhoon track prediction. For example, Jinkai Tan [52] et al. utilized the Gradient Boosting Decision Tree (GBDT) model in WNP tropical cyclone track forecasts, achieving errors of 138 km, 264 km, and 363.5 km for 24-h, 48-h, and 72-h forecasts, respectively. These values are significantly higher than the corrected TPEs obtained in this study, which are 75.31 km, 148.25 km, and 159.23 km after WRF forecast correction. Moreover, Pingping Wang [51] et al. used a 3D Convolutional Neural Network (3D CNN) to predict typhoon tracks in the WNP, achieving a TPE of 112.05 km for the 24-h forecast, which is also higher than the results obtained in this study.
Zili Liu [46] et al. used a Dual-Branched Spatio-Temporal Fusion Network for 24-h predictions in the WNP region, achieving a mean distance error (MDE) of 119.05 km. Similarly, TAO SONG [55] et al. developed a novel deep learning model based on a Bidirectional Gated Recurrent Unit (BiGRU) with an attention mechanism, achieving TPEs of 147.37 km, 164.82 km, and 171.19 km for 24-h, 48-h, and 72-h forecasts, respectively.
Overall, although some studies have explored typhoon track correction, the variability in datasets, methods, and models suggests that these results serve primarily as references. This study significantly reduces track error by leveraging the latest datasets and deep learning techniques for post-processing WRF outputs.
Although this study has achieved significant results in typhoon track correction in the Western North Pacific region, its applicability to other global regions remains unverified. The method’s effectiveness may be limited by differing climatic conditions, ocean dynamics, and atmospheric mechanisms in other regions. Furthermore, this study relies heavily on specific meteorological datasets, which may not fully represent conditions in other regions, potentially limiting the model’s generalization ability. Despite employing deep learning and machine learning methods for track correction, the performance of these methods may still be constrained by the quality and quantity of the input data, particularly when dealing with complex tracks.

6. Conclusions

This study underscores the significant superiority of the BiLSTM + ConvLSTM + WDL model in typhoon track correction and validates the critical role of BiLSTM, ConvLSTM, and the TN layer in enhancing the model’s ability to capture spatiotemporal features and improve track forecast accuracy. As the forecast time shortens, all statistical metrics show a decreasing trend, further confirming the effectiveness of the BiLSTM + ConvLSTM + WDL model in correcting WRF-forecasted tracks over short time scales.
In future research, we plan to validate the current methods across different regions by testing them on global or regional datasets to further assess the generalizability of our findings. Sensitivity analysis will also be a key focus; we intend to adjust the input data and examine the resulting outputs to evaluate the models’ stability and robustness, thereby optimizing their structure. Given the time-intensive nature of this task, it was not completed in the current study but will be prioritized in future research.
To enhance the accuracy of track predictions, we will explore more complex neural network architectures, particularly for long-term forecasts, to improve overall accuracy and stability. Different neural network architectures may be needed for specific geographical regions, as suggested by relevant literature [56]. Future studies could involve clustering paths into types and designing specialized networks for each type or developing distinct networks to correct longitude and latitude separately. In this study, the three-dimensional data primarily used geopotential height; however, future research could consider incorporating additional variables such as relative vorticity, temperature, and cloud cover into the datasets to construct more detailed three-dimensional models, further enhancing the accuracy and reliability of track forecasts.

Author Contributions

C.T. and Z.W. contributed to the conception and design of the study. Y.T., Y.H. and K.W. contributed to the analysis. Q.L. and J.Z. organized the database. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by “Research on the mechanisms and simulation of sea-land resources flow by using big data” (Grant No. 42130402); “The influence of various El Niño types on interannual sea-level variation in the South China Sea” (Grant No. 42176012); the National Key Research and Development Program of China (Grant No. 2021YFC3101702); and the Open Research Fund of the Key Laboratory of East China Sea Environmental Monitoring, State Oceanic Administration (Grant No. MEMRT2022ZX01).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The Best-Track Data and Reanalysis Data in this paper are publicly available. The Best-Track Data is available at the following URL: https://tcdata.typhoon.org.cn/zjljsjj.html, accessed on 19 June 2024. The ERA-Interim dataset can be accessed at the following URL: https://cds.climate.copernicus.eu/cdsapp#!/dataset/reanalysis-era5-pressure-levels?tab=form, accessed on 19 June 2024. Detailed NCEP FNL parameters and data processing methods can be found at the following URL: https://rda.ucar.edu/datasets/ds083.2/, accessed on 19 June 2024.

Acknowledgments

The authors are grateful to ECMWF, NCEP, and CMA for providing their datasets.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Mei, W.; Xie, S.-P.; Primeau, F.; McWilliams, J.C.; Pasquero, C. Northwestern Pacific typhoon intensity controlled by changes in ocean temperatures. Sci. Adv. 2015, 1, e1500014. [Google Scholar] [CrossRef]
  2. Zhang, C. Madden-julian oscillation. Rev. Geophys. 2005, 43, RG2003. [Google Scholar] [CrossRef]
  3. Kawai, H.; Hiraishi, T.; Kim, D.-S.; Kang, Y.-K.; Tomita, T. Hindcasting of storm surges in Korea by Typhoon 0314 (Maemi). In Proceedings of the ISOPE International Ocean and Polar Engineering Conference, Seoul, Republic of Korea, 19–24 June 2005; p. ISOPE–I-05-304. [Google Scholar]
  4. Chan, K.T.; Chan, J.C. Sensitivity of the simulation of tropical cyclone size to microphysics schemes. Adv. Atmos. Sci. 2016, 33, 1024–1035. [Google Scholar] [CrossRef]
  5. Aberson, S.D. Five-day tropical cyclone track forecasts in the North Atlantic basin. Weather Forecast. 1998, 13, 1005–1015. [Google Scholar] [CrossRef]
  6. He, C.; Zhi, X.; You, Q.; Song, B.; Fraedrich, K. Multi-model ensemble forecasts of tropical cyclones in 2010 and 2011 based on the Kalman Filter method. Meteorol. Atmos. Phys. 2015, 127, 467–479. [Google Scholar] [CrossRef]
  7. Kieu, C.; Minh, P.T.; Mai, H.T. An application of the multi-physics ensemble Kalman filter to typhoon forecast. Pure Appl. Geophys. 2014, 171, 1473–1497. [Google Scholar] [CrossRef]
  8. Song, H.-J.; Huh, S.-H.; Kim, J.-H.; Ho, C.-H.; Park, S.-K. Typhoon track prediction by a support vector machine using data reduction methods. In Proceedings of the Computational Intelligence and Security: International Conference, CIS 2005, Xi’an, China, 15–19 December 2005; Proceedings Part I. pp. 503–511. [Google Scholar]
  9. Jin, Q.; Fan, X.; Liu, J.; Xue, Z.; Jian, H. Estimating tropical cyclone intensity in the South China Sea using the XGBoost Model and FengYun Satellite images. Atmosphere 2020, 11, 423. [Google Scholar] [CrossRef]
  10. Xu, K.; Han, Z.; Xu, H.; Bin, L. Rapid prediction model for urban floods based on a light gradient boosting machine approach and hydrological–hydraulic model. Int. J. Disaster Risk Sci. 2023, 14, 79–97. [Google Scholar] [CrossRef]
  11. He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
  12. Szegedy, C.; Liu, W.; Jia, Y.; Sermanet, P.; Reed, S.; Anguelov, D.; Erhan, D.; Vanhoucke, V.; Rabinovich, A. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 1–9. [Google Scholar]
  13. LeCun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef]
  14. Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet classification with deep convolutional neural networks. Commun. ACM 2017, 60, 84–90. [Google Scholar] [CrossRef]
  15. Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv 2014, arXiv:1409.1556. [Google Scholar]
  16. Chen, R.; Wang, X.; Zhang, W.; Zhu, X.; Li, A.; Yang, C. A hybrid CNN-LSTM model for typhoon formation forecasting. GeoInformatica 2019, 23, 375–396. [Google Scholar] [CrossRef]
  17. Yuan, S.; Wang, C.; Mu, B.; Zhou, F.; Duan, W. Typhoon intensity forecasting based on LSTM using the rolling forecast method. Algorithms 2021, 14, 83. [Google Scholar] [CrossRef]
  18. Kim, S.; Kim, H.; Lee, J.; Yoon, S.; Kahou, S.E.; Kashinath, K.; Prabhat, M. Deep-hurricane-tracker: Tracking and forecasting extreme climate events. In Proceedings of the 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, USA, 7–11 January 2019; pp. 1761–1769. [Google Scholar]
  19. Shi, X.; Chen, Z.; Wang, H.; Yeung, D.-Y.; Wong, W.-K.; Woo, W.-C. Convolutional LSTM network: A machine learning approach for precipitation nowcasting. Adv. Neural Inf. Process. Syst. 2015, 28. [Google Scholar]
  20. Peng, T.; Zhi, X.; Ji, Y.; Ji, L.; Tian, Y. Prediction skill of extended range 2-m maximum air temperature probabilistic forecasts using machine learning post-processing methods. Atmosphere 2020, 11, 823. [Google Scholar] [CrossRef]
  21. Ji, Y.; Zhi, X.; Ji, L.; Zhang, Y.; Hao, C.; Peng, T. Deep-learning-based post-processing for probabilistic precipitation forecasting. Front. Earth Sci. 2022, 10, 978041. [Google Scholar] [CrossRef]
  22. Siami-Namini, S.; Tavakoli, N.; Namin, A.S. The performance of LSTM and BiLSTM in forecasting time series. In Proceedings of the 2019 IEEE International Conference on Big Data (Big Data), Los Angeles, CA, USA, 9–12 December 2019; pp. 3285–3292. [Google Scholar]
  23. Deng, J.; Chen, X.; Jiang, R.; Song, X.; Tsang, I.W. St-norm: Spatial and temporal normalization for multi-variate time series forecasting. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, Singapore, 14–18 August 2021; pp. 269–278. [Google Scholar]
  24. Cheng, H.-T.; Koc, L.; Harmsen, J.; Shaked, T.; Chandra, T.; Aradhye, H.; Anderson, G.; Corrado, G.; Chai, W.; Ispir, M. Wide & deep learning for recommender systems. In Proceedings of the 1st Workshop on Deep Learning for Recommender Systems, Boston, MA, USA, 15 September 2016; pp. 7–10. [Google Scholar]
  25. He, X.; Chua, T.-S. Neural factorization machines for sparse predictive analytics. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, Tokyo, Japan, 7–11 August 2017; pp. 355–364. [Google Scholar]
  26. Lian, J.; Zhou, X.; Zhang, F.; Chen, Z.; Xie, X.; Sun, G. xdeepfm: Combining explicit and implicit feature interactions for recommender systems. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, London, UK, 19–23 August 2018; pp. 1754–1763. [Google Scholar]
  27. Hodson, T.O.; Over, T.M.; Foks, S.S. Mean squared error, deconstructed. J. Adv. Model. Earth Syst. 2021, 13, e2021MS002681. [Google Scholar] [CrossRef]
  28. Lyu, Y.; Zhi, X.; Wu, H.; Zhou, H.; Kong, D.; Zhu, S.; Zhang, Y.; Hao, C. Analyses on the multimodel wind forecasts and error decompositions over North China. Atmosphere 2022, 13, 1652. [Google Scholar] [CrossRef]
  29. Ying, M.; Zhang, W.; Yu, H.; Lu, X.; Feng, J.; Fan, Y.; Zhu, Y.; Chen, D. An overview of the China Meteorological Administration tropical cyclone database. J. Atmos. Ocean. Technol. 2014, 31, 287–301. [Google Scholar] [CrossRef]
  30. Lu, X.; Yu, H.; Ying, M.; Zhao, B.; Zhang, S.; Lin, L.; Bai, L.; Wan, R. Western North Pacific tropical cyclone database created by the China Meteorological Administration. Adv. Atmos. Sci. 2021, 38, 690–699. [Google Scholar] [CrossRef]
  31. Srivastava, A.K.; Ullrich, P.A.; Rastogi, D.; Vahmani, P.; Jones, A.; Grotjahn, R. Assessment of WRF (v 4.2.1) dynamically downscaled precipitation on subdaily and daily timescales over CONUS. Geosci. Model Dev. 2023, 16, 3699–3722. [Google Scholar] [CrossRef]
  32. Zhang, H.-b.; Zhi, X.-f.; Chen, J.; Wang, Y.-n.; Wang, Y. Study of the modification of multi-model ensemble schemes for tropical cyclone forecasts. J. Trop. Meteorol. 2015, 21, 389. [Google Scholar]
  33. Khatri, W.D.; Xiefei, Z.; Ling, Z. Interannual and Interdecadal Variations in Tropical Cyclone Activity over the Arabian Sea and the Impacts over Pakistan. In High-Impact Weather Events over the SAARC Region; Springer: Berlin/Heidelberg, Germany, 2014; pp. 129–145. [Google Scholar]
  34. Xu, G.; Xian, D.; Fournier-Viger, P.; Li, X.; Ye, Y.; Hu, X. AM-ConvGRU: A spatio-temporal model for typhoon path prediction. Neural Comput. Appl. 2022, 34, 5905–5921. [Google Scholar] [CrossRef]
  35. Moon, J.; Park, J.; Cha, D.-H.; Moon, Y. Five-day track forecast skills of WRF model for the western North Pacific tropical cyclones. Weather Forecast. 2021, 36, 1491–1503. [Google Scholar] [CrossRef]
  36. Hong, S.-Y.; Lim, J.-O.J. The WRF single-moment 6-class microphysics scheme (WSM6). Asia-Pac. J. Atmos. Sci. 2006, 42, 129–151. [Google Scholar]
  37. Mlawer, E.J.; Taubman, S.J.; Brown, P.D.; Iacono, M.J.; Clough, S.A. Radiative transfer for inhomogeneous atmospheres: RRTM, a validated correlated-k model for the longwave. J. Geophys. Res. Atmos. 1997, 102, 16663–16682. [Google Scholar] [CrossRef]
  38. Dudhia, J. Numerical study of convection observed during the winter monsoon experiment using a mesoscale two-dimensional model. J. Atmos. Sci. 1989, 46, 3077–3107. [Google Scholar] [CrossRef]
  39. Kain, J.S. The Kain–Fritsch convective parameterization: An update. J. Appl. Meteorol. 2004, 43, 170–181. [Google Scholar] [CrossRef]
  40. Hong, S.-Y.; Noh, Y.; Dudhia, J. A new vertical diffusion package with an explicit treatment of entrainment processes. Mon. Weather Rev. 2006, 134, 2318–2341. [Google Scholar] [CrossRef]
  41. Dudhia, J. A multi-layer soil temperature model for MM5. In Proceedings of the Preprints, The Sixth PSU/NCAR Mesoscale Model Users’ Workshop, Boulder, CO, USA, 22–24 July 1996; pp. 22–24. [Google Scholar]
  42. Feser, F.; von Storch, H. A dynamical downscaling case study for typhoons in Southeast Asia using a regional climate model. Mon. Weather Rev. 2008, 136, 1806–1815. [Google Scholar] [CrossRef]
  43. Ji, L.; Zhi, X.; Zhu, S. Probabilistic Forecasts of 1–15-day 500 hPa Geopotential Height over Northern Hemisphere based on Bayesian Model Averaging. In Proceedings of the EGU General Assembly Conference Abstracts, Vienna, Austria, 8–13 April 2018; p. 4263. [Google Scholar]
  44. Karney, C.F. Algorithms for geodesics. J. Geod. 2013, 87, 43–55. [Google Scholar] [CrossRef]
  45. Qin, W.; Tang, J.; Lao, S. DeepFR: A trajectory prediction model based on deep feature representation. Inf. Sci. 2022, 604, 226–248. [Google Scholar] [CrossRef]
  46. Liu, Z.; Hao, K.; Geng, X.; Zou, Z.; Shi, Z. Dual-branched spatio-temporal fusion network for multihorizon tropical cyclone track forecast. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2022, 15, 3842–3852. [Google Scholar] [CrossRef]
  47. Rajini Selvaraj, A.; Gurusamy, T. An optimal model using single-dimensional CAE-IRNN based SPOA for cyclone track prediction. Expert Syst. Appl. 2023, 230, 120437. [Google Scholar] [CrossRef]
  48. Qiao, B.; Wu, J.; Wang, R.; Hao, Y.; Wang, P.; Han, D.; Wu, G. A parallel feature selection method based on NMI-XGBoost and distance correlation for typhoon trajectory prediction. J. Supercomput. 2024, 80, 11293–11321. [Google Scholar] [CrossRef]
  49. Rüttgers, M.; Jeon, S.; Lee, S.; You, D. Prediction of typhoon track and intensity using a generative adversarial network with observational and meteorological data. IEEE Access 2022, 10, 48434–48446. [Google Scholar] [CrossRef]
  50. Lu, P.; Xu, M.; Sun, A.; Wang, Z.; Zheng, Z. Typhoon tracks prediction with convlstm fused reanalysis data. Electronics 2022, 11, 3279. [Google Scholar] [CrossRef]
  51. Wang, P.; Wang, P.; Wang, C.; Xue, B.; Wang, D. Using a 3D convolutional neural network and gated recurrent unit for tropical cyclone track forecasting. Atmos. Res. 2022, 269, 106053. [Google Scholar] [CrossRef]
  52. Tan, J.; Chen, S.; Wang, J. Western North Pacific tropical cyclone track forecasts by a machine learning model. Stoch. Environ. Res. Risk Assess. 2021, 35, 1113–1126. [Google Scholar] [CrossRef]
  53. Xia, Y.; Chen, J.; Zhi, X.; Chen, L.; Zhao, Y.; Liu, X. Impact of model bias correction on a hybrid data assimilation system. J. Meteorol. Res. 2020, 34, 400–412. [Google Scholar] [CrossRef]
  54. Kim, K.; Yoon, D.; Cha, D.-H.; Im, J. Improved Tropical Cyclone Track Simulation over the Western North Pacific using the WRF Model and a Machine Learning Method. Asia-Pac. J. Atmos. Sci. 2023, 59, 283–296. [Google Scholar] [CrossRef]
  55. Song, T.; Li, Y.; Meng, F.; Xie, P.; Xu, D. A novel deep learning model by Bigru with attention mechanism for tropical cyclone track prediction in the Northwest Pacific. J. Appl. Meteorol. Climatol. 2022, 61, 3–12. [Google Scholar] [CrossRef]
  56. Dueben, P.D.; Bauer, P. Challenges and design choices for global weather and climate models based on machine learning. Geosci. Model Dev. 2018, 11, 3999–4009. [Google Scholar] [CrossRef]
Figure 1. A visualization of all forecasted typhoons. Adapted from the figure by Xu [34] et al.
Figure 1. A visualization of all forecasted typhoons. Adapted from the figure by Xu [34] et al.
Atmosphere 15 01125 g001
Figure 2. The three-dimensional time-series structure of a typhoon.
Figure 2. The three-dimensional time-series structure of a typhoon.
Atmosphere 15 01125 g002
Figure 3. Methodological Workflow Diagram.
Figure 3. Methodological Workflow Diagram.
Atmosphere 15 01125 g003
Figure 4. Typhoon Track Prediction Model Based on the Wide and Deep Framework.
Figure 4. Typhoon Track Prediction Model Based on the Wide and Deep Framework.
Atmosphere 15 01125 g004
Figure 5. Typhoon Track Prediction Model Based on the Neural Factorization Machine Framework.
Figure 5. Typhoon Track Prediction Model Based on the Neural Factorization Machine Framework.
Atmosphere 15 01125 g005
Figure 6. Typhoon Track Prediction Model Based on the Extreme Deep Factorization Machine Framework.
Figure 6. Typhoon Track Prediction Model Based on the Extreme Deep Factorization Machine Framework.
Atmosphere 15 01125 g006
Figure 7. Comparative Analysis of Typhoon TPE and Latitude–Longitude RMSE at Different Integration Times in WRF Forecast Results.
Figure 7. Comparative Analysis of Typhoon TPE and Latitude–Longitude RMSE at Different Integration Times in WRF Forecast Results.
Atmosphere 15 01125 g007
Figure 8. Temporal Trends of WRF-Forecasted Typhoon TPE from 2000 to 2022.
Figure 8. Temporal Trends of WRF-Forecasted Typhoon TPE from 2000 to 2022.
Atmosphere 15 01125 g008
Figure 9. Scatter Plot Comparing Latitude and Longitude of WRF Forecasts with Best Track at Different Integration Times; the red line indicates the regression line, reflecting the linear relationship between observed and predicted values. (ac) represent scatter plots for 72 h, 48 h, and 24 h Integration Times along the longitude, and (a1c1) represent the same along the latitude.
Figure 9. Scatter Plot Comparing Latitude and Longitude of WRF Forecasts with Best Track at Different Integration Times; the red line indicates the regression line, reflecting the linear relationship between observed and predicted values. (ac) represent scatter plots for 72 h, 48 h, and 24 h Integration Times along the longitude, and (a1c1) represent the same along the latitude.
Atmosphere 15 01125 g009
Figure 10. Comparative Analysis of Overall MSE ((a,a1), Unit: °2), Bias2 ((b,b1), Unit: °2), Distribution ((c,c1), Unit: °2), and Sequence ((d,d1), Unit: °2) for Latitude and Longitude Directions Across Different Models on the 2018–2022 Test Set. The top panels show metrics for the latitude direction, while the bottom panels show metrics for the longitude direction. The models labeled in the figure are BiLSTM + ConvGRU + WDL (A), BiLSTM + ConvLSTM + WDL (B), WRF (C), BiLSTM + ConvLSTM + xDeepFM (D), and Kalman Filter (E). In each subplot, the curves of different colors correspond to 72-h, 48-h, and 24-h integration times. (ad) represent latitudinal variables, and (a1d1) represent longitudinal variables.
Figure 10. Comparative Analysis of Overall MSE ((a,a1), Unit: °2), Bias2 ((b,b1), Unit: °2), Distribution ((c,c1), Unit: °2), and Sequence ((d,d1), Unit: °2) for Latitude and Longitude Directions Across Different Models on the 2018–2022 Test Set. The top panels show metrics for the latitude direction, while the bottom panels show metrics for the longitude direction. The models labeled in the figure are BiLSTM + ConvGRU + WDL (A), BiLSTM + ConvLSTM + WDL (B), WRF (C), BiLSTM + ConvLSTM + xDeepFM (D), and Kalman Filter (E). In each subplot, the curves of different colors correspond to 72-h, 48-h, and 24-h integration times. (ad) represent latitudinal variables, and (a1d1) represent longitudinal variables.
Atmosphere 15 01125 g010
Figure 11. Spatial Distribution Comparison of MSE (Unit: °2) for Latitude and Longitude Directions Across BiLSTM + ConvGRU + WDL, BiLSTM + ConvLSTM + WDL, BiLSTM + ConvLSTM + DeepFM, WRF, and Kalman Filter on the 2018–2022 Typhoon Test Set. The (top panels) show MSE distribution for latitude, while the (bottom panels) show MSE distribution for longitude.
Figure 11. Spatial Distribution Comparison of MSE (Unit: °2) for Latitude and Longitude Directions Across BiLSTM + ConvGRU + WDL, BiLSTM + ConvLSTM + WDL, BiLSTM + ConvLSTM + DeepFM, WRF, and Kalman Filter on the 2018–2022 Typhoon Test Set. The (top panels) show MSE distribution for latitude, while the (bottom panels) show MSE distribution for longitude.
Atmosphere 15 01125 g011
Figure 12. Spatial Distribution Comparison of Bias2 (Unit: °2) for Latitude and Longitude Directions Across BiLSTM + ConvGRU + WDL, BiLSTM + ConvLSTM + WDL, BiLSTM + ConvLSTM + DeepFM, WRF, and Kalman Filter on the 2018–2022 Typhoon Test Set. The (top panels) show Bias2 distribution for latitude, while the (bottom panels) show Bias2 distribution for longitude.
Figure 12. Spatial Distribution Comparison of Bias2 (Unit: °2) for Latitude and Longitude Directions Across BiLSTM + ConvGRU + WDL, BiLSTM + ConvLSTM + WDL, BiLSTM + ConvLSTM + DeepFM, WRF, and Kalman Filter on the 2018–2022 Typhoon Test Set. The (top panels) show Bias2 distribution for latitude, while the (bottom panels) show Bias2 distribution for longitude.
Atmosphere 15 01125 g012
Figure 13. Spatial Distribution Comparison of Distribution (Unit: °2) for Latitude and Longitude Directions Across BiLSTM + ConvGRU + WDL, BiLSTM + ConvLSTM + WDL, BiLSTM + ConvLSTM + DeepFM, WRF, and Kalman Filter on the 2018–2022 Typhoon Test Set. The (top panels) show the distribution metric for latitude, while the (bottom panels) show the distribution metric for longitude.
Figure 13. Spatial Distribution Comparison of Distribution (Unit: °2) for Latitude and Longitude Directions Across BiLSTM + ConvGRU + WDL, BiLSTM + ConvLSTM + WDL, BiLSTM + ConvLSTM + DeepFM, WRF, and Kalman Filter on the 2018–2022 Typhoon Test Set. The (top panels) show the distribution metric for latitude, while the (bottom panels) show the distribution metric for longitude.
Atmosphere 15 01125 g013
Figure 14. Spatial Distribution Comparison of Sequence (Unit: °2) for Latitude and Longitude Directions Across BiLSTM + ConvGRU + WDL, BiLSTM + ConvLSTM + WDL, BiLSTM + ConvLSTM + DeepFM, WRF, and Kalman Filter on the 2018–2022 Typhoon Test Set. The (top panels) show the sequence metric for latitude, while the (bottom panels) show the sequence metric for longitude.
Figure 14. Spatial Distribution Comparison of Sequence (Unit: °2) for Latitude and Longitude Directions Across BiLSTM + ConvGRU + WDL, BiLSTM + ConvLSTM + WDL, BiLSTM + ConvLSTM + DeepFM, WRF, and Kalman Filter on the 2018–2022 Typhoon Test Set. The (top panels) show the sequence metric for latitude, while the (bottom panels) show the sequence metric for longitude.
Atmosphere 15 01125 g014
Figure 15. Comparison of Track Prediction Performance in Latitude and Longitude Directions Between the WRF Model and Correction Models (BiLSTM + ConvLSTM + WDL and BiLSTM + ConvGRU + WDL) on the 2018–2022 Typhoon Test Set. The red line indicates the regression line, reflecting the linear relationship between observed and predicted values; the red shaded area represents the 95% confidence interval of the regression line.
Figure 15. Comparison of Track Prediction Performance in Latitude and Longitude Directions Between the WRF Model and Correction Models (BiLSTM + ConvLSTM + WDL and BiLSTM + ConvGRU + WDL) on the 2018–2022 Typhoon Test Set. The red line indicates the regression line, reflecting the linear relationship between observed and predicted values; the red shaded area represents the 95% confidence interval of the regression line.
Atmosphere 15 01125 g015
Figure 16. Path Comparison of Typhoon In-fa (2021), Showing the Differences Between the WRF, BiLSTM + ConvLSTM + WDL, and BiLSTM + ConvGRU + WDL Methods and Historical Typhoon Tracks. Adapted from the figure by Xu [34] et al.
Figure 16. Path Comparison of Typhoon In-fa (2021), Showing the Differences Between the WRF, BiLSTM + ConvLSTM + WDL, and BiLSTM + ConvGRU + WDL Methods and Historical Typhoon Tracks. Adapted from the figure by Xu [34] et al.
Atmosphere 15 01125 g016
Figure 17. Path Comparison of Typhoon Chanthu (2021), Showing the Differences Between the WRF, BiLSTM + ConvLSTM + WDL, and BiLSTM + ConvGRU + WDL Methods and Historical Typhoon Tracks. Adapted from the figure by Xu [34] et al.
Figure 17. Path Comparison of Typhoon Chanthu (2021), Showing the Differences Between the WRF, BiLSTM + ConvLSTM + WDL, and BiLSTM + ConvGRU + WDL Methods and Historical Typhoon Tracks. Adapted from the figure by Xu [34] et al.
Atmosphere 15 01125 g017
Table 1. Characteristics of the two-dimensional dataset, using the WRF model with a 72-h integration time as an example.
Table 1. Characteristics of the two-dimensional dataset, using the WRF model with a 72-h integration time as an example.
IDFeature NameDescription
1–5 LONG 0 , LONG 6 , LONG 12 , LONG 18 , LONG 24 Longitude in the last 24 h
6–10 LONG 0 , LONG 6 , LONG 12 , LONG 18 , LONG 24 Longitude in the last 24 h
11–15 WND 0 , WND 6 , WND 12 , WND 18 , WND 24 Wind speed in the last 24 h
16 MONTH Current month
17–20 LAT 0 LAT 6 , LAT 6 LAT 12 , LAT 12 LAT 18 , LAT 18 LAT 24 1st-order difference in historical latitude
25–28 WND 0 WND 6 , WND 6 WND 12 , WND 12 WND 18 , WND 18 WND 24 1st-order difference in historical wind speed
29 i = 0 3 ( LAT 6 i LAT 6 ( i + 1 ) ) 2 Sum of squares of 1st-order latitude difference
30 i = 0 3 ( LONG 6 i LONG 6 ( i + 1 ) ) 2 Sum of squares of 1st-order longitude difference
31 i = 0 3 ( LAT 6 i LAT 6 ( i + 1 ) ) 2 Square root of feature 29
32 i = 0 3 ( LONG 6 i LONG 6 ( i + 1 ) ) 2 Square root of feature 30
33–34 LAT 0 , LONG 0 Square root of current latitude and longitude
35–38 Acc ( LOC 6 , LOC 0 ) , Acc ( LOC 12 , LOC 6 ) , Acc ( LOC 18 , LOC 12 ) , Acc ( LOC 24 , LOC 18 ) Physical acceleration of historical location
39–42 Angle ( LOC 0 , 0 °   N ) , Angle ( LOC 6 , 0 °   N ) , Angle ( LOC 12 , 0 °   N ) , Angle ( LOC 18 , 0 °   N ) Zonal angle
43–46 Angle ( LOC 0 , 0 °   E ) , Angle ( LOC 6 , 0 °   E ) , Angle ( LOC 12 , 0 °   E ) , Angle ( LOC 18 , 0 °   E ) Meridional angle
47–50 Angle ( LOC 0 , LOC 6 ) , Angle ( LOC 6 , LOC 12 ) , Angle ( LOC 12 , LOC 18 ) , Angle ( LOC 18 , LOC 24 ) Angle of historical location
51–53 Angle ( PATH 0 , 6 , PATH 6 , 12 ) , Angle ( PATH 6 , 12 , PATH 12 , 18 ) Angle of historical path
54–56 LAT ( WRF ) 24 , LAT ( WRF ) 48 , LAT ( WRF ) 72 TC center latitude forecasted by WRF (Integration time of 72 h) after 24 h, 48 h, 72 h
57–59 LON ( WRF ) 24 , LON ( WRF ) 48 , LON ( WRF ) 72 TC center longitude forecasted by WRF (Integration time of 72 h) after 24 h, 48 h, 72 h
Table 2. Comparison of TPE (km) Across Different Models at 72 h, 48 h, and 24 h on the 2018–2022 Test Set.
Table 2. Comparison of TPE (km) Across Different Models at 72 h, 48 h, and 24 h on the 2018–2022 Test Set.
Method72 h48 h24 h
WRF255.18236.4194.80
BiLSTM207.67150.01112.86
BiLSTM (TN = False)428.72156.49126.42
Linear305.13276.10145.79
Linear (TN = False)328.61423.81305.58
GRU244.15212.43182.26
GRU (TN = False)362.08347.98230.93
Transformer394.74331.51186.14
Transformer (TN = False)425.48398.28239.78
Bold values represent the best result.
Table 3. Comparison of TPE (km) Across Different Models at 72 h, 48 h, and 24 h Integration Times on the 2018–2022 Test Set.
Table 3. Comparison of TPE (km) Across Different Models at 72 h, 48 h, and 24 h Integration Times on the 2018–2022 Test Set.
Method72 h48 h24 h
WRF255.18236.4194.80
BiLSTM + ConvGRU + WDL186.87174.5981.02
BiLSTM + ConvGRU + NFM198.22186.3888.45
BiLSTM + ConvGRU + xDeepFM905.10700.69691.30
BiLSTM + ConvLSTM + WDL159.23148.2575.31
BiLSTM + ConvLSTM + NFM1143.481004.39911.28
BiLSTM + ConvLSTM + xDeepFM822.03693.49648.87
SVM667.09331.50323.73
XGBoost258.86252.44243.87
LightGBM226.75212.09204.81
Kalman Filter603.09568.95543.38
Bold values represent the best result.
Table 4. Performance Comparison of Models in Longitude Direction Based on MBE (°), RMSE (°), and R2 Across Multiple Time Scales on the 2018–2022 Test Set.
Table 4. Performance Comparison of Models in Longitude Direction Based on MBE (°), RMSE (°), and R2 Across Multiple Time Scales on the 2018–2022 Test Set.
Evaluation MetricsMBERMSER2
Integration Times72 h48 h24 h72 h48 h24 h72 h48 h24 h
WRF−0.8580−0.8234−0.27916.59236.12432.23500.88500.89690.9857
BiLSTM−0.1757−0.2150−0.06413.56862.61312.24500.96630.98130.9856
BiLSTM (BN = False)0.4024−0.1068−0.47946.08292.65331.81890.90210.98070.9905
Linear−0.91870.17250.11343.72123.51131.66340.96330.96620.9921
Linear (BN = False)−0.1158−0.2076−0.33264.82704.85973.59600.93830.93530.9631
GRU−0.2162−0.3655−0.35483.37372.98782.07650.96990.97550.9877
GRU (BN = False)0.2917−0.25310.04164.81374.32122.89440.93870.94880.9761
Transformer−0.9189−0.9964−0.39405.04854.64802.80760.93260.94080.9775
Transformer (BN = False)−0.53680.0071−0.40095.40694.95313.55010.92270.93280.9641
BiLSTM + ConvGRU + WDL−0.3510−0.25970.06432.96872.94771.15590.97660.97620.9961
BiLSTM + ConvGRU + NFM0.03870.0028−0.19313.39893.14121.66460.96940.97290.9921
BiLSTM + ConvGRU + xDeepFM−6.46612.06341.386911.41319.04808.92760.65550.77580.7731
BiLSTM + ConvLSTM + WDL−0.1654−0.26610.04542.70002.75491.10390.98070.97920.9965
BiLSTM + ConvLSTM + NFM−10.2760−2.5333−5.438512.817110.26719.75580.56560.71130.7290
BiLSTM + ConvLSTM + xDeepFM4.97561.86171.911410.92628.60667.67740.68430.79710.8322
SVM−0.1267−0.3982−0.34857.45544.30494.30010.85710.94940.9472
XGBoost−0.5818−0.4998−0.38434.00854.01673.95380.95870.95600.9554
LightGBM−0.4862−0.4172−0.35553.70293.52013.53420.96470.96620.9643
Kalman Filter−0.2304−0.1806−0.16168.27217.76757.32440.82410.83550.8470
Bold values represent the best result.
Table 5. Performance Comparison of Models in Latitude Direction Based on MBE (°), RMSE (°), and R2 Across Multiple Time Scales on the 2018–2022 Test Set.
Table 5. Performance Comparison of Models in Latitude Direction Based on MBE (°), RMSE (°), and R2 Across Multiple Time Scales on the 2018–2022 Test Set.
Evaluation MetricsMBERMSER2
Integration Times72 h48 h24 h72 h48 h24 h72 h48 h24 h
WRF−0.5869−0.05470.02042.19272.35861.41540.95710.94830.9804
BiLSTM−0.4274−0.1121−0.10451.88030.99561.03370.96840.99080.9895
BiLSTM (BN = False)−0.8436−0.0706−0.04353.45791.08080.92700.89330.98910.9916
Linear−0.4207−0.2008−0.20432.04601.75981.32990.96260.97130.9827
Linear (BN = False)−0.6643−0.4310−0.09422.44242.63941.79880.94680.93550.9684
GRU−0.6968−0.3578−0.45741.97961.45121.53080.96500.98050.9771
GRU (BN = False)−0.6137−0.8096−0.50251.97772.40241.48170.96510.94660.9786
Transformer−1.3834−0.4780−0.34523.30382.65581.78010.90260.93470.9691
Transformer (BN = False)−0.95240.0587−0.26393.11703.30441.76110.91330.89890.9698
BiLSTM + ConvGRU + WDL−0.4735−0.0644−0.12871.44861.25870.90610.98120.98530.9920
BiLSTM + ConvGRU + NFM−0.2782−0.1518−0.11821.55191.38001.01890.97850.98230.9898
BiLSTM + ConvGRU + xDeepFM−1.14321.36220.41674.25742.95653.44870.83830.91910.8841
BiLSTM + ConvLSTM + WDL−0.3333−0.1077−0.05851.25741.08090.86620.98590.98910.9926
BiLSTM + ConvLSTM + NFM−2.2929−3.01050.33504.53786.90314.89510.81640.55910.7666
BiLSTM + ConvLSTM + xDeepFM1.94330.67720.39623.94123.05282.97180.86150.91370.9140
SVM−1.3585−0.5819−0.53074.91842.85272.84420.79140.92480.9213
XGBoost−0.7504−0.6587−0.55672.83112.67302.59290.93090.93400.9346
LightGBM−0.6070−0.5169−0.41212.60812.35942.30570.94130.94850.9483
Kalman Filter−0.0386−0.0301−0.01554.54484.36184.23700.82190.82420.8255
Bold values represent the best result.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Tao, C.; Wang, Z.; Tian, Y.; Han, Y.; Wang, K.; Li, Q.; Zuo, J. Calibration of Typhoon Track Forecasts Based on Deep Learning Methods. Atmosphere 2024, 15, 1125. https://doi.org/10.3390/atmos15091125

AMA Style

Tao C, Wang Z, Tian Y, Han Y, Wang K, Li Q, Zuo J. Calibration of Typhoon Track Forecasts Based on Deep Learning Methods. Atmosphere. 2024; 15(9):1125. https://doi.org/10.3390/atmos15091125

Chicago/Turabian Style

Tao, Chengchen, Zhizu Wang, Yilun Tian, Yaoyao Han, Keke Wang, Qiang Li, and Juncheng Zuo. 2024. "Calibration of Typhoon Track Forecasts Based on Deep Learning Methods" Atmosphere 15, no. 9: 1125. https://doi.org/10.3390/atmos15091125

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop