Next Article in Journal
An Integrated Approach for the Simulation Modeling and Risk Assessment of Coastal Flooding
Previous Article in Journal
Environmental Fate and Effects of Foaming Agents Containing Sodium Lauryl Ether Sulphate in Soil Debris from Mechanized Tunneling
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Probabilistic Prediction of Significant Wave Height Using Dynamic Bayesian Network and Information Flow

College of Meteorology and Oceanography, National University of Defense Technology, Nanjing 211101, China
*
Author to whom correspondence should be addressed.
Water 2020, 12(8), 2075; https://doi.org/10.3390/w12082075
Submission received: 22 June 2020 / Revised: 15 July 2020 / Accepted: 16 July 2020 / Published: 22 July 2020
(This article belongs to the Section Oceans and Coastal Zones)

Abstract

:
Short-term prediction of wave height is paramount in oceanic operation-related activities. Statistical models have advantages in short-term wave prediction as complex physical process is substantially simplified. However, previous statistical models have no consideration in selection of predictive variables and dealing with prediction uncertainty. This paper develops a machine learning model by combining the dynamic Bayesian network (DBN) with the information flow (IF) designated as DBN-IF. IF is focused on selecting the best predictive variables for DBN by causal analysis instead of correlation analysis. DBN for probabilistic prediction is constructed by structure learning and parameter learning with data mining. Based on causal theory, graph theory, and probability theory, the proposed DBN-IF model could deal with the uncertainty and shows great performance in significant wave height prediction compared with the artificial neural network (ANN), random forest (RF) and support vector machine (SVM) for all lead times. The interpretable DBN-IF is proven as a promising tool for nonlinear and uncertain wave height prediction.

1. Introduction

The intense increase in various ocean engineering has spurred an interest in accurate prediction of wave characteristics, especially significant wave height. Wave height prediction is important for planning offshore engineering, such as exploitation of marine renewable energy, harbor constructions, and marine operations. Besides studies on the long-term wave height prediction [1,2], real-time and fast forecasting also plays a crucial role in offshore operations, which has received considerable critical attention.
Waves are more formidable to characterize than other ocean variables like tides, owing to their stochastic nature. The physical process of wave generation is basically uncertain, nonlinear, and non-stationary, which makes wave height prediction difficult. In the last few decades, many studies have been devoted to this issue, developing a number of forecasting approaches. All of those fall into two major types: energy balance equation-based numerical models and statistical models.
Numerical models for wave height prediction are based on energy balance equations. According to various components of the source functions in equations, wave numerical models are classified into three generations [3]. Familiar models, including Wave Analysis Model (WAM), Simulating WAves Nearshore (SWAN), and WAVEWATCH, have been widely applied in actual wave height forecasting [4,5]. Numerical models are generally employed to predict waves over a large spatial and temporal domain, beneficial to natural disaster predicting, maritime activity planning, etc. Due to consideration of complex dynamic process of waves, numerical predictions are mostly accurate; nevertheless, they are costly and time-consuming, which is the greatest challenge for real-time and fast wave prediction when carrying out maritime operations, especially for feedback control application problems. Furthermore, numerical models have low generalization ability and it is necessary to reset the boundary conditions and re-run the ocean wave models when implemented in different regions. In follow-up studies, despite considerable advances in computational techniques, wave numerical models cannot easily generalize over various sites and times efficiently because of the complex parameters and high computational complexity.
In order to deal with the above problems, an alternative data-driven method has been developed by many scholars to predict wave height, that is the time series-based statistical model. The greatest advantage of statistical models in competition with numerical models is less required computational costs. A number of studies have been published on this topic.
Early research employed classical time series models for wave height prediction, including Auto Regressive model (AR), Auto-Regressive Moving Average model (ARMA), Auto-Regressive Integrated Moving Average model (ARIMA), and Kalman Filter model (KF), etc. The above models have been widely used to predict time series of wave height in short-term periods [6,7,8]. Classical time series models run fast and are easy to understand, but have limited ability to capture nonlinearity and non-stationarity of wave height time series because of their linear and stationary assumptions [9]. In addition, only wave height is used as input variable in those models, without consideration of effects from other relevant variables such as wind, pressure, and temperature, which conflicts with the physical process of waves.
Later, with increasing the number of real observations over the past dozen years, machine learning (ML)-based approaches such as artificial neural network (ANN), support vector machine (SVM), and random forest (RF) became popular for wave height prediction. ANN was first used by Deo [10] to obtain a direct prediction of wave height with a simple network. Then all kinds of ANNs with different input parameters (predictive variables) and the number of hidden neurons have been used for forecasting wave height [11]. Hybridization of ANNs with various techniques, such as wavelet decomposition (WD), fuzzy logic and genetic programming, have also been tried by Ozger [12] and Shahabi [13]. Meanwhile, SVM and RF have been adopted to predict nonlinear time series of wave height [14,15,16]. Deep learning algorithms, such as convolutional neural network (CNN) and recurrent neural network (RNN), have been initially applied to wave prediction [17,18]. Besides simplifying the tedious and intricate calculations, ML-based approaches are capable of performing nonlinear modeling without a priori knowledge about the input and output variables. More importantly, those approaches take into full account the relationships between wave height and other meteorological and oceanographic variables. Evidence suggests that ML-based approaches are superior to classical time series models and perform better than numerical models for shorter interval prediction. ML algorithms are more appropriate for short-term and fast wave height prediction.
In spite of those huge studies on application of ML algorithms, there are little, to our knowledge, previous studies focused on selection of the best predictive variables for an accurate prediction [19]. As we all know, predictors play a vital role in prediction model. This problem is usually known in the ML community as Feature Selection [20]. Avrim [21] and Salcedo [22] pointed out that irrelevant variables, used as part of a training procedure in a regression machine, can unnecessarily increase the cost and running time of a prediction system, as well as degrade its generalization performance. At present, correlation analysis is still the primary tool for predictor selection in meteorology and oceanography [23,24]. This is unfortunate, as there has been strong argument in causal theory against using correlation analysis for this purpose. Liang [25] explained that two variables with a strong correlation did not necessarily have a strong causality. In other words, strong causality definitely leads to strong correlation, but strong correlation may not lead to strong causality. It is necessary to develop a more reasonable method for predictor selection. Causal analysis may be more suitable in prediction modeling [26].
For another, though the relationships between wave height and relevant variables are taken into consideration in ML-based models, it is difficult to interpret those relationships using an ANN, RF, or SVM model. This problem is also known as “Black Box” in neural network [27]. ML-based models learn straightforward mappings between input and output variables; however, the mappings are invisible, and it is unknown how the input affects the output. The interpretability of ML-based models is still under exploration [19]. Additionally, predictions of both numerical models and statistical models are certain, that is obtaining a certain predictand at one future time. Nevertheless, the wave height is influenced by many meteorological and oceanographic factors, which is fluctuant and uncertain. Certain prediction cannot express the credibility, which causes difficulty for decision-making in marine operations.
To improve the mentioned defects in forecasting approaches, we introduce the information flow (IF) theory and dynamic Bayesian network (DBN) to propose a novel intelligent prediction model (DBN-IF) for accurate significant wave height prediction. IF theory, put forward by Liang [28], is a novel causal analysis method. Additionally, he has applied causal IF to select the best predictors of tropical cyclone forecasting and compared with correlation analysis. The results show that predictors based on IF led to more accurate prediction. The emerging DBN is an improvement of Bayesian network (BN), a representative of the uncertain artificial intelligence. Based on graph theory and probability theory, DBN can not only visualize the relationships among network nodes but also quantitatively express the interactions with probability distributions. Consequently, DBN is capable of dealing with “Black Box” and uncertainty in wave height prediction. In recent years, DBN was initially applied in engineering problems [29,30,31]. As far as we can see, there is no research about DBN for wave height prediction. We will combine DBN with IF for wave height forecasting, proposing the DBN-IF model. Comparative experiments are conducted by using measured data from buoys maintained by the National Data Buoy Center (NDBC). The results reveal the effects of different predictors on prediction accuracy and consistently show the superiority of DBN-IF to other models in uncertain, nonlinear, and non-stationary wave prediction.
The rest of the paper is organized as follows: Section 2 presents the theoretical formulations and implementation schemes of the DBN-IF model. Performance of the proposed model in significant wave height prediction and results analysis are elaborated in Section 3. Section 4 concludes the present studies.

2. Theoretical Explanation

In this section, brief instructions of dynamic Bayesian network (DBN) and information flow (IF) are presented along with the formulation of the proposed prediction technique (DBN-IF).

2.1. Dynamic Bayesian Network

Bayesian network (BN) was first proposed by Judea Pearl [32], including the classical Bayesian network (CBN) and the dynamic Bayesian network (DBN), whose theoretical basis is graph theory and probability theory. DBN is an improvement of CBN, which integrates the time dimension into CBN to explain the temporal causality. Therefore, DBN is a dynamic reasoning model with an ability of probabilistic analysis and prediction of temporal information.
According to Bayesian theory, BN is a directed acyclic graph expressing the causal relationship among variables, which is composed of nodes, directed arcs, and conditional probability distribution tables (CPTs). The nodes represent the variables; the arcs represent the causal relationships (cause-to-effect); CPTs express the strength of the causality quantitatively. DBN is an extension of CBN in the time dimension, and could be explained by a bigram < B 0 , B > :
  • B 0 denotes the initial network, that is the CBN in each time slice. It contains the network structure and CPTs of nodes at the same time;
  • B denotes the transition network, which contains the structural arcs and the transition probability distribution of nodes in contiguous time slices.
Define a variable set X = ( X 1 , X 2 , , X n ) and a finite time segment ( 0 , 1 , , T ) , then the joint probability distribution of X 0 , , X T is:
P ( X 0 , , X T ) = P ( X 0 ) · t = 1 T i = 1 N P [ X i t | π ( X i t ) ]
where X i t denotes the node i in the time slice t ; π ( X i t ) denotes the parent of X i t . The probabilistic reasoning with different time slices and different node states is realized by Equation (1).
The construction of DBN includes structure learning and parameter learning: the former requires to construct B 0 and B ; the latter requires to determine the conditional probability P ( ( X i t | π ( X i t ) ) and the transition probability P ( X t | X t 1 ) . Based on the network structure and probability distribution, posterior probability of each nodes in different time slices can be obtained by reasoning, achieving probabilistic prediction of network nodes. Previous studies have summarized two common learning approaches for DBN: manual construction based on professional knowledge and automatic learning based on intelligent algorithms [33]. In this paper, we adopt a combination of subjective knowledge and measured data for DBN learning.

2.2. Information Flow

Information Flow (IF) is a real physical notion recently rigorized by Liang [24] to express causality between two variables (or events) in a quantitative way, where causality is measured by the information transfer rate from one variable’s time series to another. IF can realize the formalization and quantification in causal analysis.
Given two time series X 1 and X 2 , the maximum likelihood estimator of the IF from X 2 to X 1 is:
T 2 1 = C 11 C 12 C 2 , d 1 C 12 2 C 1 , d 1 C 11 2 C 22 C 11 C 12 2
where: C i j denotes the covariance between X i and X j ; C i , d j is determined as follows. Let X ˙ j be the finite-difference approximation of d X j / d t using the Euler forward scheme:
X ˙ j , n = X j , n + k X j , n k Δ t
with k = 1 or k = 2 (the details about how to determine k are referred to [24] and Δ t being the time step. C i , d j in Equation (2) is the covariance between X i and X ˙ j .
In order to quantify the relative importance of a detected causality, Liang [27] developed an approach to normalizing the IF:
{ Z 2 1 | T 2 1 | + | d H 1 * d t | + | d H 1 n o i s e d t | τ 2 1 = T 2 1 Z 2 1
where: H 1 * represents the phase space expansion along the X 1 direction; H 1 n o i s e represents the random effect.
Later, Bai and Liang [25] modified it to be more inclusive to earth science:
N I F 2 1 = a b s ( T 2 1 ) a b s ( T 2 1 ) + a b s ( d H 1 n o i s e d t )
where a b s represents absolute value function. N I F 2 1 , whose range is (0,1), measures the importance of the IF transmitting from X 2 to X 1 .
The larger the value, the more significant the causal relationship between X 2 and X 1 . In particular, when the significance level is 0.1, N I F 2 1 > 0.1 indicates that the causal relationship is significant. The interactions in atmospheric and oceanic system are extremely complicated. In this paper, we calculate the IF based on Equation (5) to identify the causal relationships and select the best predictors.

2.3. Model Formulation

The main objective of our research is to predict significant wave height by implementing IF theory and DBN. IF theory is focused on selecting the best set of predictive variables for DBN by causal analysis. DBN is constructed by structure learning and parameter learning for probabilistic prediction. Next we will show how IF is able to screen the best predictors, and how DBN is able to deal with uncertainty to obtain excellent prediction from network learning.
DBN and IF are combined to construct a probabilistic prediction model (DBN-IF). The model comprises three modules: predictor selection, DBN training, and probabilistic reasoning. First, the causal analysis between predictors and wave height is conducted by calculating IF to select best predictors. Then, DBN is constructed by structure learning and parameter learning on the basis of large historical data. Finally, the real-time data of predictors are input, and posterior probability distributions of wave height are obtained by probabilistic reasoning. Figure 1 summarizes the technique process and each module is elaborated as follows.
  • Predictor Selection: calculate IF between predictors and wave height to identify their causal relationships, and select the variables having significant causality with wave height as the best predictors.
  • Network Training: discretize the data of variables (predictors and wave height); mine causal relationships among variables based on historical data and adjust arcs according to professional knowledge, establishing the initial network and transition network; learn the conditional probability and the transition probability using intelligent algorithms.
  • Probabilistic Prediction: discretize the real-time data of predictors and input them as prior evidence; calculate the posterior probability distributions of wave height in different time slices for probabilistic prediction.
  • More technical details and implementation processes are explained in the next section.

3. Experiment and Analysis

In order to investigate the performance of the DBN-IF model in forecasting significant wave height, we carried out a number of experiments, in which measured data from moored ocean buoys were used. In the following prediction experiments, we present the data description, implementation details, and main results obtained with DBN-IF. Experiments were carried out with FULL-BNT v.1.0.4 Tool-Box (https://download.csdn.net/download/b08514/6942975).

3.1. Description of Data

The data obtained from buoys are a reliable data source due to less measurement errors and priori calibration. After combing the known studies, we preliminarily selected wind direction, wind speed, gust speed, dominant wave period, average wave period, direction of wave at dominant period, sea level pressure, air temperature, and sea surface temperature (a total of nine variables) as predictors. For the analysis and experiments, measured data of predictors and significant wave height (from 1 January to 31 December 2014) collected from buoys 51101 were used for model training and data (from 1 January to 31 December 2015) collected from buoys 41002 and 22103 were used for model testing (data source files from the web: https://www.ndbc.noaa.gov/).
Take buoy 51101 as an example to analyze the data features of nine variables. Table 1 summarizes the buoy information and details of collected variables from buoy 51101 in 2014. It can be seen that there are relatively obvious differences about the statistical parameters of different variables. Some of the hourly times series records of significant wave height (WVHT) are presented in Figure 2. The nonlinearity and non-stationarity of WVHT time series are notable, which are barriers of accurate prediction.

3.2. Predictor Selection

To investigate the validity of IF, correlation analysis was also conducted for predictor selection. Based on training data, we first calculated the correlation coefficient (CC) between different variables as shown in Figure 3. It can be summarized that (1) the correlations between predictors and WVHT vary greatly. It is necessary to reject poorly relative predictors. (2) The correlations between different predictors are also remarkable. Taking strongly relative predictors as input variables simultaneously could result in information redundancy, which increases the cost and running time of a prediction system, as well as degrades its generalization performance. Therefore, it is indispensable to select effective predictors.
Correlation analysis shows the degree of relevance between predictors and WVHT; however, predictors closely related to WVHT are not necessarily the cause of WVHT on the basis of IF theory. Then we calculated the IF to compare with CC in Table 2, and Figure 4 presents the importance ranking of predictors in terms of both measures.
For IF, the predictors passing the significance test include GST, WSPD, APD, and ATMP. It is the biggest with IF of “GST→WVHT” and the next is “WSPD→WVHT”, which is in accord with the wind-wave generation mechanism. By contrast, for CC, the predictors passing the significance test include APD, GST, ATMP, WSPD, and DPD. It is the biggest with CC between APD and WVHT. Although APD is related to the growth of wave, it cannot be the cause of WVHT [34]. IF can identify the causal relationships. The best predictors selected by IF have better interpretability.
To further verify the reliability of IF, we respectively took the best predictors screened by IF and CC as input variables for nonlinear regression. The results are presented in Table 3.
Compared with “APD + GST + ATMP + WSPD + DPD”, the results obtained by “GST + WSPD +APD + ATMP” have bigger R2 and F, indicating its regression equation is more significant. In this paper, we take “GST + WSPD + APD + ATMP” as best predictors.

3.3. DBN Training

The four predictors and WVHT were taken as DBN nodes. Then structure and parameter learning were carried out based on training data.

3.3.1. Data Discretization

As DBN is better at processing discrete data, the continuous data is required to be discretized to determine the number of states taken by nodes. We analyzed time series records over a period and selected reasonable interval division steps for variables. Then, variable states were denoted with consecutive numbers. Consequently, discrete data of each node were obtained with the equal interval division method. The discretization standard is shown in Table 4 and Table 5 presents a part of the discrete training data.

3.3.2. Structure Learning

We adopted the advanced greedy search (AGS) method proposed in our published paper [35] to learn network structure, including the initial network structure and transition network structure. AGS comprises two steps: global causal analysis with IF and search for optimal structure with Greedy Search algorithm (GS). Table 6 shows the details of the AGS method.
Step 1: Calculate IF between different variables and make a significant analysis of causal relationships. Figure 5 shows the result of causal analysis: as an example, N I F WSPD GST = 0.8541 , N I F GST WSPD = 0.6145 , both pass the significance test and N I F WSPD GST > N I F GST WSPD , so we could judge WSPD is the cause for GST. That is, there may be an arc “WSPD→GST” in the DBN. N I F WSPD ATMP = 0.0337 , N I F ATMP WSPD = 0.0021 , both do not pass the significance test so it is questionable to determine the arc between WSPD and ATMP. A preliminary analysis of the causal relationships is conducted to get the primitive structure. The adjacency matrix describing the relationship of WSPD, GST, APD, ATMP, and WVHT is shown in Figure 6a, and the corresponding primitive network structure is shown in Figure 6b.
Step 2: Based on the discrete training data and the primitive structure, we adopt the GS algorithm to learn the initial network structure. Then connect the node WVHT of two adjacent time slices to build the transition network. Figure 7 visualizes the DBN structure, where the interactions of variables are expressed clearly. The initial network describes the causal relationship of WSPD, GST, APD, ATMP, and WVHT at time T. The transition network expresses the causal relationship of adjacent time (T to T + 1). DBN is better in interpretability, rather than a “Black Box”. Note that the time interval between two adjacent time slices is 1 h.

3.3.3. Parameter Learning

We adopted the expectation maximization (EM) algorithm for parameter learning. First, the probability distribution of each node was initialized, including prior probability, conditional probability, and transition probability. Then, based on the inference mechanism and training data, EM algorithm was used to modify the initial probability distribution, getting the probability distribution that matches the objective data.
Figure 8 shows that the result converges after 16 iterations. As an example, the transition probability distribution is shown in Table 7. So far, the DBN-IF model for forecasting wave height is completed.

3.4. Results and Discussion

After training the model, WVHT time series were predicted using a part of testing data (from 1–16 February) collected from buoy 41002 (3920 m; 31.892° N, 74.930° W). Discrete data of predictors at current time were entered into DBN for probabilistic reasoning and predict WVHT in different lead times (1, 3, 6, 12, and 24 h). Table A1 presents the detailed posterior probability distributions of WVHT. The probabilistic predictions are able to express the probability of each state of WVHT in the next moment completely, dealing with the uncertainty of prediction.
According to posterior probability distribution, we took the median in the most probable interval as the predictand of WVHT. In order to validate the performance of the DBN-IF, comparative investigations among ANN, RF, and SVM were conducted. Figure 9 presents the measured and predicted WVHT computed by different methods during the analysis period (from 1–16 February 2015).
As shown in Figure 9, all models show lower accuracy with increasing lead time. The peaks and troughs are well predicted by four models when the lead time is short (1, 3, and 6 h). By increasing the lead time (12 and 24 h), the differences, obtained from ANN, RF, and SVM, between the measured and predicted WVHT are obvious. However, the general patterns of the variations of WVHT are still reasonably captured by the DBN-IF model.
In addition, when the lead time reaches 12 and 24 h, shifts between the measured and predicted WVHT time series by ANN, RF, and SVM can be easily noted. It is easily found that the shifts obviously increase as the lead time grows. However, the shifts are overcome with DBN-IF because the transition network in DBN could achieve the real-time correction of errors to guarantee the accumulation of effective information. Predictands for the nonlinear and non-stationary WVHT are improved by combining IF with DBN.
To investigate the performances for the above prediction models quantitatively, different error measures including the CC, root mean square error (RMSE), scatter index (SI), and Nash–Sutcliffe (NSE) were employed as evaluation criteria. We do not repeat CC and RMSE in consideration of space. SI and NSE as shown in Equations (6) and (7) are explained as follows.
SI = RMSE h m
NSE = 1 i = 1 n ( h i ^ h i ) 2 i = 1 n ( h i h m ) 2
where h i ^ is the predicted results, h i is the measured WVHT, h m is the mean value of h i , and n represents the length of test time series. The RMSE is a good measure for evaluating the performance of a model because RMSE is proportional to the observed mean. Therefore, the SI forms a good non-dimensional error measure. NSE compares the goodness of fit between the observed and predicted data. A high value of NSE (up to one) indicates high efficiency of the model [15].
Details of the CC, RMSE, SI, and NSE of WVHT predictions at the studied buoy 41002 are summarized in Table 8, Table 9, Table 10, Table 11 and Table 12. Details of the error measures are plotted in Figure 10 to show the relations between their magnitudes and the prediction lead times.
When lead time = 1 h, for all models, the CC is greater than 0.95, RMSE is lower than 0.4, and SI is lower than 0.1. The performances of four models are similar, and DBN-IF is slightly better than the other three models. When lead time reaches 3 h, error measures of DBN-IF change little, but the other three models change a great deal. The NSE of ANN, RF, and SVM decline to about 0.8. By increasing lead time (6 and 12 h), the forecasting accuracy of DBN-IF remains high (CC > 0.9, RMSE < 0.3, SI < 0.1 and NSE > 0.9). The performances of the other three models are significantly lower than DBN-IF. Note that, when lead time reaches 12 h, the prediction accuracy of DBN-IF declines dramatically, although it is still the best.
Previous studies used different input predictors to forecast WVHT. We re-achieved these models based on the same testing data (from 1 to 16 February buoy 41002). Table 13 summarizes the studies that applied different ML-based approaches and different input predictors to forecast WVHT in different lead times.
In comparison to Kamranzad [5] and Somayeh [16], it is easy to see that the predictor selection process affects the performance of the prediction model, improving its accuracy. When using ANN for prediction, the results with input predictors selected by IF are more accuracy markedly. When using RF, different input predictors have similar predictions with short lead times (3 and 6 h); however, when lead time reaches 12 and 24 h, the results with input predictors selected by IF are better. Furthermore, Figure 11 visualizes the comparison between different methods where the current study outperformed other studies and methods.
In order to complete this experiment section, we selected another buoy 22103 (130 m; (34.012° N, 127.503° W) and longer analysis period (from 1 January to 31 December 2015) for prediction experiments to test the generalization ability of the DBN-IF. Predictions of WVHT in different locations show consistent conclusions. Seen from Table 14 and Table 15, DBN-IF outperforms ANN, RF, and SVM. In addition, compared with long-term prediction, the proposed model is better at short-term prediction.

4. Conclusion

In this paper, we introduced causal IF and DBN to propose a novel probabilistic prediction model (DBN-IF) for significant wave height. In this ML-based model, we first introduced IF theory to select the best predictors by analyzing the causal relationships between wave height and other meteorological and oceanographic variables. Then, we extracted and quantitatively expressed the interactions among variables by structure and parameter learning to construct the DBN. Finally, predictions of wave height were achieved by probabilistic reasoning. Experimental results show that the performance of the DBN-IF model in predicting the hourly wave height is superior to those of primary ML-based models (ANN, SVM, and RF). The high accuracy of the DBN-IF model is attributed to the following two prominent advantages:
  • Emphasis on screening of predictors. Different from the previous prediction models, the first step of our proposed model is to analyze and screen predictors. Use state-of-the-art IF theory instead of correlation coefficient or time-delay correlation coefficient to perform causal analysis between predictors and wave height to select the best predictors;
  • Good interpretability of prediction model and ability to deal with uncertainty. Based on graph theory and probability theory, DBN can not only visualize the relationships among predictive variables but also quantitatively express the interactions with probability distributions. On the one hand, it handles the “Black Box” problem that ML algorithms such as ANN, SVM, and RF are difficult to explain. On the other hand, it deals with the uncertainty of nonlinear wave height time series through probability theory.
However, the DBN-IF model requires a large amount of data for training, so the application is limited in locations without wave buoys. Additionally, it does not perform well in long-term prediction. The future work scope involves information redundancy in predictor selection, long-term prediction of wave height, and also DBN-based prediction models with no need for much data. This work shall be extended to other major offshore activity regions across the world to enable support in offshore operation and marine control applications.

Author Contributions

M.L. and K.L. conceived and designed the experiments; M.L. performed the experiments; M.L. and K.L. analyzed the data; M.L. wrote the paper. All authors have read and agreed to the published version of the manuscript.

Funding

National Natural Science Foundation of China: 41875061. National Natural Science Foundation of China: 41775165. Natural Science Foundation of Jiangsu Province: BK20161464.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (No.41875061; No.41775165) and the Natural Science Foundation of Jiangsu Province (BK20161464).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

This section contains Table A1 supplemental to the main text.
Table A1. The posterior probability distributions of WVHT. The boldface font indicates the maximal probability.
Table A1. The posterior probability distributions of WVHT. The boldface font indicates the maximal probability.
TimeTime Slices
State 12345678910
State of Node WVHT10000000000
20002.27 × 10−256.89 × 10−212.00 × 10−2101.83 × 10−218.49 × 10−220
3002.10 × 10−152.84 × 10−137.23 × 10−134.16 × 10−119.81 × 10−133.89 × 10−132.52 × 10−127.36 × 10−11
403.24 × 10−109.62 × 10−083.54 × 10−082.43 × 10−085.68 × 10−085.77 × 10−087.12 × 10−098.05 × 10−081.41 × 10−07
504.59 × 10−052.75 × 10−052.66 × 10−051.06 × 10−052.70 × 10−063.17 × 10−067.63 × 10−066.06 × 10−067.43 × 10−06
60.0330.005820.0018880.0006990.0004690.0003050.0003210.0004590.0003550.000522
70.1800590.1572320.0096490.00490.0052050.0059880.009460.0117990.0110330.01301
80.4028090.5126110.2316840.1713050.2081380.4850440.32640.1317210.1760650.656672
90.3795150.2983650.6176470.5433210.6116940.1876130.3897110.7588890.6950460.133887
100.0046160.0259270.1365260.2729120.1744830.2995680.2220230.0971240.1174950.195903
11000.0025780.00683700.0214790.052082000
120000000000
130000000000

References

  1. Zheng, C.W.; Li, C.Y. Variation of the wave energy and significant wave height in the China sea and adjacent waters. Renew. Sustain. Energy Rev. 2015, 43, 381–387. [Google Scholar] [CrossRef]
  2. Zheng, C.W.; Wu, G.X.; Chen, X. CMIP5-based wave energy projection: Case studies of the South China Sea and the East China Sea. IEEE Access 2019, 31, 329–337. [Google Scholar] [CrossRef]
  3. Janssen, P. Progress in ocean wave forecasting. J. Comput. Phys. 2008, 227, 3572–3594. [Google Scholar] [CrossRef]
  4. Goda, Y. Revisiting Wilson’s formulas for simplified wind-wave prediction. J. Waterw. Port Coast. Ocean Eng. 2003, 129, 93–95. [Google Scholar] [CrossRef]
  5. Kamranzad, B.; Etemad-Shahidi, A.; Kazeminezhad, M.H. Wave height forecasting in Dayyer, the Persian Gulf. Ocean Eng. 2011, 38, 248–255. [Google Scholar] [CrossRef] [Green Version]
  6. Spanos, P. ARMA algorithms for ocean wave modeling. J. Energy Resour. Technol. 1983, 105, 300–309. [Google Scholar] [CrossRef]
  7. Altunkaynak, A.; Özger, M. Temporal significant wave height estimation from wind speed by perceptron Kalman filtering. Ocean Eng. 2004, 31, 1245–1255. [Google Scholar] [CrossRef]
  8. Valipour, M.; Banihabib, M.E.; Behbahani, S.M.R. Comparison of the ARMA, ARIMA, and the autoregressive artificial neural network models in forecasting the monthly inflow of Dez dam reservoir. J. Hydrol. 2013, 476, 433–441. [Google Scholar] [CrossRef]
  9. Stefanakos, C. Fuzzy time series forecasting of nonstationary wind and wave data. Ocean Eng. 2016, 121, 1–12. [Google Scholar] [CrossRef] [Green Version]
  10. Deo, M.C.; Naidu, C.S. Real time forecasting using neural networks. Ocean Eng. 1998, 26, 191–203. [Google Scholar] [CrossRef]
  11. Lee, T.L. Neural network prediction of a storm surge. Ocean Eng. 2006, 33, 483–494. [Google Scholar] [CrossRef]
  12. Ozger, M. Significant wave height forecasting using wavelet fuzzy logic approach. Ocean Eng. 2010, 37, 1443–1451. [Google Scholar] [CrossRef]
  13. Shahabi, S.; Khanjani, M.J.; Kermani, M.R.H. Significant wave height modelling using a hybrid wavelet-genetic programming approach. KSCE J. Civil Eng. 2017, 21, 1–10. [Google Scholar] [CrossRef]
  14. Mahjoobi, J.; Mosabbeb, E.A. Prediction of significant wave height using regressive support vector machines. Ocean Eng. 2009, 36, 339–347. [Google Scholar] [CrossRef]
  15. Duan, W.Y.; Han, Y.; Huang, L.M.; Zhao, B.B.; Wang, M.H. A hybrid EMD-SVR model for the short-term prediction of significant wave height. Ocean Eng. 2016, 124, 54–73. [Google Scholar] [CrossRef]
  16. Mafi, S.; Amirinia, G. Forecasting hurricane wave height in Gulf of Mexico using soft computing methods. Ocean Eng. 2017, 146, 352–362. [Google Scholar] [CrossRef]
  17. Hatalis, K. Multi-step forecasting of wave power using a nonlinear recurrent neural network. In Proceedings of the IEEE Pes General Meeting, National Harbor, MD, USA, 27–31 July 2014. [Google Scholar]
  18. Gasse, M.; Millioz, F.; Roux, E.; Garcia, D.; Liebgott, H.; Friboulet, D. High-quality plane wave compounding using convolutional neural networks. IEEE Trans. Ultrason. Ferroelectr. Freq. Control 2017, 64, 1637–1639. [Google Scholar] [CrossRef] [Green Version]
  19. Cornejo-Bueno, L.; Nieto-Borge, J.C.; García-Díaz, P.; Rodríguez, G.; Salcedo-Sanz, S. Significant wave height and energy flux prediction for marine energy applications: A grouping genetic algorithm—Extreme learning machine approach. Renew. Energy 2016, 97, 380–389. [Google Scholar] [CrossRef]
  20. Weston, J.; Mukherjee, S.; Chapelle, O.; Pontil, M.; Vapnik, V. Feature selection for SVMs. Adv. Neural Inf. Process. Syst. 2000, 11, 526–532. [Google Scholar]
  21. Blum, A.L.; Langrey, P. Selection of relevant features and examples in machine learning. Artif. Intell. 1997, 97, 245–271. [Google Scholar] [CrossRef] [Green Version]
  22. Salcedo-Sanz, S.; Prado-Cumplido, M.D.; Pérez-Cruz, F.; Bousoño-Calzón, C. Feature selection via genetic optimization. In International Conference on Artificial Neural Networks; Springer: Berlin/Heidelberg, Germany, 2002; pp. 547–552. [Google Scholar]
  23. Landman, W.A.; Mason, S.J. Forecasts of near-global sea surface temperatures using canonical correlation analysis. J. Clim. 2001, 14, 3819–3833. [Google Scholar] [CrossRef]
  24. Juneng, L.; Tangang, F.T. Level and source of predictability of seasonal rainfall anomalies in Malaysia using canonical correlation analysis. Int. J. Climatol. 2008, 28, 1255–1267. [Google Scholar] [CrossRef]
  25. San Liang, X. Unraveling the cause-effect relation between time series. Phys. Rev. E 2014, 90, 052150. [Google Scholar] [CrossRef] [Green Version]
  26. Li, M.; Liu, K.F. Causality-based attribute weighting via information flow and genetic algorithm for naive bayes classifier. IEEE Access 2019, 7, 150630–150641. [Google Scholar] [CrossRef]
  27. Schaefer, H. Entering the black box of neural networks. Methods Inf. Med. 2003, 42, 287–296. [Google Scholar]
  28. Liang, S.X. Normalizing the causality between time series. Phys. Rev. E 2015, 92, 022126. [Google Scholar] [CrossRef] [Green Version]
  29. Sun, J.; Sun, J. A dynamic Bayesian network model for real-time crash prediction using traffic speed conditions data. Transp. Res. Part C Emerging Technol. 2015, 54, 176–186. [Google Scholar] [CrossRef]
  30. Xiao, Q.; Chaoqin, C.; Li, Z. Time series prediction using dynamic Bayesian network. Optik 2017, 135, 98–103. [Google Scholar] [CrossRef]
  31. Li, M.; Liu, K.F. Application of intelligent dynamic Bayesian network with wavelet analysis for probabilistic prediction of storm track intensity index. Atmosphere 2018, 9, 224. [Google Scholar] [CrossRef] [Green Version]
  32. Pearl, J. From Bayesian networks to causal networks. In Mathematical Models for Handling Partial Knowledge in Artificial Intelligence; Springer: Berlin, Germany, 1995; pp. 157–182. [Google Scholar]
  33. Cussens, J. Bayesian network learning with cutting planes. arXiv 2012, arXiv:1202.3713v1. Available online: https://arxiv.org/abs/1202.3713 (accessed on 3 June 2012).
  34. Mao, J.Y.; Xie, H.R. Review of Wind-Wave Generation Mechanisms. Adv. Mar. Sci. 2019, 37, 533–542. [Google Scholar]
  35. Li, M.; Zhang, R.; Hong, M.; Bai, C. Improved structural learning algorithm of Bayesian network based on information flow. Syst. Eng. Electron. 2018, 465, 202–207. [Google Scholar]
Figure 1. Technical flowchart of dynamic Bayesian network and information flow (DBN-IF model).
Figure 1. Technical flowchart of dynamic Bayesian network and information flow (DBN-IF model).
Water 12 02075 g001
Figure 2. Time series records of WVHT in buoy 51101.
Figure 2. Time series records of WVHT in buoy 51101.
Water 12 02075 g002
Figure 3. Correlation coefficients (CCs) between different variables.
Figure 3. Correlation coefficients (CCs) between different variables.
Water 12 02075 g003
Figure 4. Importance ranking of predictors.
Figure 4. Importance ranking of predictors.
Water 12 02075 g004
Figure 5. IF between different variables.
Figure 5. IF between different variables.
Water 12 02075 g005
Figure 6. Results of causal analysis; (a) adjacency matrix, (b) primitive structure.
Figure 6. Results of causal analysis; (a) adjacency matrix, (b) primitive structure.
Water 12 02075 g006
Figure 7. DBN structure of WVHT prediction.
Figure 7. DBN structure of WVHT prediction.
Water 12 02075 g007
Figure 8. Iterative curve of expectation maximization (EM) algorithm.
Figure 8. Iterative curve of expectation maximization (EM) algorithm.
Water 12 02075 g008
Figure 9. Observed and predicted WVHT during the study period with four methods and different lead times: (a) 1 h, (b) 3 h, (c) 6 h, (d) 12 h, (e) 24 h.
Figure 9. Observed and predicted WVHT during the study period with four methods and different lead times: (a) 1 h, (b) 3 h, (c) 6 h, (d) 12 h, (e) 24 h.
Water 12 02075 g009aWater 12 02075 g009b
Figure 10. Error measures with four methods and different lead time: (a) CC, (b) RMSE, (c) SI, (d) NSE.
Figure 10. Error measures with four methods and different lead time: (a) CC, (b) RMSE, (c) SI, (d) NSE.
Water 12 02075 g010aWater 12 02075 g010b
Figure 11. The CC and SI of different methods: (a) CC, (b) SI
Figure 11. The CC and SI of different methods: (a) CC, (b) SI
Water 12 02075 g011
Table 1. Information and summary of measured data from buoy 51101.
Table 1. Information and summary of measured data from buoy 51101.
Buoy: 51101Depth: 4849 m (24.361° N, 162.075° W)
VariablesMaximumAverageMinimum
Wind direction (WDIR, °)360117.9471
Wind speed (WSPD, m/s)15.46.9080
Gust speed (GST, m/s)20.48.5060.2
Dominant wave period (DPD, s)21.0510.9524.76
Average wave period (APD, s)13.927.0694.96
Direction of wave at dominant period (MVD, °)360173.4081
Sea level pressure (PRES, Pa)10261016.106998.8
Air temperature (ATMP, ℃)29.223.99816.6
Sea surface temperature (WTMP, ℃)30.225.08220.8
Significant wave height (WVHT, m)6.962.3590.83
Table 2. Comparison of IF and CC. Statistically significant values at the 90% level are highlighted.
Table 2. Comparison of IF and CC. Statistically significant values at the 90% level are highlighted.
Variable WDIRWSPDGSTDPDAPDMVDPRESATMPWTMP
WVHTIF0.03230.24880.30250.09770.17070.07940.01460.11170.0622
CC0.22630.34720.41060.32870.62090.2928−0.0726−0.3543−0.2677
Table 3. Results of the nonlinear regression model.
Table 3. Results of the nonlinear regression model.
The Best PredictorsDeterminate Coefficient R2F
IF: GST + WSPD + APD + ATMP0.79344671.1564
CC: APD + GST + ATMP + WSPD + DPD0.70612989.4381
Table 4. Discretization standard of variables.
Table 4. Discretization standard of variables.
VariablesWSPDGSTAPDATMPWVHT
Interval step0.5 m/s0.5 m/s0.5 s0.5 ℃0.5 m
State number1–321–411–191–261–13
Table 5. Discrete training data.
Table 5. Discrete training data.
Sample IDWSPDGSTAPDATMPWVHT
17182177
28202387
38222786
48192397
..................
872134478
872247886
Table 6. Algorithm scheme of AGS.
Table 6. Algorithm scheme of AGS.
Input:Training Data of Predictors and WVHT
Output:Optimal DBN structure
Initialization:Preprocess training data and set significant level
Causal analysis:Calculate the IF between each two variables and analyze the causal relationships
Primitive structure:Determine the arcs based on IF to obtain the primitive structure
Structure search:Adopt GS algorithm to search for the optimal structure
Table 7. The transition probability distribution of node WVHT.
Table 7. The transition probability distribution of node WVHT.
(T + 1)State of Node WVHT
(T) 12345678910111213
State of node WVHT10.7740.22600000000000
20.0120.8730.1150000000000
300.0560.8550.089000000000
4000.1010.7910.1080.0010000000
50000.2030.6630.1320.002000000
60000.0030.2410.5440.1970.01600000
7000000.2640.5360.1720.0240.004000
8000000.0270.3290.4520.1440.0410.00700
90000000.0360.2980.4400.2020.02400
1000000000.1190.2880.3730.1860.0340
11000000000.0970.2900.3870.1290.097
120000000000.3640.3640.1820.091
1300000000000.1670.5000.333
Table 8. The CC, root mean square error (RMSE), scatter index (SI), Nash–Sutcliffe (NSE) of WVHT predictions with lead time = 1 h.
Table 8. The CC, root mean square error (RMSE), scatter index (SI), Nash–Sutcliffe (NSE) of WVHT predictions with lead time = 1 h.
Criteria DBN-IFANNRFSVM
CC0.99270.95690.95850.9587
RMSE0.13180.29480.29740.3641
SI0.03410.07620.07690.0941
NSE0.98290.91490.91340.8703
Table 9. The CC, RMSE, SI, NSE of WVHT predictions with lead time = 3 h.
Table 9. The CC, RMSE, SI, NSE of WVHT predictions with lead time = 3 h.
Criteria DBN-IFANNRFSVM
CC0.98390.92230.92510.9102
RMSE0.18950.39870.42030.4759
SI0.04910.10310.10870.1231
NSE0.96480.84440.82720.7784
Table 10. The CC, RMSE, SI, NSE of WVHT predictions with lead time = 6 h.
Table 10. The CC, RMSE, SI, NSE of WVHT predictions with lead time = 6 h.
CriteriaDBN-IFANNRFSVM
CC0.93340.88150.90240.8562
RMSE0.24280.52720.60240.5935
SI0.06280.13630.15570.1534
NSE0.94230.72810.75240.6553
Table 11. The CC, RMSE, SI, NSE of WVHT predictions with lead time = 12 h.
Table 11. The CC, RMSE, SI, NSE of WVHT predictions with lead time = 12 h.
CriteriaDBN-IFANNRFSVM
CC0.91420.84230.89810.8284
RMSE0.28140.63560.71320.7346
SI0.07270.16430.18440.1899
NSE0.92250.60470.50230.4721
Table 12. The CC, RMSE, SI, NSE of WVHT predictions with lead time = 24 h.
Table 12. The CC, RMSE, SI, NSE of WVHT predictions with lead time = 24 h.
CriteriaDBN-IFANNRFSVM
CC0.75230.66010.69320.6392
RMSE0.68560.74270.85410.7965
SI0.16760.19130.22080.2059
NSE0.52480.46410.28620.3792
Table 13. The predictions of different methods and different input predictors.
Table 13. The predictions of different methods and different input predictors.
ReferenceLead TimeCCSIModelPredictors
Ozger
[12]
3
6
12
0.960
0.899
0.800
0.135
0.211
0.289
Fuzzy-LogicWind speed
Significant wave height
Ozger
[12]
3
6
12
0.925
0.842
0.690
0.184
0.260
0.349
ARMAXWind speed
Significant wave height
Kamranzad
[5]
3
6
12
24
0.907
0.820
0.663
0.379
0.378
0.511
0.663
0.845
ANNFriction velocity
Wind direction
Significant wave height
Wave direction
Duan
[15]
1
3
6
0.986
0.954
0.855
0.014
0.044
0.086
WD-SVMSignificant wave height
Somayeh
[16]
3
6
12
24
0.981
0.948
0.880
0.635
0.270
0.418
0.619
0.945
RFWind speed
Significant wave height
Wave period
Pressure
Air temperature
Water temperature
Dew point
Table 14. The CC, RMSE, SI, NSE of WVHT predictions with lead time = 6 h.
Table 14. The CC, RMSE, SI, NSE of WVHT predictions with lead time = 6 h.
CriteriaDBN-IFANNRFSVM
CC0.91140.86720.88010.8542
RMSE0.21650.39470.38510.4131
SI0.14360.17580.17210.2164
NSE0.91420.75610.78730.6983
Table 15. The CC, RMSE, SI, NSE of WVHT predictions with lead time = 12 h.
Table 15. The CC, RMSE, SI, NSE of WVHT predictions with lead time = 12 h.
CriteriaDBN-IFANNRFSVM
CC0.87920.82110.85360.8147
RMSE0.34890.43610.42530.5326
SI0.17350.20830.19690.2945
NSE0.88030.72060.73670.6714

Share and Cite

MDPI and ACS Style

Li, M.; Liu, K. Probabilistic Prediction of Significant Wave Height Using Dynamic Bayesian Network and Information Flow. Water 2020, 12, 2075. https://doi.org/10.3390/w12082075

AMA Style

Li M, Liu K. Probabilistic Prediction of Significant Wave Height Using Dynamic Bayesian Network and Information Flow. Water. 2020; 12(8):2075. https://doi.org/10.3390/w12082075

Chicago/Turabian Style

Li, Ming, and Kefeng Liu. 2020. "Probabilistic Prediction of Significant Wave Height Using Dynamic Bayesian Network and Information Flow" Water 12, no. 8: 2075. https://doi.org/10.3390/w12082075

APA Style

Li, M., & Liu, K. (2020). Probabilistic Prediction of Significant Wave Height Using Dynamic Bayesian Network and Information Flow. Water, 12(8), 2075. https://doi.org/10.3390/w12082075

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop