Next Article in Journal
Ice-Water-Gas Interaction during Icebreaking by an Airgun Bubble
Previous Article in Journal
Submarine Small-Scale Features of Cyclic Steps in the Penghu Canyon: Implications for the Migration of Canyon
Previous Article in Special Issue
Towards the Evaluation of Date Time Features in a Ship Route Prediction Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On the Non-Gaussianity of Sea Surface Elevations

by
Alicia Nieto-Reyes
Department of Mathematics, Statistics and Computer Science, Universidad de Cantabria, 39005 Santander, Spain
Current address: Faculty of Science, Universidad de Cantabria, Avd. Los Castros s/n, 39005 Santander, Spain.
J. Mar. Sci. Eng. 2022, 10(9), 1303; https://doi.org/10.3390/jmse10091303
Submission received: 10 August 2022 / Revised: 6 September 2022 / Accepted: 9 September 2022 / Published: 15 September 2022
(This article belongs to the Special Issue Challenges and Perspectives for Marine Data Science)

Abstract

:
The sea surface elevations are generally stated as non-Gaussian processes in the current literature, being considered Gaussian for short periods of relatively low wave heights. The objective here is to study the evolution of the distribution of the sea surface elevation from Gaussian to non-Gaussian as the period of time in which the associated time series is recorded increases. To do this, an empirical study based on the measurements of the buoys in the US coast downloaded at a casual day is performed. This study results in rejecting the null hypothesis of Gaussianity in below 25% of the cases for short periods of time and in over 95% of the cases for long periods of time. The analysis pursued relates to a recent one by the author in which the heights of sea waves are proved to be non-Gaussian. It is similar in that the Gaussianity of the process is studied as a whole and not just of its one-dimensional marginal, as it is common in the literature. It differs, however, in that the analysis of the sea surface elevations is harder from a statistical point of view, as the one-dimensional marginals can be Gaussian, which is observed throughout the study and in that a longitudinal study is performed here.

1. Introduction

Much attention in the literature is dedicated to the study of the sea surface height [1,2,3], a function of the sea surface elevation which is generally obtained by making use of the zero-up or down crossing methodology. The sea surface height is relevant because of design and analysis of off-shore structures [4] and ships [5] and, therefore, the literature is large in terms of studying its distribution [6,7,8,9,10]. The sea surface height has been modeled, for instance, as
  • a Rayleigh distribution [11,12],
  • a, more general, Weibull distribution [13],
  • a Forristall distribution [1],
  • a Naess distribution [14],
  • a Boccotti distribution [15],
  • a Klopman distribution [16],
  • a van Vledder distribution [17],
  • a Battjes–Groenendijk distribution [18],
  • a Mendez distribution [19], or
  • a LoWiSh II distribution [20].
In [21] it is experimentally proved that the sea heights do not follow a Gaussian distribution.
This study is dedicated, however, to the analysis of the sea surface elevation, as opposed to the sea heights. The measurements of sea surface elevation are obtained by buoys throughout the sea, which are later preprocessed to obtain the sea heights. Sea surface elevations have been studied from a statistical point of view, studying its distribution [22], the skewness of the distribution [23], and the modellization of the process [24,25]. Consideration has also being given to how to measure [26] and record the data [27]. From an applied perspective, the literature contains works on sea surface elevations to, for instance, ship motion forecasting [28] and the development of sea surface elevation maps [29]. Furthermore, ref. [30] studies that for certain wave groups the periods of stationarity are short and [31] that the sea surface elevation is only Gaussian in short periods of relatively low wave heights. For many sea states the process is nonlinear due to its second order structure [32], and even to much higher order structures for high waves [33].
This work goes beyond the existing literature and it is dedicated to empirically study how the Gaussianity of the distribution of the sea surface elevation evolves along an increase of the time period of the associated time series. For that, the measurements of 59 buoys along the US coast are studied. It is obtained that over 50 % of the cases are non-Gaussian with a length time period corresponding to 2 × 10 4 observations. This results increases to over 95 % of the cases if the length time period increases to correspond to 10 5 observations. From a statistical point of view, the importance of studying the sea surface elevation is high and lies in that it is a raw measurement. While experimental studies show that the distribution of sea heights are clearly non-Gaussian, having a non-Gaussian one dimensional marginal, the non-Gaussianity of the sea surface elevation is not so obvious; which makes the problem more interesting.
In fact, in proving the non-Gaussianity, it is here demonstrated that some cases that common hypothesis tests consider as Gaussian correspond to non-Gaussian processes with Gaussian one-dimensional marginals. It is worth saying that out of a 22.03 % rejection rate for a length time period corresponding to 10 3 observations, a 6.78 % correspond to non-Gaussian processes with Gaussian one-dimensional marginals, the Guassianity of which would not have been rejected but for the use of the methodology applied here. This methodology is based on the random projection test [34], a goodness of fit test that checks the Gaussianity of the process as a whole and not just of a finite order marginal, as other established test in the literature do; see, for instance [35,36]. The obtained findings are important due to the cases that the literature considered as Gaussian are numerous. These cases include very large waves and, in fact, according to [37], very large waves might be much more frequent than commonly assumed.
The rest of the manuscript includes: The description of the studied dataset in Section 2 and of the applied methodology in Section 3. The results of the analysis are described in Section 4. The analysis makes use of the nortsTest package of the R software. Section 5 contains the obtained conclusions.

2. Datasets

The Coastal Data Information Program (https://cdip.ucsd.edu (accessed on 20 August 2022)) contains surface elevations measured by buoys that are along the cost of the US. For the present study, these measurement where downloaded on the 20 August 2022 from the web page https://thredds.cdip.ucsd.edu/thredds/catalog/cdip/realtime/catalog.html (accessed on 20 August 2022). In particular, the variable downloaded is that named xyzZDisplacement. The set of data used here differs from that in [21] and it has not being used in the literature before.
There are a total of 59 datasets, each corresponding to the collected time series of a station (buoy). Each buoy has an identification number, which is displayed in the first and fourth columns of Table 1. The latitude and longitude coordinates of these buoys are displayed in Figure 1 and, rounded to 2 decimal values, also included in the table. The top plot of Figure 1 contains a world map where the coordinates have been drawn. The bottom plot is a zoom of the top one that contains only the coordinate of the buoys that are close to the US mainland coast.
In Table A1, in Appendix A, it can be observed the time period of the time series associated to each of the 59 buoys, under the columns designated with the names Start Time (columns 2 and 3) and End Time (columns 4 and 5). The shortest time period is depicted in bold, which is that of station 244. This is the buoy with the shortest time period because the length of the associated time series is the smallest among that of the 59 time series. This information is presented in Table A2, also in Appendix A, under the columns designated with the name length.
Each of the 59 datasets under study is restricted to a time series of length 5 × 10 5 . The first 5 × 10 5 are the ones selected. The datasets consist of raw data, which contains unknown, unobserved, values. After taking out those from the first 5 × 10 5 selected, the length of the time series associated to each of the 59 buoys can also be observed from Table A2, in Appendix A. It is designated by the name studied. It can be observed from Table A2, in Appendix A, that there is one buoy for which the whole 5 × 10 5 first elements have been observed: buoy 249. There, it can also be observed that there are two buoys, 188 and 202, that have the minimum value under the label studied.
In the left panel of Figure 2, it is displayed part of the recorded data for buoy 433. The displayed data results from restricting the 1,622,186 observations stored for buoy 433 and taking the ones corresponding to the first 10 5 time points. As it is obvious from the plot, the first 10 5 time points contain unobserved elements. Indeed, 410,144 observations have been made out of the 5 × 10 5 selected for the study (see Table A2, in Appendix A). From the left panel of the figure, it is also observable that the unobserved data splits the time series in ten parts. The right panel of Figure 2 is a zoom of the left panel containing solely the part to the left of the time series.
Greenwich mean time (GMT) in the format year-month-day and hours:minutes:seconds is used for the x-axis of the plots in the figure.

3. Methodology

Given X t a real valued random variable for each t Z ,
X : = { X t } t Z
is a stochastic process [38]. Most common hypotheses on stochastic processes are those of stationarity [39] and Gaussianity [40]. X is stationary if
  • E [ X t ] = E [ X t + k ] for all k , t Z , where E denotes the expectation function,
  • Cov ( X t , X k ) = Cov ( X t k , X 0 ) for all k , t Z , where Cov denotes the covariance function and
  • Var [ X t ] < for all t Z , where Var denotes the variance.
X is Gaussian if
( X t 1 , , X t n )   is a Gaussian random vector for all   n N .
It occurs that a stationary Gaussian process is strictly stationarity. X is strictly stationary if
( X t 1 , , X t n )   and   ( X t 1 + k , , X t n + k )
are equally distributed for all n N and k , t 1 , , t n Z . Consequently, given a stationary process X , it is Gaussian if
( X t , , X t )   is a Gaussian random vector for all   t N .

3.1. Tests for Stationarity

This manuscript is about testing the Guassianity of stocastic processes. Typically, those tests assume that the process is stationary. Thus, this assumption has to be previously checked. For that, the most common tests in the literature are
1.
Ljung-Box test [41],
2.
Augmented Dickey-Fuller test [42],
3.
Phillips-Perron test [43] and
4.
kpps test [44].
For the first three tests, the tests can be simplified as contrasting the null hypothesis
H 0 , 1 : X   is non stationary
against the alternative
H a , 1 : X   is stationary
while the kpps test results in the null hypothesis
H 0 , 2 : X   is stationary
against the alternative
H a , 2 : X   is non stationary .
The hypotheses are tested in different ways. For instance, Ljung-Box test makes use of the autocorrelation function, which, at lag k for a stationary process is
Cov ( X t , X t + k ) Var ( X t ) .
This is observable from its statistic:
n ( n + 2 ) k = 1 h ρ ^ k 2 n k ,
where ρ ^ k denotes the sample autocorrelation at lag k and n the sample size. Note that it depends on a constant h .

3.2. Tests for Gaussianity

Most tests for Gaussianity of stochastic processes assume the process is stationary and test whether a finite marginal distribution of the process is Gaussian, generally, the one-dimensional marginal. That is, instead of testing whether (1) is satisfied, these tests contrast the null hypothesis
H 0 , 3 : X t   is a Gaussian random variable
against the alternative
H a , 3 : X t   is not a Gaussian random variable
by checking whether X t is a Gaussian random variable. Let us reflect that, because of the stationarity, the distribution of X t is the same for all t Z ; that is, it is independent of t .
Common tests to check the Gaussianity of a real valued random variable require a sample of independent and identically distributed random variables [45]. As this work deals with stochastic processes, the independence assumption is not verified. However, there are also many tests for this situation. Here, it is made use of the Epps test [35], which checks that the characteristic function of the one-dimensional distribution of the process is that of a Gaussian distribution, and of the Lobato and Velasco test [36], which checks that the third and fourth order moments of the one-dimensional distribution of the process are those of a Gaussian distribution.
If the null hypothesis H 0 , 3 is rejected, with the above mentioned tests, the null hypothesis
H 0 , 4 : X   is a Gaussian process
is rejected against the alternative
H a , 4 : X   is not a Gaussian process .
However, it may occur that
X t   is a Gaussian random variable
while
X   is not a Gaussian process .
The above mentioned tests are at nominal level again this type of alternatives. For this, it is used here the random projection test [34], which test the Gaussianity of the whole distribution of the process and not just of a finite dimensional marginal. For elaboration on it, see Section 3.2.

Random Projection Test

The random projection test was introduced in [34] as a tool to test the Gaussianity of stationary processes that is able to reject the null hypothesis of Gaussianity (5) against alternatives with Gaussian finite-dimensional marginals. The procedure is based on a result in [46] that implies that if
{ X j } j t , d ,
with d drawn from a Dirichlet distribution [47], is Gaussian, then
{ X j } j t
is Gaussian. Note that due to the stationarity assumption, the Gaussianity of { X j } j t is equivalent to (1). In what follows, the procedure is explained in detail.
Let
λ 1 , λ 2 > 0
be two parameters. Making use of the following stick-breaking procedure, a Dirichlet distribution is considered:
1.
Let
β ( λ 1 , λ 2 )
denote a beta distribution with parameters λ 1 , λ 2 .
2.
Let d 0 be drawn from the distribution β ( λ 1 , λ 2 ) . Note that
d 0 [ 0 , 1 ] .
3.
For any k N , the natural numbers, let d k be the result of multiplying
1 i = 0 k 1 d i
and an element drawn independently from he distribution β ( λ 1 , λ 2 ) . Note that
d k [ 0 , 1 i = 0 k 1 d i ] .
Let X be a stationary process. The associated projected process based on { d k } k N is
Y : = { Y t } t Z
with
Y t : = i = 0 d i X t i .
Then, making use of this randomly projected process, it suffices to apply to it a test for the null hypothesis of Gaussianity (4).
The selection of the parameters λ 1 , λ 2 is important. It is explained in [34] that values such as
λ 1 = 100   and   λ 2 = 1
result in an projected process Y similar to X . However, values such as
λ 1 = 2   and   λ 2 = 7
result in projected processes different from X while providing an effective method.

3.3. False Discovery Rate

When multiple tests are performed, the multiplicity has to be taken into account. For that it is used here the false discovery rate [48]. The false discovery rate aims at controlling the expected proportion of falsely rejected hypothesis. It was first introduced in [49] to take into account the multiplicity of independent tests. In [48] it was established that the definition in [49] remains valid for certain types of dependency. However, for general dependent cases [48] has to be applied.

4. Results of the Analysis

This section analyzes whether each of the 59 datasets provided in Section 2, one per buoy, is drawn from a Gaussian process as the length of the time series increases. For that, the tests described in Section 3.2 are used here. As commented in Section 2, our datasets have been restricted to 5 × 10 5 observations that include missing data; with the amount of non-missing observations being recorded in Table A2 of Appendix A under the label studied. The length of the time series along which this longitudinal study is performed, makes use of the following values
L e n g t h { 10 3 , 10 4 , 2 × 10 4 , 4 × 10 4 , 6 × 10 4 , 8 × 10 4 , 10 5 }
and is computed by selecting that amount of non-missing observations from the buoys that have them.
To relate the above length quantities to time, it is important to take into account that the time in seconds UTC (Coordinated universal time) associated to an observation t N is computed as
T t : = T 0 + t 1 r d
where T 0 is the time at which the recording starts, r is the sample rate and d is the filter delay. r takes value 1.28 and d 133.3 but for buoys in
B : = { 132 , 142 , 171 , 194 , 204   and   244 }
that r takes value 2.56 and d value 130. Thus, the amount of recorded time, T t T 0 , for t = 10 3 observations corresponds to 647.1688 s in UTC time in general and to 260.2344 for the buoys in B . This results in GMT time in 10.79 min and 4.34 min, respectively. Table 2 displays these results, rounded to two decimal values, for each t equal to the length in (6). The table also includes the translation of seconds UTC into GMT time. Thus, it is observable from the table that the analysis we pursue here is of time series that have been recorded for a period that varies between 4 min and 21 h. In fact, in general, for a length of 10 3 the periods are close to 11 min while the are just abobe 2 h for a length of 10 4 and of more than 4 h for a length of 2 × 10 4 .
Let us select, for instance, a time period of 2.13 h in general and of 1.05 h for the buoys in B . Then, for each of the buoys (but for buoys 188, 189, 202 and 204), the first non-missing observations corresponding to those time periods would be selected. Buoys 188, 189, 202 and 204 are not studied for these time periods because, as reported in Table A2 under the label studied, their non-missing observations are less than 10 4 . Thus, 55 out of the 59 buoys are studied in this scenario. This is not the case, however, when we study buoys recorded for 4.34 min as the time series duration associated to all the buoys is larger than that time period.
First, despite the time series associated to each buoy consists of non iid (independent identically distributed) drawn observations, in Figure 3 it is plotted the histogram and kernel density estimated curve associated to each of the 55 time series for a time period of 2.13 h in general and of 1.05 h for the buoys in B . For each histogram 70 cells have been used; but for buoy 241 that is plotted based on 40 cells. A Gaussian kernel is used for the density estimations with the bandwidth resulting of applying least squares cross validation [50,51]. The obtained bandwidths are reported in Table A3, in Appendix A. These plots are done to display descriptive information on the datasets. Note that the null hypothesis of Gaussianity in (5) cannot be rejected by just inspecting the plots. This is due to the fact that when testing the null hypothesis of Gaussianity it is assumed X is a Gaussian process and the objective is to reject such assumption by the evidence provided by the data. This is done at certain significance level. Here, it is used 0.05. By inspection of the plots in Figure 3, one cannot guarantee whether there is enough evidence for such a rejection as it cannot be quantified.
As the tests for testing the null hypothesis of Gaussianity reported in Section 3.2 require the stationarity assumption for the process, making use of the tests provided in Section 3.1, it is first checked whether each of the datasets, when making used of each of the time periods in Table 2, is drawn from a stationary process. The results obtained from checking the stationarity are displayed in Table 3. Only one result is provided by test and time period because the maximum of the obtained p-values is reported for the Augmented Dickey-Fuller, Phillips-Perron and Ljung-Box tests while the minimum is reported for the kpps test. Note that in the first three tests the null hypothesis of non-stationarity is tested, as in (2), and in the fourth it is the null hypothesis of stationarity, as in (3). Thus, p-values smaller than 0.01 are obtained for the Augmented Dickey-Fuller test and the Phillips-Perron test and larger than 0.1 for the kpps test.
p-Values that are close to zero are obtained for the Ljung-Box test, but for time periods of 17.32 h and 21.66 h in general, and of 8.64 h and 10.81 h for buoys in B . For a time period of 17.32 h in general, and of 8.64 h for buoys in B , the minimum p-value is 0.18 in the Ljung-Box test and is obtained for buoy 241. The other p-values of that test for that time period are technically zero. It is also for buoy 241 that the Ljung-Box test gives a p-value of 0.36 when the time period is of 21.66 h in general, and of 10.81 h for buoys in B . For that time period and test, the p-values associated to the rest of buoys are smaller than 0.01. When considering low time periods, the ultimate case being 4.34 min, all the tests provides the aimed result. Thus, it can be assumed (with the possible exception of buoy 241 for time periods of 17.32 h and 21.66 h in general and of 8.64 h and 10.81 h for buoys in B) that the studied datasets are drawn from stationary processes, and check their Gaussianity under the mentioned assumption.
In order to study the Gaussianity of the datasets under study, it is first analyzed the one-dimensional marginal distribution of the process. This is because a rejection of the null hypothesis (4) implies the sought rejection of the whole distribution of the process, in (5). For analyzing the one-dimensional marginal distribution, it is made use of the Epps and Lobato and Velasco tests commented in Section 3. The results are displayed in Table 4 for the particular case of a time period of 2.13 h in general and of 1.05 h for the buoys in B. There, for each dataset, associated to a buoy (columns 1 and 5), it can be observed the p-values resulting from applying the Epps test (columns 2 and 6) and the Lobato and Velasco test (columns 3 and 7). As multiplicity has to be taken into account, columns 4 and 8 display the FDR values. It can be observed from the table that 20 of the 55 FDR values are smaller than 0.05. They have been highlighted in bold. If the less conservative FDR introduced in [49] had been used, the number of rejections would have increased. If multiplicity had not been taken into account at all and the null hypothesis (4) were rejected when the minimum of the two p-values was smaller than 0.05, the number of rejections would have increased to 22.
To better illustrate the findings, the results of Table 4 are summarized in the left plot of Figure 4. The x-axis represents the buoy’s identification number while the y-axis displays the obtained FDR for dependent tests. A grey line at y = 0.05 is drawn to show what buoys have a FDR above or below that value, which result in a rejection of the null hypothesis (4). It can observe that there are three FDR that are just above 0.05. They correspond to buoys 028, 215 and 230.
In what follows it is pursued a further study in the 35 buoys for which there is yet no evidence to reject the null hypothesis of Gaussianity, displayed in (5). This further study consists in applying the random projection test based on the Epps and Lobato and Velasco tests with parameters ( 100 , 1 ) and ( 2 , 7 ) . The results of applying the random projection test are reported in Table 5. There it can be observed that the random projection test is able to reject the null hypothesis of Gaussianity in 2 out of the 35 buoys, which results in a total of 22 rejections out of 55 (the 40%). The FDR values that result in a rejection are highlighted in bold.
The results in Table 5 have been summarized in the right plot of Figure 4. There, the FDR values larger and smaller than 0.05 can be clearly observed; and that there is a p-value just above 0.05, the one corresponding to buoy 230.
This procedure that we have exemplified through the use of a time period of 2.13 h in general and of 1.05 h for the buoys in B , has been performed for all the time periods displayed in Table 2. This has resulted in a rejection rate displayed in Table 6 with the label withprojection. Thus, if it is used a time period of 4.34 min for the buoys in B and of 10.79 min in general, the Gaussianity of 22.03% of the processes is rejected. This rate increases to 58.49 % and 82.35 % for time periods of 2.13 h and 4.30 h in general and of 1.05 h and 2.13 h for buoys in B , respectively. It reaches 96.08 % for a time period of 21.66 h in general and 10.81 h for buoys in B .
If the random projection was not applied, that is, if only the Epps and Lovato and Velasco tests were applied, the rejection rates would have been just below for time periods of 10.79 min, 30.32 min, 2.13 h, 12.98 h and 17.32 h in general and of 4.34 min, 14.10 min, 1.05 h, 6.47 h and 8.64 h for buoys in B . The corresponding rejection rates are in Table 6 with the label no projection. Meanwhile, if the multiplicity would have not been taken into account altogether and the minimum p-value would have been selected, the rejection rates would also have been similar, just a bit over as the time period decreases, separately in general and in B . These values are displayed in Table 6 with the label minimum. To better illustrate all these rejections, we have plotted them in Figure 5. There, the rejections rates of the proposed procedure are plotted with a back plus symbol, those obtained without using the random procedure are plotted with green triangles and the ones where the minimum of the p-values is computed are plotted with blue circles. It can observed from the figure that the three values get closer as the time period increases.

5. Conclusions

For the analysis, the sea surface elevations provided by the measurements of the buoys along the US coast have been analyzed along different time periods. It has been obtained in the analysis section that the rejection rates increase when the time periods increase. Thus, while less than 25 % of the studied processes can be considered as non-Gaussian when the time period consists of less than 11 min, it reaches over the 95 % of the cases when the time period is larger than 21 h in general and of 10 h for the buoys in B . Thus, it is clear the non-Gaussianity of the sea surface elevation and that only enough information (in terms of the time period) is need for its rejection. If we had only used non-randomized tests, as it is common in the literature, the obtained rejection rates would have been a bit lower, with the larger differences being obtained as the time period decreases. For the obtained results, a more complex and new goodness of fit test has been required. This is that known as random projection test and requires of the selection of a distribution used to draw a vector in which to project the data and a goodness of fit test for the one-dimensional marginal distribution of the process to apply on the projected data. This leads to the conclusion that the sea surface elevations are generally non-Gaussian despite that their one-dimensional distribution is Gaussian in a few of the cases. The obtained results are significant as they indicate two important aspects:
  • The sea surface elevations are generally not Gaussian as the time period increases, from time series recorded in just 4 to 10 min to time series recorded in 10 to 21 h.
  • In a few of the cases the one-dimensional marginal is Gaussian for small and moderate time periods. For instance, for time series recorded in just 4 to 10 min, in 14 to 30 min and those recorded in 1 to 2 h.
These two facts result in that sea surface elevations cannot be modeled as Gaussian processes but in a few of the cases can be modeled as non-Gaussian processes with one-dimensional Gaussian marginals. Note that this is not a contradiction due to, as explained in Section 3, there a non-Gaussian processes with one-dimensional Gaussian marginals.
In testing the non-Gaussianity, multiple tests are performed and consequently multiplicity has to be taken into account. However, if it was not to be taken into account, the same results would have been obtained when all the buoys are studied for time periods of 6.47 h and over; with the rejection rates getting larger than those based on the FDR rate when the length period decreases.

Funding

A.N.-R. was supported by grant MTM2017-86061-C2-2-P funded by MCIN/AEI/10.13039/501100011033 and “ERDF A way of making Europe”.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Coastal Data Information Program at https://thredds.cdip.ucsd.edu/thredds/catalog/cdip/realtime/catalog.html (accessed on 20 August 2022).

Conflicts of Interest

The author declares no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
FDRFalse discovery rate
UTCCoordinated universal time
iidindependent identically distributed

Appendix A

Table A1. Start and end times for which the measurements have been recorded by each of the 59 buoys. These are GMT times which are reported in the format year-month-day and hours-minutes-seconds. In bold, the buoy whose measurements have been recorded for a smaller period of time.
Table A1. Start and end times for which the measurements have been recorded by each of the 59 buoys. These are GMT times which are reported in the format year-month-day and hours-minutes-seconds. In bold, the buoy whose measurements have been recorded for a smaller period of time.
BuoyStart Time (GMT)End Time (GMT)
Date (yyyy-mm-dd)TimeDate (yyyy-mm-dd)Time
0282021-04-2919:00:002022-08-2008:59:57
0292022-01-2621:00:002022-08-2008:59:58
0362022-05-2523:00:002022-08-2008:59:58
0452022-03-0120:00:002022-08-2008:59:58
0672020-12-0221:00:002022-08-2008:59:57
0712022-08-0219:32:132022-08-2008:59:58
0762022-06-0218:00:002022-08-2008:59:58
0922021-08-0421:00:002022-08-2008:59:57
0942020-10-0821:00:002022-08-2008:59:57
0982022-02-0421:00:002022-08-2008:59:58
1002021-02-1918:00:002022-08-2008:59:57
1062022-01-2023:00:002022-08-2008:59:58
1212021-06-2113:32:132022-04-1022:59:58
1322020-09-1522:30:002022-03-0101:22:48
1342022-01-1515:00:002022-08-2008:59:58
1392021-11-1420:00:002022-08-2008:59:58
1422022-03-0215:00:002022-05-2008:22:49
1432022-08-1623:00:002022-08-2008:59:58
1442021-11-1117:00:002022-08-2009:29:58
1472022-03-2115:00:002022-08-2008:59:58
1502021-11-1815:00:002022-08-2008:59:58
1542021-01-1217:00:002022-06-2721:59:57
1572020-06-2020:00:002022-08-2008:59:57
1582022-07-0602:02:132022-08-2008:59:58
1602022-05-0221:02:132022-08-2008:59:58
1622021-11-2018:00:002022-08-2008:59:58
1662022-05-1621:00:002022-05-1803:29:58
1682022-03-2520:00:002022-08-2008:59:58
1712022-01-1416:00:002022-05-1207:52:49
1812022-08-1121:02:132022-08-2008:59:58
1852022-08-0602:00:002022-08-2009:29:58
1882021-11-0310:23:102022-08-2009:20:55
1892021-10-2020:02:132022-08-2008:59:58
1912021-12-1320:00:002022-08-2008:59:58
1922022-08-0417:00:002022-08-2008:59:58
1942021-11-0515:00:002022-02-0623:52:49
1962022-07-1904:02:132022-08-2009:29:58
1972022-07-1904:02:132022-08-2008:59:58
1982021-06-1621:00:002022-08-2008:59:57
2012021-11-1021:00:002022-08-2008:59:58
2022021-07-2108:02:132022-05-0423:59:58
2032021-05-1219:00:002022-08-2008:59:57
2042021-08-2023:00:002022-05-1002:52:49
2092022-06-1018:32:132022-08-2008:59:58
2132022-03-1517:00:002022-08-2008:59:58
2142022-06-2804:02:132022-08-2008:59:58
2152022-04-1419:00:002022-08-2008:59:58
2172022-08-0823:02:132022-08-2008:59:58
2202022-04-2622:00:002022-08-2008:59:58
2222021-11-1515:00:002022-08-2008:59:58
2242020-07-1615:00:002022-08-2008:59:57
2302022-05-1917:00:002022-08-2008:59:58
2392022-07-1719:05:102022-08-2009:03:56
2402022-07-2219:02:132022-08-2008:59:58
2412022-08-0200:00:002022-08-2008:29:58
2432021-09-1320:00:002022-08-2008:59:57
2442022-03-3104:30:002022-03-3110:12:49
4302022-07-1518:02:132022-08-2008:59:58
4332022-08-0517:00:002022-08-2008:59:58
Table A2. The 59 available buoys are labelled by an identification number in descending order. The length of the associated time series is also reported, the smallest being highlighted in bold. The studied label represents the length of the studied time series after selecting the first 5 × 10 5 time points and eliminating the unobserved, missing, values.
Table A2. The 59 available buoys are labelled by an identification number in descending order. The length of the associated time series is also reported, the smallest being highlighted in bold. The studied label represents the length of the studied time series after selecting the first 5 × 10 5 time points and eliminating the unobserved, missing, values.
BuoyLengthStudiedBuoyLengthStudied
02852,817,065412,2781851,583,018216,438
02922,722,218423,96818832,062,4632304
0369,557,162458,52818933,569,2796912
04518,966,698488,48019127,592,874419,360
06769,170,857334,1121921,732,778451,754
0711,942,272394,01619420,652,288162,047
0768,695,466317,9841963,564,288396,288
09242,080,425500,0001973,561,984463,136
09475,258,025412,44819847,499,433306,294
09821,726,890444,70420131,237,802476,960
10060,447,913453,92020231,813,6312304
10623,376,554145,32220351,379,369403,232
12132,447,23129,95220457,986,3036912
132117,484,79714,5922097,808,256426,272
13423,966,378474,65621317,432,234446,838
13930,800,042347,7662145,884,416301,856
14217,403,64810,75221514,109,866481,855
143378,026336,5542171,262,592403,232
14431,147,946447,00822012,768,938449,142
14716,777,898476,96022230,712,490430,880
15030,380,714412,44822484,575,400449,312
15458,742,953479,26423010,248,362481,568
15787,427,752467,5742393,714,125352,544
1585,008,896479,2642403,161,088375,584
16012,109,824467,7442412,029,994456,224
16230,145,706435,48824337,661,353467,744
166140,714126,89024452,99229,952
16816,312,490453,9204303,939,840426,272
17126,015,999205,0884331,622,186410,144
181940,032493,088
Table A3. Bandwidth, per buoy, rounded to 2 decimal values used for the kernel density estimated curves in Figure 3.
Table A3. Bandwidth, per buoy, rounded to 2 decimal values used for the kernel density estimated curves in Figure 3.
buoy028029036045067071076092094098100
bandwidtd0.040.120.050.040.060.080.070.040.060.080.04  
buoy106121132134139142143144147150154
bandwidtd0.10.140.080.030.090.130.020.030.010.030.04  
buoy157158160162166168171181185191192
bandwidtd0.040.010.030.050.070.050.110.020.060.040.03  
buoy194196197198201203209213214215217
bandwidtd0.130.040.040.050.040.040.040.030.020.040.04  
buoy220222224230239240241243244430433
bandwidtd0.080.070.040.010.130.010.010.020.130.030.02

References

  1. Forristall, G.Z. On the statistical distribution of wave heights in a storm. J. Geophys. Res. Ocean. 1978, 83, 2353–2358. [Google Scholar] [CrossRef]
  2. Azaïs, J.M.; León, J.R.; Ortega, J. Geometrical characteristics of Gaussian sea waves. J. Appl. Probab. 2005, 42, 407–425. [Google Scholar] [CrossRef]
  3. Karmpadakis, I.; Swan, C.; Christou, M. Assessment of wave height distributions using an extensive field database. Coast. Eng. 2020, 157, 103630. [Google Scholar] [CrossRef]
  4. Haver, S. On the joint distribution of heights and periods of sea waves. Ocean. Eng. 1987, 14, 359–376. [Google Scholar] [CrossRef]
  5. Mendes, S.; Scotti, A. The Rayleigh-Haring-Tayfun distribution of wave heights in deep water. Appl. Ocean. Res. 2021, 113, 102739. [Google Scholar] [CrossRef]
  6. Tayfun, M.A. Distribution of Large Wave Heights. J. Waterw. Port Coastal Ocean. Eng. 1990, 116, 686–707. [Google Scholar] [CrossRef]
  7. Mori, N.; Liu, P.C.; Yasuda, T. Analysis of freak wave measurements in the Sea of Japan. Ocean. Eng. 2002, 29, 1399–1414. [Google Scholar] [CrossRef]
  8. Stansell, P. Distributions of freak wave heights measured in the North Sea. Appl. Ocean. Res. 2004, 26, 35–48. [Google Scholar] [CrossRef]
  9. Stansell, P. Distributions of extreme wave, crest and trough heights measured in the North Sea. Ocean. Eng. 2005, 32, 1015–1036. [Google Scholar] [CrossRef]
  10. Casas-Prat, M.; Holthuijsen, L. Short-term statistics of waves observed in deep water. J. Geophys. Res. Ocean. 2010, 115. [Google Scholar] [CrossRef] [Green Version]
  11. Longuet-Higgins, M.S. On the distribution of the heights of sea waves: Some effects of nonlinearity and finite band width. J. Geophys. Res. Ocean. 1980, 85, 1519–1523. [Google Scholar] [CrossRef]
  12. Jishad, M.; Yadhunath, E.; Seelam, J.K. Wave height distribution in unsaturated surf zones. Reg. Stud. Mar. Sci. 2021, 44, 101708. [Google Scholar] [CrossRef]
  13. Muraleedharan, G.; Rao, A.; Kurup, P.; Nair, N.U.; Sinha, M. Modified Weibull distribution for maximum and significant wave height simulation and prediction. Coast. Eng. 2007, 54, 630–638. [Google Scholar] [CrossRef]
  14. Naess, A. On the distribution of crest to trough wave heights. Ocean. Eng. 1985, 12, 221–234. [Google Scholar] [CrossRef]
  15. Boccotti, P. On Mechanics of Irregular Gravity Waves. Atti Accad. Naz. Lincei Mem. 1989, 19, 110–170. [Google Scholar]
  16. Klopman, G. Extreme Wave Heights in Shallow Water; Report H2486; WL|Delft Hydraulics: Delft, The Netherlands, 1996. [Google Scholar]
  17. van Vledder, G.P. Modification of the Glukhovskiy Distribution; Technical Report H1203; Delft Hydraulics: Delft, The Netherlands, 1991. [Google Scholar]
  18. Battjes, J.A.; Groenendijk, H.W. Wave height distributions on shallow foreshores. Coast. Eng. 2000, 40, 161–182. [Google Scholar] [CrossRef]
  19. Mendez, F.J.; Losada, I.J.; Medina, R. Transformation model of wave height distribution on planar beaches. Coast. Eng. 2004, 50, 97–115. [Google Scholar] [CrossRef]
  20. Wu, Y.; Randell, D.; Christou, M.; Ewans, K.; Jonathan, P. On the distribution of wave height in shallow water. Coast. Eng. 2016, 111, 39–49. [Google Scholar] [CrossRef]
  21. Nieto-Reyes, A. On the Non-Gaussianity of the Height of Sea Waves. J. Mar. Sci. Eng. 2021, 9, 1446. [Google Scholar] [CrossRef]
  22. Srokosz, M. On the joint distribution of surface elevation and slopes for a nonlinear random sea, with an application to radar altimetry. J. Geophys. Res. Ocean. 1986, 91, 995–1006. [Google Scholar] [CrossRef]
  23. Srokosz, M.; Longuet-Higgins, M. On the skewness of sea-surface elevation. J. Fluid Mech. 1986, 164, 487–497. [Google Scholar] [CrossRef]
  24. Hokimoto, T.; Shimizu, K. A non-homogeneous hidden Markov model for predicting the distribution of sea surface elevation. J. Appl. Stat. 2014, 41, 294–319. [Google Scholar] [CrossRef]
  25. Pena-Sanchez, Y.; Mérigaud, A.; Ringwood, J.V. Short-term forecasting of sea surface elevation for wave energy applications: The autoregressive model revisited. IEEE J. Ocean. Eng. 2018, 45, 462–471. [Google Scholar] [CrossRef]
  26. Schulz-Stellenfleth, J.; Lehner, S. Measurement of 2-D sea surface elevation fields using complex synthetic aperture radar data. IEEE Trans. Geosci. Remote Sens. 2004, 42, 1149–1160. [Google Scholar] [CrossRef]
  27. Collins, C.O.; Lund, B.; Waseda, T.; Graber, H.C. On recording sea surface elevation with accelerometer buoys: Lessons from ITOP (2010). Ocean. Dyn. 2014, 64, 895–904. [Google Scholar] [CrossRef]
  28. Reichert, K.; Dannenberg, J.; van den Boom, H. X-Band radar derived sea surface elevation maps as input to ship motion forecasting. In Proceedings of the IEEE OCEANS’10, Sydney, NSW, Australia, 24–27 May 2010; pp. 1–7. [Google Scholar]
  29. Hessner, K.; Reichert, K.; Hutt, B.L. Sea surface elevation maps obtained with a nautical X-Band radar–Examples from WaMoS II stations. In Proceedings of the 10th International Workshop on Wave Hindcasting and Forecasting and Coastal Hazard Symposium, North Shore, Oahu, HI, USA, 11–16 November 2007; pp. 11–16. [Google Scholar]
  30. Cherneva, Z.; Guedes Soares, C. Non-linearity and non-stationarity of the New Year abnormal wave. Appl. Ocean. Res. 2008, 30, 215–220. [Google Scholar] [CrossRef]
  31. Rodriguez, G.; Soares, C.G.; Pacheco, M.; Pe´rez-Martell, E. Wave Height Distribution in Mixed Sea States. J. Offshore Mech. Arct. Eng. 2001, 124, 34–40. [Google Scholar] [CrossRef]
  32. Tayfun, M.A. Narrow-band nonlinear sea waves. J. Geophys. Res. Ocean. 1980, 85, 1548–1552. [Google Scholar] [CrossRef]
  33. Petrova, P.G.; Aziz Tayfun, M.; Guedes Soares, C. The Effect of Third-Order Nonlinearities on the Statistical Distributions of Wave Heights, Crests and Troughs in Bimodal Crossing Seas. J. Offshore Mech. Arct. Eng. 2013, 135, 021801. [Google Scholar] [CrossRef]
  34. Nieto-Reyes, A.; Cuesta-Albertos, J.A.; Gamboa, F. A Random-Projection Based Test of Gaussianity for Stationary Processes. Comput. Stat. Data Anal. 2014, 75, 124–141. [Google Scholar] [CrossRef]
  35. Epps, T.W. Testing That a Stationary Time Series is Gaussian. Ann. Stat. 1987, 15, 1683–1698. [Google Scholar] [CrossRef]
  36. Lobato, I.; Velasco, C. A simple Test of Normality for Time Series. Econom. Theory 2004, 20, 671–689. [Google Scholar] [CrossRef]
  37. Benetazzo, A.; Barbariol, F.; Bergamasco, F.; Torsello, A.; Carniel, S.; Sclavo, M. Observation of Extreme Sea Waves in a Space–Time Ensemble. J. Phys. Oceanogr. 2015, 45, 2261–2275. [Google Scholar] [CrossRef]
  38. Coleman, R. What is a Stochastic Process? In Stochastic Processes; Springer: Dordrecht, The Netherlands, 1974; pp. 1–5. [Google Scholar] [CrossRef]
  39. Rozanov, Y.A. Stationary Random Processes; Holden-Day: San Francisco, CA, USA, 1967. [Google Scholar]
  40. Kozachenko, Y.; Pogorilyak, O.; Rozora, I.; Tegza, A. 2-Simulation of Stochastic Processes Presented in the Form of Series. In Simulation of Stochastic Processes with Given Accuracy and Reliability; Kozachenko, Y., Pogorilyak, O., Rozora, I., Tegza, A., Eds.; Elsevier: Amsterdam, The Netherlands, 2016; pp. 71–104. [Google Scholar] [CrossRef]
  41. Box, G.; Pierce, D.A. Distribution of Residual Autocorrelations in Autoregressive-Integrated Moving Average Time Series Models. J. Am. Stat. Assoc. 1970, 65, 1509–1526. [Google Scholar] [CrossRef]
  42. Said, S.E.; Dickey, D.A. Testing for Unit Roots in Autoregressive-Moving Average Models of Unknown Order. Biometrika 1984, 71, 599–607. [Google Scholar] [CrossRef]
  43. Perron, P. Trends and Random Walks in Macroeconomic Time Series: Further Evidence From a New Approach. J. Econ. Dyn. Control. 1988, 12, 297–332. [Google Scholar] [CrossRef]
  44. Kwiatkowski, D.; Phillips, P.C.; Schmidt, P.; Shin, Y. Testing the Null Hypothesis of Stationarity Against the Alternative of a Unit Root: How sure Are We that Economic Time Series Have a Unit Root? J. Econom. 1992, 54, 159–178. [Google Scholar] [CrossRef]
  45. D’Agostino, R.B.; Stephens, M.A. Goodness-of-fit techniques. Qual. Reliab. Eng. Int. 1986, 3, 71. [Google Scholar] [CrossRef]
  46. Cuesta-Albertos, J.; del Barrio, E.; Fraiman, R.; Matrán, C. The Random Projection Method in Goodness of Fit for Functional Data. Comput. Stat. Data Anal. 2007, 51, 4814–4831. [Google Scholar] [CrossRef]
  47. Pitman, J. Combinatorial Stochastic Processes. In Lectures from the 32nd Summer School on Probability Theory Held in Saint-Flour; Springer: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
  48. Benjamini, Y.; Yekutieli, D. The Control of the False Discovery Rate in Multiple Testing under Dependency. Ann. Stat. 2001, 29, 1165–1188. [Google Scholar] [CrossRef]
  49. Benjamini, Y.; Hochberg, Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J. R. Stat. Soc. Ser. Methodol. 1995, 57, 289–300. [Google Scholar] [CrossRef]
  50. Rudemo, M. Empirical Choice of Histograms and Kernel Density Estimators. Scand. J. Stat. 1982, 9, 65–78. [Google Scholar]
  51. Bowman, A.W. An Alternative Method of Cross-Validation for the Smoothing of Density Estimates. Biometrika 1984, 71, 353–360. [Google Scholar] [CrossRef]
Figure 1. Top panel: World map with the coordinates of the 59 buoys whose measurements constitute the datasets analyzed in this paper. Bottom panel: Zoom of the top panel that shows the US mainland with the coordinates of the buoys close to it. The identification number is that in Table 1.
Figure 1. Top panel: World map with the coordinates of the 59 buoys whose measurements constitute the datasets analyzed in this paper. Bottom panel: Zoom of the top panel that shows the US mainland with the coordinates of the buoys close to it. The identification number is that in Table 1.
Jmse 10 01303 g001
Figure 2. Left panel: representation of the 10 5 first recordings, including unobserved (missing) data, of the time series associated to buoy 433. The nine observed voids represent unobserved data. Right panel: representation of the first segment of the time series in the left panel. Sea surface elevation in meters and time in seconds GMT with the format year-month-day and hours:minutes:seconds.
Figure 2. Left panel: representation of the 10 5 first recordings, including unobserved (missing) data, of the time series associated to buoy 433. The nine observed voids represent unobserved data. Right panel: representation of the first segment of the time series in the left panel. Sea surface elevation in meters and time in seconds GMT with the format year-month-day and hours:minutes:seconds.
Jmse 10 01303 g002
Figure 3. Histogram, based on 70 cells (but for buoy 241 on 40 cells), and kernel density estimated curve associated to each of the 55 buoys (time series) analyzed for a time period of 2.13 h in general and of 1.05 h for the buoys in B .
Figure 3. Histogram, based on 70 cells (but for buoy 241 on 40 cells), and kernel density estimated curve associated to each of the 55 buoys (time series) analyzed for a time period of 2.13 h in general and of 1.05 h for the buoys in B .
Jmse 10 01303 g003aJmse 10 01303 g003b
Figure 4. Left panel: FDR values corresponding to the studied buoys in Table 4. Right panel: FDR values corresponding to the buoys studied in Table 5. The line y = 0.05 is displayed in both panels in color grey.
Figure 4. Left panel: FDR values corresponding to the studied buoys in Table 4. Right panel: FDR values corresponding to the buoys studied in Table 5. The line y = 0.05 is displayed in both panels in color grey.
Jmse 10 01303 g004
Figure 5. Display of the rejection rates in Table 6: the values labelled by no projection are in green triangles, those by with projection in back plus symbols and those by minimum with blue circles. The time period in the x-axis corresponds to the buoys in general. For the buoys in B , it is 4.34 min, 14.10 min, 1.05 h, 2.13 h, 4.30 h, 6.47 h, 8.64 h and 10.81 h.
Figure 5. Display of the rejection rates in Table 6: the values labelled by no projection are in green triangles, those by with projection in back plus symbols and those by minimum with blue circles. The time period in the x-axis corresponds to the buoys in general. For the buoys in B , it is 4.34 min, 14.10 min, 1.05 h, 2.13 h, 4.30 h, 6.47 h, 8.64 h and 10.81 h.
Jmse 10 01303 g005
Table 1. Buoys identification number, with associated longitud and latitude coordinates, whose surface elevation measurements constitute the datasets analyzed in this paper.
Table 1. Buoys identification number, with associated longitud and latitude coordinates, whose surface elevation measurements constitute the datasets analyzed in this paper.
BuoyLatitudeLongitudeBuoyLatitudeLongitude
02833.86°−118.64°18536.7°−122.34°
02937.94°−123.46°18819.78°−154.97°
03646.86°−124.24°189−14.27°−170.5°
04533.18°−117.47°19132.52°−117.43°
06733.22°−119.87°19235.75°−75.33°
07134.45°−120.78°19430°−81.08°
07635.2°−120.86°19613.68°144.81°
09233.62°−118.32°19715.27°145.66°
09440.29°−124.73°19821.48°−157.75°
09821.41°−157.68°20132.87°−117.27°
10032.93°−117.39°20222.28°−159.57°
10621.67°−158.12°20333.77°−119.56°
12113.35°144.79°20459.6°−151.83°
13230.71°−81.29°20939.77°−73.77°
13427.55°−80.22°21333.58°−118.18°
13943.77°−124.55°21427.59°−82.93°
14237.79°−122.63°21533.7°−118.2°
14328.4°−80.53°21734.21°−76.95°
14427.34°−84.27°22032.75°−117.5°
14736.92°−75.72°22234.77°−121.5°
15034.14°−77.72°22437.75°−75.33°
15440.97°−71.13°23048.03°−87.73°
15736.33°−122.1°23920.75°−157°
15836.63°−121.91°24037.02°−76.15°
16042.8°−70.17°24164.47°−165.48°
16246.22°−124.13°24336°−75.42°
16650.03°−145.2°24424.41°−81.97°
16840.9°−124.36°43036.26°−75.59°
17136.61°−74.84°43336.2°−75.71°
18118.38°−67.28°
Table 2. Time period in UTC and GMT associated to each length in (6). The times are divided in those for the 6 buoys in B and the rest of 53 buoys, which are labelled by general.
Table 2. Time period in UTC and GMT associated to each length in (6). The times are divided in those for the 6 buoys in B and the rest of 53 buoys, which are labelled by general.
LengthUTCGMT
GeneralBGeneralB
10 3 647.17 s260.23 s10.79 min4.34 min
2.5 × 10 3 1819.04 s846.1719 s30.32 min14.10 min
10 4 7678.42 s3775.86 s2.13 h1.05 h
2 × 10 4 15,490.92 s7682.11 s4.30 h2.13 h
4 × 10 4 31,115.92 s15,494.61 s8.64 h4.30 h
6 × 10 4 46,740.92 s23,307.11 s12.98 h6.47 h
8 × 10 4 62,365.92 s31,119.61 s17.32 h8.64 h
10 5 77,990.92 s38,932.11 s21.66 h10.81 h
Table 3. Summary of the obtained p-values, for different time periods, for each of the 59 studied datasets under four different stationarity tests: Augmented Dickey-Fuller (first column), Phillips-Perron (second column), Ljung-Box test (third column) and kpps (fourth column). The null hypothesis is of stationarity for the kpps test and of non-stationarity for the other three. For each of the lengths, the minimum p-value value over the studied buoys is displayed for the kpps test and the maximum in the other three cases.
Table 3. Summary of the obtained p-values, for different time periods, for each of the 59 studied datasets under four different stationarity tests: Augmented Dickey-Fuller (first column), Phillips-Perron (second column), Ljung-Box test (third column) and kpps (fourth column). The null hypothesis is of stationarity for the kpps test and of non-stationarity for the other three. For each of the lengths, the minimum p-value value over the studied buoys is displayed for the kpps test and the maximum in the other three cases.
Time PeriodTests
GeneralBAugmented
Dickey-Fuller
Phillips-PerronLjung-Boxkpps
10.79 min4.34 min<0.01<0.018.66 × 10 8 >0.1
30.32 min14.10 min<0.01<0.016.09 × 10 10 >0.1
2.13 h1.05 h<0.01<0.011.48 × 10 6 >0.1
4.30 h2.13 h<0.01<0.010>0.1
8.64 h4.30 h<0.01<0.012.60 × 10 7 >0.1
12.98 h6.47 h<0.01<0.010>0.1
17.32 h8.64 h<0.01<0.010.18>0.1
21.66 h10.81 h<0.01<0.010.36>0.1
Table 4. p-Values resulting from applying the Epps test (columns 2 and 6) and the Lobato and Velasco test (columns 3 and 7) per dataset associated to each of the 55 buoys (columns 1 and 5) studied under a time period of 2.13 h in general and of 1.05 h for the buoys in B. FDR (columns 4 and 8) combination, for dependent p-values, of the two p-values per buoy, with the ones smaller than 0.05 highlighted in bold.
Table 4. p-Values resulting from applying the Epps test (columns 2 and 6) and the Lobato and Velasco test (columns 3 and 7) per dataset associated to each of the 55 buoys (columns 1 and 5) studied under a time period of 2.13 h in general and of 1.05 h for the buoys in B. FDR (columns 4 and 8) combination, for dependent p-values, of the two p-values per buoy, with the ones smaller than 0.05 highlighted in bold.
BuoyEppsL.-V.FDRBuoyEppsL.-V.FDR
0280.910.050.091710.667.14 × 10 11 1.43 × 10−10
0290.420.610.611811.61 × 10 3 1.54 × 10 4 3.07 × 10−4
0360.150.140.151850.910.60.91
0450.970.280.561910.520.310.52
0670.280.150.281920.880.070.14
0710.470.610.611948.07 × 10 3 2.76 × 10 9 5.53 × 10−9
0760.200.240.241960.070.350.15
0920.090.010.031970.810.160.33
0940.870.270.541980.491.58 × 10 4 3.17 × 10−4
0980.390.790.772010.010.310.02
1000.580.990.992030.930.740.93
1063.29 × 10 4 0.026.58 × 10−42090.730.010.01
1210.411.43 × 10 6 2.86 × 10−62130.430.420.43
1320.083.08 × 10 11 6.16 × 10−112140.046.35 × 10 6 1.27 × 10−5
1340.760.760.762150.810.040.09
1390.820.220.432170.030.000.01
1423.56 × 10 4 1.59 × 10 8 3.18 × 10−82200.560.770.77
1430.021.49 × 10 4 2.98 × 10−42220.860.600.86
1440.015.51 × 10 4 1.10 × 10−32240.013.13 × 10 9 6.27 × 10−9
1470.480.360.482300.660.040.09
1500.330.470.472390.220.770.44
1540.630.060.112400.740.640.74
1570.160.450.332410.100.200.19
1580.600.670.672430.060.010.01
1600.180.020.042440.510.020.04
1620.920.200.404300.170.320.32
1660.686.87 × 10 4 1.37 × 10−34330.190.080.15
1680.720.960.96
Table 5. FDR values (column 6) resulting of applying the random projection test for each buoy (column 1) with a FDR adjusted p-value larger than 0.05 in Table 4 (time period of 2.13 h in general and of 1.05 h for the buoys in B). The parameters and the one-dimensional tests used in performing the random projection test are included for each buoy.
Table 5. FDR values (column 6) resulting of applying the random projection test for each buoy (column 1) with a FDR adjusted p-value larger than 0.05 in Table 4 (time period of 2.13 h in general and of 1.05 h for the buoys in B). The parameters and the one-dimensional tests used in performing the random projection test are included for each buoy.
BuoyEpps
(100,1)
Epps
(2,7)
L.-V.
(100,1)
L.-V.
(2,7)
FDR
0280.890.620.040.010.04
0290.420.460.610.650.65
0360.200.260.160.260.26
0450.980.980.270.260.81
0670.340.260.150.160.34
0710.460.530.620.780.78
0760.210.730.240.620.73
0940.870.810.270.760.87
0980.390.560.790.850.85
1000.620.841.001.001.00
1340.720.240.760.320.76
1390.800.560.240.390.80
1470.540.750.360.550.75
1500.190.530.480.610.61
1540.600.500.070.290.30
1570.190.160.430.520.52
1580.590.510.690.760.76
1620.910.980.190.240.72
1680.730.760.960.870.96
1850.910.770.600.600.91
1910.500.470.330.630.63
1920.690.090.080.080.18
1960.100.320.360.450.39
1970.580.070.140.010.04
2030.930.950.740.720.95
2130.430.650.420.870.87
2150.670.500.050.110.19
2200.600.880.780.350.88
2220.840.620.590.640.84
2300.790.830.020.080.08
2390.220.290.770.790.79
2400.780.930.760.960.96
2410.110.180.260.110.26
4300.210.410.340.630.63
4330.310.720.140.650.58
Table 6. Rejection rates along different length periods when no projections are used (first row), when the proposed projection procedure is used (second row) and when no multiplicity is taken into account (third row), i.e., the minimum of the p-values is used for the rejection. m stands for minutes and h for hours.
Table 6. Rejection rates along different length periods when no projections are used (first row), when the proposed projection procedure is used (second row) and when no multiplicity is taken into account (third row), i.e., the minimum of the p-values is used for the rejection. m stands for minutes and h for hours.
General10.79 m30.32 m2.13 h4.30 h8.64 h12.98 h17.32 h21.66 h
Time PeriodB4.34 m14.10 m1.05 h2.13 h4.30 h6.47 h8.64 h10.81 h
no projection 15.2519.3036.3658.4978.4380.3990.2096.08
witd projection 22.0322.8140.0058.4978.4382.3592.1696.08
minimum 30.5124.5643.6466.0480.3982.3592.1696.08
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Nieto-Reyes, A. On the Non-Gaussianity of Sea Surface Elevations. J. Mar. Sci. Eng. 2022, 10, 1303. https://doi.org/10.3390/jmse10091303

AMA Style

Nieto-Reyes A. On the Non-Gaussianity of Sea Surface Elevations. Journal of Marine Science and Engineering. 2022; 10(9):1303. https://doi.org/10.3390/jmse10091303

Chicago/Turabian Style

Nieto-Reyes, Alicia. 2022. "On the Non-Gaussianity of Sea Surface Elevations" Journal of Marine Science and Engineering 10, no. 9: 1303. https://doi.org/10.3390/jmse10091303

APA Style

Nieto-Reyes, A. (2022). On the Non-Gaussianity of Sea Surface Elevations. Journal of Marine Science and Engineering, 10(9), 1303. https://doi.org/10.3390/jmse10091303

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop