Assessing Inhomogeneities in Extreme Annual Rainfall Data Series by Multifractal Approach

Amanda P. García-Marín; Javier Estévez; Renato Morbidelli; Carla Saltalippi; José Luis Ayuso-Muñoz; Alessia Flammini

doi:10.3390/w12041030

Abstract

Testing the homogeneity in extreme rainfall data series is an important step to be performed before applying the frequency analysis method to obtain quantile values. In this work, six homogeneity tests were applied in order to check the existence of break points in extreme annual 24-h rainfall data at eight stations located in the Umbria region (Central Italy). Two are parametric tests (the standard normal homogeneity test and Buishand test) whereas the other four are non-parametric (the Pettitt, Sequential Mann–Kendal, Mann–Whitney U, and Cumulative Sum tests). No break points were detected at four of the stations analyzed. Where inhomogeneities were found, the multifractal approach was applied in order to check if they were real or not by comparing the split and whole data series. The generalized fractal dimension functions D_q and the multifractal spectra f(α) were obtained, and their main parameters were used to decide whether or not a break point existed.

Keywords:

extreme rainfall data series; inhomogeneities; multifractality

1. Introduction

Frequency analysis of extreme rainfall data (defined as yearly maxima of a certain duration) is a common application in hydrologic engineering. It should be applied to recorded series that follow the characteristics of independency, stationarity and homogeneity [1]. Inhomogeneities in station data records can be due to observational routines (station relocations or changes in measuring techniques) and in that case, statistical methods and metadata information are effective for identifying these [2]. However, some shifts in climate dynamics are a consequence of human activities [3,4,5], or are related to El-Niño Southern Oscillation (ENSO) events such as those that occurred in 1976–77 [6], or to volcanic eruptions and changes in atmospheric composition and circulation [7].

Regardless of the reason, it is important to interpret past variability and recognize the shifts in climatic data in order to make adequate future projections. Therefore, it is crucial to investigate the effects of climate change on hydrologic time series, and especially on extreme rainfall data in order to accurately estimate or even update the intensity-duration-frequency (IDF) curves in a certain place [8]. The IDF curves are important tools for hydrologic and hydraulic design, which nowadays faces the problems of increased urbanization and population and of climate change [9], among others.

Trends in hourly, monthly, seasonal or annual rainfall series have been studied in several works with different results [10,11,12,13,14,15,16,17].

Numerous statistical methods such as the standard normal homogeneity (SNH) test, Buishand test, Pettitt test or Sequential Mann–Kendall test, among others, can be applied to evaluate the homogeneity of climate time series and have been widely used [13,18,19]. Some of the shifts detected under these classical tests turn out to be the regular behavior of time series under the scaling assumption [20,21]. Therefore, authors such as [22] recommend the consideration of scaling behavior of annual maximum rainfall when analyzing the trends in a time series, especially considering that the scaling hypothesis can increase the probability of rejection of correct null hypothesis [23]. Thus, accounting for scaling may help to avoid contradictions in trend analysis studies [23].

Authors such as [24] or [25] have analyzed changes in temporal data series during scale invariance, considering that several works have demonstrated the multifractal properties of meteorological systems [26,27,28]. Those systems exhibit the same properties for different measuring scales, thus certain behavioral information can be transferred from one scale to another, which is very useful for modeling purposes.

Sometimes, different results for the year when a shift or break point is detected in a data series are obtained by different statistical homogeneity tests, and metadata or other local and climate information cannot help to determine whether a break point really exists or not. In this work, we propose a novel use of the scale invariance properties of data sets as a useful tool to confirm whether inhomogeneities previously found in extreme rainfall data series are real or not. Moreover, the methodology can also be used to confirm the existence of a shift when only one or a few statistical tests have detected it. If a break point exists in an extreme rainfall series, the two data series derived by splitting the whole one before and after that break point should have different characteristics and can be considered as different. Therefore, the length decreases and can influence rainfall data estimation when applying frequency analysis to the two data series; this element is of great importance, especially for low return period quantile estimation, and so for IDF estimation.

For these purposes, extreme annual 24-h rainfall data from eight rain gauge stations with sufficiently long series were selected in the Umbria region (Central Italy). Six statistical homogeneity tests were applied to the selected extreme rainfall series in order to highlight inhomegeneities. The multifractal properties of the data series were then used to check the existence or not of break points detected by one or a few or all tests.

2. Materials and Methods

2.1. Data Source

Rainfall data from the Umbria region (central Italy) are used in this work. The Umbria region has an area of 8456 Km², and is located in the Tiber river basin, which crosses the region from the north to the south-west. Its landscape is partially mountainous due to the presence of the Apennine Mountains that reach up to 2000 m a.s.l., and partially hilly in the central and western areas with altitudes ranging from 100 to 800 m a.s.l.

The mean annual rainfall for the last century is about 900 mm, with values varying in space from 650 mm to 1450 mm. The highest monthly rainfall values generally occur during the autumn–winter period, together with floods caused by widespread rainfall. The highest and lowest rainfall depths typically take place in November and July, respectively.

Five significant droughts have occurred in recent years in the region (2001 to 2003, then in 2007, 2012, 2015 and 2017) as well as six dangerous flood events (in 2005, 2008, 2010, 2012 and in 2013), with very significant economic impacts [29].

The study area is currently monitored through a dense rain gauge network (about 1 rain gauge every 90 Km²). For this study, only the stations with at least 50 continuous years of extreme 24hour duration rainfall data have been considered (Table 1): Bastia Umbra, Bevagna, Nocera Umbra, Petrelle, Ponte Nuovo di Torgiano, Spoleto, Terni and Todi. Figure 1 shows the location of the eight sites considered in the Umbria region.

Table 1. Stations selected for the work and the period of data considered.

Figure 1. Rain gauges in the Umbria region used in this work.

2.2. Methodology

2.2.1. Homogeneity Tests

There are many statistical tests to check the homogeneity of a data series y₁, y₂, …, y_n, with n the number of data, with specific characteristics. Some tests are known as parametric because they assume that the analyzed variable is normally distributed. Those that do not make this assumption are called non-parametric tests.

Under the null hypothesis, the annual values y_i (i = 1, …, n) of the testing variable y are independent and identically distributed and the series is considered as homogeneous. Under the alternative hypothesis, the series shows a break in the mean and is considered as inhomogeneous. Moreover, there are tests that are able to give information about the year the break occurred, whereas those that assume the series is not randomly distributed under alternative hypothesis cannot provide this information.

In the present work, six homogeneity tests were applied. All of them are able to detect the year of the break if it exists. Two parametric tests were selected: the Buishand range (BR) test and the standard normal homogeneity (SNH) test. The non-parametric tests were: the Pettitt (PT) test, Tequential Mann–Kendall test (SQMK) test, Mann–Whitney U (MWU) test and Cumulative Sum (CUSUM) test.

(1): Buishand Range test
This test is based on the adjusted partial sums or cumulative derivations from the mean of the n data points, $\bar{y}$ :

$S_{0}^{*} = 0; S_{k}^{*} = \sum_{i = 1}^{k} (y_{i} - \bar{y}), k = 1, \dots, n$

(1)

For a homogeneous record, $S_{k}^{*}$ values fluctuate around zero. The values of $S_{k}^{*}$ can be rescaled by the sample standard deviation, $D_{y}$ :

$S_{k}^{* *} = \frac{S_{k}^{*}}{D_{y}}, k = 0, \dots, n$

(2)

with $D_{y}^{2} = \sum_{i = 1}^{n} \frac{{(y_{i} - \bar{y})}^{2}}{n}$
The Q, which is sensitive to departures from homogeneity, can then be obtained as:

$Q = m a x_{0 \leq k \leq n} | S_{k}^{* *} |$

(3)

For different significance levels, the critical values for the test statistics (Qc) depend on the number of data and can be found in [30].
(2): Standard Normal Homogeneity test
This test [31] compares the mean of the first k years of data with the last (n − k) years by using the T_k of the statistics:

$T_{k} = k \bar{z_{1}} + (n - k) \bar{z_{2}}, k = 1, 2, \dots n$

(4)

where

$\bar{z_{1}} = \frac{1}{k} \sum_{i = 1}^{k} \frac{y_{i} - \bar{y}}{s}, \bar{z_{2}} = \frac{1}{n - k} \sum_{i = k + 1}^{n} \frac{y_{i} - \bar{y}}{s}, a n d s = \frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}$

(5)

The year k shows a break if the value of T_k is the maximum. If the $T_{0} = \max_{1 \leq k \leq n} T_{k}$ is greater than the critical values [32], the null hypothesis is rejected.
(3): Pettitt test
The Pettitt test [33] is based on the ranking r_i of the y_i values. The ranks r_i are obtained by ordering the data in crescent order, so that the smallest one gets ranked 1 and the highest gets the n-rank. The U_k is then obtained as:

$U_{k} = 2 \sum_{i = 1}^{k} r_{i} - k (n + 1) k = 1, 2, \dots, n$

(6)

If a break occurs in the year K, then the U_k is:

$U_{K} = m a x_{1 \leq k \leq n} | U_{k} |$

(7)

The statistical significance of the break point is checked by comparing the value of $U_{K}$ (Equation (7)) to its theoretical value:

$U_{K}_{α} = {(- \frac{L n (α (n^{3} + n^{2}))}{6})}^{1 / 2}$

(8)

with α the significance level.
(4): Sequential Mann–Kendall test
This test is usually applied to evaluate trends in data series and also to check the moment when they start being significant [34,35,36]. It calculates two series of statistic values: one (U(t)) for the progressive temporal data set (y₁, y₂, …, y_n) and the other (U’(t)) for the corresponding regressive data set (y_n, y_n−1, …, y₁), both with an average value of zero and a standard deviation of one. Usually, the ranks of the y_i values are preferred.
The t_k has to be obtained by:

$t_{k} = \sum_{i = 1}^{k} n_{i}, (2 \leq k \leq n)$

(9)

with n_i being the number of cases for which $y_{i} > y_{j} (j = 1, 2, \dots, i - 1) .$
Under the null hypothesis, it is assumed that no trend exists, and t_k follows a normal distribution with average and variance values given by:

$E [t_{k}] = \frac{k (k - 1)}{4} V a r [t_{k}] = \frac{k (k - 1) (2 k + 5)}{72} (2 \leq k \leq n)$

(10)

The sequential values of U(t_k) are finally obtained as:

$U (t_{k}) = \frac{t_{k} - E [t_{k}]}{\sqrt{V a r (t_{k})}}, (k = 2, 3, \dots, n)$

(11)

A positive value of U(t_k) shows a positive trend, whereas a negative trend appears if the U(t_k) value is negative.
The null hypothesis is rejected if $| U (t_{k}) | > Z_{(1 - \frac{α}{2})}$ where $Z_{(1 - \frac{α}{2})}$ is the critical value of the typified normal distribution with a probability higher than $\frac{α}{2}$ . For a significance level α = 5%, the critical value of $Z_{(1 - \frac{α}{2})}$ is 1.9604.
The values of U’( $t_{k}$ ) are also obtained from regressive temporal data series, t_k’, E[t_k’], Var[t_k’] by using Equations (9)–(11), and finally:

$U^{'} (t_{k}) = - U (t_{k^{'}}) w i t h k^{'} = n + 1 - j (j = 1, 2, \dots, n)$

(12)

By plotting the curves of the U(t) and U’(t) as a function of the year, a break point is detected where the curves cross and diverge.
(5): Mann–Whitney U test
This test can detect a step change in a time series by assessing the difference in the means of two sub-series arising from splitting the complete data series. The original time series (y_i, i = 1, 2, …, n) is broken into two subseries, one from y₁ to y_n1 and the other from y_n1+1 to y_n, of sizes n₁ and n₂ (n − n1 + 1), respectively. A new data series, z_t (t = 1, …, n) is obtained by rearranging the original time series y_i in increasing order of magnitude. The Mann–Whitney test statistic is obtained by:

$Z_{c} = \frac{\sum_{i = 1}^{n_{1}} R (y_{i}) - n_{1} (n_{1} + n_{2} + 1) / 2}{{[n_{1} n_{2} (n_{1} + n_{2} + 1 / 12]}^{0.5}}$

(13)

where R(y_i) is the rank of observation y_i in the series z_t. If $| Z_{c} | > Z_{(1 - \frac{α}{2})}$ the hypothesis of equal means in both subseries is rejected, being $Z_{(1 - \frac{α}{2})}$ the (1 − α/2) quantile of the normal distribution.
(6): Cumulative Sum test
The distribution-free cumulative sum technique was proposed by [37] to detect if a significant step change occurred in a given time series at a significance level α = 0.05 [38]. The null hypothesis considers that no step change exists. For the given data series y_i (i = 1, …, n), the test statistics V_k is obtained as:

$V_{k} = \sum_{i = 1}^{k} s i g n (y_{i} - y_{m e d i a n})$

(14)

where k = 1, 2, …, n; y_median is the median of the time series, and the sign function is:

$sign (y_{i} - y_{m e d i a n}) = {\begin{matrix} 1 i f y_{i} > y_{m e d i a n} \\ 0 i f y_{i} = y_{m e d i a n} \\ - 1 i f y_{i} < y_{m e d i a n} \end{matrix}$

(15)

A step-change point consists of any year when the maximal or minimal V_k falls outside the 95% confidence limit of ±1.36 $\sqrt{n}$ [38].

2.2.2. Multifractal Characterization

Processes that look the same regardless of the scale at which they are observed are considered to be fractal type. Natural fractal processes, such as rainfall [26] have a statistical nature and their scaling properties can be expressed by statistical relationships [39,40]. Usually, the fractal analysis of data sets is performed by means of fixed-size algorithms that rely on the subdivision of the system into smaller parts with equal size, changing that size iteratively [41].

The scaling exponents of the qth moments of the process under study can be analyzed thorough the Generalized Fractal Dimension (GFD) function, D_q, also known as the Rénji spectrum [42] by:

D_{q} = \lim_{r \to 0} \log \frac{Z (q, r)}{(q - 1) l o g r} \forall q \neq 1

(16)

where

Z (q, r)

is the partition function, depending on the values of the mass distribution (μ) of the points that represent a set data composed of n data. Considering a box of length r to fill the set, the number of ni points of data included can be obtained and also the value of

μ_{i} = \frac{n_{i}}{n}

. The partition function is then given as

Z (q, r) = \sum_{i = 1}^{n} μ_{i}^{q}

. Considering the limit of Equation (16) when q→1, the value of D1 is obtained as:

D_{1} = \lim_{r \to 0} \frac{\sum_{i = 1}^{n} μ_{i} l o g μ_{i}}{\log r}

(17)

The value of D_q for q = 0 is known as the box-counting dimension or the fractal dimension of the system. For q = 1 the information entropy is obtained. The so-called correlation dimension is obtained for q = 2.

The log-log plot of

Z (q, r)

versus r gives the value of the linear segments of the mass exponent τ_q. The relation between the Rénji spectrum (D_q) and τ_q is obtained as:

τ_{q} = (q - 1) D_{q}

(18)

The Lipschitz–Hölder exponent,

α_{q} = - \frac{{d τ}_{q}}{d q},

obtains the multifractal spectrum

f (α)

as:

f (α_{q}) = q α_{q} + τ_{q}

(19)

The multifractal spectrum shows an inverted parabolic shape and its parameters can be related to interesting characteristics of the data set [25]. The smoothest and most extreme events in the process are related to the

α_{m a x}

and

α_{m i n}

values, respectively. The structure of the process can be explained through the value of

α_{0}

. Processes with low correlation and fine structure show high values of

α_{0}

. Low values of

α_{0}

are expected for regular processes. The asymmetry parameter is related to the role that extreme events play in the temporal structure of the data set. The width of the spectrum measures the degree of multifractality of the processes.

3. Results

3.1. Break Point Detection

The BR and SNH tests were applied to all the extreme annual daily rainfall data series for each station selected in the Umbria region. The results obtained for both tests are shown in Table 2, where the statistics and their critical values are given for a significance level of 0.05.

Table 2. Results obtained for the parametric Buishand range (BR) and Standard Normal Homogeneity (SNH) tests; the statistics Q and T₀ and their critical values Q_c and T_0c are also shown.

The results of the BR test show that only at the Spoleto station, the value of Q is higher than the critical one Q_c, and an inhomogeneity is detected in 1977. The SNH test results show that no inhomogeneity is present in any of the data series analyzed, considering that all the statistics T₀ are below their critical values T_0c.

The four non-parametric tests described in the Methodology section were also applied to the data series. The results obtained for the PT test are shown in Table 3. For a significance level of

α = 0.05,

the value of K is higher than the critical value K_c only for the Spoleto extreme data series. A break point is then detected in Spoleto in 1969. For the rest of the data series analyzed in the Umbria region, no inhomogeneities were detected by the PT test.

Table 3. Results obtained after applying the Pettitt test.

The SQMK test was applied to all the extreme annual data series. Inhomogeneities were found for the Spoleto and Terni stations, and the results are shown in Figure 2. For Spoleto, a decreasing trend is found after 1965 according to the behavior of the U(t_k) curve. The progressive U(t_k) and regressive U’(t_k) curves cross and diverge in 1980 where a break point is detected; the average extreme annual rainfall for the 1959 to 1980 period is 70.65 mm while for the period from 1981 to 2017 it is 60.21 mm. After 1981, the trend is decreasing and is significant starting from 2003 when the U(t_k) values become lower than the confidence level limit considered (

α = 0.05

).

Figure 2. Results obtained from the sequential Mann–Kendall test in the Spoleto and Terni stations.

For the Terni station, a break point is detected in 1957 according to the behavior of progressive and regressive SQMK test curves. A decreasing trend in the extreme annual rainfall data values starts in hat year, and is statistically significant for short periods in the early seventies (from 1969 to 1974) and for the years 1994, 1995 and 1996.

The step changes detected by the MWU test for the analyzed series are shown in Figure 3. Considering the differences in the means, the years with break points are 1968, 1994, 1969 and 1962, for the Nocera Umbra, Petrelle, Spoleto and Terni stations, respectively. The average values before the break point are higher than those after it for all sites except for Petrelle, where the average value of extreme rainfall data is higher after the inhomogeneity date.

Figure 3. Step changes detected by the Mann–Whitney U test at the Nocera Umbra, Spoleto, Petrelle and Terni stations.

Finally, the application of CUSUM technique to the available data series detected no step-changes in any years. As an example, the results for the Bastia Umbra and Terni stations are shown in Figure 4. The V_k values obtained for both stations data series are always below the confidence limit considered, being close for year 1969 in Bastia Umbra and for 1960 and 1962 in the Terni station.

Figure 4. Results obtained at the Bastia Umbra and Terni stations after applying the CUSUM test.

In order to summarize the results obtained with all of the applied tests, Table 4 shows the year of the break points detected at each station. For Bastia Umbra, Bevagna, Ponte Nuovo and Todi, no inhomogeneities were detected by any of the tests applied. For the Nocera and Petrelle stations, only the MW test was able to detect a break point in 1968 and 1994, respectively. Two tests indicated inhomogeneities for the rainfall data at Terni station: the SQMK test in 1957 and MWU test in 1962. Three different years were detected as break points in the Spoleto rainfall data: 1969 by the PT test, 1977 by BR test, and 1980 by the SQMK test.

Table 4. Summary of the years in which break points were detected by the applied tests.

According to the results shown above, the multifractal behavior of the available data series will be used to analyze whether a break point detected can be considered as real or not, when only one test is able to detect it or when different years with inhomogenities arise.

3.2. Multifractal Behavior

For all stations at which a break point was detected (Nocera Umbra, Petrelle, Spoleto and Terni), multifractal characterization of extreme annual 24-h rainfall was performed for the complete data sets, and also for the series obtained from splitting the original data sets for the year of the inhomogeneity. For all of the data series, the GFD function was obtained and the main parameters are shown in Table 5. For all situations, the value of the fractal dimension D₀ is equal to 1. This value is related to the number of boxes needed to cover the fractal object or system under analysis. For all the sites and series, the value of D₀ = 1 fills the entire 1D domain.

Table 5. Main results obtained for the Rénji spectrum of the extreme rainfall data series for which break points were detected by the statistical tests.

The value of the fractal dimension for q = 1, D₁, describes the degree of heterogeneity of the measure and characterizes the distribution and intensity of singularities with respect to the mean [43,44]. For the complete extreme rainfall data sets, the most heterogeneous is the one at the Spoleto station, followed by the Petrelle, Nocera Umbra and Terni stations. This behavior can also be analyzed through the value of (D₀-D₁). The greater the value, the less uniform is the data set. The most uniform data series are those at Nocera Umbra and Terni stations, where fewer differences in the data than those in Petrelle and Spoleto are expected.

The correlation dimension values, D2, are also shown in Table 5. They can be used to analyze how often recurrence can be expected in a system, so thus high values are characteristic of less recurrent and predictable data series [45]. The highest value is obtained for Spoleto, followed by Petrelle, Terni and Nocera Umbra, respectively. On the contrary, high values of (D0-D2) (Table 5) are expected for predictable data series.

Even though the fractal dimension values are the main important to be considered in order to analyze the fractal behavior of a system, the trend of D_q for several q values (Rénji spectrum) is shown in Figure 5. The multifractal spectra f(α) were also obtained for all the available data series (Figure 6) and several multifractal parameters are shown in Table 6.

Figure 5. Multifractal dimensions functions D_q for the complete data series and also for the rainfall series before and after the break points detected in the Nocera Umbra, Petrelle, Spoleto and Terni stations.

Figure 6. Multifractal spectra f(α) for the complete rainfall data series and also for the rainfall data series before and after the break points detected in the Nocera Umbra, Petrelle, Spoleto and Terni stations.

Table 6. Main results obtained for the multifractal spectrum of the extreme rainfall data series for which break points were detected by the statistical tests.

4. Discussion

The same information can be analyzed by focusing on the similarities and differences between the fractal dimension values for complete data series and the series before and after the break points. For the Nocera Umbra station, the values obtained for the data set before the break point are similar to those obtained for the complete data set. More differences appear for the data set after the inhomogeneity. According to (D₀-D₁), more uniform data are found before the break point than after it. Moreover, the former data set is also more predictable than the latter, according to the (D₀-D₂) values.

The similarities found in Nocera Umbra between the complete and one of the split data sets are also found for the Terni station, where the break point year detected is 1957. Almost equal values of D₁, D₂, (D₀-D₁) and (D₀-D₂) are found, showing similar uniformity and degree of predictability in the data sets.

Different values are obtained for the Petrelle data sets, with the complete data series being less uniform than the one before the break point according to the (D₀-D₁) values, and more predictable than the one after the break point according to the (D₀-D₂) values.

Similar values for the fractal dimension values D₁, D₂, and their differences with D_0, are found for the complete data set of Spoleto, and those obtained with the extreme rainfall data set after 1980. Completely different values are found for both data series obtained before and after the break point in 1969, and those obtained for the complete data set.

For the Nocera Umbra station, the GFD function D_q for the complete data series is almost coincident with the one obtained for the rainfall data series before the break point. Different values were obtained for the rainfall data series after the break point. The similarity in the behavior of the complete data series and the one before the potential break point means that no real break point exists since the addition of the data after it does not change the general pattern.

The Rénji spectra obtained for the Petrelle station data series are different. In this case, the three data series should be considered as different and thus the break point is real.

No similarities arise between the data series before or after the break points with respect to the complete data series at Spoleto station if all the D_q values are considered. Nevertheless, for negative q values, similar D_q functions are obtained for the data sets before and after the break points in 1969 and 1980. These similitudes are maintained for positive q values for D_q functions of data sets before both break points. Nevertheless, D_q functions are different for positive q values for data sets after the break points, with the one after the inhomogeneity at 1980 being the closest to the D_q function for the complete data series. This behavior means that the break point took place in 1969.

For the Terni station, similar trends of D_q function are found for the complete data series and those before and after the break point year of 1957, respectively. For the former series, similar values are found for q positive values whereas for the latter the similarities arise for negative q values. Different D_q functions are obtained for those series derived from the break point in1962. This behavior implies that the inhomogeneity is placed at 1962 in the Terni extreme rainfall data series.

Regarding the shape of the multifractal spectra, similar f(α) are obtained for the Nocera Umbra complete data set and for the one before the break point year. Even though left-side skewed multifractal spectra are found for all the data series, the one for the data set after the break point differs from those previously mentioned.

Different multifractal spectra are also obtained for the Petrelle data sets, where not even the symmetry is similar; the complete data set spectrum is slightly right-skewed, whereas those obtained before and after the break point are skewed in the opposite direction.

Multifractal spectra before the break points appear similar at Spoleto, where the one for the complete data set also differs from those obtained before and after the break points, showing some similarities only for the left side of f(α) after the inhomogeneity in 1980.

Similarities in the multifractal spectra arise for the Terni station for both the complete data set and those before and after the break point in 1957, whereas different results are found for f(α) before and after the break point in 1962 and the whole rainfall data set.

For Nocera Umbra station, the complete extreme rainfall data series shows a negative value for the asymmetry parameter, which means that extreme events are important in the data series, and also there is a high complexity according to the value of parameter w. Even though the asymmetry parameter values are also negative for data sets before and after the break point, the last data series seems to be less influenced by extreme events than the previous one (or even than the complete series). The richness of the process is also lower for the data set after the break point than for both complete and before break point data sets. Almost similar values for all the parameters (α_min, α_max, a_s and w) were obtained when analyzing both the complete data set and the one before the break point.

For the Petrelle station, different values are obtained for all the parameters and data sets. The influence of extreme events is present in both data sets before and after the break point according to the negative value of the asymmetry parameter (a_s), whereas the complete data series shows a greater influence of less extreme events, with a positive value of a_s. The width of the spectra (w) is also different with a loss of richness (lower multifractality) in the complete data set compared to the one before the break point, maybe due to the influence of the data after the break point, where the richness has the lowest value.

For the Terni station, all the spectra exhibit negative values for the asymmetry parameter (a_s), which implies the dominance of extreme events in all the data series. The values closer to the αmin and αmax parameters are found for the complete data series and for the series before and after 1957, and thus a similar influence by extreme and smooth events can be expected for all the series. Similar complex processes are found according to the values of w. This behavior is not present for data series before and after the break point of 1962, where a loss of multifractality is found after the break point compared to the data series before it. This last data series also shows the strong influence of extreme events according to both the values of a_s and αmin.

The multifractal spectrum for the Spoleto complete data series shows a negative value for the asymmetry parameter, which is influenced by the presence of extreme events in the data set. The shape changes to a positive value of the asymmetry parameter for the data sets before the break points. The most similar values of all the parameters are found for the complete data set and the one after the break point of 1980.

Authors such us [25] used the variability in the values of α₀ and w from the multifractal spectrum to assess changes in climate data series. For all the data sets studied in this work, the values of α₀ are always equal to 1 and thus no possible changes can be explained based on these values. Nevertheless, there are differences in the values obtained for the parameter that describes the degree of multifractality of the data series, w. Thus, the differences in the values of this multifractal parameter will be of great importance in deciding if a break point can really be identified in a data series.

According to all the results described above, it can be stated that no inhomogeneity exists in the Nocera extreme annual rainfall data series, whereas the break point detected in the Petrelle data set by the statistical test can be confirmed. For the Spoleto station the behavior is not as clear as in the previous stations if all the parameters are considered, but the closest similarity between complete and split data series is found in 1980, specially for the width of the multifractal spectrum (w) and the fractal dimensions D₁ and D₂. Thus, the most important inhomogeneity is the one detected by the statistical tests in 1969. For the Terni station, the greatest differences in multifractal properties are found between the complete data series and those obtained considering the inhomogeneity in 1962, especially for the value of w, D₁ and D₂. Therefore, no break point should be considered in 1957.

5. Conclusions

The multifractal character of rainfall data has been widely studied for extreme annual data of different durations and it is useful when dealing with rainfall predictive modeling or IDF estimation. Thus, the possibility of confirming the existence of a break point previously detected by statistical methods, based on the multifractal behavior of rainfall can help to enhance the results of methods and models that use rainfall as a direct or indirect input, especially, those that depend on the length of the rainfall data series.

The results of this work show that the multifractal characterization of extreme rainfall annual data series can be a suitable tool to decide whether or not an inhomogeneity exists in the data set, especially when there is no agreement in the results from statistical tests. Parametric and non-parametric statistical tests commonly applied to detect shifts in time data series were used to check if any break points were present in extreme annual 24-h rainfall data series in the Umbria region (Italy). For some of the stations considered, no inhomogeneities were found with the statistical tests.

For those stations where only one of the tests detected a break point, or where different years for the break points were identified by the different tests, the maintenance of multifractal parameters’ values was used as the criterion to reject the potential break point. Because multifractal analysis results are sensitive to climate shifts if the spectra parameters of time series divided into subsets differ, the values of the heterogeneity fractal dimension D₁, the correlation fractal dimension D₂, and the complexity of the data set given by w provide the bases for the decision criteria. If the complete data series maintained the values of D₁, D₂ or w with respect to any of the split series before or after the break point, it was considered that no inhomogeneity existed. The multifractal behavior of the split series was also found in the complete series. On the contrary, if the values of the multifractal parameters were different among the data series, the break point under consideration was accepted.

According to the results obtained in this work, the multifractal behavior of extreme rainfall data can be considered as an indicator of the changes in the dynamics of the atmospheric processes before and after observed shifts. This is of great importance in models based on extreme rainfall frequency analysis where the existence of a break point in a data series changes the length of it and can influence the estimated values. To make it easier for practitioners and engineers, a software that combine homogeneity tests and multifractal algorithms is being developed.

Author Contributions

Conceptualization, formal analysis, writing original-draft, funding acquisition, supervision—A.P.G.-M.; Funding acquisition, validation—J.E.; resources, data curation, supervision—R.M.; Resources, data curation—C.S.; Methodology, software—J.L.A.-M.; Writing, reviewing and editing—A.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Spanish Ministry of Science, Innovation and Universities, grant number AGL2017-87658-R. And the APC was funded by the same funder.

Acknowledgments

A.P.G.-M. acknowledges the collaboration and hosting of the University of Perugia and specifically, of the Hydrology team of the Department of Civil and Environmental Engineering.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

Haktanir, T.; Citakoglu, H. Trend, Independence, Stationarity, and Homogeneity Tests on Maximum Rainfall Series of Standard Durations Recorded in Turkey. J. Hydrol. Eng. 2014, 19, P05014009. [Google Scholar] [CrossRef]
Wijngaard, J.B.; Klein Tank, A.M.G.; Können, G.P. Homogeneity of 20th century European daily temperature and precipitation series. Int. J. Climatol. 2003, 23, 679–692. [Google Scholar] [CrossRef]
Hoerling, M.; Kumar, A.; Eischeid, J.; Jha, B. What is causing the variability in global mean land temperature? Geophys. Res. Lett. 2008, 35, L23712. [Google Scholar] [CrossRef]
Martínez, M.D.; Serra, C.; Burgueño, A.; Lana, X. Time trends of daily maximum and minimum temperatures in Catalonia (NE Spain) for the period 1975–2004. Int. J. Climatol. 2010, 30, 267–290. [Google Scholar]
Geng, Q.; Wu, P.; Zhao, X. Spatial and temporal trends in climatic variables in arid areas of northwest China. Int. J. Climatol. 2016, 36, 4118–4129. [Google Scholar] [CrossRef]
Swanson, K.L.; Tsonis, A.A. Has the climate recently shifted? Geophys. Res. Lett. 2009, 36, L06711. [Google Scholar] [CrossRef]
Morozova, A.L.; Valente, M.A. Homogenization of Portuguese long-term temperature data series: Lisbon, Coimbra and Porto. Earth Syst. Sci. Data 2012, 4, 187–213. [Google Scholar] [CrossRef]
Guo, Y. Updating Rainfall IDF Relationships to Maintain Urban Drainage Design Standards. J. Hydrol. Eng. 2006, 11, 506–509. [Google Scholar] [CrossRef]
Hassan, H.W.; Nile, B.K.; Al-Masody, B.A. Study the climate change effect on storm drainage networks by storm water management model [SWMM]. Environ. Eng. Res. 2017, 22, 393–400. [Google Scholar] [CrossRef]
Adamowski, K.; Bougadis, J. Detection of trends in annual extreme rainfall. Hydrol. Process. 2003, 17, 3547–3560. [Google Scholar] [CrossRef]
Fujibe, F.; Yamazaki, N.; Katsuyama, M.; Kobayashi, K. The increasing trend of intense precipitation in Japan based on four-hourly data for a hundred years. Sci. Online Lett. Atmos. SOLA 2005, 1, 41–44. [Google Scholar] [CrossRef]
Wang, B.; Ding, Q.; Jhun, J.G. Trends in Seoul (1778–2004) summer precipitation. Geophys. Res. Lett. 2006, 33, L15803. [Google Scholar] [CrossRef]
Burn, D.H.; Mansour, R.; Zhang, K.; Whitfield, P.H. Trends and variability in extreme rainfall events in British Columbia. Can. Water Resour. J. 2011, 36, 67–82. [Google Scholar] [CrossRef]
Douglas, E.M.; Fairbank, C.A. Is precipitation in northern New England becoming more extreme? Statistical analysis of extreme rainfall in Massachusetts, New Hampshire, and Maine and updated es- timates of the 100-year storm. J. Hydrol. Eng. 2011, 16, 203–217. [Google Scholar] [CrossRef]
Nandargi, S.; Dhar, O.N. Extreme rainfall events over the Himalayas between 1871 and 2007. Hydrol. Sci. J. 2011, 56, 930–945. [Google Scholar] [CrossRef]
Yavuz, H.; Erdogan, S. Spatial analysis of monthly and annual precipitation trends in Turkey. Water Resour. Manag. 2012, 26, 609–621. [Google Scholar] [CrossRef]
Xu, Z.X.; Takeuchi, K.; Ishidaira, H. Monotonic trend and step changes in Japanese precipitation. J. Hydrol. 2003, 279, 144–150. [Google Scholar] [CrossRef]
Shadmani, M.; Marofiand, S.; Roknian, M. Trend analysis in reference evapotranspiration using Mann–Kendall and Spear- man’s Rho tests in Arid Regions of Iran. Water Resour. Manag. 2012, 26, 211–224. [Google Scholar] [CrossRef]
Piccarreta, M.; Lazzari, M.; Pasini, A. Trends in daily temperature extremes over the Basilicata region (southern Italy) from 1951 to 2010 in a Mediterranean climatic context. Int. J. Climatol. 2015, 35, 1964–1975. [Google Scholar] [CrossRef]
Koutsoyiannis, D. Climatic change, the hurst phenomenon, and hydrological statistics. Hydrol. Sci. J. 2003, 48, 3–24. [Google Scholar] [CrossRef]
Koutsoyiannis, D. Nonstationarity versus scaling in hydrology. J. Hydrol. 2006, 324, 239–254. [Google Scholar] [CrossRef]
Haddad, O.B.; Moravej, M. Discussion of “Trend, Independence, Stationarity, and Homogeneity Tests on Maximum Rainfall Series of Standard Durations Recorded in Turkey” by Tefaruk Haktanir and Hatice Citakoglu. J. Hydrol. Eng. 2015, 20, 07015016. [Google Scholar] [CrossRef]
Hamed, K.H. Trend detection in hydrologic data: The Mann–Kendall trend test under the scaling hypothesis. J. Hydrol. 2008, 349, 350–363. [Google Scholar] [CrossRef]
Baranowski, P.; Krzyszczak, J.; Slawinski., C.; Hoffmann, H.; Kozyra, J.; Nierόbca, A.; Siwek, K.; Gluza, A. Multifractal analysis of me- teorological time series to assess climate impacts. Clim. Res. 2015, 65, 39–52. [Google Scholar] [CrossRef]
Krzyszczak, J.; Baranowski, P.; Zubik, M.; Kazandjiev, V.; Georgieva, V.; Sławiński, C.; Siwek, K.; Kozyra, J.; Nierόbca, A. Multifractal characterization and comparison of meteorological time series from two climatic zones. Theor. Appl. Climatol. 2018, 137, 1811–1824. [Google Scholar] [CrossRef]
García-Marín, A.P.; Jiménez-Hornero, F.J.; Ayuso, J.L. Applying multifractality and the self-organized criticality theory to describe the temporal rainfall regimes in Andalusia (southern Spain). Hydrol. Process. 2008, 22, 295–308. [Google Scholar] [CrossRef]
García-Marín, A.P.; Estévez, J.; Jiménez-Hornero, F.J.; Ayuso-Muñoz, J.L. Multifractal analysis of validated wind speed time series. Chaos 2013, 23, 13133–21505. [Google Scholar] [CrossRef]
Jiménez-Hornero, F.J.; Jiménez-Hornero, J.E.; de Ravé, E.G.; Pavon-Dominguez, P. Exploring the relationship between nitrogen dioxide and ground-level ozone by applying the joint multifractal analysis. Environ. Monit. Assess. 2010, 167, 675–684. [Google Scholar] [CrossRef]
Morbidelli, R.; Saltalippi, C.; Flammini, A.; Corradini, C.; Wilkinson, S.M.; Fowler, H.J. Influence of temporal data aggregation on trend estimation for intense rainfall. Adv. Water Resour. 2018, 122, 304–316. [Google Scholar] [CrossRef]
Buishand, T.A. Some methods for testing the homogeneity of rainfall records. J. Hydrol. 1982, 58, 11–27. [Google Scholar] [CrossRef]
Alexandersson, H. A homogeneity test applied to precipitation data. Int. J. Climatol. 1986, 6, 661–675. [Google Scholar] [CrossRef]
Khaliq, M.N.; Ouarda, T.B.M.J. On the critical values of the standard normal homogeneity test (SNHT). Int. J. Climatol. 2007, 27, 681–687. [Google Scholar] [CrossRef]
Pettitt, A.N. A non-parametric approach to the change-point problem. Appl. Stat. 1979, 28, 126–135. [Google Scholar] [CrossRef]
Brunetti, M.; Maugeri, M.; Nanni, T. Variations of temperature and precipitation in Italy from 1866 to 1995. Theor. Appl. Climatol. 2000, 65, 165–174. [Google Scholar] [CrossRef]
Partal, T.; Kahya, E. Trend Analysis in Turish Precipitation Data. Hydrol. Process. 2006, 20, 2011–2026. [Google Scholar] [CrossRef]
Karpouzos, D.K.; Kavalieratou, S.; Babajimopoulos, C. Trend Analysis of Precipitation Data in Pieria Region (Greece). Eur. Water 2010, 30, 31–40. [Google Scholar]
McGilchrist, C.C.; Woodyer, K.D. Note on a Distribution-Free CUSUM Technique. Technometrics 1975, 17, 321–325. [Google Scholar] [CrossRef]
Wang, X.; Yang, X.; Liu, T.; Li, F.; Gao, R.; Duan, L.; Luo, Y. Trend and extreme occurrence of precipitation in a midlatitude Eurasian steppe watershed at various time scales. Hydrol. Process. 2014, 28, 5547–5560. [Google Scholar] [CrossRef]
Casas-Castillo, M.C.; Rodríguez-Solá, R.; Navarro, X.; Russo, B.; Lastra, A.; González, P.; Redaño, A. On the consideration of scaling properties of extreme rainfall in Madrid (Spain) for developing a generalized intensity-duration-frequency equation and assessing probable maximum precipitation estimates. Theor. Appl. Climatol. 2018, 131, 573–580. [Google Scholar] [CrossRef]
García-Marín, A.P.; Morbidelli, R.; Saltalippi, C.; Cifrodelli, M.; Estévez, J.; Flammini, A. On the choice of the optimal frequency analysis of annual extreme rainfall by multifractal approach. J. Hydrol. 2019, 575, 1267–1279. [Google Scholar] [CrossRef]
Carmona-Cabezas, R.; Ariza-Villaverde, A.B.; Gutiérrez de Ravé, E.; Jiménez-Hornero, F.J. Visibility graphs of ground-level ozone time series: A multifractal analysis. Sci. Total Environ. 2019, 661, 138–147. [Google Scholar] [CrossRef] [PubMed]
Feder, J. Fractals; Plenum: New York, NY, USA, 1988. [Google Scholar]
Davis, A.; Marshak, A.; Wiscombe, W.; Cahalan, R. Multifractal characterization of non stationarity and intermittency in geophysical fields: Observed, retrieved or simulated. J. Geophys. Resour. 1994, 99, 8055–8072. [Google Scholar] [CrossRef]
Herrera-Grimaldi, P.; García-Marín, A.P.; Estévez, J. Multifractal analysis of diurnal temperature range over Southern Spain using validated datasets. Chaos 2019, 29, 063105. [Google Scholar] [CrossRef] [PubMed]
Ding, M.; Grebogi, C.; Ott, E.; Sauer, T.; Yorke, J.A. Estimating correlation dimension from a chaotic time series: When does plateau onset occur? Physica D 1993, 69, 404–424. [Google Scholar] [CrossRef]

Figure 1. Rain gauges in the Umbria region used in this work.

Figure 2. Results obtained from the sequential Mann–Kendall test in the Spoleto and Terni stations.

Figure 3. Step changes detected by the Mann–Whitney U test at the Nocera Umbra, Spoleto, Petrelle and Terni stations.

Figure 4. Results obtained at the Bastia Umbra and Terni stations after applying the CUSUM test.

Figure 5. Multifractal dimensions functions D_q for the complete data series and also for the rainfall series before and after the break points detected in the Nocera Umbra, Petrelle, Spoleto and Terni stations.

Figure 6. Multifractal spectra f(α) for the complete rainfall data series and also for the rainfall data series before and after the break points detected in the Nocera Umbra, Petrelle, Spoleto and Terni stations.

Table 1. Stations selected for the work and the period of data considered.

Station.	Period (Years) Considered
Bastia Umbra	from 1966 to 2017
Bevagna	from 1967 to 2017
Nocera Umbra	from 1961 to 2017
Petrelle	from 1951 to 2017
Ponte Nuovo di Torgiano	from 1954 to 2017
Spoleto	from 1949 to 2017
Terni	from 1952 to 2017
Todi	from 1948 to 2017

Table 2. Results obtained for the parametric Buishand range (BR) and Standard Normal Homogeneity (SNH) tests; the statistics Q and T₀ and their critical values Q_c and T_0c are also shown.

Station	BRT		SNHT
Station	Q	Q_c	T₀	T_0c
Bastia Umbra	0.862	1.271	6.689	8.480
Bevagna	0.746	1.270	2.626	8.456
Nocera	0.650	1.273	3.094	8.586
Petrelle	0.978	1.277	3.970	8.768
Ponte	0.615	1.276	2.182	8.717
Spoleto	1.295	1.277	7.546	8.784
Terni	1.088	1.276	7.505	8.735
Todi	0.739	1.278	3.794	8.814

Table 3. Results obtained after applying the Pettitt test.

Station	Pettitt Test
Station	K	K_0.05
Bastia Umbra	213.0	267.5
Bevagna	170.0	259.9
Nocera	194.0	306.7
Petrelle	322.0	390.4
Ponte	152.0	356.1
Spoleto	435.0	399.1
Terni	276.0	373.1
Todi	237.0	461.8

Table 4. Summary of the years in which break points were detected by the applied tests.

Station	Parametric Tests		Non-Parametric Tests
Station	BR	SNH	PT	SQMK	MWU	CUSUM
Bastia Umbra	-	-	-	-	-	-
Bevagna	-	-	-	-	-	-
Nocera Umbra	-	-	-	-	1968	-
Petrelle	-	-	-	-	1994	-
Ponte	-	-	-	-	-	-
Spoleto	1977	-	1969	1980	1969	-
Terni	-	-	-	1957	1962	-
Todi	-	-	-	-	-	-

Table 5. Main results obtained for the Rénji spectrum of the extreme rainfall data series for which break points were detected by the statistical tests.

Station		Complete Data Series				Before Break Point Series				After Break Point Series
Station		D₁	D₂	D₀-D₁	D₀-D₂	D₁	D₂	D₀-D₁	D₀-D₂	D₁	D₂	D₀-D₁	D₀-D₂
Nocera Umbra	1968	0.980	0.958	0.020	0.042	0.977	0.953	0.023	0.047	0.984	0.967	0.016	0.033
Petrelle	1994	0.988	0.975	0.012	0.025	0.985	0.971	0.015	0.029	0.988	0.977	0.012	0.023
Terni	1957	0.980	0.960	0.020	0.040	0.980	0.959	0.020	0.041	0.979	0.955	0.021	0.045
Terni	1962	0.980	0.960	0.020	0.040	0.965	0.930	0.035	0.070	0.986	0.972	0.014	0.028
Spoleto	1969	0.991	0.981	0.009	0.019	0.989	0.977	0.011	0.023	0.989	0.978	0.011	0.022
Spoleto	1980	0.991	0.981	0.009	0.019	0.989	0.977	0.011	0.023	0.990	0.981	0.010	0.019

Table 6. Main results obtained for the multifractal spectrum of the extreme rainfall data series for which break points were detected by the statistical tests.

Station		Complete Data Series				Before Break Point Series				After Break Point Series
Station		α_min	α_max	a_s	w	α_min	α_max	a_s	w	α_min	α_max	a_s	w
Nocera Umbra	1968	0.74731	1.16829	−0.43561	0.42098	0.74697	1.14838	−0.56416	0.40142	0.81372	1.14083	−0.29407	0.32712
Petrelle	1994	0.85847	1.18429	0.09650	0.32581	0.82446	1.18982	−0.08529	0.36536	0.88417	1.11864	−0.09868	0.23448
Terni	1957	0.80952	1.15864	−0.17081	0.34913	0.79642	1.17564	−0.30561	0.37922	0.77523	1.15035	−0.30485	0.37513
Terni	1962	0.80952	1.15864	−0.17081	0.34913	0.71493	1.26671	−0.16686	0.55178	0.79981	1.13520	−0.38638	0.33539
Spoleto	1969	0.87427	1.11828	−0.13999	0.24401	0.84608	1.16869	0.02276	0.32261	0.84776	1.14035	−0.17251	0.29258
Spoleto	1980	0.87427	1.11828	−0.13999	0.24401	0.84608	1.16869	0.02276	0.32261	0.88881	1.13704	−0.00022	0.24823

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.