Next Article in Journal
Acknowledgement to Reviewers of Hydrology in 2017
Previous Article in Journal
Effect of Observation Errors on the Timing of the Most Informative Isotope Samples for Event-Based Model Calibration
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Generation of Spatially Heterogeneous Flood Events in an Alpine Region—Adaptation and Application of a Multivariate Modelling Procedure

by
Klaus Schneeberger
1,* and
Thomas Steinberger
2
1
alpS—Centre for Climate Change Adaptation, Grabenweg 68, 6020 Innsbruck, Austria
2
PPE Research Centre, Hochschulstrasse 1, 6850 Dornbirn, Austria
*
Author to whom correspondence should be addressed.
Hydrology 2018, 5(1), 5; https://doi.org/10.3390/hydrology5010005
Submission received: 15 November 2017 / Revised: 21 December 2017 / Accepted: 21 December 2017 / Published: 4 January 2018

Abstract

:
Flooding often has a negative impact on society. In particular, widespread flood events can cause a lot of damage. These events are often spatially and temporally heterogeneous and should be duly considered for an appropriate analysis of flooding. Therefore, a conditional multivariate approach is adapted and applied in order to (i) contribute to a better understanding of the spatial characteristics of fluvial floods and (ii) to deliver sets of synthetically generated flood events. The present paper focuses on a simulation procedure consisting of careful data preparation and selection and the application of a conditional multivariate approach. The conditional approach is adapted to account for the seasonality of runoff data. Model checks attuned to the model are presented to ensure the consistence of simulated and observed data. The Austrian Province Vorarlberg was chosen as the study area. A thorough data analysis of runoff time series showed that the hydrological behaviour is characterized by a strong seasonality that was considered within the applied modelling procedure. The analysis of the spatial dependence of high river flows identified regions where floods likely occur simultaneously and regions with low spatial dependence. The main result of the modelling procedure, a large set of widespread flood events, was successfully generated.

1. Introduction

Fluvial floods are a natural phenomenon that regularly cause significant losses to property and human life. Through efficient flood risk management, the adverse consequences of flooding can be mitigated or even avoided. Therefore, a thorough flood risk analysis provides the basis for decision making of involved parties, such as flood management agencies, public authorities, and insurance industries.
In this context, flood risk is usually understood as a combination of the probability of events and the potential adverse consequences [1,2,3]. A traditional procedure to derive the probability of flooding is to apply extreme value statistics to observed runoff series at river gauging stations. By means of this flood frequency analysis, the return period (T) of flood events in a year is determined for a single point i.e., gauging station) and being assigned to an entire catchment. In the course of flood risk analyses, these homogenous flood scenarios, assuming constant return periods of discharges within a river reach or an entire region, are frequently used.
As flood events in most cases are heterogeneous in space and time [4,5], the use of homogenous flood scenarios may lead to inaccurate results [6,7]. Therefore, heterogeneous flood scenarios, which take into account the spatial variability of flood events, should be considered in flood risk analysis [6,8,9]. As the number of past observed flood events leading to flood damages is usually low, the corresponding number of heterogeneous flood scenarios is limited and a statistical evaluation of the flood risk is difficult to achieve.
A promising approach to overcome this shortcoming is the simulation of synthetic flood events considering a vast range of possible flooding situations. The flood characteristics within a region of interest is typically represented by several gauging stations, and therefore a multivariate method, which is also capable of reproducing the spatial dependence structure in the investigated region, has to be applied.
In the field of hydrology, a popular approach to combine different marginal distributions and dependence structures is the copula approach. Although, in recent years, copula-based models have been applied in hydrology and related fields, such as multivariate frequency analysis, risk assessment, and multivariate extreme value analysis [8,10,11,12,13,14,15,16,17,18,19], most applications are limited to bivariate and trivariate cases. Jongman et al. [8] used a stepwise conditional copula approach to account for a higher dimensional case in flood risk analysis. Apart from this study, the semi-parametric conditional exceedance model introduced by Heffernan and Tawn [20] (henceforth referred to as the HT model) is able to capture the multivariate nature of widespread flooding and can be used to simulate flood events.
The HT model can be interpreted as a multisite peak-over-threshold approach [21] and is able to model the joint probability of large sets of variables (e.g., river flow or sea-level data). It is an appropriate model to describe the probability that one or multiple variables are extreme. Because of these properties, the HT model is well-suited (i) to show the dependence structure of runoff in a region and (ii) to describe and reproduce the joint occurrence of high river flows at different sites. The HT model, thus, delivers high and extreme runoff peaks at one or multiple sites representing heterogeneous flood situations. A key feature of the HT model is its flexibility in modeling different dependence structures of extreme events. It can be used for both asymptotically dependent and asymptotically independent data. None of these two dependence structures should be excluded by the applied model (as it is done e.g., by a copula model) because it is likely that, for example, two gauges along the same river are asymptotically dependent, whereas it is possible that not-nested sites are asymptotically independent. Furthermore, the HT model is able to describe the changing dependence structure when runoffs become extreme.
The general idea and methodology of this conditional model was introduced by Heffernan and Tawn in 2004. Since that time, the HT model has been applied successfully particularly in hydrology and related fields; for example, in order to analyse the spatial dependence of runoff and precipitation in Great Britain (GB) [22], to simulate various ocean variables for extreme conditions [23,24,25], and in the context of flood risk analysis [21,26,27,28,29,30,31]. These flood risk studies show that the HT model is an appropriate approach to generate widespread flood events and that it is well-suited to consider different physical sources of flooding such as high river flows and sea levels. To the authors’ knowledge, except for one study [27], all previous flood risk studies in which the HT model was applied focused on flooding in GB. The HT model makes the assumption that flooding in different seasons of the year have the same spatial dependence [22]. The seasonal features of the processes were not explicitly considered in the above-mentioned flood related studies, but are taken into account in the present work.
This study introduces a modelling procedure that comprises suitable methods for data review and preprocessing, the application of the HT model with properly preprocessed and deseasonalized input data, and model checks attuned to the conditional model. This approach is therefore suitable to be applied in any setting in which the spatial dependence between seasons differs. The modelling procedure was applied and tested in a mountainous study area in which the seasonal characteristics of runoff in summer are different from the characteristics in winter. The main results were synthetically generated flood events on point scale, or, in other words, a data set of simulated runoff at the spatial extent of river gauges. A comparison of observed and simulated stream flow data shows the plausibility of the simulation.
The Austrian Federal Province of Vorarlberg has been chosen as the study area. In the past decades, Vorarlberg has been affected by serious flood events (e.g., May 1999, August 2002, August 2005 and June 2013) where each event led to severe economic losses [32]. Thus, a probabilistic flood risk analysis model was developed to analyse the risk of flooding. The model consists of three major modules: (i) a hazard module where a data set of synthetic flood scenarios is generated which serves as surrogate to determine the probability of widespread flooding; (ii) an impact module with results being loss-probability-relations on a community level representing the potential flood damages; and (iii) a risk assessment module in which the probability of flood risk in the study area is evaluated statistically [33]. The present paper focuses on the generation of spatially heterogeneous flood events.
The present study is organized as follows. Section 2 consists of a short description of the study area and the data used in this study. The modelling procedure begins with Section 3, where the focus lies on the data selection and preprocessing, which is inevitable to account for the seasonality of runoff. As the second part of the modelling procedure, Section 4 provides a review of the HT model. In Section 5, results about the seasonal characteristics of runoff, the spatial dependence measure, the application of the HT model including model checks, and the simulation results are presented. Finally, in Section 6, conclusions are drawn and future steps are discussed.

2. Study Area and Data Used

The Austrian Federal Province of Vorarlberg has been chosen as the study area (Figure 1). Vorarlberg is located in the eastern Alps and is characterized by its complex mountainous topography where elevation ranges from 395 m (Lake Constance) to 3312 m above sea level (Piz Buin). The region under investigation covers 2601 km2, with 91% of the area belonging to the river Rhine basin and 9% draining into the river Danube.
Vorarlberg is located north of the Alpine divide and experiences a relatively high amount of mean annual precipitation; for example, the region Bregenzerwald (mostly drained by the river Bregenzerach) receives approximately 2000 mm year−1 [34]. A high percentage of the precipitation in winter falls as snow. Therefore, the seasonality of river flow is characterized by low runoff in winter and high runoff in spring and summer, especially in the high mountain regions in the southern part of the study area [35].
Runoff time series from river gauging stations, which are a direct measure for river flooding, were used as the primary data source for driving the HT model. For the present study, 17 representative gauges were chosen where time series of daily maximum runoff from 1 January 1976 to 31 December 2013 with no missing data were available (Table 1).
A preliminary data analysis helps to understand the hydrological characteristics of the investigated region. Firstly, daily maximum and mean runoff data were compared. The advantage of using daily maximum runoff data is that information about peak discharge induced by short rainfall events (e.g., synoptic weather patterns) is included, which can be relevant in small catchments. A comparison of annual maximum series ( A M S ) derived from daily mean data ( A M S d m ) and from daily maximum data ( A M S d x ) for coinciding time period shows that the use of daily mean data would lead to an underestimation of the flood situation in Vorarlberg. On average, the medians of A M S d x are 87% higher than the medians of A M S d m . The use of daily maximum data is inevitable in the presented study, although, in most previous flooding related studies [21,26,28], daily mean data were used to run the HT model.
Secondly, the flood process typology was reviewed that also gives some indication of the length of flood events. Merz and Blöschl [36] analysed runoff conditions in Austria and classified annual maximum peaks (time series 1971–1997) in categories such as long-rain floods, short-rain floods, flash floods, rain-on-snow floods, and snowmelt floods. According to this classification, the flood events in the 17 watersheds considered in this study were triggered mostly by long-rainfall events (45%), by short-rainfall events (32%), rain-on-snow events (14%), and others (9%) [36].

3. Data Review and Preparation

Severe flood events usually have a low probability of occurrence and thus their investigation is based on a small amount of data. Hawkes et al. [37] emphasized that the data selection and preparation is probably the most important component when analysing extremes. This section introduces and reviews some appropriate approaches and aspects of data preparation and selection attuned to the HT model.

3.1. Event Definition

A key aspect of this study is to analyse widespread flooding i.e., flood events where multiple stations are affected). A flood event is defined as the maximum runoff of each site that occurs in a certain time interval of length τ simultaneously—in other words, the maximum value within regularly spaced blocks. This block maxima are used for further analyses. The choice of length of the time interval τ rests on the following statistical and hydrological considerations and finally relies on expert judgment.
  • An analysis of runoff time series, where the consecutive days exceeding a specific threshold (defined by the pth quantile q p ) are counted, provides a first estimation of τ . Denote the daily maximum runoff at gauge i [ 1 , , n ] and time t by D i , t , and then the probability of peaks of length L is defined as
    R ( p , L ) = 1 n i = 1 n P r ( D i , ( t , , t + L ) > q p | D i , t > q p D i , t 1 q p )
  • The travel- or concentration time provides additional information about the hydrological response of a watershed and therefore contribute to a thorough event definition. In this context, the concentration time T c (in hours) is a possible measure to indicate the time interval in which a receptor (e.g., flood-prone area) can be affected by simultaneous flooding from two or multiple tributaries. Grimaldi et al. [38] summarized and presented selected formulas for estimating the concentration time, such as, the formulas of (a) ‘Department of Public Works’, (b) ‘Giandotti’, (c) ‘Kirpich’, and (d) ‘Viparelli’, where only few input parameters are necessary. Full details concerning these formulas of the concentration time and their specified restrictions (e.g., regarding the catchment size) can be found in Grimaldi et al. [38]. Although the empirical formulas to estimate the concentration time are associated with large uncertainties [38], they still provide a valuable instrument to calculate the travel time in investigated watersheds.
  • The event definition (i.e., selection of τ ) may also depend on the intended application of the model, such as reconstruction of flood situations in the past lasting a certain number of days or application in a distinct (re)insurance context [21].

3.2. Event Categorization

Besides the definition of flood events, the categorization of their severity can be conducted as follows: a severe flood event (e.g., that causes damage) is defined as the simultaneous occurrence of high runoff at different sites where at least one site exceeds a certain threshold. The severity of events can be determined using the proxy called unit of flood hazard (UoFH) [33,39]. The UoFH is defined as the number of sites that experience runoff that equals or exceeds a certain threshold i.e., a runoff that corresponds to aspecific T). In this study, a return period T = 30 -year was selected as a threshold to define the UoFH.

3.3. Seasonality of Runoff

The HT model relies on the assumption that fluvial flooding has the same spatial characteristics in winter months and in summer months [22]. Thus, the knowledge about the runoff seasonality may help to analyse if this assumption is fulfilled. Directional statistics [40] are a particularly useful approach to anaylse the seasonality of high flows. Therefore, the occurrence dates of flood peaks (exceeding a certain threshold) are translated into a location on a unit circle, where the start of the year is plotted on the easternmost point and further months are plotted in a counterclockwise sense. Generally, the starting point depends on the hydrological characteristics of the study area i.e., splitting of high flow month has to be avoided). The monthly frequency of flood peaks (above a threshold) can be represented by rose diagrams, which helps to identify high and low flow months [41]. A second measure for analysing the seasonality is the ‘Burn vector’ [42,43], which consists of the pair θ ¯ and r ¯ . θ i ¯ represents the mean occurrence date of flood peaks of a specific site i and r i ¯ indicates the variability of the occurrence date (details can be found in [42]).

3.4. Interpretation of Extremes

When analysing several sites with different properties (e.g., catchment size, mean annual runoff), the use of the probability scale is convenient. In the field of hydrology, the interpretation of probabilities is usually done using return periods of events instead of quantiles of the variables. For a given quantile p, the return period (T) is defined as
T = 1 n p y ( 1 p )
where n p y is the average number of events per year. It is calculated as n p y = 365 r s τ , where r s is the ratio that is required in order to take the type of the data set into account (for an all-year ( A Y ) series r s = 1 , a half-year ( H Y ) series r s = 0.5 , and a seasonal series r s = 0.25 ) and τ is the length (in days) of the time interval, which is used for event definition (cf. Section 3.1).

4. Heffernan and Tawn Model

This section reviews the measures to characterize the spatial dependence of river runoff, the statistical model (HT model) used to generate simulated events, and the estimation and simulation strategy including some details about the simulation procedure for the nonparametric part of the HT model.
For a set of gauges I, let Q i be the variable representing maximum flow within a time interval τ at gauge i I (henceforth real scale). Denote the number of gauges by n and an observation at time t by Q i , t . For either a set of observations or a random variable X, denote its pth quantile by q p ( X ) , meaning that q p ( X ) has a probability p of not being exceeded.

4.1. Spatial Dependence Measures

The first measure of spatial dependence can be interpreted as an exploratory summary measure of bivariate dependence and is defined as [22]
P i , j ( p ) = P r Q i > q p ( Q i ) | Q j > q p ( Q j )
so P i , j ( p ) is the probability that gauge i exceeds q p ( Q i ) given gauge j is extreme i.e., exceeds q p ( Q j ) ). The second dependence measure describes the proportion of gauges that are extreme given that gauge j is extreme and is defined as
N j ( p ) = i j P r Q i > q p ( Q i ) | Q j > q p ( Q j ) n 1
An efficient method for analysing the spatial dependence has to be able to qualitatively describe the behaviour of measured data for moderate and high p-values. The calculation of spatial dependence measures P i , j ( p ) and N j ( p ) is based on approx. Forty-five and five values in the present study for moderate i.e., p = 0.99 ) and high i.e., p = 0.999 ) p-values, respectively. The information about P i , j ( p ) and N j ( p ) over a range of p helps to characterize the dependence structure of different sites for various levels of extremeness.

4.2. Statistical Model

The statistical model is built up of a model for the marginal distribution for the runoff series Q i and a conditional dependence model for the joint distribution of these transformed variables given one of them is extreme. The model for the marginal distribution is used to transform all data to the same scale and ensures a relatively simple semiparametric form of the conditional dependence model.

4.2.1. Marginal Model

For the standardisation of the original data, we used a transfomation to Laplace marginal distribution [44] instead of a Gumbel marginal distribution as it is recommended in the primary HT model publication [20].
We use F i to denote the marginal cumulative distribution function (CDF) of the runoff series Q i of gauge i. Then, the transformation is defined by
X i = log ( 2 F i ( Q i ) ) for   Q i F 1 ( 1 / 2 ) log ( 2 ( 1 F i ( Q i ) ) ) for   Q i > F 1 ( 1 / 2 )
The transformed random variable X i (henceforth transformed scale) has a Laplace distribution with both the parameters of location and scale equal to 1. Before applying this marginal model, an inference on the distribution of Q i has to be done (see Section 4.3.1).

4.2.2. Dependence Model

In order to present the key idea of the conditional dependence model [20], we split the vector of transformed river runoff series ( X i ) i I to a one-dimensional conditioning component X = X j and the vector of dependent components Y = ( X i ) i I { j } .
In the following, vector algebra is to be interpreted as componentwise and (in)equalities involving vectors should hold for all components. The HT model relies on the assumption that there are normalising functions a : R + R n 1 and b : R + R + n 1 such that
P r X u > x , Y a ( X ) b ( X ) > z | X > u exp ( x ) G ( z ) for   u
for some non-degenerate distribution function G (with no mass at + ).
On the one hand, the assumption of the existence of normalising functions is quite common in extreme value modelling, for example when fitting a Generalized Pareto distribution to one- dimensional data. On the other hand, this assumption means that, conditional on X > u , the random variables X u and Z = Y a ( X ) b ( X ) converge to independent random variables for u .
For a wide range of copula dependence models, the normalising functions a and b belong to a simple class of functions. When using the transformation to Laplace marginal distribution, this class can be represented as [44]
a ( x ) = α x   and   b ( x ) = x β
for α [ 1 , 1 ] n 1 and β ( , 1 ) n 1 .
When applying the HT model, the asymptotic assumptions are supposed to hold exactly for X > q p s i m , for a high threshold q p s i m (which is defined as the empirical quantile of the data for some high non-exceedance probability p s i m ). Then, the dependence of the components in Y on the conditioning variable X is given by the parametric model
Y = α X + X β Z
where X and Z are independent. This represents n 1 univariate regression type models Y i = α i X + X β i Z i , where the dependence of Y i on X is modelled parametrically, but the dependence structure of the random vector Y is given non-parametrically by the distribution G of Z .

4.3. Estimation and Simulation

The aim of the simulation is to generate a set of synthetic events S such that at least one of the river flows is extreme—in other words, has an exceedance probability higher than a given p s i m . Several steps are involved in fitting the HT model to the data and subsequently to use the fitted model to generate simulated extreme events. These steps are described in some detail below and examples i.e., of sites Kennelbach ( j = 2 ) and Thal ( i = 5 )) are outlined in Figure 2a–d.

4.3.1. Estimation of Marginal Distribution

As suggested in [20], estimation of the marginal distribution F ^ i of the runoff variable Q i for 1 i n is done by using the empirical distribution E C D F i of the data below some threshold q p G P , where q p G P is the empirical quantile of the data for some high non-exceedance probability p G P . The Generalized Pareto distribution is fitted to the data above the threshold q p G P using a maximum likelihood method. The CDF of this distribution is denoted by G P σ ^ i , ξ ^ i . Here, σ i and ξ i are the scale and shape parameter of the Generalized Pareto distribution [45]. Hence,
F ^ i ( q ) = 1 ( 1 p G P ) ( 1 G P σ ^ i , ξ ^ i ( q ) ) for   q > q p G P E C D F i ( q ) for   q q p G P

4.3.2. Marginal Transformation

Transformation to Laplace margins is done using the estimated CDF F ^ i instead of F i for the marginal transformation using Equation (5). When simulation results X i are available on a transformed scale, and backwards transformation to real scale is done by
Q i = F ^ i 1 ( exp ( X i ) / 2 ) for   X i < 0 F ^ i 1 ( 1 exp ( X i ) / 2 ) for   X i 0
where F ^ i 1 ( p ) is the inverse of the empirical distribution for p p G P and the inverse of the Generalized Pareto distribution G P σ ^ i , ξ ^ i evaluated at ( p p G P ) / ( 1 p G P ) for p > p G P .

4.3.3. Parameter Estimation

To estimate the model parameters α i and β i in Y i = α i X + X β i Z i , we use a quasi maximum likelihood approach [20]. Let μ i be the mean of Z i and σ i its standard deviation, and then E ( Y i ) = α i X + X β i μ i and V a r ( Y i ) = ( X β i σ i ) 2 holds. For a pair of transformed observations ( x t , y i , t ) t = 1 , , N t , where x t is extreme ( x t > q p s i m for all t), we set
L ( α i , β i , μ i , σ i ) = t = 1 N t log ( x t β i σ i ) + 1 2 y i , t ( α i x t + x t β i μ i ) x t β i σ i 2
and estimates of the parameters are given by
( α ^ i , β ^ i , μ ^ i , σ ^ i ) = arg min ( α i , β i , μ i , σ i ) L ( α i , β i , μ i , σ i )
The distribution of Z i is not estimated parametrically. Instead, simulation will be based directly on the estimates
Z ^ i , t = y i , t α ^ i x t x t β ^ i
This estimation procedure is performed for all possible pairs of data ( x t , y i , t ) , where x t is extreme.

4.3.4. Simulation of the Conditioning River Flow X

The HT model provides a method to describe the joint conditional distribution of all runoff variables conditional on one being extreme. Before the simulation of the conditional is done, the data to condition on has to be provided, as follows (similar to [28]):
  • Number of simulated extreme events per simulation period: The number of extreme events i.e., where at least one site exceeds q p s i m ) per simulation period (e.g., all-year-, half-year series) is approximated by a negative binominal distribution.
  • Selection of conditioning gauge: The set of extreme events S is partitioned to S = i = 1 n S i , where S i is the set of extreme events where gauge i has the highest non-exceedance probability i.e., is the most extreme one). Then, for an event E S , the probability P r ( E S i | E S ) that gauge i is the most extreme if at least one gauge is extreme has to be estimated using given data. The gauge that is selected as conditioning variable X is drawn from a multinomial distribution with these probabilities. Denote the index of the selected gauge by i .
  • Drawing the conditioning x i : According to the transformation to standard Laplace margins and since p s i m is close to one and hence q p s i m > 0 set x i = q p s i m + e , where e is drawn from an exponential distribution with mean one.

4.3.5. Simulation of Z

Since no parametric model for the distribution of Z is known, we apply the data based random vector generation method of Taylor and Thompson [46] to the set of vector valued estimates { Z ^ t } t = 1 , , N t = { ( Z ^ i , t ) i I { i } } t = 1 , , N t . This method uses m nearest neighbours of a randomly chosen element in { Z ^ t } t = 1 , , N t , where m is a smoothing parameter. A simulated vector Z is generated by the following steps:
  • Randomly choose one Z t 0 from { Z ^ t } t = 1 , , N t .
  • Find the m nearest neighbours of Z t 0 denoted by Z t 1 , , Z t m and compute the vector of componentwise means Z ¯ of Z t 0 , , Z t m .
  • Draw a random sample w 0 , , w m from a uniform distribution with lower bound 1 m + 1 3 m ( m + 1 ) 2 and upper bound 1 m + 1 + 3 m ( m + 1 ) 2 .
  • Deliver Z = Z ¯ + k = 0 m w k ( Z t k Z ¯ ) .
The smoothing parameter m is chosen and the limits of the uniform distribution from which the weights w i are drawn from are calculated such that the variance and covariance structure of the data { Z ^ t } t = 1 , , N t is unchanged, at least approximately.

4.3.6. Simulation of the Dependent Event

When the conditioning value x i and a sample Z of { Z ^ t } t = 1 , , N t are drawn, it is straightforward to compute a sample of the dependent gauges Y by
Y = α ^ x + x β ^ Z
If all components of Y are smaller than x , a valid synthetic event is generated. Otherwise, the simulation of Z and computing Y has to be repeated.

4.4. HT Model and Seasonal Characteristics of Runoff

The HT model relies on the assumption that river flooding in the winter half-year has the same spatial characteristics as flooding in the summer half-year (cf. Section 3.3 and [22]). The spatial dependence measure N j ( p ) is an appropriate tool to investigate whether the spatial characteristics of different series (all-year ( A Y ), May to October ( M 2 O ), and November to April ( N 2 A ) series) differs significantly. If the assumption does not hold, then the data has to be split into periods that reveal the same spatial characteristics and the HT model has to be applied to both data sets independently.

5. Results and Discussion

Runoff data from selected river gauging stations of the study area Vorarlberg were used (i) for event definition and seasonality analysis, (ii) to conduct a spatial dependence analysis of river flows, (iii) to apply the present multivariate conditional model, and thereby (iv) to generate synthetic (extreme) flood events. The results are based on daily maximum runoff data from 17 representative river gauges in the study area.
As a preliminary remark, note that the number of years n y simulated with the HT model can be arbitrarily high. This property is essential for the generation of synthetic flood events and for the subsequent evaluation of flood risk. Nevertheless, in this paper, a relatively small simulated data set i.e., n y = 38 year) was used to make the comparison with the observed data (38-year series) reasonable.

5.1. Event Definition and Seasonality of Runoff

Daily maximum runoff data ( D i ) were analysed using the methods introduced in Section 3.1 in order to identify the most appropriate τ . Firstly, the results of the measure R ( p , L ) are summarized in Table 2. For all investigated quantile values p, the probability that an event lasts longer than three days is less than 5%. Secondly, the concentration time (cf. Section 3.1 (ii)) of the selected catchments is displayed in Table 3. Apart from the Rhine catchment at Lustenau (E) (ca. 6000 km2), all sites react relatively quickly ( T c less than half a day). Thirdly, the flood process typology of annual peaks [36] (cf. Section 2) reveals different relevant flood controls, which hints that the time interval with length τ of three days is a good compromise. Fourthly, (cf. Section 3.1 (iii)), it should be noted that the presented work is embedded in a flood risk project with a (re)insurance background. In this industry, a time span of 72 h (‘hours clause’) is often used for event definition [47,48]. Based on the three criteria, a time interval with length τ = 3 days was chosen. This time interval guarantees in most cases that consecutive independent flood peaks induced by short rainfall events were considered. On the other hand, τ = 3 days is long enough to ensure that flood peaks in neighboring catchments induced by a single weather pattern will be interpreted as one event. Besides the aspects mentioned above, the idea of the spatial dependence measure P i , j ( p ) can be applied to investigate the temporal dependence in individual series. Therefore, Equation (3) has to be adapted to account for daily maximum data D i , t . The dependence measure is then given by P j , j ( D , L ) ( p ) = P r ( D j , t + L > q p ( D i ) | D j , t > q p ( D j ) ) [22]. For analysing the temporal dependence of all individual series, P ( D , L ) ( p ) = 1 n j = 1 n P j , j ( D , L ) ( p ) was calculated for different p-values. Thus, P ( D , L ) ( p ) shows averaged values of the dependence measure over all sites. For a lag of L = 2 days P ( D , L ) ( 0.99 ) = 0.11 , P ( D , L ) ( 0.997 ) = 0.03 , and P ( D , L ) ( 0.999 ) = 0.01 . For high thresholds (corresponding to p = 0.997 or higher), the temporal dependence is less than 5%. Only for low thresholds (corresponding to p = 0.99 ) is the temporal independence is not fulfilled; however, this small level of extremeness is not relevant when applying the HT model. Generally, a time interval with a length τ = 3 days ensures temporal independence of events for high thresholds (corresponding to pth quantile). As regularly spaced blocks (i.e., block maxima) were used in this step, it might happen that a block limit falls within a runoff peak at an individual site that can cause the same event to be selected twice. However, as extreme peaks that are relevant in terms of model estimation (above u) are very short (cf. Table 2), a splitting of high peak is extremely rare and thus these cases are negligible.
So far, daily maximum runoff data ( D i ) were used for data analyses and event definition. The following results are based on maximum runoff in a time interval τ = 3 days i.e., Q i values). Note a quantile value p = 0.997 applied to daily data ( D i ) equals T 0.82-year and therefore roughly corresponds to a quantile value p = 0.99 applied to Q i values ( T 0.91-year).
The runoff in the investigated catchment is characterised by strong seasonality, meaning that high flow conditions mainly occur in summer. The mean occurrence date (represented by θ i ¯ ) of A M S of all sites is from July to mid of August (Table 3). Following Merz and Blöschl [36], 94% of the catchments experience medium to strong and only 6% weak seasonality, which indicates that in most catchments annual peaks are concentrated around the mean occurrence date. Figure 3 shows monthly frequency of flood peaks above certain threshold values q p ( Q i ) . The results of all sites were combined and plotted in the rose diagram (Figure 3). The figure indicates that flood peaks mainly occur between June and August. Furthermore, Figure 3 shows that from a hydrological perspective a separation of the year into a summer half-year ranging from May to October ( M 2 O ), which is dominated by high flow, and in a winter half-year ( N 2 A : November to April), which is characterized by low flow, is reasonable. High flow events above p = 0.999 do not usually occur in the winter half-year (events above T ≈ 16-lyear solely occur from May to September, not shown in Figure 3), which reveals that only the summer half-year is relevant for flood risk considerations.
The pronounced seasonality suggests that there is a different spatial characteristic in M 2 O - and N 2 A - as well as in the all-year series. Figure 4 shows scatter plots of the spatial dependence measure N j ( p ) of 17 sites and for three different p-values for half-year and all-year series. The threshold q p used in Equation (4) to calculate N j ( p ) was derived from the all-year series. To make the results comparable i.e., data from half-year- and all-year series), the pth quantile of half-year series p H Y was calculated using p H Y = 1 1 p A Y 0.5 , where p A Y is the pth quantile of all-year series. In Figure 4a, the dependence measure of the all-year series N j A Y ( p A Y ) versus the N j M 2 O ( p H Y ) (summer month) is plotted, whereby a strong correlation can be especially observed for high p-values. For p A Y = 0.95 , 0.99 and 0.997, the dependence measure for A Y series is systematically lower than for M 2 O series. Figure 4b shows that the spatial dependence between A Y - and N 2 A series differs and that the dependence measure for all-year series is systematically higher than for the winter half-year. The distribution-free Wilcoxon test [49] was applied to assess whether N j N 2 A ( p H Y ) and N j M 2 O ( p H Y ) differ significantly. The null hypothesis that the spatial dependence of winter and summer half-year do not differ was rejected for a significance level of 1%. However, the HT model relies on the assumption that flooding in the winter half-year has the same spatial characteristics as flooding in summer half-year [22]. Thus, half-year series were used for further analyses.

5.2. Spatial Dependence in River Flows

The analysis of the spatial dependence of river flows can help to understand the characteristics of widespread flooding. The spatial dependence measures P i , j ( p ) and N j ( p ) were calculated for M 2 O and N 2 A series, respectively.
P i , j ( p ) explains the dependence of a pair of variables for a certain threshold q p , where P i , j ( p ) = 0.4 means that there is a 40% probability of Q i being large i.e., above q p ) if Q j is above the pth quantile. The spatial dependence measures P i , j ( p ) of four selected stations (Kennelbach, Thal, Gisingen, Schruns) over a range of p are depicted in Figure 5, where P i , j M 2 O ( p H Y ) (summer half-year) is shown in the upper triangle and P i , j N 2 A ( p H Y ) (winter half-year) is shown in the lower triangle. P i , j ( p ) of observed data (solid cyan lines) shows the range of dependence between the stations. There is a relatively strong spatial dependence for low p-values between nested catchments where the downward propagation of flood wave plays an important role (e.g., Kennelbach and Thal as well as Gisingen and Schruns), whereas P i , j ( p ) varies markedly for high p-values. The dependence of summer half-years between the stations Kennelbach and Gisingen is fairly equally spread over the entire range of p. These watersheds are relatively large and therefore more vulnerable to long-rainfall events triggered by advective storm systems that affect the entire study area. In Figure 5, the lowest dependence can be found between the stations Thal and Schruns. This is because the watersheds are relatively small and their hydrological characteristics differ significantly [35]. Generally, the spatial dependence between stations in the summer half-year is stronger than in the winter half-year; in 65% of all possible combinations, i.e., 272 cases), P i , j M 2 O ( p H Y ) is higher than P i , j N 2 A ( p H Y ) (averaged over p). The most obvious differences between P i , j N 2 A ( p H Y ) and P i , j M 2 O ( p H Y ) appear between station Kennelbach and Gisingen as well as Thal and Gisingen, where P i , j N 2 A ( p H Y ) is high for large p.
The second spatial dependence measure N j ( p ) can be interpreted, as synopsis of P i , j ( p ) over a range of i. In contrast to Keef et al. [22], who investigated the spatial dependence of river flow and precipitation across GB by analysing N j ( p ) of stations within a defined radius (30 and 60 km), we analysed the spatial dependence over the entire study area i.e., consideration of all stations). This is reasonable since the entire study area is relatively small and widespread flooding across the study area is relevant for the Province of Vorarlberg.
Figure 6 shows the spatial dependence measure N j ( p ) for the summer half-year. Similar to the first dependence measure, a comparison of N j M 2 O ( p H Y ) and N j N 2 A ( p H Y ) reveals that the dependence in summer is higher than in winter i.e., in 77% of possible combinations, N j M 2 O ( p H Y ) is higher than N j N 2 A ( p H Y ) ). Figure 6a illustrates N j M 2 O ( p H Y ) over a range of p H Y . Overall, the plot indicates that the estimated dependence between the sites decreases with increasing p H Y . In general, the stations located at the same river (nested catchments) show high dependence (e.g., Kennelbach, Mellau, Hopfreben). Data from small headwater catchments and catchments that are situated on the edge of the study area often show low dependence (e.g., Thal, Unterhochsteg, Lochau). In particular, for these stations, N j M 2 O ( p H Y ) increases with higher p H Y i.e., fewer data involved in the analysis), because these small catchments seem to be particularly vulnerable to synoptic weather patterns, which in turn are often responsible for the most extreme events in small catchments. Furthermore, the above-mentioned stations with low dependence lie within a radius of 5 km, which indicates that spatial proximity does not guarantee similar behaviour.
As the entire study has ,a distinct flood risk background, spatial dependence of rare events was studied in detail. Therefore, we illustrate N j M 2 O ( p H Y ) for p H Y equal to 0.99, 0.998, and 0.999, which correspond to T 1.6, 8.2, and 16.4-year in maps of Vorarlberg (Figure 6b–d). The gauges are coloured according to N j M 2 O ( p H Y ) . Figure 6c is particularly suitable for detailed discussion as N j M 2 O ( p H Y ) is based on a reasonable number of events and p H Y = 0.998 is high enough to analyse the dependence of severe flood events. Overall, the stations can be divided into three groups according to their spatial dependence: (i) low dependence N j M 2 O ( 0.998 ) < 0.3 (stations: Thal, Hoher Steg, Unterhochsteg, Schruns, Lustenau (E), Lochau); (ii) moderate dependence 0.3 N j M 2 O ( 0.998 ) < 0.4 (stations: Lingenau, Enz, Lustenau (H), Garsella, Gisingen, Lech); and (iii) high dependence N j M 2 O ( 0.998 ) > 0.4 (stations: Mellau, Kennelbach, Hopfreben, Schönenbach, Laterns). The sites with high dependence are all located in the region Bregenzerwald, which experiences a high amount of annual precipitation.

5.3. Estimation Results and Model Checks

The estimation and simulation procedures outlined in Section 4.3, which were implemented in Matlab (The Mathworks, Natick, MA, USA), were carried out using the data from summer and winter half-year. The interim results of the bivariate case are illustrated in Figure 2 for data of stations Kennelbach ( i = 2 ) and Thal ( i = 5 ) for data of M 2 O . Figure 2a shows the observed data ( Q 2 , Q 5 ) on a real scale. The same data are depicted in Figure 2b in a transformed scale, whereas the G P distribution was applied for data above the threshold q p G P H Y = 0.9 . Figure 2c illustrates X 2 and Y 5 | 2 in a transformed scale above the threshold q p s i m . This figure includes data where the marginal G P distribution was applied above the threshold of q p G P H Y = 0.9 (red) and above q p G P H Y = 0.98 (grey), respectively. Furthermore, the fitted model ( Y 5 | 2 = α ^ 2 X 2 + X β ^ 2 μ ^ 2 ) was applied for values with an exceedance probability higher than p s i m H Y = 0.97 , 0.98, and 0.99. Figure 2c indicates the influence of the choice of the model parameter on the regression model. The red bold line illustrates the fitted model for the selected model parameters p G P H Y = 0.9 and p s i m H Y = 0.98 . The corresponding quantile values of an all-year series would be p G P A Y = 0.95 and p s i m A Y = 0.99 . Figure 7 shows data and fitted regression models for periods M 2 O and N 2 A for exemplarily chosen stations. Thus, the difference of the models for M 2 O and N 2 A periods are apparent, which confirms the necessity of using half-year series.
After parameter estimation was done, the simulation procedure starts with the selection of the conditioning river flow. Therefore, the number of extreme events per year was simulated with a negative binomial distribution. Figure 8a shows that the distribution of simulated data is compatible with the empirical distribution of the observed data. The next step was the selection of the conditioning station i i.e., the station with the highest non-exceedance probability. The distribution of the conditioning station is shown in Figure 8b. The cyan crosses represent the probability that one of the 17 gauges is the most extreme based on observed data, and the percentile ranges result from drawing from the multinomial distribution (100 replications). The distribution of the sites with the largest probability of being extreme (Figure 8b) may appear counter-intuitive, as one could expect a uniform distribution. However, the results are in accordance with the findings of Keef et al. [28], who find similar properties of investigated runoff data in GB. Sites that lie on the same river exhibiting high dependences have low probability of being used as conditioning gauge (e.g., sites i = 1 and 2). Sites with low dependence are expected to have higher probability of being used as conditioning gauges; the four sites with the highest probability ( i = 11 , 12 , 15 and 17) are categorized as stations with low spatial dependence (see Section 5.2). After the conditioning site i was selected, its value x i was drawn.
Before simulating the dependent component (i.e., Y ), Z has to be drawn as outlined in Section 4.3.5. The smoothing parameter was chosen with m = 5 . Finally, a set of spatial dependent extreme events S was simulated. Figure 9d shows a scatterplot (Kennelbach and Thal) in real scale, where green triangles represent simulated events representing n y = 38 years of the simulation period. Note that, although the simulated events can be assigned to a selected time stamp (i.e., year), no continuous time series of river flow was made available.
As no general Goodness-of-Fit test of the HT model exists, several aspects were tested to ensure that the model assumptions were not violated. The first assumption that needs to be checked is that the tails of the marginals are Generalized Pareto distributed. Asymptotically, this assumption is supported by the Pickands–Balkema–de Haan theorem [45]. To assess whether the threshold q p G P was high enough, we used the Goodness-of-Fit test for G P distribution [50]. For thresholds corresponding to p G P 0.9 , the Generalized Pareto distribution of the tails was never rejected for the 17 investigated river gauges on the 5% significance level; on the 10% significance level, very few rejections appeared. Furthermore, stability of the estimated parameters (see e.g., chap. 4.3.4 in Coles [51]) was checked by visual inspection of parameter estimate plots. This shows that a value of p G P of at least 0.9 is reasonable for all gauges.
Secondly, the assumption of independence of the conditioning variable X and the vector of normalized dependent variables Z = Y α X X β , given that X is above some high threshold, is checked by a series of independence tests for all possible combinations (17 × 16 = 272 combinations) of data ( x t , y i , t ) with x t > q p s i m . A statistical test based on Spearman’s ϱ was used since no assumptions on the distribution of Z were made. For p s i m 0.95 , less than 5% of these tests rejected independence on a significance level of 5%. Hence, independence can be considered to hold for p s i m values of at least 0.95 .
The third assumption that the distribution G of Z is non-degenerated with no mass at + is of theoretical interest and automatically fulfilled when G is inferred from data.
A key aspect of all model applications is the compatibility of observed and simulated data. This was checked by comparing several components such as scatterplots of selected gauges, the spatial dependence measures P i , j ( p ) and N j ( p ) , and the medians of yearly maximum runoff of the half-year series derived from observed and simulated data. These model checks were used to determine the most suitable model parameters to be p G P H Y = 0.9 and p s i m H Y = 0.98 . In the context of model checks, it is important to emphasize that the HT model is valid above a threshold q p s i m . This implies that a comprehensive set of simulated data was only available for high runoff values. As a consequence, the comparison of observed and simulated data has to be done on the basis of relative ‘extreme’ values.
Figure 9 illustrates scatterplots of four representative gauges (Kennelbach, Thal, Gisingen, Schruns) of observed and simulated data (simulation period of 38 years). A visual check shows the range of dependence in the data and indicates that the HT model is flexible enough to reproduce the structure of the data. When the correlation between observed data of two stations is strong, simulated data also reveal strong correlations (e.g., Kennelbach and Gisingen). On the other hand, simulated data are almost arbitrarily distributed when observed data show hardly any correlation (e.g., Thal and Schruns). Figure 9 also illustrates the difference between runoff data of M 2 O (upper triangle) and N 2 A (lower triangle). The runoff intensity of the former is considerably higher than the runoff in N 2 A , which again indicates that a separation of the data into winter and summer half-year is necessary.
Furthermore, the spatial dependence measure P i , j ( p ) of observed and simulated data was compared and depicted in Figure 5. Because of the validity of the HT model above a certain threshold u i.e., p s i m H Y = 0.98 ), P i , j ( p ) of simulated data is illustrated for p = 0.98 , 0.99, 0.994, 0.998, and 0.999. As illustrated in Figure 5, the dependence measure P i , j ( p ) of observed and simulated data is in the same range. Table 4 shows the systematic comparison for all possible combinations i.e., 272 combinations). P i , j M 2 O ( p H Y ) = 0.95 (column 3) means that 95% of the observed P i , j ( p ) lies within the specified range (5–95% of simulated data). The results for P i , j N 2 A ( p H Y ) are slightly higher and show that the model is even better suited to reproduce the spatial dependence of river flow for the winter half-year than for the summer half-year. The systematic analysis of the spatial dependence measure N j ( P ) (Table 4) confirms the accordance of simulated and observed data.
The most important model check is probably a systematic comparison of observed and simulated runoff series of each station. Therefore, the use of yearly maximum runoff ( M R ) of the half-year series seems to be appropriate because this type of series contains only relatively high values. Consequently, M R of observed data ( M R o b s M 2 O and M R o b s N 2 A ) and simulated data ( M R s i m M 2 O and M R s i m N 2 A ) were derived and the medians of each series were compared (Figure 10). Instead of runoff (m3 s−1), the use of specific discharge q (l s−1 km−2) allows for easy comparisons of stations with different catchment size. For the comparison of simulated and observed data, 100 replications with a simulation period of 38 years were generated and percentile ranges were depicted for all stations (Figure 10). The required number of replications has been determined by applying a procedure to obtain a specified precision of confidence intervals [52]. For 100 replications, the 90% confidence interval of the medians of M R s i m M 2 O has a relative length of less than 5% for all 17 sites. Figure 10 indicates that observed and simulated values are in accordance for most sites. The systematic comparison of the maximum runoff (cf. the last two lines of Table 4) shows that the HT model is able to reproduce the runoff adequately and that the selected parameters are appropriate.

5.4. Simulated Extreme Events

The main output of the HT model is a set of spatially distributed flood events S on defined locations i.e., river gauges). As the runoff intensity of the summer half-year (see also Figure 3) is considerably higher than the intensity of the winter half-year, only events from M 2 O were considered here. In a first step, the information about the intensity of these events was made available in real scale i.e., m3 s−1) for each individual event and every station. To enable a further usage of the synthetic events, it was necessary to define the so-called level of extremeness [21] for all events and every single station on a probability scale. This probability scale corresponds to the concept of return period of hydraulic loads.
The link between physical scale (m3 s−1) and ‘return period’ scale, which corresponds to ‘probability’ scale, was constructed by applying the Generalized Extreme Value ( G E V ) distribution. The method of L-moments [53] was used to fit the G E V distribution to the A M S of observed data of all stations and a χ 2 Goodness-of-Fit test was carried out to check the suitability of this distribution. The G E V distribution was never rejected at a 5% significance level for the 17 investigated stations. Furthermore, previous studies pointed out that the G E V distribution is an appropriate distribution function within the region [35,54,55].
Figure 11 shows five observed and five simulated flood events where the color of the triangles represents the return period that occurs on each station. The synthetic events reveal a reasonable spatial dependence structure that can also be found in observed flood events. For further analyses, only severe flood events are of interest, where one or multiple stations exceed a T ≥ 30-year. The severity of the illustrated flood events was categorized by the measure unit of flood hazard (see Section 3.2), which is indicated on the bottom of each map. For comparison, following the categorization of the UoFH, nine events were identified in the period 1976–2013, and the most severe observed flood event (August 2005) was categorized as a 12 UoFH-event i.e., the runoff was equal to or exceeded a threshold that corresponded to a return period T = 30-year at 12 river gauges).

6. Conclusions

In this paper, we have demonstrated that the HT model [20] can be successfully applied in a complex mountain topography with high runoff seasonality. However, before applying the conditional dependence model, a priori data review and preparation is essential in order to avoid violation of the model assumptions. A thorough data analysis of daily maximum data of 17 river gauging stations in the study area Vorarlberg shows that a time interval of three days is most suitable for event definition. Because of the pronounced seasonality of runoff and the different spatial characteristics in the half-year series, a separation of the all-year series was necessary. Thus, the conditional model was driven with time series from M 2 O and N 2 A .
The spatial characteristics of widespread and often spatially heterogeneous flood events were analysed. Therefore, the spatial dependence measures P i , j ( p ) and N j ( p ) were calculated in order to improve the understanding of the spatial characteristics of widespread river flood events. River gauging stations were identified where high runoff likely occurs simultaneously with other stations and, on the other hand, stations were found that show hardly any spatial dependence with other stations.
By applying the HT model, a set of synthetic flood events were generated. These extreme events represent a large range of possible flooding situations as they could occur in the investigated region. The set of synthetic flood events serves as a surrogate for probabilistic flood risk analysis. The severity of individual flood events was categorized by a simplistic hydrological proxy (units of flood hazard). The present modeling procedure can be applied wherever a dense network of river gauging measurements exist that capture the spatial characteristics of flood events in a specific region.
The analysis that is shown here does not include considerations about the potential adverse consequences of flooding. However, for a comprehensive analysis of flood risk, the following aspects are indispensable. Besides the generation of synthetic flood events, the adverse consequences of flooding have to be quantified (e.g., the monetary damage of buildings) and the overall flood risk within a region has to be assessed. Schneeberger et al. [33] introduces a probabilistic framework for risk analysis, which takes into account all relevant aspects and requires a set of spatially distributed flood events.

Acknowledgments

This work results from the research project InsuRE funded by the Austrian Research Promotion Agency (FFG) within the scope of the programme COMET, the Tyrolean Insurance Company (Tiroler Versicherung), and the Vorarlberger Insurance Company (Vorarlberger Landesversicherung). We would like to thank all the institutions that provided data, particularly the Hydrographic Service of Vorarlberg. K.S. acknowledges the financial support provided through the scholarship Nachwuchsförderung der Universität Innsbruck. We are grateful to all reviewers for the insightful and helpful comments to previous versions of the manuscript.

Author Contributions

K.S. and T.S. conceived and designed the experiments; K.S. and T.S. analyzed the data; T.S. and K.S. developed and implemented the applied method; K.S. and T.S. wrote the paper.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
HT modelMultivariate semi-parametric conditional model introduced by Heffernan and Tawn (2004) [20]
TReturn period in year
A M S Annual maximum series
A M S d m AMS derived from daily mean data
A M S d x AMS derived from daily maximum data
UoFHUnit of flood hazard
A Y All-year (series)
G E V General extreme value (distribution)
M R Yearly maximum runoff (of half-year series)
M 2 O May to October (series)
N 2 A November to April (series)

References

  1. National Research Council (NRC). Flood Risk Management and the American River Basin: An Evaluation; National Academies Press: Washington, DC, USA, 1995. [Google Scholar]
  2. European Union. On the assessment and management of flood risks: Directive 2007/60/EC of the European Parliament and the Council. Off. J. Eur. Community 2007, 288, 27–34. [Google Scholar]
  3. FLOODsite. Lanugauge of Risk: Project Definitions, 2nd ed.; HR Wallingford: Wallingford, UK, 2009. [Google Scholar]
  4. Uhlemann, S.; Thieken, A.H.; Merz, B. A consistent set of trans-basin floods in Germany between 1952–2002. Hydrol. Earth Syst. Sci. 2010, 14, 1277–1295. [Google Scholar] [CrossRef]
  5. Leonard, M.; Westra, S.; Phatak, A.; Lambert, M.; van den Hurk, B.; McInnes, K.; Risbey, J.; Schuster, S.; Jakob, D.; Stafford-Smith, M. A compound event framework for understanding extreme impacts. Wiley Interdiscip. Rev. Clim. Chang. 2014, 5, 113–128. [Google Scholar] [CrossRef]
  6. Thieken, A.H.; Apel, H.; Merz, B. Assessing the probability of large-scale flood loss events: A case study for the river Rhine, Germany. J. Flood Risk Manag. 2015, 8, 247–262. [Google Scholar] [CrossRef]
  7. Falter, D.; Dung, N.V.; Vorogushyn, S.; Schröter, K.; Hundecha, Y.; Kreibich, H.; Apel, H.; Theisselmann, F.; Merz, B. Continuous, large-scale simulation model for flood risk assessments: Proof-of-concept. J. Flood Risk Manag. 2016, 9, 9–21. [Google Scholar] [CrossRef]
  8. Jongman, B.; Hochrainer-Stigler, S.; Feyen, L.; Aerts, J.C.J.H.; Mechler, R.; Botzen, W.J.W.; Bouwer, L.M.; Pflug, G.; Rojas, R.; Ward, P.J. Increasing stress on disaster-risk finance due to large floods. Nat. Clim. Chang. 2014, 4, 264–268. [Google Scholar] [CrossRef]
  9. Falter, D.; Schröter, K.; Nguyen, V.D.; Vorogushyn, S.; Kreibich, H.; Hundecha, Y.; Apel, H.; Merz, B. Spatially coherent flood risk assessment based on long-term continuous simulation with a coupled model chain. J. Hydrol. 2015, 524, 182–193. [Google Scholar] [CrossRef]
  10. Renard, B.; Lang, M. Use of a Gaussian copula for multivariate extreme value analysis: Some case studies in hydrology. Adv. Water Res. 2007, 30, 897–912. [Google Scholar] [CrossRef]
  11. Genest, C.; Favre, A.C.; Béliveau, J.; Jacques, C. Metaelliptical copulas and their use in frequency analysis of multivariate hydrological data. Water Resour. Res. 2007, 43, 1–12. [Google Scholar] [CrossRef]
  12. Schölzel, C.; Friederichs, P. Multivariate non-normally distributed random variables in climate research— Introduction to the copula approach. Nonlinear Process. Geophys. 2008, 15, 761–772. [Google Scholar] [CrossRef]
  13. Klein, B.; Pahlow, M.; Hundecha, Y.; Schumann, A. Probability Analysis of Hydrological Loads for the Design of Flood Control Systems Using Copulas. J. Hydrol. Eng. 2010, 15, 360–369. [Google Scholar] [CrossRef]
  14. Kilgore, R.; Thompson, D.B.; Ford, D.T. Estimating the Joint Probability of Coincident Flows at Stream Confluences. In World Environmental and Water Resources Congress 2011; Beighley, R.E., Killgore, M.W., Eds.; American Society of Civil Engineers: Reston, VA, USA, 2011. [Google Scholar]
  15. Chen, L.; Singh, V.P.; Shenglian, G.; Hao, Z.; Li, T. Flood Coincidence Risk Analysis Using Multivariate Copula Functions. J. Hydrol. Eng. 2012, 17, 742–755. [Google Scholar] [CrossRef]
  16. Requena, A.I.; Mediero, L.; Garrote, L. A bivariate return period based on copulas for hydrologic dam design: accounting for reservoir routing in risk estimation. Hydrol. Earth Syst. Sci. 2013, 17, 3023–3038. [Google Scholar] [CrossRef]
  17. Sraj, M.; Bezak, N.; Brilly, M. Bivariate flood frequency analysis using the copula function: A case study of the Litija station on the Sava River. Hydrol. Process. 2015, 29, 225–238. [Google Scholar] [CrossRef]
  18. Durocher, M.; Chebana, F.; Ouarda, T.B. On the prediction of extreme flood quantiles at ungauged locations with spatial copula. J. Hydrol. 2016, 533, 523–532. [Google Scholar] [CrossRef]
  19. Requena, A.I.; Chebana, F.; Mediero, L. A complete procedure for multivariate index-flood model application. J. Hydrol. 2016, 535, 559–580. [Google Scholar] [CrossRef]
  20. Heffernan, J.E.; Tawn, J.A. A conditional approach for multivariate extreme values (with discussion). J. R. Stat. Soc. B 2004, 66, 497–546. [Google Scholar] [CrossRef]
  21. Lamb, R.; Keef, C.; Tawn, J.A.; Laeger, S.; Meadowcroft, I.; Surendran, S.; Dunning, P.; Batstone, C. A new method to assess the risk of local and widespread flooding on rivers and coasts. J. Flood Risk Manag. 2010, 3, 323–336. [Google Scholar] [CrossRef]
  22. Keef, C.; Svensson, C.; Tawn, J.A. Spatial dependence in extreme river flows and precipitation for Great Britain. J. Hydrol. 2009, 378, 240–252. [Google Scholar] [Green Version]
  23. Jonathan, P.; Flynn, J.; Ewans, K. Joint modelling of wave spectral parameters for extreme sea states. Ocean Eng. 2010, 37, 1070–1080. [Google Scholar] [CrossRef]
  24. Ewans, K.; Jonathan, P. Evaluating environmental joint extremes for the offshore industry using the conditional extremes model. J. Mar. Syst. 2014, 130, 124–130. [Google Scholar] [CrossRef]
  25. Gouldby, B.; Mendez, F.; Guanche, Y.; Rueda, A.; Minguez, R. A methodology for deriving extreme nearshore sea conditions for structural design and flood risk analysis. Coast. Eng. 2014, 88, 15–26. [Google Scholar] [CrossRef]
  26. Keef, C.; Tawn, J.A.; Svensson, C. Spatial risk assessment for extreme river flows. Appl. Stat. 2009, 58, 601–618. [Google Scholar] [CrossRef]
  27. Mendes, B.V.D.M.; Pericchi, L.R. Assessing conditional extremal risk of flooding in Puerto Rico. Stoch. Environ. Res. Risk Assess. 2009, 23, 399–410. [Google Scholar] [CrossRef]
  28. Keef, C.; Tawn, J.A.; Lamb, R. Estimating the probability of widespread flood events. Environmetrics 2013, 24, 13–21. [Google Scholar] [CrossRef]
  29. Neal, J.; Keef, C.; Bates, P.D.; Beven, K.; Leedal, D. Probabilistic flood risk mapping including spatial dependence. Hydrol. Process. 2013, 27, 1349–1363. [Google Scholar] [CrossRef]
  30. Wyncoll, D.; Gouldby, B. Integrating a multivariate extreme value method within a system flood risk analysis model. J. Flood Risk Manag. 2015, 8, 145–160. [Google Scholar] [CrossRef]
  31. Speight, L.; Hall, J.; Kilsby, C. A multi-scale framework for flood risk analysis at spatially distributed locations. J. Flood Risk Manag. 2017, 10, 124–137. [Google Scholar] [CrossRef]
  32. Habersack, H.; Krapesch, G. Hochwasser 2005—Ereignisdokumentation: Der Bundeswasserbauverwaltung, des Forsttechnischen Dienstes für Wildbach- und Lawinenverbauung und des Hydrographischen Dienstes; Bundesministerium für Land- und Forstwirtschaft, Umwelt- und Wasserwirtschaft: Vienna, Austria, 2006. [Google Scholar]
  33. Schneeberger, K.; Huttenlau, M.; Winter, B.; Steinberger, T.; Achleitner, S.; Stötter, J. A probabilistic framework for risk analysis of widespread flood events: A proof-of-concept study. Risk Anal. 2017. [Google Scholar] [CrossRef] [PubMed]
  34. Gaál, L.; Szolgay, J.; Kohnová, S.; Parajka, J.; Merz, R.; Viglione, A.; Blöschl, G. Flood timescales: Understanding the interplay of climate and catchment processes through comparative hydrology. Water Resour. Res. 2012, 48. [Google Scholar] [CrossRef]
  35. Schneeberger, K.; Huttenlau, M.; Stötter, J. A Seasonality Analysis of Flood Peaks as Basis for Flood Risk Analyses in two Alpine Regions. In Proceedings of the 3rd STAHY International Workshop on Statistical Methods for Hydrology and Water Resources Management, Tunis, Tunisia, 1–2 October 2012. [Google Scholar]
  36. Merz, R.; Blöschl, G. A process typology of regional floods. Water Resour. Res. 2003, 39, 1340–1359. [Google Scholar] [CrossRef]
  37. Hawkes, P.J.; Gonzalez-Marco, D.; Sánchez-Arcilla, A.; Prinos, P. Best practice for the estimation of extremes: A review. J. Hydraul. Res. 2008, 46, 324–332. [Google Scholar] [CrossRef]
  38. Grimaldi, S.; Petroselli, A.; Tauro, F.; Porfiri, M. Time of concentration: a paradox in modern hydrology. Hydrol. Sci. J. 2012, 57, 217–228. [Google Scholar] [CrossRef]
  39. Keef, C.; Lamb, R.; Tawn, J.A.; Laeger, S. A multivariate model for the broad scale spatial assessment of flood risk. In Proceedings of the BHS Third International Symposium, Managing Consequences of a Changing Global Environment, Newcastle, UK, 19–23 July 2010. [Google Scholar]
  40. Mardia, K.V.; Jupp, P.E. Statistics of Directional Data, 2nd ed.; Wiley Series in Probability and Statistics; Wiley: New York, NY, USA, 1999. [Google Scholar]
  41. Schneeberger, K.; Dobler, C.; Huttenlau, M.; Stötter, J. Assessing potential climate change impacts on the seasonality of runoff in an Alpine watershed. J. Water Clim. Chang. 2015. [Google Scholar] [CrossRef]
  42. Burn, D.H. Catchment similarity for regional flood frequency analysis using seasonality measures. J. Hydrol. 1997, 202, 212–230. [Google Scholar] [CrossRef]
  43. De Michele, C.; Rosso, R. A multi-level approach to flood frequency regionalisation. Hydrol. Earth Syst. Sci. 2002, 6, 185–194. [Google Scholar] [CrossRef]
  44. Keef, C.; Papastathopoulos, I.; Tawn, J.A. Estimation of the conditional distribution of a multivariate variable given that one of its components is large: Additional constraints for the Heffernan and Tawn model. J. Multivar. Anal. 2013, 115, 396–404. [Google Scholar] [CrossRef]
  45. Embrechts, P.; Klüppelberg, C.; Mikosch, T. Modelling Extremal Events: For Insurance and Finance; Stochastic Modelling and Applied Probability; Springer: Berlin/Heidelberg, Germany, 1997; Volume 33. [Google Scholar]
  46. Taylor, M.S.; Thompson, J.R. A data based algorithm for the generation of random vectors. Comput. Stat. Data Anal. 1986, 4, 93–101. [Google Scholar] [CrossRef]
  47. Holub, M.; Gruber, H.; Fuchs, S. Risks from natural hazards in the insurance business. In Studienreise 2010—Risiko im Bereich Schutz vor Naturgefahren; Skolaut, C., Ed.; Wildbach- und Lawinenverbau; Verein der Diplomingenieure der Wildbach-und Lawinenverbauung Österreichs: Villach, Austria, 2011; Volume 167. [Google Scholar]
  48. Serinaldi, F.; Kilsby, C.G. A Blueprint for Full Collective Flood Risk Estimation: Demonstration for European River Flooding. Risk Anal. 2017, 37, 1958–1976. [Google Scholar] [CrossRef] [PubMed]
  49. Bortz, J.; Schuster, C. Statistik für Human- und Sozialwissenschaftler, 7th ed.; Springer-Lehrbuch; Springer: Berlin, Germany, 2010. [Google Scholar]
  50. Choulakian, V.; Stephens, M.A. Goodness-of-Fit Tests for the Generalized Pareto Distribution. Technometrics 2001, 43, 478–484. [Google Scholar] [CrossRef]
  51. Coles, S. An Introduction to Statistical Modeling of Extreme Values; Springer Series in Statistics; Springer: London, UK; New York, NY, USA, 2001. [Google Scholar]
  52. Law, A.; Kelton, W. Simulation Modelling and Analysis; McGraw-Hill Series in Industrial Engineering and Management Science; McGraw-Hill: New York, NY, USA, 2000. [Google Scholar]
  53. Hosking, J.R.M.; Wallis, J.R. Regional Frequency Analysis: An Approach Based on L-Moments; Cambridge University Press: Cambridge, UK; New York, NY, USA, 1997. [Google Scholar]
  54. Merz, R.; Blöschl, G. Flood frequency regionalisation—Spatial proximity vs. catchment attributes. J. Hydrol. 2005, 302, 283–306. [Google Scholar] [CrossRef]
  55. Petrow, T.; Merz, B.; Lindenschmidt, K.E.; Thieken, A.H. Aspects of seasonality and flood generating circulation patterns in a mountainous catchment in south-eastern Germany. Hydrol. Earth Syst. Sci. 2007, 11, 1455–1468. [Google Scholar] [CrossRef]
Figure 1. Study area Vorarlberg and the location of river runoff gauges.
Figure 1. Study area Vorarlberg and the location of river runoff gauges.
Hydrology 05 00005 g001
Figure 2. Scatterplot of observed and simulated data from half-year series (May to October) of gauges Kennelbach ( i = 2 ) and Thal ( i = 5 ), statistical model, and model thresholds: (a) observed data in real scale; (b) observed data (threshold for marginal distribution q p G P = 0.9 ) in transformed scale, incl. q p s i m = 0.98 ; (c) observed data in transformed scale (values above q p s i m ), incl. fitted model (red color indicates that the G P distribution was applied above the threshold q p G P = 0.9 ; grey color indicates q p G P = 0.98 ); and (d) observed and simulated data ( n y = 38 year) on a real scale.
Figure 2. Scatterplot of observed and simulated data from half-year series (May to October) of gauges Kennelbach ( i = 2 ) and Thal ( i = 5 ), statistical model, and model thresholds: (a) observed data in real scale; (b) observed data (threshold for marginal distribution q p G P = 0.9 ) in transformed scale, incl. q p s i m = 0.98 ; (c) observed data in transformed scale (values above q p s i m ), incl. fitted model (red color indicates that the G P distribution was applied above the threshold q p G P = 0.9 ; grey color indicates q p G P = 0.98 ); and (d) observed and simulated data ( n y = 38 year) on a real scale.
Hydrology 05 00005 g002
Figure 3. Rose diagram of runoff (for all Q i ) above threshold values ( q p ( Q i ) ).
Figure 3. Rose diagram of runoff (for all Q i ) above threshold values ( q p ( Q i ) ).
Hydrology 05 00005 g003
Figure 4. Comparison of spatial dependence measure N j ( p ) derived from all-year and half-year series: (a) N j A Y ( p ) versus N j M 2 O ( p ) and (b) N j A Y ( p ) versus N j N 2 A ( p ) .
Figure 4. Comparison of spatial dependence measure N j ( p ) derived from all-year and half-year series: (a) N j A Y ( p ) versus N j M 2 O ( p ) and (b) N j A Y ( p ) versus N j N 2 A ( p ) .
Hydrology 05 00005 g004
Figure 5. Spatial dependence measure P i , j ( p H Y ) against non-exceedance probability p H Y of observed and simulated (100 × 38 year) runoff data of station Kennelbach ( i = 2 ), Thal ( i = 5 ), Gisingen ( i = 14 ), and Schruns ( i = 12 ). Upper triangle: M 2 O . Lower triangle: N 2 A .
Figure 5. Spatial dependence measure P i , j ( p H Y ) against non-exceedance probability p H Y of observed and simulated (100 × 38 year) runoff data of station Kennelbach ( i = 2 ), Thal ( i = 5 ), Gisingen ( i = 14 ), and Schruns ( i = 12 ). Upper triangle: M 2 O . Lower triangle: N 2 A .
Hydrology 05 00005 g005
Figure 6. (a) spatial dependence measure N j M 2 O ( p H Y ) against non-exceedance probability p H Y ; (bd) dependence measure depicted in a map of Vorarlberg with colored gauging stations representing N j M 2 O ( p H Y ) , corresponding to (b) p H Y = 0.99 ( T 1.6-year); (c) p H Y = 0.998 ( T 8.2-year), and (d) p H Y = 0.999 ( T 16.4-year).
Figure 6. (a) spatial dependence measure N j M 2 O ( p H Y ) against non-exceedance probability p H Y ; (bd) dependence measure depicted in a map of Vorarlberg with colored gauging stations representing N j M 2 O ( p H Y ) , corresponding to (b) p H Y = 0.99 ( T 1.6-year); (c) p H Y = 0.998 ( T 8.2-year), and (d) p H Y = 0.999 ( T 16.4-year).
Hydrology 05 00005 g006
Figure 7. Observed data from M 2 O and N 2 A in transformed scale (values above q p s i m H Y = 0.98 ) incl. fitted model conditioned on j = 14 (Gisingen) and dependend: (a) i = 2 (Kennelbach); (b) i = 5 (Thal); and (c) i = 12 (Schruns).
Figure 7. Observed data from M 2 O and N 2 A in transformed scale (values above q p s i m H Y = 0.98 ) incl. fitted model conditioned on j = 14 (Gisingen) and dependend: (a) i = 2 (Kennelbach); (b) i = 5 (Thal); and (c) i = 12 (Schruns).
Hydrology 05 00005 g007
Figure 8. Model application: (a) empirical CDF of number of extreme events i.e., at least one site exceeds q p s i m ) per year and (b) probability that gauge i is most extreme.
Figure 8. Model application: (a) empirical CDF of number of extreme events i.e., at least one site exceeds q p s i m ) per year and (b) probability that gauge i is most extreme.
Hydrology 05 00005 g008
Figure 9. Pairwise comparison of observed and simulated ( n y = 38 -year) data of station Kennelbach ( i = 2 ), Thal ( i = 5 ), Gisingen ( i = 14 ), and Schruns ( i = 12 ). Upper triangle: M 2 O . Lower triangle: N 2 A .
Figure 9. Pairwise comparison of observed and simulated ( n y = 38 -year) data of station Kennelbach ( i = 2 ), Thal ( i = 5 ), Gisingen ( i = 14 ), and Schruns ( i = 12 ). Upper triangle: M 2 O . Lower triangle: N 2 A .
Hydrology 05 00005 g009
Figure 10. Comparison of observed and simulated data of period M 2 O for selected 17 sites. Medians of observed time series are depicted by blue crosses and medians of simulated data (100 × 38 year) are shown by selected percentiles.
Figure 10. Comparison of observed and simulated data of period M 2 O for selected 17 sites. Medians of observed time series are depicted by blue crosses and medians of simulated data (100 × 38 year) are shown by selected percentiles.
Hydrology 05 00005 g010
Figure 11. Observed and examples of simulated flood events (T in year) including a categorization of the events severity by means of unit of flood hazard (UoFH).
Figure 11. Observed and examples of simulated flood events (T in year) including a categorization of the events severity by means of unit of flood hazard (UoFH).
Hydrology 05 00005 g011
Table 1. Selected River gauging stations, including the ID (station ID) according to the Austrian Hydrographic Service, and the catchment area (km2).
Table 1. Selected River gauging stations, including the ID (station ID) according to the Austrian Hydrographic Service, and the catchment area (km2).
iName of GaugesRiverIDArea
1MellauBregenzerach200261228.6
2KennelbachBregenzerach200329826.3
3HopfrebenBregenzerach20024641.7
4SchönenbachSubersach20028731.1
5ThalRotach20031190.1
6LingenauSubersach200295111.6
7Hoher StegDornbirnerach200212112.9
8EnzDornbirnerach20020451.1
9Lustenau (H)Rheintalbinne20022077.5
10LaternsFrutz20015433.4
11UnterhochstegLeiblach200394102.4
12SchrunsLitz200048102
13GarsellaLutz20010595.5
14GisingenIll2001471281
15Lustenau (E)Rhein2001966110
16LechLech20037884.3
17LochauRuggbach2003457.2
Table 2. Probability R ( p , L ) that peaks (exceeding a threshold q p ) last L days.
Table 2. Probability R ( p , L ) that peaks (exceeding a threshold q p ) last L days.
pT (Year)L (Days)
01234
R (p,L)0.990.310.320.210.030
0.9970.910.390.250.020
0.9992.710.340.320.010
0.99979.110.440.2800
Table 3. Concentration time ( T c ) and components θ i ¯ and r i ¯ of ‘Burn vector’ based on A M S for all sites.
Table 3. Concentration time ( T c ) and components θ i ¯ and r i ¯ of ‘Burn vector’ based on A M S for all sites.
iName of Gauges T c (h) θ i ¯ (°) r i ¯
1Mellau1.9–4.82040.56
2Kennelbach3.3–8.82160.40
3Hopfreben0.7–2.32050.52
4Schönenbach0.5–2.02080.61
5Thal1.3–4.62240.29
6Lingenau1.4–3.82070.57
7Hoher Steg1.4–4.31960.70
8Enz0.8–2.82090.64
9Lustenau (H)1.4–8.71900.73
10Laterns0.7–2.12080.68
11Unterhochsteg1.0–6.31880.45
12Schruns1.5–3.51880.80
13Garsella1.2–3.02000.69
14Gisingen5.3–121920.76
15Lustenau (E)10–301990.77
16Lech1.2–3.41910.63
17Lochau0.4–1.32050.58
Table 4. Comparison of observed and simulated data for selected criteria ( P i , j ( p ) and N j ( p ) , and medians of yearly maximum runoff for summer and winter half-year). The results exhibit the proportion of possible combinations ( P i , j ( p ) , N j ( p ) ) and sites (median( M R )), respectively, where the observed data lies within the percentile range of simulated data.
Table 4. Comparison of observed and simulated data for selected criteria ( P i , j ( p ) and N j ( p ) , and medians of yearly maximum runoff for summer and winter half-year). The results exhibit the proportion of possible combinations ( P i , j ( p ) , N j ( p ) ) and sites (median( M R )), respectively, where the observed data lies within the percentile range of simulated data.
Test CriteriaPercentiles (%)
12.5–87.55–952.5–97.5
P i , j M 2 O ( p H Y ) 0.870.950.98
P i , j N 2 A ( p H Y ) 0.910.970.99
N j M 2 O ( p H Y ) 0.830.960.98
N j N 2 A ( p H Y ) 0.870.980.99
median ( M R M 2 O )0.710.940.94
median ( M R N 2 A )0.650.820.94

Share and Cite

MDPI and ACS Style

Schneeberger, K.; Steinberger, T. Generation of Spatially Heterogeneous Flood Events in an Alpine Region—Adaptation and Application of a Multivariate Modelling Procedure. Hydrology 2018, 5, 5. https://doi.org/10.3390/hydrology5010005

AMA Style

Schneeberger K, Steinberger T. Generation of Spatially Heterogeneous Flood Events in an Alpine Region—Adaptation and Application of a Multivariate Modelling Procedure. Hydrology. 2018; 5(1):5. https://doi.org/10.3390/hydrology5010005

Chicago/Turabian Style

Schneeberger, Klaus, and Thomas Steinberger. 2018. "Generation of Spatially Heterogeneous Flood Events in an Alpine Region—Adaptation and Application of a Multivariate Modelling Procedure" Hydrology 5, no. 1: 5. https://doi.org/10.3390/hydrology5010005

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop