Next Article in Journal
Effect of Regulated Deficit Irrigation (RDI) on the Growth and Development of Pear Fruit (Pyrus communis L.), var. Triunfo de Viena
Previous Article in Journal
Exploring the Impact of Group Tourists’ Citizenship Behavior on Engagement: The Intimacy as a Mediating Variable
Previous Article in Special Issue
Parameterization of a National Groundwater Model for New Zealand
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Metamodelling of Naturalised Groundwater Levels at a Regional Level in New Zealand

1
National Institute of Water and Atmospheric Research (NIWA), Christchurch 8011, New Zealand
2
National Institute of Water and Atmospheric Research (NIWA), Wellington 6021, New Zealand
3
Greater Wellington, Wellington 6011, New Zealand
*
Author to whom correspondence should be addressed.
Sustainability 2023, 15(18), 13393; https://doi.org/10.3390/su151813393
Submission received: 5 July 2023 / Revised: 14 August 2023 / Accepted: 30 August 2023 / Published: 7 September 2023

Abstract

:
Groundwater is under pressure from increasing demands for agriculture, industry, domestic uses and support of ecosystems. Understanding the natural state of a groundwater system helps policy makers manage groundwater sustainably. Here we developed a metamodelling approach based on stepwise linear regression that emulates the functionality of physically-based models in the three primary aquifers of the Greater Wellington region of New Zealand. The inputs for the metamodels included local weather data, and nearby river flow data. The metamodels were calibrated and validated against the available simulations of naturalised groundwater level time series from physically-based models for 47 selected wells. For 36 of these wells, the metamodels had Nash-Sutcliffe Efficiency and coefficient of determination over 0.5, showing that they could adequately mimic naturalised groundwater level dynamics as simulated by the physically-based groundwater models. The remaining 11 wells had unsatisfactory performance and were typically located far away from rivers or along the coast. The results also showed that modelled groundwater levels in the aquifer’s recharge zone were more sensitive to short-term (less than 2 weeks lag) than long-term river flow (above 4 weeks to 1 year lag), whereas the converse pattern was observed for the aquifer’s discharge zone. Although some special considerations are needed, this metamodelling framework can be generally applied to other aquifers to support groundwater resource management at a lower cost than updating physically-based models.

1. Introduction

Groundwater is a very important natural resource and its total volume represents 96% of all earth’s unfrozen fresh water [1]. Groundwater is not only the primary source of drinking water for half of the world’s population [2] and the source for over 40% of global consumptive water use in irrigation [3], but it also sustains ecosystems by providing water, nutrients and a relatively stable temperature in streams and lakes [4]. However, with combined effects of increasing groundwater abstraction and climate change, there is a global depletion in groundwater resources [5,6]. Therefore, understanding of the natural state and prediction of groundwater resource availability, e.g., water level, is a crucial step to manage groundwater resources for water allocation purposes and increase the resilience under climate variability and change.
To predict groundwater resource dynamics, traditionally three different approaches are used: conceptual modelling [7,8], statistical modelling [9,10] and physically-based numerical modelling (e.g., MODFLOW [11]; FEFLOW [12]; HydroGeoSphere [13]). Conceptual modelling is usually used to model the general water balance (e.g., input to and output from the groundwater system) and normally provides a basis for physically-based numerical modelling. Statistical modelling usually applies statistical analysis on existing data (e.g., trend analysis). Numerical modelling can give physically sound predictions if the model structure is robust in representing the physical processes accurately and is calibrated satisfactorily; however, it needs great effort in data collection (e.g., geological data), model calibration and model simulation.
This study focuses on the Greater Wellington region, which lies in the southern part of the North Island of New Zealand (Figure 1). Groundwater is particularly important in the region as a supply of water for irrigation, providing 60% of the region’s irrigation allocation. Groundwater also provides 30% of the region’s public and community water demands [14]. However, over the past two decades, groundwater levels at some wells have shown a declining trend, suggesting a decreasing availability of groundwater over time [15]. Greater Wellington is responsible for managing water resources in the region. Greater Wellington requires an understanding of naturalised groundwater levels to assist with sustainable groundwater management (e.g., status and trend) and to fulfil national regulatory requirements such as state of the environment reporting and limit setting for resource use. Here, naturalised groundwater levels refer to the conditions that would exist in the absence of any human abstractions. Note that the available measured groundwater level data do not represent natural groundwater levels because the groundwater systems have been significantly altered by humans through water extraction.
In the Greater Wellington region, there are three physically-based models for estimating naturalised groundwater levels, corresponding to the three principal aquifer systems that exist within the region [16,17,18,19,20]. Updating and upgrading these physically-based models with recent data such as climate and water abstraction data would be time-intensive and expensive. Despite several recent advances in model-data assimilation approaches, the process of updating numerical groundwater models with new input datasets (e.g., land surface recharge, river flow) is expensive and generally cost-prohibitive to continuously use in decision-making. Accordingly, in recent years, Greater Wellington has considered alternative and innovative modelling approaches to support rapid decision-making.
The purpose of this study is to evaluate whether a metamodelling approach can mimic the dynamics of the groundwater system in the Greater Wellington region. In so doing, we assess how metamodelling performance varies in terms of location and depth, and what drives the groundwater system spatially. The metamodels used in this study are based on a conceptual model which generally describes the relationships between the groundwater system and the weather and river systems. Metamodels (i.e., a “model to a model”) are developed and trained based on existing physically-based models, and can overcome the key disadvantages of the three traditional approaches listed above [21,22]. The metamodelling approach uses existing numerical groundwater models to develop significantly lower-cost estimates of groundwater level for continuous, and potentially for near real-time, decision-making [23,24,25]. These metamodels, such as Linear Regression and machine learning technology (e.g., Artificial Neural Network—ANN), have been widely used for water resources management including groundwater modelling [25,26,27,28,29,30]. Despite computational efficiency of metamodelling, there are limitations and challenges in its application, including high-dimensional problems, uncertainties, validation and overfitting, etc. These have been reviewed and addressed in the literature (e.g., see [31] for a detailed review on metamodelling in water resources and [32] for an general introduction on machine learning for hydrologic sciences). In this study, we developed a metamodelling framework by combining a conceptual groundwater model and the multivariable regression method to simulate “naturalised” groundwater levels across the Greater Wellington region to assist with groundwater resource management. Compared with machine learning methods (e.g., ANN) which are often regarded as “black box” and involving “big data”, the linear regression method is easy to apply and maintains the physical interpretation of the relationship between inputs and outputs [26,27]; thus, it is suggested to be applied even before using machine learning methods [27].

2. Materials and Methods

2.1. Methodology

2.1.1. Conceptual Model

Natural groundwater level at a location is driven by various factors. These factors include the amount and timing of local land surface recharge (LSR) and/or recharge from river seepage (if relevant), the groundwater flow dynamics of the regional groundwater system, and soil and aquifer hydraulic properties.
The amount and timing of LSR into the aquifer system is driven by weather conditions (e.g., precipitation (P), temperature (T), and potential evapotranspiration (PET)), plant water uptake, soil water holding capacity and topography (land slope). Statistically, if there is no change in land cover or land use, LSR can be simulated as a function of weather variables, i.e.,
L S R = f ( P ; T ; P E T ; )
River flow, within a groundwater zone, can interact with the groundwater (e.g., through hydraulic connection) as a losing or gaining stream. Therefore, rivers can also impact groundwater levels along the river course where a hydraulic connection exists.
River flow is also driven by the upstream weather conditions. Thus, the river flow can also be simulated as a function of weather variables:
Q = f ( P u ;   T u ;   P E T u ;   )
where   Q , P u , T u , and P E T u are river flow and upstream precipitation, temperature, and PET. “Upstream” here means the catchment area in which water accumulates and then flows to the given flow recorder site.
Inflow and outflow of groundwater at a specific location are controlled by the regional groundwater system, which is also a function of the LSR.
To sum up, the groundwater level (L) at a location can be simulated as a function of river flow and local weather conditions:
L = f Q ; L S R = f Q ;   P ; T ; P E T ;
or in a time series form:
L t = f Q t ,   , Q t i ; P t , , P t j ,   ; T t , , T t k ; P E T t , , P E T t l ;
where t is time, and i, j, k and l are the time lags between the change in water level and the change in other variables (Q, P, T and PET). Variables with time lags have longer-term effects on groundwater level, reflecting surface storage (river), soil storage and groundwater storage. In this study, a time step length of one week was used. It is worth noting that the impact of upstream weather factors ( P u , T u , and P E T u ) are represented by river flow Q.

2.1.2. Metamodel

Physically-based models (e.g., MODFLOW [11]) and conceptual process models (e.g., TopModel [33]) are often used in hydrology to solve the above equations. One disadvantage of these models, as described above, is that they normally require additional data or assumptions (e.g., hydraulic conductivity) and model calibration.
In this study, the metamodelling approach was applied using a linear regression model to simulate the relationship between groundwater level, river flow and weather conditions (i.e., Equation (3)), for each groundwater well site:
L t = i = 1 N 0 a 0 , i X i , t + + i = 1 N k a k , i X i , t k + b
where X i , t and X i , t k are the weather or flow variable at time step t and at time step tk (i.e., with a lag time of k time steps), respectively, and a k , i and b are corresponding regression coefficients.
Generally, not all variables are important in the regression for a given site. Thus, we employed stepwise regression, using the Akaike Information Criterion (AIC) and the Bayesian Information Criterion (BIC), to determine which independent variables needed to be retained in the final model at each site [34].

2.2. Case Study and Data

2.2.1. Groundwater Systems in Greater Wellington Region

There are three principal groundwater areas in Greater Wellington region (Figure 1): Kapiti Coast, Hutt Valley and Wairarapa Valley. Groundwater in Kapiti Coast is mainly used for irrigation. Hutt Valley groundwater is a major drinking water source to the Greater Wellington metropolitan area. Water use in the Wairarapa Valley is mainly for irrigation. In total, groundwater use in the region accounts for approximately one-third of the total annual water allocation. Groundwater is particularly important as a supply of water for irrigation, providing 60% of the region’s irrigation allocation. Groundwater also provides 30% of the region’s public and community water supply allocation [14].
Previously, for each principal groundwater area, a physically-based groundwater model has been developed for groundwater management purposes. The Wairarapa Valley groundwater model (covering a total area of 1073 km2) consists of three sub-regional numerical groundwater flow models: upper valley [17], middle valley [18], and lower valley [19], implemented with the FEFLOW finite element code [25]. The Hutt Valley groundwater model [16] was developed using MODFLOW code [11], covering an area of 29 km2. The Kapiti Coast groundwater model [20] was developing using MODFLOW code, covering an area of 172 km2.

2.2.2. Naturalised Groundwater Levels

The naturalised groundwater levels at each well were simulated by the corresponding physically-based groundwater model described in Section 2.2.1. These previous estimates of naturalised groundwater levels were obtained by running the physically-based groundwater models with all abstractions set to zero. Time periods of available naturalised flow predictions from the physical models are 1992~2007 for the Wairarapa Valley, 1992~2019 for the Hutt Valley, and 1992~2007 for the Kapiti Coast.
The modelled naturalised groundwater levels have been used to assess the potential risk to aquifer systems in the region. Greater Wellington identified wells that are potentially at high risk from groundwater level depletion, based on an analysis of groundwater level trends across the three main aquifer systems. The wells were categorised into four broad categories: Very high risk, High risk, Moderate risk, and Low risk. More details can be found in [15]. For this study Greater Wellington selected 47 wells (as listed in Table 1 and Figure 1) which cover all four risk categories (as in column “Risk Category” in Table 1). These 47 well sites were selected to represent a good spatial and depth coverage of the main aquifers in the three primary groundwater zones (Kapiti Coast, Wairarapa and Hutt Valley). They comprise around one third of the total state of environment monitoring network operated by Greater Wellington and are considered to provide a good representation of the variability in overall groundwater conditions in the region as well as aquifer depletion risk from abstraction (very high, high, medium, and low).

2.2.3. Weather Data and Flow Data

Weather data were obtained from NIWA’s Virtual Climate Station Network (VCSN [35]). The VCSN data are estimates of daily climate variables (such as rainfall, PET, air and vapour pressure, maximum and minimum air temperature, soil temperature, relative humidity, solar radiation, and wind speed) on a regular (~5 km spatial resolution) grid covering the whole of New Zealand. These estimates are produced every day, based on the spatial interpolation of actual data observations at climate stations located around New Zealand (https://niwa.co.nz/climate/our-services/virtual-climate-stations; accessed on 29 August 2023).
River flow data were supplied by Greater Wellington at each flow monitoring site (triangle in Figure 1) which is either close to or upstream of each well along the groundwater flow path. Most flow sites have long-term records and are positioned to represent natural upstream surface runoff.

2.3. Application Procedure

Figure 2 illustrates the metamodelling procedure, including data preparation, processing, and metamodelling and assessment, which are elaborated in the following.

2.3.1. Initial Assessment of the Groundwater Level Data

The naturalised groundwater level data used for this study are from scenario simulations from the three physically-based groundwater models described above. Notably, the initial setup of the numerical groundwater models had typically influenced the first year of the simulated groundwater levels, which is normally deemed as the “warm-up” period when running numerical models. Therefore, groundwater level data from this warm-up period were not used as input for the metamodelling.

2.3.2. Compilation of Weather and Flow Data

Daily P, T, and PET data were extracted for each well site from then VCSN dataset. These daily data were then aggregated to a weekly time step to match the time step of the groundwater level data.
River flow sites and daily flow data assigned to each well site were provided by Greater Wellington. These daily flow data were also aggregated to weekly timesteps.
The weather data were derived from a site close to the given groundwater well, in order to reflect the local weather driver for LSR (Equation (1)). The river flow data were sourced from sites upstream and/or downstream of the given groundwater wells, reflecting the upstream driver and/or downstream hydraulic connection (Equation (2)).

2.3.3. Setup of Model Predictors

The following factors were considered as potential model predictors:
-
weekly P, T, and PET (referred to as P_1, T_1 and PET_1), lagged weekly P, T, and PET with lag times of 2, 4, 9, 13, 17, 21, 26, 30, 40 and 50 weeks (e.g., lagged P is referred to as P_2, P_4 and so on), shifted weekly P, T, and PET with 2, 4, 9, 13, 17, 21, 26, 30, 40 and 50 weeks (e.g., shifted weekly P is referred to as P_s2, P_s4, and so on). Other weather variables (e.g., humidity) were excluded from a preliminary selection based on correlation analysis to reduce the number of model predictors.
-
weekly ratio of P and PET (PoP) with lag times of 2, 4, 9, 13, 17, 21, 26, 30, 40 and 50 weeks (as PoP_2, PoP_4, and so on). PoP is also referred to “dryness” of the weather condition (for a short period) or of climate condition (over a long period).
-
weekly flow data (referred to as Flow_1), lagged weekly flow at 2, 4, 9, 13, 17, 21, 26, 30, 40 and 50 weeks (referred to as Flow_2 and so on), and shifted weekly flow at 2, 4, 9, 13, 17, 21, 26, 30, 40 and 50 weeks (referred to as Flow_s2 as above).
Here, ‘lagged weekly data at n weeks’ means the weekly data averaged over the past n weeks which represent the impact of physical storages (e.g., soil column and river), and ‘shifted weekly data at n weeks’ means the weekly data n weeks ago which gives the direct impact of the variables as compared to the impact of physical storages. The lag times of increasing intervals (i.e., from 2, to 4, 9, 13, 17, 21, 26, 30, 40 and 50) are to distinguish the short and long term effects (the longer the lag times, the smaller difference between corresponding variables). We chose a maximum lag time of 50 weeks (equivalent to rounded 1 year lag time), reflecting lagged impact of weather and flow on the groundwater within 1 year. However longer than 1 year lag time can also be selected if it reflects the reality, and there is a high correlation in the long term lagged weather and flow variables (e.g., we didn’t choose variables with lag time over than 50 which has a correlation higher than 0.8 with the one with lag time 50).

2.3.4. Model Calibration, Validation and Assessment

In hydrological modelling, the selection of the calibration period is very important. Ideally, the calibration period should cover a wide range of weather conditions (e.g., wet and dry years). However, one purpose of this study is to extend naturalised groundwater levels to recent years (i.e., beyond the periods covered by the previously developed numerical models). Therefore, we simply used the 75%:25% data split for calibration and validation periods, i.e., the first 75% of the data were used for the calibration period and the rest for the validation period. The final model was selected through stepwise regression analysis based on Akaike Information Criterion (AIC) and the Bayesian Information Criterion (BIC).
Past literature shows that criteria for model performance assessment are case dependent and there are few generally accepted standards in hydrology, to the best of our knowledge. In accordance with standard procedures in surface water hydrology, the coefficient of determination (R2) and the Nash-Sutcliffe Efficiency (NSE) were used in this study to assess model performance. We didn’t use Kling–Gupta efficiency [36], as “unlike NSE, KGE does not have an inherent benchmark against which flows are compared”, “Modellers using KGE must be specific about the benchmark against which they compare their model performance” [37], and traditional accumulated knowledge on NSE cannot be directly converted KGE. Criteria listed in Table 2 developed by for river flow were used to assess the goodness-of-fit for the performance of the metamodel, in terms of its ability to match Greater Wellington’s previously simulated values for the naturalised groundwater level at each site. These criteria were used in both the calibration and validation periods.

3. Results

3.1. Model Performance

Figure 3 lists metamodel performance indices (NSE) for the calibration and validation periods for the tested sites in all three aquifers. Generally, performance metrics (i.e., R2 not shown) were very similar for the calibration and validation periods, and therefore we opted to classify the performance of the model based on the validation period.
In the Kapiti Coast area, simulations for 7 out of 12 well sites are classified as from “Satisfactory” to “Very Good”, indicating the general effectiveness of the metamodelling approach. Most sites with metamodel fits classified as “Satisfactory” to “Very Good” are located close to the main rivers, indicating the influence of both river flow and weather on the groundwater level dynamics. By contrast, there are five well sites with metamodel fits classified as “Not Satisfactory”. Two plots in the top left of Figure 4 compare the time series of naturalised groundwater levels (estimated by the physically-based models) versus the metamodels for two sites in Kapiti Coast (S25/5528 and S25/5332). Simulation at S25/5228 (classified as “Good”) has good matches to the naturalised groundwater levels, including the dynamics and peaks. At S25/5332 (classified as “Unsatisfactory”) the metamodel fails to reproduce the groundwater level in the validation period (especially the magnitude), despite a good match in the calibration period and good matches to the groundwater dynamics in both the calibration and validation periods.
In the Hutt Valley area, all performance metrics (i.e., R2 and NSE) for all wells are classified as “Very Good”. This indicates the general effectiveness of the metamodelling approach for matching the simulated naturalised groundwater levels derived from the physically-based model. The bottom-left plot in Figure 4 gives the simulation at R27/1115, corroborating the model performance.
In the Wairarapa Valley area, simulations for 23 out of 29 well sites are classified as from “Satisfactory” to “Very Good”, indicating the general effectiveness of the metamodelling approach. Well sites with metamodel fits classified as “Satisfactory” to “Very Good” are located either in the upper valley, in the lower valley, or close to the flow sites. The right three plots in Figure 4 show simulations at three representative sites. These three sites are S26/0033 in the upper valley, S27/0099 in the middle valley, and S27/0434 in the lower valley, which are classified as “Very good”, “Unsatisfactory”, and “Very good”, respectively. Simulations at S26/0033 and S26/0434 have a good match in groundwater level dynamics and peaks (both R2 and NSE are above 0.9 for both calibration period and validation periods). Although the performance at middle valley (“S27/0099”) was classified as “Not satisfactory”, the metamodels match the basic dynamics (peak, recession, and valley) of the naturalised groundwater levels, but the magnitudes differ, especially in the peaks. The metamodel estimates are smoother and the peaks are lagged relative to the simulations from the physically-based models. This difference suggests that the naturalised groundwater levels, as simulated by the physically-based models, are affected by one or more drivers or site-specific complexities that are not adequately captured by the formulation of the metamodels. All these “Not satisfactory” sites are located in the middle valley.

3.2. Influential Factors

Figure 5 shows the t statistic of the regression analysis for the 6 selected sites discussed in Section 3.1.
In the Kapiti coast (S25/5228 and S25/5352), the influential factors to groundwater level vary spatially. At S25/5228, the most two influential factors are ‘PoP_26’ and ‘Flow_1’, where ‘PoP_26’ represents the dryness in the past half-year and ‘Flow_1’ the river flow condition in first week. This indicates that half-year dryness (potentially influencing LSR) significantly impacts the groundwater level, and the interaction with the river is rapid. The impact of precipitation, temperature, and dryness has an over-month lag effect, and the impact of flow is fast (within a week) and can last for up to approximately 1 year (as indicated by Flow_50). At S25/5352, the most influential factor is ‘T_50’ (i.e., the average temperature in the past year), indicating the importance of yearly average warmness. Compared to S25/5228, the lesser fluctuation in groundwater level time series at S25/5332 (Figure 4) reflects the longer-term impact of weather variables over a period of one month or longer.
At R27/1115 (the selected site in Hutt Valley), the most influential factors are “Flow_1”, “Flow_4”, “PoP_9 and “PoP_21”, indicating that the interaction with river flow is also fast (in the order of a week) and lasts for about 4 weeks, and the strong response to dryness can be over 2 months. All weather variables play an important role in the fluctuation of groundwater level, among which the response to rainfall is fast and short-lived (within 2 weeks) while the response to other variables can last 1 year.
In the Wairarapa Valley, different factors affect groundwater levels at different sites. At S26/033 (upper valley), the most two influential factors are ‘PoP_40’ and ‘Flow_1’. This indicates that the impact of dryness (a potential function of LSR) has a relatively long lag, i.e., 40 weeks, to groundwater, and the interaction with river flow is very fast (within a week). The impact of precipitation is also rapid and lasts more than half a year, while other climate variables also have an impact. The impact of flow is not only fast (within 1 week) but also lasts up to 1 year, indicating that surface water leakage contributes to slow moving regional groundwater. In the middle valley (S27/0099), the most influential factor is PoP_40, and there is no relation with river flow. Other climate variables have a lag effect over 1 month. In the lower valley (S27/0434), the most influential factor is “Flow_50”, indicating that the yearly flow condition is highly related to groundwater level, although the interaction with flow starts within a week (i.e., Flow_1). The weather variables have a lagged effect on groundwater level over 1 month. The importance of flow on groundwater level is fast (a week) and continues over 1 year, indicating there is a strong interaction with the river.

4. Discussion

4.1. Model Performance

The metamodel approach provides acceptable fit (from “Satisfactory” to “Very good”) at 36 out of 47 well sites covering the three primary aquifers in the Greater Wellington region in New Zealand. The metamodelling performance is not sensitive to well depth (as in Figure 6). Overall, the results demonstrate that, generally, the metamodels can provide good surrogates of naturalised groundwater levels derived from the three physically-based groundwater models previously developed by Greater Wellington.
However, there are 11 well sites (out of 47) with metamodel fits classified “Not Satisfactory”. These sites are located in the middle Wairarapa Valley and the Kapiti Coast. For the six sites in the middle Wairarapa valley, the metamodel results are mainly sensitive to the weather variables (site S27/0099 in Figure 5), with river flow having little influence on model results, as discussed in Section 3. These weather variables mainly include long term weather variables (e.g., rainfall—“P40”; “PoP40”) which contribute to the main variation of the groundwater level. “Not Satisfactory” results might arise from missing or incorrect interpretation of the geologic features (e.g., folds and faults in the area as illustrated in [18]) in the metamodel. In the Kapiti Coast area, the five sites classified as “Not Satisfactory” are located far away from big rivers (although some sites far away from river are classified as either “Very Good”, “Good”, or “Satisfactory” (e.g., R26/6503 and R26/6520). For these sites, the metamodels are also largely insensitive to river flow. The lack of sensitivity of the metamodels to river flow may arise because the Kapiti Coast is characterised by a long, narrow aquifer in which groundwater levels may be driven less by river flows and LSR and more by ocean and tidal dynamics. Thus, inclusion of tidal dynamics/sea level as a predictor in the model should be investigated to improve metamodel performance. “Not Satisfactory” results in the Kapiti Coast might also arise from missing or incorrect interpretation of the fault in the area (as illustrated in [20]) or other geologic features in the metamodel, similar to these in the Wairarapa Valley.
For both the Wairarapa Valley and Kapiti Coast, metamodel performance may be improved not just by inclusion of additional relevant input variables (e.g., tidal dynamics), but also by allowing for non-linear relationships in the model, although the non-linear relationship has been partially accounted for with different lag times and compound variables (i.e., PoP).
It is also worth noting that the naturalised groundwater level data (used for calibration of the metamodels) were not observed data but simulations from physically-based groundwater models. Therefore, these naturalised data may have biases or inaccuracies, compared to the true natural groundwater levels.

4.2. Importance of Factors

Figure 5 shows the different behaviours of groundwater levels reacting to weather variables and flow at 6 selected sites.
In the Kapiti Coast, sites close to the river (e.g., S25/5228) react rapidly to river flow while there is no reaction for sites away from river (e.g., S25/5332). The corresponding time series of groundwater in Figure 4 also indicates that groundwater level is more dynamic (more fluctuating) at S25/5228 than S25/5332, reflecting the longer-time impact from weather variables.
In the Wairarapa Valley, sites in the middle valley (S27/0099) are similar to S25/5332 in the Kapiti Coast, i.e., they are more influenced by long-term weather variables (instead of river flow and recent (i.e., 2 or 4 week) weather variables), as also observed in the time series plot in Figure 4. In the upper and lower valley, although groundwater levels at both sites (S26/0033 and S27/0434) show similar patterns of sensitivity between the river flow and groundwater fluctuations (Figure 4), the mechanism seems different. In the upper valley, river leakage provides the water sources for the groundwater system, whereas it is the discharge zone to river from groundwater in the lower valley, i.e., Flow_1 is more sensitive than Flow_50 at S26/0033 and the converse is true at S27/0434.

5. Conclusions

We developed a metamodelling approach based on stepwise linear regression to emulate the naturalised groundwater levels that had been previously simulated using physically-based models in the Greater Wellington region of New Zealand.
The metamodels can adequately mimic the naturalised groundwater level dynamics at most sites as simulated by the three physically-based groundwater models. This is shown by good simulation results: model performances for 36 out of 47 (77%) wells were classified from “Satisfactory” to “Very Good”.
The metalmodelling approach can handle the nonlinear groundwater system (regardless well depth), except for some hydrogeologic features (e.g., fold and fault) and the potential impact of sea level in the middle Wairarapa valley and along the Kapiti Coast. This means that the complexity associated with interactions of the aquifer and weather system needs special attention, and some other variables (e.g., sea level) could usefully be included in the metamodelling in the future.
The metamodelling approach can reflect the distinctive relationship between groundwater level with input variables in recharge and discharge zones. For example, in the recharge zone of the Wairarapa Valley (upper valley), groundwater level is more sensitive to short-term flow than long-term flow, whereas the converse is true for the discharge zone (lower valley).
Although some special attention is needed for some sites, this metamodelling framework can be generally applied to other aquifers by preparing the regional and site- specific weather, river flow and groundwater level data. Thus, we suggest that the metamodelling approach demonstrated in this study can be used generally and transferrably to support groundwater resources management.

Author Contributions

Conceptualization, all authors; methodology, all authors; software, J.Y.; validation, J.Y. and C.R.; formal analysis, all authors; investigation, J.Y.; resources, R.M. and M.T.; data curation, J.Y.; writing—original draft preparation, J.Y. and C.R.; writing—review and editing, C.J.D., D.B., R.M. and M.T.; visualization, J.Y.; supervision, C.J.D.; project administration, R.M.; funding acquisition, R.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Greater Wellington, New Zealand, grant number PO268319, and the Strategic Science Investment Fund (SSIF) Programmes of Ministry of Business, Innovation and Employment (MBIE) through NIWA, grant number FWWA2308.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Flow data and groundwater level data can be downloaded from https://github.com/yangjniwa/model_groundwaterlevel/tree/main (accessed on 29 August 2023). For weather data, please contact NIWA for authorisation (https://niwa.co.nz/climate/our-services/virtual-climate-stations; accessed on 29 August 2023).

Acknowledgments

This work was funded by the Greater Wellington, New Zealand, and the Strategic Science Investment Fund (SSIF) Programmes of Ministry of Business, Innovation and Employment (MBIE) through NIWA.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Vrba, J.; van der Gun, J. The World’s Groundwater Resources: Contribution to Chapter 4 of WWDR-2. 2004. Available online: https://www.ircwash.org/sites/default/files/212.0-04WO-18539.pdf (accessed on 29 August 2023).
  2. Velis, M.; Conti, K.I.; Biermann, F. Groundwater and human development: Synergies and trade-offs within the context of the sustainable development goals. Sustain. Sci. 2017, 12, 1007–1017. [Google Scholar] [CrossRef]
  3. Siebert, S.; Burke, J.; Faures, J.M.; Frenken, K.; Hoogeveen, J.; Döll, P.; Portmann, F.T. Groundwater use for irrigation–A global inventory. Hydrol. Earth Syst. Sci. 2010, 14, 1863–1880. [Google Scholar] [CrossRef]
  4. Kløve, B.; Ala-Aho, P.; Bertrand, G.; Boukalova, Z.; Ertürk, A.; Goldscheider, N.; Ilmonen, J.; Karakaya, N.; Kupfersberger, H.; Kvœrner, J.; et al. Groundwater dependent ecosystems. Part I: Hydroecological status and trends. Environ. Sci. Policy 2011, 14, 770–781. [Google Scholar] [CrossRef]
  5. Wada, Y.; Van Beek, L.P.; Van Kempen, C.M.; Reckman, J.W.; Vasak, S.; Bierkens, M.F. Global depletion of groundwater resources. Geophys. Res. Lett. 2010, 37, L20402. [Google Scholar] [CrossRef]
  6. Mays, L.W. Groundwater resources sustainability: Past, present, and future. Water Resour. Manag. 2013, 27, 4409–4424. [Google Scholar] [CrossRef]
  7. Meyer, P.D.; Gee, G.W.; Nicholson, T.J. Information on Hydrologic Conceptual Models, Parameters, Uncertainty Analysis, and Data Sources for Dose Assessments at Decommissioning Sites (No. NUREG/CR-6656; PNNL-13091); Pacific Northwest National Lab. (PNNL): Richland, WA, USA, 2000.
  8. Enemark, T.; Peeters, L.J.; Mallants, D.; Batelaan, O. Hydrogeological conceptual model building and testing: A review. J. Hydrol. 2019, 569, 310–329. [Google Scholar] [CrossRef]
  9. Ferdowsian, R.; Pannell, D.J.; McCarron, C.; Ryder, A.; Crossing, L. Explaining groundwater hydrographs: Separating atypical rainfall events from time trends. Soil Res. 2001, 39, 861–876. [Google Scholar] [CrossRef]
  10. Yan, S.F.; Yu, S.E.; Wu, Y.B.; Pan, D.F.; Dong, J.G. Understanding groundwater table using a statistical model. Water Sci. Eng. 2018, 11, 1–7. [Google Scholar] [CrossRef]
  11. Harbaugh, A.W. MODFLOW-2005: The U.S. Geological Survey Modular Ground-Water Model—The Ground-Water Flow Process, U.S. Geological Survey Techniques and Methods 6-A16; US Department of the Interior, US Geological Survey: Reston, VA, USA, 2005.
  12. Diersch, H.J.G. FEFLOW—Finite Element Modelling of Flow, Mass and Heat Transport in Porous and Fractured Media; Springer: Berlin/Heidelberg, Germany, 2014. [Google Scholar] [CrossRef]
  13. Brunner, P.; Simmons, C.T. HydroGeoSphere: A fully integrated, physically based hydrological model. Ground Water 2012, 50, 170–176. [Google Scholar] [CrossRef]
  14. Keenan, L.; Thompson, M.; Mzila, D. Freshwater Allocation and Availability in the Wellington Region: State and Trends; Publication No. GW/EMI-T-12/141; Greater Wellington Regional Council: Wellington, New Zealand, 2012.
  15. Mzila, D. Groundwater Level Trend Analysis for the Wellington Region: Internal Technical Paper Prepared in Support of State of Environment Reporting; Greater Wellington Region Council: Wellington, New Zealand, 2012.
  16. Gyopari, M. Lower Hutt Aquifer Model Revision (HAM3): Sustainable Management of the Waiwhetu Aquifer; Greater Wellington Regional Council: Wellington, New Zealand, 2014.
  17. Gyopari, M.; McAlister, D. Wairarapa Valley Groundwater Resource Investigation: Upper Valley Catchment Hydrogeology and Modelling; Technical Publication No. GW/EMI-T-10/74; Greater Wellington Regional Council: Wellington, New Zealand, 2010.
  18. Gyopari, M.; McAlister, D. Wairarapa Valley Groundwater Resource Investigation: Middle Valley Catchment Hydrogeology and Modelling; Technical Publication No. GW/EMI-T-10/73; Greater Wellington Regional Council: Wellington, New Zealand, 2010.
  19. Gyopari, M.; McAlister, D. Wairarapa Valley Groundwater Resource Investigation: Lower Valley Catchment Hydrogeology and Modelling; Technical Publication No. GW/EMI-T-10/75; Greater Wellington Regional Council: Wellington, New Zealand, 2010.
  20. Gyopari, M.; Mzila, D.; Hughes, B. Kapiti Coast Groundwater Resource Investigation: Catchment Hydrogeology and Modelling Report; Publication No. GW/ESCI-T-14/92; Greater Wellington Regional Council: Wellington, New Zealand, 2014.
  21. Christelis, V.; Hughes, A.G. Metamodel-assisted analysis of an integrated model composition: An example using linked surface water–groundwater models. Environ. Model. Softw. 2018, 107, 298–306. [Google Scholar] [CrossRef]
  22. Fienen, M.N.; Nolan, B.T.; Feinstein, D.T. Evaluating the sources of water to wells: Three techniques for metamodeling of a groundwater flow model. Environ. Model. Softw. 2016, 77, 95–107. [Google Scholar] [CrossRef]
  23. Asher, M.J.; Croke, B.F.; Jakeman, A.J.; Peeters, L.J. A review of surrogate models and their application to groundwater modeling. Water Resour. Res. 2015, 51, 5957–5973. [Google Scholar] [CrossRef]
  24. Di Salvo, C. Improving Results of Existing Groundwater Numerical Models Using Machine Learning Techniques: A Review. Water 2022, 14, 2307. [Google Scholar] [CrossRef]
  25. Kumar, V.; Holzkaemper, A.; Lerner, D.N. Integrated meta-modelling for decision support in integrated catchment management. In Proceedings of the 5th International Congress on Environmental Modelling and Software, Ottawa, ON, Canada, 5–8 July 2010; Available online: https://scholarsarchive.byu.edu/iemssconference/2010/all/218/ (accessed on 4 September 2023).
  26. Yan, Y.; Feng, C.C.; Wan, M.P.H.; Chang, K.T.T. Multiple Regression and Artificial Neural Network for the Prediction of Crop Pest Risks. In Information Systems for Crisis Response and Management in Mediterranean Countries; ISCRAM-med 2015, Lecture Notes in Business Information Processing; Bellamine, B.S.N., Adam, C., Hanachi, C., Eds.; Springer: Cham, Switzerland, 2015; Volume 233. [Google Scholar]
  27. Tsakiri, K.; Marsellos, A.; Kapetanakis, S. Artificial neural network and multiple linear regression for flood prediction in Mohawk River, New York. Water 2018, 10, 1158. [Google Scholar] [CrossRef]
  28. Osman, A.I.A.; Ahmed, A.N.; Huang, Y.F.; Kumar, P.; Birima, A.H.; Sherif, M.; Sefelnasr, A.; Ebraheemand, A.A.; El-Shafie, A. Past, Present and Perspective Methodology for Groundwater Modeling-Based Machine Learning Approaches. Arch. Comput. Methods Eng. 2022, 29, 3843–3859. [Google Scholar] [CrossRef]
  29. Afrifa, S.; Zhang, T.; Appiahene, P.; Varadarajan, V. Mathematical and Machine Learning Models for Groundwater Level Changes: A Systematic Review and Bibliographic Analysis. Future Internet 2022, 14, 259. [Google Scholar] [CrossRef]
  30. Payne, K.; Chami, P.; Odle, I.; Yawson, D.O.; Paul, J.; Maharaj-Jagdip, A.; Cashman, A. Machine Learning for Surrogate Groundwater Modelling of a Small Carbonate Island. Hydrology 2023, 10, 2. [Google Scholar] [CrossRef]
  31. Razavi, S.; Tolson, B.A.; Burn, D.H. Review of surrogate modeling in water resources. Water Resour. Res. 2012, 48, W07401. [Google Scholar] [CrossRef]
  32. Xu, T.; Liang, F. Machine learning for hydrologic sciences: An introductory overview. Wiley Interdiscip. Rev. Water 2021, 8, e1533. [Google Scholar] [CrossRef]
  33. Beven, K. TOPMODEL: A critique. Hydrol. Process. 1997, 11, 1069–1085. [Google Scholar] [CrossRef]
  34. Aho, K.; Derryberry, D.; Peterson, T. Model selection for ecologists: The worldviews of AIC and BIC. Ecology 2014, 95, 631–636. [Google Scholar] [CrossRef]
  35. Tait, A.; Henderson, R.; Turner, R.; Zheng, X.G. Thin plate smoothing spline interpolation of daily rainfall for New Zealand using a climatological rainfall surface. Int. J. Climatol. 2006, 26, 2097–2115. [Google Scholar] [CrossRef]
  36. Gupta, H.V.; Kling, H.; Yilmaz, K.K.; Martinez, G.F. Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling. J. Hydrol. 2009, 377, 80–91. [Google Scholar] [CrossRef]
  37. Knoben, W.J.; Freer, J.E.; Woods, R.A. Inherent benchmark or not? Comparing Nash–Sutcliffe and Kling–Gupta efficiency scores. Hydrol. Earth Syst. Sci. 2019, 23, 4323–4331. [Google Scholar] [CrossRef]
  38. Moriasi, D.N.; Gitau, M.W.; Pai, N.; Daggupati, P. Hydrologic and water quality models: Performance measures and evaluation criteria. Trans. ASABE 2015, 58, 1763–1785. [Google Scholar]
Figure 1. Locations of river flow sites (triangles and alphabets) and 47 wells (black dots and numbers) in the Greater Wellington region. Numbers correspond to well number in the first column of Table 1, and only 7 flow sites were selected for metamodelling. Shaded areas are the aquifers for GWRC’s existing physically-based models.
Figure 1. Locations of river flow sites (triangles and alphabets) and 47 wells (black dots and numbers) in the Greater Wellington region. Numbers correspond to well number in the first column of Table 1, and only 7 flow sites were selected for metamodelling. Shaded areas are the aquifers for GWRC’s existing physically-based models.
Sustainability 15 13393 g001
Figure 2. Metamodelling application procedure.
Figure 2. Metamodelling application procedure.
Sustainability 15 13393 g002
Figure 3. Spatial distribution of metamodel performance for the calibration and validation periods. Number “A/B” shows NSE values with A for calibration period and B for validation period. Black site names indicate metamodel fits classified as from “Satisfactory” to “Very Good”, while purple site names indicate metamodel fits classified as “Not Satisfactory” for the validation period.
Figure 3. Spatial distribution of metamodel performance for the calibration and validation periods. Number “A/B” shows NSE values with A for calibration period and B for validation period. Black site names indicate metamodel fits classified as from “Satisfactory” to “Very Good”, while purple site names indicate metamodel fits classified as “Not Satisfactory” for the validation period.
Sustainability 15 13393 g003
Figure 4. Comparison of naturalised groundwater levels estimated from physical-based models versus metamodels at 6 selected sites.
Figure 4. Comparison of naturalised groundwater levels estimated from physical-based models versus metamodels at 6 selected sites.
Sustainability 15 13393 g004
Figure 5. t statistic in linear regression analysis of each independent variables at selected sites.
Figure 5. t statistic in linear regression analysis of each independent variables at selected sites.
Sustainability 15 13393 g005
Figure 6. Relation of metamodelling performance (in validation) with well depth.
Figure 6. Relation of metamodelling performance (in validation) with well depth.
Sustainability 15 13393 g006
Table 1. Site information of 47 wells selected for groundwater level modelling.
Table 1. Site information of 47 wells selected for groundwater level modelling.
NoSite IDWell Depth (m) 1Risk Category 2AquiferFlow Site
1S25/53329.1LowKapiti coast
2S25/532925.3LowKapiti coast
3R25/522831.7LowKapiti coastA: Otaki at Pukehinau
4S25/5228UnknownLowKapiti coastA: Otaki at Pukehinau
5S25/52586.0LowKapiti coastA: Otaki at Pukehinau
6S25/5208192.0HighKapiti coast
7R25/512313.0LowKapiti coast
8R26/659474.0LowKapiti coastB: Waikanae at WTP
9R26/662615.8HighKapiti coast
10R26/691621.0LowKapiti coastB: Waikanae at WTP
11R26/650314.8LowKapiti coast
12R26/65206.0LowKapiti coast
13R27/111714.4NALower HuttC: Hutt at Birchville
14R27/6386115NALower HuttC: Hutt at Birchville
15R27/111523.47NALower HuttC: Hutt at Birchville
16R27/012029.6LowLower HuttC: Hutt at Birchville
17R27/012226.2LowLower HuttC: Hutt at Birchville
18R27/0320114.6LowLower HuttC: Hutt at Birchville
19T26/024347.5HighUpper Wairarapa ValleyD: Ruamahanga Mt Bruce
20T26/05015.1ModerateUpper Wairarapa ValleyD: Ruamahanga Mt Bruce
21S26/003312.0LowUpper Wairarapa ValleyE: Waingawa at Kaituna
22T26/04299.9LowUpper Wairarapa ValleyE: Waingawa at Kaituna
23S26/003038.0HighUpper Wairarapa ValleyE: Waingawa at Kaituna
24S26/07385.4ModerateMiddle Wairarapa Valley
25S26/074333.0Very highMiddle Wairarapa Valley
26S26/056845.0LowMiddle Wairarapa Valley
27S26/067531.5HighMiddle Wairarapa Valley
28S26/022923.8LowMiddle Wairarapa ValleyE:Waingawa at Kaituna
29S26/023641.4HighMiddle Wairarapa ValleyE: Waingawa at Kaituna
30S26/02427.5LowMiddle Wairarapa ValleyE:Waingawa at Kaituna
31S27/02487.9LowMiddle Wairarapa Valley
32S26/054518.0HighMiddle Wairarapa ValleyF: Waiohine Gorge
33S26/04905.0ModerateMiddle Wairarapa ValleyF: Waiohine Gorge
34T26/032610.0LowMiddle Wairarapa Valley
35S27/02024.8HighLower Wairarapa Valley
36S27/009916.8LowLower Wairarapa Valley
37S27/001266.5Very highLower Wairarapa Valley
38S27/01488.8LowLower Wairarapa Valley
39S27/03469.5LowLower Wairarapa ValleyG: Ruamahanga Waihenga
40S27/038120.9LowLower Wairarapa ValleyG: Ruamahanga Waihenga
41S27/042843.6Very highLower Wairarapa ValleyG: Ruamahanga Waihenga
42S27/043445.2Very highLower Wairarapa ValleyG: Ruamahanga Waihenga
43S27/054219.0ModerateLower Wairarapa ValleyG: Ruamahanga Waihenga
44S27/059444.0LowLower Wairarapa ValleyG: Ruamahanga Waihenga
45S27/057655.5ModerateLower Wairarapa ValleyG: Ruamahanga Waihenga
46R28/000217.0NALower Wairarapa ValleyG: Ruamahanga Waihenga
47S27/057132.0LowLower Wairarapa ValleyG: Ruamahanga Waihenga
1 Well depth means the drilled depth of the well; 2 “NA” in the column means “Not Assessed” in the risk study [15].
Table 2. Performance criteria from Moriasi et al. (2015) [38].
Table 2. Performance criteria from Moriasi et al. (2015) [38].
Performance MetricVery Good Good Satisfactory Not Satisfactory
R2R2 > 0.850.75 < R2 ≤ 0.850.60 < R2 ≤ 0.75R2 ≤ 0.60
NSENSE > 0.800.70 < NSE ≤ 0.800.50 < NSE ≤ 0.70NSE ≤ 0.50
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yang, J.; Rajanayaka, C.; Daughney, C.J.; Booker, D.; Morris, R.; Thompson, M. Metamodelling of Naturalised Groundwater Levels at a Regional Level in New Zealand. Sustainability 2023, 15, 13393. https://doi.org/10.3390/su151813393

AMA Style

Yang J, Rajanayaka C, Daughney CJ, Booker D, Morris R, Thompson M. Metamodelling of Naturalised Groundwater Levels at a Regional Level in New Zealand. Sustainability. 2023; 15(18):13393. https://doi.org/10.3390/su151813393

Chicago/Turabian Style

Yang, Jing, Channa Rajanayaka, Christopher J. Daughney, Doug Booker, Rebecca Morris, and Mike Thompson. 2023. "Metamodelling of Naturalised Groundwater Levels at a Regional Level in New Zealand" Sustainability 15, no. 18: 13393. https://doi.org/10.3390/su151813393

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop