1. Introduction
Rainfall is an important climate phenomenon that affects social and economic activities in Tanzania. The economy of Tanzania is mainly dependent on rain-fed agriculture, which is highly vulnerable to the amount and distribution of rainfall [
1]. On the other hand, excessive rainfall also has a negative influence on socioeconomic activities in Tanzania, as it can lead to flooding, loss of life, and damage to properties. In Tanzania, rainfall has large temporal and spatial variability [
2]. Therefore, accurate forecasting is necessary. The Tanzania Meteorological Agency (TMA) is a government agency that acts as the country’s authoritative voice for weather and climate-related issues. Among the major responsibilities of the TMA is to provide weather forecasts at all scales to the public.
In order to issue forecasts, Tanzania uses numerical weather prediction (NWP) products provided by global centers obtained through the WMO’s Severe Weather Forecasting Demonstration Project (SWFDP), namely GFS (Global Forecasting System, USA), ECMWF (European Centre for Medium-Range Weather Forecasts, European countries), and UKGM (United Kingdom Global Model, United Kingdom). The TMA also has access to NWP products from other global centers, including the Unified Model (UM) of Korea and ARPEGE of Meteo-France. There are also regional centers that provide model products to other countries, including Tanzania. These include the UM from the RSMC in Pretoria, South Africa; Aire Limitée Adaptation dynamique Développement InterNational (ALADIN) from the Regional Specialized Meteorological Centre (RSMC) for tropical cyclones in the Southwest Indian Ocean in Réunion, France; and WRF from the RSMC in Nairobi, Kenya. At the national level, the TMA also runs a number of regional models, including WRF, Consortium for Small-scale Modeling, and WaveWatch III. As the country is located in a tropical region, the main type of precipitation that affects Tanzania is convective precipitation. This includes showers, thundershowers, hailstones, and sometimes tornadoes or waterspouts. The convective precipitation in the tropics is sometimes difficult to resolve by global models partly because it occurs at small spatial and temporal scales. In order to capture this type of precipitation, an adequate mesoscale model with an appropriate combination of settings is needed. However, most of the rainfall events were not well forecasted by the current global model and regional model. This implies that more suitable numerical model settings are needed to improve the accuracy of the forecasts produced and issued to users and the public.
In this study, the skill of a numerical mesoscale WRF model in forecasting precipitation over Tanzania and the sensitivity of the results of selected physical parameterization schemes and horizontal grid spacing (resolution) during the wet and dry seasons were assessed. Currently, the available settings of the WRF NWP model do not provide accurate forecasts to meet customer satisfaction for all seasons; thus, this research was conducted to improve the quality of the forecast. In addition, following the development and availability of new versions and settings of the WRF system worldwide, this research was conducted to test the sensitivities of the new options for physical parameterization of clouds in the model. This study provided WRF model settings in terms of resolution and physical parameterization schemes (microphysics and cumulus physics) that best resolved weather systems during the wet and dry seasons over Tanzania. The main objective of this study was to simulate and model the rainfall distribution over Tanzania using the WRF model. To achieve this main objective, a number of tasks (specific objectives) were conducted. These included the following: (i) to conduct numerical experiments using the WRF model at different horizontal resolutions and to determine the accuracy of the model, and (ii) to conduct experiments with different physical parameterization schemes (convective schemes and cloud microphysics schemes) to determine the sensitivity of the model.
This paper comprises four sections.
Section 1 provides the background of the study area, rationale of the study, and its objectives.
Section 2 discusses the research methodology of the study. A description of the domain size experiment and evaluation of different physical schemes are indicated in this section.
Section 3 describes the results of the study following the sensitivity tests of the model performance of different physical schemes in forecasting rainfall events.
Section 4 includes the summary and remarks and provides recommendations for further studies.
2. Materials and Methods
2.1. Study Area
Tanzania is an East African nation covering 947,303 km
2 that is located between 1° S to 12° S and 28° E to 42° E in
Figure 1a. Because Tanzania is located in a tropical area, the weather is more controlled by precipitation than by temperature and surface wind, which do not vary much throughout the year. As Tanzania is located in an equatorial climatic region, it often receives a large amount of precipitation during the wet season (October to May) and during the dry season (June to September) each year. Tanzania’s rainfall pattern is divided into two types, namely bimodal rainfall pattern areas (areas receiving two rainfall seasons in one year) and unimodal rainfall pattern areas (areas receiving one rainfall season in one year) in
Figure 1b. In the bimodal areas, there is a short rainfall season from October to December and a long rainfall season from March until May each year, while in unimodal areas there is one long rainfall season from November to May each year.
For the wet season, the cases selected for this study included the heavy rain season of 20–22 December 2011. These were very heavy rainfall events due to strengthening of the inter tropical convergence zone (ITCZ), which is the main rainfall mechanism in the country. It is difficult to predict the onset and location of heavy rainfall events caused by the ITCZ, and most of the available global models were unable to correctly resolve the rainfall. For the dry season, the events of 22–25 July 2014 were used. In this month, although it is drier and cooler in Tanzania, the coastal locations receive prolonged periods of light rainfall due to easterly waves and ridging high-pressure systems. The elevated areas and mountainous areas also normally receive precipitation during this month. These rains are always stratiform in nature, which results from layered clouds, such as stratus, stratocumulus, and altostratus or nimbostratus clouds.
2.2. Rainfall Events and Their Observations
Rainfall events include 20–22 December 2011 for the wet season and 22–25 July 2014 for the dry season.
2.2.1. Heavy Rain Event of 20–22 December 2011
During this heavy rain event, the well-established ITCZ was the dominant weather system. Several locations in the country received heavy rainfall due to the ITCZ. Most of the available global models and the national WRF model of the 19th 0000Z forecasts for these three days showed almost no sign of heavy precipitation over the coastal and central regions, and they forecasted less precipitation over the southern sector of the country. The 24-h forecast issued by the TMA expected only light showers for 20 December 2011.
Figure 2a shows satellite images that indicated deep convective clouds along the coastal, central, western, and northern areas of Tanzania [
3]. During this period, the Northern Hemisphere high-pressure cells, namely the Azores and the Arabian ridge, remained intense. The southern hemisphere high-pressure cells, namely Mascarene and St. Helena shown in
Figure 2b,c, were relatively weak. This resulted in southward movement of the ITCZ towards Tanzania.
2.2.2. Rain Event of 22–25 July 2014
For this event, the dominant weather system was the ridging high. A ridging high is a weather system that occurs in southeastern Africa when the region is dominated by a high-pressure system from the south that extends the ridge northward along the coast, as shown in
Figure 3. Normally, a ridging high is a low-level system and appears during the Southern Hemisphere’s winter (dry season in most areas). It is considered one of the off-season rainfall mechanisms. In most cases, ridging highs are associated with strong maritime (moist) low-level winds as the high-pressure systems are situated behind the frontal system, thereby causing a strong pressure gradient and strong winds. As these strong, moist, low-level winds encounter topographic barriers due to the nature of the topography of southeastern Africa, they deposit a large amount of moisture just before the topographic barriers; as the area is dominated by the high-pressure system, a cluster of low-level stratiform (layered clouds) clouds forms and results in light rainfall that can last for a prolonged period of time, such as an entire day, and causes flooding on the ground. The EUMETAST satellite imagery indicates the presence of clusters of layered clouds oriented north–south along the East African coast advected from the east over the Indian ocean (image not shown). The presence of layered clouds advected from the east indicates the easterly wave pattern [
4]. The synoptic-scale mean sea-level pressure analysis showed that the area was dominated by the ridge (high pressure) and the cold, dry southerly flow, thereby indicating that the clouds were layered (stratiform).
2.2.3. Observations
During the events described in previous subsections, the daily rainfall data have been observed by the TMA’s station network in
Figure 4. These data were observed from both automatic and manned stations. The data contained 24-h accumulated rainfall data as observed at 06:00 UTC on each of the event days. The station observational data were considered the most accurate available data for the case selection and for the verification of the forecasts by the WRF model in this study. Therefore, the rainfall accumulation data from a total of 42 stations were used for the verification of the model performance.
2.3. Default Model Configuration
At the TMA central forecasting office, WRF model version 3.8.1 [
5] was configured for East Africa and the Lake Victoria Basin. The domain was configured in the Mercator map projection with a central latitude of 10° S and longitude of 36° E. It had a one-way nested domain configuration consisting of an outer domain with 15 km resolution as shown in
Figure 5a, and an inner domain with 5 km resolution, as shown in
Figure 5b. The outer domain and inner domain had 200 × 200 and 235 × 160 grid points, respectively. Both had 32 vertical levels with a maximum height of 100 hPa. Also, in order for the mesoscale model to run, the initial and boundary condition data were needed. GFS initial and boundary condition data were downloaded from the NOAA Global Model GFS and were used to initialize the mesoscale WRF model in this study.
For physical parameterization, the default model used the same configuration that is utilized by the WRF-ARW model run by the TMA, Tanzania. The physical parameterization included the multiscale Kain–Fritsch cumulus physics [
6,
7], YSU scheme for planetary boundary layers (PBL) [
8], Lin et al. [
9] cloud microphysics, the Unified Noah land surface model [
10], and RRTM [
11] schemes for shortwave and longwave radiation, respectively.
Table 1 summarizes the default model configuration. The initial and boundary conditions for the domain were obtained from the GFS global model [
12] with 0.5 degree horizontal resolution.
2.4. Experimental Model Configuration
The experimental model configuration used was the same configuration as the control settings, except for the grid spacing, cloud microphysics, and cumulus physics.
Table 2 summarizes the experimental model configuration, and research design configuration. Apart from the 15/5 km outer and inner domain resolution, there were also 12/4 km resolution and 9/3 km resolution for outer and inner domains. For each model resolution, the cloud microphysics and cumulus physics schemes were comprised of seven different experimental configurations including the default model configuration in
Table 1 such that the Kessler and WRF double-moment 6-class [
13] were used for microphysics schemes and the Betts–Miller–Janjic [
14], ensemble Grell [
15,
16], and new simplified Arakawa [
17] schemes were used for the cumulus physics schemes. Note that for each rainfall event, there were a total of 21 runs seen in
Table 2, and that the simulation 1 has the same model configuration of the default seen in
Table 1. There are various parameterization schemes for microphysics including the WRF single-moment 5-class microphysics [
18], single-moment 6-class microphysics [
19], and cumulus parameterizations including old simplified Arakawa–Schubert [
20], Tiedtke [
21], new Tiedtke [
22], and multiscale KF [
23]. However, there are too many combinations to investigate, so we only considered selected the combinations that we are interested in.
2.5. Analysis Procedure
To assess the skill of the model, contingency tables and the associated scores were used to represent the relationship between the forecasted and observed rainfall [
24]. This technique consisted of a two-dimensional contingency table that displayed the discrete joint distribution of the forecasted and observed events in terms of their frequency of occurrence. In order to create the contingency table, a threshold value was first determined; this value had to be exceeded by the observation or the forecast for it to be termed as an event. Once each value of the data set was classified as an event or nonevent, they were distributed according to the intersection between forecast and observation, as indicated in
Table 3.
From
Table 3, the following indexes were calculated to assess the scores of the experiments. These included probability of detection (POD), frequency bias (BIAS), proportion correct (PC), threat score (TS), and false alarm ratio (FAR), which was calculated using the following formulas:
POD, also known as hit rate, is a statistical measure that examines events by measuring the proportion of observed events that were correctly forecasted. It ranges between 0 and 1, whereby 0 represents poor skill and 1 represents perfect skill. BIAS is another statistical measure that compares the frequency of forecasts (forecasted yes) to the frequency of actual occurrences (observed yes). It ranges between 0 and infinity, whereby 1 represents unbiased, BIAS > 1 represents over-forecasting, and BIAS < 1 represents under-forecasting. PC gives the fraction of all forecasts that were correct. It is sensitive to both hits and correct rejections. The value ranges between 0 and 1 and the perfect score is 1. TS, also known as the critical success index (CSI), is a statistical index that is widely used as a performance measure of rare events (e.g., rainfall). It ranges between 0 and 1, whereby TS = 1 represents perfect skill and TS = 0 represents poor skill. FAR gives the fraction of forecast events that were observed to be nonevents. The value varies between 0 and 1 and the perfect score is 0.
In order to compare the performance of different model configurations, the combined skill score was employed [
25]. Such a skill score plays a role in summarizing the above statistical scores that could be used together when there are many model results for different experiments, and the concept of such skill scores gives the readers overall evaluation and direct understandings of all statistical scores. Thus, we slightly modified the combined skill score such that a modified skill score employed the following scores: PC, POD, TS, and FAR. The purposes of the modified skill score below help us to understand the overall performance better and summarize the results for a final decision.
where the values of PC, POD, and TS are always positive and range between 0 and 1. The FAR range is also between 0 and 1. Thus, the result of 1–FAR is always positive and between 0 and 1. Thus, by its definition, the modified skill score above always ranges between 0 and 1. Note that the statistical skill scores are used here to assess overall performance in order to determine the best model configurations. To summarize all of the model performance results, we utilized the Taylor diagram for the combined skill score and FB to represent the overall performance of the model rainfall predictions [
26].
3. Numerical Results
The section below describes the numerical results for model horizontal resolution and physical parameterization. To do so, we first show the Taylor diagram to evaluate the overall performances for all 21 simulations in
Table 2. Then, we analyze the detailed results for the model horizontal resolution and then the effects on physical parameterization. Note that we only show the results with the threshold for an event of 10 mm or above for the wet season and 1 mm or above for the dry season.
Figure 6 shows the results of the combined score and BIAS for the wet season and the dry season, respectively. Note that the blue contour is related to the azimuthal angle, and the black dashed contours are the radial distance from the origin. Hence, when the model agrees with observations, both values of the combined score and BIAS for the models lie nearest to 1. The diagram for the different resolution and physical parameterization schemes indicated that models with the same resolution with different physics showed different performances for both seasons, and predicted differently for the two seasons. However, it is still notable that the square (9/3 km resolution) and triangle (15/5 km resolution) shapes are located closest to the value of 1. In addition, it showed relatively good skill scores of the models with colors of black, magenta, and yellow, respectively corresponding to the physics scheme for simulations 1, 4, and 7.
To show more details for model performances, in
Figure 7 we showed the combined score and BIAS for different resolutions in both wet and dry seasons. Combined scores showed that the models with the high resolution showed relatively better performances during both seasons, while during the dry season, the models with the coarsest resolution also showed good performance. The model BIAS score indicates whether the models systematically favor producing precipitation. Among different combinations of model physical parameterizations, the BIAS for the models with multiscale Kain–Fritsch cumulus physics together with Lin et al. microphysics showed comparatively more accurate scores than others for both seasons. Overall, according to the Taylor diagram and the statistical skill scores of the model simulations for both wet and dry seasons, it was clear that the model outputs with the finest resolution showed predictability that is more acceptable.
3.1. Model Resolution Effect
In this section, we simulated rainfall over Tanzania for both seasons to examine the impact of horizontal grid resolution on model predictability. It is generally assumed that models with higher resolution have higher skill than those with lower resolution in resolving small-scale convective systems. This is because most convective clouds occur at a small horizontal scale ranging from several hundreds of meters to a few kilometers. In this study, this behavior was clearly found for the wet season, but a significant difference was not found for the dry season.
Figure 8 showed that during the wet season, all the resolution settings which were 15/5, 12/4 and 9/3 km grid spacing showed reasonably good forecasts in terms of special coverage of the rainfall systems. Increased resolution especially showed improvements in the special coverage when compared to the low-resolution simulations. This is because during the wet season, when most of the precipitation activities over Tanzania are convective in nature, the horizontal grid spacing improved the skill of the forecast. This behavior was also the case for the other researches [
27]. That is, convective systems take place in small areas and are highly influenced by local features like topography and land cover, so the higher resolution models provide better skills in predicting predominant convective weather systems.
For the dry season however, there was no visible pattern regarding the improvement of the skills of the model, as the model resolution was increased over Tanzania, as shown in
Figure 9. This may have been due to the fact that during the dry season, most of the precipitation activities were stratiform in nature due to the synoptic-scale systems (ridging high and easterly waves). That is, the weather systems dominating the country during the dry season were mainly stratiform, occupying large areas and insubstantially affected by local features like area topography and land cover. Thus, increased resolution may have little impact in predicting stratiform systems. This is similar to the results of the European region [
28]. Even though the model resolutions did not show a big difference for predicting rainfall during the dry season, the results indicated a relatively positive effect for the dry season. Overall, analysis of the rainfall predictions from the experiments showed that increasing model resolution had an overall positive impact on rainfall prediction for both seasons.
Note that the precipitation with 15/5, and 9/3 km resolution are alike, while they are different from that with 12/4 km. That is, the precipitation locations and their magnitudes are not linearly related to the model resolutions. It is generally known that global models overestimate light precipitation and underestimate heavy rainfall due to the low horizontal resolution compared to the scale of the precipitation core [
29]. Also, in some regional studies, the heavy rainfall cores with different resolutions showed slight shifts to different locations and the magnitude of precipitation was underestimated [
30,
31]. This difference might be the effects of the initial conditions, which are significant for short-range forecasts, and higher forecast skill could be achieved either by generating an initial condition via data assimilation schemes or by expanding the region used for the outer domain. In this study, however, the unknown nonlinearity between model resolution and rainfall core has been not articulated yet, and an improvement in forecast skill should be pursed in a subsequent study.
3.2. Impact on Physical Parameterization
The statistical evaluation methods, POD, PC, TS, FAR, and BIAS, for rainfall occurrence and amount are very important. In the previous subsection, the models with the finest resolution respectively revealed better and similar rainfall predictability than the model with coarser resolution for the wet and dry seasons. In this section, we evaluated the effect of the physical parameterization for simulation 11, simulation 12, …, simulation 17, the models that were configured with 3-km resolution for the inner domain and 9-km resolution for the outer domain, as shown in
Table 2. In order to make the statistical results more significant, we added the 5–8 May 2017 heavy rain event for the wet season and the 20–23 August 2017 rain event for the dry seasons.
Table 4 shows averages of the statistical skill scores for the wet and dry seasons for those simulations indicated by simulation 1, simulation 2, …, simulation 7. For example, POD is a statistical measure that relates the likelihood of detection of the meteorological event by the forecast. It is calculated as the ratio of the number of events forecasted to the total number of forecasts (hit/[hit + miss]). In
Table 4, it is evident that for the wet season, the default setting (simulation 1), which was comprised of Lin et al. microphysics and multiscale Kain–Fritsch cumulus physics schemes, had the highest POD (0.59), followed by simulation 4 (Kessler microphysics and new simplified Arakawa cumulus physics schemes), which had an average POD of 0.44. On the other hand, for the dry season, the POD of the events was generally very low. This may have been due to the fact that there were few reports to verify, as most of the stations reported precipitation of less than 1 mm, especially for coastal areas where the number of observation stations is not sufficient. Other statistical scores showed slightly different results such that the highest score of PC, TS, and FAR, and the unbiased one of BIAS for the wet (dry) season was simulation 1 (simulation 5), simulation 1 (simulation 2), simulation 6 (simulations 5,6,7), and simulation 1 (simulation 1), respectively.
To assess overall performance in order to determine best model configurations, the mean absolute distance error (MDE) is defined by
where abs(
x) means the absolute value of
x. Note that the MADE value closest to 1 is the perfect skill score, and MDE measures the average magnitude of the absolute difference between model skill scores and the value 1, of which the combined score and BIAS represent the perfect skill and unbiased score. Thus, the smaller value of MDE indicates the better model predictability.
The purposes of the MDE score were to understand the overall performance better and summarize the results for a final decision. The overall ranking of the model simulations for the wet season showed that the default simulation (simulation 1) was first, followed by simulation 2, and simulation 6 showed the poorest skill. For the dry season, the overall ranking of the model simulations showed that simulation 1 was first, followed by simulation 2. Meanwhile, simulation 6 and simulation 7 showed the poorest skill because the FAR index was not assessable for these simulations due to a large number of false alarms.
4. Conclusions and Remarks
The performance of different schemes on the precipitation systems during the wet and dry seasons over Tanzania is evaluated in this study. Tanzania is located in the tropics, where during the wet season, the dominant weather system is the ITCZ, which produces deep convective systems, including thunderstorms. During the dry season, weather systems dominating the country are mainly stratiform, which occupy a large area and are not affected by local features such as topography and land cover. In order to evaluate the skill of the model predictability for different resolutions and various parameterizations, each model simulation was ranked based on the statistical indexes explained in the above section. For each index, the simulations were ranked for each season, and then the average rank of each simulation was calculated based on all the statistical indexes combined.
Generally, all the resolution settings, which were 15/5, 12/4, and 9/3 km grid spacing, showed reasonably good forecasting in terms of special coverage of the rainfall systems. Increased resolution (i.e., 9/3 km grid spacing) showed a greater improvement in the special coverage than the low-resolution simulations (i.e., 15/5 km grid spacing), especially during the wet season when the weather system was dominated by convective systems [
32]. Convective systems occur in small areas and are highly influenced by local features, such as topography and land cover; thus, the higher resolution models are better at predicting predominantly convective weather systems. For the dry season, there was no visible pattern regarding the improvement in the skill of the model, as the model resolution was increased over Tanzania. As noted above, the dominant weather systems during the dry seasons are stratiform in nature, which always occupy relatively large area thus not being affected by local scale features. Thus, increased resolution had little impact in predicting stratiform systems.
The default simulation containing the Lin et al. microphysics scheme with the multiscale Kain–Fritsch cumulus physics scheme showed greater success at resolving weather systems during the wet season. The microphysics scheme is able to take into consideration the deep convective systems and clouds in all phases, i.e., liquid, ice, and vapor processes; in this research, it was also proved to be better at resolving weather systems during the wet season. Following the findings of this study, it is recommended that the findings of this research be adopted for operational forecasting at the TMA. In addition, further research should be conducted with other physical parameterizations, such as radiation physics and planetary boundary layer physics, among others. Even with these findings, further research should be conducted to improve the physical equations and better resolve the weather systems in the country and the region as a whole.