Next Article in Journal
Enhancing Friction Pendulum Isolation Systems Using Passive and Semi-Active Dampers
Next Article in Special Issue
A Review of the Artificial Neural Network Models for Water Quality Prediction
Previous Article in Journal
Toughening Behavior of Carbon/Epoxy Laminates Interleaved by PSF/PVDF Composite Nanofibers
Previous Article in Special Issue
Simulation of Evapotranspiration at a 3-Minute Time Interval Based on Remote Sensing Data and SEBAL Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Comparison of Entropy Methods for an Optimal Rain Gauge Network: A Case Study of Daegu and Gyeongbuk Area in South Korea †

1
Department of Statistics, Daegu University, Gyeongbuk 38453, Korea
2
Korea Institute of Civil Engineering and Building Technology, Goyang 10223, Korea
3
Division of Mathematics and Big Data Science, Daegu University, Gyeongbuk 38453, Korea
*
Author to whom correspondence should be addressed.
This work was supported by Korea Hydro and Nuclear Power Co., Ltd. (NO. 2018-Tech-20).
Appl. Sci. 2020, 10(16), 5620; https://doi.org/10.3390/app10165620
Submission received: 20 July 2020 / Revised: 11 August 2020 / Accepted: 11 August 2020 / Published: 13 August 2020
(This article belongs to the Special Issue Hydrologic and Water Resources Investigations and Modeling)

Abstract

:
To reduce hydrological disasters, it is necessary to operate rain gauge stations at locations where the spatio-temporal characteristics of rainfall can be reflected. Entropy has been widely used to evaluate the designs and uncertainties associated with rain gauge networks. In this study, the optimal rain gauge network in the Daegu and Gyeongbuk area, which requires the efficient use of water resources due to low annual precipitation and severe drought damage, was determined using conditional and joint entropy, and the selected network was quantitatively evaluated using the root mean square error (RMSE). To consider spatial distribution, prediction errors were generated using kriging. Four estimators used in entropy calculations were compared, and weighted entropy was calculated by weighting the precipitation. The optimal number of rain gauge stations was determined by calculating the RMSE reduction and the reduction ratio according to the number of selected rain gauge stations. Our findings show that the results of conditional entropy were better than those of joint entropy. The optimal rain gauge stations showed a tendency wherein peripheral rain gauge stations were selected first, with central stations being added afterward.

1. Introduction

Recently, there has been a significant increase in cases of damage caused by natural disasters globally, due to an increase in extreme weather events caused by climate change. Some of the major causes of socioeconomic damage on the Korean Peninsula are natural disasters related to drought, home rain, flooding, and flooding of dams [1]. Droughts, in particular, are a type of natural disaster that can cause long-term damage. Due to limited annual precipitation, the Daegu and Gyeongbuk area is susceptible to severe drought damage [2]. To respond to the risk of such damage, it is important to efficiently utilize and manage water resources through hydrological analysis. The efficient management of water resources can counteract rain-related natural disasters. Water resources can be efficiently managed by identifying information about the rainfall characteristics by basin and the spatial distribution of rainfall. In other words, water resources can be managed using rain information. No wonder information about rainfall with high observational accuracy is important. We need to construct a rain gauge network with high accuracy. The more rain gauge stations there are, the higher the accuracy of the observations can be. However, the higher the number of rain gauge stations, the higher the cost. A rain gauge station should be properly positioned so that it can obtain a lot of information while minimizing costs to a small number of rain gauge stations. This could also allow a new rain gauge station to be operated at a point whereat observations are needed thought the cost saved. Therefore, the optimal rain gauge network must be constructed based on the spatial distribution of rainfall, and the appropriateness of the constructed network must be evaluated.
Information entropy approaches have been widely used to evaluate the design and uncertainties of networks for collecting hydrological data [3,4,5,6,7,8]. The basis for designing a rain gauge network using entropy is that the network must have the least amount of overlapping information possible, which means that all rain gauge stations must be independent of each other, as far as possible. One study [9] used the maximum entropy value of rainfall information to develop a model that predicts long-term. One study [10] used the maximum value of entropy to determine the threshold of rainfall. One study [11] objectively evaluated the groundwater quality monitoring network of the Gaza strip, Palestine, using the concept of entropy. The study [12] investigated network designs for different Canadian basins using marginal entropy, joint entropy, and transmitted information. The study [13] showed that the entropy theory can help design the optimal water-level station in the Netherlands. The study [14] developed the maximum information minimum redundancy criterion for the evaluation and planning of water level monitoring networks. Additionally, the study [14] used information theory to quantify the common information among all the monitoring stations of the system. The study [15] evaluated the optimal stations for rainfall data using the information theory of Shannon. The optimal gauge station combination was proposed based on the maximum information transmission using joint entropy. The study [4] calculated the uncertainty of hydrological data based on entropy and evaluated the degree of uncertainty reduction. The studies [16,17] evaluated the gauge station network in Louisiana, USA, and defined the optimal number of stations. The study [6] approached the optimal gauge network design by evaluating the information flow measurement between stations based on entropy. The study [18] used entropy to find locations that could maximize optimal watercourse and hydrological information. The study [19] investigated temporal variations in the quantity and uncertainty of rainfall events using entropy. They showed that entropy is a means of probabilistically comparing and evaluating regional precipitation characteristics and that it can be used as a measure of evaluating water resources.
Entropy cannot reflect the spatial distribution of rain gauge networks because it is calculated based on the probability distribution of observation data. As such, studies have been conducted to reflect the spatial distribution of rain gauge stations along with entropy. The study [20] used the minimum variance estimation method of kriging to select the optimal rain gauge station locations in the basin. The study [21] proposed a method for optimal data collection at random locations through the variance reduction analysis based on kriging. Moreover, studies have been conducted using the entropy theory, which is a concept of information quantity that quantitatively represents a reduction in uncertainty. The study [22] used kriging and entropy to evaluate the optimal rain gauge stations considering the spatial distribution of stations. Kriging was used in the reconstruction of the precipitation of a basin to determine the spatial distribution of rain gauge stations. The optimal gauge station locations were then selected from the stations constructed in the basin through joint entropy. The study [23] secured spatiality through rainfall prediction using kriging and determined the minimum number of rain gauge stations required in a Taiwanese basin using joint entropy. The study [24] used kriging and joint entropy for the optimal network design of Shanghai. They created rainfall data using kriging and proposed an optimal network by calculating entropy.
Rain gauge stations have been evaluated using various methods, such as kriging, regression analysis, correlation analysis, and entropy. Entropy-based studies used conditional and joint entropies to select the optimal stations. In addition, several studies focused on the spatiality of stations because the evaluation of stations using entropy is performed through the empirical probabilities of observation data and does not reflect the spatial distribution of stations. However, no quantitative comparison was conducted to assess how much performance the optimal rain gauge network actually has based on entropy. In statistics and meteorology, prediction models are developed and evaluated through differences between the observed and predicted values. In the case of continuous data, the root mean square error (RMSE) is mainly used as a model selection criterion. In this study, several estimators of entropy and weighted entropy were compared. A quantitative comparison of the rain gauge networks selected by various entropy methods allows the design of appropriate rain gauge networks. An entropy-based rain gauge network was quantitatively evaluated by predicting the precipitation values of unselected rain gauge stations based on the stations selected as the optimal rain gauge network using entropy. In addition, the performance of a rain gauge network selected as entropy was evaluated using the RMSEs of all rain gauge network combinations. Additionally, a method of determining the number of rain gauge stations using RMSE was proposed. The number of rain gauge stations that can yield the maximum efficiency was determined by calculating the RMSE reduction and the reduction ratio according to the number of rain gauge stations.
The structure of this paper is as follows. Section 2 presents the kriging theory for considering entropy and spatial distribution and discusses the methods of selecting the optimal rain gauge networks and determining the optimal number of rain gauge stations using RMSE. Entropy theory presents several estimators and weighted entropy. In Section 3, the optimal rain gauge network for 11 rain gauge stations in the Daegu and Gyeongbuk area is evaluated based on entropy, and the determination of the number of rain gauge stations for efficient operation is described. Conclusions are presented in Section 4.

2. Methodology

This section discusses the method of selecting the optimal rain gauge network using entropy. The optimal rain gauge network was selected through the conditional and joint entropies of the long-term precipitation observed from rain gauge stations. Entropy cannot consider the spatial distribution of rain gauge stations because uncertainty is calculated from the probability distribution of precipitation. To consider the spatial distribution, prediction errors were generated using ordinary kriging (OK) and leave-one-out cross validation (LOOCV). Rain gauge stations with small prediction errors were the ones with low entropy. The optimal rain gauge network was selected through the conditional and joint entropies of the prediction errors generated by OK. In addition, the precipitations of unselected stations were predicted using OK, and the optimal number of stations was determined using the RMSE. The operating system of the computer used in this study was Windows 10. The central processing unit (CPU) was i5-7500 and the RAM was 32 GB. The software used for data processing was R version 3.5.2. (Daegu university, Korea).

2.1. Research Flowchart

Figure 1 shows the methods used to select the optimal rain gauge stations considering conditional and joint entropies. Methods A and B were the calculation of conditional entropy from the empirical probability distribution of rain gauge data, and method C was the calculation of joint entropy. In addition, the prediction error of each rain gauge station was generated using kriging and LOOCV. As entropy decreases with an increase in the kriging interpolation performance, kriging and entropy were used for considering both the spatiality of rain gauge stations and the uncertainty of rain gauge data. Methods D and E were the calculation of conditional entropy from the probability distribution of prediction error, and method F was the calculation of joint entropy. Methods A and D were used for obtaining the entropy of unselected rain gauge stations using the stations selected through entropy. Methods B and E, on the other hand, were used for obtaining the entropy of selected rain gauge stations using unselected stations. These methods were calculated and compared for each of the entropy estimators. In addition, the entropy with weight was calculated and compared. We applied the monthly average precipitation, regional precipitation, and regional monthly precipitation as weights. Further, in order to focus on places with little rainfall in each region, 1-regional precipitation was applied as a weight, as opposed to regional precipitation. In addition, conditional and joint entropies based on the number of stations were calculated for each method, and the number that maximized the entropies was determined. The mean absolute error (MAE), RMSE, bias, and correlation coefficient (Corr) were calculated using the predicted precipitations of the rain gauge stations that were not selected in the combination of stations with high entropies. Finally, the optimal number of rain gauge stations was determined using the quality of the RMSE decrease with an increase in the number of stations.

2.2. Entropy

Entropy is a method of quantitatively evaluating the degree of randomness using the probability distribution of data [25]. Entropy includes marginal entropy, conditional entropy, and joint entropy. Marginal entropy proposed by [25] is the amount of uncertainty of discrete random variable X , as shown in the following equation.
H ( X ) = n = 1 N   p ( x n ) ln p ( x n ) ,
where p ( x n ) is the empirical probability of x n and H ( X ) is the uncertainty of marginal entropy X . n is the number of the class intervals of X .
If there are two random variables X and Y , conditional and joint entropies are calculated through conditional and joint probabilities. When variable Y is given, the entropy of variable X is conditional entropy H ( X | Y ) , as shown below.
H ( X | Y ) = n = 1 N   m = 1 M   p ( x n , y m ) ln p ( x n | y m ) ,
where n and m of X and Y are the number of class intervals. P ( x n , y m ) is the joint probability and p ( x n | y m ) is the conditional probability. Joint entropy H ( X , Y ) is calculated from the joint probability of random variables X and Y .
H ( X , Y ) = n = 1 N   m = 1 M   p ( x n , y m ) ln p ( x n , y m ) .
Weights can be applied according to the purpose and study characteristics determined by the experimenter. The weight may be independent of or dependent on p ( x n ) . Weight entropy is given by the following equation [26].
WH ( X ) = n = 1 N   w n p ( x n ) ln p ( x n ) ,
where w n is weight of event. Weight entropy is applicable to conditional and joint entropy. Rain gauge stations with high entropy are important as they have high uncertainties in rainfall observation.
The most common entropy calculation method estimates uncertainty from the empirical probability distribution of data. Several approaches have been developed and used to compensate for the shortcomings of these methods. In this study, three more methods were considered, other than the “emp” method using empirical probability.
“mm” is the bias-corrected empirical entropy estimator proposed by [27]. “shrink” is a shrinkage estimate of the entropy of a Dirichlet probability distribution [28]. “sg” is the entropy of a Dirichlet probability distribution proposed by [29]. In this study, entropy was calculated using the entropy, condentropy, and multiinformation functions of the infotheo package [30].

2.3. Ordinary Kriging (OK)

Kriging is a technique for predicting the values of unobserved points using the weighted combination of known surrounding values. Kriging was developed by [31] and mathematically established by [32]. Kriging defines the spatial interrelationship as a variogram calculated from the given data. The variogram 2 γ ( h ) is a measure of spatial autocorrelation that represents the similarity of data present at a constant distance h .
2 γ ( h ) = E [ ( z ( x ) z ( x + h ) ) 2 ] ,
OK predicts the unknown value of point z 0 using z i , which are n data values already known. OK resolves the bias problem of simple kriging by proposing the sum of weights λ i as 1.
E ( z 0 ) E ( z 0 ^ ) = E ( z 0 ) E ( i = 1 n   λ i z i ) = 0 ,
1 i = 1 n   λ i = 0 ,
Under this condition, weights are obtained so that the error variance can be minimized. The problem of obtaining the minimum and maximum values under constraints is solved by the Lagrange parameter method. The error variance equation of OK is as follows.
L ( λ 1 , λ 2 , , λ n : n ) = σ 2 2 i = 1 n   λ i σ 0 i 2 + i = 1 n   j = 1 n   λ i λ j σ ij 2 + 2 w ( 1 i = 1 n   λ i ) , where σ 2 = Var ( z 0 ) , σ oj 2 = Cov ( z 0 , z j ) , σ ij 2 = Cov ( z i , z j ) .
where w is the Lagrangian factor and L ( λ 1 , λ 2 , , λ n : n ) is the Lagrangian objective function. This study used a variogram based on the exponential model. In this study, OK was performed using the krige function of the gstat package [33].

2.4. Method for Selecting the Optimal Rain Gauge Network

The combination of two or more rain gauge stations that maximizes conditional entropy or joint entropy is considered an optimal combination. In this study, the observation data of unselected rain gauge stations were predicted using the selected rain gauge network and OK to evaluate the suitability of the selected combination of rain gauge stations. This is because the precipitation of unselected stations is quantitatively calculated when the selected rain gauge network is operated. The MAE, RMSE, bias, and Corr calculated from the differences between the observed and predicted values of unselected rain gauge stations were considered as the measures of evaluation.
MAE =   1 n   i = 1 n   ( | z ^ ( x i ) z ( x i ) | ) ,
RMSE =   1 n   i = 1 n   [ z ^ ( x i ) z ( x i ) ] 2 ,
bias =   1 n   i = 1 n   ( z ^ ( x i ) z ( x i ) ) ,
Corr =   i = 1 n   ( z ( x i ) z ( x ) ¯ ) ( z ^ ( x i ) z ^ ( x ) ¯ ) i = 1 n   ( z ( x i ) z ( x ) ¯ ) 2 i = 1 n   ( z ^ ( x i ) z ^ ( x ) ¯ ) 2 ,
where n is the number of data, z ^ ( x i ) is the predicted value at an arbitrary position x i , and z ( x i ) is the observed value. In addition, z ^ ( x i ) ¯ is the average of the predicted values and z ( x i ) ¯ is the average of the observed values.
The optimal combination according to the determined number of stations can be obtained using the maximum value of entropy. A previous study presented a method of obtaining the optimal number of stations using the maximum informational content [34]. The study selected the combination with the maximum informational content among the various combinations of rain gauge stations based on joint entropy. The prediction performance of the selected combination, however, could not be reflected. This study proposes a method of determining the optimal number of stations through the RMSE calculated using OK. As the number of rain gauge stations increases, the RMSE decreases due to the improvement in the spatial interpolation performance. In other words, the appropriate number of rain gauge stations can be determined by evaluating the degree of the RMSE reduction due to the increase in the number of stations.
First, we considered the difference in the RMSE between the optimal combination of i rain gauge stations and that of i 1 stations. The minimum value of the RMSE ( RMSE min ) was obtained when there were n 1 rain gauge stations, and the maximum value of RMSE ( RMSE max ) was obtained when there was one rain gauge station. When the number of rain gauge stations increased by one, the average RMSE reduction was obtained from the range of RMSE ( RMSE max RMSE min ), as follows.
RMSE ¯ =   RMSE max RMSE min n 1 .
Assuming that S i is the reduction ratio of the RMSE of i rain gauge stations, S i is calculated using the following equation.
S i =   RMSE i 1 RMSE i RMSE ¯   ,   where   i = 1 , 2 , , n 1 .
In other words, when S i is larger than 1, the number of rain gauge stations can be considered an efficient number because the RMSE reduction for i rain gauge stations is relatively larger than the average RMSE reduction. In addition, we calculated the RMSE reduction ratio R i for i rain gauge stations. R i represents the RMSE reduction when i rain gauge stations are operated, compared to the total RMSE reduction.
R i = RMSE max RMSE i RMSE max RMSE min   × 100 .
Assuming that x-axis is the number of stations and the y-axis is R i , y = 0 holds for x = 1 and y = 1 for x = n 1 ; the farthest point from the straight line of the two points is the optimal number of stations.

2.5. Study Area and Data

The Daegu and Gyeongbuk area is a large basin that occupies the southeast of the Korean Peninsula. Topographically, the Daegu and Gyeongbuk area appears to be affected by the mountains of the nearby Taebaek mountain range. The limited annual precipitation results in a scarcity of water in the area. Furthermore, climate change has led to an increase in the number of days with heavy rainfall, leading to an increase in disaster risk awareness in the area [35]. Hence, identification of the spatial distribution of rain gauge stations installed in the area and the evaluation of their efficiency are important.
The study used 11 rain gauge stations installed by the Korea Meteorological Administration (KMA) in the Automated Surface Observing System (ASOS) of the Daegu and Gyeongbuk area, as shown in Figure 2. The daily precipitation data from 1988 to 2016 obtained from the meteorological data open portal site (https://data.kma.go.kr) were used as research data. The daily maximum precipitation was 516.4 mm, and the average annual precipitation ranged from 1018.2 to 1328.7 mm. The northern part of the study area had higher precipitation than the southern part (Table 1).

3. Result and Discussion

3.1. Rain Gauge Network Using Entropy

The precipitation data were converted to discrete spaces to calculate entropy. Figure 3 compares the entropies for class intervals of 100, 200, 300, and 400. Figure 3a shows the entropy of the observed precipitation, and Figure 3b shows the entropy of the prediction error. For both cases, the amount of entropy information increased as the class interval increased; however, the pattern of the maximum information amount according to the number of stations was the same. This study was conducted for a fixed class interval of 200.
To find the optimal rain gauge stations of the study area using entropy, the first optimal rain gauge station was selected using marginal entropy. First, entropy was calculated using the basic “emp” method. The rain gauge station installed in Mungyeong was selected as the first station, as its marginal entropy (2.441) was higher than those of other regions. Daegu (2.766) was selected based on the marginal entropy that used the prediction error.
Two or more optimal rain gauge stations can be determined using conditional and joint entropies. As conditional entropy (methods A, B, D, and E) sequentially determines the optimal rain gauge stations, it takes a relatively short time compared with the joint entropy methods. Joint entropy (methods C and F) calculates the entropies of all combinations and selects those with the maximum values. Therefore, they require relatively more computing time than conditional entropy. In addition, the combinations of rain gauge stations were selected irregularly. In contrast, the joint entropy of the prediction error that considered the spatiality of stations sequentially selected the optimal rain gauge stations in a manner similar to conditional entropy (Table 2).
The “mm” and “shrink” methods showed the same results in all cases as the “emp” method (Table A1 and Table A2). The calculated entropy values were different, but the selected rainfall network was the same. However, there was a slight difference in the case of the “sg” method (Table A3). In the process of selecting two rain gauge stations with conditional entropy using prediction error (method E), the rain gauge network was selected differently. As a result, the estimator differed in terms of the estimation method, but there was no significant difference noted in this study.

3.2. Rain Gauge Network Using Weighted Entropy

We also considered weighted entropy, as precipitation varies seasonally, and entropy performance varies depending on the amount of precipitation. On heavy precipitation days, entropy was better to select the optimal rain gauge network. We applied monthly average precipitation, regional precipitation, 1-regional precipitation, and regional monthly precipitation as weights.
Table A4 shows the result of calculating the entropy using the monthly average precipitation as a weight. Methods A and F showed one + (better rank), method C showed two + and one − (worse rank). Method D showed one + and one −, and method E showed three − only. Method B did not show any change. In the calculation of entropy using observation data, we can see that a better rain gauge network was selected. However, it can be seen that the entropy calculation using the prediction error was slightly worse.
Table A5 shows the result of calculating the entropy using regional precipitation as a weight. Method A had one + and four − changes, method B had only two −, and method C had only four −. Methods D and E had two + and seven −, and method F had seven + and two −. Most methods showed worse results; however, method F showed improved results. Nevertheless, this was confirmed as a worse result than that of method B.
Table A6 shows the results of calculating the entropy using 1-regional precipitation as a weight. Method B had one −; method C had one +. Method D had four −, method E had two − and three +, and method F had five −. Method A was unchanged. There were some improvements in methods C and E, but most methods showed poor results.
Table A7 shows the results of calculating the entropy using regional monthly precipitation as a weight. Method A had one +, and Method B had two −. Method C had one + and two −, method D had one + and one −, and method E had two + and two −. Method F remained unchanged. There was a slight improvement in the results, except for those by method F, but there was also a deterioration. Although it looks similar, entropy had a bad appearance in method B, which was considered to be the best rank. The schematic of these results is shown in Table 3 below.

3.3. Discussion

Table 4 shows the top five station combinations by calculating the RMSEs for all combinations. In addition, the RMSEs of the rain gauge network combinations by the number of rain gauge stations are shown in Figure 4. When selecting one rain gauge station, the stations located in the centers of Daegu and Gyeongbuk regions showed excellent prediction performance. However, if one selected more than two rain gauge stations, you can see that stations located outside Daegu and Gyeongbuk were mainly selected. In other words, the station when selecting one rain gauge station was not often selected in more than one combination. Rain gauge stations located in Pohang (3) were frequently selected from a combination of two stations, and rain gauge stations in Bonghwa (5) were also frequently selected. From the combination of three rain gauge stations, Yeongju (6) was added and selected, and from the combination of five rain gauge stations, Uljin (1) can be confirmed to be included in all combinations. Pohang, Bonghwa, Yeongju, and Uljin are rain gauge stations located along the north and the coast in Daegu and Gyeongbuk region. In several combinations of rain gauge stations, Uiseong (9), Gumi (10), and Yeongcheon (11) were added, and the prediction performance of the rain gauge network was improved.
In the case of a rain gauge network using conditional entropy, Mungyeong (7) and Daegu (4) appear to have been selected first, and the performance seems to have deteriorated. In the case of method B, which was judged to be the best method for selecting the rain gauge network using entropy, the performance was improved with the addition of Bonghwa and Yeongju in the north. In other words, it is considered that the rain gauge stations in the north are important locations for rain gauge networks in Daegu and Gyeongbuk.

3.4. Optimal Number of Rain Gauge Stations

For comparing the six methods, the errors generated by predicting the average daily precipitation of the unselected stations based on the selected stations were evaluated through MAE, RMSE, bias, and Corr (Figure 5).
Conditional entropy methods (methods A, B, D, and E) generally exhibited similar results. For joint entropy methods (methods C and F), the results were similar until the number of stations was three; different results were observed for a higher number of stations. The conditional entropy methods exhibited better results than joint entropy because MAE and RMSE for these were lower, and Corr was higher. With regard to the bias, methods A and B showed a tendency to predict lower values than the observed values, while methods C, D, E, and F predicted higher values. For the RMSE, methods A and D exhibited similar results; however, they started to show different results when the number of stations reached nine. The RMSE of method A became lower and exhibited excellent performance, with Pohang being the location of the added station. For method D, Pohang was the last station added. This was because it exhibited the highest daily maximum precipitation of 516.4 mm based on the research data mentioned above. In method B, the rain gauge stations installed in coastal areas were first selected. As for the RMSE values of methods A and B, method A exhibited better results when the number of stations was less than six, and method B when the number of stations was six or more. In other words, methods A and B exhibited relatively good performance.
Table 5 and Figure 6 show the RMSE results for determining the optimal number of stations. S shows the actually decreasing amount under the assumption that RMSE decreases at equal intervals as the number of rain gauge stations increases. In other words, it is meaningful to find the values of S higher than one and higher than the average value. For method A, the numbers of stations where S was higher than one were 2, 3, 9, and 10. The number of stations that exhibited the highest value was two (4.436), followed by nine (1.334), three (1.173), and 10 (1.162). For method B, the numbers of stations where S was higher than one were 2, 4, 6, and 9. The number of stations that showed the highest value was two (4.119), followed by six (1.912), four (1.300), and nine (1.106).
R is the predictive power of i rain gauge stations by comparing the total RMSE reduction with the RMSE reduction when i stations are used. For method A, R was 83.45% and represented more than 80% when the number of stations was nine or higher. For method B, R was 81.26% and represented more than 80% when the number of stations was six or higher. Subsequently, R did not significantly increase even with an increase in the number of stations. Hence, six stations were judged to be appropriate. In addition, when the farthest R values from the average curve were compared, six and seven stations had the best predictive power.
The optimal number of stations was determined to increase the accuracy of observation while minimizing costs. If the minimum number of stations is operated considering the accuracy of rainfall observation, then the optimal number of stations is six. This number was selected based on the stations of method B, because method B exhibited excellent performance for up to six stations. The optimal rain gauge stations installed in the Daegu and Gyeongbuk area were Mungyeong, Pohang, Uljin, Daegu, Yeongdeok, and Bonghwa. Figure 7 shows the optimal rain gauge stations in the order of selection. The selected stations were located on the outskirts of the Daegu and Gyeongbuk area.

4. Conclusions

In this study, the uncertainties were calculated and compared according to the various estimators of entropy. There was no significant difference between the entropy estimators “emp,” “mm,” “shrink,” and “sg.” In addition, weighted entropy was calculated to determine the uncertainty by applying monthly average precipitation, regional precipitation, and regional monthly precipitation as weights. In order to focus on regions with little rainfall, 1-regional precipitation was applied as a weight as opposed to regional precipitation. In this case, there was a tendency to improve, but also a tendency to get worse. There was no significant difference noted when using method B, which showed the best results.
The RMSEs of all combinations of rain gauge stations were calculated to determine the patterns of the top five combinations. The main locations for rain gauge stations were Pohang, Bonghwa, Yeongju, and Uljin, which are located along the north and the coast of Daegu and Gyeongbuk regions. Uiseong, Gumi, and Yeongcheon in the central region can also be considered as locations for rain gauge stations.
The study also proposed a method for determining the optimal number of stations by calculating the RMSE reduction and ( S ) reduction ratio ( R ). The optimal rain gauge network was selected using entropy, and the optimal number of rain gauge stations was determined through RMSE for the efficient operation of rain gauge stations. RMSE was obtained by predicting the unselected rain gauge stations using the observation data of the stations selected according to the optimal number of stations. When the value of S was higher than one according to the number of rain gauge stations, the case with R higher than 80% was selected to determine the optimal number of rain gauge stations. When the optimal number of rain gauge stations was determined for the 11 stations in the Daegu and Gyeongbuk area, 80% performance could be obtained with six stations, and performance similar to that of all stations could be obtained with only six stations. The peripheral stations were selected first, followed by the central stations. The uncertainties of the peripheral stations were found to be higher because of large fluctuations.
It was confirmed that the conditional entropy of observation data was excellent for selecting the optimal rain gauge network and the proportion of RMSE was used for the selection of the optimal number of rain gauge stations. They can contribute to the improvement of the accuracy of hydrological research and the efficient operation of rain gauge stations. Future research will allow efficient rain gauge networks to be designed if radar data are used.

Author Contributions

Conceptualization, J.L. and S.Y. (Sanghoo Yoon); methodology, T.K., J.L. and S.Y. (Sanghoo Yoon); software, S.Y. (Seongsim Yoon); validation, T.K., S.Y. (Seongsim Yoon) and S.Y. (Sanghoo Yoon); formal analysis, T.K. and J.L.; investigation, T.K. and J.L; resources, S.Y. (Seongsim Yoon) and S.Y. (Sanghoo Yoon); data curation, T.K. and S.Y. (Sanghoo Yoon); writing—original draft preparation, T.K. and J.L; writing—review and editing, T.K., S.Y. (Seongsim Yoon) and S.Y. (Sanghoo Yoon); visualization, T.K.; supervision, S.Y. (Seongsim Yoon) and S.Y. (Sanghoo Yoon); project administration, T.K., S.Y. (Seongsim Yoon) and S.Y. (Sanghoo Yoon); funding acquisition, S.Y. (Seongsim Yoon) and S.Y. (Sanghoo Yoon). All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Korea Hydro and Nuclear Power Co., Ltd., grant number 2018-Tech-20.

Acknowledgments

I would like to express my gratitude to all those who contributed to this study.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Results of a select optimal rain gauge network of methods (mm).
Table A1. Results of a select optimal rain gauge network of methods (mm).
Num.Conditional Entropy (1)CasesMethod ANum.Conditional Entropy (1)CasesMethod D
12.45511712.779114
21.686107,(11)22.045104,(7)
30.93797,11,(1)31.12594,7,(1)
40.51187,11,1,(5)40.52984,7,1,(10)
50.26077,11,1,5,(4)50.36174,7,1,10,(8)
60.20467,11,1,5,4,(8)60.27264,7,1,10,8,(5)
70.18057,11,1,5,4,8,(6)70.19154,7,1,10,8,5,(11)
80.12747,11,1,5,4,8,6,(10)80.12444,7,1,10,8,5,11,(9)
90.10437,11,1,5,4,8,6,10,(3)90.08134,7,1,10,8,5,11,9,(6)
100.08827,11,1,5,4,8,6,10,3,(2)100.06424,7,1,10,8,5,11,9,6,(2)
110.05117,11,1,5,4,8,6,10,3,2,(9)110.03114,7,1,10,8,5,11,9,6,2,(3)
Computing time: 0.88 sComputing time: 0.97 s
Num.Conditional Entropy (2)CasesMethod BNum.Conditional Entropy (2)CasesMethod E
12.45511712.779114
22.097107,(3)22.164104,(5)
32.72297,3,(1)33.28494,5,(3)
43.26587,3,1,(4)43.67184,5,3,(8)
53.66177,3,1,4,(8)54.12774,5,3,8,(9)
63.83567,3,1,4,8,(5)64.34764,5,3,8,9,(6)
74.10357,3,1,4,8,5,(9)74.45954,5,3,8,9,6,(2)
84.19647,3,1,4,8,5,9,(10)84.42844,5,3,8,9,6,2,(7)
94.26637,3,1,4,8,5,9,10,(6)94.65234,5,3,8,9,6,2,7,(10)
104.25827,3,1,4,8,5,9,10,6,(11)104.75924,5,3,8,9,6,2,7,10,(11)
114.35917,3,1,4,8,5,9,10,6,11,(2)114.86514,5,3,8,9,6,2,7,10,11,(1)
Computing time: 0.56 sComputing time: 0.63 s
Num.Joint EntropyCasesMethod CNum.Joint EntropyCasesMethod F
12.45511712.779114
20.856554,1120.927554,11
32.2531652,6,732.5311654,10,11
43.9403302,5,6,744.5933304,7,10,11
55.7244622,7,9,10,1156.8814621,4,7,10,11
67.5974622,6,7,9,10,1169.1474621,2,4,7,10,11
79.4813302,5,6,7,9,10,11711.3653301,2,4,6,7,10,11
811.3741652,4,5,6,7,9,10,11813.5751651,2,4,6,7,9,10,11
913.008552,4,5,6,7,8,9,10,11915.775551,2,4,6,7,8,9,10,11
1014.722111,2,4,5,6,7,8,9,10,111017.934111,2,4,5,6,7,8,9,10,11
1116.25011,2,3,4,5,6,7,8,9,10,111120.03111,2,3,4,5,6,7,8,9,10,11
Computing time: 23.17 sComputing time: 29.60 s
Table A2. Results of a select optimal rain gauge network of methods (shrink).
Table A2. Results of a select optimal rain gauge network of methods (shrink).
Num.Conditional Entropy (1)CasesMethod ANum.Conditional Entropy (1)CasesMethod D
12.43111712.760114
21.508107,(11)21.878104,(7)
30.82197,11,(1)30.95194,7,(1)
40.43887,11,1,(5)40.43984,7,1,(10)
50.21677,11,1,5,(4)50.31774,7,1,10,(8)
60.17367,11,1,5,4,(8)60.24264,7,1,10,8,(5)
70.15457,11,1,5,4,8,(6)70.16554,7,1,10,8,5,(11)
80.10647,11,1,5,4,8,6,(10)80.10644,7,1,10,8,5,11,(9)
90.08837,11,1,5,4,8,6,10,(3)90.06734,7,1,10,8,5,11,9,(6)
100.07227,11,1,5,4,8,6,10,3,(2)100.04524,7,1,10,8,5,11,9,6,(2)
110.04417,11,1,5,4,8,6,10,3,2,(9)110.02214,7,1,10,8,5,11,9,6,2,(3)
Computing time: 1.06 sComputing time: 1.31 s
Num.Conditional Entropy (2)CasesMethod BNum.Conditional Entropy (2)CasesMethod E
12.43111712.760114
21.963107,(3)22.026104,(5)
32.45597,3,(1)33.00194,5,(3)
42.92287,3,1,(4)43.29684,5,3,(8)
53.27177,3,1,4,(8)53.69474,5,3,8,(9)
63.40067,3,1,4,8,(5)63.87464,5,3,8,9,(6)
73.64257,3,1,4,8,5,(9)73.95054,5,3,8,9,6,(2)
83.71547,3,1,4,8,5,9,(10)83.88844,5,3,8,9,6,2,(7)
93.76337,3,1,4,8,5,9,10,(6)94.08534,5,3,8,9,6,2,7,(10)
103.74227,3,1,4,8,5,9,10,6,(11)104.18124,5,3,8,9,6,2,7,10,(11)
113.82917,3,1,4,8,5,9,10,6,11,(2)114.27114,5,3,8,9,6,2,7,10,11,(1)
Computing time: 0.59 sComputing time: 0.59 s
Num.Joint EntropyCasesMethod CNum.Joint EntropyCasesMethod F
12.43111712.760114
20.972554,1121.056554,11
32.5011652,6,732.8151654,10,11
44.2343302,5,6,744.9563304,7,10,11
56.0404622,7,9,10,1157.2874621,4,7,10,11
67.9264622,6,7,9,10,1169.5734621,2,4,7,10,11
79.8203302,5,6,7,9,10,11711.7973301,2,4,6,7,10,11
811.7161652,4,5,6,7,9,10,11814.0071651,2,4,6,7,9,10,11
913.353552,4,5,6,7,8,9,10,11916.213551,2,4,6,7,8,9,10,11
1015.077111,2,4,5,6,7,8,9,10,111018.371111,2,4,5,6,7,8,9,10,11
1116.60911,2,3,4,5,6,7,8,9,10,111120.46611,2,3,4,5,6,7,8,9,10,11
Computing time: 27.93 sComputing time: 21.57 s
Table A3. Results of a select optimal rain gauge network of methods (sg).
Table A3. Results of a select optimal rain gauge network of methods (sg).
Num.Conditional Entropy (1)CasesMethod ANum.Conditional Entropy (1)CasesMethod D
12.42911712.754114
21.506107,(11)21.873104,(7)
30.81997,11,(1)30.94894,7,(1)
40.43887,11,1,(5)40.43884,7,1,(10)
50.21677,11,1,5,(4)50.31274,7,1,10,(8)
60.17367,11,1,5,4,(8)60.23764,7,1,10,8,(5)
70.15357,11,1,5,4,8,(6)70.61654,7,1,10,8,5,(11)
80.10647,11,1,5,4,8,6,(10)80.10444,7,1,10,8,5,11,(9)
90.08837,11,1,5,4,8,6,10,(3)90.06734,7,1,10,8,5,11,9,(6)
100.07227,11,1,5,4,8,6,10,3,(2)100.04624,7,1,10,8,5,11,9,6,(2)
110.04417,11,1,5,4,8,6,10,3,2,(9)110.02214,7,1,10,8,5,11,9,6,2,(3)
Computing time: 0.89 sComputing time: 1.02 s
Num.Conditional Entropy (2)CasesMethod BNum.Conditional Entropy (2)CasesMethod E
12.42911712.754114
21.961107,(3)22.021104,(3)
32.45297,3,(1)32.80494,3,(5)
42.91887,3,1,(4)43.28384,3,5,(8)
53.26777,3,1,4,(8)53.67974,3,5,8,(9)
63.39667,3,1,4,8,(5)63.85964,3,5,8,9,(6)
73.63857,3,1,4,8,5,(9)73.93654,3,5,8,9,6,(2)
83.71047,3,1,4,8,5,9,(10)83.87044,3,5,8,9,6,2,(7)
93.75837,3,1,4,8,5,9,10,(6)94.06734,3,5,8,9,6,2,7,(10)
103.73627,3,1,4,8,5,9,10,6,(11)104.15924,3,5,8,9,6,2,7,10,(11)
113.82317,3,1,4,8,5,9,10,6,11,(2)114.24514,3,5,8,9,6,2,7,10,11,(1)
Computing time: 0.61 sComputing time: 0.54 s
Num.Joint EntropyCasesMethod CNum.Joint EntropyCasesMethod F
12.42911712.754114
20.971554,1121.056554,11
32.4981652,6,732.8121654,10,11
44.2293302,5,6,744.9533304,7,10,11
56.0354622,7,9,10,1157.2834621,4,7,10,11
67.9194622,6,7,9,10,1169.5654621,2,4,7,10,11
79.8123302,5,6,7,9,10,11711.7863301,2,4,6,7,10,11
811.7061652,4,5,6,7,9,10,11813.9951651,2,4,6,7,9,10,11
913.343552,4,5,6,7,8,9,10,11916.201551,2,4,6,7,8,9,10,11
1015.067111,2,4,5,6,7,8,9,10,111018.362111,2,4,5,6,7,8,9,10,11
1116.59811,2,3,4,5,6,7,8,9,10,111120.45411,2,3,4,5,6,7,8,9,10,11
Computing time: 24.27 sComputing time: 25.14 s
Table A4. Results of a select optimal rain gauge network of methods (weight: monthly average precipitation).
Table A4. Results of a select optimal rain gauge network of methods (weight: monthly average precipitation).
Num.Conditional Entropy (1)RankMethod ANum.Conditional Entropy (1)RankMethod D
14.7397715.07164
21.07237,(11)21.43574,(7)
30.590197,11,(1)30.554174,7,(1)
40.281647,11,1,(5)40.291100(−)4,7,1,(8)
50.13084(+)7,11,1,5,(8)50.21245(+)4,7,1,8,(5)
60.111947,11,1,5,8,(4)60.149544,7,1,8,5,(10)
70.0981767,11,1,5,8,4,(6)70.1011064,7,1,8,5,10,(11)
80.067917,11,1,5,8,4,6,(10)80.061984,7,1,8,5,10,11,(9)
90.055147,11,1,5,8,4,6,10,(3)90.033394,7,1,8,5,10,11,9,(6)
100.04127,11,1,5,8,4,6,10,3,(2)100.016114,7,1,8,5,10,11,9,6,(2)
110.031 7,11,1,5,8,4,6,10,3,2,(9)110.008 4,7,1,8,5,10,11,9,6,2,(3)
Computing time: 0.86 sComputing time: 0.92 s
Num.Conditional Entropy (2)RankMethod BNum.Conditional Entropy (2)RankMethod E
14.7397715.07164
21.58997,(3)21.67551(−)4,(3)
31.810537,3,(1)32.111214,3,(5)
42.137477,3,1,(4)42.413424,3,5,(8)
52.3772557,3,1,4,(8)52.684594,3,5,8,(9)
62.427277,3,1,4,8,(5)62.782124,3,5,8,9,(6)
72.61347,3,1,4,8,5,(9)72.807544,3,5,8,9,6,(2)
82.656157,3,1,4,8,5,9,(10)82.65665(−)4,3,5,8,9,6,2,(10)
92.66217,3,1,4,8,5,9,10,(6)92.72941(−)4,3,5,8,9,6,2,10,(11)
102.61417,3,1,4,8,5,9,10,6,(11)102.801104,3,5,8,9,6,2,10,11,(7)
112.667 7,3,1,4,8,5,9,10,6,11,(2)112.904 4,3,5,8,9,6,2,10,11,7,(1)
Computing time: 0.63 sComputing time: 0.57 s
Num.Joint EntropyRankMethod CNum.Joint EntropyRankMethod F
14.7397715.07164
23.67240(+)2,723.756504,11
37.7841562,6,738.1911614,10,11
411.9813302,5,6,7412.8552754,7,10,11
516.225413(+)2,5,6,7,11517.6352721,4,7,10,11
620.499437(−)2,5,6,7,9,11622.3023401,2,4,7,10,11
724.7843242,5,6,7,9,10,11726.9051891,2,4,6,7,10,11
829.0711652,4,5,6,7,9,10,11831.487105(+)1,2,4,6,7,8,10,11
933.132532,4,5,6,7,8,9,10,11936.077491,2,4,6,7,8,9,10,11
1037.287111,2,4,5,6,7,8,9,10,111040.631111,2,4,5,6,7,8,9,10,11
1141.228 1,2,3,4,5,6,7,8,9,10,111145.082 1,2,3,4,5,6,7,8,9,10,11
Computing time: 24.94 sComputing time: 23.36 s
Table A5. Results of a select optimal rain gauge network of methods (weight: precipitation by station).
Table A5. Results of a select optimal rain gauge network of methods (weight: precipitation by station).
Num.Conditional Entropy (1)RankMethod ANum.Conditional Entropy (1)RankMethod D
11.2297711.1437(−)7
20.80637,(11)20.89735(−)7,(1)
30.548197,11,(1)30.533177,1,(4)
40.379647,11,1,(5)40.432114(−)7,1,4,(6)
50.26953(+)7,11,1,5,(10)50.351172(+)7,1,4,6,(11)
60.218261(−)7,11,1,5,10,(6)60.270216(−)7,1,4,6,11,(2)
70.177231(−F)7,11,1,5,10,6,(4)70.200189(−)7,1,4,6,11,2,(10)
80.138917,11,1,5,10,6,4,(8)80.170132(−)7,1,4,6,11,2,10,(5)
90.12140(−)7,11,1,5,10,6,4,8,(2)90.09940(−)7,1,4,6,11,2,10,5,(8)
100.07511(−)7,11,1,5,10,6,4,8,2,(9)100.0822(+)7,1,4,6,11,2,10,5,8,(3)
110.061 7,11,1,5,10,6,4,8,2,9,(3)110.037 7,1,4,6,11,2,10,5,8,3,(9)
Computing time: 0.53 sComputing time: 0.70 s
Num.Conditional Entropy (2)RankMethod BNum.Conditional Entropy (2)RankMethod E
11.2297711.1437(−)7
21.10297,(3)21.02532(−)7,(9)
31.390537,3,(1)31.393138(−)7,9,(5)
41.806477,3,1,(4)41.784124(−)7,9,5,(8)
52.1432557,3,1,4,(8)51.95940(+)7,9,5,8,(3)
62.321277,3,1,4,8,(5)62.171248(−)7,9,5,8,3,(2)
72.58647,3,1,4,8,5,(9)72.440188(−)7,9,5,8,3,2,(10)
82.677157,3,1,4,8,5,9,(10)82.64693(−)7,9,5,8,3,2,10,(4)
92.74724(−)7,3,1,4,8,5,9,10,(2)92.801307,9,5,8,3,2,10,4,(6)
102.8383(−)7,3,1,4,8,5,9,10,2,(6)102.9923(+)7,9,5,8,3,2,10,4,6,(1)
112.969 7,3,1,4,8,5,9,10,2,6,(11)113.317 7,9,5,8,3,2,10,4,6,1,(11)
Computing time: 0.47 sComputing time: 0.40 s
Num.Joint EntropyRankMethod CNum.Joint EntropyRankMethod F
11.2297711.1437(−)7
20.46954(−)6,720.30954(−)6,7
30.962165(−)5,6,730.61876(+)6,7,11
41.5313302,5,6,741.07196(+)1,6,7,11
52.101462(−)2,5,6,7,951.568172(+)1,4,6,7,11
62.744462(−)2,5,6,7,9,1062.073204(+)1,4,6,7,10,11
73.4383242,5,6,7,9,10,1172.6231891,2,4,6,7,10,11
84.1511652,4,5,6,7,9,10,1183.14130(+)1,2,3,4,6,7,10,11
94.712532,4,5,6,7,8,9,10,1193.64629(+)1,2,3,4,6,7,9,10,11
105.336111,2,4,5,6,7,8,9,10,11104.1546(+)1,2,3,4,5,6,7,9,10,11
115.850 1,2,3,4,5,6,7,8,9,10,11114.624 1,2,3,4,5,6,7,8,9,10,11
Computing time: 19.20 sComputing time: 18.75 s
Table A6. Results of a select optimal rain gauge network of methods (weight: precipitation by 1-station).
Table A6. Results of a select optimal rain gauge network of methods (weight: precipitation by 1-station).
Num.Conditional Entropy (1)RankMethod ANum.Conditional Entropy (1)RankMethod D
12.3637712.70064
21.58037,(11)21.81074,(7)
30.893197,11,(1)30.98289(−)4,7,(10)
40.490647,11,1,(5)40.554984,7,10,(1)
50.2461257,11,1,5,(4)50.3742414,7,10,1,(8)
60.194947,11,1,5,4,(8)60.283363(−)4,7,10,1,8,(11)
70.1721767,11,1,5,4,8,(6)70.2331064,7,10,1,8,11,(5)
80.123917,11,1,5,4,8,6,(10)80.131984,7,10,1,8,11,5,(9)
90.099147,11,1,5,4,8,6,10,(3)90.08350(−)4,7,10,1,8,11,5,9,(2)
100.08327,11,1,5,4,8,6,10,3,(2)100.064114,7,10,1,8,11,5,9,2,(6)
110.048 7,11,1,5,4,8,6,10,3,2,(9)110.032 4,7,10,1,8,11,5,9,2,6,(3)
Computing time: 1.05 sComputing time: 1.81 s
Num.Conditional Entropy (2)RankMethod BNum.Conditional Entropy (2)RankMethod E
12.3637712.70064
21.97197,(3)22.08151(−)4,(3)
32.552537,3,(1)32.877214,3,(5)
43.074477,3,1,(4)43.53387(−)4,3,5,(6)
53.4502557,3,1,4,(8)53.72243(+)4,3,5,6,(8)
63.620277,3,1,4,8,(5)64.074124,3,5,6,8,(9)
73.87114(−)7,3,1,4,8,5,(10)74.234544,3,5,6,8,9,(2)
83.991157,3,1,4,8,5,10,(9)84.334574,3,5,6,8,9,2,(7)
94.03217,3,1,4,8,5,10,9,(6)94.5607(+)4,3,5,6,8,9,2,7,(1)
104.00917,3,1,4,8,5,10,9,6,(11)104.5255(+)4,3,5,6,8,9,2,7,1,(11)
114.098 7,3,1,4,8,5,10,9,6,11,(2)114.621 4,3,5,6,8,9,2,7,1,11,(10)
Computing time: 0.90 sComputing time: 1.08 s
Num.Joint EntropyRankMethod CNum.Joint EntropyRankMethod F
12.3637712.70064
20.894504,1120.960504,11
32.2731562,6,732.5631614,10,11
43.9013302,5,6,744.448295(−)2,4,10,11
55.6584362,7,9,10,1156.525419(−)2,4,7,10,11
67.4724232,6,7,9,10,1168.6573401,2,4,7,10,11
79.305323(+)2,4,6,7,9,10,11710.796308(−)1,2,4,7,9,10,11
811.1511652,4,5,6,7,9,10,11812.924162(−)1,2,4,7,8,9,10,11
912.746532,4,5,6,7,8,9,10,11914.99550(−)1,2,4,5,7,8,9,10,11
1014.420111,2,4,5,6,7,8,9,10,111016.981111,2,4,5,6,7,8,9,10,11
1115.907 1,2,3,4,5,6,7,8,9,10,111118.924 1,2,3,4,5,6,7,8,9,10,11
Computing time: 26.61 sComputing time: 32.86 s
Table A7. Results of a select optimal rain gauge network of methods (weight: monthly average precipitation by station).
Table A7. Results of a select optimal rain gauge network of methods (weight: monthly average precipitation by station).
Num.Conditional Entropy (1)RankMethod ANum.Conditional Entropy (1)RankMethod D
14.6537715.07264
21.16037,(11)21.43574,(7)
30.588197,11,(1)30.554174,7,(1)
40.281647,11,1,(5)40.291100(−)4,7,1,(8)
50.13084(+)7,11,1,5,(8)50.21245(+)4,7,1,8,(5)
60.111947,11,1,5,8,(4)60.149544,7,1,8,5,(10)
70.0981767,11,1,5,8,4,(6)70.1011064,7,1,8,5,10,(11)
80.067917,11,1,5,8,4,6,(10)80.061984,7,1,8,5,10,11,(9)
90.055147,11,1,5,8,4,6,10,(3)90.033394,7,1,8,5,10,11,9,(6)
100.04127,11,1,5,8,4,6,10,3,(2)100.016114,7,1,8,5,10,11,9,6,(2)
110.031 7,11,1,5,8,4,6,10,3,2,(9)110.008 4,7,1,8,5,10,11,9,6,2,(3)
Computing time: 0.87 sComputing time: 0.90 s
Num.Conditional Entropy (2)RankMethod BNum.Conditional Entropy (2)RankMethod E
14.6537715.07264
21.63397,(3)21.68451(−)4,(3)
31.867537,3,(1)32.113214,3,(5)
42.240477,3,1,(4)42.419424,3,5,(8)
52.3982557,3,1,4,(8)52.686594,3,5,8,(9)
62.448277,3,1,4,8,(5)62.784124,3,5,8,9,(6)
72.64314(−)7,3,1,4,8,5,(10)72.807544,3,5,8,9,6,(2)
82.698157,3,1,4,8,5,10,(9)82.66565(−)4,3,5,8,9,6,2,(10)
92.66617,3,1,4,8,5,10,9,(6)92.73211(+)4,3,5,8,9,6,2,10,(1)
102.6833(−)7,3,1,4,8,5,10,9,6,(2)102.8307(+)4,3,5,8,9,6,2,10,1,(11)
112.675 7,3,1,4,8,5,10,9,6,2,(11)112.881 4,3,5,8,9,6,2,10,1,11,(7)
Computing time: 0.95 sComputing time: 0.59 s
Num.Joint EntropyRankMethod CNum.Joint EntropyRankMethod F
14.6537715.07264
23.54854(−)6,723.754504,11
37.6101562,6,738.1801614,10,11
411.7853302,5,6,7412.8422754,7,10,11
516.003413(+)2,5,6,7,11517.6132721,4,7,10,11
620.265437(−)2,5,6,7,9,11622.2813401,2,4,7,10,11
724.5123242,5,6,7,9,10,11726.8821891,2,4,6,7,10,11
828.6951652,4,5,6,7,9,10,11831.4591441,2,4,6,7,9,10,11
932.736532,4,5,6,7,8,9,10,11936.046491,2,4,6,7,8,9,10,11
1036.834111,2,4,5,6,7,8,9,10,111040.598111,2,4,5,6,7,8,9,10,11
1140.731 1,2,3,4,5,6,7,8,9,10,111145.041 1,2,3,4,5,6,7,8,9,10,11
Computing time: 25.30 sComputing time: 25.14 s

References

  1. Lee, M.; Hong, J.H.; Kim, K.Y. Estimating damage costs from natural disasters in Korea. Nat. Hazards Rev. 2017, 18, 04017016. [Google Scholar] [CrossRef]
  2. Lee, J.H.; Kim, C.J. Derivation of drought severity-duration-frequency curves using drought frequency analysis. J. Korea Water Resour. Assoc. 2011, 44, 889–902. [Google Scholar] [CrossRef] [Green Version]
  3. Amorocho, J.; Espildora, B. Entropy in the assessment of uncertainty in hydrologic systems and models. Water Resour. Res. 1973, 9, 1511–1522. [Google Scholar] [CrossRef]
  4. Chapman, T.G. Entropy as a measure of hydrologic data uncertainty and model performance. J. Hydrol. 1986, 85, 111–126. [Google Scholar] [CrossRef]
  5. Harmancioglu, N.; Yevjevich, V. Transfer of hydrologic information among river points. J. Hydrol. 1987, 91, 103–118. [Google Scholar] [CrossRef]
  6. Yang, Y.; Burn, D.H. An entropy approach to data collection network design. J. Hydrol. 1994, 157, 307–324. [Google Scholar] [CrossRef]
  7. Singh, V.P. The use of entropy in hydrology and water resources. Hydrol. Process. 1997, 11, 587–626. [Google Scholar] [CrossRef]
  8. Kawachi, T.; Maruyama, T.; Singh, V.P. Rainfall entropy for delineation of water resources zones in Japan. J. Hydrol. 2001, 246, 36–44. [Google Scholar] [CrossRef]
  9. Krstanovic, P.F.; Singh, V.P. A univariate model for long-term streamflow forecasting. Stoch. Hydrol. Hydraul. 1991, 5, 189–205. [Google Scholar] [CrossRef]
  10. Montesarchio, V.; Ridolfi, E.; Russo, F.; Napolitano, F. Rainfall threshold definition using an entropy decision approach and radar data. Nat. Hazards Earth Syst. Sci. 2011, 7, 2061–2074. [Google Scholar] [CrossRef] [Green Version]
  11. Mogheir, Y.; Singh, V.P.; De Lima, J.L.M.P. Spatial assessment and redesign of a groundwater quality monitoring network using entropy theory, Gaza strip, Palestine. Hydrogeol. J. 2006, 14, 700–712. [Google Scholar] [CrossRef] [Green Version]
  12. Mishra, A.K.; Coulibaly, P. Hydrometric network evaluation for Canadian watersheds. J. Hydrol. 2010, 380, 420–437. [Google Scholar] [CrossRef]
  13. Alfonso, L.; Lobbrecht, A.; Price, R. Information theory-based approach for location of monitoring water level gauges in polders. Water Resour. Res. 2010, 46, 374–381. [Google Scholar] [CrossRef]
  14. Li, C.; Singh, V.P.; Mishra, A.K. Entropy theory-based criterion for hydrometric network evaluation and design: Maximum information minimum redundancy. Water Resour. Res. 2012, 48. [Google Scholar] [CrossRef]
  15. Caselton, W.F.; Husain, T. Hydrologic networks: Information transmission. J. Water Resour. Plan. Manag. Div. 1980, 106, 503–520. [Google Scholar]
  16. Krstanovic, P.F.; Singh, V.P. Evaluation of rainfall networks using entropy: I. Theoretical development. Water Resour. Manag. 1992, 6, 279–293. [Google Scholar] [CrossRef]
  17. Krstanovic, P.F.; Singh, V.P. Evaluation of rainfall networks using entropy: II. Application. Water Resour. Manag. 1992, 6, 295–314. [Google Scholar] [CrossRef]
  18. Al-Zahrani, M.; Husain, T. An algorithm for designing a precipitation network in the south-western region of Saudi Arabia. J. Hydrol. 1998, 205, 205–216. [Google Scholar] [CrossRef]
  19. Maruyama, T.; Kawachi, T. Evaluation of rainfall characteristics using entropy. J. Rainwater Catchment Syst. 1998, 4, 7–10. [Google Scholar] [CrossRef] [Green Version]
  20. Bastin, G.; Lorent, B.; Duque, C.; Gevers, M. Optimal estimation of the average rainfall and optimal selection of rain gauge locations. Water Resour. Res. 1984, 20, 463–470. [Google Scholar] [CrossRef]
  21. Rouhani, S. Variance reduction analysis. Water Resour. Res. 1985, 21, 837–846. [Google Scholar] [CrossRef]
  22. Awadallah, A.G. Selecting optimum locations of rainfall stations using kriging and entropy. Int. J. Civ. Environ. Eng. 2012, 12, 36–41. [Google Scholar]
  23. Chen, Y.C.; Wei, C.; Yeh, H.C. Rainfall network design using kriging and entropy. Hydrol. Process. Int. J. 2008, 22, 340–346. [Google Scholar] [CrossRef]
  24. Xu, P.; Wang, D.; Singh, V.P.; Wang, Y.; Wu, J.; Wang, L.; Zou, X.; Liu, J.; Zou, Y.; He, R. A kriging and entropy-based approach to rain gauge network design. Environ. Res. 2018, 161, 61–75. [Google Scholar] [CrossRef]
  25. Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef] [Green Version]
  26. Guiaşu, S. Weighted entropy. Rep. Math. Phys. 1971, 2, 165–179. [Google Scholar] [CrossRef]
  27. Miller, G. Note on the bias of information estimates. Available online: https://www.scienceopen.com/document?vid=357d299f-62fa-4bda-8dd2-e4d5b5abde5d (accessed on 20 July 2020).
  28. Hausser, J.; Strimmer, K. Entropy inference and the James-Stein estimator, with application to nonlinear gene association networks. J. Mach. Learn. Res. 2009, 10, 1469–1484. [Google Scholar]
  29. Schürmann, T.; Grassberger, P. Entropy estimation of symbol sequences. Chaos Interdiscip. J. Nonlinear Sci. 1996, 6, 414–427. [Google Scholar] [CrossRef]
  30. Meyer, P.E. Information-theoretic variable selection and network inference from microarray data. Univ. Libre Brux. 2008, 2, 261–274. [Google Scholar]
  31. Krige, D.G. A statistical approach to some basic mine valuation problems on the Witwatersrand. J. S. Afr. Inst. Min. Metall. 1951, 52, 119–139. [Google Scholar]
  32. Matheron, G. Principles of geostatistics. Econ. Geol. 1963, 58, 1246–1266. [Google Scholar] [CrossRef]
  33. Pebesma, E.J. Multivariable geostatistics in S: The gstat package. Comput. Geosci. 2004, 30, 683–691. [Google Scholar] [CrossRef]
  34. Lee, J.H.; Byun, H.; Kim, H.S.; Jun, H.D. Evaluation of a raingauge network considering the spatial distribution characteristics and entropy: A case study of Imha dam basin. J. Korean Soc. Hazard. Mitig. 2013, 13, 217–226. [Google Scholar] [CrossRef] [Green Version]
  35. Son, C.; Lee, S. Changes in precipitation characteristics over inland of Yeongnam region. J. Clim. Res. 2011, 6, 121–142. [Google Scholar]
Figure 1. Research flowchart.
Figure 1. Research flowchart.
Applsci 10 05620 g001
Figure 2. Location of rain gauge station in Daegu and Gyeongbuk (ASOS).
Figure 2. Location of rain gauge station in Daegu and Gyeongbuk (ASOS).
Applsci 10 05620 g002
Figure 3. Maximum information of empirical probability distribution: (a) observation data; (b) prediction error.
Figure 3. Maximum information of empirical probability distribution: (a) observation data; (b) prediction error.
Applsci 10 05620 g003
Figure 4. The performances of all combination gauge networks.
Figure 4. The performances of all combination gauge networks.
Applsci 10 05620 g004
Figure 5. Result of MAE, RMSE, bias, and correlation coefficient (Corr) variation according to the numbers of stations for the rain gauge network calculated by different methods.
Figure 5. Result of MAE, RMSE, bias, and correlation coefficient (Corr) variation according to the numbers of stations for the rain gauge network calculated by different methods.
Applsci 10 05620 g005
Figure 6. Optimal number of rain gauge networks using RMSE: (a) reduction of RMSE (S); (b) ratio of RMSE (R).
Figure 6. Optimal number of rain gauge networks using RMSE: (a) reduction of RMSE (S); (b) ratio of RMSE (R).
Applsci 10 05620 g006
Figure 7. Results of the optimal rain gauge network selection.
Figure 7. Results of the optimal rain gauge network selection.
Applsci 10 05620 g007
Table 1. Summary of ASOS.
Table 1. Summary of ASOS.
No.LocationsPeriodLongitudeLatitudeDaily Maximum PrecipitationAverage Annual Precipitation
1Uljin1988~201636.992129.413279.01171.1
2Andong36.573128.707170.71047.1
3Pohang36.032129.380516.41194.5
4Daegu35.878128.653225.81077.6
5Bonghwa36.944128.914250.01190.2
6Yeongju36.872128.517259.51328.7
7Mungyeong36.627128.148161.01281.5
8Yeongdeok36.533129.409296.01085.2
9Uiseong36.356128.688215.51018.2
10Gumi36.130128.320221.01091.9
11Yeongcheon35.977128.951159.01081.7
Table 2. Results of a select optimal rain gauge network of methods (emp).
Table 2. Results of a select optimal rain gauge network of methods (emp).
Num.Conditional Entropy (1)CasesMethod ANum.Conditional Entropy (1)CasesMethod D
12.44111712.766114
21.590107,(11)21.953104,(7)
30.87397,11,(1)31.02994,7,(1)
40.47187,11,1,(5)40.47984,7,1,(10)
50.23677,11,1,5,(4)50.33474,7,1,10,(8)
60.18767,11,1,5,4,(8)60.25364,7,1,10,8,(5)
70.16657,11,1,5,4,8,(6)70.17554,7,1,10,8,5,(11)
80.11647,11,1,5,4,8,6,(10)80.11344,7,1,10,8,5,11,(9)
90.09537,11,1,5,4,8,6,10,(3)90.07434,7,1,10,8,5,11,9,(6)
100.07927,11,1,5,4,8,6,10,3,(2)100.05424,7,1,10,8,5,11,9,6,(2)
110.04717,11,1,5,4,8,6,10,3,2,(9)110.02614,7,1,10,8,5,11,9,6,2,(3)
Computing time: 1.00 sComputing time: 1.03 s
Num.Conditional Entropy (2)CasesMethod BNum.Conditional Entropy (2)CasesMethod E
12.44111712.766114
22.025107,(3)22.087104,(5)
32.57697,3,(1)33.12794,5,(3)
43.07887,3,1,(4)43.46284,5,3,(8)
53.44877,3,1,4,(8)53.88574,5,3,8,(9)
63.59867,3,1,4,8,(5)64.08364,5,3,8,9,(6)
73.85157,3,1,4,8,5,(9)74.17654,5,3,8,9,6,(2)
83.93347,3,1,4,8,5,9,(10)84.12644,5,3,8,9,6,2,(7)
93.99137,3,1,4,8,5,9,10,(6)94.33534,5,3,8,9,6,2,7,(10)
103.97627,3,1,4,8,5,9,10,6,(11)104.43524,5,3,8,9,6,2,7,10,(11)
114.06917,3,1,4,8,5,9,10,6,11,(2)114.53014,5,3,8,9,6,2,7,10,11,(1)
Computing time: 0.52 sComputing time: 0.52 s
Num.Joint EntropyCasesMethod CNum.Joint EntropyCasesMethod F
12.44111712.766114
20.918554,1120.996554,11
32.3871652,6,732.6831654,10,11
44.0983302,5,6,744.7893304,7,10,11
55.8954622,7,9,10,1157.1014621,4,7,10,11
67.7744622,6,7,9,10,1169.3764621,2,4,7,10,11
79.6643302,5,6,7,9,10,11711.5963301,2,4,6,7,10,11
811.5581652,4,5,6,7,9,10,11813.8061651,2,4,6,7,9,10,11
913.193552,4,5,6,7,8,9,10,11916.010551,2,4,6,7,8,9,10,11
1014.913111,2,4,5,6,7,8,9,10,111018.170111,2,4,5,6,7,8,9,10,11
1116.44311,2,3,4,5,6,7,8,9,10,111120.26511,2,3,4,5,6,7,8,9,10,11
Computing time: 24.79 sComputing time: 23.67 s
Table 3. Results of increase and decrease of rank after application of weighted entropy.
Table 3. Results of increase and decrease of rank after application of weighted entropy.
Weight: Monthly Average PrecipitationWeight: Precipitation by 1-Station
Method+Method+
A10A00
B00B01
C21C10
D11D03
E03E32
F10F05
Weight: precipitation by stationWeight: monthly average precipitation by station
Method+Method+
A14A10
B02B02
C04C12
D27D11
E27E22
F72F00
Table 4. The best of all combinations of RMSE.
Table 4. The best of all combinations of RMSE.
Num.RankCombinationRMSENum.RankCombinationRMSE
11211.62611,3,4,5,6,96.94
2911.7721,3,4,5,7,97.00
31012.2331,3,4,6,7,97.06
41112.5041,3,4,5,7,107.10
5812.7651,3,5,6,9,117.13
216,119.57711,3,4,5,6,8,96.86
22,39.7521,3,4,5,6,7,96.89
37,119.7731,2,3,4,5,6,106.99
43,59.8241,3,4,5,7,8,97.02
55,119.8451,3,4,5,6,9,107.03
313,6,98.20811,3,4,5,6,7,8,96.66
24,6,88.3821,3,4,5,6,7,9,106.75
36,8,118.5331,2,3,4,5,6,8,106.88
44,5,88.5441,3,5,6,7,9,10,116.88
53,6,108.5651,2,3,4,5,6,7,106.90
411,3,6,97.72911,3,4,5,6,7,8,9,106.32
23,4,6,97.9321,2,3,4,5,6,7,8,106.52
31,3,4,67.9831,3,5,6,7,8,9,10,116.60
42,3,4,67.9841,3,4,5,6,7,8,9,116.70
51,3,6,108.0351,2,3,5,6,7,8,10,116.81
511,3,4,6,97.151011,3,4,5,6,7,8,9,10,116.19
21,2,3,4,67.2921,2,3,4,5,6,7,8,10,116.45
31,3,6,9,117.3031,2,3,4,5,6,7,8,9,106.58
41,3,4,5,77.4641,2,3,5,6,7,8,9,10,117.16
51,2,3,6,117.5351,2,3,4,5,6,7,8,9,117.29
Table 5. RMSE result to determine the optimal number of stations for the rain gauge network.
Table 5. RMSE result to determine the optimal number of stations for the rain gauge network.
Method A Method B
No. S R No. S R
24.43649.28%24.11945.77%
31.17362.32%30.65253.02%
40.33766.05%41.30067.46%
50.22068.50%5−0.67060.01%
60.38672.79%61.91281.26%
7−0.61365.98%70.62788.23%
80.23868.62%8−0.20685.94%
91.33483.45%91.10698.23%
101.16296.37%100.159100.00%

Share and Cite

MDPI and ACS Style

Kwon, T.; Lim, J.; Yoon, S.; Yoon, S. Comparison of Entropy Methods for an Optimal Rain Gauge Network: A Case Study of Daegu and Gyeongbuk Area in South Korea. Appl. Sci. 2020, 10, 5620. https://doi.org/10.3390/app10165620

AMA Style

Kwon T, Lim J, Yoon S, Yoon S. Comparison of Entropy Methods for an Optimal Rain Gauge Network: A Case Study of Daegu and Gyeongbuk Area in South Korea. Applied Sciences. 2020; 10(16):5620. https://doi.org/10.3390/app10165620

Chicago/Turabian Style

Kwon, Taeyong, Junghyun Lim, Seongsim Yoon, and Sanghoo Yoon. 2020. "Comparison of Entropy Methods for an Optimal Rain Gauge Network: A Case Study of Daegu and Gyeongbuk Area in South Korea" Applied Sciences 10, no. 16: 5620. https://doi.org/10.3390/app10165620

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop