4.1. Evaluation Results for the Index Layer
The values of these indicators (C11–C38) in
Table 2 are available from the Beijing Water Resources Bulletin (2000–2018) [
40], Beijing Water Statistical Yearbook (2016–2018) [
41] and Beijing Statistical Yearbook (2019) [
42], as shown in
Table 6.
The rationality of the indicator standards directly affects the accuracy of the Beijing SWRS evaluation. For the principle of grading, a five-grade standard division is adopted based on the relevant regulations of the state (
Table 7). In general, the standards for these indicators are identified with reference to (1) the oval global situation and (2) the overall situation in China. Here, only indicators C11–C15 are used as an example to illustrate how the standards are determined. The annual precipitation distribution in China varies from below 50 to above 3000 mm [
43], and the standards of C11 are determined to have five grades. According to the internationally recognized water shortage standards (<3000 m
3, mild; <2000 m
3, moderate; <1000 m
3, severe, and <500 m
3, extreme) and the per capita water resources in China (~2000 m
3), the five grades of C12 are divided as shown in
Table 7. According to the global per capita water resources in the Word Bank dataset [
44], the dividing point values for the grades 1–5 for C13 are set as 300,000, 120,000, 40,000, 15,000, and 5000. The C14 and C15 standards are set based on the urbanization levels in developing and developed countries around the world [
32] and the forest coverage of the countries [
44]. Due to space limitations, other indicators will not be described in detail here. The grading standards are shown in
Table 4.
According to the principle of SPA, the values of indicators for the index, criterion and goal layers form set
A, and the standard values of the five grades for these indicators form set
B. These two sets are combined to obtain the set pair
H(
A,
B). The connection degrees of the set pair
H(
A,
B) can be calculated using Equations (2), (A1) and (A2). Since the connectivity degree ranges from −1 to 1, a different grading scale is used to effectively depict the evaluation results. According to the SPA model, the connection degrees approaching 1 indicate excellent evaluation results. In contrast, the evaluation is poor when the connection degree is close to −1. Corresponding to the standard of the indicators in the index system of the SWRS (five grades), the evaluation standard for the connection degrees of the three layers is divided into five grades: very poor (grade 1), poor (grade 2), medium (grade 3), good (grade 4), and excellent (grade 5). The connection degree values in the range of [−1,1] and the corresponding evaluation layers are shown in
Table 8.
The connection degrees for the index layer are the basis of the evaluation of the goal and criterion layers and the analysis of the operating mechanism of the Beijing SWRS. According to the indicator values (
Table 6) and the grading standards (
Table 7), the identity, discrepancy and contradistinction degrees (
rpq1,
rpq2,
rpq3,
rpq4, and
rpq5) are determined using Equation (A4).
A value of −1 is set for the contradistinction degree, indicating that the indicator condition is extremely poor. According to the principle of equal division, the discrepancy degree coefficients (
i1,
i2, and
i3) are determined based on the equally dividing the interval of [−1,1]. Specifically,
i1 = 0.5,
i2 = 0, and
i3 = −0.5. The variations in the connection degrees of indicators C11–C38 and the corresponding grades are depicted in
Figure 4.
The top panel of
Figure 4 shows the variations in the connection degree of the indicators related to the dynamic component of the SWRS. C12 is the per capita local water resources in Beijing. The connection degree was −1 from 2000 to 2018 (red circles), belonging to grade 1, reflecting a poor performance. During this period, the per capita local water resources in Beijing varied between 90 and 165 m
3/person, which is much lower than the international standard of severe water scarcity (<500 m
3/person). Such a high population pressure on the water subsystem of Beijing could severely weaken the dynamics of the SWRS, negatively affect the sustainable utilization of water resources and hinder the development of socioeconomic status of the Beijing megacity. However, C14 (urbanization rate) displays an excellent performance. The urbanization rate of Beijing increased from 77.5 to 86.5%, reflecting rapid growth. The connection degree varied from 0.94 in 2000 to 1 in 2005 and remained at grade 5 during this period (dark cyan circles in
Figure 4). Therefore, rapid and high-level urbanization has greatly promoted the socioeconomic development of Beijing and is very favorable to the conservation, efficient utilization, and integrated management of water resources. The black circles (
Figure 4) indicate the variations in the connection degree of C11 (annual precipitation). Although the connection degree lightly increases overall, it fluctuates near grade 2, which reflects a poor situation. This result illustrates that the annual precipitation in Beijing, although relatively limited, has a stable influence on the SWRS among all the dynamic indicators. For indicators C13 (per capita GDP) and C15 (forest coverage), the connection degrees exhibit a similar trend. From 2000 to 2018, the per capita GDP of Beijing increased by a factor of nearly five, and the forest coverage increased by 50%. The connection degree of C13 varied from −0.31 in 2000 to 0.56 in 2018, and the corresponding grade increased from 2 (poor) to 4 (good). The connection degree of C15 ranged from −0.20 in 2000 to 0.28 in 2018 and the grade of this indicator changed from medium to good. Therefore, these two indicators were major factors improving the dynamic conditions of the SWRS in Beijing.
The middle panel of
Figure 4 depicts the variations in the connection degree of the indicators related to the resistance component of the SWRS. Although the water resource development ratio (C21) in Beijing decreased from 204 to 54%, it is still very high. Therefore, the connection degree (black squares) is still −1, and this indicator is at a very poor level. From 2000 to 2018, the proportion of domestic water consumption (C22) in Beijing increased due to the increase in the population. As a result, the connection degree decreased from grade 2 (poor) to grade 1 (very poor). The indicators other than C21 and C22 exhibited a good performance, and their connection degrees increased significantly. The change in the connection degree of C25 was the most significant, increasing from −1 to 1. The proportion of water supplied by the eco-environment in Beijing increased from 1.1% in 2000 to 34.1% in 2018, which fundamentally reversed the impact on the SWRS (i.e., shift from resistance to a driving force). C23 (proportion of domestic water consumption) and C26 (proportion of sewage discharge) also have a positive impact on the SWRS.
The final panel in
Figure 4 shows the variations in the connection degrees of the indicator related to the coordination component of the SWRS. One indicator, namely, per capita daily domestic water use (C32), displays a very poor performance. The connection degree of C32 remained at −1 and is classified as grade 1. The per capita daily domestic water use is between 205 and 260 m
3/(person day), which is far higher than the standard for urban residential use in China (85–140 L/(person day) in Beijing and other cities). In the early period, the connection degree of indicators C36 and C37 was −1, reflecting the very poor performance of the SWRS in Beijing. However, with the increase in water supply provided by the South to North Project and reclaimed water, the connection degrees of these two indicators increased significantly in the later period. The connection degree for C37 was excellent in 2014 and those for C36 and C37 have been stable at medium and good levels in recent years. The proportion of agriculture in Beijing is very small, and agricultural products have accounted for less than 1% of the total GDP since 2007. Additionally, the agricultural water use efficiency in Beijing is very high at between 3000 and ~4500 m
3/hm
−2. Hence, the connection degree of C34 is good to excellent (between 0.4 and 0.9). The connection degrees of other indicators (C31, C33, C35 and C38) have been increasing and had positive coordination effects on the SWRS of Beijing.
4.2. Dynamic Analysis of the Operating Mechanism of the Beijing’s SWRS
According to the steps in the AHP method, the weights of C22–C26, C31–C38 and B1–B3 can be determined. The weights of indicators in the index and criterion layers are shown in
Table 9. Given the weights in
Table 9 and the identity, discrepancy or contradistinction degrees of indicators in the index layer, the connection degrees of indicators in the criterion layer and the goal layer can be calculated using Equations (A2) and (A3), and the results are depicted in
Figure 5.
As shown by the red solid dots and lines in
Figure 5, the connection degree of the SWRS condition in Beijing annually increased from −0.52 in 2000 to −0.03 in 2018. However, the corresponding indicator grade only changed from grade 2 to grade 3, namely, from poor to medium. In addition, the connection degree was negative until 2018. These results illustrate that the sustainability of the Beijing SWRS is improving, but this improvement has not been very significant and the sustainability of the Beijing SWRS has not yet fundamentally changed.
The dynamic conditions (B1), resistance conditions (B2) and coordination conditions (B3) are the components of the goal layer (SWRS conditions (A)). The connection degrees of these three indicators increased overall, indicating that these indicators all have a positive influence on improving the sustainability of the Beijing SWRS.
Figure 5 shows that the rates of change of the connection degrees of these three indicators were relatively consistent in the former period (2000–2008) and began to diverge after 2008, with the coordination indicator increasing fastest, the dynamic conditions indicator increasing slowest and resistance conditions indicator increasing at a medium rate.
Table 9 shows that the indicator weight of the dynamic conditions indicators is 0.58, which is higher than those of the other two indictors. Therefore, the variations in the dynamic conditions mainly determine the overall trend of the SWRS conditions.
As shown by the blue half-filled dots and lines in
Figure 5, the connection degree of the dynamic conditions indicator slowly increases from −0.45 to −0.23, thus remaining at the grade 2 level (i.e., poor level). This finding suggests that although the dynamic conditions of the Beijing SWRS are gradually improving, this improvement is very slow and influences variations in the SWRS conditions. For the resistance conditions indicator, the connection degree annually increased from −0.61 to −0.03, and the indicator grade rose from grade 2 to 3 (i.e., from poor to medium). Thus, the negative influence of the resistance factors on the Beijing SWRS decreased, and the positive effect increased. For the three factors that influence the operating mechanism of the SWRS, the coordination conditions indicator has the most positive effect; notably, the corresponding connection degree significantly increases from −0.60 to 0.35, and the corresponding grade rises from 2 (poor) to 4 (good). This significant change from negative to positive is fundamental to the overall improvement in sustainability. As the indicators related to the coordination mechanism have improved, the effects of the coordination mechanism on the SWRS have become increasingly important.
4.3. Discussion on the Grading Threshold of Indicators
The selection and grading of indicators are subjective in the comprehensive evaluation of water resources, environment or disasters. This subjectivity has become a controversial issue that causes the objectivity of the evaluation results and is also a question that is often asked. It is important to look at this issue from two perspectives. On the one hand, this issue is influenced by many factors and is also very complex, which poses many difficulties and challenges to understanding and the accurately quantitative evaluation. On the other hand, a comprehensive evaluation is a study that involves the interdisciplinarity of social and natural sciences. It has its unique features, and requires a novel horizon and novel thinking to answer the faced questions.
There are many factors that make the selection and grading of indicators subjective, such as different geographical regions, different ecological protective objectives, different development stages and changes in management policies, etc. The existence and influence of these factors are objective and unavoidable. For example, the focus of an indicator system in selecting indicators is different in different geographical regions for the evaluation of water resources sustainability. In regions with abundant precipitation, water quality is a major factor influencing water shortages and indicators related to water quality should be selected. However, in arid and semiarid regions, precipitation may be a critical factor causing water shortage, so indicators related to precipitation are the focus of consideration. In addition, the grading of the same indicator should be different in different regions. For example, vegetation type and coverage degree in desert and rainforest regions are significantly different according to the laws of zoning in physical geography—desert vegetation is sparse and rainforest vegetation is dense. If the indicator grading of rainforest vegetation is employed to measure the vegetation in desert regions, the evaluation result is definitely very bad, which is not realistic. Under national conditions, desert vegetation should be sparse and is very healthy, which is determined by the local hydrothermal conditions. Therefore, the grading of vegetation indicator in the two geographical regions should be different and adapted based on the local conditions. The grading of indicators should vary with different evaluation objectives for the same topic. For example, the proportion of agricultural areas in a country may vary from 5 to 30% according to different nature protection goals [
45]. In addition, the allowable values of some indicators vary with different stages of socioeconomic development. For example, generally, in the early stage of economic development for a country, sewage treatment may not be the performance assess index of the local government. Environmental problems become more and more serious with economic development, and the government will demand higher and higher sewage treatment rates. In China, the urban sewage treatment rate has increased from 75.25% in 2009 to 91.90% in 2015 [
46]. If the sewage treatment rate of a city is 70% in China, ten years ago it would have been evaluated as a very good environmental performance, but now it would be a very poor rating. These subjectivity (or uncertainties), caused by the regions, objectives and development stages, can be controlled or increased by constructing a causal or overall framework for the selection and grading of indicators based on some general guidelines and they can be also corrected according to practical experience.
The rationality or explanation of the grading thresholds of some indicators has been studied. For example, a 20 and 40% annual water withdrawal to availability are generally categorized as the accepted thresholds for medium and high water stress, respectively [
47]. However, the rationale for these two thresholds has not been fully explained. Hanasaki et al. [
48] quantitatively investigated the empirical water stress thresholds using a state-of-the-art global hydrological model and addressed two well-known questions—(1) under what hydrological conditions do conventional water scarcity indicators represent water stress; (2) are there alternative methods to set the thresholds to reflect regional variations? In addition, more and more methods, such as a socio-ecohydrological thresholds framework, spatial statistical method, geostatistics model, etc., are being employed to study or determine the grading thresholds for some evaluation indicators [
49,
50]. Therefore, more and more studies on the rationale of the grading thresholds of evaluation indictors will enhance the accuracy and objectivity of the evaluation results.