Next Article in Journal
Research on Energy-Saving Production Scheduling Based on a Clustering Algorithm for a Forging Enterprise
Previous Article in Journal
Performance Evaluation for Sustainability of Strong Smart Grid by Using Stochastic AHP and Fuzzy TOPSIS Methods
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Water Quality Assessment and Pollution Source Identification of the Eastern Poyang Lake Basin Using Multivariate Statistical Methods

1
Key Laboratory of Watershed Geographic Sciences, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, Nanjing 210008, China
2
AAAS Science and Technology Policy Fellow, U.S. Agency for International Development, Accra, Ghana
3
Institute of Agricultural Economics and Development, The Chinese Academy of Agricultural Sciences, Beijing 100081, China
4
Jiangsu Collaborative Innovation Center of Regional Modern Agriculture & Environmental Protection, Huaiyin Normal University, Huai’an 223300, China
*
Authors to whom correspondence should be addressed.
Sustainability 2016, 8(2), 133; https://doi.org/10.3390/su8020133
Submission received: 2 November 2015 / Revised: 16 January 2016 / Accepted: 19 January 2016 / Published: 30 January 2016
(This article belongs to the Section Environmental Sustainability and Applications)

Abstract

:
Multivariate statistical methods including cluster analysis (CA), discriminant analysis (DA) and component analysis/factor analysis (PCA/FA), were applied to explore the surface water quality datasets including 14 parameters at 28 sites of the Eastern Poyang Lake Basin, Jiangxi Province of China, from January 2012 to April 2015, characterize spatiotemporal variation in pollution and identify potential pollution sources. The 28 sampling stations were divided into two periods (wet season and dry season) and two regions (low pollution and high pollution), respectively, using hierarchical CA method. Four parameters (temperature, pH, ammonia-nitrogen (NH4-N), and total nitrogen (TN)) were identified using DA to distinguish temporal groups with close to 97.86% correct assignations. Again using DA, five parameters (pH, chemical oxygen demand (COD), TN, Fluoride (F), and Sulphide (S)) led to 93.75% correct assignations for distinguishing spatial groups. Five potential pollution sources including nutrients pollution, oxygen consuming organic pollution, fluorine chemical pollution, heavy metals pollution and natural pollution, were identified using PCA/FA techniques for both the low pollution region and the high pollution region. Heavy metals (Cuprum (Cu), chromium (Cr) and Zinc (Zn)), fluoride and sulfide are of particular concern in the study region because of many open-pit copper mines such as Dexing Copper Mine. Results obtained from this study offer a reasonable classification scheme for low-cost monitoring networks. The results also inform understanding of spatio-temporal variation in water quality as these topics relate to water resources management.

1. Introduction

Water scarcity is a growing threat to economic and social development and widespread water pollution in recent decades further complicates the threat, especially in developing countries [1,2,3]. Water pollution caused both by anthropogenic activities such as urbanization [4], industrial accidents [5,6,7], dam construction [8], and natural phenomena like soil erosion [9] and climate change [10], is a global issue that increases pressures on freshwater resources. Declining water quality is the result of spatio-temporal changes in sedimentation, temperature, pH, nutrients, heavy metals, toxic organic compounds and pesticides, and so on[11]. In order to safeguard water quality and alleviate the pressures on water resources, it is necessary to elaborate spatial-temporal changes in regional water quality and identify the potential pollution sources.
A series of monitoring programs and protocols have been developed to enable a reliable quantification of nutrient transport in the aquatic environment (e.g., the National Land with Water Information in Japan [12,13], the National Monitoring and Assessment Program (NOVA) in Denmark [14,15], the Harmonized Monitoring Scheme (HMS) in Britain [16,17], and the National Water-Quality Assessment (NAWQA) in the United States [18,19]) to generate a more comprehensive picture of water quality conditions and trends. Meanwhile, sophisticated data-driven analytical approaches (e.g., the projection pursuit technique [20] and neural networks [21,22]), multivariate statistical techniques [23] (e.g., discriminant analysis (DA), cluster analysis (CA) and principal component analysis/factor analysis (PCA/FA)), fuzzy theory[24] and hydrological models [11,25,26,27] have substantially improved water quality assessments. Among these methods, multivariate statistical techniques including CA, PCA/FA, and DA can be applied to easily extract important information in large water quality datasets and are therefore used widely to evaluate water quality and identify potential pollution sources [28].
In some regions in China, water quality impairment is severe, with important consequences for human health as well as sustainable economic and social development [29]. From 2004, the State Ministry of Environmental Protection started to focus on surface water quality monitoring systems in the main river basins, including the Yellow River Basin, the Yangtze River Basin, the Pearl River Basin, the Lake Taihu Basin, the Songhua River Basin, and the Southeastern Coastal Rivers. Meanwhile, local environmental administrations including national, provincial, prefectural (city), and local have been established to monitor and report on local surface water quality in recent years. As a consequence, a huge monitoring database, including nutrients, sediment, physical and chemical properties, toxic organic compounds and pesticides, heavy metals, etc. has been built for regional water resources management. However, lake water environment is still deteriorating. For example, Poyang Lake, the largest freshwater lake (3050 km2) in China, is polluted and most pollutants originated from five rivers including the Gan River, the Fu River, the Xin River, the Rao River, and the Xiu River. Therefore, it is necessary to investigate current situation of water pollution and identify pollution sources in rivers around the Poyang Lake. Moreover, the application of CA and PCA/FA in the Poyang Lake Basin is rare.
Therefore, several multivariate statistical approaches (DA, CA and PCA/FA) are applied to (1) illuminate temporal and spatial variations of water quality; and (2) identify the potential influencing factors that explain changes in water quality parameters of the Eastern Poyang Lake Basin. All results obtained from this study will can offer a reasonable classification scheme for low-cost monitoring networks and also inform understanding of spatio-temporal variation in water quality as these topics relate to water resources research and management.

2. Materials and Methods

2.1. Study Area

Poyang Lake, which is the largest freshwater lake (3050 km2) in China, is located on the south bank of the middle-lower Yangtze River in Jiangxi Province (Figure 1). It is shallow and connected with five main rivers, including the Gan River, the Fu River, the Xin River, the Rao River, and the Xiu River. All these river tributaries form the Poyang Lake Basin, which covers an area of 162,200 km2, accounting for nearly 97% of Jiangxi Province. The topography of the basin varies from highly mountainous regions (maximum elev. 2200 m) to alluvial plains in the lower reaches of the primary watercourses. The basin lies in a subtropical wet climate zone with a distinct alternation from a wet to a dry season (see Figure 2) with an annual mean precipitation of 1710 mm and annual mean temperature of 17.5 °C.
Figure 1. Study area and monitoring stations for the rivers in the Eastern Poyang Lake Basin.
Figure 1. Study area and monitoring stations for the rivers in the Eastern Poyang Lake Basin.
Sustainability 08 00133 g001
Figure 2. Monthly mean precipitation and temperature in the Poyang Lake Basin (1982–2006).
Figure 2. Monthly mean precipitation and temperature in the Poyang Lake Basin (1982–2006).
Sustainability 08 00133 g002
Water resources in the Poyang Lake Basin are rich, but water managers now face a series of difficulties as the awareness of global climate impacts on precipitation patterns increases. Meanwhile, with rapid economic development and the population explosion in the basin, human activities including dam construction [30] and land-use change significantly affect the water supply and patterns of water demand [31], while simultaneously being exacerbated by increased pollutant loading. In addition, mining activities caused heavy metal pollution, which is a serious issue. As a result, Poyang Lake faces a great deal of environmental problems including water quality deterioration and eutrophication. According to the Environmental Aspect Bulletin, Poyang Lake is only slightly polluted, underscoring the importance of investigating and assessing water quality for the protection of the lake water environment. In this study, the East Poyang Lake Basin (Figure 1) was chosen to conduct water quality investigations. The Eastern Poyang Lake Basin mainly contains the Xin River and Rao River (rising from two branches including the Chang River and Lean River. With rapid economic development in the past decades, dozens of non-ferrous metal mines have been intensively exploited in the East Poyang Lake Basin, and heavy metal pollution and related environmental changes have been gradually attracted by scientific community. For example, a great deal of acidic mine drainage (pH 2–3) and waste effluents containing copper (Cu) and Zinc (Zn) discharged from the neighboring Dexing Copper Mine and from dozens of smelters and mining/panning activities along rivers were poured continuously into the Lean River [32], where the aquatic environment has been contaminated.

2.2. Monitored Parameters and Analytical Methods

Twenty-eight monitoring stations were selected for taking water samples in this study; station X1–X13 are located in the Xin River, station L1–L9 are located in the Chang River, and station P1–P5 are located in east Poyang Lake (Figure 1). Water samples were taken every two months from Jan. 2012 to Apr. 2015. Considering pollution features and traditional water quality index in China, total 14 water quality parameters including temperature (TEMP), pH, ammonia-nitrogen (NH4+-N), 5-day biochemical oxygen demand (BOD), chemical oxygen demand (COD), dissolved oxygen (DO), total nitrogen (TN), total phosphorus (TP), fluoride (F), sulfide (S), copper (Cu), oil, chromium (Cr) and Zinc (Zn) were selected to analyze water quality in the Eastern Poyang Lake Basin. The process of sampling, preservation, transportation and analysis of the water samples were conducted strictly according to standard methods (State Environment Protection Bureau of China 2002). Table 1 shows the specific analytical method for each water quality parameters.
Table 1. Water quality parameters, units, analytical methods and lowest detected limit as measured from Jan. 2012 to Apr. 2015 for the Eastern Poyang Lake Basin.
Table 1. Water quality parameters, units, analytical methods and lowest detected limit as measured from Jan. 2012 to Apr. 2015 for the Eastern Poyang Lake Basin.
ParametersAbbreviationsUnitsAnalytical MethodsLowest Detected Limit
TemperatureTEMP°CThermometer-
pHpH Glass electrode-
Dissolved oxygenDOmg/LIodimetry0.2
Ammonia nitrogenNH4+-Nmg/LN-reagent colorimetry0.05
Biochemical oxygen demandBODmg/LDilution and inoculation test2
Chemical oxygen demandCODmg/LPotassium permanganate method0.5
Total nitrogenTNmg/LUltraviolet spectrophotometry0.05
Total phosphorusTPmg/LAmmonium molybdate spectrophotometry0.01
CuprumCumg/LAtomic absorption spectrometry0.001
ZincZnmg/LAtomic absorption spectrometry0.05
FluorideFmg/LIon chromatography0.05
OilOilmg/LInfrared spectrophotometry0.01
SulphideSmg/LMethylene blue spectrophotometric0.005
chromiumCrmg/Ldpc colorimetric0.004

2.3. Multivariate Statistical Methods

Spatio-temporal analysis of the water quality in the Eastern Poyang Lake Basin was analyzed by using CA, DA, and PCA/FA techniques. CA is the task of grouping a set of objects based on the characteristics they possess [33,34] and Ward's method is a criterion applied in hierarchical cluster analysis. The Ward’s Method of hierarchical clustering with Squared Euclidean Distance was applied to explore the grouping of the 28 sampling stations.
DA determines the variables that discriminate between two or more naturally occurring groups/clusters on the basis of the accuracy rate of discriminant functions (DFs). It constructs a discriminant function (DF) for each group [35]. DFs are calculated using the following equation:
f ( G i ) = k i + j = 1 n w i j P i j
where i represents the number of groups ( G ) , k i represents the constant inherent to each group, n represents the number of parameters, and w i j represents the weight coefficient assigned by DF analysis (DFA) to a given parameter ( P i j ).
DA was employed to calculate the mean of a variable to predict group membership. The standard, forward stepwise, and backward stepwise modes of DA were used to calculate DFs in two groups generated from CA to describe spatial variations in river water quality.
PCA is a dimensionality reduction technique that helps to simplify the data and make it easier to visualize by finding a set of principal components (PCs) [36,37]. PCs are orthogonal variables calculated by multiplying the original correlated variables with a list of coefficients, which can be described as
z i j = a i 1 x 1 j + a i 2 x 2 j + + a i m x m j
where z represents the component score, a represents the component loading, x represents the measured value of the variable, i represents the component number, j represents the sample number, and m represents the total number of variables.
FA was used to extract a lower dimensional linear structure from a set of data and thenfore provide a powerful means for detecting similarities among samples [38]. FA can reduce the contribution of less significant variables obtained from PCA and the new group of variables known as varifactors (VFs) is extracted through rotating the axis defined by PCA. The basic concept of FA is described as
z i j = a f 1 f 1 j + a f 2 f 21 + + a f m f m i + e f i
where z represents the measured value of a variable, a represents the factor loading, f represents the factor score, e represents the residual term accounting for errors or other sources of variation, i represents the number of sample, j represents the number of variable, and m represents the total number of factors.
Here, PCA/FA was applied to the normalized log-transformed data sets (14 variables) separately for the two different spatial regions (low pollution region and high pollution region) as delineated by the CA technique.

3. Results and Discussion

3.1. Temporal/Spatial Similarity and Grouping

Figure 3 shows the results of temporal cluster analysis, grouping the 6 months into two statistically significant clusters at (Dlink/Dmax) × 100 < 60. Cluster 1 (dry season), comprised of January and March, approximately correspond to the low flow period. Cluster 2 (wet season) contains two small clusters at (Dlink/Dmax) × 100 < 40; that is, May and November reflect the mean flow periods, and the remaining months (July and September) comprise another group, and reflect the high flow period. Notably, temporal variation of surface water quality was significantly affected by local climate seasons (spring, summer, autumn and winter) and hydrological conditions (low, mean, and high flow period). The Poyang Lake Basin lies in a subtropical wet climate zone with a distinct alternation from wet to a dry season, consistent with the temporal patterns of water quality (Figure 3).
Spatial CA also yielded a dendrogram with two statistically significant clusters at (Dlink/Dmax) × 100 < 60 (Figure 4). Group 1 included X1 and L1 to L4, and the remaining monitoring stations comprised Group 2. The X1 station and L1 to L4 stations in Group 1 are located at the upstream of the Xin River and the Lean River, respectively, which, due to low population density and the absence of industrial and commercial activity, are far from major point and non-point pollution sources. However, L1–L4 stations are located in the Dexing district, which is one of the largest copper and gold producing districts in China and metal pollution and associated mineral pollution are always a problem [39,40]. Despite relatively high Cu, S, and F concentrations were observed at the L4 Station in this study, Group 1 should be considered as moderate or low pollution. Group 2 corresponds to highly polluted stations, with highest average concentrations of NH4-N, oil, BOD, COD and TP. Most stations in this group were located at the middle to down-stream of the east Poyang Lake basin and received pollution from point sources including municipal sewage and industrial wastewater and non-point pollution sources.
Figure 3. Dendogram showing the temporal clustering of study periods.
Figure 3. Dendogram showing the temporal clustering of study periods.
Sustainability 08 00133 g003
Figure 4. Dendogram showing spatial clustering of study periods.
Figure 4. Dendogram showing spatial clustering of study periods.
Sustainability 08 00133 g004

3.2. Temporal/Spatial Variations in River Water Quality

Based on the temporal groups (wet season and dry season) from CA, DA was performed on raw data to further explore temporal changes in surface water quality. Table 2 and Table 3 indicate the discriminant functions (DFs) and classification matrices (CMs), which were calculated by the standard, forward stepwise and backward stepwise modes of DA. Variables are included step-by-step beginning with the more significant until no significant changes in the forward stepwise mode, but are removed step-by-step beginning with the less significant in the backward stepwise mode. Both the standard and forward stepwise mode DFs using 14 and 7 discriminant variables, respectively, produced the corresponding CMs assigning 96.43% of the cases correctly. In the backward stepwise mode, however, DA yielded a CM with approximately 97.86% correct assignations using only four discriminant parameters, showing that TEMP, pH, NH4-N, and TN. Thus, the temporal DA indicated that TEMP, pH, NH4-N, and TN were the most significant parameters to discriminate differences between the wet season and dry season, revealing that these four parameters could be used to account for the expected temporal changes in surface water quality in the Eastern Poyang Lake Basin.
Table 2. Classification functions coefficients for DA of temporal changes.
Table 2. Classification functions coefficients for DA of temporal changes.
ParametersStandard ModeForward Stepwise ModeBackward Stepwise Mode
Wet SeasonDry SeasonWet SeasonDry SeasonWet SeasonDry Season
TEMP1.9893.0333.2412.2072.6131.577
pH82.84784.58559.99558.09556.04854.185
NH4-N21.10524.16728.60225.45227.84625.129
BOD9.93410.125
COD−1.194−1.189
DO10.98110.72712.40912.687
TN−5.506−7.395−9.417−7.549−12.533−10.631
TP7.9966.712−3.095−1.832
F46.73446.243
S631.996634.440
Cu−255.025−250.116
Oil−148.788−206.92820.53275.009
Cr374.404403.601
Zn53.32752.592
Constant−378.134−403.979−302.864−276.639−235.610−206.896
Table 3. Classification matrix for DA of temporal changes.
Table 3. Classification matrix for DA of temporal changes.
Monitoring PeriodsPercent CorrectTemporal Groups
Wet SeasonDry Season
Standard mode
Wet Season95.53632115
Dry Season1000224
Total97.321321239
Forward stepwise mode
Wet Season95.53632115
Dry Season1000224
Total97.321321239
Backward stepwise mode
Wet Season96.42932412
Dry Season1000224
Total97.857324236
Box and whisker plots of the discriminate parameters identified by DA are indicated in Figure 5. The average temperature (Figure 5a) in wet season was clearly higher than in dry season because of the local climate. The same difference in pH was found in Figure 5b. In contrast, the average NH4-N and TN were higher in dry season than in wet season due to the local hydrologic conditions. The discharge in the wet season is much larger than in the dry season, which significantly dilutes the NH4-N and TN. Moreover, in wet season (typical in summer and autumn) there are more aquatic organisms than in the dry season, consuming more NH4-N.
Figure 5. Temporal changes: (a) temperature; (b) pH; (c) NH4-N; and (d) TN in east Poyang Lake basin.
Figure 5. Temporal changes: (a) temperature; (b) pH; (c) NH4-N; and (d) TN in east Poyang Lake basin.
Sustainability 08 00133 g005
Table 4. Classification functions coefficients for DA of spatial changes.
Table 4. Classification functions coefficients for DA of spatial changes.
ParametersStandard ModeForward Stepwise ModeBackward Stepwise Mode
Low PollutionHigh PollutionLow PollutionHigh PollutionLow PollutionHigh Pollution
TEMP1.0281.071.2141.260
pH87.55491.92579.99284.23072.66277.108
NH4-N15.38913.53416.82715.151
BOD10.17510.467
COD−0.833−0.584−0.409−0.146−0.376−0.146
DO10.64510.24610.1449.730
TN−1.1960.484−0.1401.5733.9715.296
TP10.6311.57
F41.52937.65142.88338.95459.00954.412
S518.07441.985479.059401.226416.961330.412
Cu−244.929−234.766−205.655−195.490
Oil−63.009−43.30977.283100.383
Cr435.253496.253588.508653.149
Zn54.77855.278
Constant−384.511−413.498−349.255−376.823−273.306−303.391
Just like temporal DA, the DFs and CMs for spatial DA were obtained from the standard, forward stepwise and backward stepwise modes on the basis of spatial groups (low pollution stations and high pollution stations), which are shown in Table 4 and Table 5. Both the standard and forward stepwise mode DFs using 14 and 11 discriminant variables, respectively, yielded the corresponding CMs assigning 95% of the cases correctly, whereas the backward stepwise DA gave CMs with about 93.75% correct assignations using only five discriminant parameters (Table 4 and Table 5). Backward stepwise DA showed that pH, COD, TN, F, and S were the most significant parameters to discriminate differences between the low pollution stations and high pollution stations.
Table 5. Classification matrix for discriminant analysis of spatial changes.
Table 5. Classification matrix for discriminant analysis of spatial changes.
Monitoring StationsPercent CorrectSpatial Groups
Low PollutionHigh Pollution
Standard mode
Low pollution72.0007228
High pollution100.0000460
Total95.0007228
Forward stepwise mode
Low pollution72.0007228
High pollution100.0000460
Total95.00072488
Backward stepwise mode
Low pollution69.0006931
High pollution99.1304456
Total93.75073487
Table 6. Loadings of experimental variables (14) on significant VFs for low pollution and high pollution.
Table 6. Loadings of experimental variables (14) on significant VFs for low pollution and high pollution.
ParametersLow Pollution (Six Significant Principal Components)High Pollution (four Significant Principal Components)
VF1VF2VF3VF4VF5VF6VF1VF2VF3VF4
TEMP0.0150.0260.0420.8740.2110.1070.206−0.362−0.623−0.017
pH0.7300.2300.201−0.068−0.243−0.169−0.657−0.149−0.114−0.081
NH4-N−0.144−0.078−0.0320.0840.9340.058−0.0690.8610.0220.038
BOD−0.013−0.8340.0360.167−0.0400.2490.103−0.054−0.3650.701
COD0.607−0.4180.362−0.3070.263−0.0140.0500.5620.2340.318
DO−0.1720.197−0.214−0.7450.1380.165−0.165−0.1070.794−0.012
TN0.7210.283−0.1270.1860.443−0.1240.2190.651−0.0030.091
TP0.1490.3550.7580.097−0.040−0.0910.4660.4130.1570.179
F−0.597−0.440−0.121−0.2530.3870.2610.3300.1130.6460.065
S−0.7790.1390.067−0.1520.155−0.052−0.6640.364−0.095−0.178
Cu0.0940.7410.156−0.023−0.1090.334−0.831−0.0790.224−0.055
Oil0.4200.0210.6110.165−0.253−0.1260.1310.2020.0260.626
Cr−0.188−0.1540.8870.0750.0650.1170.0730.1290.2660.639
Zn−0.123−0.011−0.026−0.0170.0610.9310.8350.098−0.1690.073
Eigenvalue3.5532.0001.7091.4691.3021.0073.2052.3641.4261.094
%Total variance25.38214.28712.20810.4919.3037.19022.89016.88310.1897.817
Cumulative% variance25.38239.66851.87662.36771.67178.86122.89039.77349.96257.779
Figure 6 shows the chosen discriminate parameters identified by spatial backward stepwise DA. The pH (Figure 6a) in low pollution regions was clearly less than in high pollution regions, which was not consistent with analyze results in Danjiangkou Reservoir Basin of China [41]. It maybe because river segments in this region receives a great deal of acidic mine drainage and waste effluents containing Cu and Zn discharged from the neighboring Dexing Copper Mine and from many smelters and mining/panning activities. The average COD and TN concentration (Figure 6b,c) in the low pollution region were also clearly less than in the high pollution region. Within high pollution regions, all stations were located in middle to down–stream reaches or near urban areas and therefore in proximity to municipal sewage and industrial waste water. The average F and S concentration (Figure 6d,e) in low pollution regions were also clearly higher than in high pollution region. Obviously, these excess acidic pollutants were main drivers that leading to lower pH. Figure 7 preferably illustrates spatial distribution of pH, DO, TN, and F at 27 stations in the east Poyang Lake basin.
Figure 6. Spatial changes: (a) pH; (b) COD; (c) TN; (d) F and (e) S in the east Poyang Lake basin.
Figure 6. Spatial changes: (a) pH; (b) COD; (c) TN; (d) F and (e) S in the east Poyang Lake basin.
Sustainability 08 00133 g006
Figure 7. Boxplots illustrating distribution of (a) pH; (b) DO (mg/L); (c) TN (mg/L); and (d) F (mg/L) at 27 stations in the east Poyang Lake basin.
Figure 7. Boxplots illustrating distribution of (a) pH; (b) DO (mg/L); (c) TN (mg/L); and (d) F (mg/L) at 27 stations in the east Poyang Lake basin.
Sustainability 08 00133 g007

3.3. Data Structure Determination and Source Identification

Based on the normalized log-transformed data sets, PCA/FA was used to further identify the potential pollution sources for the low pollution and high pollution regions. Before the PCA/FA analysis, the Kaiser–Meyer–Olkin (KMO) and Bartlett’s Sphericity tests were carried out on the parameter correlation matrix to examine the validity of PCA/FA. The KMO results for Group 1 and Group 2 were 0.61 and 0.71, respectively, and Bartlett’s Sphericity results were 547.92 and 1611.68 (p < 0.05), suggesting that PCA/FA analysis was reasonable to offer significant reductions in dimensionality. Six VFs were calculated for the low pollution region and four VFs for the high pollution region with the eigenvalues great than 1, explaining proximately 78.86% and 57.78% of the total variance in respective surface water quality data sets (Figure 8 and Table 6).
Figure 8. Scatter plot of loadings for the four VFs for group 1 (a and b) and group 2 (c and d).
Figure 8. Scatter plot of loadings for the four VFs for group 1 (a and b) and group 2 (c and d).
Sustainability 08 00133 g008
In the low pollution region, among six VFs, VF1, explaining about 25.38% of the total variance, had strong positive loadings of pH and TN and moderate positive loadings of COD, and strong negative loadings of S and moderate negative loadings of F. Generally, high concentrations of total nitrogen reflect agricultural runoff and municipal effluents [42,43]; COD is an indicator of organic pollution from industrial and domestic waste water[44]. The pH is regarded as one of the main reaction conditions for redox reactions involving organic matter, which can regulate the concentration of COD [45]. Sulphide and fluoride mainly originate from copper mines in this region (e.g., the Dexing Copper Mine in Dexing City), the components of which are very complex for dressing with high sulphur and fluoride [46]. VF1 included nutrient pollution, organic pollution and mining pollution. VF2 (14.29% of the total variance) had strong positive loadings of Cu and strong negative loadings of BOD, representing metal pollution. VF3, explaining 12.21% of the total variance, had strong positive loadings of Cr and TP and moderate positive loadings of Oil. This factor can be explained as representing influences from point sources, such as copper mines, industrial effluents and domestic wastewater. VF4, accounting for 10.49% of the total variance, had strong positive loadings of temperature and strong negative loadings of dissolved oxygen. The concentration of DO is controlled by temperature and therefore has both a seasonal and a daily cycle [47]. Therefore, the DO concentration is high in winter and early spring because of low temperature, and is low in summer and fall because of high temperature. VF5 (9.30%) had strong positive loadings on NH4-N representing non-point source pollution related to agricultural activities. VF6 (7.19%) had strong positive loadings of Zn indicating the metal pollution.
With respect to the data set pertaining to the high pollution region, among four VFs, VF1, explaining about 22.89% of the total variance, had strong positive loadings on Cu and moderate negative loadings on pH and S, basically representing metal pollution from the upstream. VF2 (16.88% of the total variance) had strong positive loadings of NH4-N and moderate positive loading of TN and COD. This factor can be explained as one typical kind of mixed pollution, which consists of point source pollution (e.g., industrial and domestic waste water) and non-point source pollution associated with agricultural activities and atmospheric deposition. VF3, explaining 10.19% of the total variance, had strong positive loadings of DO and moderate positive loadings of F and moderate negative loadings of TEMP. Generally, fluoride is from cement plants, fluorine chemical factories, and copper smelters in this region. The relationship between DO and temperature is explained in the same way as the explanation of VF4 in the low pollution region. VF4 (7.82%) had strong positive loadings of BOD and moderate positive loadings of Oil and Cr. The high concentration of BOD and Oil could represent organic pollution and oil pollution, and Cr is likely from cement plants and copper smelters in this region.
Based on the above analysis, five latent pollutants including nutrients, organics, chemicals, heavy metals and natural pollutants were identified in the study area. Firstly, nutrient pollution (ammonia nitrogen and total nitrogen) was mainly from non-point sources related to agricultural activities and atmospheric deposition and point sources including municipal effluents and fertilizer plant wastewater. In addition, organic pollution (BOD and COD) was usually from point sources (e.g., industrial and domestic waste water). Thirdly, chemical pollution was mainly from the petroleum industry (oil pollution) and copper mines and plating (S and F pollution). Fourthly, heavy metal pollution (Cu, Cr and Zn) was mainly from copper mines and plating. Finally, natural pollution was badly affected by meteorological variations such as the variation of water temperature and dissolved oxygen. Considering the types of pollution in the two regions (Table 6 and Figure 6), heavy metals (Cu, Cr and Zn), fluoride and sulfide stood out. Field investigations showed there are many copper mines in the Eastern Poyang Lake Basin, such as the Dexing Copper Mine, which are associated with mineral effluents including fluoride and sulfide.

4. Conclusions and Future Work

Different multivariate statistical techniques were applied to evaluate spatio-temporal variations in surface water quality of the Eastern Poyang Lake Basin in this study. Hierarchical CA grouped 6 months into two periods and 28 sampling stations into two groups on the basis of their similar water quality characteristics, which can provide a reasonable and useful classification for optimizing the future spatial monitoring network with lower cost. Based on the results obtained from hierarchical CA, spatial and temporal changes in surface water quality were analyzed through achieving discriminant functions and classification matrices using DA. For temporal changes, the temporal DA used only four discriminant parameters including TEMP, pH, NH4-N, and TN, with approximately 97.86% correct assignations. The spatial DA gave CMs with about 93.75% correct assignations using only five discriminant parameters (pH, COD, TN, F, and S). Thus, temporal and spatial DA analysis could be used to optimize future water quality monitoring programs by reducing the number of monitoring stations, monitoring parameters, and monitoring frequency. The results from PCA/FA analysis identified there were five latent pollutions including nutrients, organics, chemicals, heavy metals and natural pollution. Heavy metals (Cu, Cr and Zn), fluoride and sulfide were serious problems in the study region. In low pollution region, heavy metal and sulfide pollution were mainly from copper mines such as the Dexing Copper Mine, but from copper mines and plating in the high pollution region.
Although all results obtained from this study illustrate the utility of multivariate statistical techniques for extracting characteristics from large water quality data sets and identifying pollution sources to fully understand spatiotemporal variations in water quality, further study should be carried out in the future. First, some water quality parameters including TEMP, pH, NH4-N, COD, TN, F, and S should be further monitored more accuracy and controlled. In addition, it is necessary to quantitatively evaluate pollution sources to obtain contribution of different pollutants. Finally, further investigation of heavy metal pollution should be implemented, especially for Cu pollution.

Acknowledgments

This study was sponsored by the National Natural Science Foundation of China (No. 41471460, 41501552, and 41130750) and “One Hundred Talents Program” of the Chinese Academy of Sciences. The authors are grateful to some staff members of Shangrao Environmental Protection and Shangrao Development and Reform Commission for their helpful information for this study.

Author Contributions

Weili Duan had the original idea for the study and, with all co-authors carried out the design. Weili Duan drafted the manuscript, which was revised by all authors. All authors read and approved the final manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Vörösmarty, C.J.; Green, P.; Salisbury, J.; Lammers, R.B. Global water resources: Vulnerability from climate change and population growth. Science 2000, 289, 284–288. [Google Scholar] [CrossRef] [PubMed]
  2. Schwarzenbach, R.P.; Egli, T.; Hofstetter, T.B.; von Gunten, U.; Wehrli, B. Global water pollution and human health. Ann. Rev. Environ. Resour. 2010, 35, 109–136. [Google Scholar] [CrossRef]
  3. Hoekstra, A.Y.; Mekonnen, M.M. The water footprint of humanity. Proc. Natl. Acad. Sci. 2012, 109, 3232–3237. [Google Scholar] [CrossRef] [PubMed]
  4. Tu, J. Spatial variations in the relationships between land use and water quality across an urbanization gradient in the watersheds of northern Georgia, USA. Environ Manag. 2013, 51, 1–17. [Google Scholar] [CrossRef] [PubMed]
  5. Duan, W.; Chen, G.; Ye, Q.; Chen, Q. The situation of hazardous chemical accidents in China between 2000 and 2006. J. Hazard Mater. 2011, 186, 1489–1494. [Google Scholar] [CrossRef] [PubMed]
  6. Duan, W.; He, B.; Takara, K.; Luo, P.; Nover, D.; Sahu, N.; Yamashiki, Y. Spatiotemporal evaluation of water quality incidents in Japan between 1996 and 2007. Chemosphere 2013, 93, 946–953. [Google Scholar] [CrossRef] [PubMed]
  7. Duan, W.; He, B. Emergency response system for pollution accidents in chemical industrial parks, China. Int. J. Environ. Res. Public Health 2015, 12, 7868–7885. [Google Scholar] [CrossRef] [PubMed]
  8. Zhao, Q.; Liu, S.; Deng, L.; Yang, Z.; Dong, S.; Wang, C.; Zhang, Z. Spatio-temporal variation of heavy metals in fresh water after dam construction: A case study of the Manwan Reservoir, Lancang River. Environ. Monit. Assess. 2012, 184, 4253–4266. [Google Scholar] [CrossRef] [PubMed]
  9. Shi, H.; Shao, M. Soil and water loss from the Loess Plateau in China. J. Arid Environ. 2000, 45, 9–20. [Google Scholar] [CrossRef]
  10. Delpla, I.; Jung, A.; Baures, E.; Clement, M.; Thomas, O. Impacts of climate change on surface water quality in relation to drinking water production. Environ. Int. 2009, 35, 1225–1233. [Google Scholar] [CrossRef] [PubMed]
  11. Carr, G.M.; Neary, J.P. Water Quality for Ecosystem and Human Health; UNEP/Earthprint: Stevenage, UK, 2008. [Google Scholar]
  12. Duan, W.; Takara, K.; He, B.; Luo, P.; Nover, D.; Yamashiki, Y. Spatial and temporal trends in estimates of nutrient and suspended sediment loads in the Ishikari River, Japan, 1985 to 2010. Sci. Total Environ. 2013, 461, 499–508. [Google Scholar] [CrossRef] [PubMed]
  13. Luo, P.; He, B.; Takara, K.; Razafindrabe, B.H.; Nover, D.; Yamashiki, Y. Spatiotemporal trend analysis of recent river water quality conditions in Japan. J. Environ. Monitor. 2011, 13, 2819–2829. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Kronvang, B.; Jeppesen, E.; Conley, D.J.; Søndergaard, M.; Larsen, S.E.; Ovesen, N.B.; Carstensen, J. Nutrient pressures and ecological responses to nutrient loading reductions in Danish streams, lakes and coastal waters. J. Hydrol. 2005, 304, 274–288. [Google Scholar] [CrossRef]
  15. Conley, D.J.; Markager, S.; Andersen, J.; Ellermann, T.; Svendsen, L.M. Coastal eutrophication and the Danish national aquatic monitoring and assessment program. Estuar. Coasts 2002, 25, 848–861. [Google Scholar] [CrossRef]
  16. Morvan, X.; Saby, N.; Arrouays, D.; Le Bas, C.; Jones, R.; Verheijen, F.; Bellamy, P.H.; Stephens, M.; Kibblewhite, M.G. Soil monitoring in Europe: A review of existing systems and requirements for harmonisation. Sci. Total Environ. 2008, 391, 1–12. [Google Scholar] [CrossRef] [PubMed]
  17. Hurley, M.A.; Currie, J.E.; Gough, J.; Butterwick, C. A framework for the analysis of harmonised monitoring scheme data for England and Wales. Environmetrics 1996, 7, 379–390. [Google Scholar] [CrossRef]
  18. Gilliom, R.J.; Alley, W.M.; Gurtz, M.E. Design of National Water-Quality Assessment Program; United States Government Printing Office: Washington, DC, USA, 1995.
  19. Rosen, M.R.; Lapham, W.W. Introduction to the US Geological Survey National Water-Quality Assessment (NAWQA) of ground-water quality trends and comparison to other national programs. J. Environ. Qual. 2008, 37, 190. [Google Scholar]
  20. Zhang, C.; Sihui, D. A new water quality assessment model based on projection pursuit technique. J. Environ. Sci. 2009, 21, S154–S157. [Google Scholar] [CrossRef]
  21. He, B.; Oki, T.; Sun, F.; Komori, D.; Kanae, S.; Wang, Y.; Kim, H.; Yamazaki, D. Estimating monthly total nitrogen concentration in streams by using artificial neural network. J. Environ. Manag. 2011, 92, 172–177. [Google Scholar] [CrossRef] [PubMed]
  22. Wu, C.L.; Chau, K.W.; Li, Y.S. Methods to improve neural network performance in daily flows prediction. J. Hydrol. 2009, 372, 80–93. [Google Scholar] [CrossRef]
  23. Singh, K.P.; Malik, A.; Mohan, D.; Sinha, S. Multivariate statistical techniques for the evaluation of spatial and temporal variations in water quality of Gomti River (India)-a case study. Water Res. 2004, 38, 3980–3992. [Google Scholar] [CrossRef] [PubMed]
  24. Wang, W.; Xu, D.; Chau, K.; Lei, G. Assessment of river water quality based on theory of variable fuzzy sets and fuzzy binary comparison method. Water Resour. Manag. 2014, 28, 4183–4200. [Google Scholar] [CrossRef]
  25. Moriasi, D.N.; Wilson, B.N.; Douglas-Mankin, K.R.; Arnold, J.G.; Gowda, P.H. Hydrologic and water quality models: Use, calibration, and validation. Trans. ASABE 2012, 55, 1241–1247. [Google Scholar] [CrossRef]
  26. He, B.; Kanae, S.; Oki, T.; Hirabayashi, Y.; Yamashiki, Y.; Takara, K. Assessment of global nitrogen pollution in rivers using an integrated biogeochemical modeling framework. Water Res. 2011, 45, 2573–2586. [Google Scholar] [CrossRef] [PubMed]
  27. Chau, K.; Wu, C. A hybrid model coupled with singular spectrum analysis for daily rainfall prediction. J. Hydroinform. 2010, 12, 458–473. [Google Scholar] [CrossRef]
  28. Wang, Y.; Wang, P.; Bai, Y.; Tian, Z.; Li, J.; Shao, X.; Mustavich, L.F.; Li, B. Assessment of surface water quality via multivariate statistical techniques: A case study of the Songhua River Harbin region, China. J. Hydro-Environ. Res. 2013, 7, 30–40. [Google Scholar] [CrossRef]
  29. He, G.; Lu, Y.; Mol, A.P.; Beckers, T. Changes and challenges: China′s environmental management in transition. Environ. Dev. 2012, 3, 25–38. [Google Scholar] [CrossRef]
  30. Zhang, Q.; Li, L.; Wang, Y.G.; Werner, A.D.; Xin, P.; Jiang, T.; Barry, D.A. Has the Three-Gorges Dam made the Poyang Lake wetlands wetter and drier? Geophys. Res. Lett. 2012, 39, L20402. [Google Scholar] [CrossRef]
  31. Ye, X.; Zhang, Q.; Liu, J.; Li, X.; Xu, C. Distinguishing the relative impacts of climate change and human activities on variation of streamflow in the Poyang Lake catchment, China. J. Hydrol. 2013, 494, 83–95. [Google Scholar] [CrossRef]
  32. Liu, W.X.; Coveney, R.M.; Chen, J.L. Environmental quality assessment on a river system polluted by mining activities. Appl. Geochem. 2003, 18, 749–764. [Google Scholar] [CrossRef]
  33. Mckenna, J.E. An enhanced cluster analysis program with bootstrap significance testing for ecological community analysis. Environ. Model. Softw. 2003, 18, 205–220. [Google Scholar] [CrossRef]
  34. Shrestha, S.; Kazama, F. Assessment of surface water quality using multivariate statistical techniques: A case study of the Fuji river basin, Japan. Environ. Model. Softw. 2007, 22, 464–475. [Google Scholar] [CrossRef]
  35. Johnson, R.A.; Wichern, D.W. Applied Multivariate Statistical Analysis; Prentice hall: Englewood Cliffs, NJ, USA, 1992. [Google Scholar]
  36. Jolliffe, I. Principal Component Analysis; John Wiley & Sons Inc.: Charlottesville, VA, USA, 2002. [Google Scholar]
  37. Yidana, S.M.; Ophori, D.; Banoeng-Yakubo, B. A multivariate statistical analysis of surface water chemistry data-The Ankobra Basin, Ghana. J. Environ. Manag. 2008, 86, 80–87. [Google Scholar] [CrossRef] [PubMed]
  38. Reghunath, R.; Murthy, T.S.; Raghavan, B.R. The utility of multivariate statistical techniques in hydrogeochemical studies: An example from Karnataka, India. Water Res. 2002, 36, 2437–2442. [Google Scholar] [CrossRef]
  39. Xiao, X.; Wang, Y.; Zhang, H.; Yu, X. Effects of primary productivity and ecosystem size on food-chain length in Raohe River, China. Acta Ecol. Sin. 2015, 35, 29–34. [Google Scholar] [CrossRef]
  40. Chen, H.; Teng, Y.; Lu, S.; Wang, Y.; Wang, J. Contamination features and health risk of soil heavy metals in China. Sci. Total Environ. 2015, 512, 143–153. [Google Scholar] [CrossRef] [PubMed]
  41. Chen, P.; Li, L.; Zhang, H. Spatio-Temporal Variations and Source Apportionment of Water Pollution in Danjiangkou Reservoir Basin, Central China. Water 2015, 7, 2591–2611. [Google Scholar] [CrossRef]
  42. Yang, Y.; Zhou, F.; Guo, H.; Sheng, H.; Liu, H.; Dao, X.; He, C. Analysis of spatial and temporal water pollution patterns in Lake Dianchi using multivariate statistical methods. Environ. Monit. Assess. 2010, 170, 407–416. [Google Scholar] [CrossRef] [PubMed]
  43. Alexander, R.B.; Smith, R.A.; Schwarz, G.E. Effect of stream channel size on the delivery of nitrogen to the Gulf of Mexico. Nature 2000, 403, 758–761. [Google Scholar] [CrossRef] [PubMed]
  44. Singh, K.P.; Malik, A.; Sinha, S. Water quality assessment and apportionment of pollution sources of Gomti river (India) using multivariate statistical techniques-a case study. Anal. Chim. Acta 2005, 538, 355–374. [Google Scholar] [CrossRef]
  45. Juahir, H.; Zain, S.M.; Yusoff, M.K.; Hanidza, T.T.; Armi, A.M.; Toriman, M.E.; Mokhtar, M. Spatial water quality assessment of Langat River Basin (Malaysia) using environmetric techniques. Environ. Monit. Assess. 2011, 173, 625–641. [Google Scholar] [CrossRef] [PubMed]
  46. He, M.; Wang, Z.; Tang, H. Spatial and temporal patterns of acidity and heavy metals in predicting the potential for ecological impact on the Le An river polluted by acid mine drainage. Sci. Total Environ. 1997, 206, 67–77. [Google Scholar] [CrossRef]
  47. Ringler, N.H.; Hall, J.D. Effects of logging on water temperature, and dissolved oxygen in spawning Beds. Trans. Am. Fish Soc. 1975, 104, 111–121. [Google Scholar] [CrossRef]

Share and Cite

MDPI and ACS Style

Duan, W.; He, B.; Nover, D.; Yang, G.; Chen, W.; Meng, H.; Zou, S.; Liu, C. Water Quality Assessment and Pollution Source Identification of the Eastern Poyang Lake Basin Using Multivariate Statistical Methods. Sustainability 2016, 8, 133. https://doi.org/10.3390/su8020133

AMA Style

Duan W, He B, Nover D, Yang G, Chen W, Meng H, Zou S, Liu C. Water Quality Assessment and Pollution Source Identification of the Eastern Poyang Lake Basin Using Multivariate Statistical Methods. Sustainability. 2016; 8(2):133. https://doi.org/10.3390/su8020133

Chicago/Turabian Style

Duan, Weili, Bin He, Daniel Nover, Guishan Yang, Wen Chen, Huifang Meng, Shan Zou, and Chuanming Liu. 2016. "Water Quality Assessment and Pollution Source Identification of the Eastern Poyang Lake Basin Using Multivariate Statistical Methods" Sustainability 8, no. 2: 133. https://doi.org/10.3390/su8020133

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop