Research paper
Assessment of surface water quality via multivariate statistical techniques: A case study of the Songhua River Harbin region, China

https://doi.org/10.1016/j.jher.2012.10.003Get rights and content

Abstract

Multivariate statistical approaches, such as cluster analysis (CA) and principal component analysis/factor analysis (PCA/FA), were used to evaluate temporal/spatial variations in water quality and identify latent sources of water pollution in the Songhua River Harbin region. The dataset included data on 15 parameters for six different sites in the region over a five-year monitoring period (2005–2009). Hierarchical CA grouped the six monitored sites into three clusters based on their similarities, corresponding to regions of low pollution (LP), moderate pollution (MP) and high pollution (HP). PCA/FA of the three different groups resulted in five latent factors accounting for 70.08%, 67.54% and 76.99% of the total variance in the water quality datasets of LP, MP and HP, respectively. This indicates that the parameters responsible for water quality variation are primarily related to organic pollution and nutrients (non-point sources: animal husbandry and agricultural activities), temperature (natural), heavy metal and toxic pollution (point sources: industry) in relatively LP areas; oxygen-consuming organic pollution (point sources: industry and domestic wastewater), temperature (natural), heavy metal and petrochemical pollution (point source: industry), nutrients (non-point sources: agricultural activities, organic decomposition and geologic deposits) in MP areas; and heavy metal, oil and petrochemical pollution (point source: industry), oxygen-consuming organic pollution (point source: domestic sewage and wastewater treatment plants), nutrients (non-point sources: agricultural activities, runoff in soils) in HP areas of the Harbin region. Therefore, the identification of the main potential environmental hazards in different regions by this study will help managers make better and more informed decisions about how to improve water quality.

Highlights

► We analyze complicated dataset to assess surface water quality of Songhua River. ► Study area is divided into three different pollution regions rationally by CA. ► PCA/FA helps to identify the each five main latent pollution sources for three regions. ► Different proper priorities for each region can be more helpful to improve water quality.

Introduction

The quality of surface water is a major factor affecting human health and ecological systems, especially around urban areas, since rivers and their tributaries passing through cities receive a multitude of contaminants released from industrial, domestic/sewage, and agricultural effluents (Qadir et al., 2008). Anthropogenic influences such as urbanization, industrial and agricultural practices, chemical spill accidents, dam construction, and natural processes like erosion and climatic conditions, could each affect surface water quality. However, the degree to which each factor contributes to water quality is unclear (Zhang Y et al., 2009). Thus, in order to help managers prioritize and make rational decisions as to the best course of action for improving water quality, it is necessary to decrease this uncertainty by interpreting temporal and spatial variations in water quality (Wang et al., 2008) and identifying the latent pollution sources (Zhang Q et al., 2009).

Recently, some data-driven approaches, such as the projection pursuit technique and neural networks, have been applied to water quality assessment (Wang et al., 2006, 2009). However, compared with these approaches, multivariate statistical techniques such as cluster analysis (CA) and principal component analysis/factor analysis (PCA/FA) can be used to analyze large water quality datasets without losing important information. They can play the important role of verifying temporal and spatial variations caused by natural and anthropogenic factors (Liu et al., 2011; Shrestha and Kazama, 2007; Singh et al., 2009; Vega et al., 1998). In addition, they have been widely used to evaluate water quality, identify the latent sources that influence surface water, and offer a valuable tool for reliable management of water resources as well as effective solutions to pollution in the last decade (dos Santos et al., 2004; Kazi et al., 2009; Kim et al., 2005; Kumar et al., 2009; Mencio and Mas-Pla, 2008; Razmkhah et al., 2010; Simeonov et al., 2003; Varol and Sen, 2009; Zhou et al., 2007a,b).

In China, the State Ministry of Environmental Protection started to focus significantly on the environmental monitoring system in the Songhua River basin after the Songhua River Benzene Spill in December 2005. Many monitoring sections have been established around big cities like Harbin in recent years, and a huge monitoring data base, including organic properties, physical and chemical properties, nutrients, inorganic constituents and heavy metal etc. has been put in place through these programs. However, the large and complicated datasets are difficult to analyze and interpret because the transformation of water quality properties is a perplexing process with many uncertainties (Gaume et al., 1998) and there are many potential interrelationships among these properties and monitoring sites. Furthermore, little work has been done to explore the application of CA and PCA/FA to river studies in China and previous research in this field has neglected to consider the Songhua River near Harbin city.

In this study, the large data matrix generated under the 5-year (2005–2009) monitoring program is subjected to different multivariate statistical approaches (CA, PCA/FA) to (i) evaluate the contribution of water quality parameters to temporal and spatial variations in surface water quality, and (ii) identify the potential factors that explain variation in water quality parameters of the Songhua River near Harbin City. The results will not only help managers understand the main sources of pollution, but also our further studies on the impact of dam construction on the water quality of the Songhua River near Harbin City.

Section snippets

Study area

The Songhua River, the third biggest river in China, is generated from the conjunction of the Nen River and the Sec-Songhua River, which originate from the Changbai Mountains. It traverses a distance of 2214.3 km, with a basin area of 556,800 km2. The Songhua River mainstream flows across the downtown of Harbin and merges with two main tributaries – the Ashen River and the Hulan River – in the Harbin region (Fig. 1).

Harbin, the capital of the Heilongjiang Province, is located in the Song-Nen

Temporal similarity and period grouping

Temporal CA generated a dendrogram (Fig. 2), grouping 12 months into three clusters with significant differences by using a Sneath index of 2/3 Dmax. Cluster 1 (the first period) included January and February, which corresponds to the low flow period. In this period, all streams are so severely icebound that the depth of the ice layer above the water body usually reaches 0.8–1.5 m. Cluster 2 (the second period) included May, June, July, August, September and October, which closely corresponds

Conclusion

In this case study, different multivariate statistical techniques were successfully used to assess temporal and spatial variation in surface water quality of the Songhua River and identify the main contaminants and their sources of the sampling sites in the Harbin region.

Hierarchical cluster analysis grouped 6 monitoring sites into three regions (A, B and C) and classified 12 months into three periods that each share similar water quality characteristics. It offers reliable classification of

Acknowledgements

This study was funded in part by the Natural Science Foundation of China (50821002), Creative Research Groups of China (51121062), China Scholarship Council (2010612257) and the University of California Agricultural Experiment Station. The authors would like to sincerely thank Professor Wu-Seng Lung for some suggestions on the manuscript when he was visiting HIT in 2010. The monitoring work was conducted by the Harbin Environmental Protection Bureau.

Yi Wang. Ph.D candidate of Harbin Institute of Technology (HIT) and visiting scholar of University of California, Riverside (UCR). Research interests: Environment modeling and software; Environment management.

References (36)

Cited by (228)

  • Degradation of contaminants of PPCPs by photocatalysis for water purification: Kinetics, mechanisms, and cytotoxicity analysis

    2023, Chemical Engineering Journal
    Citation Excerpt :

    Considering that the general parameters in Table S1 were unable to completely depict the seasonal variation of water samples in SR, the results of other investigations into the seasonal variation of water quality in this water source may be consulted. According to Wang et al. [23], the OH requirements in the SR were significantly increased during the rainy alga season because of the production of more low-molecular-weight dissolved organic matter (LW-DOM). Moreover, LW-DOM can react faster with OH, allowing it to pass through the microfiltration or reverse osmosis pretreatment.

View all citing articles on Scopus

Yi Wang. Ph.D candidate of Harbin Institute of Technology (HIT) and visiting scholar of University of California, Riverside (UCR). Research interests: Environment modeling and software; Environment management.

Peng Wang. Ph.D. Associate Dean of School of Municipal and Environmental Engineering, HIT and Chief Professor of State Key Laboratory of Urban Water Resources and Environment in the research fields of urban water ecological security. Research interests: Environmental planning and management; Environment digital simulation, Warning and emergency decision support system; Environmental microwave chemistry.

Yujun Bai. Senior Engineer of Harbin Environmental Monitoring Center. Research interests: Ecological monitoring; Environment quality monitoring and assessment.

Zaixing Tian. Postgraduate, School of Municipal and Environmental Engineering, HIT. Research interests: Water quality modeling and simulation.

Jingwen Li. Postgraduate, School of Municipal and Environmental Engineering, HIT. Research interests: Environment planning and management; Sediment simulation.

Xue Shao. Postgraduate, School of Municipal and Environmental Engineering, HIT. Research interests: Environmental impact assessment.

Laura F. Mustavich. Visiting Research Associate, University of California, Riverside. Research interests: Mathematical modeling and biostatistics.

Bai-Lian Li. Professor and Director, University of California, Riverside. Research interests: Ecological complexity and modeling, ecosystem service assessment, mathematical and theoretical ecology, spatial dynamic modeling.

View full text