Skip to main content
Erschienen in: Sustainable Water Resources Management 2/2024

01.04.2024 | Original Article

Classifying arsenic-contaminated waters in Tarkwa: a machine learning approach

verfasst von: Mohammed Ayisha, Matthew Nkoom, Dzigbodi Adzo Doke

Erschienen in: Sustainable Water Resources Management | Ausgabe 2/2024

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Access to clean and safe drinking water is key to the improvement of social lives in most developing countries. Due to its hazardous nature and detrimental effects on human health, increased quantities of arsenic in water bodies have been a growing global health concern in recent years. In Ghana, elevated arsenic concentration is reported in some waters in Tarkwa. However, constant monitoring of arsenic concentrations in these water sources are inhibited by the associated huge expenses. To facilitate early detection, this study aimed at developing efficient machine learning models for classifying high, medium and low levels of arsenic contamination using physical water parameters, such as total dissolved solids, pH, electrical conductivity and turbidity. These parameters were selected, because they are relatively inexpensive to measure, their data were available and they may influence the concentration of arsenic in the water. Thus, three machine learning models, namely, extra trees, random forest and decision tree, were developed and assessed using evaluation metrics, such as accuracy, precision and sensitivity. The evaluation results justified the superiority of the extra trees and random forest models over decision tree. However, all developed machine learning models generally gave remarkable performance when classifying waters with high and low levels of arsenic contamination. Moreover, the variable importance analysis revealed that pH had the strongest influence in classifying arsenic contaminated waters followed by electrical conductivity. The outcome of the study has revealed the potency of machine learning algorithms in assisting water monitoring practitioners for monitoring arsenic concentration in water sources.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
Zurück zum Zitat Abbas G, Murtaza B, Bibi I et al (2018) Arsenic uptake, toxicity, detoxification, and speciation in plants: physiological, biochemical, and molecular aspects. Int J Environ Res Public Health 15:13CrossRef Abbas G, Murtaza B, Bibi I et al (2018) Arsenic uptake, toxicity, detoxification, and speciation in plants: physiological, biochemical, and molecular aspects. Int J Environ Res Public Health 15:13CrossRef
Zurück zum Zitat Acharyya SK, Lahiri S, Raymahashay BC, Bhowmik A (2000) Arsenic toxicity of groundwater in parts of the Bengal basin in India and Bangladesh: the role of Quaternary stratigraphy and Holocene sea-level fluctuation. Environ Geol 39:1127–1137CrossRef Acharyya SK, Lahiri S, Raymahashay BC, Bhowmik A (2000) Arsenic toxicity of groundwater in parts of the Bengal basin in India and Bangladesh: the role of Quaternary stratigraphy and Holocene sea-level fluctuation. Environ Geol 39:1127–1137CrossRef
Zurück zum Zitat Ampomah EK, Qin Z, Nyame G (2020) Evaluation of tree-based ensemble machine learning models in predicting stock price direction of movement. Information 11:332CrossRef Ampomah EK, Qin Z, Nyame G (2020) Evaluation of tree-based ensemble machine learning models in predicting stock price direction of movement. Information 11:332CrossRef
Zurück zum Zitat Ayotte JD, Nolan BT, Gronberg JA (2016) Predicting arsenic in drinking water wells of the Central Valley, California. Environ Sci Technol 50:7555–7563CrossRef Ayotte JD, Nolan BT, Gronberg JA (2016) Predicting arsenic in drinking water wells of the Central Valley, California. Environ Sci Technol 50:7555–7563CrossRef
Zurück zum Zitat Baah-Ennumh TY, Adom-Asamoah G (2019) Land use challenges in mining communities—the case of Tarkwa-Nsuaem municipality. Environ Ecol Res 7:139–152CrossRef Baah-Ennumh TY, Adom-Asamoah G (2019) Land use challenges in mining communities—the case of Tarkwa-Nsuaem municipality. Environ Ecol Res 7:139–152CrossRef
Zurück zum Zitat Breiman L (2017) Classification and regression trees. Routledge, LondonCrossRef Breiman L (2017) Classification and regression trees. Routledge, LondonCrossRef
Zurück zum Zitat Brus DJ, Kempen B, Heuvelink GBM (2011) Sampling for validation of digital soil maps. Eur J Soil Sci 62:394–407CrossRef Brus DJ, Kempen B, Heuvelink GBM (2011) Sampling for validation of digital soil maps. Eur J Soil Sci 62:394–407CrossRef
Zurück zum Zitat De Ville B, Neville P (2013) Decision trees for analytics: using SAS Enterprise miner. SAS Institute, Cary De Ville B, Neville P (2013) Decision trees for analytics: using SAS Enterprise miner. SAS Institute, Cary
Zurück zum Zitat Dehghan AA, Kazemi M (2013) Measurement and comparison of heavy metals concentration in vegetables used in Mashhad. Zahedan J Res Med Sci 15:3 Dehghan AA, Kazemi M (2013) Measurement and comparison of heavy metals concentration in vegetables used in Mashhad. Zahedan J Res Med Sci 15:3
Zurück zum Zitat Ewusi A, Ahenkorah I, Kuma JSY (2017) Groundwater vulnerability assessment of the Tarkwa mining area using SINTACS approach and GIS. Ghana Min J 17:18–30CrossRef Ewusi A, Ahenkorah I, Kuma JSY (2017) Groundwater vulnerability assessment of the Tarkwa mining area using SINTACS approach and GIS. Ghana Min J 17:18–30CrossRef
Zurück zum Zitat Ewusi A, Ahenkorah I, Aikins D (2021) Modelling of total dissolved solids in water supply systems using regression and supervised machine learning approaches. Appl Water Sci 11:1–16CrossRef Ewusi A, Ahenkorah I, Aikins D (2021) Modelling of total dissolved solids in water supply systems using regression and supervised machine learning approaches. Appl Water Sci 11:1–16CrossRef
Zurück zum Zitat Geurts P, Ernst D, Wehenkel L (2006) Extremely randomized trees. Mach Learn 63:3–42CrossRef Geurts P, Ernst D, Wehenkel L (2006) Extremely randomized trees. Mach Learn 63:3–42CrossRef
Zurück zum Zitat Guo P-T, Li M-F, Luo W et al (2015) Digital mapping of soil organic matter for rubber plantation at regional scale: an application of random forest plus residuals kriging approach. Geoderma 237:49–59CrossRef Guo P-T, Li M-F, Luo W et al (2015) Digital mapping of soil organic matter for rubber plantation at regional scale: an application of random forest plus residuals kriging approach. Geoderma 237:49–59CrossRef
Zurück zum Zitat Gupta P, Vishwakarma M, Rawtani PM (2009) Assesment of water quality parameters of Kerwa Dam for drinking suitability. Int J Theor Appl Sci 1:53–55 Gupta P, Vishwakarma M, Rawtani PM (2009) Assesment of water quality parameters of Kerwa Dam for drinking suitability. Int J Theor Appl Sci 1:53–55
Zurück zum Zitat Ibrahim B, Ewusi A, Ahenkorah I (2022a) Assessing the suitability of boosting machine-learning algorithms for classifying arsenic-contaminated waters: a novel model-explainable approach using Shapley additive explanations. Water 14:3509CrossRef Ibrahim B, Ewusi A, Ahenkorah I (2022a) Assessing the suitability of boosting machine-learning algorithms for classifying arsenic-contaminated waters: a novel model-explainable approach using Shapley additive explanations. Water 14:3509CrossRef
Zurück zum Zitat Kusimi JM, Kusimi BA (2012) The hydrochemistry of water resources in selected mining communities in Tarkwa. J Geochem Explor 112:252–261CrossRef Kusimi JM, Kusimi BA (2012) The hydrochemistry of water resources in selected mining communities in Tarkwa. J Geochem Explor 112:252–261CrossRef
Zurück zum Zitat Lombard MA, Bryan MS, Jones DK et al (2021) Machine learning models of arsenic in private wells throughout the conterminous United States as a tool for exposure assessment in human health studies. Environ Sci Technol 55:5012–5023CrossRef Lombard MA, Bryan MS, Jones DK et al (2021) Machine learning models of arsenic in private wells throughout the conterminous United States as a tool for exposure assessment in human health studies. Environ Sci Technol 55:5012–5023CrossRef
Zurück zum Zitat Mahjoobi J, Etemad-Shahidi A (2008) An alternative approach for the prediction of significant wave heights based on classification and regression trees. Appl Ocean Res 30:172–177CrossRef Mahjoobi J, Etemad-Shahidi A (2008) An alternative approach for the prediction of significant wave heights based on classification and regression trees. Appl Ocean Res 30:172–177CrossRef
Zurück zum Zitat Majeed F, Ziggah YY, Kusi-Manu C et al (2022) A novel artificial intelligence approach for regolith geochemical grade prediction using multivariate adaptive regression splines. Geosyst Geoenviron 1:100038CrossRef Majeed F, Ziggah YY, Kusi-Manu C et al (2022) A novel artificial intelligence approach for regolith geochemical grade prediction using multivariate adaptive regression splines. Geosyst Geoenviron 1:100038CrossRef
Zurück zum Zitat Medunić G, Fiket Ž, Ivanić M (2020a) Arsenic contamination status in Europe, Australia, and other parts of the world. Arsen Drink Water Food 1:183–233CrossRef Medunić G, Fiket Ž, Ivanić M (2020a) Arsenic contamination status in Europe, Australia, and other parts of the world. Arsen Drink Water Food 1:183–233CrossRef
Zurück zum Zitat Nordstrom DK (2002) Worldwide occurrences of arsenic in ground water. Science (80-) 296:2143–2145CrossRef Nordstrom DK (2002) Worldwide occurrences of arsenic in ground water. Science (80-) 296:2143–2145CrossRef
Zurück zum Zitat Pal M, Mather PM (2003) An assessment of the effectiveness of decision tree methods for land cover classification. Remote Sens Environ 86:554–565CrossRef Pal M, Mather PM (2003) An assessment of the effectiveness of decision tree methods for land cover classification. Remote Sens Environ 86:554–565CrossRef
Zurück zum Zitat Peiravi R, Dehghan AA, Vahedian M (2013) Heavy metals concentrations in Mashhad drinking water network. Zahedan J Res Med Sci 15:11 Peiravi R, Dehghan AA, Vahedian M (2013) Heavy metals concentrations in Mashhad drinking water network. Zahedan J Res Med Sci 15:11
Zurück zum Zitat Petrusevski B, Sharma S, Schippers JC, Shordt K (2007) Arsenic in drinking water. IRC International Water and Sanitation Centre, Delft, pp 36–44 Petrusevski B, Sharma S, Schippers JC, Shordt K (2007) Arsenic in drinking water. IRC International Water and Sanitation Centre, Delft, pp 36–44
Zurück zum Zitat Quinlan JR (2014) C4.5: programs for machine learning. Elsevier, London Quinlan JR (2014) C4.5: programs for machine learning. Elsevier, London
Zurück zum Zitat Rodriguez-Galiano VF, Ghimire B, Rogan J et al (2012) An assessment of the effectiveness of a random forest classifier for land-cover classification. ISPRS J Photogramm Remote Sens 67:93–104CrossRef Rodriguez-Galiano VF, Ghimire B, Rogan J et al (2012) An assessment of the effectiveness of a random forest classifier for land-cover classification. ISPRS J Photogramm Remote Sens 67:93–104CrossRef
Zurück zum Zitat Sahin EK (2022) Comparative analysis of gradient boosting algorithms for landslide susceptibility mapping. Geocarto Int 37:2441–2465CrossRef Sahin EK (2022) Comparative analysis of gradient boosting algorithms for landslide susceptibility mapping. Geocarto Int 37:2441–2465CrossRef
Zurück zum Zitat Tanha J, Abdi Y, Samadi N et al (2020) Boosting methods for multi-class imbalanced data classification: an experimental review. J Big Data 7:1–47CrossRef Tanha J, Abdi Y, Samadi N et al (2020) Boosting methods for multi-class imbalanced data classification: an experimental review. J Big Data 7:1–47CrossRef
Zurück zum Zitat Welch AH, Stollenwerk KG (2003) Arsenic in ground water: geochemistry and occurrence. Springer, New YorkCrossRef Welch AH, Stollenwerk KG (2003) Arsenic in ground water: geochemistry and occurrence. Springer, New YorkCrossRef
Zurück zum Zitat Welch AH, Westjohn DB, Helsel DR, Wanty RB (2000) Arsenic in ground water of the United States: occurrence and geochemistry. Groundwater 38:589–604CrossRef Welch AH, Westjohn DB, Helsel DR, Wanty RB (2000) Arsenic in ground water of the United States: occurrence and geochemistry. Groundwater 38:589–604CrossRef
Zurück zum Zitat WHO (2004) Guidelines for drinking-water quality. World Health Organization, Geneva WHO (2004) Guidelines for drinking-water quality. World Health Organization, Geneva
Zurück zum Zitat Zhang M, Shi W, Xu Z (2020) Systematic comparison of five machine-learning models in classification and interpolation of soil particle size fractions using different transformed data. Hydrol Earth Syst Sci 24:2505–2526CrossRef Zhang M, Shi W, Xu Z (2020) Systematic comparison of five machine-learning models in classification and interpolation of soil particle size fractions using different transformed data. Hydrol Earth Syst Sci 24:2505–2526CrossRef
Zurück zum Zitat Abhishek L (2020) Optical character recognition using ensemble of SVM, MLP and extra trees classifier. In: 2020 international conference for emerging technology (INCET). IEEE, New York, pp 1–4 Abhishek L (2020) Optical character recognition using ensemble of SVM, MLP and extra trees classifier. In: 2020 international conference for emerging technology (INCET). IEEE, New York, pp 1–4
Zurück zum Zitat Beauxis-Aussalet E, Hardman L (2014) Simplifying the visualization of confusion matrix. In: 26th Benelux conference on artificial intelligence (BNAIC) Beauxis-Aussalet E, Hardman L (2014) Simplifying the visualization of confusion matrix. In: 26th Benelux conference on artificial intelligence (BNAIC)
Zurück zum Zitat Derczynski L (2016) Complementarity, F-score, and NLP Evaluation. In: Proceedings of the tenth international conference on language resources and evaluation (LREC’16). pp 261–266 Derczynski L (2016) Complementarity, F-score, and NLP Evaluation. In: Proceedings of the tenth international conference on language resources and evaluation (LREC’16). pp 261–266
Zurück zum Zitat Dickson KB, Benneh G (1980) A new geography of Ghana Longmans Dickson KB, Benneh G (1980) A new geography of Ghana Longmans
Zurück zum Zitat Géron A (2017) Hands-on machine learning with scikit-learn and tensorflow: concepts. Tools, tech build intelligent system Géron A (2017) Hands-on machine learning with scikit-learn and tensorflow: concepts. Tools, tech build intelligent system
Zurück zum Zitat Ghana Statistical Service (2014) Population and housing census: district analytical report Tarkwa Nsuaem Municipality. Ghana Statistical Service Accra, Ghana, pp 16–18 Ghana Statistical Service (2014) Population and housing census: district analytical report Tarkwa Nsuaem Municipality. Ghana Statistical Service Accra, Ghana, pp 16–18
Zurück zum Zitat Hinkle SR, Polette DJ (1999) Arsenic in ground water of the Willamette Basin, Oregon. US Department of the Interior, US Geological Survey Hinkle SR, Polette DJ (1999) Arsenic in ground water of the Willamette Basin, Oregon. US Department of the Interior, US Geological Survey
Zurück zum Zitat Howard ME (2012) Investigation of arsenic in the transition zone basin of the Mojave River Howard ME (2012) Investigation of arsenic in the transition zone basin of the Mojave River
Zurück zum Zitat IARC (2004) Some drinking-water disinfectants and contaminants, including arsenic IARC (2004) Some drinking-water disinfectants and contaminants, including arsenic
Zurück zum Zitat Medunić G, Fiket Ž, Ivanić M (2020) Arsenic contamination status in Europe, Australia, and other parts of the world BT. In: Srivastava S (ed) Arsenic in drinking water and food. Springer, Singapore, pp 183–233 Medunić G, Fiket Ž, Ivanić M (2020) Arsenic contamination status in Europe, Australia, and other parts of the world BT. In: Srivastava S (ed) Arsenic in drinking water and food. Springer, Singapore, pp 183–233
Zurück zum Zitat Natasha, Shahid M, Imran M, et al (2020) Arsenic environmental contamination status in South Asia BT. In: Srivastava S (ed) Arsenic in drinking water and food. Springer, Singapore, pp 13–39 Natasha, Shahid M, Imran M, et al (2020) Arsenic environmental contamination status in South Asia BT. In: Srivastava S (ed) Arsenic in drinking water and food. Springer, Singapore, pp 13–39
Zurück zum Zitat Owusu AM (2013) Determination of total arsenic and the relationship between the arsenic levels and other determined physicochemical properties of some biological and environmental samples from selected towns in the Amansie West district of the Ashanti Region Owusu AM (2013) Determination of total arsenic and the relationship between the arsenic levels and other determined physicochemical properties of some biological and environmental samples from selected towns in the Amansie West district of the Ashanti Region
Zurück zum Zitat WHO (2017) 2017 WHO guidelines for drinking water quality: first addendum to the fourth edition. J Am Water Work Assoc 109:44–51 WHO (2017) 2017 WHO guidelines for drinking water quality: first addendum to the fourth edition. J Am Water Work Assoc 109:44–51
Metadaten
Titel
Classifying arsenic-contaminated waters in Tarkwa: a machine learning approach
verfasst von
Mohammed Ayisha
Matthew Nkoom
Dzigbodi Adzo Doke
Publikationsdatum
01.04.2024
Verlag
Springer International Publishing
Erschienen in
Sustainable Water Resources Management / Ausgabe 2/2024
Print ISSN: 2363-5037
Elektronische ISSN: 2363-5045
DOI
https://doi.org/10.1007/s40899-024-01042-1

Weitere Artikel der Ausgabe 2/2024

Sustainable Water Resources Management 2/2024 Zur Ausgabe