Skip to main content
Top
Published in: Earth Science Informatics 2/2024

12-02-2024 | RESEARCH

A SMOTified extreme learning machine for identifying mineralization anomalies from geochemical exploration data: a case study from the Yeniugou area, Xinjiang, China

Authors: Alina Shayilan, Yongliang Chen

Published in: Earth Science Informatics | Issue 2/2024

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Extreme learning Machine (ELM) is a novel supervised machine learning algorithm, which has the advantages of fast-learning speed, good generalization, high classification performance, and can avoid problems such as local minimum, unreasonable learning rate, excessive number of iterations and overfitting. However, its classification performance is affected by imbalanced training data. To solve this problem, the synthetic minority oversampling technique (SMOTE) was integrated with the ELM algorithm to construct a hybrid algorithm, called SMOTified ELM, to identify polymetallic mineralization anomalies from the 1: 50,000 drainage sediment survey data in the Yeniugou area of Tokexun County, Xinjiang, China. A comparison between the SMOTified ELM model and the ELM model shows that the SMOTified ELM model is superior to the ELM model in terms of receiver operating characteristic curves (ROCs) and area under the (ROC) curves (AUCs). The ROC curve of the SMOTified ELM model is closer to the upper left corner of the ROC space than that of the ELM model. The AUC value of the SMOTfied ELM model (0.963) is higher than that of the ELM model (0.898). The polymetallic mineralization anomalies identified by the SMOTified ELM model account for 10.61% of the study area and contain 100% of known polymetallic deposits. The polymetallic mineralization anomalies identified by the ELM model account for 8.00% of the study area and contain 89% of known polymetallic deposits. Therefore, the SMOTified ELM method is a potentially useful technique for building a supervised mineralization anomaly identification model with high performance.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Baglama J, Reichel L (2006) Restarted block Lanczos bidiagonalization methods. Numer Algorithm 43(3):251–272ADSMathSciNet Baglama J, Reichel L (2006) Restarted block Lanczos bidiagonalization methods. Numer Algorithm 43(3):251–272ADSMathSciNet
go back to reference Birch JB, Tukey JW (1978) Exploratory data analysis. J Am Stat Assoc 73:885–886 Birch JB, Tukey JW (1978) Exploratory data analysis. J Am Stat Assoc 73:885–886
go back to reference Cao Y, Wakil K, Alyousef R et al (2020) Application of extreme learning machine in behavior of beam to column connections. Structures 25:861–867 Cao Y, Wakil K, Alyousef R et al (2020) Application of extreme learning machine in behavior of beam to column connections. Structures 25:861–867
go back to reference Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357 Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357
go back to reference Chawla NV, Lazarevic A, Hall LO, Bowyer KW (2003) SMOTEBoost: improving prediction of the minority class in boosting. Lect Notes Comput Sci 2838:107–119 Chawla NV, Lazarevic A, Hall LO, Bowyer KW (2003) SMOTEBoost: improving prediction of the minority class in boosting. Lect Notes Comput Sci 2838:107–119
go back to reference Chen YL, An A (2016) Application of ant colony algorithm to geochemical anomaly detection. J Geochem Explor 164:75–85 Chen YL, An A (2016) Application of ant colony algorithm to geochemical anomaly detection. J Geochem Explor 164:75–85
go back to reference Chen YL, Shayilan A (2022) Dictionary learning for multivariate geochemical anomaly detection for mineral exploration targeting. J Geochem Explor 235:106958 Chen YL, Shayilan A (2022) Dictionary learning for multivariate geochemical anomaly detection for mineral exploration targeting. J Geochem Explor 235:106958
go back to reference Chen YL, Wu W (2016) A prospecting cost-benefit strategy for mineral potential mapping based on ROC curve analysis. Ore Geol Rev 74:26–38 Chen YL, Wu W (2016) A prospecting cost-benefit strategy for mineral potential mapping based on ROC curve analysis. Ore Geol Rev 74:26–38
go back to reference Chen YL, Wu W (2017a) Application of one-class support vector machine to quickly identify multivariate anomalies from geochemical exploration data. GEEA 17:231–238 Chen YL, Wu W (2017a) Application of one-class support vector machine to quickly identify multivariate anomalies from geochemical exploration data. GEEA 17:231–238
go back to reference Chen YL, Wu W (2017b) Mapping mineral prospectivity using an extreme learning machine regression. Ore Geol Rev 80:200–213 Chen YL, Wu W (2017b) Mapping mineral prospectivity using an extreme learning machine regression. Ore Geol Rev 80:200–213
go back to reference Chen YL, Wu W (2019a) Isolation forest as an alternative data-driven mineral prospectivity mapping method with a higher data-processing efficiency. Nat Resour Res 28:31–46ADS Chen YL, Wu W (2019a) Isolation forest as an alternative data-driven mineral prospectivity mapping method with a higher data-processing efficiency. Nat Resour Res 28:31–46ADS
go back to reference Chen YL, Wu W (2019b) Separation of geochemical anomalies from the sample data of unknown distribution population using Gaussian mixture model. Comput Geosci 125:9–18ADS Chen YL, Wu W (2019b) Separation of geochemical anomalies from the sample data of unknown distribution population using Gaussian mixture model. Comput Geosci 125:9–18ADS
go back to reference Chen YL, Lu LJ, Li XB (2014a) Kernel Mahalanobis distance for multivariate geochemical anomaly recognition. J Jilin Univ (Earth Sci) 44:396–408 (In Chinese with English Abstract) Chen YL, Lu LJ, Li XB (2014a) Kernel Mahalanobis distance for multivariate geochemical anomaly recognition. J Jilin Univ (Earth Sci) 44:396–408 (In Chinese with English Abstract)
go back to reference Chen YL, Lu LJ, Li XB (2014b) Application of continuous restricted Boltzmann machine to identify multivariate geochemical anomaly. J Geochem Explor 140:56–63 Chen YL, Lu LJ, Li XB (2014b) Application of continuous restricted Boltzmann machine to identify multivariate geochemical anomaly. J Geochem Explor 140:56–63
go back to reference Chen YL, Sui YH, Shayilan A (2023) Constructing a high-performance self-training model based on support vector classifiers to detect gold mineralization-related geochemical anomalies for gold exploration targeting. Ore Geol Rev 153:105265 Chen YL, Sui YH, Shayilan A (2023) Constructing a high-performance self-training model based on support vector classifiers to detect gold mineralization-related geochemical anomalies for gold exploration targeting. Ore Geol Rev 153:105265
go back to reference Cheng QM (1999) Multifractality and spatial statistics. Comput Geosci 25:949–961ADS Cheng QM (1999) Multifractality and spatial statistics. Comput Geosci 25:949–961ADS
go back to reference Cheng QM, Agterberg FP, Ballantyne SB (1994) The separation of geochemical anomalies from background by fractal methods. J Geochem Explor 51:109–130 Cheng QM, Agterberg FP, Ballantyne SB (1994) The separation of geochemical anomalies from background by fractal methods. J Geochem Explor 51:109–130
go back to reference Cheng QM, Agterberg FP, Bonham-Carter GF (1996) A spatial analysis method for geochemical anomaly separation. J Geochem Explor 56:183–195 Cheng QM, Agterberg FP, Bonham-Carter GF (1996) A spatial analysis method for geochemical anomaly separation. J Geochem Explor 56:183–195
go back to reference El-Makky AM (2011) Statistical analyses of La, Ce, Nd, Y, Nb, Ti, P, and Zr in bedrocks and their significance in geochemical exploration at the Um Garayat Gold Mine Area, Eastern Desert. Egypt Natural Resources Research 20:157–176 El-Makky AM (2011) Statistical analyses of La, Ce, Nd, Y, Nb, Ti, P, and Zr in bedrocks and their significance in geochemical exploration at the Um Garayat Gold Mine Area, Eastern Desert. Egypt Natural Resources Research 20:157–176
go back to reference Freund Y, Schapire R (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci 55(1):119–139MathSciNet Freund Y, Schapire R (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci 55(1):119–139MathSciNet
go back to reference Gałuszka A (2007) A review of geochemical background concepts and an example using data from Poland. Environ Geol 52:861–870ADS Gałuszka A (2007) A review of geochemical background concepts and an example using data from Poland. Environ Geol 52:861–870ADS
go back to reference Grunsky EC, Agterberg FP (1988) Spatial and multivariate analysis of geochemical data from metavolcanic rocks in the Ben Nevis area, Ontario. Math Geol 20:825–861 Grunsky EC, Agterberg FP (1988) Spatial and multivariate analysis of geochemical data from metavolcanic rocks in the Ben Nevis area, Ontario. Math Geol 20:825–861
go back to reference Han H, Wang WY, Mao BH (2005) Borderline-SMOTE: a new over-sampling method in imbalanced data sets learning. In: Huang DS, Zhang XP, Huang GB (Eds.): ICIC 2005, Part I, LNCS 3644, pp 878–887 Han H, Wang WY, Mao BH (2005) Borderline-SMOTE: a new over-sampling method in imbalanced data sets learning. In: Huang DS, Zhang XP, Huang GB (Eds.): ICIC 2005, Part I, LNCS 3644, pp 878–887
go back to reference Han CM, Xiao WJ, Wan B et al (2018) Late Palaeozoic-Mesozoic endogenetic metallogenic series and geodynamic evolution in the East Tianshan Mountains. Acta Petrologica Sinica 34(7):1914–1932 (In Chinese with English Abstract) Han CM, Xiao WJ, Wan B et al (2018) Late Palaeozoic-Mesozoic endogenetic metallogenic series and geodynamic evolution in the East Tianshan Mountains. Acta Petrologica Sinica 34(7):1914–1932 (In Chinese with English Abstract)
go back to reference He H, Yang B, Garcia EA et al (2008) ADASYN: adaptive synthetic sampling approach for imbalanced learning. Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on Computational Intelligence He H, Yang B, Garcia EA et al (2008) ADASYN: adaptive synthetic sampling approach for imbalanced learning. Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on Computational Intelligence
go back to reference Huang GB, Zhu QY, Siew CK (2004) Extreme learning machine: a new learning scheme of feedforward neural networks. In: 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.0-4CH37541). IEEE, Budapest, pp 985–990 Huang GB, Zhu QY, Siew CK (2004) Extreme learning machine: a new learning scheme of feedforward neural networks. In: 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.0-4CH37541). IEEE, Budapest, pp 985–990
go back to reference Huang GB, Zhu QY, Siew CK (2006) Extreme learning machine: theory and applications. Neurocomputing 70:489–501 Huang GB, Zhu QY, Siew CK (2006) Extreme learning machine: theory and applications. Neurocomputing 70:489–501
go back to reference Huang GB, Ding X, Zhou H (2010) Optimization method based extreme learning machine for classification. Neurocomputing 74:155–163 Huang GB, Ding X, Zhou H (2010) Optimization method based extreme learning machine for classification. Neurocomputing 74:155–163
go back to reference Huang GB, Wang DH, Lan Y (2011a) Extreme learning machines: a survey. Int J Mach Learn and Cyber 2:107–122 Huang GB, Wang DH, Lan Y (2011a) Extreme learning machines: a survey. Int J Mach Learn and Cyber 2:107–122
go back to reference Huang YW, Wu DG, Li J (2011b) Structural healthy monitoring data recovery based on extreme learning machine. Comput Eng 37(16):241–243 (In Chinese with English Abstract) Huang YW, Wu DG, Li J (2011b) Structural healthy monitoring data recovery based on extreme learning machine. Comput Eng 37(16):241–243 (In Chinese with English Abstract)
go back to reference Li MB, Huang GB, Saratchandran P, Sundararajan N (2005) Fully complex extreme learning machine. Neurocomputing 68:306–314 Li MB, Huang GB, Saratchandran P, Sundararajan N (2005) Fully complex extreme learning machine. Neurocomputing 68:306–314
go back to reference Liang NY, Saratchandran P, Huang GB, Sundararajan N (2006) Classification of mental tasks from eeg signals using extreme learning machine. Int J Neur Syst 16:29–38 Liang NY, Saratchandran P, Huang GB, Sundararajan N (2006) Classification of mental tasks from eeg signals using extreme learning machine. Int J Neur Syst 16:29–38
go back to reference Ma HF, Zhang ZM, Cai GQ et al (2002) Application Geochemical zoning characteristics and prospective prediction of gold deposits in the eastern part of the Southern Tianshan Mountains. Uranium Geology 5:282–286 (In Chinese with English Abstract) Ma HF, Zhang ZM, Cai GQ et al (2002) Application Geochemical zoning characteristics and prospective prediction of gold deposits in the eastern part of the Southern Tianshan Mountains. Uranium Geology 5:282–286 (In Chinese with English Abstract)
go back to reference O’Brien JJ, Spry PG, Nettleton D et al (2015) Using random forests to distinguish gahnite compositions as an exploration guide to broken hill-type Pb–Zn–Ag deposits in the Broken Hill domain, Australia. J Geochem Explor 149:74–86 O’Brien JJ, Spry PG, Nettleton D et al (2015) Using random forests to distinguish gahnite compositions as an exploration guide to broken hill-type Pb–Zn–Ag deposits in the Broken Hill domain, Australia. J Geochem Explor 149:74–86
go back to reference Parsa M, Maghsoudi A, Yousefi M (2018) A receiver operating characteristics-based geochemical data fusion technique for targeting undiscovered mineral deposits. Nat Resour Res 27:15–28 Parsa M, Maghsoudi A, Yousefi M (2018) A receiver operating characteristics-based geochemical data fusion technique for targeting undiscovered mineral deposits. Nat Resour Res 27:15–28
go back to reference Reichstein M, Camps-Valls G, Stevens B et al (2019) Deep learning and process understanding for data-driven earth system science. Nature 566:195–204ADSPubMed Reichstein M, Camps-Valls G, Stevens B et al (2019) Deep learning and process understanding for data-driven earth system science. Nature 566:195–204ADSPubMed
go back to reference Reimann C, Filzmoser P, Garrett RG (2002) Factor analysis applied to regional geochemical data: problems and possibilities. Appl Geochem 17:185–206ADS Reimann C, Filzmoser P, Garrett RG (2002) Factor analysis applied to regional geochemical data: problems and possibilities. Appl Geochem 17:185–206ADS
go back to reference Ren TX, Zhao Y, Zhang H et al (1984) A preliminary study on the utilization of regional geochemical prospecting method in the arid and desert area of Inner Mongolia. Geophysical and Geochemical Exploration 8:284–296 (In Chinese with English Abstract) Ren TX, Zhao Y, Zhang H et al (1984) A preliminary study on the utilization of regional geochemical prospecting method in the arid and desert area of Inner Mongolia. Geophysical and Geochemical Exploration 8:284–296 (In Chinese with English Abstract)
go back to reference Rubio B, Nombela MA, Vilas F (2000) Geochemistry of major and trace elements in sediments of the Ria de Vigo (NW Spain):an assessment of metal pollution. Mar Pollut Bull 40(1):968–980 Rubio B, Nombela MA, Vilas F (2000) Geochemistry of major and trace elements in sediments of the Ria de Vigo (NW Spain):an assessment of metal pollution. Mar Pollut Bull 40(1):968–980
go back to reference Shang YM, Lu LJ, Kang QK (2019) Identification model of geochemical anomaly based on isolation forest algorithm. Global Geology 22(3):159–166 (In Chinese with English Abstract) Shang YM, Lu LJ, Kang QK (2019) Identification model of geochemical anomaly based on isolation forest algorithm. Global Geology 22(3):159–166 (In Chinese with English Abstract)
go back to reference Si Y, Xu ZP, Gao BM (2011) Study of geophysical prospecting anomaly characteristics in Caihuagou Copper Deposit, Xinjiang Province. Resour Environ Eng 25: 364–367+379 (In Chinese with English Abstract) Si Y, Xu ZP, Gao BM (2011) Study of geophysical prospecting anomaly characteristics in Caihuagou Copper Deposit, Xinjiang Province. Resour Environ Eng 25: 364–367+379 (In Chinese with English Abstract)
go back to reference Sinclair AJ (1974) Selection of threshold values in geochemical data using probability graphs. J Geochem Explor 3:129–149 Sinclair AJ (1974) Selection of threshold values in geochemical data using probability graphs. J Geochem Explor 3:129–149
go back to reference Sinclair AJ, Tessari OJ (1981) Vein geochemistry, an exploration tool in KenoHill camp, Yukon Territory, Canada. J Geochem Explor 14:1–24 Sinclair AJ, Tessari OJ (1981) Vein geochemistry, an exploration tool in KenoHill camp, Yukon Territory, Canada. J Geochem Explor 14:1–24
go back to reference Suresh S, VenkateshBabu R, Kim HJ (2009) No-reference image quality assessment using modified extreme learning machine classifier. Appl Soft Comput 9:541–552 Suresh S, VenkateshBabu R, Kim HJ (2009) No-reference image quality assessment using modified extreme learning machine classifier. Appl Soft Comput 9:541–552
go back to reference Van HM, Miche Y, Oja E, Lendasse A (2011) GPU-accelerated and parallelized ELM ensembles for large-scale regression. Neurocomputing 74:2430–2437 Van HM, Miche Y, Oja E, Lendasse A (2011) GPU-accelerated and parallelized ELM ensembles for large-scale regression. Neurocomputing 74:2430–2437
go back to reference Wang H, Zuo RG (2015) A comparative study of trend surface analysis and spectrum–area multifractal model to identify geochemical anomalies. J Geochem Explor 155:84–90 Wang H, Zuo RG (2015) A comparative study of trend surface analysis and spectrum–area multifractal model to identify geochemical anomalies. J Geochem Explor 155:84–90
go back to reference Wu W, Chen YL (2018) Application of isolation forest to extract multivariate anomalies from geochemical exploration data. Global Geology 21(1):36–47 Wu W, Chen YL (2018) Application of isolation forest to extract multivariate anomalies from geochemical exploration data. Global Geology 21(1):36–47
go back to reference Yang L, Li J, Sun YM et al (2022) Analysis of geologic features and genetic type of Liuhuangshan Cu-Pb-Zn polymetallic mine in Toksun, Xinjiang. Chin Min Eng 51:83–88 (In Chinese with English Abstract) Yang L, Li J, Sun YM et al (2022) Analysis of geologic features and genetic type of Liuhuangshan Cu-Pb-Zn polymetallic mine in Toksun, Xinjiang. Chin Min Eng 51:83–88 (In Chinese with English Abstract)
go back to reference Yeu CWT, Lim MH, Huang GB, Agarwal A, Ong YS (2006) A new machine learning paradigm for terrain reconstruction. IEEE Geosci Remote Sens Lett 3(3):382–386ADS Yeu CWT, Lim MH, Huang GB, Agarwal A, Ong YS (2006) A new machine learning paradigm for terrain reconstruction. IEEE Geosci Remote Sens Lett 3(3):382–386ADS
go back to reference Zhang RX, Huang GB, Sundararajan N, Saratchandran P (2007) Multicategory classification using an extreme learning machine for microarray gene expression cancer diagnosis. IEEE/ACM Trans Comput Biol Bioinf 4(3):485–495 Zhang RX, Huang GB, Sundararajan N, Saratchandran P (2007) Multicategory classification using an extreme learning machine for microarray gene expression cancer diagnosis. IEEE/ACM Trans Comput Biol Bioinf 4(3):485–495
go back to reference Zhang ZJ, Zuo RG, Xiong YH (2021) Detection of the multivariate geochemical anomalies associated with mineralization using a deep convolutional neural network and a pixel-pair feature method. Appl Geochem 130:104994 Zhang ZJ, Zuo RG, Xiong YH (2021) Detection of the multivariate geochemical anomalies associated with mineralization using a deep convolutional neural network and a pixel-pair feature method. Appl Geochem 130:104994
go back to reference Zuo RG, Cheng QM, Agterberg FP, Xia Q (2009) Application of singularity mapping technique to identify local anomalies using stream sediment geochemical data, a case study from Gangdese, Tibet, western China. J Geochem Explor 101:225–235 Zuo RG, Cheng QM, Agterberg FP, Xia Q (2009) Application of singularity mapping technique to identify local anomalies using stream sediment geochemical data, a case study from Gangdese, Tibet, western China. J Geochem Explor 101:225–235
go back to reference Zuo RG, Xiong YH, Wang J, Carranza EJM (2019) Deep learning and its application in geochemical mapping. Earth Sci Rev 192:1–14ADS Zuo RG, Xiong YH, Wang J, Carranza EJM (2019) Deep learning and its application in geochemical mapping. Earth Sci Rev 192:1–14ADS
Metadata
Title
A SMOTified extreme learning machine for identifying mineralization anomalies from geochemical exploration data: a case study from the Yeniugou area, Xinjiang, China
Authors
Alina Shayilan
Yongliang Chen
Publication date
12-02-2024
Publisher
Springer Berlin Heidelberg
Published in
Earth Science Informatics / Issue 2/2024
Print ISSN: 1865-0473
Electronic ISSN: 1865-0481
DOI
https://doi.org/10.1007/s12145-024-01246-1

Other articles of this Issue 2/2024

Earth Science Informatics 2/2024 Go to the issue

Premium Partner