Skip to main content
Log in

Implementation of free and open-source semi-automatic feature engineering tool in landslide susceptibility mapping using the machine-learning algorithms RF, SVM, and XGBoost

  • Original Paper
  • Published:
Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Abstract

Various machine learning (ML) techniques have been recommended and used in the literature to produce landslide susceptibility map (LSM). On the other hand, feature engineering (FE) is an important topic in ML studies, but the concept is ignored by most research. In this study, a novel FE framework, including feature selection, feature transformation, feature binning, and feature weighting, is proposed to produce LSMs using eXtreme gradient boosting (XGBoost), random forest (RF), and support vector machine (SVM). For this purpose, first, thirteen landslide conditioning factors used in data preprocessing were utilized for producing LSM models in the study area, Babadag district of Denizli Province in the Aegean region of Turkey. Second, two irrelevant factors eliminated from the input feature subset using the feature selection in the FE framework. Third, features determined as skewed data were converted into symmetric form by applying feature transformation analysis with log transformation. Then, the remaining factors having continuous values were turned into categorical values using the quantile classifier technique. During the feature weighting phase, four different feature weighting methods, namely, eXtreme Gradient Boosting, random forest (RF), non-negative least squares (NNLS), and Frequency Ratio, were utilized to calculate the weights in each subclass of each landslide-related factor. In addition, the proposed feature subsets were also compared with raw data. At the end of process, the XGBoost model constructed with a FR-selected subset (Overall Accuracy (Acc) = 0.907 and area under curve (AUC) = 0.9822) outperformed both raw (Acc = 0.874; AUC = 0.960) and other methods (i.e., RF–FR and SVM–NNLS). Consequently, the study results revealed that the proposed FE approach could be a useful framework to increase the performance of ML techniques in identifying and extracting relevant features to develop highly optimized and enriched models.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  • Al-Najjar HAH, Pradhan B, Kalantar B, Sameen MI, Santosh M, Alamri A (2021) Landslide susceptibility modeling: an integrated novel method based on machine learning feature transformation. Remote Sens 13(16):3281

    Article  Google Scholar 

  • Alsahaf A, Azzopardi G, Ducro B, Veerkamp RF, Petkov N (2018) Predicting slaughter weight in pigs with regression tree ensembles. Int J Intell Syst 310:1–9

    Google Scholar 

  • Arabameri A, Saha S, Roy J, Chen W, Blaschke T, Bui DT (2020) Landslide susceptibility evaluation and management using different machine learning methods in the Gallicash River Watershed, Iran. Remote Sens 12:3

    Article  Google Scholar 

  • Araya-Polo M, Jennings J, Adler A, Dahlke T (2018) Deep-learning tomography. Lead Edge 37(1):58–66

    Article  Google Scholar 

  • Asadi M, Goli Mokhtari L, Shirzadi A, Shahabi H, Bahrami S (2022) A comparison study on the quantitative statistical methods for spatial prediction of shallow landslides (case study: Yozidar-Degaga Route in Kurdistan Province, Iran). Environ Earth Sci 81:51

    Article  Google Scholar 

  • Ayalew L, Yamagishi H (2005) The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko mountains, Central Japan. Geomorphology 65(1–2):15–31

    Article  Google Scholar 

  • Ballabio C, Sterlacchini S (2012) Support vector machines for landslide susceptibility mapping: the Staffora river basin case study, Italy. Math Geosci 44(1):47–70

    Article  Google Scholar 

  • Blessie EC, Karthikeyan E (2012) Sigmis: A feature selection algorithm using correlation based method. J Algorithm Comput Technol 6(3):385–394

    Article  Google Scholar 

  • Bonavita M, Arcucci R, Carrassi A, Dueben P, Geer AJ, Le Saux B, Longepe N, Mathieu PP, Raynaud L (2021) Machine learning for earth system observation and prediction. B Am Meteorol Soc 102(4):E710–E716

    Article  Google Scholar 

  • Bonham-Carter GF (1994) Geographic information systems for geoscientists: Modelling with GIS. Pergamon/Elsevier, New York

    Google Scholar 

  • Boutsidis C, Drineas P (2009) Random projections for the nonnegative least-squares problem. Linear Algebra Appl 431(5–7):760–771

    Article  Google Scholar 

  • Breiman L (2001) Random forests. Mach Learn 45:5–32

    Article  Google Scholar 

  • Bressan TS, de Souza MK, Girelli TJ, Chemale F (2020) Evaluation of machine learning methods for lithology classification using geophysical data. Comput Geosci 139:104475

    Article  Google Scholar 

  • Bui DT, Tuan TA, Klempe H, Pradhan B, Revhaug I (2016) Spatial prediction models for shallow landslide hazards: A comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree. Landslides 13(2):361–378

    Article  Google Scholar 

  • Can R, Kocaman S, Gokceoglu C (2021) A comprehensive assessment of XGBoost algorithm for landslide susceptibility mapping in the upper basin of Ataturk dam, Turkey. Appl Sci-Basel 11(11):4993

    Article  CAS  Google Scholar 

  • Can T, Tekin S (2019) Landslide susceptibility assessment around Babadag (Denizli) town using Logistic Regression method. Kahramanmaraa Sutcu Imam University J Eng Sci 22:48–56

    Google Scholar 

  • Celik SB, Kumsar H, Aydan O (2019) Dynamic model tests on the Babadag-Gundogdu Landslide (Denizli-Turkey). ISRM Rock Dynamics Summit; 2019; ISRM-RDS-2019-083.

  • Chen TQ, Guestrin C (2016) XGBoost: a scalable tree boosting system. In: Kdd’16: proceedings of the 22nd ACM Sigkdd international conference on knowledge discovery and data mining, pp 785–794

  • Chen W, Li WP, Hou EK, Zhao Z, Deng ND, Bai HY, Wang DZ (2014) Landslide susceptibility mapping based on GIS and information value model for the Chencang District of Baoji, China. Arab J Geosci 7(11):4499–4511

    Article  Google Scholar 

  • Chen W, Xie XS, Wang JL, Pradhan B, Hong HY, Bui DT, Duan Z, Ma JQ (2017) A comparative study of logistic model tree, random forest, and classification and regression tree models for spatial prediction of landslide susceptibility. CATENA 151:147–160

    Article  Google Scholar 

  • Chowdhuri I, Pal SC, Arabameri A, Ngo PTT, Chakrabortty R, Malik S, Das B, Roy P (2020) Ensemble approach to develop landslide susceptibility map in landslide dominated Sikkim Himalayan region, India. Environ Earth Sci 79(20):476

    Article  Google Scholar 

  • Colkesen I, Sahin EK, Kavzoglu T (2016) Susceptibility mapping of shallow landslides using kernel-based Gaussian process, support vector machines and logistic regression. J Afr Earth Sci 118:53–64

    Article  Google Scholar 

  • Cramer H (1946) Mathematical methods of statistics. Princeton University Press, Princeton

    Google Scholar 

  • Demir S, Sahin EK (2022) Comparison of tree-based machine learning algorithms for predicting liquefaction potential using canonical correlation forest, rotation forest, and random forest based on CPT data. Soil Dyn Earthq Eng 154:107130

    Article  Google Scholar 

  • Dialameh M, Jahromi MZ (2017) A general feature-weighting function for classification problems. Expert Syst Appl 72:177–188

    Article  Google Scholar 

  • Dietterich TG (1998) Approximate statistical tests for comparing supervised classification learning algorithms. Neural Comput 10(7):1895–1923

    Article  CAS  Google Scholar 

  • Do HM, Yin KL, Guo ZZ (2020) A comparative study on the integrative ability of the analytical hierarchy process, weights of evidence and logistic regression methods with the flow-r model for landslide susceptibility assessment. Geomat Nat Haz Risk 11(1):2449–2485

    Article  Google Scholar 

  • Dong G, Liu H (2018) Feature engineering for machine learning and data analytics, 1st edn. CRC Press. https://doi.org/10.1201/9781315181080

  • Du GL, Zhang YS, Yang ZH, Guo CB, Yao X, Sun DY (2019) Landslide susceptibility mapping in the region of eastern Himalayan syntaxis, Tibetan Plateau, China: a comparison between analytical hierarchy process information value and logistic regression-information value methods. B Eng Geol Environ 78(6):4201–4215

    Article  Google Scholar 

  • Duboue P (2020) The art of feature engineering: essentials for machine learning. Cambridge University Press, Cambridge

    Book  Google Scholar 

  • Duman TY, Can T, Emre O, Kecer M, Dogan A, Ates S, Durmaz S (2005) Landslide inventory of northwestern Anatolia. Turkey Eng Geol 77(1–2):99–114

    Article  Google Scholar 

  • Fang ZC, Wang Y, Peng L, Hong HY (2021) A comparative study of heterogeneous ensemble-learning techniques for landslide susceptibility mapping. Int J Geogr Inf Sci 35(2):321–347

    Article  Google Scholar 

  • Felicísimo A, Cuartero A, Remondo J, Quiros E (2013) Mapping landslide susceptibility with logistic regression, multiple adaptive regression splines, classification and regression trees, and maximum entropy methods: a comparative study. Landslides 10(2):175–189

    Article  Google Scholar 

  • Fell R, Corominas J, Bonnard C, Cascini L, Leroi E, Savage WZ, Eng JJTCL (2008) Guidelines for landslide susceptibility, hazard and risk zoning for land-use planning commentary. Eng Geol 102(3–4):99–111

    Article  Google Scholar 

  • Feng C, Wang H, Lu N, Chen T, He H, Lu Y, Tu XM (2014) Log-transformation and its implications for data analysis. Shanghai Arch Psychiatry 26(2):105–109

    Google Scholar 

  • Fischer MM, Wang JF (2011) Spatial data analysis: models, methods and techniques. Spatial data analysis: models, methods and techniques. Springer, Berlin

    Book  Google Scholar 

  • Foody GM (2004) Thematic map comparison: evaluating the statistical significance of differences in classification accuracy. Photogramm Eng Rem S 70(5):627–633

    Article  Google Scholar 

  • Guyon I, Elisseeff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3:1157–1182

    Google Scholar 

  • Guzzetti F, Mondini AC, Cardinali M, Fiorucci F, Santangelo M, Chang KT (2012) Landslide inventory maps: new tools for an old problem. Earth-Sci Rev 112(1–2):42–66

    Article  Google Scholar 

  • Guzzetti F, Reichenbach P, Ardizzone F, Cardinali M, Galli M (2006) Estimating the quality of landslide susceptibility models. Geomorphology 81(1–2):166–184

    Article  Google Scholar 

  • Hancer M (2013) Study of the structural evolution of the Babadağ-Honaz and Pamukkale fault zones and the related earthquake risk potential of the Buldan region in SW Anatolia, east of the Mediterranean. J Earth Sci 24:397–409

    Article  Google Scholar 

  • He R, Liu Y, Zhang H (2020) Study on automatic classification of arrhythmias. In: Liu C, Li J (eds) Feature engineering and computational intelligence in ECG monitoring. Springer, Singapore

    Google Scholar 

  • He X, Zhao KY, Chu XW (2021) AutoML: a survey of the state-of-the-art. Knowl-Based Syst 212:106622

    Article  Google Scholar 

  • Helmy T, Fatai A, Faisal K (2010) Hybrid computational models for the characterization of oil and gas reservoirs. Expert Syst Appl 37:5353–5363

    Article  Google Scholar 

  • Ho L, Legere M, Li TB, Levine S, Hao K, Valcarcel B, Pasinetti GM (2017) Autonomic nervous system dysfunctions as a basis for a predictive model of risk of neurological disorders in subjects with prior history of traumatic brain injury: implications in alzheimer’s disease. J Alzheimer’s Dis 56(1):305–315

    Article  Google Scholar 

  • Hong HY, Liu JZ, Bui DT, Pradhan B, Acharya TD, Pham BT, Zhu AX, Chen W, Bin Ahmad B (2018) Landslide susceptibility mapping using J48 Decision Tree with AdaBoost, Bagging and Rotation Forest ensembles in the Guangchang area (China). CATENA 163:399–413

    Article  Google Scholar 

  • Hussain MA, Chen ZL, Kalsoom I, Asghar A, Shoaib M (2022) Landslide susceptibility mapping using machine learning algorithm: a case study along Karakoram Highway (KKH). Pakistan J Indian Soc Remote 50(5):849–866

    Article  Google Scholar 

  • Jennifer JJ, Saravanan S (2021) Artificial neural network and sensitivity analysis in the landslide susceptibility mapping of Idukki district, India. Geocarto Int. https://doi.org/10.1080/10106049.2021.1923831

    Article  Google Scholar 

  • Joanes DN, Gill CA (1998) Comparing measures of sample skewness and kurtosis. J R Stat Soc D-Sta 47(1):183–189

    Google Scholar 

  • Joshi D, Patidar AK, Mishra A, Mishra A, Agarwal S, Pandey A, Dewangan BK, Choudhury T (2021) Prediction of sonic log and correlation of lithology by comparing geophysical well log data using machine learning principles. GeoJournal. https://doi.org/10.1007/s10708-021-10502-6

    Article  Google Scholar 

  • Kadavi PR, Lee CW, Lee S (2019) Landslide-susceptibility mapping in Gangwon-Do, South Korea, using logistic regression and decision tree models. Environ Earth Sci 78(4):1–17

    Article  Google Scholar 

  • Kalantar B, Ueda N, Saeidi V, Ahmadi K, Halin AA, Shabani F (2020) Landslide susceptibility mapping: Machine and ensemble learning based on remote sensing big data. Remote Sens-Basel 12(11):1737

    Article  Google Scholar 

  • Kamran KV, Feizizadeh B, Khorrami B, Ebadi Y (2021) A comparative approach of support vector machine kernel functions for GIS-based landslide susceptibility mapping. Appl Geomat 13(4):837–851

    Article  Google Scholar 

  • Kavzoglu T, Sahin EK, Colkesen I (2014) Landslide susceptibility mapping using GIS-based multi-criteria decision analysis, support vector machines, and logistic regression. Landslides 11(3):425–439

    Article  Google Scholar 

  • Kayihan K, Demirci R, Etiz A (2007) Babadag Ilcesi Gundogdu Mahallesi Heyelan Afeti Uygulamasi ve Yasal Surec. TMMOB Afet Sempozyumu, 5–7 December, 2007, Ankara 405-412 (in Turkish)

  • Keerthi SS, Lin CJ (2003) Asymptotic behaviors of support vector machines with gaussian kernel. Neural Comput 15(7):1667–1689

    Article  Google Scholar 

  • Khanam Z, Alkhaldi S (2019) An intelligent recommendation engine for selecting the University for Graduate Courses in KSA: SARS Student Admission Recommender System. In: International conference on inventive computation technologies. Springer, Cham, pp 711–722

  • Konak N, Goktas F (2004) 1/100.000 scaled geological maps of Turkey, Denizli M21 Quadrangle. Ankara, Turkey: MTA (in Turkish)

  • Kuhn M, Johnson K (2019) Feature engineering and selection: a practical approach for predictive models, 1st edn. Chapman and Hall/CRC, Berlin

    Book  Google Scholar 

  • Kumsar H, Aydan O, Tano H, Celik SB, Ulusay R (2016) An integrated geomechanical investigation, multi-parameter monitoring and analyses of Babadag-Gundogdu creep-like landslide. Rock Mech Rock Eng 49(6):2277–2299

    Article  Google Scholar 

  • Kutlug Sahin E, Ipbuker C, Kavzoglu T (2017) Investigation of automatic feature weighting methods (Fisher, Chi-square and Relief-F) for landslide susceptibility mapping. Geocarto Int 32(9):956–977

    Article  Google Scholar 

  • Lawson CL, Hanson RJ (1995) Solving least squares problems, vol 15. Society for Industrial Mathematics. https://doi.org/10.1137/1.9781611971217

  • Liu R, Li L, Pirasteh S, Lai Z, Yang X, Shahabi H (2021) The performance quality of LR, SVM, and RF for earthquake-induced landslides susceptibility mapping incorporating remote sensing imagery. Arab J Geosci 14(4):259

    Article  Google Scholar 

  • Lu D, Ricciuto D (2019) Efficient surrogate modeling methods for large-scale Earth system models based on machine-learning techniques. Geosci Model Dev 12(5):1791–1807

    Article  Google Scholar 

  • Mandal K, Saha S, Mandal S (2021) Applying deep learning and benchmark machine learning algorithms for landslide susceptibility modelling in Rorachu river basin of Sikkim Himalaya. India Geosci Front 12(5):101203

    Article  Google Scholar 

  • Marzan I, Marti D, Lobo A, Alcalde J, Ruiz M, Alvarez-Marron J, Carbonell R (2021) Joint interpretation of geophysical data: Applying machine learning to the modeling of an evaporitic sequence in Villar de Canas (Spain). Eng Geol 288:106126. https://doi.org/10.1016/j.enggeo.2021.106126

    Article  Google Scholar 

  • Merghadi A, Abderrahmane B, Bui DT (2018) Landslide susceptibility Assessment at Mila Basin (Algeria): a comparative assessment of prediction capability of advanced machine learning methods. ISPRS Int J Geo-Inf. 7(7):268

    Article  Google Scholar 

  • Meten M, PrakashBhandary N, Yatabe R (2015) Effect of landslide factor combinations on the prediction accuracy of landslide susceptibility maps in the Blue Nile Gorge of Central Ethiopia. Geoenviron Disasters 2(1):9

    Article  Google Scholar 

  • Meyer D, Leisch F, Hornik K (2003) The support vector machines under test. Neurocomputing 55:169–186

    Article  Google Scholar 

  • Micheletti N, Foresti L, Robert S, Leuenberger M, Pedrazzini A, Jaboyedoff M, Kanevski M (2014) Machine learning feature selection methods for landslide susceptibility mapping. Math Geosci 46(1):33–57

    Article  Google Scholar 

  • NASA Landsat Program, 2019, Landsat 8 OLI, LC08_L2SP_179034_20190825_20200826_02_T1, GeoCover, USGS, Turkey, 25/08/2019

  • Nguyen HT, Phan NYK, Luong HH, Le TP, Tran NC (2020a) Efficient discretization approaches for machine learning techniques to improve disease classification on gut microbiome composition data. Adv Sci Technol Eng Syst J 5(3):547–556

    Article  Google Scholar 

  • Nguyen PT, Ha DH, Jaafari A, Nguyen HD, Phong TV, Al-Ansari N, Prakash I, Le HV, Pham BT (2020b) Groundwater potential mapping combining artificial neural network and real adaboost ensemble technique: the Daknong Province case-study, Vietnam. Int J Env Res Pub Health 17(7):2473

    Article  Google Scholar 

  • Ochoa LH, Nino LF, Vargas CA (2018) Fast determination of earthquake depth using seismic records of a single station, implementing machine learning techniques. Ingenieria E Investigacion 38(2):97–103

    Google Scholar 

  • Osborne J (2008) Best practices in data transformation: the overlooked effect of minimum values. In: Osborne J (ed) Best practices in quantitative methods. SAGE Publications, Berlin, pp 197–204

    Chapter  Google Scholar 

  • Ozdemir MA, Cirak O, Bozyurt O, Kulaksiz EE (2021) GIS based landslide sensitivity analysis of Babadag district (Denizli/Turkey) landslide areas. Jass Studies 14(85):289–308

    Article  Google Scholar 

  • Ozdemir S, Susarla D (2018) Feature engineering made easy: identify unique features from your dataset in order to build powerful machine learning systems. Packt Publishing Ltd, Birmingham

    Google Scholar 

  • Pardeshi SD, Autade SE, Pardeshi SS (2013) Landslide hazard assessment: recent trends and techniques. Springerplus 2:523. https://doi.org/10.1186/2193-1801-2-523

    Article  Google Scholar 

  • Pham BT, Pradhan B, Bui DT, Prakash I, Dholakia MB (2016) A comparative study of different machine learning methods for landslide susceptibility assessment: a case study of Uttarakhand area (India). Environ Model Softw 84:240–250

    Article  Google Scholar 

  • Pham BT, Shirzadi A, Shahabi H, Omidvar E, Singh SK, Sahana M, Asl DT, Bin Ahmad B, Quoc NK, Lee S (2019) Landslide susceptibility assessment by novel hybrid machine learning algorithms. Sustainability 11(16):4386

    Article  Google Scholar 

  • Pourghasemi HR, Gayen A, Park S, Lee CW, Lee S (2018) Assessment of landslide-prone areas and their zonation using logistic regression, logitboost, and naivebayes machine-learning algorithms. Sustainability 11(16):4386. https://doi.org/10.3390/su11164386

    Article  Google Scholar 

  • Pradhan AMS, Kim YT (2014) Relative effect method of landslide susceptibility zonation in weathered granite soil: A case study in deokjeok-ri creek, south korea. Nat Hazards 72(2):1189–1217

    Article  Google Scholar 

  • Probst P, Wright MN, Boulesteix AL (2019) Hyperparameters and tuning strategies for random forest. Wiley Interdiscip Rev Data Min Knowl Discov 9(3):e1301

    Article  Google Scholar 

  • Qin C, Zhang Y, Bao F, Zhang C, Liu P, Liu P (2021) XGBoost optimized by adaptive particle swarm optimization for credit scoring. Math Probl Eng 2021:6655510

    Article  Google Scholar 

  • Reichenbach P, Rossi M, Malamud BD, Mihir M, Guzzetti F (2018) A review of statistically-based landslide susceptibility models. Earth-Sci Rev 180:60–91

    Article  Google Scholar 

  • Reichstein M, Camps-Valls G, Stevens B, Jung M, Denzler J, Carvalhais N, Prabhat (2019) Deep learning and process understanding for data-driven Earth system science. Nature 566(7743):195–204

    Article  CAS  Google Scholar 

  • Sahana M, Rehman S, Sajjad H, Hong HY (2020) Exploring effectiveness of frequency ratio and support vector machine models in storm surge flood susceptibility assessment: a study of Sundarban Biosphere Reserve, India. CATENA. https://doi.org/10.1016/j.catena.2019.104450

    Article  Google Scholar 

  • Sahin EK (2020) Assessing the predictive capability of ensemble tree methods for landslide susceptibility mapping using XGBoost, gradient boosting machine, and random forest. SN Appl Sci 2(7):1308

    Article  CAS  Google Scholar 

  • Sahin EK (2022) Comparative analysis of gradient boosting algorithms for landslide susceptibility mapping. Geocarto Int 37(9):2441–2465

    Article  Google Scholar 

  • Sahin EK, Colkesen I (2021) Performance analysis of advanced decision tree-based ensemble learning algorithms for landslide susceptibility mapping. Geocarto Int 36(11):1253–1275

    Article  Google Scholar 

  • Sahin EK, Colkesen I, Acmali SS, Akgun A, Aydinoglu AC (2020) Developing comprehensive geocomputation tools for landslide susceptibility mapping: LSM tool pack. Comput Geosci 144:104592

    Article  Google Scholar 

  • Sanz H, Valim C, Vegas E, Oller JM, Reverter F (2018) SVM-RFE: selection and visualization of the most relevant features through non-linear kernels. BMC Bioinformatics 19(1):1–18

    Article  Google Scholar 

  • Shahabi H, Hashim M (2015) Landslide susceptibility mapping using GIS-based statistical models and Remote sensing data in tropical environment. Sci Rep 5:1–15

    Article  Google Scholar 

  • Stewart J, Kennelly PJ (2010) Illuminated Choropleth Maps. Ann Assoc Am Geogr 100(3):513–534

    Article  Google Scholar 

  • Sun DL, Xu JH, Wen HJ, Wang DZ (2021) Assessment of landslide susceptibility mapping based on Bayesian hyperparameter optimization: a comparison between logistic regression and random forest. Eng Geol 281:105972. https://doi.org/10.1016/j.enggeo.2020.105972

    Article  Google Scholar 

  • Sun XH, Chen JP, Bao YD, Han XD, Zhan JW, Peng W (2018) Landslide susceptibility mapping using logistic regression analysis along the Jinsha River and its tributaries close to Derong and Deqin County. Southwestern China. ISPRS Int J Geo-Inf 7(11):438

    Article  Google Scholar 

  • Vakhshoori V, Zare M (2016) Landslide susceptibility mapping by comparing weight of evidence, fuzzy logic, and frequency ratio methods. Geomat Nat Haz Risk 7(5):1731–1752

    Article  Google Scholar 

  • Vali A, Comai S, Matteucci M (2020) Deep learning for land use and land cover classification based on hyperspectral and multispectral earth observation data: a review. Remote Sens-Basel 12(15):2495

    Article  Google Scholar 

  • van Westen CJ, Castellanos E, Kuriakose SL (2008) Spatial data for landslide susceptibility, hazard, and vulnerability assessment: an overview. Eng Geol 102(3–4):112–131

    Article  Google Scholar 

  • Walter JI, Ogwari P, Thiel A, Ferrer F, Woelfel I (2021) easyQuake: putting machine learning to work for your regional seismic network or local earthquake study. Seismol Res Lett 92(1):555–563

    Article  Google Scholar 

  • Wang SB, Zhuang JQ, Zheng J, Fan HY, Kong JX, Zhan JW (2021) Application of bayesian hyperparameter optimized random forest and XGboost model for landslide susceptibility mapping. Front Earth Sc. https://doi.org/10.3389/feart.2021.712240

    Article  Google Scholar 

  • Wu CY, Lin SY (2022) Performance assessment of event-based ensemble landslide susceptibility models in Shihmen watershed, Taiwan. Water 14(5):717

    Article  Google Scholar 

  • Xing Y, Yue J, Guo Z, Chen Y, Hu J, Travé A (2021) Large-Scale landslide susceptibility mapping using an integrated machine learning model: a case study in the Lvliang Mountains of China. Front Earth Sci 9:622. https://doi.org/10.3389/feart.2021.722491

    Article  Google Scholar 

  • Yao JY, Qin SW, Qiao SS, Che WC, Chen Y, Su G, Miao Q (2020) Assessment of landslide susceptibility combining deep learning with semi-supervised learning in Jiaohe County, Jilin Province, China. Appl Sci 10(16):5640

    Article  CAS  Google Scholar 

  • Ye P, Yu B, Chen WH, Liu K, Ye LZ (2022) Rainfall-induced landslide susceptibility mapping using machine learning algorithms and comparison of their performance in Hilly area of Fujian Province, China. Nat Hazards 113:965–995

    Article  Google Scholar 

  • Youssef AM, Pourghasemi HR (2021) Landslide susceptibility mapping using machine learning algorithms and comparison of their performance at Abha Basin, Asir Region. Saudi Arabia Geosci Front 12(2):639–655

    Article  Google Scholar 

  • Yu SW, Ma JW (2021) Deep learning for geophysics: current and future trends. Rev Geophys 59(3):e2021RG000742

    Article  Google Scholar 

  • Zhang H, Qiu D, Wu R, Deng Y, Ji D, Li T (2019) Novel framework for image attribute annotation with gene selection XGBoost algorithm and relative attribute model. Appl Soft Comput 80:57–79

    Article  Google Scholar 

  • Zhao S, Zhao Z (2021) A comparative study of landslide susceptibility mapping using SVM and PSO-SVM models based on grid and slope units. Math Probl Eng 2021:8854606. https://doi.org/10.1155/2021/8854606

    Article  Google Scholar 

  • Zhao W, Li X, Rong G, Lin M, Lin C, Yang Y (2020) DAFEE: a scalable distributed automatic feature engineering algorithm for relational datasets. Springer, Cham, pp 32–46

    Google Scholar 

  • Zhou SH, Chen GQ, Fang LG, Nie YW (2016) GIS-Based integration of subjective and objective weighting methods for regional landslides susceptibility mapping. Sustainability 8(4):334. https://doi.org/10.3390/su8040334

    Article  Google Scholar 

  • Zhu CH, Wang XP (2009) Landslide susceptibility mapping: a comparison of information and weights-of-evidence methods in Three Gorges Area. In: 2009 international conference on environmental science and information application technology, Vol III, proceedings, pp 342–346

Download references

Funding

The raw data used in this paper was obtained from the project “Development of ArcGIS Interfaces with R programming language for Landslide Susceptibility Mapping” (No. 118Y090) funded by The Scientific and Technological Research Council of Turkey (TUBITAK).

Author information

Authors and Affiliations

Authors

Contributions

Conceptualization, Methodology, Software, Writingreview & editing, Writing-original draft, Visualization.

Corresponding author

Correspondence to Emrehan Kutlug Sahin.

Ethics declarations

Competing interests

The authors declare no competing interests.

Conflict of interest

The author declares that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file 1 (PDF 3263 kb)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sahin, E.K. Implementation of free and open-source semi-automatic feature engineering tool in landslide susceptibility mapping using the machine-learning algorithms RF, SVM, and XGBoost. Stoch Environ Res Risk Assess 37, 1067–1092 (2023). https://doi.org/10.1007/s00477-022-02330-y

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00477-022-02330-y

Keywords

Navigation