Skip to main content
Erschienen in: Bulletin of Engineering Geology and the Environment 5/2022

01.05.2022 | Original Paper

Advanced hyperparameter optimization for improved spatial prediction of shallow landslides using extreme gradient boosting (XGBoost)

verfasst von: Taskin Kavzoglu, Alihan Teke

Erschienen in: Bulletin of Engineering Geology and the Environment | Ausgabe 5/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Machine learning algorithms have progressively become a part of landslide susceptibility mapping practices owing to their robustness in dealing with complicated and non-linear mechanisms of landslides. However, the internal structures of such algorithms contain a set of hyperparameter configurations whose correct setting is crucial to get the highest achievable performance. This current study investigates the effectiveness and robustness of advanced optimization algorithms, including random search (RS), Bayesian optimization with Gaussian Process (BO-GP), Bayesian optimization with Tree-structured Parzen Estimator (BO-TPE), genetic algorithm (GA), and Hyperband method, for optimizing the hyperparameters of the eXtreme Gradient Boosting (XGBoost) algorithm in the spatial prediction of landslides. 12 causative factors were considered to produce landslide susceptibility maps (LSMs) for the Trabzon province of Turkey, where translational shallow landslides are ubiquitous. Five accuracy metrics, including overall accuracy (OA), precision, recall, F1-score, area under the receiver operating characteristic curve (AUC), and a statistical significance test were employed to measure the effectiveness of the optimization strategies on XGBoost algorithm. Compared to the XGBoost model with default setting, the optimized models provided a significant improvement of up to 13% in terms of overall accuracy, which was also ascertained by McNemar’s test. AUC analysis revealed that having statistically similar performances, GA (0.942) and Hyperband (0.922) methods had the highest predictive abilities, followed by BO-GP (0.920), BO-TPE (0.899), and RS (0.894). Analysis of computational cost efficiency showed that the Hyperband approach (40.3 s) was much faster (about 13 times) than the GA in hyperparameter tuning, and thus appeared to be the best optimization algorithm for the problem under consideration.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Bergstra J, Bengio Y (2012) Random search for hyper-parameter optimization. J Mach Learn Res 13:281–305 Bergstra J, Bengio Y (2012) Random search for hyper-parameter optimization. J Mach Learn Res 13:281–305
Zurück zum Zitat Chen T, Guestrin C (2016) XGBoost: A scalable tree boosting system. Proc. 22nd ACM SIGKDD Int Conf Knowl Discov Data Mining, San Fr. CA, USA 13–17 Augu, 785–794 Chen T, Guestrin C (2016) XGBoost: A scalable tree boosting system. Proc. 22nd ACM SIGKDD Int Conf Knowl Discov Data Mining, San Fr. CA, USA 13–17 Augu, 785–794
Zurück zum Zitat Devkota KC, Regmi AD, Pourghasemi HR, Yoshida K, Pradhan B, Ryu IC, Dhital MR, Althuwaynee OF (2013) Landslide susceptibility mapping using certainty factor, index of entropy and logistic regression models in GIS and their comparison at Mugling-Narayanghat road section in Nepal Himalaya. Nat Hazards 65:135–165. https://doi.org/10.1007/s11069-012-0347-6CrossRef Devkota KC, Regmi AD, Pourghasemi HR, Yoshida K, Pradhan B, Ryu IC, Dhital MR, Althuwaynee OF (2013) Landslide susceptibility mapping using certainty factor, index of entropy and logistic regression models in GIS and their comparison at Mugling-Narayanghat road section in Nepal Himalaya. Nat Hazards 65:135–165. https://​doi.​org/​10.​1007/​s11069-012-0347-6CrossRef
Zurück zum Zitat Haque U, Blum P, da Silva PF, Andersen P, Pilz J, Chalov SR, Malet JP, Auflič MJ, Andres N, Poyiadji E, Lamas PC, Zhang W, Peshevski I, Pétursson HG, Kurt T, Dobrev N, García-Davalillo JC, Halkia M, Ferri S, Gaprindashvili G, Engström J, Keellings D (2016) Fatal landslides in Europe. Landslides 13:1545–1554. https://doi.org/10.1007/s10346-016-0689-3CrossRef Haque U, Blum P, da Silva PF, Andersen P, Pilz J, Chalov SR, Malet JP, Auflič MJ, Andres N, Poyiadji E, Lamas PC, Zhang W, Peshevski I, Pétursson HG, Kurt T, Dobrev N, García-Davalillo JC, Halkia M, Ferri S, Gaprindashvili G, Engström J, Keellings D (2016) Fatal landslides in Europe. Landslides 13:1545–1554. https://​doi.​org/​10.​1007/​s10346-016-0689-3CrossRef
Zurück zum Zitat Jaedicke C, Van Den Eeckhaut M, Nadim F, Hervás J, Kalsnes B, Vangelsten BV, Smith JT, Tofani V, Ciurean R, Winter MG, Sverdrup-Thygeson K, Syre E, Smebye H (2014) Identification of landslide hazard and risk “hotspots” in Europe. Bull Eng Geol Environ 73:325–339. https://doi.org/10.1007/s10064-013-0541-0CrossRef Jaedicke C, Van Den Eeckhaut M, Nadim F, Hervás J, Kalsnes B, Vangelsten BV, Smith JT, Tofani V, Ciurean R, Winter MG, Sverdrup-Thygeson K, Syre E, Smebye H (2014) Identification of landslide hazard and risk “hotspots” in Europe. Bull Eng Geol Environ 73:325–339. https://​doi.​org/​10.​1007/​s10064-013-0541-0CrossRef
Zurück zum Zitat Kavzoglu T, Teke A, Bilucan F (2020) Effectiveness of machine learning algorithms in landslide susceptibility mapping: A case study of Trabzon Province, Turkey. In: Asian Conference on Remote Sensing (ACRS), November 2020. Deqing, China Kavzoglu T, Teke A, Bilucan F (2020) Effectiveness of machine learning algorithms in landslide susceptibility mapping: A case study of Trabzon Province, Turkey. In: Asian Conference on Remote Sensing (ACRS), November 2020. Deqing, China
Zurück zum Zitat Lundberg SM, Lee SI (2017) A unified approach to interpreting model predictions. Adv Neural Inf Process Syst 4766–4775 Lundberg SM, Lee SI (2017) A unified approach to interpreting model predictions. Adv Neural Inf Process Syst 4766–4775
Zurück zum Zitat Rong G, Alu S, Li K, Su Y, Zhang J, Zhang Y, Li T (2020) Rainfall induced landslide susceptibility mapping based on Bayesian optimized random forest and gradient boosting decision tree models—a case study of Shuicheng County, China. Water 12:3066. https://doi.org/10.3390/w12113066CrossRef Rong G, Alu S, Li K, Su Y, Zhang J, Zhang Y, Li T (2020) Rainfall induced landslide susceptibility mapping based on Bayesian optimized random forest and gradient boosting decision tree models—a case study of Shuicheng County, China. Water 12:3066. https://​doi.​org/​10.​3390/​w12113066CrossRef
Zurück zum Zitat Tagil S, Jeff J (2008) GIS-based automated landform classification and topographic, landcover and geologic attributes of landforms around the Yazoren Polje, Turkey. J Appl Sci 8:910–921CrossRef Tagil S, Jeff J (2008) GIS-based automated landform classification and topographic, landcover and geologic attributes of landforms around the Yazoren Polje, Turkey. J Appl Sci 8:910–921CrossRef
Zurück zum Zitat Teke A, Kavzoglu T (2021) Determination of effective predisposing factors using Random Forest-Based Gini Index in landslide susceptibility mapping. In: 2nd International Geoinformation Days (IGD), May 2021. Mersin, Turkey Teke A, Kavzoglu T (2021) Determination of effective predisposing factors using Random Forest-Based Gini Index in landslide susceptibility mapping. In: 2nd International Geoinformation Days (IGD), May 2021. Mersin, Turkey
Zurück zum Zitat Wu H (2021) Theory of ensemble forecasting - with applications in transport modeling. PhD Thesis. The University of Sydney Wu H (2021) Theory of ensemble forecasting - with applications in transport modeling. PhD Thesis. The University of Sydney
Metadaten
Titel
Advanced hyperparameter optimization for improved spatial prediction of shallow landslides using extreme gradient boosting (XGBoost)
verfasst von
Taskin Kavzoglu
Alihan Teke
Publikationsdatum
01.05.2022
Verlag
Springer Berlin Heidelberg
Erschienen in
Bulletin of Engineering Geology and the Environment / Ausgabe 5/2022
Print ISSN: 1435-9529
Elektronische ISSN: 1435-9537
DOI
https://doi.org/10.1007/s10064-022-02708-w

Weitere Artikel der Ausgabe 5/2022

Bulletin of Engineering Geology and the Environment 5/2022 Zur Ausgabe