Skip to main content

2018 | OriginalPaper | Buchkapitel

A Hybrid Approach for Preprocessing of Imbalanced Data in Credit Scoring Systems

verfasst von : Uma R. Salunkhe, Suresh N. Mali

Erschienen in: Intelligent Computing and Information and Communication

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

During the last few years, classification task in machine learning is commonly used by various real-life applications. One of the common applications is credit scoring systems where the ability to accurately predict creditworthy or non-creditworthy applicants is critically important because incorrect predictions can cause major financial loss. In this paper, we aim to focus on skewed data distribution issue faced by credit scoring system. To reduce the imbalance between the classes, we apply preprocessing on the dataset which makes combined use of random re-sampling and dimensionality reduction. Experimental results on Australian and German credit datasets with the presented preprocessing technique has shown significant performance improvement in terms of AUC and F-measure.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Marqués, A. I., Vicente García, and Javier Salvador Sánchez. “Two-level classifier ensembles for credit risk assessment.” Expert Systems with Applications 39.12 (2012): 10916–10922. Marqués, A. I., Vicente García, and Javier Salvador Sánchez. “Two-level classifier ensembles for credit risk assessment.” Expert Systems with Applications 39.12 (2012): 10916–10922.
2.
Zurück zum Zitat BIS. Basel III: a global regulatory framework for more resilient banks and banking systems. (2011). Basel Committee on Banking Supervision, Bank for International Settlements, Basel. ISBN print: 92-9131-859-0. <http://www.bis.org/publ/bcbs189.pdf>. BIS. Basel III: a global regulatory framework for more resilient banks and banking systems. (2011). Basel Committee on Banking Supervision, Bank for International Settlements, Basel. ISBN print: 92-9131-859-0. <http://​www.​bis.​org/​publ/​bcbs189.​pdf>.
3.
Zurück zum Zitat Marqués, A. I., Vicente García, and Javier Salvador Sánchez. “Exploring the behaviour of base classifiers in credit scoring ensembles.” Expert Systems with Applications 39.11 (2012): 10244–10250. Marqués, A. I., Vicente García, and Javier Salvador Sánchez. “Exploring the behaviour of base classifiers in credit scoring ensembles.” Expert Systems with Applications 39.11 (2012): 10244–10250.
4.
Zurück zum Zitat Wu, Xiaojun, and SufangMeng. “E-commerce customer churn prediction based on improved SMOTE and AdaBoost.” Service Systems and Service Management (ICSSSM), 2016 13th International Conference on. IEEE, 2016. Wu, Xiaojun, and SufangMeng. “E-commerce customer churn prediction based on improved SMOTE and AdaBoost.” Service Systems and Service Management (ICSSSM), 2016 13th International Conference on. IEEE, 2016.
5.
Zurück zum Zitat Xiao, Hongshan, Zhi Xiao, and Yu Wang. “Ensemble classification based on supervised clustering for credit scoring.” Applied Soft Computing 43 (2016): 73–86. Xiao, Hongshan, Zhi Xiao, and Yu Wang. “Ensemble classification based on supervised clustering for credit scoring.” Applied Soft Computing 43 (2016): 73–86.
6.
Zurück zum Zitat Abellán, Joaquín, and Javier G. Castellano. “A comparative study on base classifiers in ensemble methods for credit scoring.” Expert Systems with Applications 73 (2017): 1–10. Abellán, Joaquín, and Javier G. Castellano. “A comparative study on base classifiers in ensemble methods for credit scoring.” Expert Systems with Applications 73 (2017): 1–10.
7.
Zurück zum Zitat Dal Pozzolo, Andrea, et al. “Learned lessons in credit card fraud detection from a practitioner perspective.” Expert systems with applications 41.10 (2014): 4915–4928. Dal Pozzolo, Andrea, et al. “Learned lessons in credit card fraud detection from a practitioner perspective.” Expert systems with applications 41.10 (2014): 4915–4928.
8.
Zurück zum Zitat Oreski, Stjepan, and Goran Oreski. “Genetic algorithm-based heuristic for feature selection in credit risk assessment.” Expert systems with applications 41.4 (2014): 2052–2064. Oreski, Stjepan, and Goran Oreski. “Genetic algorithm-based heuristic for feature selection in credit risk assessment.” Expert systems with applications 41.4 (2014): 2052–2064.
9.
Zurück zum Zitat Han, Lu, Liyan Han, and Hongwei Zhao. “Orthogonal support vector machine for credit scoring.” Engineering Applications of Artificial Intelligence 26.2 (2013): 848–862. Han, Lu, Liyan Han, and Hongwei Zhao. “Orthogonal support vector machine for credit scoring.” Engineering Applications of Artificial Intelligence 26.2 (2013): 848–862.
10.
Zurück zum Zitat Kim, Myoung-Jong, and Dae-Ki Kang. “Classifiers selection in ensembles using genetic algorithms for bankruptcy prediction.” Expert Systems with applications 39.10 (2012): 9308–9314. Kim, Myoung-Jong, and Dae-Ki Kang. “Classifiers selection in ensembles using genetic algorithms for bankruptcy prediction.” Expert Systems with applications 39.10 (2012): 9308–9314.
11.
Zurück zum Zitat Xiao, Jin, et al. “Dynamic classifier ensemble model for customer classification with imbalanced class distribution.” Expert Systems with Applications 39.3 (2012): 3668–3675. Xiao, Jin, et al. “Dynamic classifier ensemble model for customer classification with imbalanced class distribution.” Expert Systems with Applications 39.3 (2012): 3668–3675.
12.
Zurück zum Zitat Salunkhe, Uma R., and Suresh N. Mali. “Classifier Ensemble Design for Imbalanced Data Classification: A Hybrid Approach.” Procedia Computer Science 85 (2016): 725–732. Salunkhe, Uma R., and Suresh N. Mali. “Classifier Ensemble Design for Imbalanced Data Classification: A Hybrid Approach.” Procedia Computer Science 85 (2016): 725–732.
13.
Zurück zum Zitat Liu, Xu-Ying, Jianxin Wu, and Zhi-Hua Zhou. “Exploratory under-sampling for class-imbalance learning.” Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on 39.2 (2009): 539–550. Liu, Xu-Ying, Jianxin Wu, and Zhi-Hua Zhou. “Exploratory under-sampling for class-imbalance learning.” Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on 39.2 (2009): 539–550.
14.
Zurück zum Zitat Kamalloo, Ehsan, and Mohammad SanieeAbadeh. “An artificial immune system for extracting fuzzy rules in credit scoring.” Evolutionary Computation (CEC), 2010 IEEE Congress on. IEEE, 2010. Kamalloo, Ehsan, and Mohammad SanieeAbadeh. “An artificial immune system for extracting fuzzy rules in credit scoring.” Evolutionary Computation (CEC), 2010 IEEE Congress on. IEEE, 2010.
Metadaten
Titel
A Hybrid Approach for Preprocessing of Imbalanced Data in Credit Scoring Systems
verfasst von
Uma R. Salunkhe
Suresh N. Mali
Copyright-Jahr
2018
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-7245-1_10

Premium Partner