Prediction of Potential Bank Customers: Application on Data Mining

Başarslan, Muhammet Sinan; Argun, İrem Düzdar

doi:10.1007/978-3-030-36178-5_9

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 43))

Included in the following conference series:

The International Conference on Artificial Intelligence and Applied Mathematics in Engineering

1735 Accesses
4 Citations

Abstract

Banking is an important industry, where financial transactions are performed to meet our needs in our everyday lives. Today, banks are frequently used to meet all kinds of financial transactions. In line with the increasing competition, the banks are aiming at acquiring new customers through customer satisfaction. At this point, studies on acquiring new customers by analyzing the customer data have gained importance recently. As a result, data analysis units have been established in the banks. In addition to the banks, these units have also been established for data analysis in customer focused industries such as insurance and telecommunication. In this study, models are established by using classification algorithms to estimate potential bank customers on the bank dataset obtained by telemarketing method in UCI Machine Learning Repository, and the results are compared. Using this comparison result, it is aimed to perform a more detailed and effective data analysis. Various models have been established with various classification algorithms for the estimation of customer acquisition. The classification algorithms used in this study include the C4.5 Decision Tree, Navie Bayes (NB) algorithm, K nearest neighbors algorithm (k-nn), Logistic Regression algorithm (LogReg), Random Forest algorithm (RanFor), and Adaptive Boosting algorithm (AdaBoostM1-Ada). While establishing the classification models, it is aimed to achieve consistency in the performance of the classification models by dividing the test and training data set by two different methods. K-fold Cross Validation and Holdout methods are used for this purpose. In the K-fold cross validation, training and test da-ta sets are separated with 5- and 10-fold cross validation. In the holdout method, the dataset was divided into training and test datasets with the 60–40%, 75–25% and 80–20% training and test separation ratios, respectively. These separations are evaluated for Accuracy (ACC), Precision (PPV), Sensitivity (TPR), and F-measure (F) performance. The performance results are similar in both separation results. According to the Accuracy and F-measure criteria, the classification model established by Random Forest algorithm highest results the other models, whereas the Naive Bayes algorithm gave highest results according to the precision criterion, and the AdaBoostM1 classification algorithm yielded better according to the sensitivity criterion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Baynal, K., Çaliş, A.: Veri Madenciliğinde Kümeleme Analizi ile Bankacılık Sektöründe Bir Uygulama. Beykent Üniversitesi Fen ve Mühendislik Bilimleri Dergisi, vol. 9(1) (2016)
Google Scholar
Çankiri, S., Kartal, E., Yildirim, K., Gülseçen, S.: Organizasyonlarda Bilgi Yönetimi Sürecinde Veri Madenciliği Yaklaşımı. Fırsatlar ve Tehditler Sempozyumu, İstanbul, Bilgi Çağında Varoluş (2009)
Google Scholar
Bach, M.P., Juković, S., Dumiči, K., Šarlija, N.: Business client segmentation in banking using self-organizing maps. South East Eur. J. Econ. Bus. 8(2), 32–41 (2013)
Article Google Scholar
Sumathi, S., Sivanandam, S.N.: Introduction to Data Mining Principles. Springer, Heidelberg (2006)
Book Google Scholar
Keramati, A., Jafari-Marandi, R., Aliannejadi, M., Ahmadian, I., Mozaffari, M., Abbasi, U.: Improved churn prediction in telecommunication industry using data techniques. Appl. Soft Comput., 994–1012 (2014)
Google Scholar
Nachev, A.: Application of Data Mining Techniques for Direct Marketing, Sofia (2014)
Google Scholar
Elsalamony, H.A.: Bank direct marketing analysis of data mining techniques. Int. J. Comput. Appl., 12–22 (2014)
Google Scholar
Kumari, B., Shrivastava, V.: Evaluation and comparison of performance of different classifiers. Int. J. Emerg. Trend Eng. Basic Sci. (IJEEBS), 604–611 (2015)
Google Scholar
Erol, Ç.: Sağlık Bilimlerinde R ile Veri Madenciliği, R ile Veri Madenciliği Uygulamaları, Çağlayan Kitabevi, İstanbul, pp. 25–46 (2016)
Google Scholar
Azevedo, A.I.R.L., Santos, M.F.: KDD, SEMMA and CRISP-DM: a parallel overview, IADS-DM (2008)
Google Scholar
Dolgun, M.Ö., Ersel, D.: Doğrudan Pazarlama Stratejilerinin Belirlenmesinde Veri Madenciliği Yöntemlerinin Kullanımı. İstatistikçiler Dergisi: İstatistik & Aktüerya 7, 1–13 (2014)
Google Scholar
Bakioğlu, F.Ö.K., Kartal, E., Özen, Z., Erol, Ç., Gülseçen, S.: Aspects of students about ınformation technology courses in social science. Procedia - Soc. Behav. Sci., 176, 148–154 (2015)
Google Scholar
Özkan, Y.: Veri madenciliği yöntemleri, 2. basım, İstanbul, Türkiye: Papatya Yayıncılık (2013)
Google Scholar
Han, J., Kamber, M., Pei, J.: Data mining: concepts and techniques. In: Data Management Systems, pp. 230–240. The Morgan Kaufmann Series (2006)
Google Scholar
Gordon, L.S., Berry, M.J.A.: Mastering data mining: for marketing, sales, and customer relationship management, 2nd ed. Willey Publishing, New York (2004)
Google Scholar
Ayre, L.B.: Data Mining for Information Professionals, San Diego. California, USA (2006)
Google Scholar
Freund, Y., Schapire, R.E.: Experiments with a new boosting algorithm. In: Thirteenth International Conference on Machine Learning, pp. 148–156 (1996)
Google Scholar
Özgür, A., Erdem, H.: Saldırı Tespit Sistemlerinde Kullanılan Kolay Erişilen Makine Öğrenme Algoritmalarının Karşılaştırılması. Bilişim Teknolojileri Dergisi 5(2), 41–48 (2012)
Google Scholar
Logistic Regression. https://wiki2.org/en/Logistic_regression+Newton. Accessed 20 Aug 2018
Anonymous. http://www.matlabyar.com/wp-content/uploads/edd/2016/03/knnng.png. Accessed 16 Apr 2018
Harrington, P.: Machine Learning in Action. Manning, New York (2012)
Google Scholar
Witten, I.H., Frank, E., Mark, A.H.: Veri Madenciliği: Pratik makine öğrenme araçları ve teknikleri, 3. Baskı, p. 191. Morgan Kaufmann, San Francisco (2011)
Google Scholar
Japkowicz, N.: Performance evaluation for learning algorithms. International Conference on Machine Learning, Scotland (2012)
Google Scholar
Clark, M.: An introduction to machine learning: with applications in R
Google Scholar
Flach, P.: The many faces of ROC analysis in machine learning. ICML Tutorial
Google Scholar
Avrim, B., Adam, K., John, L.: Beating the hold-out: Bounds for k-fold and progressive cross-validation. In: Proceedings of the Twelfth Annual Conference on Computational Learning Theory. ACM (1999)
Google Scholar
Kohavi, R.: A study of cross-validation and bootstrap for accuracy estimation and model selection. Ijcai 14, 1137–1145 (1995)
Google Scholar
Moro, S., Cortez, P., Rita, P.: A data-driven approach to predict the success of bank telemarketing. Decis. Support Syst. 62, 22–31 (2014)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Doğuş University, İstanbul, Turkey
Muhammet Sinan Başarslan
Duzce University, Düzce, Turkey
İrem Düzdar Argun

Authors

Muhammet Sinan Başarslan
View author publications
You can also search for this author in PubMed Google Scholar
İrem Düzdar Argun
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Muhammet Sinan Başarslan or İrem Düzdar Argun .

Editor information

Editors and Affiliations

Department of ECE, Karunya University, Coimbatore, Tamil Nadu, India
D. Jude Hemanth
Department of Computer Engineering, Faculty of Engineering, Suleyman Demirel University, Isparta, Isparta, Turkey
Utku Kose

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Başarslan, M.S., Argun, İ.D. (2020). Prediction of Potential Bank Customers: Application on Data Mining. In: Hemanth, D., Kose, U. (eds) Artificial Intelligence and Applied Mathematics in Engineering Problems. ICAIAME 2019. Lecture Notes on Data Engineering and Communications Technologies, vol 43. Springer, Cham. https://doi.org/10.1007/978-3-030-36178-5_9

Download citation

DOI: https://doi.org/10.1007/978-3-030-36178-5_9
Published: 03 January 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-36177-8
Online ISBN: 978-3-030-36178-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics