Skip to main content

2018 | OriginalPaper | Buchkapitel

An Efficient Method for Detecting Fraudulent Transactions Using Classification Algorithms on an Anonymized Credit Card Data Set

verfasst von : Sylvester Manlangit, Sami Azam, Bharanidharan Shanmugam, Krishnan Kannoorpatti, Mirjam Jonkman, Arasu Balasubramaniam

Erschienen in: Intelligent Systems Design and Applications

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Credit card fraudulent transactions are causing businesses and banks to lose time and money. Detecting fraudulent transactions before a transaction is finalized will help businesses and banks to save resources. This research aims to compare the fraud detection accuracy of different sampling techniques and classification algorithms. An efficient method of detecting fraud using machine learning is proposed. Anonymized data set from Kaggle was used for detecting fraudulent transactions. Each transaction has been labeled as either a fraudulent transaction or not. The severe imbalance between fraud and non-fraudulent data caused the algorithms to under-perform. This was addressed with the application of sampling techniques. The combination of undersampling and SMOTE raised the recall accuracy of the classification algorithm. k-NN algorithm showed the highest recall accuracy compared to the other algorithms.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Jha, S., Westland, J.C.: A descriptive study of credit card fraud pattern. Glob. Bus. Rev. 14, 373–384 (2013)CrossRef Jha, S., Westland, J.C.: A descriptive study of credit card fraud pattern. Glob. Bus. Rev. 14, 373–384 (2013)CrossRef
2.
Zurück zum Zitat Liñares-Zegarra, J., Wilson, J.O.S.: Credit card interest rates and risk: new evidence from US survey data. Eur. J. Financ. 20, 892–914 (2014)CrossRef Liñares-Zegarra, J., Wilson, J.O.S.: Credit card interest rates and risk: new evidence from US survey data. Eur. J. Financ. 20, 892–914 (2014)CrossRef
3.
Zurück zum Zitat Lepoivre, M.R., Avanzini, C.O., Bignon, G., Legendre, L., Piwele, A.K.: Credit card fraud detection with unsupervised algorithms (Report). J. Adv. Inf. Technol. 7, 34 (2016)CrossRef Lepoivre, M.R., Avanzini, C.O., Bignon, G., Legendre, L., Piwele, A.K.: Credit card fraud detection with unsupervised algorithms (Report). J. Adv. Inf. Technol. 7, 34 (2016)CrossRef
4.
Zurück zum Zitat Bhattacharyya, S., Jha, S., Tharakunnel, K., Westland, J.C.: Data mining for credit card fraud: a comparative study. Dec. Support Syst. 50, 602–613 (2011)CrossRef Bhattacharyya, S., Jha, S., Tharakunnel, K., Westland, J.C.: Data mining for credit card fraud: a comparative study. Dec. Support Syst. 50, 602–613 (2011)CrossRef
5.
Zurück zum Zitat Prakash, C.: A parameter optimized approach for improving credit card fraud detection. Int. J. Comput. Sci. Issues 10, 360–366 (2013) Prakash, C.: A parameter optimized approach for improving credit card fraud detection. Int. J. Comput. Sci. Issues 10, 360–366 (2013)
6.
Zurück zum Zitat Venkata Ratnam, G., Siva Naga Prasad, M.: Credit card fraud detection using anti-k nearest neighbor algorithm. Int. J. Comput. Sci. Eng. 4, 1035–1039 (2012) Venkata Ratnam, G., Siva Naga Prasad, M.: Credit card fraud detection using anti-k nearest neighbor algorithm. Int. J. Comput. Sci. Eng. 4, 1035–1039 (2012)
7.
Zurück zum Zitat Correa Bahnsen, A., Aouada, D., Stojanovic, A., Ottersten, B.: Feature engineering strategies for credit card fraud detection. Exp. Syst. Appl. 51, 134–142 (2016)CrossRef Correa Bahnsen, A., Aouada, D., Stojanovic, A., Ottersten, B.: Feature engineering strategies for credit card fraud detection. Exp. Syst. Appl. 51, 134–142 (2016)CrossRef
8.
Zurück zum Zitat Dal Pozzolo, A., Caelen, O., Le Borgne, Y.-A., Waterschoot, S., Bontempi, G.: Learned lessons in credit card fraud detection from a practitioner perspective. Exp. Syst. Appl. 41, 4915–4928 (2014)CrossRef Dal Pozzolo, A., Caelen, O., Le Borgne, Y.-A., Waterschoot, S., Bontempi, G.: Learned lessons in credit card fraud detection from a practitioner perspective. Exp. Syst. Appl. 41, 4915–4928 (2014)CrossRef
9.
Zurück zum Zitat Lee, Y.J., Yeh, Y.R., Wang, Y.C.F.: Anomaly detection via online oversampling principal component analysis. IEEE Trans. Knowl. Data Eng. 25, 1460–1470 (2013)CrossRef Lee, Y.J., Yeh, Y.R., Wang, Y.C.F.: Anomaly detection via online oversampling principal component analysis. IEEE Trans. Knowl. Data Eng. 25, 1460–1470 (2013)CrossRef
11.
Zurück zum Zitat Dal Pozzolo, A., Caelen, O., Johnson, R.A., Bontempi, G.: Calibrating probability with undersampling for unbalanced classification. In: 2015 IEEE Symposium Series on Computational Intelligence, pp. 159–166. IEEE (2015) Dal Pozzolo, A., Caelen, O., Johnson, R.A., Bontempi, G.: Calibrating probability with undersampling for unbalanced classification. In: 2015 IEEE Symposium Series on Computational Intelligence, pp. 159–166. IEEE (2015)
12.
Zurück zum Zitat Liu, X.-Y., Wu, J., Zhou, Z.-H.: Exploratory undersampling for class-imbalance learning. IEEE Trans. Syst. Man Cybern. Part B (Cybernetics) 39, 539–550 (2009)CrossRef Liu, X.-Y., Wu, J., Zhou, Z.-H.: Exploratory undersampling for class-imbalance learning. IEEE Trans. Syst. Man Cybern. Part B (Cybernetics) 39, 539–550 (2009)CrossRef
13.
Zurück zum Zitat Akbani, R., Kwek, S., Japkowicz, N.: Applying support vector machines to imbalanced datasets. In: European Conference on Machine Learning, pp. 39–50. Springer, Heidelberg (2004) Akbani, R., Kwek, S., Japkowicz, N.: Applying support vector machines to imbalanced datasets. In: European Conference on Machine Learning, pp. 39–50. Springer, Heidelberg (2004)
14.
Zurück zum Zitat Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)MATH Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)MATH
15.
Zurück zum Zitat Khyati, C., Bhawna, M.: Exploration of Data mining techniques in fraud detection: credit card. Int. J. Electron. Comput. Sci. Eng. 1, 1765–1771 (2012) Khyati, C., Bhawna, M.: Exploration of Data mining techniques in fraud detection: credit card. Int. J. Electron. Comput. Sci. Eng. 1, 1765–1771 (2012)
16.
Zurück zum Zitat Chawla, N.V.: Data mining for imbalanced datasets: an overview. In: Data Mining and Knowledge Discovery Handbook, pp. 853–867. Springer, Boston (2005) Chawla, N.V.: Data mining for imbalanced datasets: an overview. In: Data Mining and Knowledge Discovery Handbook, pp. 853–867. Springer, Boston (2005)
17.
Zurück zum Zitat Nadarajan, S., Ramanujam, B.: Encountering imbalance in credit card fraud detection with metaheuristics. Adv. Nat. Appl. Sci. 10, 33–41 (2016) Nadarajan, S., Ramanujam, B.: Encountering imbalance in credit card fraud detection with metaheuristics. Adv. Nat. Appl. Sci. 10, 33–41 (2016)
18.
Zurück zum Zitat Liaw, A., Wiener, M.: Classification and regression by randomForest. R News 2, 18–22 (2002) Liaw, A., Wiener, M.: Classification and regression by randomForest. R News 2, 18–22 (2002)
19.
Zurück zum Zitat Le Cessie, S., Van Houwelingen, J.C.: Ridge estimators in logistic regression. Appl. Stat. 41, 191–201 (1992)CrossRefMATH Le Cessie, S., Van Houwelingen, J.C.: Ridge estimators in logistic regression. Appl. Stat. 41, 191–201 (1992)CrossRefMATH
23.
Zurück zum Zitat Hand, D.J., Mannila, H., Smyth, P.: Principles of Data Mining. MIT Press, Cambridge (2001) Hand, D.J., Mannila, H., Smyth, P.: Principles of Data Mining. MIT Press, Cambridge (2001)
Metadaten
Titel
An Efficient Method for Detecting Fraudulent Transactions Using Classification Algorithms on an Anonymized Credit Card Data Set
verfasst von
Sylvester Manlangit
Sami Azam
Bharanidharan Shanmugam
Krishnan Kannoorpatti
Mirjam Jonkman
Arasu Balasubramaniam
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-76348-4_41