ABSTRACT
Machine learning and data mining techniques have been used extensively in order to detect credit card frauds. However, most studies consider credit card transactions as isolated events and not as a sequence of transactions.
In this article, we model a sequence of credit card transactions from three different perspectives, namely (i) does the sequence contain a Fraud? (ii) Is the sequence obtained by fixing the card-holder or the payment terminal? (iii) Is it a sequence of spent amount or of elapsed time between the current and previous transactions? Combinations of the three binary perspectives give eight sets of sequences from the (training) set of transactions. Each one of these sets is modelled with a Hidden Markov Model (HMM). Each HMM associates a likelihood to a transaction given its sequence of previous transactions. These likelihoods are used as additional features in a Random Forest classifier for fraud detection. This multiple perspectives HMM-based approach enables an automatic feature engineering in order to model the sequential properties of the dataset with respect to the classification task. This strategy allows for a 15% increase in the precision-recall AUC compared to the state of the art feature engineering strategy for credit card fraud detection.
- Bahnsen A. C., Aouada D., Stojanovic A., and Ottersten B. (2016) Feature engineering strategies for credit card fraud detection. Expert Systems With Applications.Google Scholar
- Bolton R. and Hand D. J. (2001). Unsupervised profiling methods for fraud detection. Credit scoring and credit control VII.Google Scholar
- Davis J. and Goadrich M. (2006). The relationship between precision-recall and roc curves. ICML 06 Proceedings of the 23rd international conference on Machine learning. Google ScholarDigital Library
- Whitrow C., Hand D. J., Juszczak P., Weston D. J., and Adams N. M. (2008). Transaction aggregation strategy for credit card fraud detection. Data Mining and Knowledge Discovery 18(1). Google ScholarDigital Library
Recommendations
A Behavior-cluster Based Imbalanced Classification Method for Credit Card Fraud Detection
DSIT 2019: Proceedings of the 2019 2nd International Conference on Data Science and Information TechnologyCredit card fraud detection has been paid more and more attention by researchers. The credit card transactions are represented by highly imbalanced data sets. The number of genuine transactions is far more than fraudulent transactions, which will ...
Real Time Data-Driven Approaches for Credit Card Fraud Detection
ICEBA 2018: Proceedings of the 2018 International Conference on E-Business and ApplicationsCredit card fraud causes many financial losses for customer and also for the organization. For this reason, in the past few years, many studies have been performed using machine learning techniques to detect and block fraudulent transactions. This paper ...
Credit Card Fraud Prediction Using XGBoost: An Ensemble Learning Approach
With the development of technology, the internet and eCommerce online payment has become an essential mode of payment. Nowadays, credit card payment is a convenient mode of payment online as well as offline transactions. As online credit card payment ...
Comments