Optimizing Machine Learning Classifiers for Credit Card Fraud Detection on Highly Imbalanced Datasets Using PCA and SMOTE Techniques

Authors

DOI:

https://doi.org/10.21015/vtcs.v12i2.1921

Abstract

Card fraud detection refers to the process of identifying unauthorized or suspicious transactions made using credit or debit cards. It employs machine learning models, rule-based systems, and anomaly detection techniques to detect patterns indicating potential fraud. There is a growing need for systems that can accurately predict and prevent fraudulent transactions. Reducing financial loss by Implementing advanced detection models to safeguard it from fraud or malicious transactions. Therefore, we proposed machine learning models that will predict credit card fraud at an early stage. Also, the study used feature scaling, Principal Component Analysis (PCA), and the Synthetic Minority Over-sampling Technique (SMOTE) to deal with the class imbalance on the dataset. Moreover, SMOTE is applied to balance the classes by synthesizing examples of the minority class, making classifiers more robust. The results show that LR, SVM, KNN, and XGBoost models correctly predict 97\% of fraudulent and non-fraudulent cases. The Decision Tree and the Random Forest models are capable of achieving at least 96\%, respectively. This research combines advanced machine learning methodologies with real-time processing to give insights into predictive analytics in financial fraud detection, which may enhance accuracy and efficiency in financial security systems.

References

A. Cherif, A. Badhib, H. Ammar, S. Alshehri, M. Kalkatawi, and A. Imine, "Credit card fraud detection in the era of disruptive technologies: A systematic review," Journal of King Saud University-Computer and Information Sciences, vol. 35, no. 1, pp. 145–174, 2023.

A. A. Khan, A. A. Laghari, H. Elmannai, A. A. Shaikh, S. Bourouis, M. Hadjouni, and R. Alroobaea, "Gan-iotvs: A novel internet of multimedia things-enabled video streaming compression model using gan and fuzzy logic," IEEE Sensors Journal, 2023.

A. A. Khan, A. A. Laghari, A. M. Baqasah, R. Alroobaea, A. Almadhor, G. A. Sampedro, and N. Kryvinska, "Blockchain-enabled infrastructural security solution for serverless consortium fog and edge computing," PeerJ Computer Science, vol. 10, 2024.

R. Bin Sulaiman, V. Schetinin, and P. Sant, "Review of machine learning approach on credit card fraud detection," Human-Centric Intelligent Systems, vol. 2, no. 1, pp. 55–68, 2022.

I. Khan, A. Jameel, I. Ullah, I. Khan, and H. Ullah, "The agi-cybersecurity nexus: Exploring implications and applications," in Artificial General Intelligence (AGI) Security: Smart Applications and Sustainable Technologies. Springer, 2024, pp. 271–289.

F. K. Alarfaj, I. Malik, H. U. Khan, N. Almusallam, M. Ramzan, and M. Ahmed, "Credit card fraud detection using state-of-the-art machine learning and deep learning algorithms," IEEE Access, vol. 10, pp. 39 700–39 715, 2022.

P. Tiwari, S. Mehta, N. Sakhuja, J. Kumar, and A. K. Singh, "Credit card fraud detection using machine learning: A study," arXiv preprint arXiv:2108.10005, 2021.

I. Ullah, F. Ali, S. Nazir, H. U. Khan, M. S. Anwar, and C. Choi, "Educating banking employees to ensure security in the cyberworld," in Cybersecurity Management in Education Technologies. CRC Press, 2023, pp. 49–63.

E. Esenogho, I. D. Mienye, T. G. Swart, K. Aruleba, and G. Obaido, "A neural network ensemble with feature engineering for improved credit card fraud detection," IEEE Access, vol. 10, pp. 16 400–16 407, 2022.

A. A. Khan, Y.-L. Chen, F. Hajjej, A. A. Shaikh, J. Yang, C. S. Ku, and L. Y. Por, "Digital forensics for the socio-cyber world (dfscw): A novel framework for deepfake multimedia investigation on social media platforms," Egyptian Informatics Journal, vol. 27, p. 100502, 2024.

J. I.-Z. Chen and K.-L. Lai, "Deep convolution neural network model for credit-card fraud detection and alert," Journal of Artificial Intelligence, vol. 3, no. 02, pp. 101–112, 2021.

A. A. Khan, A. A. Laghari, Z. A. Shaikh, Z. Dacko-Pikiewicz, and S. Kot, "Internet of things (iot) security with blockchain technology: A state-of-the-art review," IEEE Access, vol. 10, pp. 122 679–122 695, 2022.

R. Asha and S. K. KR, "Credit card fraud detection using artificial neural network," Global Transitions Proceedings, vol. 2, no. 1, pp. 35–41, 2021.

R. Sailusha, V. Gnaneswar, R. Ramesh, and G. R. Rao, "Credit card fraud detection using machine learning," in 2020 4th International Conference on Intelligent Computing and Control Systems (ICICCS). IEEE, 2020, pp. 1264–1270.

A. A. Khan, A. A. Laghari, A. M. Baqasah, R. Alroobaea, T. R. Gadekallu, G. A. Sampedro, and Y. Zhu, "Oran-b5g: A next-generation open radio access network architecture with machine learning for beyond 5g in industrial 5.0," IEEE Transactions on Green Communications and Networking, 2024.

N. S. Alfaiz and S. M. Fati, "Enhanced credit card fraud detection model using machine learning," Electronics, vol. 11, no. 4, p. 662, 2022.

J. F. Roseline, G. Naidu, V. S. Pandi, S. A. alias Rajasree, and N. Mageswari, "Autonomous credit card fraud detection using machine learning approach," Computers and Electrical Engineering, vol. 102, p. 108132, 2022.

J. Forough and S. Momtazi, "Ensemble of deep sequential models for credit card fraud detection," Applied Soft Computing, vol. 99, p. 106883, 2021.

A. A. Taha and S. J. Malebary, "An intelligent approach to credit card fraud detection using an optimized light gradient boosting machine," IEEE Access, vol. 8, pp. 25 579–25 587, 2020.

H. Fanai and H. Abbasimehr, "A novel combined approach based on deep autoencoder and deep classifiers for credit card fraud detection," Expert Systems with Applications, vol. 217, p. 119562, 2023.

Z. Salekshahrezaee, J. L. Leevy, and T. M. Khoshgoftaar, "The effect of feature extraction and data sampling on credit card fraud detection," Journal of Big Data, vol. 10, no. 1, p. 6, 2023.

E. Ileberi, Y. Sun, and Z. Wang, "Performance evaluation of machine learning methods for credit card fraud detection using smote and adaboost," IEEE Access, vol. 9, pp. 165 286–165 294, 2021.

N. Rtayli and N. Enneya, "Enhanced credit card fraud detection based on svm-recursive feature elimination and hyperparameters optimization," Journal of Information Security and Applications, vol. 55, p. 102596, 2020.

T. T. Nguyen, H. Tahir, M. Abdelrazek, and A. Babar, "Deep learning methods for credit card fraud detection," arXiv preprint arXiv:2012.03754, 2020.

F. Wahab, U. Hayat, M. Khan, I. Ullah, and M. Yasir, "Importance of machine learning and network security for communication systems," in Artificial General Intelligence (AGI) Security: Smart Applications and Sustainable Technologies. Springer, 2024, pp. 195–210.

A. A. Khan, A. A. Laghari, R. Alroobaea, A. M. Baqasah, M. Alsafyani, R. Bacarra, and J. A. J. Alsayaydeh, "Secure remote sensing data with blockchain distributed ledger technology: A solution for smart cities," IEEE Access, 2024.

Y. Lucas and J. Jurgovsky, "Credit card fraud detection using machine learning: A survey," arXiv preprint arXiv:2010.06479, 2020.

F. Mehmood, A. A. Khan, H. Wang, S. Karim, U. Khalid, and F. Zhao, "Blpca-ledger: A lightweight plenum consensus protocols for consortium blockchain based on the hyperledger indy," Computer Standards & Interfaces, vol. 91, p. 103876, 2025.

I. D. Mienye and Y. Sun, "A deep learning ensemble with data resampling for credit card fraud detection," IEEE Access, vol. 11, pp. 30 628–30 638, 2023.

S. Khatri, A. Arora, and A. P. Agrawal, "Supervised machine learning algorithms for credit card fraud detection: a comparison," in 2020 10th International Conference on Cloud Computing, Data Science & Engineering (Confluence). IEEE, 2020, pp. 680–683.

H. Tingfei, C. Luo, and J. Zhang, "Real-time credit card fraud detection using machine learning," Future Generation Computer Systems, vol. 119, pp. 274–284, 2021.

A. A. Khan, S. Dhabi, J. Yang, W. Alhakami, S. Bourouis, and L. Yee, "B-lpoet: A middleware lightweight proof-of-elapsed time (PoET) for efficient distributed transaction execution and security on blockchain using multithreading technology," Computers and Electrical Engineering, vol. 118, p. 109343, 2024.

P. Chatterjee, D. Das, and D. B. Rawat, "Digital twin for credit card fraud detection: Opportunities, challenges, and fraud detection advancements," Future Generation Computer Systems, 2024.

I. Ullah, A. Noor, S. Nazir, F. Ali, Y. Y. Ghadi, and N. Aslam, "Protecting IoT devices from security attacks using effective decision-making strategy of appropriate features," The Journal of Supercomputing, vol. 80, no. 5, pp. 5870–5899, 2024.

I. Benchaji, S. Douzi, B. El Ouahidi, and J. Jaafari, "Enhanced credit card fraud detection based on attention mechanism and LSTM deep model," Journal of Big Data, vol. 8, pp. 1–21, 2021.

T.-H. Lin and J.-R. Jiang, "Credit card fraud detection with autoencoder and probabilistic random forest," Mathematics, vol. 9, no. 21, p. 2683, 2021.

S. Bagga, A. Goyal, N. Gupta, and A. Goyal, "Credit card fraud detection using pipelining and ensemble learning," Procedia Computer Science, vol. 173, pp. 104–112, 2020.

Downloads

Published

2024-10-20

How to Cite

Zeeshan Ali Haider, Khan, F. M., Zafar, A., Nabila, & Khan, I. U. (2024). Optimizing Machine Learning Classifiers for Credit Card Fraud Detection on Highly Imbalanced Datasets Using PCA and SMOTE Techniques. VAWKUM Transactions on Computer Sciences, 12(2), 28–49. https://doi.org/10.21015/vtcs.v12i2.1921