Skip to main navigation Skip to search Skip to main content

Statistical Data-Generative Machine Learning-Based Credit Card Fraud Detection Systems

  • Macao Polytechnic University

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

This study addresses the challenges of data imbalance and missing values in credit card transaction datasets by employing mode-based imputation and various machine learning models. We analyzed two distinct datasets: one consisting of European cardholders and the other from American Express, applying multiple machine learning algorithms, including Artificial Neural Networks, Convolutional Neural Networks, and Gradient Boosted Decision Trees, as well as others. Notably, the Gradient Boosted Decision Tree demonstrated superior predictive performance, with accuracy increasing by 4.53%, reaching 96.92% on the European cardholders dataset. Mode imputation significantly improved data quality, enabling stable and reliable analysis of merged datasets with up to 50% missing values. Hypothesis testing confirmed that the performance of the merged dataset was statistically significant compared to the original datasets. This study highlights the importance of robust data handling techniques in developing effective fraud detection systems, setting the stage for future research on combining different datasets and improving predictive accuracy in the financial sector.

Original languageEnglish
Article number2446
JournalMathematics
Volume13
Issue number15
DOIs
Publication statusPublished - Aug 2025

Keywords

  • credit card fraud
  • credit prediction
  • machine learning
  • predictive modeling
  • statistical data generation

Fingerprint

Dive into the research topics of 'Statistical Data-Generative Machine Learning-Based Credit Card Fraud Detection Systems'. Together they form a unique fingerprint.

Cite this