Skip to content

Exploration of binary classification models for credit card fraud detection from dataset of European cardholder transactions (September 2013). Dataset provided by Machine Learning Group of Université Libre de Bruxelles and Worldline collaboration.

License

Notifications You must be signed in to change notification settings

cristobalmitchell/credit-card-fraud

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Credit Card Fraud Detection

We will explore various models utilizing a credit card transaction data set from Kaggle and evaluating which has the best performance metrics for classifying an incoming transaction as fraudulent or normal. The data set is provided by The Machine Learning Group (MLG) of Université Libre de Bruxelles (ULB) and contains transactions made by european cardholders in September 2013.

The jupyter notebook walks readers through the end to end process of building machine learning models from exploratory data analysis, preprocessing, and variable relationship analysis to model creation and validation.

Models

  • Logistic Regression
  • Gradient Boosting Machine
  • Boosted Decision Tree
  • Extreme Gradient Boost
  • Random Forest
  • Neural Network

Results

Based on the criteria of high precision, high recall, and fast predictions the Logistic Regression model performed best and was the only model that did not show signs of overfitting.

About

Exploration of binary classification models for credit card fraud detection from dataset of European cardholder transactions (September 2013). Dataset provided by Machine Learning Group of Université Libre de Bruxelles and Worldline collaboration.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published