Skip to content

The Fraud Detection project aims to improve identification of fraudulent activities in e-commerce and banking by developing advanced machine learning models that analyze transaction data, employ feature engineering, and implement real-time monitoring for high accuracy fraud detection.

License

Notifications You must be signed in to change notification settings

Daniel-Andarge/AiML-financial-fraud-detection-model

Repository files navigation

Machine Learning-based Fraud Detection for E-commerce and Banking Transactions

This project aims to significantly enhance the identification of fraudulent activities within E-commerce and banking sectors. It focuses on developing advanced machine learning models that analyze transaction data, employ sophisticated feature engineering techniques, and implement real-time monitoring systems to achieve high accuracy in fraud detection.

Table of Contents

  1. Project Overview
  2. Data Collection and Preprocessing
  3. Exploratory Data Analysis (EDA)
  4. Feature Engineering
  5. Model Building and Training
  6. Model Explainability Using SHAP
  7. Model Deployment and API Development
  8. Project Report
  9. Contributing
  10. License

Project Overview

This project aims to significantly improve the identification of fraudulent activities within these sectors. It focuses on developing advanced machine learning models that analyze transaction data, employ sophisticated feature engineering techniques, and implement real-time monitoring systems to achieve high accuracy in fraud detection.

Data Collection and Preprocessing

Gather and preprocess transaction data to ensure it is clean and usable for analysis. This includes data cleaning, handling missing values, and normalization.

Exploratory Data Analysis (EDA)

Analyze customer transaction characteristics to identify patterns and trends influencing fraud detection.

Univariate Analysis

Univariate Analysis

Bivariate Analysis

For detailed insights and visualizations related to bivariate analysis, please refer to the EDA Notebook.

Feature Engineering

Create new features that enhance the predictive power of the models based on insights from EDA.

Feature Engineering

Model Building and Training

After training and testing multiple models, we selected the following:

Fraud-IP Dataset - XGBoost Model

XGBoost Model XGBoost Model Evaluation

Credit Card Dataset - Logistic Regression with StandardScaler

Logistic Regression Model Logistic Regression Model Evaluation

Model Explainability Using SHAP

Summary Plot

Summary Plot

Force Plot

Force Plot

Model Deployment and API Development

Running the Flask App

Running Flask App

Testing the API

Testing the API

Building Docker Image

Building Docker Image

Running Docker Container

Running Docker Container

Testing the API from Postman

Generated new instances and sent requests to the fraud detection model API.

Postman Testing

Project Report

For a comprehensive overview of the project, please refer to the project report: Project Report PDF.

Contributing

Contributions are welcome! Please fork the repository and submit a pull request.

License

This project is licensed under the MIT License. See the LICENSE file for details.

About

The Fraud Detection project aims to improve identification of fraudulent activities in e-commerce and banking by developing advanced machine learning models that analyze transaction data, employ feature engineering, and implement real-time monitoring for high accuracy fraud detection.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published