Machine Learning-based Fraud Detection for E-commerce and Banking Transactions

This project aims to significantly enhance the identification of fraudulent activities within E-commerce and banking sectors. It focuses on developing advanced machine learning models that analyze transaction data, employ sophisticated feature engineering techniques, and implement real-time monitoring systems to achieve high accuracy in fraud detection.

Project Overview
Data Collection and Preprocessing
Exploratory Data Analysis (EDA)
- Univariate Analysis
- Bivariate Analysis
Feature Engineering
Model Building and Training
- Fraud-IP Dataset - XGBoost Model
- Credit Card Dataset - Logistic Regression with StandardScaler
Model Explainability Using SHAP
- Summary Plot
- Force Plot
Model Deployment and API Development
Project Report
Contributing
License

Project Overview

This project aims to significantly improve the identification of fraudulent activities within these sectors. It focuses on developing advanced machine learning models that analyze transaction data, employ sophisticated feature engineering techniques, and implement real-time monitoring systems to achieve high accuracy in fraud detection.

Data Collection and Preprocessing

Gather and preprocess transaction data to ensure it is clean and usable for analysis. This includes data cleaning, handling missing values, and normalization.

Exploratory Data Analysis (EDA)

Analyze customer transaction characteristics to identify patterns and trends influencing fraud detection.

Univariate Analysis

Bivariate Analysis

For detailed insights and visualizations related to bivariate analysis, please refer to the EDA Notebook.

Feature Engineering

Create new features that enhance the predictive power of the models based on insights from EDA.

Model Building and Training

After training and testing multiple models, we selected the following:

Fraud-IP Dataset - XGBoost Model

Credit Card Dataset - Logistic Regression with StandardScaler

Model Explainability Using SHAP

Summary Plot

Force Plot

Model Deployment and API Development

Running the Flask App

Testing the API

Building Docker Image

Running Docker Container

Testing the API from Postman

Generated new instances and sent requests to the fraud detection model API.

Project Report

For a comprehensive overview of the project, please refer to the project report: Project Report PDF.

Contributing

Contributions are welcome! Please fork the repository and submit a pull request.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
.github/workflows		.github/workflows
assets		assets
flask-app		flask-app
notebooks		notebooks
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Machine Learning-based Fraud Detection for E-commerce and Banking Transactions

Table of Contents

Project Overview

Data Collection and Preprocessing

Exploratory Data Analysis (EDA)

Univariate Analysis

Bivariate Analysis

Feature Engineering

Model Building and Training

Fraud-IP Dataset - XGBoost Model

Credit Card Dataset - Logistic Regression with StandardScaler

Model Explainability Using SHAP

Summary Plot

Force Plot

Model Deployment and API Development

Running the Flask App

Testing the API

Building Docker Image

Running Docker Container

Testing the API from Postman

Project Report

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

Daniel-Andarge/AiML-financial-fraud-detection-model

Folders and files

Latest commit

History

Repository files navigation

Machine Learning-based Fraud Detection for E-commerce and Banking Transactions

Table of Contents

Project Overview

Data Collection and Preprocessing

Exploratory Data Analysis (EDA)

Univariate Analysis

Bivariate Analysis

Feature Engineering

Model Building and Training

Fraud-IP Dataset - XGBoost Model

Credit Card Dataset - Logistic Regression with StandardScaler

Model Explainability Using SHAP

Summary Plot

Force Plot

Model Deployment and API Development

Running the Flask App

Testing the API

Building Docker Image

Running Docker Container

Testing the API from Postman

Project Report

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages