sofiamnc / credit_score_data_modeling Goto Github PK

View Code? Open in Web Editor NEW

Data analysis, preprocessing and custom ML model implementation for a credit score app.

Jupyter Notebook 99.28% Python 0.72%

credit_score_data_modeling's Introduction

Credit Scoring Data Modeling

Sofia Chevrolat (December 2020)

Overview

The financial firm "Prêt à dépenser" is a consumer credit company for people with few or no credit history.

In order to offer more transparency regarding its credit granting decisions, the company wants to develop an interactive dashboard based on a machine learning model scoring the default probability of a given client. This model should be based on a variety of data (behavioral, from other financial institutions...).

This repository contains several ordered notebooks presenting the steps taken to achieve the modeling of the input data :

Exploratory Data Analysis
Data Assembly
Balancing Method & Algorithm Selection
Feature selection
Hyperparameter Tuning & Final Model Explainability
Data Assembly for the Dashboard

The resulting model outputs the credit scoring for a given client on a scale from 0 to 100, 0 being the best value (0 risk of default), 100 being the worst value (no chance the client will pay back its credit).

Requirements

See requirements.txt

Usage

Download the dataset from Kaggle, and place the files under Notebooks/Resources/datasets/origin/.
Run the following in your terminal to install all required libraries :

pip3 install -r requirements.txt

Run each notebook one after the other, following the order indicated by the digits in each notebook's name.

For a complete overview of the modeling approach, please see the methodology note.

Credit

A big thank you to Will Koehrsen, whose notebooks were a huge help and inspiration for tackling this problem.

Recommend Projects

sofiamnc / credit_score_data_modeling Goto Github PK

credit_score_data_modeling's Introduction

Credit Scoring Data Modeling

Overview

Requirements

Usage

Credit

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent