Coder Social home page Coder Social logo

loan_prediction's Introduction

Predict Loan Eligibility for Dream Housing Finance company

Problem statement

Dream Housing Finance company deals in all kinds of home loans. They have presence across all urban, semi urban and rural areas. Customer first applies for home loan and after that company validates the customer eligibility for loan. Company wants to automate the loan eligibility process (real time) based on customer detail provided while filling online application form. These details are Gender, Marital Status, Education, Number of Dependents, Income, Loan Amount, Credit History and others. To automate this process, they have provided a dataset to identify the customers segments that are eligible for loan amount so that they can specifically target these customers.

Dataset

Dataset set is downloaded from analytics vidya hackathon.The data set consist of payment,loan approval,loan amount, credit history of every loan applicants.

libraries

The packages are Pandas to load data and to do data analysis, Numpy to work with arrays, scikit-learn is used for building the model and evaluating it, seaborn and matplotlib for data visualisation, pydotplus to visualize the decision tree and finally xgboost model

Data analysis

Exploring the dataset to gain an understanding of the type, quantity, and distribution of data in our dataset. Data analysis is done to check any outliers ,missing values and categorical variables in the dataset. Data visualization is done using seaborn and matplotlib pacakges Outliers are treated using capping and flooring technique

Creating training and testing data

Using scikit-learn train-test split function data set is divided 80% as train and 20% as test.

Creating model

Using scikit-learn 9 different classification models are built. Logistic regression,LDA,Decision tree, random forest, bagging,Ada boost,gradient boost, xgboost,KNN , SVM classifiers.

Model evaluation

We evaluated our built models using the evaluation metrics provided by the scikit-learn package. The evaluation metrics we are going to use are the accuracy score metric, recall,roc_curve, and finally the confusion matrix.

ROC curve

image image image image image image image image

Decision Tree

image

loan_prediction's People

Contributors

thushar-marvel avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.