Coder Social home page Coder Social logo

rfm-analysis's Introduction

Customer Segmentation based on RFM Analysis

Adventure Works Cycles a multinational manufacturing company. The company manufactures and sells metal and composite bicycles to North American, European and Asian commercial markets

Data Understanding

AdventureWorksDW2019.bak can be downloaded from internet.

Built 5NF STAR Schema(Fact, Customer, Product, Date, Location) from the database imported

image

Data Preparation

Built SQL Queries from the tables to get orders done by different customers in the year 2013

Calculated R, F and M values for each customer from the data generated with reference date

Depending on the company’s objectives, customers can be segmented in several ways so that it is financially possible to make marketing campaigns.

The ideal customers for e-commerce companies are generally the most recent ones compared to the date of study (our reference date), who are very frequent and who spend enough.

image

Based on the RFM values, we assigned a score to each customer between 1 and 3 for each RFM value of a customer. 3 is the best score. 1 is the worst score

Ex : A Customer who bought most recently and most often, and spent the most, his RFM score is 3–3–3

image

Recommendations

Best Customers: We can Reward them for their multiples purchases. They can be early adopters to very new products. Suggest them “Refer a friend”. Also, they can be the most loyal customers that have the habit to order.

Lost Cheap Customers: Send them personalized emails/messages/notifications to encourage them to order.

Big Spenders: Notify them about the discounts to keep them spending more and more money on your products

Loyal Customers: Create loyalty cards in which they can gain points each time of purchasing and these points could transfer into a discount

Conclusion :

Naive Bayes has an Accuracy of 95% and 84% of Area under ROC but the TP rate is 0

Logistic Regression has an Accuracy of 95% and 84% of Area under ROC but the TP rate is 0

Decision Tree Model has an Accuracy of 95% and 84% of Area under ROC but the TP rate is 3.3%

Where as Random Forest Model has an Accuracy of 96% and 84% of Area under ROC but the TP rate is 12.5%

So, Random Forest Model is better in identifying the customer segments that we need to target in our dataset

rfm-analysis's People

Contributors

surendra-tanniru avatar

Stargazers

 avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.