Coder Social home page Coder Social logo

ecommerce-customer-churn-prediction's Introduction

Trường Đại học Công nghệ Thông tin | University of Information Technology

IS334.N21.TMCL - ECOMMERCE

Customer Churn Prediction

  • Class: ECOMMERCE - IS334.N21.TMCL
  • Lecturer: Mr. Do Duy Thanh

Introduction

  • This is my Ecommerce project, I build to only research and learn about Machine Learning. Understanding about How Machine Learning apply to ECommerce.

Context

In today's highly competitive business landscape, retaining customers is crucial for the success of an organization. A customer churn prediction model using various machine learning algorithms including logistic regression, random forest, Decision Tree

Dataset:

  • Content Each row represents a customer, each column contains customer’s attributes described on the column Metadata.

  • The data set includes information about:

    • Customers who left within the last month – the column is called Churn
    • Services that each customer has signed up for – phone, multiple lines, internet, online security, online backup, device protection, tech support, and streaming TV and movies
    • Customer account information – how long they’ve been a customer, contract, payment method, paperless billing, monthly charges, and total charges
    • Demographic info about customers – gender, age range, and if they have partners and dependents
  • Source: https://www.kaggle.com/datasets/blastchar/telco-customer-churn

Pipeline:

pipeline

Difficult Points:

different points

  • Imbalanced data: In fact, the data we collect will often not have a balance between classes (stayed/churned/...). This is understandable because if the 'churn' is balanced with the 'stayed' class, the organization is on the verge of bankruptcy. The harm of data imbalance is very serious.
    • Eg: Suppose we have 100 samples, 99 class 'non churn' and 1 class 'churn'. The model only needs to make predictions that are all 'non churn' to be 99% accurate, perfect. But in reality it is really useless.
    • To solve this problem, We have many methods to solve. The simplest is that we will change the evaluation metrics, specifically we will no longer use the 'accuracy' metrics.
  • Feature Featuring: The problem requires us to have the knowledge of an expert in the field we are solving. We need to understand the features of the data, which features will affect the desired output
  • Model selection: there are many methods to solve this problem, Logistic Regression, K algorithms, classification algorithms, neural networks,...

Procession:

Details of data preprocessing, training and evaluation of the model, I made a note in the notebook, please read it.

Note

  • Deep learning model notebook as extra (Deep neural is a complicated process. In reality, simple things is usually more effective), I have not understood it yet. I'm not sure it work correctly and effectively. Maybe it overfit and I do not see that :DD So I do not recommend to consult this one.
  • In Ecommerce project, We requied to create a dashboard to visualize data. this is the URL to Dashboard project: https://github.com/HungPham2002/Dashboard_ECommerce

References:

[1] Susan Li: Predict Customer Churn – Logistic Regression, Decision Tree and Random Forest (2017).

[2] Lalwani, P., Mishra, M. K., Chadha, J. S. and Sethi, P., “Customer churn prediction system: A machine learning approach”, 2022.

[3] Asthana P: A comparison of machine learning techniques for customer churn prediction. International Journal of Pure and Applied Mathematics (2018).

[4] Hyperparameters and Tuning Strategies for Random Forest by Philipp Probst, Marvin Wright and Anne-Laure Boulesteix (February 27, 2019)

[5] Qureshi, S.A., Rehman, A.S., Qamar, A.M., Kamal, A., Rehman, A.: Telecommunication subscribers’ churn prediction model using machine learning. In: Eighth International Conference on Digital Information Management (ICDIM 2013)

[7] Umayaparvathi V, Iyakutti K: A survey on customer churn prediction in the telecom industry: Datasets, methods and metrics. International Research Journal of Engineering and Technology (IRJET) (2016).

[9] Altexsoft: Customer Churn Prediction Using Machine Learning: Main Approaches and Models

[11] Zakria Saad, Customer churn prediction on Telecom dataset

[12] Germann, F, Lilien, G.L, Fiedler, L, and Kraus, M., “Do retailers benefit from deploying customer analytics?”, Journal of Retailing (2014).

[13] Mishra, A. and Reddy, U. S., “A novel approach for churn prediction using deep learning”, Proceedings of the IEEE International Conference on Computational Intelligence and Computing Research (ICCIC), Coimbatore India, 1–4, (2017).

[14] Using Decision Trees to Predict Customer Churn (Published On: 10 April 2020 - By Cybiant)

ecommerce-customer-churn-prediction's People

Contributors

hungpham2002 avatar

Stargazers

kishorekumar avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.