Coder Social home page Coder Social logo

employee-attrition-prediction-bluechip-hackathon-solution's Introduction

Employee Attrition Prediction - Bluechip Summit Hackathon Solution

This repository contains my solution for the Employee Attrition Prediction challenge presented at the Bluechip Summit hackathon. The objective of this challenge was to predict the probability of employee attrition within a given company using machine learning techniques.

Leaderboards

Public Leaderboard (20% Test Data)

This leaderboard represents the standings calculated with approximately 20% of the test data at the time of the hackathon. The final results, based on the remaining 80% of the test data, led to different standings.

Leaderboard-20% of test data

  • Position(out of 158): 85
  • Accuracy: 0.90164
  • Percentage Accuracy: 90.164%

Private Leaderboard (80% Test Data - Final Standings)

The private leaderboard reflects the final standings calculated with approximately 80% of the test data at the time of the hackathon. As the competition concluded, this leaderboard showcases the ultimate rankings in the Employee Attrition Prediction challenge.

Final Leaderboard-80% of test data

  • Position(out of 158): 65
  • Accuracy: 0.87396
  • Percentage Accuracy: 87.396%

Problem Statement

Employee attrition poses a significant challenge for organizations, impacting productivity and incurring high recruitment costs. The task was to develop a predictive model that anticipates the likelihood of an employee leaving the company based on provided data.

File Structure

Employee-Attrition-Prediction-Bluechip-Hackathon-Solution/
├── assets/
│   └── images/
│       ├── private_leaderboard.png
│       └── public_leaderboard.png
├── data/
│   ├── train.csv
│   ├── test.csv
│   ├── sample_submission.csv
│   └── data_with_attrition2.csv
├── modeling/
│   └── models.ipynb
├── README.md
└── requirements.txt

Dataset

The dataset provided for this challenge includes two main files:

  • train.csv: Contains the training data with various features related to employees.
  • test.csv: Holds the test data for which predictions need to be made.
  • sample_submission.csv: A sample submission file format to follow for submitting predictions.

Solution Overview

The solution involved several key steps:

  • Exploratory Data Analysis (EDA): Understanding the data distribution, identifying patterns, and gaining insights into factors influencing attrition.
  • Data Preprocessing: Handling missing values, encoding categorical variables, and feature scaling.
  • Feature Engineering: Creating new features or transforming existing ones to improve model performance.
  • Model Building: Utilizing machine learning algorithms to predict the probability of employee attrition.
  • Evaluation: Assessing model performance using appropriate metrics like accuracy, precision, recall.

Getting Started

Prerequisites

  • Python 3
  • Jupyter Notebook (optional for running the provided notebooks)
  • Required Python libraries (listed in requirements.txt)

Clone the Repository

git clone https://github.com/francisojeah/Employee-Attrition-Prediction-Bluechip-Hackathon-Solution.git

Install dependencies:

pip install -r requirements.txt

Usage

  • Place the provided train.csv and test.csv files in the project directory.
  • Run the provided notebook, models.ipynb, to reproduce the analysis, feature engineering, model building, and predictions.
  • Ensure that the submission file follows the format specified in sample_submission.csv.

Results

The best-performing model achieved a probability prediction accuracy of 90.87% on the test set, demonstrating its effectiveness in predicting employee attrition.

License

This project is licensed under the MIT License.

employee-attrition-prediction-bluechip-hackathon-solution's People

Contributors

francisojeah avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.