Coder Social home page Coder Social logo

adityadj98 / analyzing-employee-s-performance-for-hr-analytics Goto Github PK

View Code? Open in Web Editor NEW

This project forked from swatikhedekar/analyzing-employee-s-performance-for-hr-analytics

0.0 0.0 0.0 1.51 MB

End-to-end real time project from hicounselor

Home Page: https://hicounselor.com/projects

License: Apache License 2.0

Jupyter Notebook 100.00%

analyzing-employee-s-performance-for-hr-analytics's Introduction

Project Title

Analyzing Employee’s Performance for HR Analytics

Description

This project will train you how to use SQL to analyze a real-world database, how to extract the most useful information from the dataset, how to pre-process the data using Python for improved performance, and how to use a structured query language to retrieve useful information from the database.

About Dataset

You will be using a real-world dataset of the Employee's records to complete this project. This project intends to evaluate the provided dataset, solve business problems on this dataset and mine information insights.

Data source

I have used the data available at HiCounselor website HR analytics data challenge, 2023. The dataset fulfils the requirements for the project and is in the CSV format.

Download Dataset

Module 1: Pre-processing the dataset

Module 2: Run SQL queries

Step by Step Process of Live Project Execution

  1. Data cleaning with Python:
  • removing duplicates and
  • handling null values,
  • deletion or transformation of irrelevant values,
  • data type transformation,
  • data validations.
  1. SQL Queries:
  • aggregating the data,
  • grouping the data,
  • ordering the data,
  • using case
  • use having
  • sub queries etc

Module 1: Data Preprocessing using python

Steps To Perform Data Preprocessing

  • Step 1: Removing duplicate rows.
  • Step 2: Removing rows for which numeric columns are having irrelevant data type values
  • Step 3: Remove irrelevant values from each column if any. Validation all values for a column, Check for any inconsistencies or discrepancies in data types, units, or formats.Feel free to add more validation checks which you might feel necessary for the dataset’s integrity
  • Step 4: Export the cleaned dataset as a .csv file: prefer UTF-8 encoding.

We get cleaned dataset in csv.Then perform following steps.

  • Step 5: Convert the pre-processed dataset into an SQL file.
  • Step 6: Manually generate a table by utilizing the database information provided in the "Database Info" tab.

Module 2:

In this module, you will be working on performing data analysis on the pre-processed data from the previous module and conducting Data Analysis using SQL. You will generate queries for given problem statements.

Task 1: Find the average age of employees in each department and gender group.

  • ( Round average age up to two decimal places if needed).

Task 2:List the top 3 departments with the highest average training scores.

  • ( Round average scores up to two decimal places if needed)

Task 3:Find the percentage of employees who have won awards in each region.

  • (Round percentages up to two decimal places if needed)

Task 4:Show the number of employees who have met more than 80% of KPIs for each recruitment channel and education level.

Task 5:Find the average length of service for employees in each department, considering only employees with previous year ratings greater than or equal to 4.

  • (Round percentages up to two decimal places if needed)

Task 6:List the top 5 regions with the highest average previous year ratings.

  • ( Round average ratings up to two decimal places if needed)

Task 7:List the departments with more than 100 employees having a length of service greater than 5 years.

Task 8:Show the average length of service for employees who have attended more than 3 training, grouped by department and gender.

  • ( Round average length up to two decimal places if needed)

Task 9:Find the percentage of female employees who have won awards, per department. Also show the number of female employees who won awards and total female employees.

  • ( Round percentage up to two decimal places if needed)

Task 10:Calculate the percentage of employees per department who have a length of service between 5 and 10 years.

  • ( Round percentage up to two decimal places if needed)

Task 11:Find the top 3 regions with the highest number of employees who have met more than 80% of their KPIs and received at least one award, grouped by department and region.

Task 12:Calculate the average length of service for employees per education level and gender, considering only those employees who have completed more than 2 trainings and have an average training score greater than 75.

  • ( Round average length up to two decimal places if needed)

Task 13:For each department and recruitment channel, find the total number of employees who have met more than 80% of their KPIs, have a previous_year_rating of 5, and have a length of service greater than 10 years.

Task 14:Calculate the percentage of employees in each department who have received awards, have a previous_year_rating of 4 or 5, and an average training score above 70, grouped by department and gender .

  • ( Round percentage up to two decimal places if needed).

Task 15:List the top 5 recruitment channels with the highest average length of service for employees who have met more than 80% of their KPIs, have a previous_year_rating of 5, and an age between 25 and 45 years, grouped by department and recruitment channel.

  • ( Round average length up to two decimal places if needed).

Environment Variables

To run this project, you will need to registered for this project with HiCounselor. They alloted a team and database with user and password. then you can access the sandbox.

Screenshots

hrinfo

Tech

Team

  1. Swati Khedekar
  2. Rohit Raj Sarraf
  3. Andrew Strain
  4. Koushal Kashyap

Feedback

If you have any feedback, please reach out to us linkedin

Thank You!

analyzing-employee-s-performance-for-hr-analytics's People

Contributors

swatikhedekar avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.