To write a program to implement the Decision Tree Classifier Model for Predicting Employee Churn.
- Hardware โ PCs
- Anaconda โ Python 3.7 Installation / Jupyter notebook
- Import pandas module and import the required data set.
- Find the null values and count them.
- Count number of left values.
- From sklearn import LabelEncoder to convert string values to numerical values.
- From sklearn.model_selection import train_test_split.
- Assign the train dataset and test dataset.
- From sklearn.tree import DecisionTreeClassifier.
- Use criteria as entropy.
- From sklearn import metrics.
- Find the accuracy of our model and predict the require values.
/*
Program to implement the Decision Tree Classifier Model for Predicting Employee Churn.
Developed by:Vasanthamukilan M
RegisterNumber:212222230167
*/
import pandas as pd
from sklearn.tree import plot_tree
import matplotlib.pyplot as plt
data=pd.read_csv("/content/Employee_EX6.csv")
data.head()
data.info()
data.isnull().sum()
data['left'].value_counts()
from sklearn.preprocessing import LabelEncoder
le=LabelEncoder()
data["salary"]=le.fit_transform(data["salary"])
data.head()
x=data[["satisfaction_level","last_evaluation","number_project","average_montly_hours","time_spend_company","Work_accident","promotion_last_5years","salary"]]
x.head()
y=data["left"]
from sklearn.model_selection import train_test_split
x_train,x_test,y_train,y_test=train_test_split(x,y,test_size=0.2,random_state=100)
from sklearn.tree import DecisionTreeClassifier
dt=DecisionTreeClassifier(criterion="entropy")
dt.fit(x_train,y_train)
y_pred=dt.predict(x_test)
from sklearn import metrics
accuracy=metrics.accuracy_score(y_test,y_pred)
print(accuracy)
dt.predict([[0.5,0.8,9,260,6,0,1,2]])
plt.figure(figsize=(18,6))
plot_tree(dt,feature_names=x.columns,class_names=['salary','left'],filled=True)
plt.show()
Thus the program to implement the Decision Tree Classifier Model for Predicting Employee Churn is written and verified using python programming.