Data preprocessing is a process of preparing the raw data and making it suitable for a machine learning model. A real-world data generally contains noises, missing values, and maybe in an unusable format which cannot be directly used for machine learning models. Data preprocessing is required tasks for cleaning the data and making it suitable for a machine learning model which also increases the accuracy and efficiency of a machine learning model. It involves below steps:
- Getting a dataset
- Importing the required libraries
- Importing the dataset
- Finding missing data
- Encoding categorical data
- Spliting dataset into train and test dataset
- Feature scaling