#Report:: ML_with_Python_Assigin
- Retrieve data from different sources as shown below and saved in .csv Format.
Data1 https://github.com/toUpperCase78/formula1-datasets/blob/master/formula1_2019season_drivers.csv
Data3 https://www.kaggle.com/datasets/deepcontractor/froza-horizon-5-cars-dataset
Data4 https://data.world/ogletree/tour-down-under-2018
Data5 https://github.com/toUpperCase78/formula1-datasets/blob/master/formula1_2021season_drivers.csv
- Data2 is taken for the Visualization and Data Pre-processing: a. Read the data and merged.
b. Checked Missing values: There was no missing values present.
c. Taken some of the columns data Visualization: Correlation Matrix: The correlation matrix displays the relationship between numerical variables in the data. The heatmap shows ‘timetaken_in_millisec' has a strong negative correlation with 'wins' and 'points' (i.e. inverse relation ).
d. Top 10 Car Companies by Wins: The bar chart displays the top 10 companies with the highest total wins. It shows that “Ferrari” has the highest number of wins, followed by ‘McLaren’ and 'Mercedes'.
e. Top 10 Nationalities by Wins: The bar chart displays the top 10 nationalities with the highest number of wins. It shows that drivers from 'British (Britain)' have the highest number of wins, followed by 'Great Britain' and 'Brazil'.
f. Wins by Company and Driver Nationality: The stacked bar chart displays the number of wins by company and driver nationality for the top 10 companies. It shows that 'British' has the highest number of wins across various driver nationalities, followed by 'Mercedes' and 'McLaren'.