This repository contains a notebook to predict survival on the titanic for the famous competition on Kaggle. The primary purpose of this project was to improve my knowledge of data visualization techniques, feature engineering, and basic machine learning classification algorithms.
The original Kaggle submission can be found here.
The data used comes from the Kaggle competition, "Titanic: Machine Learning from Disaster".
- Sci Kit Learn
- Plotly
- XG Boost
- NumPy
- Pandas
predicting-survival-on-titanic.ipynb
: Contains the code that was used to make the kernel. Outputs the submission file for the competition
- Clone the repository
git clone https://github.com/mdylan2/titanic.git
- Download the data from here and store it in the relevant directory
- Run the notebook
Please reach out to me on GitHub if you have any questions