This is the project for the course 'Fundamentals of Data Science. (Fall 2021).
Performed data analysis on length of stay of patients in different hopitals of different cities. This dataset is publicly available on kaggle.
- Steps performed:
- Created data quality reports for both categorical and continuous variables.
- Identified data quality issues present in the dataset by studying data quality reports.
- Resolved data quality issues by using methods taught to us in class.
- Prepared data for further analysis. Visualized both categorical variables and continuous variables to identify any underlying trends and distribution of the data.
Our analysis answered following important questions.
- Cases per Hospital //Done
- How many hospitals fall in each cities //Done
- How many cases did each hospitals conduct over cities
- How often did the patient visit hospital? //Done
- Do the patient prefer same hospital everytime? //Done
- Which is the most visited department? //Done
- Do older people stay longer ? //Done
- Is initial admission deposit related to their stay? //Done
- How long patient stay based on severity //Done
- Are Age & Stay Days related to Severeness
- Is bed alloted based on severeness
- Are there more visitors for patients with severe illness
- Do hospitals provide extra rooms for long stay
- How many hospital types fall in each region