Coder Social home page Coder Social logo

nadiarichards / full_stack_healthcare_project Goto Github PK

View Code? Open in Web Editor NEW

This project forked from noahgoldstein13/full_stack_data_analytics_project

0.0 0.0 0.0 9.74 MB

Jupyter Notebook 77.77% Python 8.24% HTML 3.08% JavaScript 9.09% CSS 1.82%

full_stack_healthcare_project's Introduction

Full Stack Healthcare Value of Care Analysis Project

By: Nadia Richards, Samori Emmanuel, Noah Goldstein, Perry Reynolds

Background


This project's main purpose was to make a full stack app that utilizes multiple tools including Python, SQL, Flask API and front-end work with HTML, CSS and JavaScript (Plotly, Leaflet, D3 libraries). We chose several datasets to achieve that specifically for healthcare revenue space. We originally pulled and cleaned data that had multiple data points to quantify value of care as well including readmission ratio in each facility based on different care categories (that table is in our PostreSQL, but we ended up not using it in our visualizations), clinical outcomes dataset per facility/zip code as well as dataset with hospital-acquired infections. We have also explored more detailed Census data including poverty rate per zipcode as well as total population and it's racial distribution. We wanted to explore a deeper relationshios between multitude of those variables, but were limited with just chosing to present 3 visualizations as well as limited timeframe. There is definitely space for further exploration within these topics.

Premise: In the wake of a global pandemic, public health and healthcare have risen to the forefront of the public eye. Questions have been raised and scenarios proposed about our current healthcare system and the flaws within it. We took an extensive look at the various care models across the country and how they compare in terms of Medicare/Medicaid payments per condition type, as well as how they vary across the country. By focusing specifically on Heart Failure, Pneumonia, Hip/Knee Replacements, and Heart Attacks, we were able to take an in depth look at medical data and develop the results detailed below.

Tools and Technology Used for Analysis


  • Visual Studio Code
  • Healthcare Data CSVs
  • Python
  • Jupyter Notebook
  • Pandas
  • QuickDBD
  • SQL
  • PostgreSQL
  • APIs
  • Javascript
  • D3
  • Plotly
  • Leaflet
  • MapBox
  • CSS
  • HTML
  • Flask App

Process Flow


  1. CSV and API into Pandas
  2. Cleanse data
  3. Push data to SQL database
  4. Create Flask API
  5. Pull data into browser using JavaScript
  6. Generate Dashboard using D3, MapBox, Plotly, and Leaflet
  7. HTML and CSS formatting / organization

Data Sources (1 CMS.gov Dataset, 1 Census.gov Dataset, 1 OpenDataSoft.com Dataset)


  1. National Payment and Value of Care Data by Hospital: see dataset HERE
  2. Census Data - Zipcodes, Population, Poverty Rate, and Median Income: see data source HERE
    • We used the following sources to utilize the census data API:
      • Library Documentation LINK
      • Desired data labels LINK
  3. US Zip Code Latitude and Longitude Coordinates: see dataset HERE

Dataset Values Utilized and Ensuing Relationships (QuickDBD)


Screenshot

Dashboard Charts & Visualizations


Screenshot Screenshot Screenshot

Summary of Findings


Key Insights From Dashboard

  1. When analyzing the heatmap visualization we noticed that the Midwest was rather unpopulated compared to the East and West coasts. We attributed this observation to the fact that the middle third of the United States is significantly less populous than the coasts.
  2. When analyzing the scatter plot, we noticed that although the density of zipcodes with Medicare/Medicaid payments is most populous in the $25,000 - $80,000 median household income range, the maximum payments by zipcode is actually highest in the more wealthy median zipcode areas. One potential reason for this observation could be that there is a high density of wealthier elderly patients utilizing Medicare/Medicaid in those regions.
  3. Pneumonia was the highest grossing Medicare/Medicaid primary ailment between 2016-2019

Main Roadblocks While Developing Full Stack Application

  1. While developing our table schema in PostgreSQL, we noticed that our zipcode fields dropped the leading "0" for all instances starting with "0" or "00" from categorizing the field as numeric data rather than string data.
  2. Cross Origin integration proved challenging when pulling data into D3 from our Flask API
  3. We encountered difficulty with conflicting HTML and CSS coding when formatting our dashboard visualizations, bootstrap cards, svgs, and images

Limitations to Our Data

  1. The data we utilized only provides gross Medicare/Medicaid payments and does not include self-pay or private insurance reimbursements. The percentage (%) breakdown between Medicare/Medicaid, self-pay and private insurance reimbursements may provide drastically different insights based upon the hospital and population that it serves.
  2. The data utilized was gathered from 2016 - 2019, so it is slightly dated
  3. There were approximately 50 zipcodes that did not have data fields populated

Additional Thoughts to be Explored Further

  1. Average population age per zipcode and its relationship to Medicare/Medicaid payments
  2. Variation between care facilities accepting Medicare/Medicaid vs private-only
  3. Correlation between Medicare/Medicaid payments with median household income
  4. Average cost by primary ailment category
  5. Analyze more current data to see how COVID-19 would have impacted our results
  6. Analyze similar topics categorized by age groups rather than by zipcode or median income

full_stack_healthcare_project's People

Contributors

nadiarichards avatar noahgoldstein13 avatar semmanu1 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.