Coder Social home page Coder Social logo

cs598dlh's Introduction

CS 598 DLH Final Project

Name: Fabio Vera

Team: 133

Project: 11

Github link: https://github.com/fverac/cs598dlh

Video link: https://www.youtube.com/watch?v=ymaahdX9844

Reproducing: "Predicting 30-days mortality for MIMIC-III patients with sepsis-3: a machine learning approach using XGboost."

Project setup / Installation

The two prerequisites needed to be able to run the notebook are to:

  1. Obtain MIMIC-III data, and
  2. Set up your environment.

MIMIC-III

Data is all from MIMIC III v1.4, which "is a large, freely-available database comprising deidentified health-related data associated with over forty thousand patients who stayed in critical care units of the Beth Israel Deaconess Medical Center between 2001 and 2012. The database includes information such as demographics, vital sign measurements made at the bedside (~1 data point per hour), laboratory test results, procedures, medications, caregiver notes, imaging reports, and mortality (including post-hospital discharge)."

Data can be obtained from this link after completing the required training. https://physionet.org/content/mimiciii/1.4/

Precomputed features like first-day vital signs and many more were obtained from Google BigQuery, which in addition to storing More information as well as underlying code for creating these precomputed/derived tables can be found here: https://github.com/MIT-LCP/mimic-code/tree/main/mimic-iii/concepts

Additionally, here are some resources for accessing MIMIC through BigQuery: https://mimic.mit.edu/docs/gettingstarted/cloud/bigquery/

Tables needed:

  1. Raw MIMIC-III tables
    1. D_ICD_DIAGNOSES
    2. DIAGNOSES_ICD
  2. Precomputed MIMIC concepts
    1. icustay_detail
    2. labs_first_day
    3. heightweight
    4. vitals_first_day
    5. gcs_first_day
    6. elixhauser_ahrq_v37
    7. ventilation_first_day
    8. rrt_first_day
    9. urine_output_first_day
    10. sapsii
    11. sofa
    12. qsofa

The notebook expects the raw uncompressed MIMIC-III csv's to reside in the mimic-iii-clinical-database-1.4 directory, and similarly expects the precomputed mimic concepts files to live in the mimic_concepts directory.

Setting up the environment

For convenience, an environment.yml file is provided for reproducibility, for use with Conda. To create the project environment one can use the following command

conda env create -f environment.yml
conda activate dlh_final_proj

In the event that Conda is not possible to use, key environment details are below:

python 3.9.12
sklearn 1.0.2
pandas 1.4.2
numpy 1.21.5
xgboost 2.0.3

cs598dlh's People

Contributors

fverac avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.