Light

kentchun33333 / kaggle_rsna2019_4th_solution Goto Github PK

View Code? Open in Web Editor NEW

This project forked from xuxusss/kaggle_rsna2019_4th_solution

0.0 1.0 0.0 130 KB

Shell 2.62% Python 97.38%

kaggle_rsna2019_4th_solution's Introduction

Hello!

Below you can find a outline of how to reproduce my solution for the RSNA Intracranial Hemorrhage Detection competition.

Visit kaggle forum for solution overview: Kaggle RSNA Intracranial Hemorrhage Detection: 4th Place Solution

Our code is based on Appian's repo: https://github.com/appian42/kaggle-rsna-intracranial-hemorrhage

HARDWARE

Ubuntu 16.04
NVIDIA 2080Ti

SOFTWARE

(python packages are detailed separately in requirements.txt)

Python 3.6.7
CUDA 10.1
CUDNN 7501
NVIDIA Drivers 418.67

START

Setup environment
Place the raw data into ./IFE_1/input folder.
1. The test data correspond to the test data provided in the Stage 2 of competition.
2. Use stage 2 training data to train the model.
cd IFE_1, run ./bin/preprocess.py to preprocess the training and test images and split the training data into five folds.
To train:
1. Train feature extraction models
  - Go to IFE_1, IFE_2, IFE_3, run ./bin/train.sh to train five fold models. Models are saved in /model/. Best models are saved as foldx_best.pt.
  - It will take about 24 ~ 48 hours to train each model for one fold.
2. Extract features
  - Go to IFE_1, IFE_2, IFE_3, run ./bin/gen_feat_train.sh and ./bin/gen_feat_test.sh to generate 1D (and 3D features). Use the best models generated from step 4.1.1.
  - It will take around 5 hours to extract one feature set (train/test TTA5).
3. Train classification models.
  - Go to folder cls_1, cls_2, cls_3, run ./bin/train.sh, train five fold models for each folder.
  - It will take around 3 hours to train 1D+3D model (single model), and around 1.5 hours to train 1D model (single model).
To infer:
1. Extract test features.
  - Go to folder IFE_1, IFE_2, IFE_3, run ./bin/gen_feat_test.sh to extract test features.
2. Predict classification probabilities
  - Go to folder cls_1, cls_2, cls_3, run ./bin/predict.sh to predict result using extracted features.
3. Ensemble
  - run ./libs/ensemble.sh to ensemble all the predictions.
Models and features are generated in sequence. If one follows the above mentioned steps in order, all the softlinks should be valid by the time they are referred.

kaggle_rsna2019_4th_solution's People

Contributors

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.