Coder Social home page Coder Social logo

yc-coder-chen / detecting-cancer-on-gigapixel-images Goto Github PK

View Code? Open in Web Editor NEW
13.0 4.0 3.0 21.86 MB

Columbia University Applied Deep Learning Project: Detecting Cancer Metastases on Gigapixel Pathology Images

Jupyter Notebook 100.00%
deep-learning gigapixel-images detecting-cancer-metastases tensorflow tensorflow2 keras

detecting-cancer-on-gigapixel-images's Introduction

Detecting Cancer Metastases on Gigapixel Pathology Images

Columbia University Applied Deep Learning Course Project (Fall 2019)
Author: Yingxiang Chen
Columbia Uni: yc3526
Video Demo: https://youtu.be/h6wJMuvgd4M

Objective

Current Situation:

  • Microscopic examination of lymph nodes is crucial in breast cancer staging
  • Currently, the manual process requires highly skilled pathologists
  • The process is fairly time-consuming and error-prone, particularly for lymph nodes with either no or small tumors

So I want to follow the strategy in the paper and utilize the deep learning models & techniques to relieve the workload of physicians by creating a workflow to detect and locate tumor pixels in the images and offer automatic second opinions.

Dataset

  • Raw Data: 21 Gigapixel Pathology Images, each has a tumor slide and a corresponding mask from The CAMELYON16 challenge.
  • Trainset: 8000 (4000 each at two different zoom levels) image patches sampled from 16 Gigapixel Pathology Images.
  • Validation set: 1600 (800 each at two different zoom levels) image patches sampled from 2 Gigapixel Pathology Images.
  • Test set: 3 Gigapixel Pathology Images.

Image augmentation

Adopt similar augmentation strategies discussed in the paper.

  • Use Keras ImageDataGenerator to augment data

    • Horizontal_flip
    • Vertical_flip
    • Rescale
    • Width_shift
    • Height_shift
    • Rotation
  • Use TensorFlow image random function to augment data

    • Random brightness
    • Random saturation
    • Random hue
    • Random contrast

Model

  • Transfer Learning: Used two pre-trained inception v3 models on Imagenet to speed up the training process.
  • Global Pooling: Applied GlobalAveragePooling layer after the inception model to significantly reduce parameters.

img

Result

  • The result on slide 075

img

  • THe result on slide 091

img

  • The result on slide 096

img

Final Deliverable

detecting-cancer-on-gigapixel-images's People

Contributors

yc-coder-chen avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.