Coder Social home page Coder Social logo

danielvelaj / medclip Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 73.29 MB

Medical image captioning and feature extraction. Deep learning pretreating task comparison.

License: MIT License

Python 0.50% Jupyter Notebook 99.49% Shell 0.01%
deep-learning medical-imaging captioning-images

medclip's Introduction

MedClip

This is the official repository for the MedClip research project from MindKind research group. Mindkind logo

The project investigates and compares different pretraining tasks for medical image feature extraction and captioning.

Workflow

The experiments are structured as follows:

download datasets ---> prepare data ---> build models ---> run experiments

Each stage is run by a python script of its own that allows to custumize options in every step.

Downloading datasets

To download datasets use the download_all.py script from the src directory. This folder will download and extract the zip files for each dataset and sort their contents into the data/raw directory.

python download_all.py

Preparing data

The downloaded data is used to produce clean dataframes aswell as model training material. Each downloaded dataset generates a dataframe as shown below

File Modality Anatomy Patient history Findings Impression Diagnosis
path to file imaging modality imaged anatomy clinicla history of patient in natural language findings in natural language diagnostic impression concise diagnosis

For some datasets there may be extra columns or missing columns but the names are consistent across all generated datasets. To prepare the data run the prepare_data.py script from the src directory:

python prepare_data.py

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

License

MIT

medclip's People

Contributors

danielvelaj avatar solangead avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.