Coder Social home page Coder Social logo

koziolk / openfda-bigdata-pipeline Goto Github PK

View Code? Open in Web Editor NEW
5.0 1.0 1.0 1.94 MB

OpenFDA BigData Pipeline

JavaScript 7.03% Java 37.20% Dockerfile 0.55% Python 6.75% Jupyter Notebook 48.46%
adverse-drug-reaction big-data fda-drug openfda-api openfda kafka mongodb

openfda-bigdata-pipeline's Introduction

OpenFDA BigData Pipeline

OpenFDA BigData Pipeline enables collection, processing, and real-time presentation of data - on adverse drug events from the openFDA database.

The solution uses Apache Kafka as a message broker, Mongo DB as a document storage, Spring Boot for services and is Dockerized.

Contents

This repository contains the code for the openFDA BigData Pipeline solution

Architecture

Pipeline Architecture

Configuration

The project runs with the default configuration defined in each of services and in pipeline.yml. For more details refer directly to:

Running solution locally in Docker

If you intend to try running project yourself, I have put together a pipeline.yml configuration that can help you get started.

Calling the following command

docker-compose -f pipeline.yml up

will:

  • Start openfda-producer container
  • Start zookeper container
  • Start kafka container
  • Start mongodb container
  • Start openfda-consumer container
  • Start openfda-live-dashboard container which will expose port 8050
  • Start jupyter-notebook container which will expose port 8888

Accessing the application

Once all your Docker containers are up and running you can access openfda-live-dashaboard web dashboard via a browser under the following URL:

http://localhost:8050

In addition, you can access Jupyter Notebook jupyter-notebook via a browser under the following URL:

http://localhost:8888

Example graphs

Top 20 patient reactions reported between 2020-01-01 and 2022-01-01

Top patient reactions

Top 20 patient medical products reported between 2020-01-01 and 2022-01-01

Top medical products

Issues and contribution

Bug reports and pull requests are welcome on GitHub at https://github.com/koziolk/openfda-bigdata-pipeline

openfda-bigdata-pipeline's People

Contributors

koziolk avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Forkers

slackroo

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.