Coder Social home page Coder Social logo

dataliftoff / federated_learning_on_health_data Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 344 KB

The smartphones of the people probably carry the most valueable but also private data. Since using data promisses to be one of the best ways to fight back against COVID-19, it is highly desirable to get access. By using a Federated Learning approach with PySyft it is possible to learn from the private data right on the smartphone, with the data never leaving the device.

Jupyter Notebook 100.00%
machine-learning deep-learning federated-learning wirvsvirus health-data distributed-learning coronavirus

federated_learning_on_health_data's Introduction

Federated Learning On Distributed Private Health Data On Smartphones

The smartphones of the people probably carry the most valueable but also private data. Since using data promisses to be one of the best ways to fight back against COVID-19, it is highly desirable to get access.

By using a Federated Learning approach with PySyft it is possible to learn from the private data right on the smartphone, with the data never leaving the device.

A short YouTube-Video has been created for this project.
There is also an Devpost-project.

Approach

  1. Since there is no private dataset with health data during a virus outbreak, a simulated dataset has been used to show the prove of concept.
  2. The dataset contains the health status of each person (e.g. temperature, movement, ... ) for several days during the virus outbreak.
  3. Using PySyft-Workers the data for each single person is distributed to a worker (virtual smartphone). Therefor each worker only knows its own health status.
  4. A simple feedforward network is send to each worker during the training process. The learning takes place directly on the virtual smartphone itself and an updated network is returned to the host. This way the data did never leave the smartphone and stays protected.
  5. The target variable to predict is the total number of infected people in this notebook.

Conclusion

It is possible to make us of the private health data of the people without lowering the protection of the data.
The notebook can be seen as prove of concept that learning on distributed individual health data can start a learning process in neural network.

Limitations (With Possible Solutions)

Limitation:
Simulated data without a connection to the real world has been used.
Solution:
Exchanging the dataset and adjusting the code should be pretty easy.

Limitation:
A trusted App with permission to store private health data on the device is needed on many smartphones.
Solution:
Probably another team created a similar App during the hackathon or there is an existing one already out there. Merging this approach with such an App is necessary.

Limitation:
The hardware limitations have been very strict for this notebook.
Solution:
Running the simulation and training process on a much larger scale should indicate if the approach is promising.

federated_learning_on_health_data's People

Contributors

dataliftoff avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.