Coder Social home page Coder Social logo

n0rdlicht / rki-vaccination-scraper Goto Github PK

View Code? Open in Web Editor NEW
15.0 1.0 1.0 18.57 MB

A scraper to incrementally add published vaccination data by the RKI (Robert-Koch-Institut).

Home Page: https://api.vaccination-tracker.app

License: GNU Affero General Public License v3.0

Makefile 0.26% Python 89.07% JavaScript 2.25% Vue 8.42%
frictionlessdata covid19-data

rki-vaccination-scraper's People

Contributors

n0rdlicht avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Forkers

nikobergemann

rki-vaccination-scraper's Issues

Keys in use from / until

Publish a document showing when which key was added / deprecated and hence has data associated with it.

Migration from Goodtables to Frictionless Repository

Hi @n0rdlicht,

Goodtables.io is going to be deprecated in 2022, we, therefore, recommend migrating to the new Frictionless Repository (https://repository.frictionlessdata.io/) continuous data validation system provided by Frictionless Data. The core difference between the two projects is that Frictionless Repository doesn't rely on any hosted infrastructure except for Github Actions which makes this project more sustainable. Also, it uses a newer Frictionless Framework under the hood that brought many improvements over the old goodtables-py library in terms of validation quality and performance.

As usual, if you have any doubts or questions, please come and ask in our Discord chat or in the GitHub Discussion.

Backfill missing values

Some values are not yet in de-vaccinations due to missing keys. Backfilling will be done from git history.

Failing pipeline runner

Due to completly new structure of the underlying RKI excel file, the current pipeline is broken.

  • Now two relevant tabs instead of one, one for sums and one for different indications
  • New: Daily numbers for initial and booster vaccinations
  • New: Daily numbers for different vaccine types (currently BioNTech and Moderna)

Migrate to frictionless-py framework

To future proof and simplify the code should be migrated to frictionless-py Pipelines/Transformations as the API is already using it this would greatly simplify dependencies.

Numbers for 2021-04-26 seem to be wrong

e.g.
Germany,DE,nation,sum_initial_moderna,3597780,83166711,7.260192121821434,2021-04-26
while on 2021-04-25:
Germany,DE,nation,sum_initial_moderna,1069222,83166711,7.166621029416445,2021-04-25
was 2.5 million less...
rki excel reports 1099371 on 2021-04-26

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.