Comments (24)
Hey all,
Have been working on https://coronadatascraper.com/ aka https://github.com/lazd/coronadatascraper in my own time, and also am a Sanity user professionally+personally.
We've been building scrapers over there only from official sources. No news, no aggregates, just governments directly (yes this is a pain since many governments like to have free-text press releases, sometimes with useful numbers written out like thirty-five
).
If there are any sources on there that aren't primary sources (government depts), please raise an issue on that github and we'll work to sort it out.
There's a slack for that project too if anyone wants to jump on and chat with us that are working on it.
from covid19_scenarios.
That's really cool @ManuelB. Thanks for sharing.
from covid19_scenarios.
For Brazil, I saw that the data available at https://brasil.io/dataset/covid19/ have been used. Great!
However, some data is outdated. For example, today, the last record in "BRA-Distrito Federal.tsv" is for 2020-04-30.
Who is working with the Brazilian data? I'm willing to help if needed.
In the opportunity, I would like to thank the project's team! We used Covid Scenarios in a publication [1] that had a relevant local repercussion.
from covid19_scenarios.
One way might be to crowdsource the search for data.
There are many COVID-19 and SARS-CoV-2-related projects on the web. Some of them may contain data, APIs or just interesting ideas that can help us to make our application better.
Here are some examples:
- https://github.com/search?q=covid+OR+covid-19+OR+coronavirus+OR+ncov+OR+SARS-CoV-2
- https://github.com/soroushchehresa/awesome-coronavirus
- http://open-source-covid-19.weileizeng.com/
from covid19_scenarios.
https://www.ecdc.europa.eu/en/publications-data/download-todays-data-geographic-distribution-covid-19-cases-worldwide seems like a good data source? Example data: https://www.ecdc.europa.eu/sites/default/files/documents/COVID-19-geographic-disbtribution-worldwide-2020-03-20.xlsx Data seems to be global and well structured. It only counts cases and deaths, though (no hospitalized, ICU, recovered)
from covid19_scenarios.
Would this be enough ? Itβs data thatβs refreshed daily at 9am EST https://www.tableau.com/covid-19-coronavirus-data-resources
from covid19_scenarios.
https://coronadatascraper.com/ there's a fair bit of data available there as well
from covid19_scenarios.
For Spain, this is a good data source, containing national and regional cases, deaths, ICU and recovered, updated on a daily basis: https://github.com/datadista/datasets/tree/master/COVID%2019
from covid19_scenarios.
I have a finished pull request for the ECDC dataset pending now, replacing the WHO data and parser.
from covid19_scenarios.
https://coronadatascraper.com/ there's a fair bit of data available there as well
There is an amazing amount of data on that API, but I guess it is not an official source. Should be easy to write a parser for, if required.
from covid19_scenarios.
Good point!
I have checked a few of their scrappers, they all seem to be directed at government pages eg https://www.health.gov.au/news/health-alerts/novel-coronavirus-2019-ncov-health-alert/coronavirus-covid-19-current-situation-and-case-numbers for Australia
https://www.canada.ca/en/public-health/services/diseases/2019-novel-coronavirus-infection.html for Canada and so on.
or github repos that are official sources like https://github.com/opencovid19-fr/data for France
If we were to go this road it shouldn't take too long to vet each source I guess.
from covid19_scenarios.
Spain's data: neherlab/covid19_scenarios_data#11
from covid19_scenarios.
I wrote a first parser for the coronadatascaper.com now (in my forked repo). In the latest version, it should also contain correct entries for regions such as USA-OK-Love County. Everything is stored in a global .tsv (and json as well).
Re: source quality of coronadatascaper: Germany's numbers are pulled of the app of a tabloid... I don't think it will be possible to vet sources of such an API, as they can change things as they see fit.
/edit: To be more precise: Germany's numbers are themselves aggregated, from the official sources (RKI) and newspapers, of which at least one is more or less a tabloid (Morgenpost)
from covid19_scenarios.
Hi, thanks you all for the work.
Here is my little contribution : I added data for France (neherlab/covid19_scenarios_data#18)
Take care
from covid19_scenarios.
I have collected a lot of data for Germany:
https://github.com/ManuelB/covid-19-vis/tree/gh-pages/germany
It is used to run a full simulation for 417 districts in Germany and runs on the command line.
Details what I am doing in described here:
https://youtu.be/lwUDvNfVeEo
If the data is integrated into the data repository it would show more than 400 items in the select box. I would think this is too much.
from covid19_scenarios.
I can provide you an API that gives all country data regarding COVID-19 .It also get updated frequently
fetch('https://corona.lmao.ninja/all')
Hope it will help you guys.
from covid19_scenarios.
For Brazil, I saw that the data available at https://brasil.io/dataset/covid19/ have been used. Great!
However, some data is outdated. For example, today, the last record in "BRA-Distrito Federal.tsv" is for 2020-04-30.
Who is working with the Brazilian data? I'm willing to help if needed.
Hi @pauloangelo , thanks for highlighting this. The data needs to be update manually by the maintainers of this project, and that has just not been done in the last 3 days. I am sure they will do this soon!
from covid19_scenarios.
Thank you @noleti . I'm available to help, if needed. Thank you all for this remarkable initiative!
from covid19_scenarios.
Hey @pauloangelo, sorry for the delay. We will update the date now and re-release soon!
from covid19_scenarios.
Thank you @nnoll !
from covid19_scenarios.
If you compile population sizes for Brazilian regions and their hospital capacities, we can add them as presets.
from covid19_scenarios.
Hi all,
The counts for "BRA-Distrito Federal" are including the cases from other regions detected at Distrito Federal. The Brasil.io dataset registers external cases as "Importados/Indefinidos". I suggest to count just the local cases. For example, for 29-May-2020 there are 142 local deaths, while the TSV counts 154.
Best regards,
PA
from covid19_scenarios.
If you compile population sizes for Brazilian regions and their hospital capacities, we can add them as presets.
Hi @rneher , I will have a look at it. For the hospital capacities, unfortunately, we don't have a reliable data. The government are varying this information. For the population sizes, R0, etc, I believe we can provide, at least for "BRA-Distrito Federal". Follows below the link/data that we have been using in our weekly report.
Weekly reports created by our observatory (parameters are also motivated here)
https://www.prepidemia.org/boletins-quinzenais-prepidemia .
from covid19_scenarios.
@pauloangelo I created a separate issue for this, let's continue there
#718
from covid19_scenarios.
Related Issues (20)
- Option for adding Google Mobility data as an additive mitigation NPI HOT 3
- Option for correcting observed case counts using test positivity data and/or recorded deaths HOT 3
- Simulation plot changes every time refresh is pressed. HOT 1
- π§π· Brazil case data is incorrect HOT 7
- Step-by-step guide for parameter adjustment HOT 5
- Port to Next.js HOT 1
- Split app data per region and load on demand
- Don't bunde the data HOT 7
- weekly cases(data) vs weekly death (model) HOT 7
- Document, improve schema modification workflow
- Second wave
- Seroprevalence
- I want to contribute for Spanish translation HOT 4
- Missing Patients in hospital (data) in the results HOT 5
- Not all strings are translated
- API HOT 4
- Include effect of vaccination HOT 3
- Outcomes Summary Table
- Missing (And wrong) data shown before 08-16-2020 HOT 3
- [Security] Workflow eslint.yml is using vulnerable action reviewdog/action-eslint
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. πππ
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google β€οΈ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from covid19_scenarios.