Coder Social home page Coder Social logo

Comments (9)

gwincr11 avatar gwincr11 commented on August 14, 2024

When you say cache it locally, do you actually mean check it in with a pull request?

from covasim.

inc0 avatar inc0 commented on August 14, 2024

Or directly commit into repo, at least for now, via deploy key. @cliffckerr are you ok with this approach?

Every day (or hour) we could pull latest data from JHL, process it and cache it locally. We'd need to create deploy key and save it as action secret

from covasim.

cliffckerr avatar cliffckerr commented on August 14, 2024

@inc0 if we're committing data, this should probably not be in the covasim repo -- potentially synthpops but maybe best to have a separate module/repo for it?

from covasim.

inc0 avatar inc0 commented on August 14, 2024

or actually we can just have something like python download_data.py and entry to .gitignore?

from covasim.

ckerr-IDM avatar ckerr-IDM commented on August 14, 2024

works for me! we can have a gitignored data folder and a script to update it

from covasim.

gwincr11 avatar gwincr11 commented on August 14, 2024

I am working on a script to pull in the neherlab data as part of: #45

I ended up with a folder structure of:
covasim/datasets/sources/ Where the data will live
covasim/dataset/translators.py Which is holding the translators to get datasets to the format we want
covasim/datasets/update_sources.py Which pull in the source files when run.

I was also thinking about adding a reference file that could tell us which of the source files contain which countries data. This could be generated by update sources and dropped in the sources file.

I was speaking with Scott Ayers and he could also use the reference file for some front end work he is doing. So it serves a few purposes.

from covasim.

hamelsmu avatar hamelsmu commented on August 14, 2024

Some comments/questions

  • The John Hopkins data is only updated daily, no need to pull more often
  • Why do we need to check in a copy of the data into this repo? We can reference the URL of the data directly. Most data science libraries allow you to reference the URLs of CSV files as a filehandle

from covasim.

ckerr-IDM avatar ckerr-IDM commented on August 14, 2024

@hamelsmu yes, i think we're in agreement -- we'd have a script to manually update the data into a git-ignored folder

from covasim.

hamelsmu avatar hamelsmu commented on August 14, 2024

Ah yes, sorry I misread the thread -- my apologies

from covasim.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.