Coder Social home page Coder Social logo

seasonal-flu's Introduction

nextstrain.org/flu

This is the Nextstrain build for seasonal influenza viruses, available online at nextstrain.org/flu.

The build encompasses fetching data, preparing it for analysis, doing quality control, performing analyses, and saving the results in a format suitable for visualization (with auspice). This involves running components of Nextstrain such as fauna and augur.

All influenza virus specific steps and functionality for the Nextstrain pipeline should be housed in this repository.

This build is more complicated than other standard nextstrain build because all four currently circulating seasonal influenza lineages (A/H3N2, A/H1N1pdm, B/Vic and B/Yam) are analyzed using the same Snakefile with appropriate wildcards. In addition, we run analyses of both the HA and NA segments of the influenza virus genome and analyze datasets that span different time intervals (eg 2, 3, 6 years).

Furthermore, the Nextstrain analysis of influenza virus evolution also uses antigenic and serological data from different WHO collaborating centers. These antigenic data come in four flavors depending on the assay that passage history of the antigens. The influenza virus output files have the wildcard set

{center}_{lineage}_{segment}_{resolution}_{passage}_{assay}

that currently use the following values:

  • center: [who, cdc, crick, niid, vidrl]
  • lineage: [h3n2, h1n1pdm, vic, yam]
  • segment: [ha, na]
  • resolution: [6m, 2y, 3y, 6y, 12y]
  • assay: [hi, fra]
  • passage: [cell, egg]

Intermediate files follow this wildcard ordering, but may omit irrelevant wildcards, eg filtered_h3n2_ha.fasta.

To manage both builds for the general public and the different WHO collaborating centers, the Snakefiles are split into a Snakefile_base that contains the rules for the core analysis and the files, alongside:

  • Snakefile for the standard "live" build housed at nextstrain.org/flu
  • Snakefile_WHO for the WHO CC builds
  • Snakefile_report to generate figures and additional analysis for the biannual reports to the WHO

The latter Snakefiles import the rules specified in Snakefile_base, define additional rules, and specify the build targets.

fauna / RethinkDB credentials

This build starts by pulling sequences from our live fauna database (a RethinkDB instance). This requires environment variables RETHINK_HOST and RETHINK_AUTH_KEY to be set.

If you don't have access to our database, you can run the build using the example data provided in this repository. Before running the build, copy the example sequences into the data/ directory like so:

mkdir data/
cp example_data/* data/

Then run the the build via:

nextstrain build . targets/flu_seasonal_h3n2_ha_12y

seasonal-flu's People

Contributors

trvrb avatar rneher avatar huddlej avatar joverlee521 avatar barneypotter24 avatar tsibley avatar jameshadfield avatar kistlerk avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.