Coder Social home page Coder Social logo

hzi-bifo / phylogeography_paper Goto Github PK

View Code? Open in Web Editor NEW
6.0 8.0 1.0 14.44 MB

This repository contains all data and code for the manuscript "Phylogeographic reconstruction using air transportation data and its application to the 2009 H1N1 influenza A pandemic".

License: Apache License 2.0

R 90.08% Shell 9.92%

phylogeography_paper's Introduction

Phylogeographic reconstruction using air transportation data

This repository contains all data and software for the manuscript "Phylogeographic reconstruction using air transportation data and its application to the 2009 H1N1 influenza A pandemic" published in PLOS Computational Biology.

Data

This folder contains data for all steps of the simulation and evaluation, including:

  • the gvd file specifying the model for the geographic simulation in GLEAMviz, as well as the output of one simulation used in this publication
  • the input and configuration for the simulation of the sampling, the tree and the sequences with FAVITES
  • the output for all 50 simulations, including trees, sequences and geographic locations for all nodes of the tree
  • the result of the parsimonious phylogeographic reconstruction for all 50 simulations
  • the xml file used for the analysis with BEAST stating all parameters, as well as the maximum clade credibility trees and trees with the most probable locations for each nodes that were used for the evaluation

For further details on all available files, please see the readme in the data directory.

Figures&Tables

This folder contains all figures and tables of the manuscript, as well as some additional tables stating all Fréchet tree distances.

Software

This folder contains all codes for:

  • the simulation pipeline that prepares the input for FAVITES based on the GLEAMviz output, calls FAVITES to simulate sampling, tree and sequences and postprocesses the output
  • the parsimonious phylogeographic reconstruction using the Sankoff algorithm
  • the summary of the maximum clade credibility tree created by BEAST

For details stating dependencies, necessary input files as well as instructions how to run the software, please see the readme for the simulation as well as the readme for the reconstruction.

The distance matrices used for the parsimonious reconstruction can be found in this data repository.

Software for the comparison of the reconstructed spread to the simulated one can be found in the FrechetTreeDistance repository.

phylogeography_paper's People

Contributors

susannereimering avatar

Stargazers

Felix S avatar Rongzhang avatar Biopig avatar Liuyong Ding avatar Liam avatar Huang avatar

Watchers

James Cloos avatar Alice McHardy avatar Andreas Bremges avatar Tiago Andrade Borges Santos avatar Gary Robertson avatar  avatar  avatar Liam avatar

Forkers

liamxg

phylogeography_paper's Issues

anything that I missed?

Dear @alicemchardy,

Please see below:

bash simulation_pipeline.sh ../../Data/Gleamviz/seedings.tsv ../../Data/Gleamviz/metadata_cities.tsv ../../FAVITES/run_favites_docker.py ../../Data/FAVITES/flu_spread_config.json simulation_output
Traceback (most recent call last):
File "/Software/Simulation/../../FAVITES/run_favites_docker.py", line 111, in
o = check_output(['docker','images']).decode().splitlines()
File "/Users/lixingguang/opt/miniconda3/lib/python3.9/subprocess.py", line 424, in check_output
return run(*popenargs, stdout=PIPE, timeout=timeout, check=True,
File "/Users/lixingguang/opt/miniconda3/lib/python3.9/subprocess.py", line 505, in run
with Popen(*popenargs, **kwargs) as process:
File "/Users/lixingguang/opt/miniconda3/lib/python3.9/subprocess.py", line 951, in init
self._execute_child(args, executable, preexec_fn, close_fds,
File "/Users/lixingguang/opt/miniconda3/lib/python3.9/subprocess.py", line 1821, in _execute_child
raise child_exception_type(errno_num, err_msg, err_filename)
FileNotFoundError: [Errno 2] No such file or directory: 'docker'
gunzip: can't stat: FAVITES_output/error_free_files/sequence_data.fasta.gz (FAVITES_output/error_free_files/sequence_data.fasta.gz.gz): No such file or directory
gunzip: can't stat: FAVITES_output/error_free_files/transmission_network.txt.gz (FAVITES_output/error_free_files/transmission_network.txt.gz.gz): No such file or directory
gunzip: can't stat: FAVITES_output/error_free_files/phylogenetic_trees/tree_0.tre.gz (FAVITES_output/error_free_files/phylogenetic_trees/tree_0.tre.gz.gz): No such file or directory
gunzip: can't stat: FAVITES_output/error_free_files/phylogenetic_trees/tree_0.time.tre.gz (FAVITES_output/error_free_files/phylogenetic_trees/tree_0.time.tre.gz.gz): No such file or directory
gunzip: can't stat: FAVITES_output/seed_sequences.tsv.gz (FAVITES_output/seed_sequences.tsv.gz.gz): No such file or directory
Error in file(file, "r") : cannot open the connection
Calls: read.tree -> scan -> file
In addition: Warning message:
In file(file, "r") :
cannot open file 'FAVITES_output/error_free_files/phylogenetic_trees/tree_0.tre': No such file or directory
Execution halted
cp: FAVITES_output/error_free_files/sequence_data.fasta: No such file or directory
head: FAVITES_output/seed_sequences.tsv: No such file or directory
head: FAVITES_output/seed_sequences.tsv: No such file or directory

ask for help

Rscript phylogeo_sankoff.R ../../Data/Simulations/Sim_1/simulated_seq_cds.labeled.phy ../../Data/Simulations/Sim_1/tipdata.txt /path/to/effective.distance.matrix.csv reconstruction_effective

Could you tell me where to find the effective.distance.matrix.csv file, thansk.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.