The wevtyto from susanaguix

wevtyto's Introduction

Here we present a tool to analyse enterovirus sequencing data. It was desinged focusing on wastewater samples and Oxford Nanopore data.

Briefly, VSEARCH tool is used to filter and cluster the generated raw reads and then perform a BLAST search against a custom reference database. A final Excel file per sample is generated containing information about the Enterovirus types found as well as its abundance.

ev_typing_environment.yml creates a Conda environment that contains all the required packages to run the pipeline

ev_typing_nix.py is the script to analyse those samples that were amplified following the Nix et.al., 2006 protocol targeting all Enterovirus.

ev_typing_shaw.py is the script to analyse those samples that were amplified following the Shaw et.al., 2020 protocol targeting Cluster C Enterovirus.

ev_reference_sequences.fasta.gz contains a FASTA file with reference sequences from all Enterovirus types. There are 6303 references. If preferred, a custom reference FASTA file can be used always keeping the same header structure as the provided file.

Recommend Projects

susanaguix / wevtyto Goto Github PK

wevtyto's Introduction

wevtyto's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent