Coder Social home page Coder Social logo

genid / tissueid Goto Github PK

View Code? Open in Web Editor NEW
0.0 2.0 0.0 8.45 MB

A novel taxonomy-independent deep learning microbiome approach allow for accuaret classification of human epithelial materials

License: GNU General Public License v3.0

Python 100.00%
microbiome 16s-rrna deep-neural-networks artificial-intelligence prediction classification

tissueid's Introduction

Novel taxonomy-independent deep learning microbiome approach allows for accurate classification of human epithelial materials

Celia Díez Lópeza, Athina Vidakia, Arwin Ralfa, Diego Montiel Gonzáleza, Djawad Radjabzadehb, Robert Kraaijb,c, André G. Uitterlindenb,c, Cordula Haasd, Oscar Laoe,f, and Manfred Kaysera

  • a Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands
  • b Department of Internal Medicine, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands
  • c Department of Epidemiology, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands
  • d Zurich Institute of Forensic Medicine, University of Zurich, Zurich, Switzerland
  • e CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
  • f Universitat Pompeu Fabra (UPF), Barcelona, Spain

Installation requirements

Operating system: Linux only. Tested on Ubuntu 16.04LTS, but should also work on newer version of Ubuntu. It should be easy to made it work on other Linux distributions. 

Install the following dependencies
        
apt-get install bwa

SAMtools: We recommend the newests versions of SAMtools (e.g. > 1.4.1)

wget https://github.com/samtools/samtools/releases/download/1.4.1/samtools-1.4.1.tar.bz2 -O samtools.tar.bz2
tar -xjvf samtools.tar.bz2 
cd samtools-1.4.1/
./configure
make
make install

Required python 3 and Anaconda with following packages (skip if already installed)

conda install -c conda-forge pandas==0.23.4;
conda install -c conda-forge scikit-learn==0.20.0;
conda install -c conda-forge tensorflow==1.10.0;

Usage

python TissueID.py 

[-fasta sample.fasta] \         file or path directory with one or more samples

[-fastq Sample.fastq] \         file or path directory with one or more samples

-out output.tsv \               output file including probabilities in tsv format

-model Model/ \                 folder containing the training 50 training ENSEMBLE models

-pos pos_file.bed \             relevant positions based on E.coli K12

-ref ref/Ecoli_K12_ref.fasta \  reference Genome E.coli K12 

[-t 4] \                        Number of Cpus to use during alignment 

See complete manual at the website: https://www.erasmusmc.nl/genetic_identification/resources/

Comments and bug report

Please send an email at [email protected] for any comment and if there is a problem getting the software up and running.

Reference

C. Díez López et al., Novel taxonomy-independent deep learning microbiome approach allows for accuare classification of human epithelial materials (2019)

https://doi.org/10.1016/j.fsigen.2019.03.015

tissueid's People

Contributors

dmontielg avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.