Coder Social home page Coder Social logo

dianaoaxaca / bradyrhizobium-analysis Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 46 KB

In this repository you can find data, results and scripts used in the paper Diversity and specificity of rhizobia from nodules of tropical tree-legumes

License: GNU General Public License v3.0

Shell 100.00%

bradyrhizobium-analysis's Introduction

Diversity and specificity of rhizobia from nodules of tropical tree-legumes


In this repository, you will find the workflow of the analyzes made in the Diversity and specificity of rhizobia from nodules of tropical tree-legumes article.


Data

In this study we analized 9 bacterial strains, isolated from nodules of three different leguminous plants and different geographical locations:

Strain Source nodule plant isolation Geographical locations
CCGB01 Lysiloma sp. Cuernavaca, Morelos. México
CCGB20 Lysiloma sp. Cuernavaca, Morelos. México
CCGB12 Lysiloma sp. Cuernavaca, Morelos. México
CCGUVB4N Inga vera Xico, Veracruz. Mexico
CCGUVB14 Inga vera Xico, Veracruz. Mexico
CCGUVB23 Inga vera Xico, Veracruz. Mexico
CCGUVB1N3 Inga vera Xico, Veracruz. Mexico
B51278 Lysiloma divaricatum Edimburgo, Escocia
B51279 Lysiloma divaricatum Edimburgo, Escocia

Structure

|-- analysis
|-- data
|-- figures
|-- logs_results
|-- metadata
|-- src
|-- .gitignore
|-- LICENSE
|-- README.md

analysis/ Contains results of each analysis in full version.

data/ Directory with raw sequences and symbolic links of clean sequences sorted by long and short reads.

figures/ Directory with images of the results obtained.

logs_results/ This contains the reports of each analysis in order to summarize the results.

metadata/ Contains tables with particularities of each sample, tables with information about data downloaded and used for some analyzes and tables with some summary results.

src/ Contains all scripts to perform each analysis

.gitignore File with information not visible in the repository due to space issues, for example the data/ directory. This information will be available when the data is made public and we will put the access numbers, so that they are available.

LICENSE File with license specifications.

README File with workflow and repository details.

Workflow

00.Get sequences, and verify integrity

bash src/00.md5sum.sh

01. QC preprocesing

In this step we checked the quality of the short reads with FastQC. We also eliminated adapters and poor quality reads with TrimGalore v0.6.7.

bash src/01.qc_preprocessing.sh

Tablas de lecturas antes y despues de la limpieza

02. Assemblies

The sequenced genomes were assembled in two ways:

  • Hybrid assembly: When you had long reads (MinION) and short reads (Illumina). This was done with Unicycler v0.4.8
  • Simple assembly: When only short reads (Illumina) were obtained. This was done with SPAdes v3.13.1
#Hybrid assembly
bash src/02.hybrid_assembly.sh
#Simple assembly
bash src/02.spades_assembly.sh

The hybrid assembly of 23-inga strain showed plasmid putative. So, we verified it all strains with RFPlasmid v0.0.18.

bash src/02.find_plasmids.sh
  • Plasmid assembly: 23-inga and 1N3-inga strains were plasmid positives. We assembled the plasmids from these two strains with plasmidSPAdes v3.13.1and include one of the strains that does not have a plasmid, as a negative control.
bash src/02.plasmid_spades_assembly.sh
  • Separate plasmid from chromosome: Map plasmid assembly to reads of 23-inga and 1N3-inga strains with bowtie2, minimap2, samtools and bedtools software; to separate plasmid of chromosome.
bash src/02.plasmid_separate.sh
  • Chromosome assembly again: After clean plasmid, assembly chromosome.
bash src/02.assembly_chr_fltr.sh

03. Functional annotation

The prediction and annotation of genes was done using the online server RAST v2.0 with RAStk toolkit, a complementary annotation was made on the web server EggNogg-Mapper v2.17 and Prokka v1.14.6

The code lines generated by each web service are displayed

cat src/03.annotation_server.sh
bash src/03.annot_prokka.sh

bradyrhizobium-analysis's People

Contributors

dianaoaxaca avatar

Stargazers

Juan Alfredo Hernández-García avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.