Coder Social home page Coder Social logo

alnsl's Introduction

alnsl

A nextflow pipeline for alignment of short WGS reads.

Preparing index sequences

  • prepare reference index for bwa-mem2

     bwa-mem2 index ref.fa
    
  • prepare elprep files using the folowing commands:

    #reference file
    elprep fasta-to-elfasta ref.fa ref.fa.elfasta
    #variats files GTAK4_Bunddle for variant calibration
    elprep vcf-to-elsites <vcf-file> <elsites-file>
    

Conda Environment

We will build a micromamba environment with the needed software

micromamba create -f conda.yml 

Param files

Here we provide an example to map the short reads to hg38, the context of the params files is the following

dbsnp: /mnt/beegfs/labs/DiGenomaLab/databases/references/human/GATK_Bundle/Homo_sapiens_assembly38.dbsnp138.elsites
dbindel: /mnt/beegfs/labs/DiGenomaLab/databases/references/human/GATK_Bundle/Mills_and_1000G_gold_standard.indels.hg38.elsites
ref: /mnt/beegfs/labs/DiGenomaLab/databases/references/human/bwa2/hs38DH.fa
elpre_ref: /mnt/beegfs/labs/DiGenomaLab/databases/references/human/hs38DH.fa.elfasta
alt_js: /mnt/beegfs/home/adigenova/micromamba/envs/aln/bin/bwa-postalt.js
bqsr: true

Save the above content in a file (i.e) : aln-params.yml

Read file

provide a csv file wiht the following information:

sampleId,read1,read2
test2,./test_reads/test2.R1.fq.gz,./test_reads/test2.R2.fq.gz
test3,./test_reads/test3.R1.fq.gz,./test_reads/test3.R2.fq.gz
test,./test_reads/test.R1.fq.gz,./test_reads/test.R2.fq.gz

Currently if a sample is split into several files is necesary to merge the reads before runing the pipeline.

Save the above content in a file (i.e) : reads.csv

runnig the pipeline

nextflow run main.nf --csv reads.cvs -profile uoh -params-file aln-params.yml

in case of failure use:

nextflow run main.nf --csv reads.cvs -profile uoh -params-file aln-params.yml -resume

that will generate a directory called results

Creating an aggregated report

To create an aggregated report across all the samples, is possible to run multiqc on the result directory:

  • load the environment

    micromamba activate aln
    
  • run multiqc

 multiqc .

alnsl's People

Contributors

adigenova avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.