Coder Social home page Coder Social logo

rapid-cns2_nf's Introduction

Screenshot 2024-03-17 at 10 41 16 PM

Rapid-CNS2 workflow

Overview

The Rapid-CNS2 nextflow pipeline is a bioinformatics workflow designed for comprehensive analysis of genomic and epigenomic data generated using adaptive sampling based sequencing of central nervous system (CNS) tumours. It performs tasks such as basecalling, variant calling, methylation analysis, structural variant calling, copy number variation calling, and provides a comprehensive molecular diagnostic-ready report.

This pipeline is implemented using Nextflow, allowing for easy execution and scalability on various compute environments, including local machines, clusters, and cloud platforms.

Features

  • Modular architecture for easy customization and extension.
  • Supports both basecalling from raw ONT POD5s and analysis of pre-aligned BAM files.
  • Accelerated variant calling with Clara Parabricks supported Deepvariant and Sniffles2
  • Annotation and filtering of clinically relevant variants
  • Includes methylation analysis with Rapid-CNS2 classifier and MGMT promoter methylation status determination.
  • Automated report generation for summarizing analysis results.
  • Prepare input files for the MNP-Flex classifier (optional)

Requirements

  • Nextflow (version 3.0.0 or later)
  • Conda, Docker or Singularity (optional, for containerized execution of tools)
  • Required input data:
    • Raw ONT POD5 data (for basecalling) or pre-aligned BAM files
    • Reference genome file (hg38 required)

Usage

  1. Clone this repository:

    git clone https://github.com/areebapatel/Rapid-CNS2_nf.git
  2. Edit the nextflow.config file to configure pipeline parameters according to your requirements.

  3. Run the pipeline using Nextflow:

    nextflow run main.nf --input <input_directory> --id <sample_identifier> [--options]

    Replace <input_directory> with the path to the directory containing ONT POD5 data or pre-aligned BAM files, and <sample_identifier> with a unique identifier for the sample.

    Additional options can be specified to customize pipeline behavior. Use the --help option to view available options and their descriptions.

  4. Monitor pipeline progress and access results in the specified output directory.

Sequencing

This pipeline analyses CNS tumour data generated through Nanopore adaptive sampling using ReadFish or adaptive sampling on MinKNOW. It is compatible with data generated on MinION, GridION and PromethION

Parameters

Parameter Description Default Value
--input Path to the directory POD5 files for Dorado basecalling and minimap2 alignment or BAM file if available (Required)
--id Sample identifier (Required)
--ref Path to hg19 reference file null
--tmp_dir Directory to store temporary files. If it does not exist it will be created tempDir
--out_dir Directory path to store all the outputs output
--log_dir Directory to store log files logDir
--minimum_mgmt_cov Minimum coverage for MGMT promoter methylation analysis 5
--model_config Basecalling model to be used [email protected]
--port Port for basecall server 8887
--num_gpu Number of GPUs to use 3
--num_clients Number of clients par.num_gpu * 3
--help Show help message null
--test Run in test mode null
--reads Samtools addreplacerg -r option. Specify as -r "SM:GM24385" -r "ID:GM24385" null
--basecalling Enable basecalling from raw ONT POD5 data. If provided, --input should point to the directory containing raw data. false
--mnp-flex Prepare input file for the MNP-Flex classifier. false

Acknowledgements

We are extremely grateful to all our lab members and collaborators for their support! Keeping up with AI to make our life easier and to compensate for our (Areeba's) art skills, our logo was generated by DALL-E.

Citation

If you use this pipeline, please cite our preprint:

Felix Sahm, Areeba Patel, Kirsten Göbel et al. Versatile, accessible cross-platform molecular profiling of central nervous system tumors: web-based, prospective multi-center validation, 10 April 2024, PREPRINT (Version 1) available at Research Square [https://doi.org/10.21203/rs.3.rs-4182910/v1]

Contributions

Contributions are welcome! If you encounter any issues, have suggestions for improvements, or would like to contribute new features, please open an issue or pull request on this repository.

License

This project is licensed under the MIT License.

rapid-cns2_nf's People

Watchers

 avatar

rapid-cns2_nf's Issues

SUPPORT FOR OTHER JOB SCHEDULERS

Hi @areebapatel
I just came across this wonderful tool you made and I am trying to deploy it. I was wondering if the support for job schedulers highlighted in the nextflow.config file (LSF and SLURM) are the only limited ones or any scheduler can be used. This is because my particular HPC uses PBS as a job scheduler. Please advice

George

MISSING ```.nf``` file report rendering and ```mnpFlex.nf```

Dear @areebapatel ,

As i was waiting for a response for the first issue, I tried deploying the pipeline in a local machine. I did come across alot of errors that are more of bugs concerning the variables in calling the processes in main.nf file vs how they are actually named in the respective ./nextflow/*.nf files. Kindly look into it to make the necessary updates.
Additionally, there is a problem in the basecalling.nf, the runtime: keyword in

runtime:
    docker pod5Docker
    cpus runtime_attributes.nThreads
    memory "${runtime_attributes.gbRAM} GB"
    time 10.hours

keeps being flagged as invalid, so i replaced : with {} brackets as shown:

runtime {
    docker pod5Docker
    cpus runtime_attributes.nThreads
    memory "${runtime_attributes.gbRAM} GB"
    time 10.hours
    }

Subsequently, the whole reportRendering.nf file in include { reportRendering } from './nextflow/reportRendering.nf' is missing from the .nextflow directory.From there, I could not proceed anymore. I request the addition of the reportRendering.nf file so I can proceed.

Thanks,
George

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.