Coder Social home page Coder Social logo

msk-access / genotype_variants Goto Github PK

View Code? Open in Web Editor NEW
1.0 2.0 4.0 181 KB

Project to genotype SNV, INDEL and SV.

Home Page: https://genotype-variants.rtfd.io

License: Other

Makefile 3.06% Python 94.14% Dockerfile 2.80%
genotyping msk-access

genotype_variants's Introduction

genotype_variants

GitHub Workflow Status

Documentation Status

Project to genotype SNV, INDELS and SV.

Features

Currently this module only supports genotyping and merging small variants (SNV and INDELS).

For this we have the following command line submodule called small_variants.

Which have the following sub-commands:

  • generate: To run GetBaseCountMultiSample version 1.2.5 on given BAM files
  • merge: To merge MAF format files w.r.t counts generated from the generate command.
  • all: This will run both of the sub-commands above generate and merge togather.
  • multiple-samples: This will run sub-commands all for multiple samples in the provided metadata file

Please read the USAGE (https://genotype-variants.readthedocs.io/en/latest/usage.html) section of the documentation for more information

Requires GetBaseCountMultiSample v1.2.5 and above

To Do

  • Tagging genotyped files for thresholds
  • Genotyping normal buffy coats
  • Genotype structural variants calls

Credits

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

genotype_variants's People

Contributors

buehlere avatar ionox0 avatar murphycj2 avatar ndbrown6 avatar rhshah avatar

Stargazers

 avatar

Watchers

 avatar  avatar

genotype_variants's Issues

create_all_maf_dataframe.py

  • genotype_variants version: current
  • Python version: 3.7.4 installed via Miniconda
  • Operating System: CentOS-7 running on Juno

Description

Running genotype_variants small_variants all using the command line below. The underlying file create_all_maf_dataframe.py contains errors which prevent python from combining the maf files produced from the previous steps

What I Did

1/ Line #111
df_ds refers to the combined duplex simplex data frame whilst df_s which receives the value of the operation is a subtraction of df_ds from another field in df_s. df_ds does not contain a column with name t_total_count_forward_standard since only df_s was renamed at Line #73-81

2/ Line #133 and #147, df_d is undefined

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.