Coder Social home page Coder Social logo

afc's Introduction

aFC

allelic Fold Change

Calculates allelic Fold Change (aFC) using standard input files for fastQTL.

Please see our preprint in bioRxiv for details and benchmarking of the method.

Method developed by Pejman Mohmammadi, software by Stephane E. Castel both in the Lappalainen Lab at the New York Genome Center and Columbia University Department of Systems Biology.

Runs on Python 2.7.x and has the following dependencies: pandas, statsmodels, scikits-bootstrap, NumPy.

#Usage Requires tabix indexed gzip compressed VCF file containing genotypes and BED file containing phenotypes, identical to the inputs of fastQTL, and a list of QTL to calculate aFC for. If provided, covariates will be regressed out of the phenotype values. Outputs the aFC and corresponding 95% confidence interval for each input QTL.

#Arguments ##Required

  • --vcf - Tabix indexed and gzipped VCF file containing sample genotypes. See fastQTL for format details.
  • --pheno - Tabix indexed and gzipped BED file containing sample phenotypes. See fastQTL for format details.
  • --qtl - File containing QTL to calculate allelic fold change for. Should contain tab separated columns 'pid' with phenotype (gene) IDs and 'sid' with SNP IDs. Optionally can include the columns 'sid_chr' and 'sid_pos', which will facilitate tabix retrieval of genotypes, greatly reducing runtime.
  • --geno - Which field in VCF to use as the genotype. By default 'GT' = genotype. Setting to 'DS' will use dosage rounded to the nearest integer (IE 1.75 = 2 = 1|1).
  • --chr - Limit to a specific chromosome.
  • --log_xform - The data has been log transformed (1/0). If so, please set --log_base.
  • --o - Output file.

##Optional

  • --cov () - Covariates file. See fastQTL for format details.
  • --matrix_o () - Output the raw data matrix used to calculate aFC for each QTL into the specific folder.
  • --boot (100) - Number of bootstraps to perform for effect size confidence interval. Can be set to 0 to skip confidence interval calculation, which will greatly reduce runtimes.
  • --ecap (log2(100)) - Absolute aFC cap in log2.
  • --log_base (2) - Base of log applied to data. If other than 2, data will be converted to log2.

#Output File

  • 1 - sid - Variant ID.
  • 2 - pid - Phenotype (gene) ID.
  • 3 - log2_aFC - allelic Fold Change in log2.
  • 4 - log2_aFC_lower - Lower estimate of 95% confidenace interval of log2(aFC).
  • 5 - log2_aFC_upper - Upper estimate of 95% confidenace interval of log2(aFC).

afc's People

Contributors

secastel avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.