Coder Social home page Coder Social logo

quantgen's People

Contributors

nrode avatar timflutre avatar timflutre-perso avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

Forkers

nrode laugay asboldt

quantgen's Issues

handle day when showing elasped time

In "utils.cpp", the function elapsedTime() doesn't return the number of days if the program ran for more than one day, and instead the number of hours starts again from 0.

compute permutation P-values as (k+1)/(P+1)

In "get_summary_stats.cpp", no permutation P-value should be equal to 0. That's why Brian Browning explains in his paper describing PRESTO (BMC Bioinformatics 2008): "If k out of P permutations have a maximum test statistic greater than the maximum test statistic for the original data, the multiple-testing adjusted P-value for the experiment is (k+1)/(P+1)".

verbosity level test_demultiplex.py

verbosity level for demultiplex.py = verbosity level of test_demultiplex.py -1
verbosity argument could be an indicator variable (0 or 1)

use Fisher transformation of Spearman coef

In "get_summary_stats.cpp", use the Fisher transformation of the Spearman rank correlation coefficient (see here). Compare to the approximation leading to a Student distribution, with the Fisher transformation I can get a Z score, as used by Fu et al.

in gbs.py, step 1, one should give 1 lane ID per (pair of) fastq file(s)

If there is one (pair of) fastq file(s) per lane, then one simply indicates the lane ID, say "4", in the "lane" column of the sample file.

If there is more than one (pair of) fastq file(s) per lane, say 3, e.g. if a double barcoding was used, then one can concatenate the lane ID "4", with other numbers, say "41", "42" and "43", in the "lane" column of the sample file.

This could be mentioned in the help message.

compute summary stats by reading SNPs one by one

In "get_summary_stats.cpp", allow to skip parameter "--links" usually used to precise which SNPs are in cis of which genes. To do so efficiently, compute summary stats for each SNP by reading the genotype file line by line. This will therefore allow to easily compute summary stats for all SNPs, whether they are in cis or trans of any gene.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.