Coder Social home page Coder Social logo

disty's Introduction

Disty McMatrixface

Compute a distance matrix from a core genome alignment file.

Prerequisities

  • GCC 4.8+ or equivalent
  • ZLib

Getting started

git clone https://github.com/c2-d2/disty
cd disty && make
./disty tests/test2.fa

Output:

	seq1	seq2	seq3
seq1	0	3	1
seq2	3	0	1
seq3	1	1	0

Installation

Using Bioconda:

conda config --add channels defaults
conda config --add channels conda-forge
conda config --add channels bioconda

conda install disty

Using brew:

brew install tseemann/bioinformatics-linux/disty

From Github:

git clone https://github.com/c2-d2/disty
cd disty && make && make install

Command line parameters

Usage:   disty <alignment.fa>

Options:
  -n  FLOAT  skip columns having frequency of N > FLOAT [1.00]
  -i  INT    input format [0]
                 0: ACGT
                 1: 01
  -s  INT    strategy to deal with N's [0]
                 0: ignore pairwisely
                 1: ignore pairwisely and normalize
                 2: ignore globally
                 3: replace by the major allele
                 4: replace by the closest individual (not implemented yet)
  -h         print help message and exit
  -v         print version and exit

Author

Karel Brinda <[email protected]>

disty's People

Contributors

karel-brinda avatar

Stargazers

Felipe Marques de Almeida avatar Tom Stanton avatar Alan Orth avatar  avatar Peter Menzel avatar Taj Azarian avatar Carl Mathias Kobel avatar Shane Hogle avatar Rodrigo O. Polo avatar Thanh Lee avatar  avatar Shaun Jackman avatar  avatar

Watchers

James Cloos avatar  avatar

disty's Issues

Core dump on mismatched lengths - handle better?

>seq1
AGTCAGTC
>seq2
AGGCAGTC
>seq3
AGTGAGTA
>seq4
TGTTAGA

disty test.aln
Loading sequences from test.aln
disty: src/main.cpp:251: void load_sequences(const string&, T&, T&) [with T = std::vector<std::basic_string<char> >; std::string = std::basic_string<char>]: Assertion `len==static_cast<int>(s.size())' failed.
Aborted (core dumped)

Name clash with similar EMBOSS tool!

I think you need to rename your repo?

% distmat --help

Create a distance matrix from a multiple sequence alignment
Version: EMBOSS:6.6.0.0

   Standard (Mandatory) qualifiers (* if not always prompted):
  [-sequence]          seqset     File containing a sequence alignment.
*  -nucmethod          menu       [0] Multiple substitution correction methods
                                  for nucleotides. (Values: 0 (Uncorrected);
                                  1 (Jukes-Cantor); 2 (Kimura); 3 (Tamura); 4
                                  (Tajima-Nei); 5 (Jin-Nei Gamma))
*  -protmethod         menu       [0] Multiple substitution correction methods
                                  for proteins. (Values: 0 (Uncorrected); 1
                                  (Jukes-Cantor); 2 (Kimura Protein))
  [-outfile]           outfile    [*.distmat] Phylip distance matrix output
                                  file

   Additional (Optional) qualifiers (* if not always prompted):
*  -ambiguous          boolean    [N] Option to use the ambiguous codes in the
                                  calculation of the Jukes-Cantor method or
                                  if the sequences are proteins.
*  -gapweight          float      [0.] Option to weight gaps in the
                                  uncorrected (nucleotide) and Jukes-Cantor
                                  distance methods. (Any numeric value)
*  -position           integer    [123] Choose base positions to analyse in
                                  each codon i.e. 123 (all bases), 12 (the
                                  first two bases), 1, 2, or 3 individual
                                  bases. (Any integer value)
*  -calculatea         boolean    [N] This will force the calculation of
                                  parameter 'a' in the Jin-Nei Gamma distance
                                  calculation, otherwise the default is 1.0
                                  (see -parametera option).
*  -parametera         float      [1.0] User defined parameter 'a' to be use
                                  in the Jin-Nei Gamma distance calculation.
                                  The suggested value to be used is 1.0 (Jin
                                  et al.) and this is the default. (Any
                                  numeric value)

   Advanced (Unprompted) qualifiers: (none)
   General qualifiers:
   -help               boolean    Report command line options and exit. More
                                  information on associated and general
                                  qualifiers can be found with -help -verbose

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.