Coder Social home page Coder Social logo

oops's Introduction

OOPS: Optimal Orthology for Phylogenomic Studies

Given a set of orthologous sequences, OOPS reduces the set to one sequence per taxon using one of the 26 supported criteria (see below) to pick the optimal sequences.

install

Copy oops.pl to a convient location in your PATH and ensure that a Perl version 5 interpreter is available (tested with 5.18).

  • MAFFT version 7 must be in your PATH if criteria E–Z are employed.
  • The script alignAA.pl must be in your PATH if criterion E is employed.
  • The script editAA.pl must be in your PATH if criteria G–H are employed.
  • The script pairsAA.pl must be in your PATH if criteria I–L are employed.
  • T-Coffee version 11 must be in your PATH if criterion M is employed.
  • TNT version 1.5 must be in your PATH if criteria N–V are employed.
  • RAxML version 8 or FastTree version 2 must be in your PATH if criteria W–Z are employed (the -s option uses RAxML in place of the default FastTree).

input files

The input FASTA format file should contain amino acid sequences from multiple species and sequence names should be in the form of ‘>genus_species#sequenceID’. The octothorpe (number, hash) symbol should not be used in the taxon name or the sequence identification number.

>Paradonea_presleyi#D123
ELVISISDEAD
>Preseucoela_imallshookupis#L456
ELVISLIVES

use

Options -i (input file) and -o (output directory) are required. The other options control the analyses parameters and optimality criteria used.

-c	Optimality criteria(on) for paralogs selection (default all):
		(A) mean sequence length
		(B) variance of sequence length
		(C) relative sequence length (s.d. above/below mean)
		(D) random
		(E) align score
		(F) alignment length
		(G) mean edit distance
		(H) variance of edit distance
		(I) sum of pairs
		(J) sum of pairs BLOSUM45
		(K) sum of pairs BLOSUM50
		(L) sum of pairs BLOSUM62
		(M) TCS (transitive consistency score)
		(N) maximum possible parsimony steps
		(O) minimum possible parsimony steps
		(P) parsimony information
		(Q) parsimony tree length
		(R) ensemble CI
		(S) ensemble RI
		(T) normalized sum of parsimony jackknife
		(U) mean parsimony jackknife
		(V) variance of parsimony jackknife
		(W) ML tree score
		(X) normalized sum of ML support
		(Y) mean ML support
		(Z) variance of ML support
-d	Do not keep output directory clean (default = keep clean).
-i	Input sequences in FASTA format (names should be '>genus_species#sequenceID').
-m	Use the LG or WAG model for FastTree calculations (default = lg; RAxML
	conducts model testing).
-n	Limit analysis to n combinations (default = all).
-o	Output directory.
-r	RAxML executable name (default = raxml).
-s	Use RAxML in place of the default FastTree (slower);
	bootstrap is used in place of FastTree’s SH–like local supports.
-t	Number of threads (default = 1) for MAFFT (and RAxML if -s is used).
-w	Report analysis times (default = do not).

citation

If you use this software, please cite: Eshel, G., M. Tessler, A. Narechania, K. Varala, C.W. Nelson, C. Zegar, S.–O. Kolokotronis, D.Wm. Stevenson, G. Coruzzi, R. DeSalle, and D.P. Little. Submitted. Correlation among algorithms as a powerful tool to optimize phylogenomic pipelines. Bioinformatics.

license

GPL

oops's People

Contributors

dpl10 avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.