Coder Social home page Coder Social logo

jlsteenwyk / biokit Goto Github PK

View Code? Open in Web Editor NEW
21.0 3.0 10.0 41.69 MB

a versatile toolkit for processing and analyzing diverse types of sequence data

Home Page: https://jlsteenwyk.com/BioKIT/

License: MIT License

Makefile 0.25% Python 99.75%
bioinformatics comparative-genomics evolution evolutionary-biology evolutionary-genomics genomics python

biokit's Introduction

Logo

Docs · Report Bug · Request Feature

follow on Twitter

BioKIT is a UNIX shell toolkit for processing molecular sequence data.

If you found biokit useful, please cite the following: BioKIT: a versatile toolkit for processing and analyzing diverse types of sequence data. Genetics. doi: 10.1101/2021.10.02.462868.


This documentation covers downloading and installing BioKIT. Details about each function as well as tutorials for using BioKIT are available in the online documentation.


Installation

If you are having trouble installing BioKIT, please contact the lead developer, Jacob L. Steenwyk, via email or twitter to get help.

To install using pip, we strongly recommend building a virtual environment to avoid software dependency issues. To do so, execute the following commands:

# create virtual environment
python -m venv .venv
# activate virtual environment
source .venv/bin/activate
# install biokit
pip install jlsteenwyk-biokit

Note, the virtual environment must be activated to use biokit.

After using biokit, you may wish to deactivate your virtual environment and can do so using the following command:

# deactivate virtual environment
deactivate

Similarly, to install from source, we strongly recommend using a virtual environment. To do so, use the following commands:

# download
git clone https://github.com/JLSteenwyk/BioKIT.git
cd biokit/
# create virtual environment
python -m venv .venv
# activate virtual environment
source .venv/bin/activate
# install
make install

To deactivate your virtual environment, use the following command:

# deactivate virtual environment
deactivate

Note, the virtual environment must be activated to use biokit.


To install via anaconda, execute the following command:

conda install -c jlsteenwyk jlsteenwyk-biokit

Visit here for more information: https://anaconda.org/JLSteenwyk/jlsteenwyk-biokit


To test biokit installation, launch the help message

biokit -h

biokit's People

Contributors

jlsteenwyk avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

biokit's Issues

RSCU issue

Hi,

I'm noticing some inconsistencies with rscu results.

Here's an example sequence:

>seq
ATGCGTCTGTACTTGATTCTGTCAGCATCGATGGTCATCAGCTATTTTGTGACCTGTAACGCGGCTACGAGTTCCGATCA
GACCAAGCTCTCGAGAATGGAGCCATCACTCCTCGTCCACTCACTCGGTGCAGCACTGAACGTCGATGCCGTCGACAGAA
GATTTCTTCGAGAGCGAAAAGAAAGTATGGAAGAGCGAGGATATCATCTGACGCTAAAAGGGGAAGTAAAAGCATTGGCC
AAGAAAATCATTGCAGATTTCGACACGGCCGACGACGTGTACAAGAAGTGGAATGAGAACGGCCACTCGTTGAACAAAAT
CGCGAACTTGCTGAAAGTGTCGGAAAAGGAAAAATACGTACCGGTGTACAACGGCTATCTGGCCTACCTGAACAGGATAA
ATAGCTGA

And the biokit rscu output:

CGA	4.5
AGA	4.5
ACG	4.0
UGA	3.0
ACC	2.6667
CUG	2.4706
UCG	2.0
CCA	2.0
CCG	2.0
CAG	2.0
...
...

As an example codon, I think the RSCU of CGA should be 2.25?
3/(8/6)

Looking through the code, it looks like the observed_sum value is wrong as it is reset to 0 when a KeyError exception is thrown:

for _, codons in codon_table.items():
observed_sum = 0
for codon in codons:
try:
observed_sum += codon_counts[codon]
except KeyError:
observed_sum = 0

If I replace this with a pass statement, I get the expected result 2.25.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.