Coder Social home page Coder Social logo

astrobiomike / fegenie Goto Github PK

View Code? Open in Web Editor NEW

This project forked from arkadiy-garber/fegenie

0.0 1.0 0.0 100.72 MB

HMM-based identification and categorization of iron genes and iron gene operons in genomes and metagenomes

Python 92.39% Shell 1.70% R 5.37% Dockerfile 0.54%

fegenie's Introduction

FeGenie

Please see the Wiki page for introduction and tutorial on how to use this tool.

Special thanks to Michael Lee (https://github.com/AstrobioMike) for helping to put together the setup.sh script, which signficantly eases installation.

Tutorial (Binder)

FeGenie introductory slideshow:

Content | Video presentation

FeGenie video tutorial:

Content | Video presentation

To start the tutorial, hit the 'launch binder' button below, and follow the commands in 'Walkthrough'

Binder (Initially forked from here. Thank you to the awesome binder team!)

Walkthrough

Enter the main FeGenie directory

cd FeGenie

print the FeGenie help menu

FeGenie -h

run FeGenie on test dataset

FeGenie.py -bin_dir genomes/ -bin_ext fna -out fegenie_out

Go into the output directory and check out the output files

cd fegenie_out
less FeGenie-geneSummary-clusters.csv

run FeGenie on gene calls

FeGenie.py -bin_dir ORFs/ -bin_ext faa -out fegenie_out --orfs

run FeGenie on gene calls, and use reference database (RefSeq sub-sample) for cross-validation

FeGenie.py -bin_dir ORFs/ -bin_ext faa -out fegenie_out --orfs -ref refseq_db/refseq_nr.sample.faa

Easy Installation (if you have Conda installed)

git clone https://github.com/Arkadiy-Garber/FeGenie.git
cd FeGenie
bash setup.sh
conda activate fegenie
FeGenie.py -h

Easy Installation (if you don't have Conda)

git clone https://github.com/Arkadiy-Garber/FeGenie.git
cd FeGenie
bash setup_noconda.sh
./FeGenie.py -h

Quick-start

FeGenie.py -bin_dir /directory/of/bins/ -bin_ext fasta -t 16

The argument for -bin_ext needs to represent the filename extension of the FASTA files in the selected directory that you would like analyzed (e.g. fa, fasta, fna, etc).

Quick-start (if you installed using the 'setup_noconda.sh' script)

./FeGenie.py -hmm_lib hmms/iron -bin_dir /directory/of/bins/ -bin_ext fasta -t 16 -out output_fegenie

hmms/iron directory can be found within FeGenie's main repository -t 8 means that 8 threads will be used for HMMER and BLAST. If you have less than 16 available on your system, set this number lower (default = 1)

Running with docker

In case of running FeGenie with docker the only dependency you need to have installed is docker itself (installation guide).

With docker installed you can run FeGenie in the following way:

docker run -it -v $(pwd):/data note/fegenie-deps ./FeGenie.py -bin_dir test_dataset -bin_ext txt -out fegenie_out -hmm_lib hmms/iron -t $(nproc)

./FeGenie.py ... follows normal, non-dockerized flow of arguments.

Beware that you need to mount directories which contain files FeGenie is supposed to read. If you are not familiar with docker then run docker run command from the directory into which you cloned FeGenie repository. If all the files you pass to FeGenie are in inside this directory and you use relative filepaths (like e.g. hmms/iron) everything will work just fine.

Upcoming Updates (we welcome more suggestions, which can be submitted as an Issue)

  1. Ability to accept previously-annotated genomes and gene-calls.
  2. Include Cytochrome 579 (and possible rusticyanin)
  3. Improve dilineation between MtrA and MtoA for better resolution with respect to identification of iron reduction and iron oxidation, respectively.
  4. Option to report absolute values for gene counts (rather than normalized gene counts)
  5. Include option to release all results (regardless of whether rules for reporting were met)
  6. Identification of iron-sulfur proteins.

Citing FeGenie:

Garber AI, Nealson KH, Okamoto A, McAllister SM, Chan CS, Barco RA and Merino N (2020) FeGenie: A Comprehensive Tool for the Identification of Iron Genes and Iron Gene Neighborhoods in Genome and Metagenome Assemblies. Front. Microbiol. 11:37. doi: 10.3389/fmicb.2020.00037

fegenie's People

Contributors

arkadiy-garber avatar note avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.