Comments (2)
Hi @qkqk-hub
You can absolutely do that.
- Filter out very short sequences (e.g. less than 1.5 kb), since the classification of those is not super reliable and they won't bin well anyway.
- Classify the whole metagenome, so you will have all viruses, including the non-binned. This is important because there might be complete viral genomes in a single scaffold (you won't need bins for those).
- Identify the bins with viral contigs and take an averaged mean of the virus score (averaged by the contig length) for each bin. This averaged mean will prevent you from flagging a bacterial bin with a single phage contig as a phage bin.
In the future I might implement native support for bins. In the meantime, the steps above should give you some good results.
I have a lot of memory and cpu, can I speed things up?
geNomad should be able to leverage your hardware with default parameters :)
from genomad.
Thank you very much for your reply. I understand.
from genomad.
Related Issues (20)
- Error downloading database HOT 2
- Inquiry on virus from MAG HOT 4
- [feature request] query database clustering HOT 1
- Whether measures have been taken by genomad to avoid identifying genomic islands as viruses? HOT 5
- AMR annotations on chromsome? HOT 1
- Errors when download and the same issue when running genomad -h HOT 3
- The virus identified by genomad weren't annotated as virus sequence by VIBRANT? HOT 3
- geNomad taxonomy about Baltimore classification HOT 1
- Error with geNomad v1.8.0, missing tensorflow.keras HOT 5
- mmseqs2 error HOT 3
- Different protein number from genomad and pyrodigal-gv HOT 2
- Small (reference) data for testing HOT 9
- Error while classifying sequences HOT 6
- Error mmseqs prefilter HOT 4
- genomad annotate fastq file is empty or contains multiple entries HOT 3
- plasmid classified as virus? HOT 7
- Optimization Request for Analyzing Large Number of MAGs with geNomad HOT 5
- Fewer viral contigs identified from genomad vs virsorter2 HOT 4
- The question about --disable-nn-classification HOT 1
- Provirus detection in genomad vs checkv HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from genomad.