Comments (3)
Alright, closing the issue then!
from metatag.
Hi there,
According to hmmsearch's error messages, it seems like you could change hmmsearch's parameter --cut_nc by --cut_ga (or drop it altogether if that doesn't work). These parameters control the model-specific score thresholding, according to hmmsearch documentation (pag. 76).
To modify the setting, go to makedatabase.py
and change line 86, additional_args='--cut_nc' by additional_args='--cut_ga' or additional_args=None:
The previous are additional arguments to hmmsearch, so any valid argument to hmmsearch can be added this way.
If that doesn't work we will consider other options :)
from metatag.
It worked well, although the problem was that you have to write the value twice:
TC 207.409 207.409
I added at the end the value from the COG webpage as the TC value. According to the HMM documentation (page 76.), the "TC thresholds are generally considered to be the score of the lowest-scoring known true positive that is above all known false positives." This seems to "correspond" to the definition of the value COG webpage, the Threshold Bit Score defined as:
"the domain specific threshold score (shown as a bit score) that an RPS-BLAST hit must meet or exceed in order to be considered a specific hit, which represents a high confidence association between a protein query sequence and a conserved domain and therefore a high confidence level for the inferred function of the protein query sequence. The threshold is equal to the weakest E-value (and highest bit score) among self-hits of a domains member protein sequences to the resulting domain model (illustrated example). Domain-specific threshold scores are calculated only for NCBI-curated domains."
The code in makedatabase.py
was modified accordingly:
print('* Making peptide-specific reference database...')
with TemporaryFilePath() as tempfasta, TemporaryFilePath() as tempfasta2:
filterFASTAByHMM(
hmm_model=args.hmm,
input_fasta=args.data,
output_fasta=tempfasta,
hmmer_output=hmmer_output,
additional_args=f'--cut_tc'
)
from metatag.
Related Issues (20)
- Build Docs
- Use logger instead of print
- Fix dependency versions in conda environment
- Update Docs
- Update docstrings
- Integrate JPLACEparser
- Add usage docs and examples
- new error using papara HOT 7
- problem to print help of makedatabase.py HOT 3
- Modify automated sequence data type checking HOT 1
- error when relabelling the newick tree with the placements HOT 1
- Integrate Pynteny in makedatabase.py
- Refactor code prior to repo publication
- Add pynteny to makedatabase_struc
- Install phyloplacement with poetry? HOT 1
- Re-structure project and rename tool
- User logger instead of print throughout code base
- Handle log / terminal messages
- Compose minimal pipeline
- Use pathlib instead of os
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from metatag.