Comments (7)
I've poked around a bit more, and it only happens using -Y
for target/decoy competition (I'm searching using a concatenated DB with multiple files so I use -Y
to not use mixmax due to duplicate scan numbers), so I'm wondering if it is intentional. On the whole, of a dataset of 22 PSMs, 13 were retained in my tests. Indeed, there are only 13 scan numbers in the input, but multiple sequences and 22 different psm_ids, which differ in their SIR (SpectrumIdentificationResult
id?). When not using -Y
all of them are in the XML output.
I guess this kind of explains it: #152 (comment) , but I'm still wondering if one should take into account the entire PSM ID or not.
from percolator.
I think you can get the behavior you want by setting --search-input concatenated
. You don't need the -Y
flag in that case (it will just be ignored).
from percolator.
Aha, yes that indeed works! It is a bit quieter in the stderr about the FDR method (it says "separate searches input detected, but overridden by -I flag...". I don't know what the difference is between -Y
and -I
though, does it still do the relevant FDR method for concatenated searches?
from percolator.
Yes, --search-input concatenated
uses the FDR method for concatenated searches. We specifically introduced this flag to deal with non-standard scenarios such as yours.
from percolator.
Too many options not to get confused :)
So, if I have understood correctly, I used to do -Y
on my concatenated search, which avoids mixmax, but is actually meant for when one runs separated searches and do the competition step inside percolator? So when I instead run -I concatenated
it will use the same FDR (non mixmax) but not do actual competition based on PSM scan numbers.
And when one runs msgf2pin
without -m
, the behaviour of -Y
and -I concatenated
would be more or less identical, since it does not have multiple solutions for a given scan numbers.
Hope I have understood it now, thanks for the quick help! :)
from percolator.
Yes, that's correct :)
from percolator.
Great, thanks very much!
from percolator.
Related Issues (20)
- version 3.06: proteinIds are empty HOT 2
- Tests failed on i686 architectures only
- stopped in Percolator process with an exit code: -1073741819 HOT 11
- Skip decoy prefix finding if `--protein-decoy-pattern` is specified
- Error Specific to HPC - Couldn't find protein header HOT 3
- TDC issue when rescoring multiple files
- Split 3 in cross-validation HOT 5
- Glycoproteomics application HOT 2
- Error in BaseSpline with 3 PSMs or fewer
- Extremely slow GitHub Action MacOS builds HOT 1
- Running percolator on msgf+ output HOT 2
- Problem with including decoys in the xml output
- Question about psm_id column in the output HOT 2
- Question: Folder of pin files as input HOT 3
- Allow empty pin-file with --no-terminate flag
- [BUG] Normalization of constant feature causes explosion of weights
- Strange behaviour of --subset-max-train HOT 3
- Any idea about the error: `percolator: /storage/yufe/tools/percolator-rel-3-06-04/src/BaseSpline.cpp:234: void BaseSpline::initiateQR(): Assertion `dx[ix] > 0' failed.`?
- 3.6.5 on bioconda
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from percolator.