Coder Social home page Coder Social logo

hgb-bin-proteomics / msannika Goto Github PK

View Code? Open in Web Editor NEW
4.0 4.0 0.0 120.12 MB

MS Annika is a crosslink search engine based on MS Amanda, aimed at identifying crosslinks of MS2-cleavable crosslinkers from MS2 and MS3 spectra.

Home Page: https://ms.imp.ac.at/?action=ms-annika

License: Other

cross-linking crosslinker crosslinking mass mass-spectrometry ms2 ms3 proteomics search-engine spectrometry tandem-ms xlms

msannika's People

Contributors

michabirklbauer avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

msannika's Issues

[Version: Any] Specifying crosslinker as fixed/variable modification leads to abort of search

If the crosslinker is specified in the MS Annika Search Node as a fixed or variable modifications (in addition to specifying it as the crosslink modification in the MS Annika Detector Node), an error is thrown. This is intended behaviour because in (almost) all cases this is not what you want. The modifications happening to the two cross-linked peptides during cross-linking are already accounted for by the setting in the MS Annika Detector Node.

For the rare case that you actually additionally want to consider that these peptides might carry an additional crosslinker on one of the other residues (e.g., as a monolink modification) you would have to duplicate the crosslinker in the "Chemical Modifications" window in Proteome Discoverer and give it another unique name and select this one instead. That should solve the issue and the search will run through completely.

[Version: >= 3.0.2 Non-Cleavable Search] Maximum peptide length > 50 slows down search for big databases

  • Due to the limitations of our matrix search it is recommended to set a maximum peptide length of less than 50
  • For small databases this doesn't matter
  • For big databases this matters a lot, peptides longer than 50 amino acids trigger re-allocations of the sparse matrix which are very costly and a search that would take 30s suddenly takes 20min
  • Going with the default maximum peptide length in MS Annika is usually a good choice!

[Version: >= 2.2.2] Can't add protein scores in Protein Scorer node

In Proteome Discoverer 3.1 using the Protein Scorer node in the consensus workflow will yield an error with MS Annika. This error does not terminate the search (or the creation of the result) and does not affect crosslink results!

However, as a result it currently might not be possible to calculate Protein FDR with MS Annika in Proteome Discoverer (in case that would be something you are interested in - we don't have experience with that).

We recommend using xiFDR in combination with our xiFDR exporter for higher-level FDR calculations.
Nevertheless, we are working on a fix!

[Version: Any] Proteome Discoverer doesn't parse MGF scan numbers correctly

Proteome Discoverer doesn't read scan numbers from MGF files correctly and just numbers spectra from 0 to n. This messes up down-stream analysis as CSMs can't be properly mapped back to the spectra they originate from. This can't be fixed in MS Annika as MS Annika doesn't read the MGF file, Proteome Discoverer does - however it is possible to remap output files with the this tool we implemented: https://github.com/hgb-bin-proteomics/Proteome_Discoverer_MGF_Scan_Number_Repair_Tool

[Version: Any] Duplicate proteins crash MS Annika

If multiple fasta files are selected for search in MS Annika, users have to make sure that they don't overlap - so do not contain the same protein twice (or more often). Duplicate proteins may lead to a crash in MS Annika as Proteome Discoverer enforces that proteins must be unique. A good practice is to merge fasta files beforehand (so pasting all sequences into one fasta file) and only use one fasta file for search in MS Annika, as Proteome Discoverer makes sure that each fasta file only contains unique proteins.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.