Coder Social home page Coder Social logo

percolator / percolator Goto Github PK

View Code? Open in Web Editor NEW
124.0 24.0 36.0 57.92 MB

Semi-supervised learning for peptide identification from shotgun proteomics datasets

Home Page: http://percolator.ms

License: Other

CMake 8.31% Shell 2.56% C++ 81.47% Python 1.50% C 1.64% Batchfile 1.74% Makefile 2.79%

percolator's Introduction

Percolator

Build status Contributor Covenant

This repository holds the source code for percolator and format converters software packages; a software for postprocessing of shotgun proteomics data.

The source code is hosted at github, binary downloads can be found in the release repository: https://github.com/percolator/percolator/releases

The building procedure is cmake-based. Helper scripts for building on different platforms can be found in: admin/builders/

The cleanest way to build is to use the vagrant virtual machine system, which can be setup using the script: admin/vagrant/manager.sh

percolator's People

Contributors

acquayefrank avatar alinar avatar berglunddaniel avatar br903 avatar charlesegrant avatar ekvall93 avatar eriksjolund avatar fcyu avatar fu avatar ground-beetle avatar jfnavarro avatar johnhalloran321 avatar jwakely avatar kfattila avatar lkuchenb avatar luminita avatar marcus1337 avatar matthewthe avatar percolator avatar theold190 avatar viktorg avatar wfondrie avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

percolator's Issues

faulty psm statistics without -U flag

without the -U flag, weedOutRedundant() was being invoked at both peptide and psm level! Without the -U flag (by default!) the psm statistics are unreliable.

broken gtest

removing redundant variable "base" from CMakeLists causes problems at compile time when -DGTEST="TRUE"

independent versioning for pin and pout schemata

Currently the versioning for pin and pout files is regulated by the
SCHEMA_VERSION_MAJOR and SCHEMA_VERSION_MINOR
variables in the main CMakeLists.
Allow for independent versioning by creating:
_for pin: PIN_VERSION_MAJOR and PIN_VERSION_MINOR
_for pout: POUT_VERSION_MAJOR and POUT_VERSION_MINOR

32bit packages: xsd architecture issue

The 32 bit packages are statically including a 64 bit version of the xsd library. The package installs correctly but any attempt to run percolator will result in the following:
"/usr/bin/percolator: cannot execute binary file"

Wrong file permission on installed xml schemes

When I install the deb-packages on my 64bit ubuntu box percolator complains that it cant read the xml-scheme. Checking the file permission I that the file is owned by user tomasoni:

lukask@kall:$ percolator svn/tda/exp/general/worm/perc/10ppm.pin
Percolator version 2.01, Build Date Jun 30 2011 15:09:22
Copyright (c) 2006-9 University of Washington. All rights reserved.
Written by Lukas Käll ([email protected]) in the
Department of Genome Sciences at the University of Washington.
Issued command:
percolator svn/tda/exp/general/worm/perc/10ppm.pin
Started Tue Jul 5 15:04:54 2011
Hyperparameters fdr=0.01, Cpos=0, Cneg=0, maxNiter=10
WARNING: no local xml schema is available to validate the input.
If further errors are encountered, please reinstall Percolator.
:0:0 error: unsupported protocol in URL
svn/tda/exp/general/worm/perc/10ppm.pin:2:255 error: fatal error during schema scan
WARNING: no local xml schema is available to validate the input.
If further errors are encountered, please reinstall Percolator.
:0:0 error: unsupported protocol in URL
svn/tda/exp/general/worm/perc/10ppm.pin:2:255 error: fatal error during schema scan
lukask@kall:
$
lukask@kall:~$ ls -al /usr/share/percolator/xml-pin-1-2/percolator_in.xsd
-r-----r-- 1 tomasoni users 13890 2011-06-22 11:46 /usr/share/percolator/xml-pin-1-2/percolator_in.xsd

solveInPlace: failed assertion (HEAD)

....
Merging results from 3 datasets
Selecting pi_0=0.651
Calibrating statistics - calculating q values
percolator: /home/lukask/git/percolator/src/BaseSpline.cpp:378: static
void BaseSpline::solveInPlace(Matrix&, Vector&): Assertion
`isfinite(val)' failed.
Aborted

Sqt2pin Seg fault

When executing the fms2 file
/ext/behemot_public/ms2/mattia/08183.ms2 I get a seg-fault. Is it due to absent EZ lines?

stdout of sqt2pin

sqt2pin -o fname generates xml to stdout. Please generate xml to output only when there are no -o option given

low pi_0 estimation causes assert(q1>median) to fail

Low pi_0 estimation causes assert(q1>median) to fail during percolator's normalization step

[from Torsten Ueckert's email - 8th March 2011 ]

I get an exit code 255 from percolator.exe wile processing the
attached input file.

This is data from a Mascot search with the OR20070924_S_mix7_06.RAW file.

Percolator astray

for "some" input, Percolator gives the warning:
"Reseting score vector, using default vector".
The weights calculated for XCorr and/or deltaCn are negative

See the five subfolders of:
dirac.scilifelab.se:/scratch/temp/vagisha_percolatorAstray/yrc_5expts_w_perc_wrng_after_tide_fix/

make percolator-converters dependent on percolator

; lsb_release -a
Distributor ID: Ubuntu
Description: Ubuntu 10.10
Release: 10.10
Codename: maverick

$ sudo dpkg --force-all --install percolator-converters-v2_00-linux-amd64.deb 
(Reading database ... 315483 files and directories currently installed.)
Unpacking percolator-converters (from percolator-converters-v2_00-linux-amd64.deb) ...
dpkg: warning: overriding problem because --force enabled:
 trying to overwrite '/usr/share/percolator/xml-pin-1-2/percolator_in.xsd', which is also in package percolator 2.00.0
Setting up percolator-converters (2.00.0) ...

Processing messages from sqt2pin are confusing

I get the following processing info on stderr:

Sqt2Pin version 2.01, Build Date Jun 30 2011 12:52:26
Copyright (c) 2010 Lukas Käll. All rights reserved.
Written by Lukas Käll ([email protected]) in the
Department of Biochemistry and Biophysics at the Stockholm University.
Issued command:
sqt2pin ../target200ppm/Celegans_Jupter_C12_120min_grad.sqt ../reverse200ppm/Celegans_Jupter_C12_120min_grad.sqt

Reading from sqt file(s) and writing to database(s):
reading ../target200ppm/Celegans_Jupter_C12_120min_grad.sqt
reading ../reverse200ppm/Celegans_Jupter_C12_120min_grad.sqt

Writing database(s) to pin file:
writing ../target200ppm/Celegans_Jupter_C12_120min_grad.sqt (and correspondent decoy file)

Termination successful.

  1. write sqt2pin and not Sqt2pin
  2. We are hopefully not writing to the sqt file writing ../target200ppm/Celegans_Jupter_C12_120min_grad.sqt (and correspondent decoy file) -- instead we are writing the contense from those files to stdout
  3. What does Termination successful mean?

unimod ids for PTMs

The current PTM representation is sequest style * @ etc...
To be changed to unimod id's (eg MY[unimod:1]PEPTIDE)

Sqt2pin: psm grouping

Current psm grouping (by scan number) seems to cause information from different files to be retained in memory longer than necessary. Consider instead splitting the fragSpectrumScan: scanNumber="[scan #]_[Filename]".

[from Barbara's email, 7th March 2011]

I noticed that the schema groups psms first by scan number. So within the
tag, there are multiple PSMs all for spectra with the
same scan number but with different charge states and from different files.
For example (I've removed some of the attributes)

<fragSpectrumScan experimentalMassToCharge="912.6508" scanNumber="35">
  <peptideSpectrumMatch  id="FileName_35_2_1">
    <features>  ...   </features>
    <peptide>   ...   </peptide>
  </peptideSpectrumMatch>

  <peptideSpectrumMatch  id="FileName_35_4_1">
    <features>  ...   </features>
    <peptide>   ...   </peptide>
  </peptideSpectrumMatch>

  <peptideSpectrumMatch  id="DifferentFileName_35_2_1">
    <features>  ...   </features>
    <peptide>   ...   </peptide>
  </peptideSpectrumMatch>
</fragSpectrumScan>

Seems like this would require that all the files be read in and stored
before the pin.xml can be written out. Perhaps percolator requires that the
psms be grouped in this way, but if not maybe it's possible to change the
schema slightly so that the pin.xml file is written as the .sqt files are
read so that there's no need to store much in memory.

percolator stdin

I am missing an option to read pin files from stdin. Ot would be nice to execute a pipe structure such as "sqt2pin target.sqt decoy.sqt | percolator -E - -p"

new pin-1-2

pin: experimentalMassToCharge element is duplicated in fragSpectrumScan element and psm element. Remove it from fragSpectrumScan

<exp_mass> and <calc_mass> missing from pout

Scores.cpp line 75.
Output <exp_mass> and <calc_mass> whenever the information is available; boolean MassHandler::monoisotopic connected to old -M option is no longer meaningful.

Deprecated options

The options -e -c -y -o, -s, -Y, -2 of percolator 1.15 seem redundant - please remove or explain better.

report on vm configurations

Write a report with as many details as possible about the virtual machines you have for cross-compiling and testing.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.