Open Mass Spectrometry Search Algorithm
2004/12/27
39203001生化所 甘瑞麒
Abstract:
Geer et.al. developed a probability-based algorithm to identify peptides from tandem mass spectra and protein sequence database. The method used Poisson distribution to model frequency of product ion matches between the measured and calculated one. The null hypothesis of this distribution is "all peptide matches to a ms/ms spectrum is random". They used the model to score the randomness of ms/ms spectrum match to a peptide in database and used the score to determine significance of a hit between spectrum and peptide sequence. They showed better efficiency, sensitivity, specificity, speed compare to another popular probability-based peptide identification software Mascot.
Source:
Geer LY, Markey SP, Kowalak JA, Wagner L, Xu M, Maynard DM, Yang X, Shi W, Bryant SH. (2004) J Proteome Res 3: 958-964.
Reference:
1. Sadygov RG, Yates JR 3rd. (2003) A hypergeometric probability model for protein identification and validation using tandem mass spectral data and protein sequence databases. Anal. Chem. 75: 3792-3798.
2. Aebersold R, Mann M.(2003) Mass spectrometry-based proteomics. Nature 422: 198-207.
3. Perkins DN, Pappin DJ, Creasy DM, Cottrell JS. (1999) Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis 20: 3551-3567.