doi:10.1016/j.jchromb.2004.12.009
Copyright © 2004 Elsevier B.V. All rights reserved.
Database independent detection of isotopically labeled MS/MS spectrum peptide pairs
References and further reading may be available for this article. To view references and further reading you must
purchase this article.
Frank Potthasta, 1,
,
, Jiri Ocenasekb, 1, Dorothea Rutishausera, Martin Pelikanc and Ralph Schlapbacha
aFunctional Genomics Center Zurich, Uni/ETH Zurich, Winterthurer Strasse 190, 8057 Zurich, Switzerland
bComputational Laboratory, ETH Zurich, Hirschengraben 84, 8092 Zurich, Switzerland
cDepartment of Mathematics and Computer Science, University of Missouri at St. Louis, 320 CCB 8001 Natural Bridge Road, St. Louis, MO 63121, USA
Received 1 April 2004;
accepted 7 December 2004.
Available online 30 December 2004.
Abstract
Mass spectrometry data generated in differential profiling of complex protein samples are classically exploited using database searches. In addition, quantitative profiling is performed by various methods, one of them using isotopically coded affinity tags, where one typically uses a light and a heavy tag. Here, we present a new algorithm, ICATcher, which detects pairs of light/heavy peptide MS/MS spectra independent of sequence databases. The method can be used for de novo sequencing and detection of posttranslational modifications. ICATcher is distributed as open source software.
Keywords: Proteomics; ICATcher; ICAT; Peptide sequencing; PTM detection; Differential analysis; Chromatography
Fig. 1. Example of a hypercluster consisting of four clusters for a human gamma-actin peptide (Swissprot P02571). The peptide contains one cysteine which might be labeled with a light or a heavy ICAT tag, and one methionine which may be oxidized or not; i.e. there are four possible modification forms and therefore four possible clusters in total. Each of the four levels in the graph presents one modification form of the peptide. The light/heavy forms of ICAT-labeled cysteine are denoted with C/C*, respectively. Oxidized methionine is denoted with M#. In this example, all peptide forms have been detected several times, e.g. there were eight spectra measured on the C*M# precursor ion of this peptide. Indicated masses are average masses as calculated by Sequest.
 |
Fig. 2. Quality control of the classification algorithm. (A) Scan-number distance (corresponding to elution time difference) histograms (bin size 100) of all light/heavy pairs passing the first and those passing the second filter. The triangle indicates the noise model. It is clearly visible that pairs passing the second filter have a strong tendency to co-elute, which is in line with experimental knowledge of the ICAT reagent. The total distribution is close to the expected random distribution plus the second filter ICATcher signal, which is a footprint of a good classification. (B) Histogram of ICATcher scores (bin size 0.05). The two step-histograms are the distribution of all ICATcher scores passing the first filter, and (smaller one) the distribution of scores for all correct pairs identified by Sequest and Peptide Prophet. The continuous curve is a scaled Gaussian model fit for those peptide links which pass the first filter but are false positives. Parameters: σ ≈ 0.13, maximum at μ ≈ 0.54. Only pairs with an ICATcher score below 0.2 are classified as true Δ = 9.03 light/heavy pairs.

Corresponding author. Tel.: +41 1 6353905; fax: +41 1 6353922.
1 Equally contributing authors.