TY - JOUR
T1 - Calculating absolute and relative protein abundance from mass spectrometry-based protein expression data
AU - Vogel, Christine
AU - Marcotte, Edward M.
N1 - Funding Information:
ACKNOWLEDGMENTS C.V. acknowledges support by the International Human Frontier Science Program. We thank John Braisted and Srilatha Kuntumalla from JCVI for many useful discussions regarding the APEX calculations. This work was supported by grants from the Welch (F-1515) and Packard Foundations, the National Science Foundation and National Institutes of Health.
PY - 2008
Y1 - 2008
N2 - Mass spectrometry (MS)-based shotgun proteomics allows protein identifications even in complex biological samples. Protein abundances can then be estimated from the counts of tandem MS (MS/MS) spectra attributable to each protein, provided one accounts for differential MS detectability of contributing peptides. We developed a method, APEX, which calculates Absolute Protein EXpression levels based upon learned correction factors, MS/MS spectral counts and each protein's probability of correct identification. This protocol describes APEX-based calculations in three parts. (i) Using training data, peptide sequences and their sequence properties, a model is built to estimate MS detectability (Oi) for any given protein. (ii) Absolute protein abundances are calculated from spectral counts, identification probabilities and the learned Oi-values. (iii) Simple statistics allow calculation of differential expression in two distinct biological samples, i.e., measuring relative protein abundances. APEX-based protein abundances span 3-4 orders of magnitude and are applicable to mixtures of 100s to 1,000s of proteins.
AB - Mass spectrometry (MS)-based shotgun proteomics allows protein identifications even in complex biological samples. Protein abundances can then be estimated from the counts of tandem MS (MS/MS) spectra attributable to each protein, provided one accounts for differential MS detectability of contributing peptides. We developed a method, APEX, which calculates Absolute Protein EXpression levels based upon learned correction factors, MS/MS spectral counts and each protein's probability of correct identification. This protocol describes APEX-based calculations in three parts. (i) Using training data, peptide sequences and their sequence properties, a model is built to estimate MS detectability (Oi) for any given protein. (ii) Absolute protein abundances are calculated from spectral counts, identification probabilities and the learned Oi-values. (iii) Simple statistics allow calculation of differential expression in two distinct biological samples, i.e., measuring relative protein abundances. APEX-based protein abundances span 3-4 orders of magnitude and are applicable to mixtures of 100s to 1,000s of proteins.
UR - http://www.scopus.com/inward/record.url?scp=51649124498&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=51649124498&partnerID=8YFLogxK
U2 - 10.1038/nprot.2008.132
DO - 10.1038/nprot.2008.132
M3 - Article
C2 - 18772871
AN - SCOPUS:51649124498
SN - 1754-2189
VL - 3
SP - 1444
EP - 1451
JO - Nature Protocols
JF - Nature Protocols
IS - 9
ER -