QPROT: Statistical method for testing differential expression using protein-level intensity data in label-free quantitative proteomics

H Choi, S Kim, D Fermin, CC Tsou, AI Nesvizhskii - Journal of proteomics, 2015 - Elsevier
H Choi, S Kim, D Fermin, CC Tsou, AI Nesvizhskii
Journal of proteomics, 2015Elsevier
We introduce QPROT, a statistical framework and computational tool for differential protein
expression analysis using protein intensity data. QPROT is an extension of the QSPEC suite,
originally developed for spectral count data, adapted for the analysis using continuously
measured protein-level intensity data. QPROT offers a new intensity normalization
procedure and model-based differential expression analysis, both of which account for
missing data. Determination of differential expression of each protein is based on the …
Abstract
We introduce QPROT, a statistical framework and computational tool for differential protein expression analysis using protein intensity data. QPROT is an extension of the QSPEC suite, originally developed for spectral count data, adapted for the analysis using continuously measured protein-level intensity data. QPROT offers a new intensity normalization procedure and model-based differential expression analysis, both of which account for missing data. Determination of differential expression of each protein is based on the standardized Z-statistic based on the posterior distribution of the log fold change parameter, guided by the false discovery rate estimated by a well-known Empirical Bayes method. We evaluated the classification performance of QPROT using the quantification calibration data from the clinical proteomic technology assessment for cancer (CPTAC) study and a recently published Escherichia coli benchmark dataset, with evaluation of FDR accuracy in the latter.
Biological significance
QPROT is a statistical framework with computational software tool for comparative quantitative proteomics analysis. It features various extensions of QSPEC method originally built for spectral count data analysis, including probabilistic treatment of missing values in protein intensity data. With the increasing popularity of label-free quantitative proteomics data, the proposed method and accompanying software suite will be immediately useful for many proteomics laboratories.
This article is part of a Special Issue entitled: Computational Proteomics.
Elsevier