Articles, Abstracts, and Reports

A Protein Standard That Emulates Homology for the Characterization of Protein Inference Algorithms.

Publication Title

Journal of proteome research

Document Type

Article

Publication Date

5-4-2018

Keywords

Algorithms; Benchmarking; Escherichia coli; Humans; Peptide Fragments; Peptides; Proteins; Proteomics; Sequence Homology, Amino Acid; Trypsin

Abstract

A natural way to benchmark the performance of an analytical experimental setup is to use samples of known composition and see to what degree one can correctly infer the content of such a sample from the data. For shotgun proteomics, one of the inherent problems of interpreting data is that the measured analytes are peptides and not the actual proteins themselves. As some proteins share proteolytic peptides, there might be more than one possible causative set of proteins resulting in a given set of peptides and there is a need for mechanisms that infer proteins from lists of detected peptides. A weakness of commercially available samples of known content is that they consist of proteins that are deliberately selected for producing tryptic peptides that are unique to a single protein. Unfortunately, such samples do not expose any complications in protein inference. Hence, for a realistic benchmark of protein inference procedures, there is a need for samples of known content where the present proteins share peptides with known absent proteins. Here, we present such a standard, that is based on E. coli expressed human protein fragments. To illustrate the application of this standard, we benchmark a set of different protein inference procedures on the data. We observe that inference procedures excluding shared peptides provide more accurate estimates of errors compared to methods that include information from shared peptides, while still giving a reasonable performance in terms of the number of identified proteins. We also demonstrate that using a sample of known protein content without proteins with shared tryptic peptides can give a false sense of accuracy for many protein inference methods.

Specialty/Research Institute

Institute for Systems Biology

Download

Providence Full Text

Included in

Genetics and Genomics Commons

COinS

Articles, Abstracts, and Reports

A Protein Standard That Emulates Homology for the Characterization of Protein Inference Algorithms.

Publication Title

Document Type

Publication Date

Keywords

Abstract

Specialty/Research Institute

Included in

Browse

Links

Search

Providence Research

Articles, Abstracts, and Reports

A Protein Standard That Emulates Homology for the Characterization of Protein Inference Algorithms.

Publication Title

Authors

Document Type

Publication Date

Keywords

Abstract

Specialty/Research Institute

Included in

Share

Browse

Links

Search

Providence Research