Evaluation of PE5 Entries on the Human Proteome Project Target List.
Publication Title
Journal of proteome research
Document Type
Article
Publication Date
7-4-2025
Keywords
Humans; Proteome; Databases, Protein; Molecular Sequence Annotation; Proteomics; HUPO; human proteome; human proteome project; protein evidence.; washington; isb
Abstract
One aim of the international Human Proteome Organization (HUPO) Human Proteome Project (HPP) is to obtain high-confidence translation evidence for every human protein-coding gene established in its target list of 19,433 entries based on the protein-coding genes from Ensembl-GENCODE. However, 76 are annotated in UniProtKB (as of release 2024_06) with PE5, indicating skepticism in the protein's existence from a manual curator, so it is unclear if these entries belong in the HPP target list. Here, we review these 76 entries by assembling evidence from the literature, reference databases, and genome alignments with other species to conclude whether these entries should be freed from their PE5 status to become annotated with PE1-4 in UniProtKB. We find that 17 of these have credible translation evidence and therefore should be upgraded to PE1. Another 15 lack translation evidence but have transcription evidence, the evolutionary hallmarks of protein-coding genes, and are presumed to produce functional proteins. 41 have no translational or transcriptional evidence, although they still bear the evolutionary hallmarks of protein-coding genes; currently, it remains unclear if these are protein-coding, so their representation becomes a matter of policy. Only 3 entries still seem best categorized as PE5 and excluded from the HUPO-HPP target list.
Specialty/Research Institute
Institute for Systems Biology
DOI
10.1021/acs.jproteome.5c00167