Multi-institutional Assessment of Pathologist Scoring HER2 Immunohistochemistry.

Document Type


Publication Date


Publication Title

Modern pathology : an official journal of the United States and Canadian Academy of Pathology, Inc


oregon; portland


The HercepTest was approved 20+ years ago as the companion diagnostic test for trastuzumab in human epidermal growth factor 2 (HER2) or ERBB2 gene-amplified/overexpressing breast cancers. Subsequent HER2 immunohistochemistry (IHC) assays followed, including the now most common Ventana 4B5 assay. Although this IHC assay has become the clinical standard, its reliability, reproducibility, and accuracy have largely been approved and accepted on the basis of concordance among small numbers of pathologists without validation in a real-world setting. In this study, we evaluated the concordance and interrater reliability of scoring HER2 IHC in 170 breast cancer biopsies by 18 breast cancer-specialized pathologists from 15 institutions. We used the Observers Needed to Evaluate Subjective Tests method to determine the plateau of concordance and the minimum number of pathologists needed to estimate interrater agreement values for large numbers of raters, as seen in the real-world setting. We report substantial discordance within the intermediate categories (<1% agreement for 1+ and 3.6% agreement for 2+) in the 4-category HER2 IHC scoring system. The discordance within the IHC 0 cases is also substantial with an overall percent agreement (OPA) of only 25% and poor interrater reliability metrics (0.49 Fleiss' kappa, 0.55 intraclass correlation coefficient). This discordance can be partially reduced by using a 3-category system (28.8% vs 46.5% OPA for 4-category and 3-category scoring systems, respectively). Observers Needed to Evaluate Subjective Tests plots suggest that the OPA for the task of determining a HER2 IHC score 0 from not 0 plateaus statistically around 59.4% at 10 raters. Conversely, at the task of scoring HER2 IHC as 3+ or not 3+ pathologists' concordance was much higher with an OPA that plateaus at 87.1% with 6 raters. This suggests that legacy HER2 IHC remains valuable for finding the patients in whom the ERBB2 gene is amplified but unacceptably discordant in assigning HER2-low or HER2-negative status for the emerging HER2-low therapies.

Clinical Institute


Clinical Institute

Women & Children


Pathology & Laboratory Medicine