Clinical and Multiomic Features Differentiate Young Black and White Breast Cancer Cohorts Derived by Machine Learning Approaches.

Publication Title

Clinical breast cancer

Document Type

Article

Publication Date

11-28-2024

Keywords

Biclustering; Breast cancer molecular subtype; Cohort analysis; Health Disparities; Multi-omics; washington; isb; diversity; machine learning

Abstract

BACKGROUND: There are documented differences in Breast cancer (BrCA) presentations and outcomes between Black and White patients. In addition to molecular factors, socioeconomic, racial, and clinical factors result in disparities in outcomes for women in the United States. Using machine learning and unsupervised biclustering methods within a multiomics framework, here we sought to shed light on the biological and clinical underpinnings of observed differences between Black and White BrCA patients.

MATERIALS AND METHODS: We examined The Cancer Genome Atlas BrCA samples from stage II patients aged 50 or younger that are Black (BAA50) or White (W50) (n = 139 patients; 36 BAA50 and 103 W50) These patients were chosen because marked differences in survival were observed in an earlier study. A variety of multiomic data sets were analyzed to further characterize the clinical and molecular disparities for insights.

RESULTS: We coupled RNAseq data with protein-protein interaction as well as BrCA-specific protein co-expression network data to identify 2 novel biclusters. These biclusters are significantly associated with clinical features including race, number of lymph nodes involved with disease, estrogen receptor status, progesterone receptor status and menopausal status. There were also differentially mutated genes. Using DNA methylation data, we identified differentially methylated genes. Machine learning algorithms were trained on differential methylation values of driver genes. The trained algorithms were successful in predicting the bicluster assignment of each sample.

CONCLUSION: These results demonstrate that there was a significant association between the cluster membership and BAA50 and W50 cohorts, indicating that these biclusters accurately stratify these cohorts.

Area of Special Interest

Cancer

Specialty/Research Institute

Oncology

Specialty/Research Institute

Population Health

DOI

10.1016/j.clbc.2024.11.015

Share

COinS