Colorectal Cancer Detection via Metabolites and Machine Learning.
Publication Title
Current issues in molecular biology
Document Type
Article
Publication Date
4-30-2024
Keywords
california; santa monica; pacific neurosci; machine learning
Abstract
Today, colorectal cancer (CRC) diagnosis is performed using colonoscopy, which is the current, most effective screening method. However, colonoscopy poses risks of harm to the patient and is an invasive process. Recent research has proven metabolomics as a potential, non-invasive detection method, which can use identified biomarkers to detect potential cancer in a patient's body. The aim of this study is to develop a machine-learning (ML) model based on chemical descriptors that will recognize CRC-associated metabolites. We selected a set of metabolites found as the biomarkers of CRC, confirmed that they participate in cancer-related pathways, and used them for training a machine-learning model for the diagnostics of CRC. Using a set of selective metabolites and random compounds, we developed a range of ML models. The best performing ML model trained on Stage 0-2 CRC metabolite data predicted a metabolite class with 89.55% accuracy. The best performing ML model trained on Stage 3-4 CRC metabolite data predicted a metabolite class with 95.21% accuracy. Lastly, the best-performing ML model trained on Stage 0-4 CRC metabolite data predicted a metabolite class with 93.04% accuracy. These models were then tested on independent datasets, including random and unrelated-disease metabolites. In addition, six pathways related to these CRC metabolites were also distinguished
Area of Special Interest
Neurosciences (Brain & Spine)
Area of Special Interest
Digestive Health
Area of Special Interest
Cancer
Specialty/Research Institute
Gastroenterology
Specialty/Research Institute
Neurosciences
Specialty/Research Institute
Oncology
DOI
10.3390/cimb46050254