Predictive models may aid oncologists with making critical treatment decisions. A machine learning model is built using gene expression data from patients’ primary tumor tissues to predict whether a patient will respond positively or negatively to two chemotherapeutics: 5-Fluorouracil and Gemcitabine. It was also found that the genes most informative for predicting drug response were enriched in well-known cancer signaling pathways and their potential significance in chemotherapy prognosis was highlighted. As Fluorouracil and Gemcitabine provide the largest numbers of patients within TCGA they were chosen for this model. Normalized gene expression data were clustered and used as the input features. Matching clinical trial data was used to ascertain the response of the patients via multiple classification methods. Multiple clustering and classification methods were compared for prediction accuracy of drug response. Clara and random forest were found to be the best clustering and classification methods,respectively. The models predict with up to 86% accuracy.
Input variables : Patients’ clinical trial data and transcriptomic data from patients’ primary tumor samples
Output Variables : Patient's response to two chemotherapeutics: 5-Fluorouracil and Gemcitabine (Positive/ Negative)
Statistical | : | Somers D | Accuracy | Precision and Recall | Confusion Matrix | F1 Score | Roc and Auc | Prevalence | Detection Rate | Balanced Accuracy | Cohen's Kappa | Concordance | Gini Coefficent | KS Statistic | Youden's J Index |
Business | : | # of Adverse Events |
Infrastructure | : | Log Bytes | Logging/User/IAMPolicy | Logging/User/VPN | CPU Utilization | Memory Usage | Error Count | Prediction Count | Prediction Latencies | Private Endpoint Prediction Latencies | Private Endpoint Response Count |
Visit Model : github.com
Additional links : researchgate.net
Model Category | : | Public |
Date Published | : | September, 2020 |
Healthcare Domain | : | Life Sciences |
Code | : | github.com |
Clinical Trials |
Adverse Drug Events |