Elson S. Floyd College of Medicine Washington State University Spokane, WA
E. Tizpa1, A. Tam2, S. Maroongroge2, A. Amini2, S. M. Glaser2, S. V. Dandapani2, B. Yuh3, J. Yoshida4, S. Liu5, T. B. Dorff6, S. K. Pal6, J. Yamzon3, A. Zhumkhawala3, R. Satterthwaite3, J. Montez3, P. Lee7, J. Y. C. Wong2, Y. R. Li2, and C. J. Ladbury2; 1Washington State University Elson S. Floyd College of Medicine, Spokane, WA, 2Department of Radiation Oncology, City of Hope National Medical Center, Duarte, CA, 3Department of Surgery, City of Hope National Medical Center, Duarte, CA, 4Department of Surgery, Lennar Foundation Comprehensive Cancer Center, City of Hope National Medical Center, Irvine, CA, 5Department of Medical Oncology, Lennar Foundation Comprehensive Cancer Center, City of Hope National Medical Center, Irvine, CA, 6Department of Medical Oncology & Therapeutics Research, City of Hope National Medical Center, Duarte, CA, 7Department of Radiation Oncology, Lennar Foundation Comprehensive Cancer Center, City of Hope National Medical Center, Irvine, CA
Purpose/Objective(s):A critique of genomic risk classifiers is potential correlation with readily available clinical data. For these classifiers to enhance clinical decision making, they must demonstrate additional discrimination from clinical variables alone with regards to prognostic and/or predictive value. A 22-gene classifier is increasingly employed to inform treatment decisions in prostate cancer. This study aimed to assess whether its risk score remained independent of available clinical variables when using machine learning (ML) to predict genomic risk score outputs.Materials/
Methods: This was a retrospective study of males with localized prostate cancer treated at one of twenty sites within a single hospital network. Patients whose tumors were sent for genomic risk profiling were eligible. Clinical features including year of biopsy, age, clinical stage, prostate specific antigen (PSA), Gleason score, and National Comprehensive Cancer Network (NCCN) risk group were extracted from the medical record and genomic risk score/category were extracted from the pathology results. Logistic regression for binary classification and linear regression for continuous classification plus 5 ML models were trained to predict the risk score, low-risk disease, and high-risk disease. Model performance was measured using area under the curve (AUC) for binary classification and Spearman rho (?) for regression. The best-performing model was explained using SHapley Additive exPlanation (SHAP) values. Results: A total of 354 patients with biopsy specimens obtained between 2010 and 2024 were identified. Median age was 66.7 (IQR: 61.4-73.2). A total of 27.1%, 57.9%, and 15.0% of patients were NCCN low, intermediate, and high risk, respectively. Median genomic risk score was 0.385 (IQR: 0.26-0.58). A total of 57.6%, 18.1%, and 24.3% of patients had genomic risk classified as low, intermediate, and high, respectively. An extreme gradient boosting tree achieved the best performance at predicting genomic risk score (?: 0.526; 95%CI: 0.355-0.668). A random forest model achieved the best performance at predicting high-risk (AUC: 0.790; 95%CI: 0.671-0.909) and low-risk (AUC: 0.749; 95%CI: 0.631-0.867) genomic score. The most important variables for predicting risk score were primary Gleason, NCCN risk category, and total Gleason. Risk factors predicting high-risk disease included primary Gleason, NCCN risk group, and total Gleason. For low-risk disease they were primary Gleason, age, and total Gleason. Conclusion: ML predicted the output of genomic risk classifiers with favorable albeit imperfect performance using clinical variables alone. Future analyses should evaluate whether genomic risk classifiers may be particularly useful in the subset of patients whose genomic risk score differs from what was predicted using clinical variables alone. ML in combination with genomic risk should also be evaluated as synergistic tools to predict actuarial outcomes once sufficient follow-up is available.