본문으로 건너뛰기
← 뒤로

Interpretable machine-learning prediction of severe myelosuppression in colorectal cancer patients receiving chemotherapy using XGBoost and SHAP: a retrospective study with a web-based calculator.

Frontiers in oncology 2026 Vol.16() p. 1785146

Ding L, Peng L, Xu Z, Cui Z, Wang Z

📝 환자 설명용 한 줄

[BACKGROUND] Patients with colorectal cancer (CRC) are susceptible to severe myelosuppression (SMS) after chemotherapy.

이 논문을 인용하기

BibTeX ↓ RIS ↓
APA Ding L, Peng L, et al. (2026). Interpretable machine-learning prediction of severe myelosuppression in colorectal cancer patients receiving chemotherapy using XGBoost and SHAP: a retrospective study with a web-based calculator.. Frontiers in oncology, 16, 1785146. https://doi.org/10.3389/fonc.2026.1785146
MLA Ding L, et al.. "Interpretable machine-learning prediction of severe myelosuppression in colorectal cancer patients receiving chemotherapy using XGBoost and SHAP: a retrospective study with a web-based calculator.." Frontiers in oncology, vol. 16, 2026, pp. 1785146.
PMID 41939468

Abstract

[BACKGROUND] Patients with colorectal cancer (CRC) are susceptible to severe myelosuppression (SMS) after chemotherapy. Conventional linear models may have limited performance and may fail to capture complex, nonlinear risk patterns, which can hinder early risk stratification and timely intervention. We aimed to develop an interpretable machine-learning model to predict SMS and to build a web-based calculator for individualized risk assessment.

[METHODS] We retrospectively enrolled 987 CRC patients who received capecitabine plus oxaliplatin with or without targeted therapy at our hospital between March 2021 and November 2025. Nine predictors were selected using least absolute shrinkage and selection operator (LASSO) regression. We developed and compared several models, including extreme gradient boosting (XGBoost), random forest, decision tree, and support vector machine. Model interpretability was assessed using SHapley Additive exPlanations (SHAP) at both the global and individual levels to characterize nonlinear effects and feature interactions. A web-based, real-time risk calculator was also implemented.

[RESULTS] On the validation set, the XGBoost model achieved the best balance of predictive performance (AUC = 0.906; sensitivity = 0.864). SHAP analysis quantified the contribution of each feature, with the top three contributors being white blood cell count, number of chemotherapy cycles, and Karnofsky Performance Status score. Nonlinear threshold effects were observed for continuous variables, including white blood cell count, platelet count, and serum albumin. Interactions were identified between white blood cell count and performance status, as well as between white blood cell count and number of chemotherapy cycles. The web-based calculator enables real-time individualized risk estimation. Decision curve analysis indicated favorable net clinical benefit across a range of decision thresholds.

[CONCLUSION] We developed a high-performing and interpretable model for predicting SMS in CRC patients receiving chemotherapy. The accompanying web-based calculator may provide a practical tool for early risk stratification and individualized management of chemotherapy-related SMS.

같은 제1저자의 인용 많은 논문 (5)