Development and external validation of interpretable machine learning models for personalized multiple treatment recommendations in non-small cell lung cancer.
[OBJECTIVE] Treatment decision-making for non-small cell lung cancer (NSCLC) is complex, necessitating individualized decision-support tools to improve prognosis.
APA
Luo Q, Zheng Z, et al. (2026). Development and external validation of interpretable machine learning models for personalized multiple treatment recommendations in non-small cell lung cancer.. International journal of medical informatics, 206, 106160. https://doi.org/10.1016/j.ijmedinf.2025.106160
MLA
Luo Q, et al.. "Development and external validation of interpretable machine learning models for personalized multiple treatment recommendations in non-small cell lung cancer.." International journal of medical informatics, vol. 206, 2026, pp. 106160.
PMID
41124941
Abstract
[OBJECTIVE] Treatment decision-making for non-small cell lung cancer (NSCLC) is complex, necessitating individualized decision-support tools to improve prognosis. This study aimed to develop and externally validate interpretable machine learning models to predict multiple treatment recommendations (surgery, radiotherapy and chemotherapy) and assess their prognostic implications.
[METHODS] We utilized data from 31,873 NSCLC patients (aged 45 and older, single primary lesion, 2010-2020) from the Surveillance, Epidemiology, and End Results (SEER) database to develop predictive models, and an additional 11,220 patients formed the external validation cohort. Individual models for surgery, radiotherapy and chemotherapy recommendations were developed and validated using nine machine learning algorithms. Model performance was evaluated using the area under the receiver operating characteristic curve (AUROC), accuracy, F1 score and Brier score. Shapley Additive exPlanations (SHAP) analysis was performed for model interpretability. Propensity score matching and Cox proportional hazards models were employed to quantify the survival benefits associated with each treatment modality.
[RESULTS] LightGBM demonstrated superior performance across all three treatment modalities, achieving AUROC values of 0.902 (surgery), 0.726 (radiotherapy), and 0.829 (chemotherapy) in the development cohort, with robust external validation AUROC values of 0.906, 0.721, and 0.801, respectively. Decision curve analyses revealed that all three models conferred higher net clinical benefits. SHAP analyses identified M stage, tumor stage, N stage and age as critical predictors. Survival analyses revealed significant overall survival benefits from surgery in early-stage patients, whereas radiotherapy and chemotherapy markedly improved outcomes in advanced-stage patients.
[CONCLUSION] The interpretable machine learning models accurately predicted individualized recommendations across multiple treatment modalities, providing valuable insights to support personalized decision-making and optimize patient outcomes for NSCLC patients.
[METHODS] We utilized data from 31,873 NSCLC patients (aged 45 and older, single primary lesion, 2010-2020) from the Surveillance, Epidemiology, and End Results (SEER) database to develop predictive models, and an additional 11,220 patients formed the external validation cohort. Individual models for surgery, radiotherapy and chemotherapy recommendations were developed and validated using nine machine learning algorithms. Model performance was evaluated using the area under the receiver operating characteristic curve (AUROC), accuracy, F1 score and Brier score. Shapley Additive exPlanations (SHAP) analysis was performed for model interpretability. Propensity score matching and Cox proportional hazards models were employed to quantify the survival benefits associated with each treatment modality.
[RESULTS] LightGBM demonstrated superior performance across all three treatment modalities, achieving AUROC values of 0.902 (surgery), 0.726 (radiotherapy), and 0.829 (chemotherapy) in the development cohort, with robust external validation AUROC values of 0.906, 0.721, and 0.801, respectively. Decision curve analyses revealed that all three models conferred higher net clinical benefits. SHAP analyses identified M stage, tumor stage, N stage and age as critical predictors. Survival analyses revealed significant overall survival benefits from surgery in early-stage patients, whereas radiotherapy and chemotherapy markedly improved outcomes in advanced-stage patients.
[CONCLUSION] The interpretable machine learning models accurately predicted individualized recommendations across multiple treatment modalities, providing valuable insights to support personalized decision-making and optimize patient outcomes for NSCLC patients.
MeSH Terms
Humans; Carcinoma, Non-Small-Cell Lung; Machine Learning; Lung Neoplasms; Female; Male; Middle Aged; Precision Medicine; Aged; SEER Program; Prognosis
같은 제1저자의 인용 많은 논문 (5)
- Contrast-enhanced CT-based radiomics for predicting visceral pleural invasion in early-stage non-small cell lung cancer.
- Age-Based Left-Digit Bias in the Treatment of Pancreatic Adenocarcinoma.
- Associations of serum lipid traits with DLBCL: a prospective cohort study from the UK Biobank.
- NCAPD2 Modulates MHC-I Antigen Presentation via the PI3K/AKT Axis to Drive Metastatic Progression in Gastric Cancer.
- A New Integrative Modeling Approach for Generating Counterfactual Projections of Colorectal Cancer Incidence Rates in the Absence of Organized Screening in Australia.