Development of a machine learning model for preoperative prediction of spread through air spaces in resectable non-small cell lung cancer: A single-center retrospective study.
Spread through air spaces (STAS) is a pathological feature associated with poor prognosis in non-small cell lung cancer (NSCLC).
APA
Yang C, Ding G, et al. (2026). Development of a machine learning model for preoperative prediction of spread through air spaces in resectable non-small cell lung cancer: A single-center retrospective study.. Oncology letters, 31(2), 60. https://doi.org/10.3892/ol.2025.15413
MLA
Yang C, et al.. "Development of a machine learning model for preoperative prediction of spread through air spaces in resectable non-small cell lung cancer: A single-center retrospective study.." Oncology letters, vol. 31, no. 2, 2026, pp. 60.
PMID
41383980
Abstract
Spread through air spaces (STAS) is a pathological feature associated with poor prognosis in non-small cell lung cancer (NSCLC). However, its diagnosis currently depends exclusively on postoperative histopathological examination, limiting its utility for preoperative surgical planning. The present study aimed to develop an interpretable machine learning (ML) model using preoperative clinical and semantic CT features to predict STAS in surgically resectable NSCLC. The present study retrospectively analyzed 584 patients with pathologically confirmed NSCLC who underwent surgical resection. A total of five ML algorithms were developed using routinely available preoperative data and evaluated using repeated 5-fold cross-validation to ensure model robustness and mitigate overfitting. The optimal model was selected based on area under receiver operating characteristic curve (AUC). Feature importance was assessed using SHapley Additive exPlanations (SHAP) analysis for interpretability. Among the five models, eXtreme Gradient Boosting (XGBoost) demonstrated the highest predictive performance (mean cross-validated AUC=0.868 on training set; AUC=0.764 on test set). SHAP analysis identified nodule type, lobulation and smoking history as the most influential features associated with STAS. In conclusion, the present study developed a clinically interpretable XGBoost model capable of predicting STAS using readily accessible preoperative features. This model holds promise as a decision-support tool to potentially guide personalized surgical strategies in NSCLC in the future.
같은 제1저자의 인용 많은 논문 (5)
- Estimating major pathological response in non-small cell lung cancer patients with post-neoadjuvant therapy using MMT-net.
- Long-term fine particulate air pollution exposure and risk of gastric cancer mortality in Taiwan.
- A trispecific antibody engaging T cells with tumour and myeloid cells augments antitumour immunity.
- Lactylation as a metabolic-epigenetic switch in cancer: dual roles in cell death resistance and therapeutic vulnerability.
- PANX1-mediated NLRP3 inflammasome activation promotes an adaptive doxorubicin resistance through IL-1β signaling in breast cancer.