본문으로 건너뛰기
← 뒤로

Development of a machine learning model for preoperative prediction of spread through air spaces in resectable non-small cell lung cancer: A single-center retrospective study.

Oncology letters 2026 Vol.31(2) p. 60

Yang C, Ding G, Zhan B, Xuan L, Cheng F, Yang Y

📝 환자 설명용 한 줄

Spread through air spaces (STAS) is a pathological feature associated with poor prognosis in non-small cell lung cancer (NSCLC).

이 논문을 인용하기

BibTeX ↓ RIS ↓
APA Yang C, Ding G, et al. (2026). Development of a machine learning model for preoperative prediction of spread through air spaces in resectable non-small cell lung cancer: A single-center retrospective study.. Oncology letters, 31(2), 60. https://doi.org/10.3892/ol.2025.15413
MLA Yang C, et al.. "Development of a machine learning model for preoperative prediction of spread through air spaces in resectable non-small cell lung cancer: A single-center retrospective study.." Oncology letters, vol. 31, no. 2, 2026, pp. 60.
PMID 41383980

Abstract

Spread through air spaces (STAS) is a pathological feature associated with poor prognosis in non-small cell lung cancer (NSCLC). However, its diagnosis currently depends exclusively on postoperative histopathological examination, limiting its utility for preoperative surgical planning. The present study aimed to develop an interpretable machine learning (ML) model using preoperative clinical and semantic CT features to predict STAS in surgically resectable NSCLC. The present study retrospectively analyzed 584 patients with pathologically confirmed NSCLC who underwent surgical resection. A total of five ML algorithms were developed using routinely available preoperative data and evaluated using repeated 5-fold cross-validation to ensure model robustness and mitigate overfitting. The optimal model was selected based on area under receiver operating characteristic curve (AUC). Feature importance was assessed using SHapley Additive exPlanations (SHAP) analysis for interpretability. Among the five models, eXtreme Gradient Boosting (XGBoost) demonstrated the highest predictive performance (mean cross-validated AUC=0.868 on training set; AUC=0.764 on test set). SHAP analysis identified nodule type, lobulation and smoking history as the most influential features associated with STAS. In conclusion, the present study developed a clinically interpretable XGBoost model capable of predicting STAS using readily accessible preoperative features. This model holds promise as a decision-support tool to potentially guide personalized surgical strategies in NSCLC in the future.

같은 제1저자의 인용 많은 논문 (5)