Construction of a Lung Cancer Screening Risk Prediction Model Based on Machine Learning Algorithms.
1/5 보강
[OBJECTIVE] Utilizing lung cancer risk prediction models at the screening stage can enhance the accuracy of identifying high-risk individuals eligible for lung cancer screening.
- 95% CI 0.574-0.720
APA
Zhang T, Chen Y, et al. (2026). Construction of a Lung Cancer Screening Risk Prediction Model Based on Machine Learning Algorithms.. Journal of evidence-based medicine, 19(1), e70104. https://doi.org/10.1111/jebm.70104
MLA
Zhang T, et al.. "Construction of a Lung Cancer Screening Risk Prediction Model Based on Machine Learning Algorithms.." Journal of evidence-based medicine, vol. 19, no. 1, 2026, pp. e70104.
PMID
41498190 ↗
Abstract 한글 요약
[OBJECTIVE] Utilizing lung cancer risk prediction models at the screening stage can enhance the accuracy of identifying high-risk individuals eligible for lung cancer screening. However, there is a relative lack of research on such prediction models in China, particularly regarding machine learning algorithms.
[METHODS] A stratified random sampling method was employed to randomly divide the dataset into a training set (70%) and a validation set (30%). Key variables were screened using LASSO regression. Then logistic regression and XGBoost algorithm were utilized to construct a lung cancer risk prediction model in the training set and validate it in the validation set, respectively.
[RESULTS] A lung cancer risk prediction model was constructed using 11,708 participants enrolled in a prospective cohort, the Guangzhou Lung-Care Project Program. In the constructed lung cancer risk prediction models, the AUC of the logistic regression model in the validation set was 0.647 (95% CI: 0.574-0.720); in contrast, the AUC of the XGBoost model based on the machine learning algorithm in the validation set was 0.658 (95% CI: 0.589-0.727), demonstrating slightly better discriminative ability compared to the logistic regression model. In addition, this study found the important effect of childhood exposure to cooking fuels on the risk of lung cancer, which has been rarely considered in previous research.
[CONCLUSION] The lung cancer risk prediction model constructed based on the XGBoost algorithm is better than the logistic regression algorithm in terms of prediction accuracy and robustness, aiding in the risk assessment of individuals undergoing screening.
[METHODS] A stratified random sampling method was employed to randomly divide the dataset into a training set (70%) and a validation set (30%). Key variables were screened using LASSO regression. Then logistic regression and XGBoost algorithm were utilized to construct a lung cancer risk prediction model in the training set and validate it in the validation set, respectively.
[RESULTS] A lung cancer risk prediction model was constructed using 11,708 participants enrolled in a prospective cohort, the Guangzhou Lung-Care Project Program. In the constructed lung cancer risk prediction models, the AUC of the logistic regression model in the validation set was 0.647 (95% CI: 0.574-0.720); in contrast, the AUC of the XGBoost model based on the machine learning algorithm in the validation set was 0.658 (95% CI: 0.589-0.727), demonstrating slightly better discriminative ability compared to the logistic regression model. In addition, this study found the important effect of childhood exposure to cooking fuels on the risk of lung cancer, which has been rarely considered in previous research.
[CONCLUSION] The lung cancer risk prediction model constructed based on the XGBoost algorithm is better than the logistic regression algorithm in terms of prediction accuracy and robustness, aiding in the risk assessment of individuals undergoing screening.
🏷️ 키워드 / MeSH 📖 같은 키워드 OA만
같은 제1저자의 인용 많은 논문 (5)
- Metabolomic profiling of genotype-derived ABO blood group, secretor status and Lewis antigens and association with pancreatic ductal adenocarcinoma risk.
- The molecular mechanisms and potential therapeutic implications of the crosstalk between DNA methylation and metabolic reprogramming in thyroid cancer.
- Pathology of Cutaneous T Cell Lymphoma: A Narrative Review.
- Dropping out or Moving on? A Systematic Review With Meta-Aggregation of the Experience of Prehabilitation Among Patients With Cancer.
- NR4A1 mediates chemotherapy‑induced senescence via the PI3K/AKT pathway in gastric cancer cells.
🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반
- A Phase I Study of Hydroxychloroquine and Suba-Itraconazole in Men with Biochemical Relapse of Prostate Cancer (HITMAN-PC): Dose Escalation Results.
- Self-management of male urinary symptoms: qualitative findings from a primary care trial.
- Clinical and Liquid Biomarkers of 20-Year Prostate Cancer Risk in Men Aged 45 to 70 Years.
- Diagnostic accuracy of Ga-PSMA PET/CT versus multiparametric MRI for preoperative pelvic invasion in the patients with prostate cancer.
- Comprehensive analysis of androgen receptor splice variant target gene expression in prostate cancer.
- Clinical Presentation and Outcomes of Patients Undergoing Surgery for Thyroid Cancer.