Assessing in-hospital mortality risk in ICU lung cancer patients using machine learning: An analysis based on the MIMIC-IV database.

Wang J; Lin L; Qiu LP; Zheng LL; Wu LX; Lv H; Xie H

doi:10.1371/journal.pone.0341259

← 뒤로

Assessing in-hospital mortality risk in ICU lung cancer patients using machine learning: An analysis based on the MIMIC-IV database.

PloS one 2026 Vol.21(1) p. e0341259

Wang J, Lin L, Qiu LP, Zheng LL, Wu LX, Lv H, Xie H

PMC 전문 ↗ 원문 ↗ DOI ↗ BibTeX ↓ RIS ↓

📝 환자 설명용 한 줄

[BACKGROUND] Patients with advanced lung cancer admitted to the intensive care unit (ICU) face a substantially elevated risk of in-hospital mortality.

🔬 핵심 임상 통계 (초록에서 자동 추출 — 원문 검증 권장)

95% CI 0.840-0.891

이 논문을 인용하기

BibTeX ↓ RIS ↓

APA Wang J, Lin L, et al. (2026). Assessing in-hospital mortality risk in ICU lung cancer patients using machine learning: An analysis based on the MIMIC-IV database.. PloS one, 21(1), e0341259. https://doi.org/10.1371/journal.pone.0341259

MLA Wang J, et al.. "Assessing in-hospital mortality risk in ICU lung cancer patients using machine learning: An analysis based on the MIMIC-IV database.." PloS one, vol. 21, no. 1, 2026, pp. e0341259.

PMID 41569981

DOI 10.1371/journal.pone.0341259

Abstract

[BACKGROUND] Patients with advanced lung cancer admitted to the intensive care unit (ICU) face a substantially elevated risk of in-hospital mortality. Early identification of high-risk individuals is essential to support timely clinical decision-making. This study aimed to develop and validate a predictive model using machine learning (ML) techniques to estimate in-hospital mortality in this patient population.

[METHODS] Clinical data were obtained from the Medical Information Mart for Intensive Care-IV (MIMIC-IV) database. Feature selection was performed using least absolute shrinkage and selection operator (LASSO) regression, enabling the construction of eight ML models: logistic regression (LR), support vector machine (SVM), gradient boosting machine (GBM), artificial neural network (ANN), extreme gradient boosting (XGBoost), k-nearest neighbors (k-NN), adaptive boosting (AdaBoost), and random forest (RF). Model performance was assessed using the area under the receiver operating characteristic curve (AUC), as well as accuracy, sensitivity, specificity, and F1 score. Discrimination, calibration, and clinical utility were also evaluated. The final model incorporated 27 clinically interpretable variables, including not only established severity scores (e.g., SAPS II) but also dynamic treatment factors (e.g., vasopressin, mechanical ventilation duration) that reflect real-world ICU practice. SHAP analysis was employed to enhance interpretability, allowing clinicians to understand both the magnitude and directionality of key predictors-an improvement over black-box ML applications in prior studies.

[RESULTS] Among the 1,755 patients included, 368 (21%) died during hospitalization in the training cohort.Notably, older individuals, particularly those of Caucasian descent, demonstrated a higher susceptibility to mortality during their hospital stay. Lasso regression revealed that 27 variables demonstrated a significant correlation with lung cancer, such as gender, hospital stay duration The XGBoost model achieved the highest predictive performance, achieving an accuracy of 0.783, an F1 score of 0.595, and an AUC of 0.865 (95% CI: 0.840-0.891)within the training cohort. The performance metrics for the test cohort reflected similar trends, with an accuracy of 0.719, an F1 score of 0.543, and an AUC of 0.790(95% CI: 0.741-0.840). Key predictors identified consistently across models (LR, SVM, ANN, and XGBoost) included hospital stay duration, Simplified Acute Physiology Score II (SAPS II), use of norepinephrine and vasopressin, prothrombin time (PT), mechanical ventilation duration, white blood cell count (WBC), and blood urea nitrogen (BUN). The SHAP summary plot further illustrated the direction and magnitude of influence for the top 15 predictors.

[CONCLUSION] The XGBoost-based model showed the best performance in predicting in-hospital mortality among critically ill lung cancer patients. Hospital stay duration and SAPS II score emerged as the most influential predictors,which can serve as the basis for a simplified clinical risk score. These findings may support early risk stratification and guide clinical decision-making in the ICU. The analysis, relying exclusively on internal divisions from MIMIC-IV, restricts the model's generalizability and, consequently, its applicability in broader clinical contexts.

MeSH Terms

Humans; Lung Neoplasms; Male; Female; Intensive Care Units; Machine Learning; Hospital Mortality; Aged; Middle Aged; Databases, Factual; ROC Curve; Risk Assessment; Neural Networks, Computer; Risk Factors; Support Vector Machine; Aged, 80 and over

같은 제1저자의 인용 많은 논문 (5)

"A diamond-shaped" penoplasty technique with or without concurrent suprapubic liposuction for adult-acquired buried penis: clinical outcomes and patient satisfaction rates.
Asian journal of andrology 2025 cited 1
A Systematic Review of Patient-Reported Outcomes for Cosmetic Indications of Botulinum Toxin Treatment.
Dermatologic surgery : official publication for American Society for Dermatologic Surgery [et al.] 2019 cited 1
Impact of diagnosis-related group systems on inpatient expenditures and medical quality for children with leukemia: evidence from real-world data.
BMC health services research 2026
A preliminary study to evaluate efficacy and safety of Lugol's solution following radioiodine for remnant ablation in differentiated thyroid cancer.
Frontiers in oncology 2026
Lentinan inhibits breast cancer cell growth through the dual downregulation of tumor-promoting effectors CD133 and SCGB2A2.
International journal of biological macromolecules 2026