From complex algorithms to clinical practice: a multicenter machine learning model and simplified decision tree for predicting cachexia risk in gastric cancer.
[BACKGROUND] Cachexia is a frequent, specific metabolic syndrome that severely compromises survival in gastric cancer (GC).
- 표본수 (n) 920
- p-value P < 0.01
APA
Zhao J, Deng Y, et al. (2026). From complex algorithms to clinical practice: a multicenter machine learning model and simplified decision tree for predicting cachexia risk in gastric cancer.. Frontiers in oncology, 16, 1767547. https://doi.org/10.3389/fonc.2026.1767547
MLA
Zhao J, et al.. "From complex algorithms to clinical practice: a multicenter machine learning model and simplified decision tree for predicting cachexia risk in gastric cancer.." Frontiers in oncology, vol. 16, 2026, pp. 1767547.
PMID
41883970
Abstract
[BACKGROUND] Cachexia is a frequent, specific metabolic syndrome that severely compromises survival in gastric cancer (GC). While early diagnosis is paramount, existing screening methods are limited by complexity and suboptimal accuracy. There is an urgent need for an efficient, data-driven tool derived from routine clinical parameters.
[METHODS] In this multicenter retrospective study, we analyzed data from three independent hospitals. Variable selection was performed using univariable and multivariable analyses. We constructed and compared multiple machine learning (ML) models to predict cachexia risk. The models' discriminative ability, calibration, and clinical net benefit were comprehensively evaluated via AUC, calibration plots, and Decision Curve Analysis (DCA).
[RESULTS] The study included 1,570 GC patients (cachexia prevalence: 30.3%). Patients were divided into training (n=920), internal testing (n=350), and external validation (n=300) cohorts. Cachexia was significantly associated with poor nutritional status, elevated inflammation, and inferior overall survival (P < 0.01). The Random Forest (RF) model yielded the best performance, maintaining excellent stability across the internal test set (AUC = 0.898) and external validation set (AUC = 0.913). To enhance clinical utility, we further derived a simplified decision tree model based on three accessible markers: CA19-9, CEA, and albumin. This simplified tool retained high diagnostic accuracy (AUC > 0.783) and demonstrated significant positive net benefits in DCA.
[CONCLUSION] We successfully established and externally validated a high-performance ML model for predicting GC-associated cachexia. Crucially, the derived simplified decision tree offers a convenient, highly generalizable tool for clinicians to identify high-risk patients using routine laboratory tests, enabling earlier precision nutritional management.
[METHODS] In this multicenter retrospective study, we analyzed data from three independent hospitals. Variable selection was performed using univariable and multivariable analyses. We constructed and compared multiple machine learning (ML) models to predict cachexia risk. The models' discriminative ability, calibration, and clinical net benefit were comprehensively evaluated via AUC, calibration plots, and Decision Curve Analysis (DCA).
[RESULTS] The study included 1,570 GC patients (cachexia prevalence: 30.3%). Patients were divided into training (n=920), internal testing (n=350), and external validation (n=300) cohorts. Cachexia was significantly associated with poor nutritional status, elevated inflammation, and inferior overall survival (P < 0.01). The Random Forest (RF) model yielded the best performance, maintaining excellent stability across the internal test set (AUC = 0.898) and external validation set (AUC = 0.913). To enhance clinical utility, we further derived a simplified decision tree model based on three accessible markers: CA19-9, CEA, and albumin. This simplified tool retained high diagnostic accuracy (AUC > 0.783) and demonstrated significant positive net benefits in DCA.
[CONCLUSION] We successfully established and externally validated a high-performance ML model for predicting GC-associated cachexia. Crucially, the derived simplified decision tree offers a convenient, highly generalizable tool for clinicians to identify high-risk patients using routine laboratory tests, enabling earlier precision nutritional management.
같은 제1저자의 인용 많은 논문 (5)
- The Effect of CDT-Based Rehabilitation Nursing on Breast Cancer-related Lymphedema.
- Prognostic value of BMI, prognostic nutritional index, and CRP in patients with lymphoma after autologous hematopoietic stem cell transplantation.
- Analysis of independent prognostic factors for the occurrence and severity of heart failure after chemotherapy in patients with small cell lung cancer.
- Silymarin sensitizes human colorectal cancer cells to 5-ALA-mediated photodynamic therapy by enhancing cytotoxicity and apoptotic signaling in vitro.
- Exploratory Investigation Into Perioperative Treatment Strategies for Potentially Resectable Stage III-N2 Driver Gene-Negative Non-Small Cell Lung Cancer in the Immunotherapy Era.