Identification and validation of an explainable machine learning model for hepatocellular carcinoma at high risk: a retrospective multicenter cohort study.
[BACKGROUND] Currently, there is still a lack of noninvasive, automated, and accurate machine learning (ML) model for prognostic risk stratification of high-risk hepatocellular carcinoma (hHCC) after
- 표본수 (n) 1220
- p-value P < 0.001
APA
Hao S, Liang X, et al. (2026). Identification and validation of an explainable machine learning model for hepatocellular carcinoma at high risk: a retrospective multicenter cohort study.. International journal of surgery (London, England), 112(1), 1164-1176. https://doi.org/10.1097/JS9.0000000000003480
MLA
Hao S, et al.. "Identification and validation of an explainable machine learning model for hepatocellular carcinoma at high risk: a retrospective multicenter cohort study.." International journal of surgery (London, England), vol. 112, no. 1, 2026, pp. 1164-1176.
PMID
40990650
Abstract
[BACKGROUND] Currently, there is still a lack of noninvasive, automated, and accurate machine learning (ML) model for prognostic risk stratification of high-risk hepatocellular carcinoma (hHCC) after transarterial chemoembolization (TACE).
[PURPOSE] To develop and test an ML-based model that integrates various clinical variables for prediction of early recurrence (ER, defined as recurrence within 24 months after treatment) after TACE for hHCC.
[METHODS] From May 2008 to October 2022, consecutive patients with unresectable HCC who underwent initial TACE were reviewed at multicenter. We input 39 preoperative and 5 treatment-related variables into five ML-based models for ER prediction. Furthermore, we compared their performances using area under the receiver operating characteristic curve (AUC) with DeLong test between these ML models and the major staging systems including Barcelona Clinic Liver Cancer, China National Liver Cancer, American Joint Committee on Cancer, Japan Society of Hepatology, and Hong Kong Liver Cancer. Overall survival (OS) and recurrence-free survival (RFS) were analyzed using the Kaplan-Meier method with log-rank tests.
[RESULTS] A total of 2068 eligible patients with hHCC were categorized into the training set ( n = 1220), the internal test set ( n = 306), and the external test set ( n = 542). ER rate was found in 78.4%, 78.8 %, and 62.5% of patients in three sets. Among all ML models, the eXtreme gradient boosting (XGBoost) model using top seven variables yielded the highest AUC of 0.888 in the internal test set and 0.854 in the external test set. Furthermore, it yielded better discriminatory ability for RFS with higher time-dependent AUC than all major staging systems (DeLong test, all P < 0.001). Kaplan-Meier analysis revealed significant OS and RFS risk stratification using the XGBoost model in three sets, respectively.
[CONCLUSION] The XGBoost model demonstrated superior predictive performance compared with other four ML models and all clinical staging systems in identifying ER, which may support individualized TACE management by guiding therapy selection.
[PURPOSE] To develop and test an ML-based model that integrates various clinical variables for prediction of early recurrence (ER, defined as recurrence within 24 months after treatment) after TACE for hHCC.
[METHODS] From May 2008 to October 2022, consecutive patients with unresectable HCC who underwent initial TACE were reviewed at multicenter. We input 39 preoperative and 5 treatment-related variables into five ML-based models for ER prediction. Furthermore, we compared their performances using area under the receiver operating characteristic curve (AUC) with DeLong test between these ML models and the major staging systems including Barcelona Clinic Liver Cancer, China National Liver Cancer, American Joint Committee on Cancer, Japan Society of Hepatology, and Hong Kong Liver Cancer. Overall survival (OS) and recurrence-free survival (RFS) were analyzed using the Kaplan-Meier method with log-rank tests.
[RESULTS] A total of 2068 eligible patients with hHCC were categorized into the training set ( n = 1220), the internal test set ( n = 306), and the external test set ( n = 542). ER rate was found in 78.4%, 78.8 %, and 62.5% of patients in three sets. Among all ML models, the eXtreme gradient boosting (XGBoost) model using top seven variables yielded the highest AUC of 0.888 in the internal test set and 0.854 in the external test set. Furthermore, it yielded better discriminatory ability for RFS with higher time-dependent AUC than all major staging systems (DeLong test, all P < 0.001). Kaplan-Meier analysis revealed significant OS and RFS risk stratification using the XGBoost model in three sets, respectively.
[CONCLUSION] The XGBoost model demonstrated superior predictive performance compared with other four ML models and all clinical staging systems in identifying ER, which may support individualized TACE management by guiding therapy selection.
MeSH Terms
Humans; Carcinoma, Hepatocellular; Liver Neoplasms; Female; Male; Machine Learning; Retrospective Studies; Middle Aged; Chemoembolization, Therapeutic; Aged; Risk Assessment; Neoplasm Recurrence, Local; Prognosis
같은 제1저자의 인용 많은 논문 (5)
- Exploring the potential of heterocyclic-containing tail approach in the development of novel covalent inhibitors dual-targeting EGFR and HER-2: design, synthesis and biological evaluation.
- PD-1 inhibitor improves radiosensitivity by tumor vessel normalization.
- Jiedu Xiaozhen Granules for Epidermal Growth Factor Receptor Tyrosine Kinase Inhibitor-Mediated Skin Toxicity: Protocol for a Randomized Controlled Trial.
- Migration-adjusted lung cancer burden in China: a population data-based Bayesian spatial modeling approach.
- Targeting rare oncogenic mutations in resectable non-small cell lung cancer: emerging perioperative strategies.