본문으로 건너뛰기
← 뒤로

Identification and validation of an explainable machine learning model for hepatocellular carcinoma at high risk: a retrospective multicenter cohort study.

International journal of surgery (London, England) 2026 Vol.112(1) p. 1164-1176

Hao S, Liang X, Li X, Zuo M, Fu Y, Ouyang Y, Zhu G, Li C, Chen J, Wang H, An C, Li J, Liu W

📝 환자 설명용 한 줄

[BACKGROUND] Currently, there is still a lack of noninvasive, automated, and accurate machine learning (ML) model for prognostic risk stratification of high-risk hepatocellular carcinoma (hHCC) after

🔬 핵심 임상 통계 (초록에서 자동 추출 — 원문 검증 권장)
  • 표본수 (n) 1220
  • p-value P < 0.001

이 논문을 인용하기

BibTeX ↓ RIS ↓
APA Hao S, Liang X, et al. (2026). Identification and validation of an explainable machine learning model for hepatocellular carcinoma at high risk: a retrospective multicenter cohort study.. International journal of surgery (London, England), 112(1), 1164-1176. https://doi.org/10.1097/JS9.0000000000003480
MLA Hao S, et al.. "Identification and validation of an explainable machine learning model for hepatocellular carcinoma at high risk: a retrospective multicenter cohort study.." International journal of surgery (London, England), vol. 112, no. 1, 2026, pp. 1164-1176.
PMID 40990650

Abstract

[BACKGROUND] Currently, there is still a lack of noninvasive, automated, and accurate machine learning (ML) model for prognostic risk stratification of high-risk hepatocellular carcinoma (hHCC) after transarterial chemoembolization (TACE).

[PURPOSE] To develop and test an ML-based model that integrates various clinical variables for prediction of early recurrence (ER, defined as recurrence within 24 months after treatment) after TACE for hHCC.

[METHODS] From May 2008 to October 2022, consecutive patients with unresectable HCC who underwent initial TACE were reviewed at multicenter. We input 39 preoperative and 5 treatment-related variables into five ML-based models for ER prediction. Furthermore, we compared their performances using area under the receiver operating characteristic curve (AUC) with DeLong test between these ML models and the major staging systems including Barcelona Clinic Liver Cancer, China National Liver Cancer, American Joint Committee on Cancer, Japan Society of Hepatology, and Hong Kong Liver Cancer. Overall survival (OS) and recurrence-free survival (RFS) were analyzed using the Kaplan-Meier method with log-rank tests.

[RESULTS] A total of 2068 eligible patients with hHCC were categorized into the training set ( n = 1220), the internal test set ( n = 306), and the external test set ( n = 542). ER rate was found in 78.4%, 78.8 %, and 62.5% of patients in three sets. Among all ML models, the eXtreme gradient boosting (XGBoost) model using top seven variables yielded the highest AUC of 0.888 in the internal test set and 0.854 in the external test set. Furthermore, it yielded better discriminatory ability for RFS with higher time-dependent AUC than all major staging systems (DeLong test, all P < 0.001). Kaplan-Meier analysis revealed significant OS and RFS risk stratification using the XGBoost model in three sets, respectively.

[CONCLUSION] The XGBoost model demonstrated superior predictive performance compared with other four ML models and all clinical staging systems in identifying ER, which may support individualized TACE management by guiding therapy selection.

MeSH Terms

Humans; Carcinoma, Hepatocellular; Liver Neoplasms; Female; Male; Machine Learning; Retrospective Studies; Middle Aged; Chemoembolization, Therapeutic; Aged; Risk Assessment; Neoplasm Recurrence, Local; Prognosis

같은 제1저자의 인용 많은 논문 (5)