Predicting hepatocellular carcinoma in people with hepatitis B: a comparison between Cox proportional hazard and machine learning models.
2/5 보강
PICO 자동 추출 (휴리스틱, conf 2/4)
유사 논문P · Population 대상 환자/모집단
환자: chronic HBV infection
I · Intervention 중재 / 시술
추출되지 않음
C · Comparison 대조 / 비교
추출되지 않음
O · Outcome 결과 / 결론
[CONCLUSIONS] In this dataset with a limited sample size and strongly imbalanced outcome, traditional CPH models provided robust, interpretable, and computationally efficient predictions for HCC risk. ML and DL methods did not outperform traditional models, reinforcing the validity of traditional statistical approaches in small to medium datasets.
OpenAlex 토픽 ·
Hepatitis B Virus Studies
Liver Disease Diagnosis and Treatment
Artificial Intelligence in Healthcare
[BACKGROUND AND AIM] Hepatocellular carcinoma (HCC) is a leading cause of cancer-related death worldwide, with chronic hepatitis B virus (HBV) infection being a major risk factor.
APA
Clémence Ramier, Marie Guyomard, et al. (2026). Predicting hepatocellular carcinoma in people with hepatitis B: a comparison between Cox proportional hazard and machine learning models.. Journal of epidemiology and population health, 74(4), 203387. https://doi.org/10.1016/j.jeph.2026.203387
MLA
Clémence Ramier, et al.. "Predicting hepatocellular carcinoma in people with hepatitis B: a comparison between Cox proportional hazard and machine learning models.." Journal of epidemiology and population health, vol. 74, no. 4, 2026, pp. 203387.
PMID
41962179 ↗
Abstract 한글 요약
[BACKGROUND AND AIM] Hepatocellular carcinoma (HCC) is a leading cause of cancer-related death worldwide, with chronic hepatitis B virus (HBV) infection being a major risk factor. To date, existing predictive scores of HCC are mainly based on traditional Cox proportional hazard (CPH) models. This study aimed to compare the variable selection process and performance of CPH models with those of machine learning (ML) and deep learning (DL) algorithms in predicting HCC among patients with chronic HBV infection.
[METHODS] We used data from 4,370 individuals with chronic HBV infection enrolled in the French prospective multicentre ANRS CO22 HEPATHER cohort, of which 56 (1.3%) developed an HCC. Two published CPH-based scores (ADAPTT and SADAPTT) were compared to Random Survival Forest (RSF), Survival Support Vector Machine (SVM), Survival XGBoost, and DeepSurv algorithms. Models were evaluated using Harrell's C-index, Inverse-Probability-of-Censoring Weighting win ratio statistic, and time-dependent area under the ROC curve at 3, 5, and 8 years. The same set of covariables was used to build all the models.
[RESULTS] CPH models demonstrated similar or higher performances (C-index [95% confidence interval]: 0.84 [0.82-0.85]) for HCC prediction compared to ML and DL models, with less overfitting. Survival SVM and RSF performed similarly (0.81 [0.79-0.83] and 0.81 [0.79-0.82], respectively) without outperforming CPH models. Variable selection was consistent across top-performing models, though CPH models more effectively captured the predictive value of certain behavioural factors, such as soft drink intake.
[CONCLUSIONS] In this dataset with a limited sample size and strongly imbalanced outcome, traditional CPH models provided robust, interpretable, and computationally efficient predictions for HCC risk. ML and DL methods did not outperform traditional models, reinforcing the validity of traditional statistical approaches in small to medium datasets.
[METHODS] We used data from 4,370 individuals with chronic HBV infection enrolled in the French prospective multicentre ANRS CO22 HEPATHER cohort, of which 56 (1.3%) developed an HCC. Two published CPH-based scores (ADAPTT and SADAPTT) were compared to Random Survival Forest (RSF), Survival Support Vector Machine (SVM), Survival XGBoost, and DeepSurv algorithms. Models were evaluated using Harrell's C-index, Inverse-Probability-of-Censoring Weighting win ratio statistic, and time-dependent area under the ROC curve at 3, 5, and 8 years. The same set of covariables was used to build all the models.
[RESULTS] CPH models demonstrated similar or higher performances (C-index [95% confidence interval]: 0.84 [0.82-0.85]) for HCC prediction compared to ML and DL models, with less overfitting. Survival SVM and RSF performed similarly (0.81 [0.79-0.83] and 0.81 [0.79-0.82], respectively) without outperforming CPH models. Variable selection was consistent across top-performing models, though CPH models more effectively captured the predictive value of certain behavioural factors, such as soft drink intake.
[CONCLUSIONS] In this dataset with a limited sample size and strongly imbalanced outcome, traditional CPH models provided robust, interpretable, and computationally efficient predictions for HCC risk. ML and DL methods did not outperform traditional models, reinforcing the validity of traditional statistical approaches in small to medium datasets.
🏷️ 키워드 / MeSH 📖 같은 키워드 OA만
🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반
- Raman Spectroscopic Signatures of Hepatic Carcinoma: Progress and Future Prospect.
- Methionine metabolism and the NOP2 methyltransferase are essential for MYC-Driven liver tumorigenesis.
- Next-generation dynamic and combinatorial nanotherapies for liver cancer: mechanisms, current advances and future perspectives.
- Deficiency of PTEN Confers Hypersensitivity to Fatty Acid-Mediated ER Stress in Transformed Hepatocytes.
- Targeting Fatty Acids in Liver Cancer: Molecular Insights and Drug Approaches.
- Observed Immune Response Following Histotripsy in Metastatic Hepatocellular Carcinoma.