Real world deployment of a pancreatic cancer risk model: impact of refitting, imputation, and computational burden.
[BACKGROUND] Early detection is a major clinical challenge in pancreatic cancer due to its nonspecific symptoms and frequent late-stage diagnosis.
- Sensitivity 18.41%
APA
Chen W, Zhou B, et al. (2026). Real world deployment of a pancreatic cancer risk model: impact of refitting, imputation, and computational burden.. EBioMedicine, 124, 106118. https://doi.org/10.1016/j.ebiom.2025.106118
MLA
Chen W, et al.. "Real world deployment of a pancreatic cancer risk model: impact of refitting, imputation, and computational burden.." EBioMedicine, vol. 124, 2026, pp. 106118.
PMID
41512677
Abstract
[BACKGROUND] Early detection is a major clinical challenge in pancreatic cancer due to its nonspecific symptoms and frequent late-stage diagnosis. While predictive models using electronic health record (EHR) data show promise, their real world implementation remains underexplored. We previously developed a random survival forest (RSF) model to estimate pancreatic cancer risk using structured EHR data from 2007 to 2017. This study evaluates practical considerations for deploying such a model in a prospective clinical context.
[METHODS] We refit the original RSF model using a cohort from 2018 to 2019 and evaluated its performance on a 2020 cohort. We assessed how model refitting and different imputation strategies influenced predictive performance and compared execution times to evaluate computational feasibility. Three imputation strategies were tested: sub-model estimation (SME), stacked multiple imputation (SMI), and imputation via fixed chained equations (IFCE). To simulate real time use, we applied the model to 53 sequential weekly patient batches (with average batch size 190,206).
[FINDINGS] Refitting improved discrimination and calibration. Without refitting, the C-index ranged from 0.69 to 0.84 depending on imputation method; with refitting, it ranged from 0.79 to 0.83. The IFCE method achieved the best balance between performance (C-index: 0.83 with refit) and runtime (19.54 min). SME had the highest C-index (0.85) and sensitivity (18.41%) but required construction of multiple sub-models. SMI was the most computationally intensive, limiting its scalability in routine use. Calibration improved markedly with refitting. Model performance differed across racial and ethnic groups; calibration was poorest among Black patients but improved with SMI. Execution time varied substantially across methods.
[INTERPRETATION] Model refitting and appropriate handling of missing data improve the real world performance of predictive models. Among imputation approaches, IFCE offers the best trade-off between computational efficiency and predictive accuracy. These findings provide practical, implementation-focused guidance for deploying risk prediction models in prospective clinical settings.
[FUNDING] Research reported in this publication was supported by the National Cancer Institute of the National Institutes of Health under Award Number R01CA230442.
[METHODS] We refit the original RSF model using a cohort from 2018 to 2019 and evaluated its performance on a 2020 cohort. We assessed how model refitting and different imputation strategies influenced predictive performance and compared execution times to evaluate computational feasibility. Three imputation strategies were tested: sub-model estimation (SME), stacked multiple imputation (SMI), and imputation via fixed chained equations (IFCE). To simulate real time use, we applied the model to 53 sequential weekly patient batches (with average batch size 190,206).
[FINDINGS] Refitting improved discrimination and calibration. Without refitting, the C-index ranged from 0.69 to 0.84 depending on imputation method; with refitting, it ranged from 0.79 to 0.83. The IFCE method achieved the best balance between performance (C-index: 0.83 with refit) and runtime (19.54 min). SME had the highest C-index (0.85) and sensitivity (18.41%) but required construction of multiple sub-models. SMI was the most computationally intensive, limiting its scalability in routine use. Calibration improved markedly with refitting. Model performance differed across racial and ethnic groups; calibration was poorest among Black patients but improved with SMI. Execution time varied substantially across methods.
[INTERPRETATION] Model refitting and appropriate handling of missing data improve the real world performance of predictive models. Among imputation approaches, IFCE offers the best trade-off between computational efficiency and predictive accuracy. These findings provide practical, implementation-focused guidance for deploying risk prediction models in prospective clinical settings.
[FUNDING] Research reported in this publication was supported by the National Cancer Institute of the National Institutes of Health under Award Number R01CA230442.
MeSH Terms
Humans; Pancreatic Neoplasms; Male; Female; Electronic Health Records; Middle Aged; Risk Assessment; Aged; Risk Factors; Computational Biology
같은 제1저자의 인용 많은 논문 (5)
- FcγRIIb deficiency inhibits tumor development by attenuating the immunosuppressive phenotype of MDSCs.
- Effectiveness of KAP-based nursing program in managing digestive symptoms in colorectal cancer patients undergoing chemotherapy: A retrospective controlled study.
- Oligometastatic Prostate and Bladder Cancer: An Integrative Narrative Review.
- IRF1 suppresses gastric tumorigenesis via dual PI3K/AKT-ERK pathway modulation and functional antagonism of oncogenic MX2.
- Androgen Receptor Signaling Inhibitors for Metastatic Hormone Sensitive Prostate Cancer in Asians: Indirect Comparison.