Reliability of data-driven versus expert-driven composite indicators in between-hospital comparisons on quality of oesophagogastric cancer surgery: a population-based retrospective cohort study.
코호트
1/5 보강
[OBJECTIVE] To construct a data-driven composite from (a subset of) currently used quality indicators for oesophagogastric cancer surgery and to evaluate whether this approach enhances the reliability
- 표본수 (n) 3588
- 연구 설계 cohort study
APA
van der Linde M, Eijkenaar F, et al. (2025). Reliability of data-driven versus expert-driven composite indicators in between-hospital comparisons on quality of oesophagogastric cancer surgery: a population-based retrospective cohort study.. BMJ open, 15(11), e104832. https://doi.org/10.1136/bmjopen-2025-104832
MLA
van der Linde M, et al.. "Reliability of data-driven versus expert-driven composite indicators in between-hospital comparisons on quality of oesophagogastric cancer surgery: a population-based retrospective cohort study.." BMJ open, vol. 15, no. 11, 2025, pp. e104832.
PMID
41198208 ↗
Abstract 한글 요약
[OBJECTIVE] To construct a data-driven composite from (a subset of) currently used quality indicators for oesophagogastric cancer surgery and to evaluate whether this approach enhances the reliability of between-hospital comparisons on outcome relative to the expert-driven composite indicator 'textbook outcome (TO)'.
[DESIGN] In this retrospective cohort study, we applied Item Response Theory (IRT) to construct a data-driven continuous composite indicator reflecting a single latent variable-the quality of surgical care-and estimated latent variable scores for all individual patients. Reliability was compared between the expert-driven (TO) and data-driven (IRT) composite indicators.
[SETTING] All Dutch hospitals providing oesophagogastric cancer surgery.
[PARTICIPANTS] All patients who underwent oesophagectomy (n=3588) or gastrectomy (n=1782) between 2018 and 2022 as registered in the Dutch Upper GI Cancer Audit (DUCA).
[PRIMARY AND SECONDARY OUTCOME MEASURES] We evaluated the reliability of between-hospital comparisons using 'rankability', which quantifies the proportion of observed variation in indicator scores between hospitals not attributable to chance.
[RESULTS] Seven out of 15 quality indicators were included in the IRT composite indicator. Most of the patients were assigned the artificial maximum of the continuous quality score (ie, ceiling effect), resulting in similar average hospital scores. Relative to TO, rankability increased when using the IRT composite for oesophagectomy (57% vs 41%) but declined for gastrectomy (38% vs 47%).
[CONCLUSIONS] The selected seven quality indicators for oesophageal and gastric cancer surgery represent a single latent variable but are not yet optimal for differentiating surgical care quality due to ceiling effects. Despite using fewer indicators, the continuous IRT score showed a promising increase in rankability for oesophagectomy, suggesting that data-driven composite indicators may enhance hospital benchmarking reliability.
[DESIGN] In this retrospective cohort study, we applied Item Response Theory (IRT) to construct a data-driven continuous composite indicator reflecting a single latent variable-the quality of surgical care-and estimated latent variable scores for all individual patients. Reliability was compared between the expert-driven (TO) and data-driven (IRT) composite indicators.
[SETTING] All Dutch hospitals providing oesophagogastric cancer surgery.
[PARTICIPANTS] All patients who underwent oesophagectomy (n=3588) or gastrectomy (n=1782) between 2018 and 2022 as registered in the Dutch Upper GI Cancer Audit (DUCA).
[PRIMARY AND SECONDARY OUTCOME MEASURES] We evaluated the reliability of between-hospital comparisons using 'rankability', which quantifies the proportion of observed variation in indicator scores between hospitals not attributable to chance.
[RESULTS] Seven out of 15 quality indicators were included in the IRT composite indicator. Most of the patients were assigned the artificial maximum of the continuous quality score (ie, ceiling effect), resulting in similar average hospital scores. Relative to TO, rankability increased when using the IRT composite for oesophagectomy (57% vs 41%) but declined for gastrectomy (38% vs 47%).
[CONCLUSIONS] The selected seven quality indicators for oesophageal and gastric cancer surgery represent a single latent variable but are not yet optimal for differentiating surgical care quality due to ceiling effects. Despite using fewer indicators, the continuous IRT score showed a promising increase in rankability for oesophagectomy, suggesting that data-driven composite indicators may enhance hospital benchmarking reliability.
🏷️ 키워드 / MeSH 📖 같은 키워드 OA만
같은 제1저자의 인용 많은 논문 (1)
🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반
- A Phase I Study of Hydroxychloroquine and Suba-Itraconazole in Men with Biochemical Relapse of Prostate Cancer (HITMAN-PC): Dose Escalation Results.
- Self-management of male urinary symptoms: qualitative findings from a primary care trial.
- Clinical and Liquid Biomarkers of 20-Year Prostate Cancer Risk in Men Aged 45 to 70 Years.
- Diagnostic accuracy of Ga-PSMA PET/CT versus multiparametric MRI for preoperative pelvic invasion in the patients with prostate cancer.
- Comprehensive analysis of androgen receptor splice variant target gene expression in prostate cancer.
- Clinical Presentation and Outcomes of Patients Undergoing Surgery for Thyroid Cancer.