Assessing the representativeness of single-center EMR data on ten cancer types: A comparative analysis with national statistics from South Korea (2011-2021).
2/5 보강
OpenAlex 토픽 ·
Global Cancer Incidence and Screening
Breast Cancer Treatment Studies
Reliability and Agreement in Measurement
[BACKGROUND] Real-world data (RWD) from electronic medical records (EMRs) is increasingly utilized in oncology to complement evidence from clinical trials by reflecting routine clinical practice and d
APA
Jung-Hyun Won, Howard R. Lee (2026). Assessing the representativeness of single-center EMR data on ten cancer types: A comparative analysis with national statistics from South Korea (2011-2021).. International journal of medical informatics, 214, 106401. https://doi.org/10.1016/j.ijmedinf.2026.106401
MLA
Jung-Hyun Won, et al.. "Assessing the representativeness of single-center EMR data on ten cancer types: A comparative analysis with national statistics from South Korea (2011-2021).." International journal of medical informatics, vol. 214, 2026, pp. 106401.
PMID
41895025 ↗
Abstract 한글 요약
[BACKGROUND] Real-world data (RWD) from electronic medical records (EMRs) is increasingly utilized in oncology to complement evidence from clinical trials by reflecting routine clinical practice and diverse patient populations. However, many EMR-based studies rely on single-center data, limiting the generalizability of their findings. We aimed to evaluate the representativeness of single-center EMR data from Seoul National University Hospital (SNUH) by comparing it with national cancer data from the Korean Statistical Information Service (KOSIS).
[METHODS] We compared annual cancer statistics from SNUH EMR and KOSIS (2011-2021) for ten cancer types: breast, gallbladder/biliary tract, gastric, kidney, liver, lung, pancreatic, prostate, thyroid cancers, and leukemia. We calculated the coverage proportion of cancer cases in the SNUH EMR relative to KOSIS. Differences in age and gender distributions between the two databases were analyzed. Annual trends in cancer cases were compared between two databases.
[RESULTS] From 2011 to 2021, SNUH data included 8.2% of national incident and 10.7% of prevalent cases, with high coverage for liver (20.4%) and pancreatic (20.3%) cancers. No significant differences in age and gender distribution were found across all cancer types (p > 0.05), with high cosine similarity (>0.8). Strong correlations in annual trends were observed for breast, lung, and pancreatic cancers (r > 0.9), while negative correlations were found for thyroid cancer prevalence (r = - 0.62) and liver cancer incidence (r = - 0.59).
[CONCLUSION] Single-center EMR data can be a valuable resource for oncology research in South Korea. However, external factors including changes in clinical guidelines should be considered when generalizing findings from such data to broader populations.
[METHODS] We compared annual cancer statistics from SNUH EMR and KOSIS (2011-2021) for ten cancer types: breast, gallbladder/biliary tract, gastric, kidney, liver, lung, pancreatic, prostate, thyroid cancers, and leukemia. We calculated the coverage proportion of cancer cases in the SNUH EMR relative to KOSIS. Differences in age and gender distributions between the two databases were analyzed. Annual trends in cancer cases were compared between two databases.
[RESULTS] From 2011 to 2021, SNUH data included 8.2% of national incident and 10.7% of prevalent cases, with high coverage for liver (20.4%) and pancreatic (20.3%) cancers. No significant differences in age and gender distribution were found across all cancer types (p > 0.05), with high cosine similarity (>0.8). Strong correlations in annual trends were observed for breast, lung, and pancreatic cancers (r > 0.9), while negative correlations were found for thyroid cancer prevalence (r = - 0.62) and liver cancer incidence (r = - 0.59).
[CONCLUSION] Single-center EMR data can be a valuable resource for oncology research in South Korea. However, external factors including changes in clinical guidelines should be considered when generalizing findings from such data to broader populations.
🏷️ 키워드 / MeSH 📖 같은 키워드 OA만
🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반
- A Phase I Study of Hydroxychloroquine and Suba-Itraconazole in Men with Biochemical Relapse of Prostate Cancer (HITMAN-PC): Dose Escalation Results.
- Self-management of male urinary symptoms: qualitative findings from a primary care trial.
- Clinical and Liquid Biomarkers of 20-Year Prostate Cancer Risk in Men Aged 45 to 70 Years.
- Diagnostic accuracy of Ga-PSMA PET/CT versus multiparametric MRI for preoperative pelvic invasion in the patients with prostate cancer.
- Clinical Presentation and Outcomes of Patients Undergoing Surgery for Thyroid Cancer.
- Association of patient health education with the postoperative health related quality of life in low- intermediate recurrence risk differentiated thyroid cancer patients.