Automated Flow and local LLM-Driven clinical Context Engineering: Precision colorectal cancer recurrence registry.

Yi-Wen Yang; Che-Yuan Chang; Yu-zu Lin; Hou-Hsuan Cheng; Sheng-Chieh Huang; Hung-Hsin Lin; Chun-Chi Lin; Yuan Lan; Huann-Sheng Wang; Shih-Ching Chang; Shung-Haur Yang; Wei-Shone Chen; Jeng-Kai Jiang

doi:10.1016/j.ijmedinf.2026.106383

← 뒤로

Automated Flow and local LLM-Driven clinical Context Engineering: Precision colorectal cancer recurrence registry.

2/5 보강

International journal of medical informatics 2026 Vol.213() p. 106383 Colorectal Cancer Screening and Dete

OpenAlex 토픽 · Colorectal Cancer Screening and Detection AI in cancer detection Machine Learning in Healthcare

Yang YW, Chang CY, Lin YZ, Cheng HH, Huang SC, Lin HH, Lin CC, Lan YT, Wang HS, Chang SC, Yang SH, Chen WS, Jiang JK

원문 ↗ DOI ↗ BibTeX ↓ RIS ↓

📝 환자 설명용 한 줄

[OBJECTIVE] This study develops and validates a deployable, privacy-preserving automated tool to address the labor-intensive nature of colorectal cancer (CRC) recurrence registration.

🔬 핵심 임상 통계 (초록에서 자동 추출 — 원문 검증 권장)

표본수 (n) 602
Specificity 87.7%

이 논문을 인용하기

BibTeX ↓ RIS ↓

APA Yi-Wen Yang, Che-Yuan Chang, et al. (2026). Automated Flow and local LLM-Driven clinical Context Engineering: Precision colorectal cancer recurrence registry.. International journal of medical informatics, 213, 106383. https://doi.org/10.1016/j.ijmedinf.2026.106383

MLA Yi-Wen Yang, et al.. "Automated Flow and local LLM-Driven clinical Context Engineering: Precision colorectal cancer recurrence registry.." International journal of medical informatics, vol. 213, 2026, pp. 106383.

PMID 41849918

DOI 10.1016/j.ijmedinf.2026.106383

Abstract

[OBJECTIVE] This study develops and validates a deployable, privacy-preserving automated tool to address the labor-intensive nature of colorectal cancer (CRC) recurrence registration. Our tool utilizes a reproducible, two-stage 'Clinical Context Engineering' workflow to mimic expert clinical reasoning and overcome the limitations of handling longitudinal clinical data ambiguity.

[METHODS] We retrospectively studied 3053 CRC patients (2010-2018). A local Large Language Model (LLM) (Qwen3:14B) analyzed ∼ 19,900 pathology and ∼ 43,900 imaging reports using iterative, patch-based analysis ("raw LLM"). Clinical validation rules were applied to generate "rule-based LLM" output, enhancing explainability and trustworthiness. Both automated methods and the manual Taiwan Cancer Registry (TCR) database were compared against a 20% manual reference standard (N = 602). Full prompts and validation code are provided for complete reproducibility.

[RESULTS] Under a strict 60-day temporal tolerance, the Rule-Based LLM achieved 90.7% accuracy, comparable to standard TCR processes (92.0%), and 77.2% sensitivity. The application of clinical validation rules significantly improved specificity from 87.7% (Raw LLM) to 93.9% (Rule-Based LLM). In time window analysis, the Rule-Based LLM identified 87.1% of recurrences within 60 days of the reference date.

[CONCLUSION] Our locally deployed, privacy-preserving, and explainable Clinical Context Engineering framework offers a viable, non-inferior alternative to standard TCR processes, reducing workload while maintaining data quality and fostering trust in AI-assisted cancer registry automation.

MeSH Terms

Humans; Colorectal Neoplasms; Registries; Neoplasm Recurrence, Local; Retrospective Studies; Taiwan; Female; Male; Reproducibility of Results; Natural Language Processing