Use of Large Language Models to Determine the Surveillance Colonoscopy Interval: A Bi-Institutional Validation Study.

Vedant Acharya; Shivan J. Mehta; Daniel A. Sussman; Vignesh Kumaresan; Jonathan England; Tessa S. Cook; S. Barry Issenberg; Amar R. Deshpande

doi:10.14309/ajg.0000000000003864

← 뒤로

Use of Large Language Models to Determine the Surveillance Colonoscopy Interval: A Bi-Institutional Validation Study.

3/5 보강

The American journal of gastroenterology 📖 저널 OA 18.9% 2021~2026 2026 Vol.121(4) p. 950-963 cited 1 Colorectal Cancer Screening and Dete

TL;DR LLM performance in determining the guideline-concordant post-polypectomy surveillance interval on a cohort of 1000 real-world colonoscopy and pathology report impressions is identified.

PICO 자동 추출 (휴리스틱, conf 2/4)

유사 논문

P · Population 대상 환자/모집단

추출되지 않음

I · Intervention 중재 / 시술

a screening or surveillance colonoscopy in 2023-2024 at 2 academic health centers were included

C · Comparison 대조 / 비교

추출되지 않음

O · Outcome 결과 / 결론

Examples with 1-3 colon polyps had an average accuracy of 95.8% whereas examples with 4 or more colon polyps had an average accuracy of 88.2%, combined P value < 0.001. [DISCUSSION] LLMs with a custom prompt achieve consistently high accuracy in determining the guideline-based surveillance colonoscopy interval.

OpenAlex 토픽 · Colorectal Cancer Screening and Detection Data-Driven Disease Surveillance AI in cancer detection

Acharya V, Mehta SJ, Sussman DA, Kumaresan V, England J, Cook TS

PubMed ↗ DOI ↗ BibTeX ↓ RIS ↓

📝 환자 설명용 한 줄

LLM performance in determining the guideline-concordant post-polypectomy surveillance interval on a cohort of 1000 real-world colonoscopy and pathology report impressions is identified.

이 논문을 인용하기

↓ .bib ↓ .ris

APA Vedant Acharya, Shivan J. Mehta, et al. (2026). Use of Large Language Models to Determine the Surveillance Colonoscopy Interval: A Bi-Institutional Validation Study.. The American journal of gastroenterology, 121(4), 950-963. https://doi.org/10.14309/ajg.0000000000003864

MLA Vedant Acharya, et al.. "Use of Large Language Models to Determine the Surveillance Colonoscopy Interval: A Bi-Institutional Validation Study.." The American journal of gastroenterology, vol. 121, no. 4, 2026, pp. 950-963.

PMID 41351229 ↗

DOI 10.14309/ajg.0000000000003864

Abstract

[INTRODUCTION] To determine the appropriate postpolypectomy colonoscopy surveillance interval, endoscopists synthesize information from colonoscopy and pathology report impressions and subsequently apply guideline-recommended interval algorithms, such as those developed by the United States Multi-Society Task Force. Given the complexity of these guidelines, this manual process is error-prone, necessitating automated tools, including large language models (LLMs), to improve guideline adherence. The primary aim of this study was to identify the LLM performance in determining the guideline-concordant postpolypectomy surveillance interval on a cohort of 1,000 real-world colonoscopy and pathology report impressions.

[METHODS] The data of patients who underwent a screening or surveillance colonoscopy in 2023-2024 at 2 academic health centers were included. Using a custom prompt outlining the US Multi-Society Task Force postpolypectomy surveillance algorithm, the LLM (GPT-4o) was asked to determine the appropriate surveillance interval for all 1,000 examples in the data set. This experiment, using the same model, prompt, and data set, was repeated 10 times; all experiments were conducted between January 27, 2025, and February 3, 2025.

[RESULTS] Across 10 experiments, the average accuracy was 94.6%. There was no significant difference in accuracy based on the institution from which the data originated or the presence of synchronous upper gastrointestinal endoscopy data within the pathology report impression. Examples with 1-3 colon polyps had an average accuracy of 95.8% whereas examples with 4 or more colon polyps had an average accuracy of 88.2%, combined P value < 0.001.

[DISCUSSION] LLMs with a custom prompt achieve consistently high accuracy in determining the guideline-based surveillance colonoscopy interval.

🏷️ 키워드 / MeSH 📖 같은 키워드 OA만

🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반

A Phase I Study of Hydroxychloroquine and Suba-Itraconazole in Men with Biochemical Relapse of Prostate Cancer (HITMAN-PC): Dose Escalation Results.
Cancer research communications 2026 Talmor B 외 📖 OA
Self-management of male urinary symptoms: qualitative findings from a primary care trial.
The British journal of general practice : the journal of the Royal College of General Practitioners 2026 Wheeler JR 외 📖 OA
Clinical and Liquid Biomarkers of 20-Year Prostate Cancer Risk in Men Aged 45 to 70 Years.
JAMA network open 2026 Lindholz M 외 📖 OA
Diagnostic accuracy of Ga-PSMA PET/CT versus multiparametric MRI for preoperative pelvic invasion in the patients with prostate cancer.
Science progress 2026 Qin Z 외 📖 OA
Clinical Presentation and Outcomes of Patients Undergoing Surgery for Thyroid Cancer.
Journal of the College of Physicians and Surgeons--Pakistan : JCPSP 2026 Khan MMU 외 📖 OA
Association of patient health education with the postoperative health related quality of life in low- intermediate recurrence risk differentiated thyroid cancer patients.
Scientific reports 2026 Li S 외 📖 OA

이 논문을 인용하기

Abstract 한글 요약

🏷️ 키워드 / MeSH 📖 같은 키워드 OA만

🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반

Abstract