Multi-center benchmarking of large language models for clinical decision support in lung cancer screening.
단면연구
1/5 보강
Large language models (LLMs) are increasingly explored for clinical applications, but their ability to generate management recommendations for lung cancer screening remains uncertain.
- 연구 설계 cross-sectional
APA
Duan Z, Huang X, et al. (2025). Multi-center benchmarking of large language models for clinical decision support in lung cancer screening.. Cell reports. Medicine, 6(12), 102465. https://doi.org/10.1016/j.xcrm.2025.102465
MLA
Duan Z, et al.. "Multi-center benchmarking of large language models for clinical decision support in lung cancer screening.." Cell reports. Medicine, vol. 6, no. 12, 2025, pp. 102465.
PMID
41274285 ↗
Abstract 한글 요약
Large language models (LLMs) are increasingly explored for clinical applications, but their ability to generate management recommendations for lung cancer screening remains uncertain. In this cross-sectional, multi-center study, 148 anonymized low-dose computed tomography (CT) reports from three healthcare institutions are used to assess the readability, accuracy, and consistency of four widely adopted models (GPT-3.5, GPT-4, Claude 3 Sonnet, and Claude 3 Opus). Among them, Claude 3 Opus produces the most readable recommendations, while GPT-4 achieves the highest clinical accuracy. Importantly, performance dose not differ significantly across institutions, underscoring the robustness of these models to variations in reporting templates and their utility in diverse healthcare settings. In an exploratory analysis, two state-of-the-art models, proprietary GPT-4o and its open-source counterpart DeepSeek-R1, show comparable performance to GPT-4, outperforming GPT-3.5. These findings highlight the potential role of LLMs to enhance clinical decision support in lung cancer screening across diverse healthcare settings.
🏷️ 키워드 / MeSH 📖 같은 키워드 OA만
같은 제1저자의 인용 많은 논문 (5)
- Development and validation of a machine learning-based model for predicting sentinel lymph node metastasis in patients with clinically node-negative breast cancer following neoadjuvant chemotherapy.
- Identification and experimental validation of core genes associated with breast cancer brain metastasis via machine learning.
- Correction: PDZK1 inhibits MRP2-mediated oxaliplatin chemosensitivity in hepatocellular carcinoma.
- Relationships among social support, decision self-efficacy, and decision regret in colorectal chemotherapy cancer patients: A mediating model.
- PROX1 is an early driver of lineage plasticity in prostate cancer.
🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반
- A Phase I Study of Hydroxychloroquine and Suba-Itraconazole in Men with Biochemical Relapse of Prostate Cancer (HITMAN-PC): Dose Escalation Results.
- Self-management of male urinary symptoms: qualitative findings from a primary care trial.
- Clinical and Liquid Biomarkers of 20-Year Prostate Cancer Risk in Men Aged 45 to 70 Years.
- Diagnostic accuracy of Ga-PSMA PET/CT versus multiparametric MRI for preoperative pelvic invasion in the patients with prostate cancer.
- Association of patient health education with the postoperative health related quality of life in low- intermediate recurrence risk differentiated thyroid cancer patients.
- Early local immune activation following intra-operative radiotherapy in human breast tissue.