본문으로 건너뛰기
← 뒤로

[A comparative study on the application of DeepSeek-R1 and ChatGPT in multidisciplinary treatment decision-making for advanced gastric cancer].

1/5 보강
Zhonghua wei chang wai ke za zhi = Chinese journal of gastrointestinal surgery 📖 저널 OA 0% 2021: 0/1 OA 2022: 0/1 OA 2024: 0/7 OA 2025: 0/59 OA 2026: 0/36 OA 2021~2026 2026 Vol.29(1) p. 114-119
Retraction 확인
출처

Huang JB, Li HZ, Zong Z, Liu QL, Gu TF, Liang B

📝 환자 설명용 한 줄

To compare the accuracy and comprehensiveness of DeepSeek-R1 and ChatGPT-4o in generating treatment recommendations for advanced gastric cancer.

이 논문을 인용하기

↓ .bib ↓ .ris
APA Huang JB, Li HZ, et al. (2026). [A comparative study on the application of DeepSeek-R1 and ChatGPT in multidisciplinary treatment decision-making for advanced gastric cancer].. Zhonghua wei chang wai ke za zhi = Chinese journal of gastrointestinal surgery, 29(1), 114-119. https://doi.org/10.3760/cma.j.cn441530-20250409-00149
MLA Huang JB, et al.. "[A comparative study on the application of DeepSeek-R1 and ChatGPT in multidisciplinary treatment decision-making for advanced gastric cancer].." Zhonghua wei chang wai ke za zhi = Chinese journal of gastrointestinal surgery, vol. 29, no. 1, 2026, pp. 114-119.
PMID 41566190 ↗

Abstract

To compare the accuracy and comprehensiveness of DeepSeek-R1 and ChatGPT-4o in generating treatment recommendations for advanced gastric cancer. This study included three steps: (1) evaluating the answers to ten key clinical questions; (2) analyzing clinical cases from the multidisciplinary team (MDT) of our center; (3) reviewing rare gastric cancer cases on PubMed. The study cases included MDT data of 95 patients with advanced gastric cancer treated at the Second Affiliated Hospital of Nanchang University from November 2022 to July 2024, as well as 14 rare cases retrieved from PubMed. Prompts designed based on the advanced gastric cancer cases were submitted to DeepSeek-R1 and ChatGPT-4o in a standardized format. A structured 4-point Likert scale was used to evaluate the accuracy and completeness of the outputs. Inter-rater consistency was calculated to ensure the objectivity of the evaluation. DeepSeek-R1 outperformed ChatGPT-4o in both accuracy and completeness regarding the ten key clinical questions, the practical MDT cases from our center, and the rare cases from PubMed. Stratified analysis showed that DeepSeek-R1 had advantages in providing answers related to surgical recommendations, chemotherapy suggestions, and chemotherapy regimens. The evaluation of inter-rater reliability revealed high reliability among raters (Accuracy and completeness: For key clinical questions: W=0.696 and W=0.632, respectively; For practical MDT cases of our center: W=0.657 and W=0.634, respectively; For rare cases from PubMed: W=0.683 for accuracy; all <0.001). DeepSeek-R1 demonstrates slightly better performance than ChatGPT-4o in generating treatment recommendations for advanced gastric cancer cases.

🏷️ 키워드 / MeSH 📖 같은 키워드 OA만

같은 제1저자의 인용 많은 논문 (1)

🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반