본문으로 건너뛰기
← 뒤로

ChatGPT-4o outperforms gemini advanced in assisting multidisciplinary decision-making for advanced gastric cancer.

1/5 보강
European journal of surgical oncology : the journal of the European Society of Surgical Oncology and the British Association of Surgical Oncology 📖 저널 OA 5.5% 2021: 0/5 OA 2022: 0/4 OA 2023: 0/7 OA 2024: 0/20 OA 2025: 7/146 OA 2026: 12/140 OA 2021~2026 2025 Vol.51(8) p. 110096
Retraction 확인
출처

Li H, Huang J, Liu K, Liu J, Liu Q, Zhou Z, Zong Z, Mao S

📝 환자 설명용 한 줄

[BACKGROUND & AIMS] The treatment of advanced gastric cancer (GC) requires precise and comprehensive clinical decision-making.

이 논문을 인용하기

↓ .bib ↓ .ris
APA Li H, Huang J, et al. (2025). ChatGPT-4o outperforms gemini advanced in assisting multidisciplinary decision-making for advanced gastric cancer.. European journal of surgical oncology : the journal of the European Society of Surgical Oncology and the British Association of Surgical Oncology, 51(8), 110096. https://doi.org/10.1016/j.ejso.2025.110096
MLA Li H, et al.. "ChatGPT-4o outperforms gemini advanced in assisting multidisciplinary decision-making for advanced gastric cancer.." European journal of surgical oncology : the journal of the European Society of Surgical Oncology and the British Association of Surgical Oncology, vol. 51, no. 8, 2025, pp. 110096.
PMID 40294561 ↗

Abstract

[BACKGROUND & AIMS] The treatment of advanced gastric cancer (GC) requires precise and comprehensive clinical decision-making. Artificial intelligence (AI) chatbots offer potential tools to enhance multidisciplinary team (MDT) discussions. This study aims to compare the performances of ChatGPT-4o and Gemini Advanced in generating treatment recommendations for advanced GC.

[METHODS] The study involved three steps: (1) evaluating responses to ten critical clinical questions, (2) analyzing clinical cases from MDT meetings at our institution, and (3) reviewing rare GC cases from PubMed. It included 95 advanced GC patients discussed between November 2022 and July 2024, and 14 rare cases from PubMed. Prompts designed from advanced GC cases were submitted to ChatGPT-4o and Gemini Advanced using a standardized format. Outputs were evaluated for accuracy and completeness using a structured 4-point Likert scale. Interrater reliability was calculated to ensure consistency among evaluators.

[RESULTS] For the ten clinical questions, ChatGPT-4o achieved better performances compared to Gemini Advanced. In MDT cases, ChatGPT-4o provided more valuable recommendations in surgical suggestion, chemotherapy recommendation, and chemotherapy regimens. Subgroup analysis confirmed these findings in both routine and complex cases with high interrater reliability. ChatGPT-4o also outperformed Gemini Advanced in the analysis of rare GC cases from PubMed, showing superior accuracy with high interrater reliability.

[CONCLUSIONS] While our findings suggest that AI chatbots can generate clinically relevant and guideline-based treatment recommendations, their use in MDT decision-making should be viewed as supportive rather than autonomous. We emphasize that while AI chatbots have potential as decision-support tools, but they should be integrated only under expert supervision in a real-world clinical context.

🏷️ 키워드 / MeSH 📖 같은 키워드 OA만

같은 제1저자의 인용 많은 논문 (5)

🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반