ChatGPT-4o outperforms gemini advanced in assisting multidisciplinary decision-making for advanced gastric cancer.
1/5 보강
[BACKGROUND & AIMS] The treatment of advanced gastric cancer (GC) requires precise and comprehensive clinical decision-making.
APA
Li H, Huang J, et al. (2025). ChatGPT-4o outperforms gemini advanced in assisting multidisciplinary decision-making for advanced gastric cancer.. European journal of surgical oncology : the journal of the European Society of Surgical Oncology and the British Association of Surgical Oncology, 51(8), 110096. https://doi.org/10.1016/j.ejso.2025.110096
MLA
Li H, et al.. "ChatGPT-4o outperforms gemini advanced in assisting multidisciplinary decision-making for advanced gastric cancer.." European journal of surgical oncology : the journal of the European Society of Surgical Oncology and the British Association of Surgical Oncology, vol. 51, no. 8, 2025, pp. 110096.
PMID
40294561 ↗
Abstract 한글 요약
[BACKGROUND & AIMS] The treatment of advanced gastric cancer (GC) requires precise and comprehensive clinical decision-making. Artificial intelligence (AI) chatbots offer potential tools to enhance multidisciplinary team (MDT) discussions. This study aims to compare the performances of ChatGPT-4o and Gemini Advanced in generating treatment recommendations for advanced GC.
[METHODS] The study involved three steps: (1) evaluating responses to ten critical clinical questions, (2) analyzing clinical cases from MDT meetings at our institution, and (3) reviewing rare GC cases from PubMed. It included 95 advanced GC patients discussed between November 2022 and July 2024, and 14 rare cases from PubMed. Prompts designed from advanced GC cases were submitted to ChatGPT-4o and Gemini Advanced using a standardized format. Outputs were evaluated for accuracy and completeness using a structured 4-point Likert scale. Interrater reliability was calculated to ensure consistency among evaluators.
[RESULTS] For the ten clinical questions, ChatGPT-4o achieved better performances compared to Gemini Advanced. In MDT cases, ChatGPT-4o provided more valuable recommendations in surgical suggestion, chemotherapy recommendation, and chemotherapy regimens. Subgroup analysis confirmed these findings in both routine and complex cases with high interrater reliability. ChatGPT-4o also outperformed Gemini Advanced in the analysis of rare GC cases from PubMed, showing superior accuracy with high interrater reliability.
[CONCLUSIONS] While our findings suggest that AI chatbots can generate clinically relevant and guideline-based treatment recommendations, their use in MDT decision-making should be viewed as supportive rather than autonomous. We emphasize that while AI chatbots have potential as decision-support tools, but they should be integrated only under expert supervision in a real-world clinical context.
[METHODS] The study involved three steps: (1) evaluating responses to ten critical clinical questions, (2) analyzing clinical cases from MDT meetings at our institution, and (3) reviewing rare GC cases from PubMed. It included 95 advanced GC patients discussed between November 2022 and July 2024, and 14 rare cases from PubMed. Prompts designed from advanced GC cases were submitted to ChatGPT-4o and Gemini Advanced using a standardized format. Outputs were evaluated for accuracy and completeness using a structured 4-point Likert scale. Interrater reliability was calculated to ensure consistency among evaluators.
[RESULTS] For the ten clinical questions, ChatGPT-4o achieved better performances compared to Gemini Advanced. In MDT cases, ChatGPT-4o provided more valuable recommendations in surgical suggestion, chemotherapy recommendation, and chemotherapy regimens. Subgroup analysis confirmed these findings in both routine and complex cases with high interrater reliability. ChatGPT-4o also outperformed Gemini Advanced in the analysis of rare GC cases from PubMed, showing superior accuracy with high interrater reliability.
[CONCLUSIONS] While our findings suggest that AI chatbots can generate clinically relevant and guideline-based treatment recommendations, their use in MDT decision-making should be viewed as supportive rather than autonomous. We emphasize that while AI chatbots have potential as decision-support tools, but they should be integrated only under expert supervision in a real-world clinical context.
🏷️ 키워드 / MeSH 📖 같은 키워드 OA만
같은 제1저자의 인용 많은 논문 (5)
- Epidemiological Analysis of Foodborne Botulism Outbreaks - China, 2004-2020.
- Comparison of efficacy of eight treatments for plantar fasciitis: A network meta-analysis.
- A Transformer-Based Deep Learning Model for predicting Early Recurrence in Hepatocellular Carcinoma After Hepatectomy Using Intravoxel Incoherent Motion Images.
- The multifaceted roles of the ACSL family in cancer: Metabolic reprogramming, ferroptosis regulation and tumour immune microenvironment remodelling.
- Neutrophil-inspired CRISPR/dCas9 nanomedicine to program self-destructing and bystander killing of tumor cell for selective cancer therapy.
🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반
- A Phase I Study of Hydroxychloroquine and Suba-Itraconazole in Men with Biochemical Relapse of Prostate Cancer (HITMAN-PC): Dose Escalation Results.
- Self-management of male urinary symptoms: qualitative findings from a primary care trial.
- Clinical and Liquid Biomarkers of 20-Year Prostate Cancer Risk in Men Aged 45 to 70 Years.
- Diagnostic accuracy of Ga-PSMA PET/CT versus multiparametric MRI for preoperative pelvic invasion in the patients with prostate cancer.
- Clinical Presentation and Outcomes of Patients Undergoing Surgery for Thyroid Cancer.
- Association of patient health education with the postoperative health related quality of life in low- intermediate recurrence risk differentiated thyroid cancer patients.