Scoring Physician Risk Communication in Prostate Cancer Using Large Language Models.

Lopez-Garcia G; Xu D; Luu M; Zheng R; Daskivich TJ; Gonzalez-Hernandez G

doi:10.1101/2025.08.07.25333034

← 뒤로

Scoring Physician Risk Communication in Prostate Cancer Using Large Language Models.

1/5 보강

medRxiv : the preprint server for health sciences 📖 저널 OA 100% 2023~2026 2025

Lopez-Garcia G, Xu D, Luu M, Zheng R, Daskivich TJ, Gonzalez-Hernandez G

📖 무료 전문 🟢 PMC 전문 PMC12363684

PubMed ↗ DOI ↗ BibTeX ↓ RIS ↓

📝 환자 설명용 한 줄

Effective risk communication is essential to shared decision-making in prostate cancer care.

이 논문을 인용하기

↓ .bib ↓ .ris

APA Lopez-Garcia G, Xu D, et al. (2025). Scoring Physician Risk Communication in Prostate Cancer Using Large Language Models.. medRxiv : the preprint server for health sciences. https://doi.org/10.1101/2025.08.07.25333034

MLA Lopez-Garcia G, et al.. "Scoring Physician Risk Communication in Prostate Cancer Using Large Language Models.." medRxiv : the preprint server for health sciences, 2025.

PMID 40832413 ↗

DOI 10.1101/2025.08.07.25333034

Abstract

Effective risk communication is essential to shared decision-making in prostate cancer care. However, the quality of physician communication of key tradeoffs varies widely in real-world consultations. Manual evaluation of communication is labor-intensive and not scalable. We present a structured, rubric-based framework that uses large language models (LLMs) to automatically score the quality of risk communication in prostate cancer consultations. Using transcripts from 20 clinical visits, we curated and annotated 487 physician-spoken sentences that referenced five decision-making domains: cancer prognosis, life expectancy, and three treatment side effects (erectile dysfunction, incontinence, and irritative urinary symptoms). Each sentence was assigned a score from 0 to 5 based on the precision and patient-specificity of communicated risk, using a validated scoring rubric. We modeled this task as five multiclass classification problems and evaluated both fine-tuned transformer baselines and GPT-4o with rubric-based and chain-of-thought (CoT) prompting. Our best performing approach, which combined rubric-based CoT prompting with few-shot learning, achieved micro averaged F1 scores between 85.0 and 92.0 across domains, outperforming supervised baselines and matching inter-annotator agreement. These findings establish a scalable foundation for AI-driven evaluation of physician-patient communication in oncology and beyond.

🏷️ 키워드 / MeSH 📖 같은 키워드 OA만

같은 제1저자의 인용 많은 논문 (1)

Scoring Physician Risk Communication in Prostate Cancer Using Large Language Models.
Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing 2026

🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반

Prostate Cancer Care for Men with an Intellectual Disability: A Population-based Cohort Study of Symptoms, Diagnosis, Treatment, and Survival.
European urology oncology 2027 Kennedy OJ 외 📖 unpaywall
Association between polygenic risk scores and cardiovascular events in prostate cancer patients receiving androgen deprivation therapy in Han Chinese.
Cardio-oncology (London, England) 2026 Nian QY 외 📖 unpaywall
Diagnostic accuracy of Ga-PSMA PET/CT versus multiparametric MRI for preoperative pelvic invasion in the patients with prostate cancer.
Science progress 2026 Qin Z 외 📖 unpaywall
Comprehensive analysis of androgen receptor splice variant target gene expression in prostate cancer.
Biochimica et biophysica acta. Molecular cell research 2026 Wüstmann N 외 📖 unpaywall
Nanotechnology-Assisted Molecular Profiling: Emerging Advances in Circulating Tumor DNA Detection.
International journal of nanomedicine 2026 Kang J 외 📖 OA
Artificial intelligence and breast cancer screening in Serbia: a dual-perspective qualitative study among radiologists and screening-aged women.
Frontiers in radiology 2026 Jovanović S 외 📖 OA

이 논문을 인용하기

Abstract 한글 요약

🏷️ 키워드 / MeSH 📖 같은 키워드 OA만

같은 제1저자의 인용 많은 논문 (1)

🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반

Abstract