Clinical validation of AI-assisted contouring in prostate radiation therapy treatment planning: Highlighting automation bias and the need for standardized quality assurance.
1/5 보강
[PURPOSE] This study evaluated the impact of a commercial AI-assisted contouring tool on intra- and inter-observer variability in prostate radiation therapy and assessed the dosimetric consequences of
APA
Arjmandi N, Sebzari AR, et al. (2026). Clinical validation of AI-assisted contouring in prostate radiation therapy treatment planning: Highlighting automation bias and the need for standardized quality assurance.. Journal of applied clinical medical physics, 27(1), e70425. https://doi.org/10.1002/acm2.70425
MLA
Arjmandi N, et al.. "Clinical validation of AI-assisted contouring in prostate radiation therapy treatment planning: Highlighting automation bias and the need for standardized quality assurance.." Journal of applied clinical medical physics, vol. 27, no. 1, 2026, pp. e70425.
PMID
41414851 ↗
Abstract 한글 요약
[PURPOSE] This study evaluated the impact of a commercial AI-assisted contouring tool on intra- and inter-observer variability in prostate radiation therapy and assessed the dosimetric consequences of geometric contour differences.
[METHODS] Two experienced radiation oncologists independently delineated clinical target volume (CTV) and organs at risk (OARs) for prostate cancer patients. Manual contours (C) and AI-generated contours (C) were compared with adjusted AI contours (C). A consensus reference (C) served as the benchmark. To evaluate clinical impact, treatment plans were recalculated and replanned on each contour set under identical beam geometries to assess dose-volume histogram (DVH) parameters.
[RESULTS] AI-assisted contouring significantly improved both intra- and inter-observer agreement. Inter-observer analysis revealed that the Dice similarity coefficient (DSCs) for CTV increased from 0.78 (± 0.11) for C to 0.89 (± 0.09) for C. Similarly, intra-observer analysis revealed that both oncologists showed significantly higher DSCs for C compared to C. A thorough geometric comparison to the C revealed that while adjustments to C improved accuracy, they generally did not surpass C for CTV and rectum. Dosimetric analyses demonstrated that, under fixed plan geometry, both C and C contours yielded lower planning target volume (PTV) D95% values compared with C, whereas after replanning, all plans met institutional criteria with no clinically significant differences among contour sets.
[CONCLUSION] AI-assisted contouring in prostate radiotherapy reduced intra- and inter-observer variability and improved contouring consistency. However, C did not consistently surpass C, especially for the CTV and rectum, where automation bias or selective clinical acceptance may have influenced edits. Fixed-plan recalculations revealed dose differences from minor geometric deviations. These findings underscore the importance of structured quality assurance (QA) and human oversight to mitigate automation bias while leveraging AI's efficiency. The single-institution design with two oncologists and one AI software limits generalizability, underscoring the need for multi-observer validation.
[METHODS] Two experienced radiation oncologists independently delineated clinical target volume (CTV) and organs at risk (OARs) for prostate cancer patients. Manual contours (C) and AI-generated contours (C) were compared with adjusted AI contours (C). A consensus reference (C) served as the benchmark. To evaluate clinical impact, treatment plans were recalculated and replanned on each contour set under identical beam geometries to assess dose-volume histogram (DVH) parameters.
[RESULTS] AI-assisted contouring significantly improved both intra- and inter-observer agreement. Inter-observer analysis revealed that the Dice similarity coefficient (DSCs) for CTV increased from 0.78 (± 0.11) for C to 0.89 (± 0.09) for C. Similarly, intra-observer analysis revealed that both oncologists showed significantly higher DSCs for C compared to C. A thorough geometric comparison to the C revealed that while adjustments to C improved accuracy, they generally did not surpass C for CTV and rectum. Dosimetric analyses demonstrated that, under fixed plan geometry, both C and C contours yielded lower planning target volume (PTV) D95% values compared with C, whereas after replanning, all plans met institutional criteria with no clinically significant differences among contour sets.
[CONCLUSION] AI-assisted contouring in prostate radiotherapy reduced intra- and inter-observer variability and improved contouring consistency. However, C did not consistently surpass C, especially for the CTV and rectum, where automation bias or selective clinical acceptance may have influenced edits. Fixed-plan recalculations revealed dose differences from minor geometric deviations. These findings underscore the importance of structured quality assurance (QA) and human oversight to mitigate automation bias while leveraging AI's efficiency. The single-institution design with two oncologists and one AI software limits generalizability, underscoring the need for multi-observer validation.
🏷️ 키워드 / MeSH 📖 같은 키워드 OA만
- Humans
- Radiotherapy Planning
- Computer-Assisted
- Prostatic Neoplasms
- Male
- Radiotherapy Dosage
- Organs at Risk
- Quality Assurance
- Health Care
- Radiotherapy
- Intensity-Modulated
- Artificial Intelligence
- Automation
- Automation bias
- Auto‐contouring
- Deep learning
- Inter‐observer variability
- Intra‐observer variability
- Quality assurance
🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반
- A Phase I Study of Hydroxychloroquine and Suba-Itraconazole in Men with Biochemical Relapse of Prostate Cancer (HITMAN-PC): Dose Escalation Results.
- Self-management of male urinary symptoms: qualitative findings from a primary care trial.
- Clinical and Liquid Biomarkers of 20-Year Prostate Cancer Risk in Men Aged 45 to 70 Years.
- Diagnostic accuracy of Ga-PSMA PET/CT versus multiparametric MRI for preoperative pelvic invasion in the patients with prostate cancer.
- Association of patient health education with the postoperative health related quality of life in low- intermediate recurrence risk differentiated thyroid cancer patients.
- Early local immune activation following intra-operative radiotherapy in human breast tissue.