The Validity, Accuracy, and Comprehensiveness of Artificial Intelligence for Generating Resident Educational Material and Discussion Guides.
Abstract
[INTRODUCTION] The current education of plastic surgery residents is hamstrung by reliance on outmoded teacher-learner methods including the incorporation of a passive system of lectures and talks into the formal conferences. The hypothesis was that artificial intelligence (AI), if trained, could generate satisfactory topical, specific subject, faculty-resident conference discussions or guides.
[METHODS] Three topics, namely, breast reduction, frontal sinus fracture, and flexor tendon injury, were selected, and an AI platform was trained with a vetted bibliography to generate discussion guides. A set of consensual cardinal teaching points were established by 3 faculty, and the AI-generated discussion guides were assessed by 6 residents with the criteria of completeness, specificity, and absence of hallucinations on a scale of 1 to 10. The same set of residents assessed faculty-generated discussion guides with the same criteria.
[RESULTS] The interrater reliability was 0.85. The mean and standard deviations of the rating values for breast reduction, frontal sinus, and flexor tendon repair were 4.39 ± 1.91, 3.39 ± 2.55, and 2.56 ± 1.62, respectively. The resident assessment of faculty-generated discussion guides yielded means and standard deviations of 8.94 ± 0.73, 8.56 ± 1.2, and 8.94 ± 0.9, respectively. A statistical comparison of the 2 groups of means and standard deviations revealed a significant difference.
[CONCLUSIONS] The platform generated pages of text and language that lacked content or, worse, was nonsensical. These results call into question the utility value of AI in the current development to contribute to the more sophisticated realm of plastic surgery resident education.
[METHODS] Three topics, namely, breast reduction, frontal sinus fracture, and flexor tendon injury, were selected, and an AI platform was trained with a vetted bibliography to generate discussion guides. A set of consensual cardinal teaching points were established by 3 faculty, and the AI-generated discussion guides were assessed by 6 residents with the criteria of completeness, specificity, and absence of hallucinations on a scale of 1 to 10. The same set of residents assessed faculty-generated discussion guides with the same criteria.
[RESULTS] The interrater reliability was 0.85. The mean and standard deviations of the rating values for breast reduction, frontal sinus, and flexor tendon repair were 4.39 ± 1.91, 3.39 ± 2.55, and 2.56 ± 1.62, respectively. The resident assessment of faculty-generated discussion guides yielded means and standard deviations of 8.94 ± 0.73, 8.56 ± 1.2, and 8.94 ± 0.9, respectively. A statistical comparison of the 2 groups of means and standard deviations revealed a significant difference.
[CONCLUSIONS] The platform generated pages of text and language that lacked content or, worse, was nonsensical. These results call into question the utility value of AI in the current development to contribute to the more sophisticated realm of plastic surgery resident education.
추출된 의학 개체 (NER)
| 유형 | 영어 표현 | 한국어 / 풀이 | UMLS CUI | 출처 | 등장 |
|---|---|---|---|---|---|
| 시술 | breast reduction
|
유방성형술 | dict | 2 | |
| 해부 | breast
|
유방 | dict | 2 | |
| 해부 | flexor tendon
|
scispacy | 1 | ||
| 합병증 | frontal sinus
|
scispacy | 1 | ||
| 약물 | [INTRODUCTION] The
|
scispacy | 1 | ||
| 약물 | [CONCLUSIONS]
|
scispacy | 1 | ||
| 질환 | frontal sinus fracture
|
C0272460
Fracture of frontal sinus
|
scispacy | 1 | |
| 질환 | flexor tendon
|
C0224848
Structure of flexor tendon
|
scispacy | 1 | |
| 질환 | hallucinations
|
C0018524
Hallucinations
|
scispacy | 1 |
MeSH Terms
Internship and Residency; Surgery, Plastic; Humans; Artificial Intelligence; Reproducibility of Results; Education, Medical, Graduate; Female
🔗 함께 등장하는 도메인
이 논문이 속한 카테고리와 같은 논문에서 자주 함께 다뤄지는 카테고리들
관련 논문
- The impact of three-dimensional simulation and virtual reality technologies on surgical decision-making and postoperative satisfaction in aesthetic surgery: a preliminary study.
- Cutaneous fistula of the breast: A complication of cosmetic autologous fat transfer.
- Epidermal inclusion cyst after breast reduction mammoplasty.
- Clinical outcomes of synthetic absorbable mesh use in breast surgery: First case series in reconstruction and aesthetic mastopexy.
- Implant-based versus autologous mastopexy after massive weight loss: Complications and patient satisfaction.