본문으로 건너뛰기
← 뒤로

Extracting structured data from unstructured breast imaging reports with transformer-based models.

Frontiers in digital health 2025 Vol.7() p. 1718330

Carrilero-Mardones M, Pérez-Martín J, Díez FJ, Bermejo Delgado I

📝 환자 설명용 한 줄

[BACKGROUND AND OBJECTIVE] Structured clinical data is essential for research and informed decision-making, yet medical reports are frequently stored as unstructured free text.

이 논문을 인용하기

BibTeX ↓ RIS ↓
APA Carrilero-Mardones M, Pérez-Martín J, et al. (2025). Extracting structured data from unstructured breast imaging reports with transformer-based models.. Frontiers in digital health, 7, 1718330. https://doi.org/10.3389/fdgth.2025.1718330
MLA Carrilero-Mardones M, et al.. "Extracting structured data from unstructured breast imaging reports with transformer-based models.." Frontiers in digital health, vol. 7, 2025, pp. 1718330.
PMID 41586210

Abstract

[BACKGROUND AND OBJECTIVE] Structured clinical data is essential for research and informed decision-making, yet medical reports are frequently stored as unstructured free text. This study compared the performance of BERT-based and generative language models in converting unstructured breast imaging reports into structured, tabular data suitable for clinical and research applications.

[METHODS] A dataset of 286 anonymised breast imaging reports in Spanish was translated into English and used to evaluate five transformer-based models pre-trained in medical data: BlueBERT, BioBERT, BioMedBERT, BioGPT and ClinicalT5. Two natural language processing approaches were explored: classification of 19 categorical variables (e.g. diagnostic technique, report type, family history, BI-RADS category, tumour shape and margin) and extractive question answering of four entities (patient age, patient history, parenchymal distortion or asymmetries, and tumour size). Multiple fine-tuning strategies and input configurations were tested for each model, and performance was evaluated using accuracy and macro F1 scores.

[RESULTS] BioGPT demonstrated the best performance in classification tasks, achieving an overall accuracy of and a macro F1 score of . This was significantly better than BERT-based models ( for accuracy and for F1), particularly in underrepresented categories such as tumour descriptors. In extractive question answering tasks, BioGPT achieved an average accuracy of , which is slightly lower than that of BioMedBERT and ClinicalT5, but not significantly so. Notably, BioGPT could perform classification and extractive question answering simultaneously, which is a capability unavailable in BERT-like models.

[CONCLUSIONS] Generative models, particularly BioGPT, offer a robust and scalable approach to automating the extraction of structured information from unstructured breast imaging reports. Their superior performance, combined with their ability to handle multiple tasks concurrently, highlights their potential to reduce the manual effort required for clinical data curation and to enable the efficient integration of imaging data into research and clinical workflows.