TEMSET-24K: Densely Annotated Dataset for Indexing Multipart Endoscopic Videos using Surgical Timeline Segmentation.

Scientific data 2025 Vol.12(1) p. 1424

Bilal M, Alam M, Bapu D, Korsgen S, Lal N, Bach S, Hajiyavand AM, Ali M, Soomro K, Qasim I, Capik P, Khan A, Khan Z, Vohra H, Caputo M, Beggs AD, Qayyum A, Qadir J, Ashraf SQ

관련 도메인

Abstract

Indexing endoscopic surgical videos is vital in surgical data science, forming the basis for systematic retrospective analysis and clinical performance evaluation. Despite its significance, current video analytics rely on manual indexing, a time-consuming process. Advances in computer vision, particularly deep learning, offer automation potential, yet progress is limited by the lack of publicly available, densely annotated surgical datasets. To address this, we present TEMSET-24K, an open-source dataset comprising 24,306 trans-anal endoscopic microsurgery (TEMS) video microclips. Each clip is meticulously annotated by clinical experts using a novel hierarchical labeling taxonomy encompassing "phase, task, and action" triplets, capturing intricate surgical workflows. To validate this dataset, we benchmarked deep learning models, including transformer-based architectures. Our in silico evaluation demonstrates high accuracy (up to 0.99) and F1 scores (up to 0.99) for key phases like "Setup" and "Suturing." The STALNet model, tested with ConvNeXt, ViT, and SWIN V2 encoders, consistently segmented well-represented phases. TEMSET-24K provides a critical benchmark, propelling state-of-the-art solutions in surgical data science.

추출된 의학 개체 (NER)

유형영어 표현한국어 / 풀이UMLS CUI출처등장
기법 endoscopic 내시경 dict 3
시술 microsurgery 미세수술 dict 1
질환 transformer-based scispacy 1
기타 TEMSET-24K scispacy 1
기타 ViT scispacy 1

MeSH Terms

Video Recording; Humans; Deep Learning; Microsurgery; Endoscopy; Abstracting and Indexing

🔗 함께 등장하는 도메인

이 논문이 속한 카테고리와 같은 논문에서 자주 함께 다뤄지는 카테고리들

관련 논문