AI performance for nodule volume doubling time in the follow-up of the UKLS lung cancer screening study compared to expert consensus and histological validation.
[AIM] To validate an artificial intelligence (AI) software for automated assessment of nodule growth by volume doubling time measurement (VDT) on protocol-mandated follow-up low-dose CT (LDCT) scans f
APA
Jiang B, Lancaster HL, et al. (2026). AI performance for nodule volume doubling time in the follow-up of the UKLS lung cancer screening study compared to expert consensus and histological validation.. European journal of cancer (Oxford, England : 1990), 232, 116137. https://doi.org/10.1016/j.ejca.2025.116137
MLA
Jiang B, et al.. "AI performance for nodule volume doubling time in the follow-up of the UKLS lung cancer screening study compared to expert consensus and histological validation.." European journal of cancer (Oxford, England : 1990), vol. 232, 2026, pp. 116137.
PMID
41319449
Abstract
[AIM] To validate an artificial intelligence (AI) software for automated assessment of nodule growth by volume doubling time measurement (VDT) on protocol-mandated follow-up low-dose CT (LDCT) scans from the UK lung cancer screening (UKLS) trial.
[METHODS] This validation study included 710 UKLS participants with 939 LDCT follow-up scans (361 3-month and 578 12-month). Follow-up scans were assessed independently by both AI and human readers. A positive finding warranting referral was defined as the largest nodule with a solid component ≥ 100 mm showing VDT ≤ 400 days at follow-up. Performance was benchmarked against the European expert panel (reference standard) and then the histological outcomes (gold standard).
[RESULTS] Against the expert panel, AI achieved the lowest 3-month negative misclassification (NM) rate (1/11, 9.1 %), versus human readers (range: 18.2-63.6 %). AI's positive misclassification (PM) rate was initially 7.8 % (28/361) at 3 months but decreased to 0.9 % (5/578) at 12 months. Against histological outcomes of 9 screen-detected lung cancers, AI identified VDT ≤ 400 days in all 4 cancers also deemed positive by the expert panel at the earliest 3-month follow-up, while human readers missed or delayed referrals in 1-3 of these. AI also identified VDT ≤ 400 days in 3 of 5 cancers that the panel classified as negative, primarily due to their sub-threshold volume (<100mm³).
[CONCLUSIONS] The automated AI system showed strong VDT assessment performance in follow-up screening, outperforming human readers in the early identification of rapid growth in histologically-confirmed cancers, thus supporting its potential to enhance risk stratification and facilitate earlier lung cancer detection.
[METHODS] This validation study included 710 UKLS participants with 939 LDCT follow-up scans (361 3-month and 578 12-month). Follow-up scans were assessed independently by both AI and human readers. A positive finding warranting referral was defined as the largest nodule with a solid component ≥ 100 mm showing VDT ≤ 400 days at follow-up. Performance was benchmarked against the European expert panel (reference standard) and then the histological outcomes (gold standard).
[RESULTS] Against the expert panel, AI achieved the lowest 3-month negative misclassification (NM) rate (1/11, 9.1 %), versus human readers (range: 18.2-63.6 %). AI's positive misclassification (PM) rate was initially 7.8 % (28/361) at 3 months but decreased to 0.9 % (5/578) at 12 months. Against histological outcomes of 9 screen-detected lung cancers, AI identified VDT ≤ 400 days in all 4 cancers also deemed positive by the expert panel at the earliest 3-month follow-up, while human readers missed or delayed referrals in 1-3 of these. AI also identified VDT ≤ 400 days in 3 of 5 cancers that the panel classified as negative, primarily due to their sub-threshold volume (<100mm³).
[CONCLUSIONS] The automated AI system showed strong VDT assessment performance in follow-up screening, outperforming human readers in the early identification of rapid growth in histologically-confirmed cancers, thus supporting its potential to enhance risk stratification and facilitate earlier lung cancer detection.
MeSH Terms
Aged; Female; Humans; Male; Middle Aged; Artificial Intelligence; Consensus; Early Detection of Cancer; Follow-Up Studies; Lung Neoplasms; Time Factors; Tomography, X-Ray Computed; Tumor Burden; United Kingdom; Consensus Statements as Topic
같은 제1저자의 인용 많은 논문 (5)
- Obesity-driven thyroid cancer burden in middle-aged and older populations: temporal trends and projected trajectories based on the Global Burden of Disease study.
- Long-term follow-up of predominantly Asian patients with relapsed/refractory FLT3-mutated acute myeloid leukemia in the phase 3 COMMODORE trial.
- Correction to: Long-term follow-up of predominantly Asian patients with relapsed/refractory FLT3-mutated acute myeloid leukemia in the phase 3 COMMODORE trial.
- Indolent NK-Cell Lymphoproliferative Disorder of the Nasal Cavity: A Case Report and Review of the Literature.
- Letter to the editor 1 on "MET promotes hepatocellular carcinoma development through the promotion of TRIB3-mediated FOXO1 degradation".