Multi-view deep learning for automated lymphoma staging from F-FDG PET/CT: physician-level accuracy with high-throughput workflow.
[BACKGROUND] Lymphoma staging plays a pivotal role in guiding risk-adapted therapeutic strategies.
APA
Li D, Li R, et al. (2026). Multi-view deep learning for automated lymphoma staging from F-FDG PET/CT: physician-level accuracy with high-throughput workflow.. EJNMMI research, 16(1). https://doi.org/10.1186/s13550-025-01357-w
MLA
Li D, et al.. "Multi-view deep learning for automated lymphoma staging from F-FDG PET/CT: physician-level accuracy with high-throughput workflow.." EJNMMI research, vol. 16, no. 1, 2026.
PMID
41686381
Abstract
[BACKGROUND] Lymphoma staging plays a pivotal role in guiding risk-adapted therapeutic strategies. While the Ann Arbor system is the standard for staging lymphoma, current manual PET/CT evaluations suffer from interobserver variability and time inefficiencies. Recent advancements in artificial intelligence (AI) offer opportunities to objectively automate this workflow. This study aimed to develop a deep-learning model based on multi-view 2D maximum intensity projection (MIP) images from F-FDG PET/CT to automate lymphoma staging and compare its performance with nuclear medicine physicians.
[METHODS] We retrospectively analyzed baseline F-FDG PET/CT scans of 227 patients with lymphoma. The deep learning models were constructed with 2D MIP, three dimensional (3D) PET and 3D CT images for binary classication (limited vs. advanced stage). The models were evaluated by 5-fold cross-validation. Ground truth were the stage results of senior nuclear medicine physicians (5–10 years). Staging performance of different models was assessed by Receiver operating characteristic (ROC) curves. McNemar’s chi-squared test was used to compare the staging differences between the best-performing model and the nuclear medicine physicians with early-career nuclear medicine physicians (1–3 and 3–5 years, respectively).
[RESULTS] The developed AI model, LASM-mMIP, utilized multi-view 2D MIP images across coronal, sagittal, left anterior oblique (LAO), and right anterior oblique (RAO) perspectives. It achieved an accuracy of 93.0% and an AUC of 0.942 for staging, outperforming other AI models based on single-view MIP images and 3D volumetric PET/CT data. LASM-mMIP exhibited significantly higher staging accuracy compared to junior nuclear medicine physicians (1–3 years, 84.58%, < 0.05) and similar accuracy to intermediate-experience physicians (3–5 years, 96.9%, = 0.095). Moreover, the model reduced staging time by four orders of magnitude compared to physicians [(2.13 ± 0.31) seconds vs. (17918.90 ± 56310.53) seconds, < 0.05]. Failure analysis revealed that most false negatives were due to occult lesions, while false positives arose from physiological hypermetabolism.
[CONCLUSIONS] The LASM-mMIP model demonstrated high accuracy and significant temporal efficiency in lymphoma staging, offering performance comparable to early-career nuclear medicine physicians. By overcoming interobserver variability and reducing manual interpretation time, this AI-driven tool could streamline workflows and enhance decision-making in clinical settings. Future work should expand patient cohorts, explore additional clinical predictors, and refine stage-specific classification to further personalize lymphoma management.
[METHODS] We retrospectively analyzed baseline F-FDG PET/CT scans of 227 patients with lymphoma. The deep learning models were constructed with 2D MIP, three dimensional (3D) PET and 3D CT images for binary classication (limited vs. advanced stage). The models were evaluated by 5-fold cross-validation. Ground truth were the stage results of senior nuclear medicine physicians (5–10 years). Staging performance of different models was assessed by Receiver operating characteristic (ROC) curves. McNemar’s chi-squared test was used to compare the staging differences between the best-performing model and the nuclear medicine physicians with early-career nuclear medicine physicians (1–3 and 3–5 years, respectively).
[RESULTS] The developed AI model, LASM-mMIP, utilized multi-view 2D MIP images across coronal, sagittal, left anterior oblique (LAO), and right anterior oblique (RAO) perspectives. It achieved an accuracy of 93.0% and an AUC of 0.942 for staging, outperforming other AI models based on single-view MIP images and 3D volumetric PET/CT data. LASM-mMIP exhibited significantly higher staging accuracy compared to junior nuclear medicine physicians (1–3 years, 84.58%, < 0.05) and similar accuracy to intermediate-experience physicians (3–5 years, 96.9%, = 0.095). Moreover, the model reduced staging time by four orders of magnitude compared to physicians [(2.13 ± 0.31) seconds vs. (17918.90 ± 56310.53) seconds, < 0.05]. Failure analysis revealed that most false negatives were due to occult lesions, while false positives arose from physiological hypermetabolism.
[CONCLUSIONS] The LASM-mMIP model demonstrated high accuracy and significant temporal efficiency in lymphoma staging, offering performance comparable to early-career nuclear medicine physicians. By overcoming interobserver variability and reducing manual interpretation time, this AI-driven tool could streamline workflows and enhance decision-making in clinical settings. Future work should expand patient cohorts, explore additional clinical predictors, and refine stage-specific classification to further personalize lymphoma management.
같은 제1저자의 인용 많은 논문 (5)
- Comprehensive complication index: A new reporting standard for postoperative complications of free-flap reconstruction in head and neck cancer patients.
- PEGylated Cu-doped WS hybrid nanosheets for targeted multimodal cancer therapy.
- Causes of Death Among Waldenström Macroglobulinemia Patients: A Population-Based Study.
- Cell-Autonomous AR Dependence in Luminal Prostatic Epithelium Governs Survival and Lineage Plasticity.
- Ensemble-learning-assisted exhaled gas disease analysis based on in-situ construction of MOF-derived MO/GaN heterojunction sensor arrays.