Leveraging Machine Learning for Rapid and Accurate Diagnosis of Acute Leukemia.
[CONTEXT] Early detection of acute leukemia (AL) is crucial for timely intervention and improved outcomes.
- Sensitivity 94%
APA
Maddirala BP, Oberoi G, et al. (2026). Leveraging Machine Learning for Rapid and Accurate Diagnosis of Acute Leukemia.. International journal of laboratory hematology, 48(1), 130-139. https://doi.org/10.1111/ijlh.14555
MLA
Maddirala BP, et al.. "Leveraging Machine Learning for Rapid and Accurate Diagnosis of Acute Leukemia.." International journal of laboratory hematology, vol. 48, no. 1, 2026, pp. 130-139.
PMID
40922620
Abstract
[CONTEXT] Early detection of acute leukemia (AL) is crucial for timely intervention and improved outcomes. Machine learning (ML) models provide a promising approach for early screening and rapid diagnosis of AL, minimizing delays in referral.
[OBJECTIVES] To assess the utility of leukocyte cell population data (CPD) through ML models for detecting AL. To subclassify AL into acute myeloid leukemia (AML) and acute lymphoblastic leukemia (ALL) using CPD morphometry at a pre-microscopic level. To perform feature analysis on the ML prediction model.
[METHODS] We analyzed 1211 cases, including 810 confirmed AL cases (by morphology, immunophenotype, or molecular methods) and 401 benign cases. Leukocyte parameters and CPD from a Sysmex XN1000 analyzer (WDF Channel) were used for classification. ML models-LightGBM, CatBoost, TabNet, and XGBoost-were trained, and the optimal model was selected based on accuracy from 5-fold cross-validation. Feature contributions were evaluated using SHAP.
[RESULTS] Heat maps and UMAP projections effectively differentiated AL from benign cases and AML from ALL. XGBoost achieved the best performance with 88% sensitivity and 94% specificity. ROC-AUC scores were 0.88 for AML, 0.87 for ALL, and 0.99 for benign. Key features identified included NE-WY, MO-WZ, LYMPH, NE-WZ, NEUT, and MONO#.
[CONCLUSION] ML models based on leukocyte and CPD parameters enhance the predictability of AL detection and lineage differentiation at a pre-microscopic level. Integrating these models into hematology analyzers provides a cost-effective, novel tool for detection and differentiation. Interpretable predictions assist experts, reducing subjectivity and expediting final diagnosis through immunophenotyping and molecular studies.
[OBJECTIVES] To assess the utility of leukocyte cell population data (CPD) through ML models for detecting AL. To subclassify AL into acute myeloid leukemia (AML) and acute lymphoblastic leukemia (ALL) using CPD morphometry at a pre-microscopic level. To perform feature analysis on the ML prediction model.
[METHODS] We analyzed 1211 cases, including 810 confirmed AL cases (by morphology, immunophenotype, or molecular methods) and 401 benign cases. Leukocyte parameters and CPD from a Sysmex XN1000 analyzer (WDF Channel) were used for classification. ML models-LightGBM, CatBoost, TabNet, and XGBoost-were trained, and the optimal model was selected based on accuracy from 5-fold cross-validation. Feature contributions were evaluated using SHAP.
[RESULTS] Heat maps and UMAP projections effectively differentiated AL from benign cases and AML from ALL. XGBoost achieved the best performance with 88% sensitivity and 94% specificity. ROC-AUC scores were 0.88 for AML, 0.87 for ALL, and 0.99 for benign. Key features identified included NE-WY, MO-WZ, LYMPH, NE-WZ, NEUT, and MONO#.
[CONCLUSION] ML models based on leukocyte and CPD parameters enhance the predictability of AL detection and lineage differentiation at a pre-microscopic level. Integrating these models into hematology analyzers provides a cost-effective, novel tool for detection and differentiation. Interpretable predictions assist experts, reducing subjectivity and expediting final diagnosis through immunophenotyping and molecular studies.
MeSH Terms
Humans; Machine Learning; Leukemia, Myeloid, Acute; Precursor Cell Lymphoblastic Leukemia-Lymphoma; Female; Male; Leukocytes; Adult; Middle Aged; Adolescent; Immunophenotyping