본문으로 건너뛰기
← 뒤로

Machine learning-based DNA microarray analysis for disease detection using the MICRO-AI framework.

Science progress 2026 Vol.109(2) p. 368504261436834

Othman MA

📝 환자 설명용 한 줄

DNA microarray is a transformative technique in genomics, enabling simultaneous examination of thousands of gene expression levels.

🔬 핵심 임상 통계 (초록에서 자동 추출 — 원문 검증 권장)
  • Sensitivity 95.2%
  • Specificity 97.4%

이 논문을 인용하기

BibTeX ↓ RIS ↓
APA Othman MA (2026). Machine learning-based DNA microarray analysis for disease detection using the MICRO-AI framework.. Science progress, 109(2), 368504261436834. https://doi.org/10.1177/00368504261436834
MLA Othman MA. "Machine learning-based DNA microarray analysis for disease detection using the MICRO-AI framework.." Science progress, vol. 109, no. 2, 2026, pp. 368504261436834.
PMID 41925147

Abstract

DNA microarray is a transformative technique in genomics, enabling simultaneous examination of thousands of gene expression levels. However, noise, high dimensionality (typically 12,000-22,000 genes), small sample sizes (155-1097 samples) and class imbalance complicate the extraction of meaningful diagnostic patterns. This paper presents MICRO-AI (Microarray Classification and Recognition using Artificial Intelligence), a comprehensive machine learning framework for DNA microarray analysis and automated disease diagnosis. The framework integrates advanced preprocessing (quantile normalisation, ComBat batch correction, KNN imputation), attention-weighted adaptive feature selection using recursive feature elimination with cross-validation, and heterogeneous ensemble classification combining gradient boosting machines, random forests and support vector machines with adaptive weight optimisation. A novel attention-based feature fusion mechanism dynamically prioritises discriminative gene expression signatures, reducing dimensionality by over 99% (from ∼20,000 to ∼127 genes) without loss of biological significance. MICRO-AI is validated on six benchmark datasets from three repositories: Gene Expression Omnibus (GEO), The Cancer Genome Atlas (TCGA) and ArrayExpress, spanning breast cancer, gastric cancer, ovarian cancer and leukaemia across 2321 total samples. Experimental results demonstrate average classification accuracy of 96.8%, sensitivity of 95.2%, specificity of 97.4%, F1-score of 96.0%, Matthews correlation coefficient of 0.928, and area under the receiver operating characteristic curve of 0.983. Comparative benchmarking against 10 state-of-the-art methods shows that MICRO-AI achieves 1.2-7.5% higher accuracy with an average training time of 52.3 s, representing 2.4-6.0× faster execution than deep learning alternatives. The modular architecture enables seamless integration with medical informatics systems for scalable clinical diagnostic deployment.

MeSH Terms

Humans; Machine Learning; Oligonucleotide Array Sequence Analysis; Gene Expression Profiling; Neoplasms; Breast Neoplasms