A deep learning model leveraging semantic features fusion for DNase I hypersensitive sites identification in the human genome.
1/5 보강
[BACKGROUND AND OBJECTIVE] DNase I hypersensitive sites (DHSs) are chromatin regions that are extremely sensitive to the DNase I enzyme, increasing their accessibility for cellular processes.
APA
Alarfaj FK, Tahir M, Srivastava G (2026). A deep learning model leveraging semantic features fusion for DNase I hypersensitive sites identification in the human genome.. Computer methods and programs in biomedicine, 273, 109127. https://doi.org/10.1016/j.cmpb.2025.109127
MLA
Alarfaj FK, et al.. "A deep learning model leveraging semantic features fusion for DNase I hypersensitive sites identification in the human genome.." Computer methods and programs in biomedicine, vol. 273, 2026, pp. 109127.
PMID
41151193 ↗
Abstract 한글 요약
[BACKGROUND AND OBJECTIVE] DNase I hypersensitive sites (DHSs) are chromatin regions that are extremely sensitive to the DNase I enzyme, increasing their accessibility for cellular processes. DHSs are crucial for understanding transcriptional regulation mechanisms and contain genetic variations linked to various diseases such as breast cancer, coronary artery disease, Alzheimer's disease, autoimmune disorders, and neurological conditions. However, conventional DHSs identification methods are labor-intensive and resource-heavy, necessitating the need for alternative cost-effective approaches with high performance.
[METHODS] In this study, we propose various computational models, namely the CNN model, the CNN-GRU fusion model, the CNN-kmer fusion model, and the CNN-GRU-kmer fusion model, to overcome the challenges associated with DHSs prediction. The CNN Model is based on a simple 1-dimensional convocational neural network (CNN). The CNN-GRU fusion model is based on a simple 1-dimensional CNN and gated recurrent unit (GRU) and then fuses the feature maps of CNN and GRU. The CNN-kmer fusion model is based on a simple 1-dimensional CNN and k-mer features. First, we input the k-mer features to a dense layer; the output of the dense layer is fused with CNN features. In the CNN-GRU-kmer fusion model, based on simple 1-dimensional CNN, GRU, and k-mer features, first we input the k-mer features to a dense layer; the output of the dense layer is fused with CNN features and GRU features and fed to a dense layer with a sigmoid function for prediction.
[RESULTS] The proposed models were validated in the publicly available dataset, obtaining an accuracy of 0.8631, a sensitivity of 0.7209, a specificity of 0.9353, an MCC of 0.6468, an AUC ROC of 0.8528, and an AUC PR of 0.7530. These results surpass all performance evaluation metrics of state-of-the-art models.
[CONCLUSIONS] This study presents that the model integrates semantic vector-based feature fusion representation, which effectively captures both local and global patterns with inherited spatio-temporal dependencies within complex DHSs sequences. The model's performance was validated both with and without semantic feature fusion, followed by quantitative and statistical analyses against individual models, significantly enhancing feature representation and classification performance. Source code and datasets are available at: https://github.com/malikmtahir/DNase/tree/main.
[METHODS] In this study, we propose various computational models, namely the CNN model, the CNN-GRU fusion model, the CNN-kmer fusion model, and the CNN-GRU-kmer fusion model, to overcome the challenges associated with DHSs prediction. The CNN Model is based on a simple 1-dimensional convocational neural network (CNN). The CNN-GRU fusion model is based on a simple 1-dimensional CNN and gated recurrent unit (GRU) and then fuses the feature maps of CNN and GRU. The CNN-kmer fusion model is based on a simple 1-dimensional CNN and k-mer features. First, we input the k-mer features to a dense layer; the output of the dense layer is fused with CNN features. In the CNN-GRU-kmer fusion model, based on simple 1-dimensional CNN, GRU, and k-mer features, first we input the k-mer features to a dense layer; the output of the dense layer is fused with CNN features and GRU features and fed to a dense layer with a sigmoid function for prediction.
[RESULTS] The proposed models were validated in the publicly available dataset, obtaining an accuracy of 0.8631, a sensitivity of 0.7209, a specificity of 0.9353, an MCC of 0.6468, an AUC ROC of 0.8528, and an AUC PR of 0.7530. These results surpass all performance evaluation metrics of state-of-the-art models.
[CONCLUSIONS] This study presents that the model integrates semantic vector-based feature fusion representation, which effectively captures both local and global patterns with inherited spatio-temporal dependencies within complex DHSs sequences. The model's performance was validated both with and without semantic feature fusion, followed by quantitative and statistical analyses against individual models, significantly enhancing feature representation and classification performance. Source code and datasets are available at: https://github.com/malikmtahir/DNase/tree/main.
🏷️ 키워드 / MeSH 📖 같은 키워드 OA만
🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반
- A Phase I Study of Hydroxychloroquine and Suba-Itraconazole in Men with Biochemical Relapse of Prostate Cancer (HITMAN-PC): Dose Escalation Results.
- Self-management of male urinary symptoms: qualitative findings from a primary care trial.
- Clinical and Liquid Biomarkers of 20-Year Prostate Cancer Risk in Men Aged 45 to 70 Years.
- Diagnostic accuracy of Ga-PSMA PET/CT versus multiparametric MRI for preoperative pelvic invasion in the patients with prostate cancer.
- Clinical Presentation and Outcomes of Patients Undergoing Surgery for Thyroid Cancer.
- Association of patient health education with the postoperative health related quality of life in low- intermediate recurrence risk differentiated thyroid cancer patients.