본문으로 건너뛰기
← 뒤로

Compression and k-mer Based Approach for Anticancer Peptide Analysis.

IEEE transactions on computational biology and bioinformatics 2026 Vol.PP()

Ali S, Ali TE, Chourasia P, Patterson M

📝 환자 설명용 한 줄

Anti-cancer peptide (ACP) sequence classification is crucial for cancer treatment development.

이 논문을 인용하기

BibTeX ↓ RIS ↓
APA Ali S, Ali TE, et al. (2026). Compression and k-mer Based Approach for Anticancer Peptide Analysis.. IEEE transactions on computational biology and bioinformatics, PP. https://doi.org/10.1109/TCBBIO.2026.3685197
MLA Ali S, et al.. "Compression and k-mer Based Approach for Anticancer Peptide Analysis.." IEEE transactions on computational biology and bioinformatics, vol. PP, 2026.
PMID 41996441

Abstract

Anti-cancer peptide (ACP) sequence classification is crucial for cancer treatment development. Current neural network approaches achieve high accuracy but require substantial parameters and training data. Recent compression-based methods compress entire sequences, potentially missing fine grained neighboring information critical for classification. We propose a novel approach integrating Gzip compression with an incremental k-mer strategy. Unlike conventional methods, we compress individual k-mers and incrementally build subsequence compressions, preserving amino acid-level context. Using Normalized Compression Distance (NCD) and kernel-based embeddings, our parameter-free method achieves state-of-the-art performance on breast and lung cancer ACP datasets, outperforming deep neural networks and large language models without requiring custom features or pre-trained models. Our approach provides a practical, efficient alternative to computationally intensive methods, proving effective even in low-resource environments.

같은 제1저자의 인용 많은 논문 (5)