본문으로 건너뛰기
← 뒤로

Machine Learning-Based Lung Cancer Classification Using Blood-Derived Microbial DNA: A Comparative Analysis of Taxonomic Profiling Strategies.

2/5 보강
Diagnostics (Basel, Switzerland) 📖 저널 OA 100% 2021: 4/4 OA 2022: 16/16 OA 2023: 20/20 OA 2024: 45/45 OA 2025: 135/135 OA 2026: 136/136 OA 2021~2026 2026 Vol.16(7) OA Cancer Genomics and Diagnostics
Retraction 확인
출처
PubMed DOI PMC OpenAlex 마지막 보강 2026-04-30
OpenAlex 토픽 · Cancer Genomics and Diagnostics Lung Cancer Diagnosis and Treatment Gut microbiota and health

Goh CJ, Park J, Kim Y, Park D, Kim J, Kwon SJ, Kim MJ, Lee MS

📝 환자 설명용 한 줄

: Blood-derived circulating cell-free microbial DNA (cfmDNA) has emerged as a potential non-invasive biomarker source for cancer detection.

이 논문을 인용하기

↓ .bib ↓ .ris
APA Chul-Jun Goh, Jiwoo Park, et al. (2026). Machine Learning-Based Lung Cancer Classification Using Blood-Derived Microbial DNA: A Comparative Analysis of Taxonomic Profiling Strategies.. Diagnostics (Basel, Switzerland), 16(7). https://doi.org/10.3390/diagnostics16071079
MLA Chul-Jun Goh, et al.. "Machine Learning-Based Lung Cancer Classification Using Blood-Derived Microbial DNA: A Comparative Analysis of Taxonomic Profiling Strategies.." Diagnostics (Basel, Switzerland), vol. 16, no. 7, 2026.
PMID 41975790 ↗

Abstract

: Blood-derived circulating cell-free microbial DNA (cfmDNA) has emerged as a potential non-invasive biomarker source for cancer detection. However, low biomass and high susceptibility to analytical variability raise concerns regarding the stability and interpretability of inferred microbial signatures. This study aimed to evaluate how different taxonomic profiling strategies influence downstream machine learning-based classification and feature interpretation in lung cancer. : cfDNA sequencing data from 168 individuals (80 lung cancer patients and 88 non-cancer controls) were analyzed using two taxonomic profiling workflows: a Bracken-based abundance estimation approach and a BLAST-refined alignment-based strategy. Microbial profiles derived from each pipeline were evaluated using supervised machine learning models within a nested cross-validation framework. Feature stability and fold-change trends were compared across profiling strategies. : A Random Forest model achieved robust classification performance under both workflows (AUC 0.852 for Bracken-derived data and 0.906 for BLAST-derived data). However, substantial pipeline-dependent variation was observed in feature selection patterns and quantitative fold-change directionality. Although 13 genera were consistently selected across cross-validation folds in both workflows, the magnitude and direction of abundance differences were not uniformly concordant. : Blood-derived microbial DNA profiles can support machine learning-based lung cancer classification; however, feature-level interpretation remains sensitive to taxonomic assignment strategy. These findings underscore the importance of pipeline-aware interpretation and methodological transparency in low-biomass blood microbiome research.

🏷️ 키워드 / MeSH 📖 같은 키워드 OA만

🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반

🟢 PMC 전문 열기