Detecting and mitigating doppelgänger bias in microbiome data: impacts on machine learning and disease classification.
1/5 보강
Highly similar microbiome samples - so-called "doppelgänger pairs" - can distort analysis outcomes, yet are rarely addressed in microbiome studies.
APA
Zhou R, Ng SK, et al. (2025). Detecting and mitigating doppelgänger bias in microbiome data: impacts on machine learning and disease classification.. Gut microbes, 17(1), 2554196. https://doi.org/10.1080/19490976.2025.2554196
MLA
Zhou R, et al.. "Detecting and mitigating doppelgänger bias in microbiome data: impacts on machine learning and disease classification.." Gut microbes, vol. 17, no. 1, 2025, pp. 2554196.
PMID
40888678 ↗
Abstract 한글 요약
Highly similar microbiome samples - so-called "doppelgänger pairs" - can distort analysis outcomes, yet are rarely addressed in microbiome studies. Here, we demonstrate that even a small proportion of such pairs (1-10% of samples) can substantially inflate machine learning performance across diverse disease cohorts including colorectal cancer (CRC), inflammatory bowel diseases (IBD), infection (CDI), and obesity. Doppelgänger pairs also bias statistical tests and distort microbial network topology. In predictive models, classification accuracy was artificially boosted by 15-30% points across KNN, SVM, and Random Forest classifiers. In association testing, doppelgängers increased false-positive rates and decreased effect size stability; their removal reduced bootstrap variance by up to 28.3%. Moreover, the removal of doppelgängers yielded more stable networks. These effects were consistently observed across 16S, shotgun metagenomic, and simulated datasets. By accounting for highly similar samples, we reduce analytical noise and false discoveries, ultimately enabling more accurate and biologically meaningful microbiome insights.
🏷️ 키워드 / MeSH 📖 같은 키워드 OA만
같은 제1저자의 인용 많은 논문 (5)
- BTX-A Rejuvenation: Regional Botulinum Toxin-A Injection of the Platysma in Patients with Facial Sagging.
- High accuracy breast cancer classification with BIRADS and coclustering.
- DNMT2 inhibits anaplastic thyroid cancer progression by downregulating 5'tiRNA production.
- Zhilining formula suppresses ferroptosis in colonic epithelial cells by inhibiting ALOX15/15(S)-HPETE to repress colorectal tumorigenesis and progression.
- Circular RNA-based HPV16 therapeutic vaccine elicits potent and durable antitumor immunity.
🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반
- A Phase I Study of Hydroxychloroquine and Suba-Itraconazole in Men with Biochemical Relapse of Prostate Cancer (HITMAN-PC): Dose Escalation Results.
- Self-management of male urinary symptoms: qualitative findings from a primary care trial.
- Clinical and Liquid Biomarkers of 20-Year Prostate Cancer Risk in Men Aged 45 to 70 Years.
- Diagnostic accuracy of Ga-PSMA PET/CT versus multiparametric MRI for preoperative pelvic invasion in the patients with prostate cancer.
- Clinical Presentation and Outcomes of Patients Undergoing Surgery for Thyroid Cancer.
- Association of patient health education with the postoperative health related quality of life in low- intermediate recurrence risk differentiated thyroid cancer patients.