Machine learning-based proteogenomic data modeling identifies circulating plasma biomarkers for early detection of lung cancer.
1/5 보강
PICO 자동 추출 (휴리스틱, conf 2/4)
유사 논문P · Population 대상 환자/모집단
[RESULTS] Using the identified proteins, machine learning models achieved median cross-validated AUCs of 0.85-0.88 (0-4 years before diagnosis [YBD]), 0.81-0.84 (5-9 YBD), and 0.80-0.86 (0-9 YBD).
I · Intervention 중재 / 시술
추출되지 않음
C · Comparison 대조 / 비교
추출되지 않음
O · Outcome 결과 / 결론
[CONCLUSIONS] While these findings do not establish mechanistic causality, they highlight proteomic alterations reflecting systemic changes preceding the diagnosis. Our study contributes to understanding genome-proteome relationships in lung cancer and identifies circulating proteins warranting further investigation as potential early biomarkers for screening and risk stratification.
[BACKGROUND] Genetic aberrations are among the critical driving factors of lung cancer.
APA
Johnson MA, Nieves-Rodriguez S, et al. (2026). Machine learning-based proteogenomic data modeling identifies circulating plasma biomarkers for early detection of lung cancer.. Communications medicine. https://doi.org/10.1038/s43856-026-01500-1
MLA
Johnson MA, et al.. "Machine learning-based proteogenomic data modeling identifies circulating plasma biomarkers for early detection of lung cancer.." Communications medicine, 2026.
PMID
41826692 ↗
Abstract 한글 요약
[BACKGROUND] Genetic aberrations are among the critical driving factors of lung cancer. Importantly, the impact of genetic variations on proteomic dysregulations with the goal of characterizing potential diagnostic biomarkers at the population-level requires additional investigation. Modeling such proteogenomic interactions is crucial in understanding early-stage biological disruptions to inform biomarker discovery, successful clinical trials, and developing effective therapeutics.
[METHODS] We investigated two complementary aspects of lung cancer risk. First, we performed a genome-wide association study of lung cancer using population-scale datasets, then examined whether lung cancer risk-associated variants influence plasma protein levels using the UK Biobank Pharma Proteomics Project data. Second, we identified plasma proteomic dysregulations in presymptomatic and symptomatic patients with the objective of pinpointing diagnostic biomarkers through leveraging machine learning methods.
[RESULTS] Using the identified proteins, machine learning models achieved median cross-validated AUCs of 0.85-0.88 (0-4 years before diagnosis [YBD]), 0.81-0.84 (5-9 YBD), and 0.80-0.86 (0-9 YBD). Performing survival analyses within the 5-9 YBD group, elevated levels of eight proteins, such as CALCB, PLAUR, and CD74, were found to significantly associate with lower survival. We identified 22 disease-associated proteins, of which 14 have been previously implicated in lung cancer, including CEACAM5, CXCL17, GDF15, WFDC2 along with 8 novel proteins. These proteins were enriched in pathways related to cytokine signaling, interleukin regulation, neutrophil degranulation, and lung fibrosis.
[CONCLUSIONS] While these findings do not establish mechanistic causality, they highlight proteomic alterations reflecting systemic changes preceding the diagnosis. Our study contributes to understanding genome-proteome relationships in lung cancer and identifies circulating proteins warranting further investigation as potential early biomarkers for screening and risk stratification.
[METHODS] We investigated two complementary aspects of lung cancer risk. First, we performed a genome-wide association study of lung cancer using population-scale datasets, then examined whether lung cancer risk-associated variants influence plasma protein levels using the UK Biobank Pharma Proteomics Project data. Second, we identified plasma proteomic dysregulations in presymptomatic and symptomatic patients with the objective of pinpointing diagnostic biomarkers through leveraging machine learning methods.
[RESULTS] Using the identified proteins, machine learning models achieved median cross-validated AUCs of 0.85-0.88 (0-4 years before diagnosis [YBD]), 0.81-0.84 (5-9 YBD), and 0.80-0.86 (0-9 YBD). Performing survival analyses within the 5-9 YBD group, elevated levels of eight proteins, such as CALCB, PLAUR, and CD74, were found to significantly associate with lower survival. We identified 22 disease-associated proteins, of which 14 have been previously implicated in lung cancer, including CEACAM5, CXCL17, GDF15, WFDC2 along with 8 novel proteins. These proteins were enriched in pathways related to cytokine signaling, interleukin regulation, neutrophil degranulation, and lung fibrosis.
[CONCLUSIONS] While these findings do not establish mechanistic causality, they highlight proteomic alterations reflecting systemic changes preceding the diagnosis. Our study contributes to understanding genome-proteome relationships in lung cancer and identifies circulating proteins warranting further investigation as potential early biomarkers for screening and risk stratification.