본문으로 건너뛰기
← 뒤로

Construction and Validation of Multi-Omics Predictive Models for Colorectal Cancer Using Machine-Learning Approaches.

Pharmacogenomics and personalized medicine 2026 Vol.19() p. 566928

Lu Z, Li X, Liang Z, Zhang X, Tan Y, Kuang Y, Li K, Zhu X

📝 환자 설명용 한 줄

[OBJECTIVE] To construct and externally validate a multi-omics nomogram that uses only routine clinicopathological variables to predict tumor mutational burden (TMB), microsatellite instability (MSI),

🔬 핵심 임상 통계 (초록에서 자동 추출 — 원문 검증 권장)
  • 표본수 (n) 398

이 논문을 인용하기

BibTeX ↓ RIS ↓
APA Lu Z, Li X, et al. (2026). Construction and Validation of Multi-Omics Predictive Models for Colorectal Cancer Using Machine-Learning Approaches.. Pharmacogenomics and personalized medicine, 19, 566928. https://doi.org/10.2147/PGPM.S566928
MLA Lu Z, et al.. "Construction and Validation of Multi-Omics Predictive Models for Colorectal Cancer Using Machine-Learning Approaches.." Pharmacogenomics and personalized medicine, vol. 19, 2026, pp. 566928.
PMID 41947871

Abstract

[OBJECTIVE] To construct and externally validate a multi-omics nomogram that uses only routine clinicopathological variables to predict tumor mutational burden (TMB), microsatellite instability (MSI), NTRK/PIK3CA mutation status and overall survival (OS) in colorectal cancer (CRC).

[METHODS] TCGA data (n=398) served as the training set and 120 consecutive CRC patients who underwent radical resection at Yuebei People's Hospital formed the prospective validation set. After z-score normalization, 21demographic, clinical and pathological features were screened for multicollinearity (VIF<5) and redundancy via least absolute shrinkage and selection operator (LASSO) regression with 10-fold cross-validation. Optimal hyper-parameters for each algorithm were tuned by nested 10-fold grid search. Four machine-learning algorithms, logistic regression (LR), support-vector machine (SVM), decision tree (DT) and random forest (RF), were compared by area under the receiver-operating-characteristic curve (AUC), F1 score and decision-curve analysis. The best model was externally validated and calibrated with bootstrapping.

[RESULTS] The results showed that the TMB prediction model included in the MSI index had the best power when constructed by the RF method, with an area under the ROC curve value of 0.9597. For the MSI state prediction model which includes three indicators of TMB, had the best power when constructed by RF method, with AUC value of 0.8225. The and gene status prediction model, which included three indicators of TMB and MSI status, had the best power when constructed using the RF method.

[CONCLUSION] The prediction model constructed in this study can help clinicians quickly identify high-risk patients and provide a basis for formulating a reasonable treatment plan. Further optimization of the model and expansion of the sample size are required to verify its power in the future.

같은 제1저자의 인용 많은 논문 (5)