Exploring the recurrence and metastasis of breast invasive ductal carcinoma based on machine learning and survival analysis.
[OBJECTIVE] Invasive ductal carcinoma (IDC), the predominant histopathological subtype comprising about 80% of breast malignancies, continues to pose a significant clinical challenge due to frequent r
- 표본수 (n) 303
- Sensitivity 96.3%
- Specificity 79.6%
- 추적기간 5.7 years
APA
Xu A, Weng X, et al. (2026). Exploring the recurrence and metastasis of breast invasive ductal carcinoma based on machine learning and survival analysis.. Frontiers in oncology, 16, 1734379. https://doi.org/10.3389/fonc.2026.1734379
MLA
Xu A, et al.. "Exploring the recurrence and metastasis of breast invasive ductal carcinoma based on machine learning and survival analysis.." Frontiers in oncology, vol. 16, 2026, pp. 1734379.
PMID
41907633
Abstract
[OBJECTIVE] Invasive ductal carcinoma (IDC), the predominant histopathological subtype comprising about 80% of breast malignancies, continues to pose a significant clinical challenge due to frequent recurrence. Existing relapse prediction models remain limited in accuracy and generalizability. This study aimed to construct and validate machine learning-based models for predicting 5-year (short- to medium term) recurrence and metastasis risk in IDC, based on recurrence-free survival (RFS) analysis.
[METHODS] A total of 640 IDC cases diagnosed between January 2017 and December 2019 were enrolled, data were partitioned into three sets: the training set (n = 303) from Fudan University Shanghai Cancer Center; the validation set (n = 217) from Shaoxing Central Hospital; and the test set (n = 120) from Zhejiang Cancer Hospital. Independent prognostic factors were identified through univariate and multivariate Cox regression analyses. Three predictive strategies were implemented: evaluating recurrence risk, distinguishing local from distant recurrence, and identifying metastatic sites. Light Gradient Boosting Machine (LGBM), XGBoost (XGB), Random Forest (RF), k-Nearest Neighbor (KNN), Neural Network (NN), and Support Vector Machine (SVM) were trained and validated.
[RESULTS] The median follow-up duration was 5.7 years. Multivariate Cox regression analyses identified multiple factors significantly associated with RFS, including the rad-score, Ki-67 index, lymph node metastasis, tumor histological grade, and breast cancer family history in first- or second-degree relatives (all < 0.05). In contrast, age, menopausal status, and molecular subtype showed no significant association with recurrence risk in this cohort ( 0.987, 0.987, and 0.960, respectively). The clinical-radiomic nomogram demonstrated strong in predictive IDC recurrence. The XGBoost model demonstrated robust and consistent predictive performance across all cohorts, achieving AUCs of 0.842, 0.848, and 0.912 on the training, validation, and test sets, respectively. On the independent test set, the model attained an accuracy of 93.8%, sensitivity of 96.3%, and specificity of 79.6%.Furthermore, density plots of the radiomic score and Ki-67 index effectively differentiated between local recurrence, bone metastasis, and metastases to other organs. Patients with lymph node metastasis and high histological grade demonstrated a higher frequency of metastases to distant organs, accounting for most cases and emphasizing the contrast with local recurrence and bone metastasis. Patients with a breast cancer family history displayed a distinct pattern of bone metastasis.
[CONCLUSION] This study underscores the utility of machine learning models in forecasting recurrence and metastatic behavior in IDC. The clinical-radiomic nomograms proved valuable for individualized surgical and therapeutic decision-making in IDC patients.
[METHODS] A total of 640 IDC cases diagnosed between January 2017 and December 2019 were enrolled, data were partitioned into three sets: the training set (n = 303) from Fudan University Shanghai Cancer Center; the validation set (n = 217) from Shaoxing Central Hospital; and the test set (n = 120) from Zhejiang Cancer Hospital. Independent prognostic factors were identified through univariate and multivariate Cox regression analyses. Three predictive strategies were implemented: evaluating recurrence risk, distinguishing local from distant recurrence, and identifying metastatic sites. Light Gradient Boosting Machine (LGBM), XGBoost (XGB), Random Forest (RF), k-Nearest Neighbor (KNN), Neural Network (NN), and Support Vector Machine (SVM) were trained and validated.
[RESULTS] The median follow-up duration was 5.7 years. Multivariate Cox regression analyses identified multiple factors significantly associated with RFS, including the rad-score, Ki-67 index, lymph node metastasis, tumor histological grade, and breast cancer family history in first- or second-degree relatives (all < 0.05). In contrast, age, menopausal status, and molecular subtype showed no significant association with recurrence risk in this cohort ( 0.987, 0.987, and 0.960, respectively). The clinical-radiomic nomogram demonstrated strong in predictive IDC recurrence. The XGBoost model demonstrated robust and consistent predictive performance across all cohorts, achieving AUCs of 0.842, 0.848, and 0.912 on the training, validation, and test sets, respectively. On the independent test set, the model attained an accuracy of 93.8%, sensitivity of 96.3%, and specificity of 79.6%.Furthermore, density plots of the radiomic score and Ki-67 index effectively differentiated between local recurrence, bone metastasis, and metastases to other organs. Patients with lymph node metastasis and high histological grade demonstrated a higher frequency of metastases to distant organs, accounting for most cases and emphasizing the contrast with local recurrence and bone metastasis. Patients with a breast cancer family history displayed a distinct pattern of bone metastasis.
[CONCLUSION] This study underscores the utility of machine learning models in forecasting recurrence and metastatic behavior in IDC. The clinical-radiomic nomograms proved valuable for individualized surgical and therapeutic decision-making in IDC patients.
같은 제1저자의 인용 많은 논문 (5)
- Comparing the impact of three-dimensional digital visualization technology versus traditional microscopy on microsurgeons in microsurgery: a prospective self-controlled study.
- SUMO1 promotes malignancy and chemoresistance in colorectal cancer: Clinical and functional evidence.
- Emotion Expression in Breast Cancer Support Seeking: Empirical Study of an Online Community.
- Development of Teleoperated Robotic System for Remote Intraocular Microsurgery.
- CIRC_0000075 targeting MIR-218-5P is involved in the prognosis and mechanism of acute myeloid leukemia.