Development and validation of machine learning models for predicting lung metastasis risk in differentiated thyroid cancer based on two databases.
1/5 보강
PICO 자동 추출 (휴리스틱, conf 2/4)
유사 논문P · Population 대상 환자/모집단
환자: lung metastasis (LM) have a poor prognosis
I · Intervention 중재 / 시술
추출되지 않음
C · Comparison 대조 / 비교
추출되지 않음
O · Outcome 결과 / 결론
[CONCLUSIONS] The GBM model emerged as an effective tool for identifying high-risk LM populations in DTC, with the potential to guide clinical practice and facilitate the development of individualized treatment plans. Further research to validate these findings across more diverse patient populations and clinical settings is recommended.
[BACKGROUND] Differentiated thyroid cancer (DTC) progresses slowly, but patients with lung metastasis (LM) have a poor prognosis.
APA
Shen H, Yang C, et al. (2024). Development and validation of machine learning models for predicting lung metastasis risk in differentiated thyroid cancer based on two databases.. Gland surgery, 13(11), 2174-2188. https://doi.org/10.21037/gs-24-481
MLA
Shen H, et al.. "Development and validation of machine learning models for predicting lung metastasis risk in differentiated thyroid cancer based on two databases.." Gland surgery, vol. 13, no. 11, 2024, pp. 2174-2188.
PMID
39678420 ↗
Abstract 한글 요약
[BACKGROUND] Differentiated thyroid cancer (DTC) progresses slowly, but patients with lung metastasis (LM) have a poor prognosis. The aim of this study was to develop and evaluate the predictive ability of machine learning (ML) models in estimating the risk of LM in patients with DTC and to identify the independent risk factors specific to different age and gender subgroups.
[METHODS] The demographic and clinicopathological data of patients with DTC were obtained from two databases: firstly, the National Institutes of Health Surveillance, Epidemiology, and End Results (SEER) database [2010-2015], which provides extensive epidemiological and clinical information on cancer patients; secondly, the Zhangzhou Municipal Hospital Affiliated to Fujian Medical University [2014-2017], which focuses more on patients' specific clinicopathological characteristics and treatment outcomes. Common variables from both databases were extracted. The data were then split into training, testing and validation sets. The training set was used to build and train ML models, while the testing and validation set were employed to assess the performance of these models. In terms of model development, we established five different ML models: logistic regression (LR), random forest (RF), decision tree (DT), extreme gradient boosting (XGBoost), and gradient boosting machine (GBM). For model validation, we utilized various evaluation metrics, including accuracy, precision, recall, F1 score, Brier score, area under the receiver operating characteristic (ROC) curve (AUROC), area under the precision-recall (PR) curve (PR-AUC), calibration curve, and decision curve analysis (DCA). The importance of various features was ranked and visualized for the top-performing models.
[RESULTS] The analysis identified age, gender, tumor size, T stage, N stage, and histologic type as significant independent risk factors for LM. The effects of gender, T stage, and histological type on the risk of LM varied across the different age subgroups. In the female population, tumor size was an independent risk factor for LM, while it was not in the male population. GBM achieved an AUROC of 0.982, a Brier score of 0.047, an accuracy of 0.818, and an F1 score of 0.818 in the validation set, outperforming the other models.
[CONCLUSIONS] The GBM model emerged as an effective tool for identifying high-risk LM populations in DTC, with the potential to guide clinical practice and facilitate the development of individualized treatment plans. Further research to validate these findings across more diverse patient populations and clinical settings is recommended.
[METHODS] The demographic and clinicopathological data of patients with DTC were obtained from two databases: firstly, the National Institutes of Health Surveillance, Epidemiology, and End Results (SEER) database [2010-2015], which provides extensive epidemiological and clinical information on cancer patients; secondly, the Zhangzhou Municipal Hospital Affiliated to Fujian Medical University [2014-2017], which focuses more on patients' specific clinicopathological characteristics and treatment outcomes. Common variables from both databases were extracted. The data were then split into training, testing and validation sets. The training set was used to build and train ML models, while the testing and validation set were employed to assess the performance of these models. In terms of model development, we established five different ML models: logistic regression (LR), random forest (RF), decision tree (DT), extreme gradient boosting (XGBoost), and gradient boosting machine (GBM). For model validation, we utilized various evaluation metrics, including accuracy, precision, recall, F1 score, Brier score, area under the receiver operating characteristic (ROC) curve (AUROC), area under the precision-recall (PR) curve (PR-AUC), calibration curve, and decision curve analysis (DCA). The importance of various features was ranked and visualized for the top-performing models.
[RESULTS] The analysis identified age, gender, tumor size, T stage, N stage, and histologic type as significant independent risk factors for LM. The effects of gender, T stage, and histological type on the risk of LM varied across the different age subgroups. In the female population, tumor size was an independent risk factor for LM, while it was not in the male population. GBM achieved an AUROC of 0.982, a Brier score of 0.047, an accuracy of 0.818, and an F1 score of 0.818 in the validation set, outperforming the other models.
[CONCLUSIONS] The GBM model emerged as an effective tool for identifying high-risk LM populations in DTC, with the potential to guide clinical practice and facilitate the development of individualized treatment plans. Further research to validate these findings across more diverse patient populations and clinical settings is recommended.
🏷️ 키워드 / MeSH 📖 같은 키워드 OA만
같은 제1저자의 인용 많은 논문 (5)
- Flavonoids and polycystic ovary syndrome.
- Impact of Fusion Partners and Transplantation Benefit in Intensively Treated -Rearranged Acute Myeloid Leukemia.
- Proteomic landscape of colorectal cancer liver metastasis: molecular signatures and novel therapeutic targets.
- A nanoscale cascade system integrating calcium interference with phototherapy-chemotherapy synergy for enhanced antitumor efficacy.
- Contrastive report and multiparametric dual-region magnetic resonance imaging learning for the preoperative prediction of axillary lymph node metastasis in breast cancer.
🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반
- Correction: Survival disparities and predictors in gastric cancer: a population-based study from Kazakhstan (2012-2023).
- AllergoOncology in Review: Harnessing Allergy in the Field of Oncology to Improve Patient Outcomes.
- Patterns and prognostic implications of cutaneous metastasis in Hong Kong: A multicenter analysis.
- Integrative Molecular Insights Into Epidemiological, Genetic, and Metabolic Risk Factors of Gallbladder Cancer: Implications for Biomarkers, Therapeutic Targeting, and Future Perspectives.
- Real-world outcomes of inotuzumab ozogamicin treatment for adult relapsed or refractory acute lymphoblastic leukemia: a result from Korea post-marketing surveillance.
- Enhancing access to treatment and programmes for viral hepatitis in an endemic country: a narrative review of literature from 2000 to 2025 (Mongolia).