본문으로 건너뛰기
← 뒤로

Development and validation of a multi-center prognostic model for predicting survival in non-small cell lung cancer using pulmonary and hematological data.

1/5 보강
Journal of thoracic disease 📖 저널 OA 100% 2022: 1/1 OA 2024: 1/1 OA 2025: 78/78 OA 2026: 91/91 OA 2022~2026 2025 Vol.17(11) p. 9411-9424
Retraction 확인
출처

PICO 자동 추출 (휴리스틱, conf 2/4)

유사 논문
P · Population 대상 환자/모집단
013 patients with histologically confirmed NSCLC treated at at Sichuan Cancer Hospital, Dazhu County People's Hospital and West China Hospital between January 2014 and December 2020.
I · Intervention 중재 / 시술
추출되지 않음
C · Comparison 대조 / 비교
추출되지 않음
O · Outcome 결과 / 결론
The prognostic model provides significant clinical implications, facilitating tailored treatment planning and prognostic evaluations for NSCLC patients. Its integration into routine clinical practice could enhance decision-making processes and potentially improve patient outcomes.

Hu P, Gu H, Tang Z, Li W, Liu X, Li Q

📝 환자 설명용 한 줄

[BACKGROUND] Prognostic stratification in non-small cell lung cancer (NSCLC) remains challenging due to heterogeneous outcomes.

🔬 핵심 임상 통계 (초록에서 자동 추출 — 원문 검증 권장)
  • 표본수 (n) 513

이 논문을 인용하기

↓ .bib ↓ .ris
APA Hu P, Gu H, et al. (2025). Development and validation of a multi-center prognostic model for predicting survival in non-small cell lung cancer using pulmonary and hematological data.. Journal of thoracic disease, 17(11), 9411-9424. https://doi.org/10.21037/jtd-2025-700
MLA Hu P, et al.. "Development and validation of a multi-center prognostic model for predicting survival in non-small cell lung cancer using pulmonary and hematological data.." Journal of thoracic disease, vol. 17, no. 11, 2025, pp. 9411-9424.
PMID 41376927 ↗

Abstract

[BACKGROUND] Prognostic stratification in non-small cell lung cancer (NSCLC) remains challenging due to heterogeneous outcomes. This study aimed to develop and validate a clinically applicable prognostic model using multi-dimensional clinical data to improve survival prediction and support personalized therapeutic decisions.

[METHODS] We retrospectively enrolled 1,013 patients with histologically confirmed NSCLC treated at at Sichuan Cancer Hospital, Dazhu County People's Hospital and West China Hospital between January 2014 and December 2020. Inclusion criteria comprised adults with untreated, non-metastatic NSCLC, while those with asthma, chronic obstructive pulmonary disease, severe comorbidities, or concurrent malignancies were excluded. We utilized demographic, clinicopathological, and biochemical data, with follow-ups conducted via telephone. Overall survival (OS) was the primary endpoint. Predictors included pulmonary function [forced expiratory volume in one second (FEV1), maximum voluntary ventilation (MVV)], blood biomarkers [total serum bilirubin (TBIL)], and clinicopathological features. Variables were selected via backward stepwise regression with Akaike's information criterion. Performance was assessed using the C-index, calibration curves, decision curve analysis (DCA), and the area under the curve (AUC).

[RESULTS] The model was developed using a Cox proportional hazards model on a training set (n=513), tested on an internal set (n=219), and externally validated on a cohort from two other hospitals (n=281). FEV1, MVV, smoking, pathological stage, and TBIL emerged as significant prognostic factors, with C-index values of 0.740, 0.734, and 0.746 in the training, testing, and validation sets, respectively. The AUC values for 3- and 5-year OS predictions exceeded 0.70, highlighting strong model performance. Calibration plots confirmed predictive accuracy across datasets, and DCA highlighted clinical utility, especially in long-term risk stratification.

[CONCLUSIONS] We developed a prognostic model for NSCLC integrating pulmonary function, biochemical, and clinicopathological data. The prognostic model provides significant clinical implications, facilitating tailored treatment planning and prognostic evaluations for NSCLC patients. Its integration into routine clinical practice could enhance decision-making processes and potentially improve patient outcomes.

🏷️ 키워드 / MeSH 📖 같은 키워드 OA만

같은 제1저자의 인용 많은 논문 (5)

📖 전문 본문 읽기 PMC JATS · ~59 KB · 영문

Introduction

Introduction
Epidemiological data from authoritative sources, including the Global Burden of Cancer database, reveal the highest incidence and mortality due to lung cancer (1). In China, this malignancy imposes a substantial public health burden (2). Five-year survival rates for advanced stages are extremely low (3). Over the past decade, multiple prognostic models have been developed based on clinicopathological variables such as tumor stage, histological subtype, performance status, and smoking history. Nevertheless, these models generally demonstrate only moderate predictive accuracy and limited external validation, restricting their clinical utility. To enhance prediction performance, molecular and genetic biomarkers [such as EGFR and KRAS mutations, ATR expression, and programmed death ligand 1 (PD-L1) expression] have been incorporated into prognostic models (4,5). While biologically informative, such models face barriers to implementation due to cost, availability, and heterogeneity in testing platforms. More recently, studies have explored hematological and lymph node skip metastasis as prognostic indicators, given their accessibility and biological relevance (6,7). However, no existing model combines preoperative pulmonary function parameters with hematological variables.
Pulmonary function parameters, including vital capacity (VC), forced vital capacity (FVC), and forced expiratory volume in one second (FEV1), maximum voluntary ventilation (MVV), serve as critical biomarkers for respiratory health assessment. These metrics quantify physiological lung performance and are associated with both lung cancer development and clinical outcomes (8-11). These findings highlight the potential utility of pulmonary function metrics as prognostic indicators in oncology practice.
Likewise, blood tests, a promising approach, may theoretically overcome tumor heterogeneity issues and provide comprehensive tumor information, especially in advanced metastatic non-small cell lung cancer (NSCLC) with further research (12). Several recent studies have attempted to establish prognostic models based on hematological factors, yet their discriminative power has often been modest. For instance, Ma and Wang constructed a prognostic model for NSCLC integrating inflammation and nutritional indexes, yet the area under the curve (AUC) was about 0.7 (13). Xie et al. aimed to develop a novel inflammatory and nutritional index to predict pathological complete response and survival prognosis in patients with NSCLC. However, although the index showed potential in predicting immunochemotherapy response, its ability to discriminate survival outcomes remained suboptimal (AUC =0.68) (14). Total serum bilirubin (TBIL) is a clinical indicator for evaluating hepatic function and biliary disorders (15). Notably, emerging evidence highlights significant associations between dysregulated TBIL levels and colorectal carcinogenesis (16), suggesting its potential as a prognostic biomarker in oncology.
In this study, using the clinically established data analysis database, a new clinical evaluation framework has been proposed based on the combined application of the respiratory function index and blood test indicators in the prognosis of lung cancer. We aimed to provide precision medicine strategies for patients in different strata and lay a theoretical foundation for further research. We present this article in accordance with the TRIPOD reporting checklist (available at https://jtd.amegroups.com/article/view/10.21037/jtd-2025-700/rc).

Methods

Methods

Participant screening and study design
In this retrospective study, we enrolled patients with NSCLC diagnosed between January 2014 and December 2020 at Sichuan Cancer Hospital and Institute. The sample size was determined to ensure statistical power based on previous studies and aimed to reach a comprehensive representation of the NSCLC population. We included cases with pathological confirmation through biopsy or surgical resection. Pathological staging was determined based on the 8th edition National Comprehensive Cancer Network Lung Cancer Classification criteria, patients with clinical stage I–IV or pathological stage I–IV NSCLC were included. Regarding pathologic type, patients diagnosed with adenocarcinoma, squamous cell carcinoma were included for analyses. Exclusion criteria included: (I) history of bronchial asthma; (II) history of chronic obstructive pulmonary disease (a heterogeneous lung condition characterised by chronic respiratory symptoms due to abnormalities of the airways and/or alveoli that cause persistent, often progressive, airflow obstruction); (III) comorbid diabetes mellitus, cardiac dysfunction (a complex clinical syndrome that results from any structural or functional impairment of ventricular filling or ejection of blood, the cardinal manifestations are dyspnea and fatigue), or severe hepatic impairment; (IV) non-pulmonary primary tumors or concurrent malignancies; and (V) loss to follow up. For external validation, cohorts from West China Hospital and Dazhu County People’s Hospital were established under identical criteria to verify model generalizability. Follow-up procedures entailed telephone interviews, aiming to capture overall survival (OS) as the primary endpoint. The study was conducted in accordance with the Declaration of Helsinki and its subsequent amendments. The study was approved by Ethics Committee of Sichuan Cancer Hospital and Institute (No. SCCHEC-02-2021-064). All participating hospitals were informed of and agreed to the conduct of this study. Informed consent was taken from all the patients.
Clinical data were retrieved from electronic medical records, comprising demographic characteristics (sex, age), medical history, smoking status, and oncological profiles including lesion dimensions, histopathological classification, and staging criteria. Respiratory functional parameters were analyzed using standardized pulmonary function tests, while hematological biomarkers were quantified by biochemical assays of blood specimens. Pulmonary function tests and hematologic tests were conducted before the initiation of any treatment, with samples collected in a fasting state between 7 a.m. and 9 a.m. (fasting was defined as refraining from eating after 10 p.m. the previous evening until blood sample collection was completed).

Variable selection
Variables were selected through backward stepwise regression guided by the Akaike information criterion (AIC). This method starts with a full model and, at each step, removes the least significant variable until no further variables can be removed. The modeling protocol was initiated by incorporating all candidate predictors (including age, sex, clinicopathological characteristics, pulmonary function parameters, and serum biochemical biomarkers) into the preliminary model. The model’s goodness-of-fit (quantified by log-likelihood) and corresponding AIC value were computed as baseline metrics. Subsequent iterations involved sequential exclusion of individual predictors, and each modified model was re-evaluated for AIC performance. The variable demonstrating maximal AIC reduction during this elimination process was removed. This selection cycle was iterated until no further AIC improvement was achieved by variable exclusion. The final model configuration, stabilized when AIC minimization plateaued, represented the optimal predictive feature subset. This systematic approach ensured retention of variables with significant prognostic relevance while maintaining model parsimony.

Development and validation of the prediction model
The cohort was segmented into training and testing sets in a 7:3 ratio for model construction and internal validation. External validation deployed independent datasets from collaborating hospitals. Model efficacy was assessed using Harrell’s concordance index (C-index), complemented by 500 bootstrap iterations generating time-dependent Receiver operating characteristic (ROC) curves and respective AUC across datasets. These analyses provided 3- and 5-year survival prediction performance insights. Model calibration and practical utility were further evaluated using decision curve analyses (DCAs) and calibration plots. Risk stratification, achieved through model-derived scores, segregated patients for survival comparison via Kaplan-Meier curves, applying two-sided log-rank tests for statistical rigor. A nomogram was constructed to visualize model utility in clinical settings.

Statistical analysis
Statistical analyses were conducted using R software (version 4.4.1). Continuous variables deviating from normal distribution criteria (assessed by normality tests) were reported as medians with interquartile ranges (IQRs) (25th–75th percentiles) [M (Q1–Q3)]. The pathological stage was analyzed as an ordinal categorical variable, comparing groups using the Kruskal-Wallis test. Categorical data were expressed as n (%) and analyzed through Chi-squared tests. Statistical significance was considered for two-tailed P values <0.05.

Results

Results

Baseline characteristics
Figure 1 illustrates the research flow, with 1,013 eligible patients with lung cancer meeting the inclusion criteria. The cohort comprised 732 cases from Sichuan Cancer Hospital and Institute (divided into 513 training and 219 testing samples using a 7:3 ratio) and 281 external validation cases from West China Hospital and Dazhu County People’s Hospital. Median ages were 59 (IQR, 51–66), 58 (IQR, 49–65), and 61 (IQR, 53–67) years for training, testing, and validation cohorts, respectively. Time refers to survival time. If the patient has died, it is calculated as the duration from the initial medical visit to the time of death (in months). If the patient is still alive, it is calculated as the duration from the initial medical visit to the last follow-up (in months). Demographic data revealed female sex predominance (70.2%, n=711). Tumor staging distribution showed 40.6% (n=411) stage I, 15.8% (n=160) stage II, 29.3% (n=297) stage III, and 14.3% (n=145) stage IV malignancies. Complete clinical characteristics are detailed in Table 1.

Backward stepwise regression combined with the AIC criteria for variable selection
Clinical characteristics, pathological characteristics, blood biochemical test results, and pulmonary function test results of the patients were included in the initial model, specifically, sex, age, pathological stage (stage), maximum tumor diameter (size), ground-glass nodule (GGN), tumor classification, and smoking status (smoking), TBIL, direct bilirubin (DBIL), free bilirubin (FBIL), total protein (TP), albumin (ALB), globulin (GLB), alanine aminotransferase (ALT), aspartate aminotransferase (AST), blood urea nitrogen (UREA), creatinine (CREA), glucose (GLU), total cholesterol (TC), triglycerides (TGs), FVC, FEV1, peak expiratory flow (PEF), maximal mid-expiratory flow (MMEF), MVV, VC, diffusing capacity of the lung for carbon monoxide to alveolar volume ratio (DLCO/VA). According to the AIC criteria, FEV1, MVV, smoking, stage, TBIL, and tumor classification were selected. Table 2 shows the AIC screening process. Table 3 presents the hazard ratios and P values for the six included variables. Pathological stage, smoking status, FEV1, and MVV were statistically significant variables. Pathological stage and FEV1 were the two greatest risk factors, while smoking status also demonstrated a considerable effect. Tumor classification did not reach statistical significance. The 95% confidence intervals (CIs) for pathological stage, smoking status, FEV1, TBIL, and MVV did not cross 1, indicating that the association between these variables and increased risk was both significant and reliable. In contrast, the association shown for tumor classification was not significant. Table S1 presents the variance inflation factors (VIFs) for the six variables. Pathological stage, tumor classification, smoking status, and TBIL exhibited negligible collinearity. FEV1 and MVV showed mild collinearity. Table S2 shows the C-index values of the adjusted model after modifying variables. We removed FEV1 and re-modeled with the remaining variables, the C-indices were 0.732 (training), 0.724 (test), and 0.735 (validation). Similarly, after removing MVV, the re-modeled C-indices were 0.733 (training), 0.729 (test), and 0.733 (validation), both of which were lower than when both were included. Table S3 shows the tests for equal proportional hazards were performed with Schoenfeld residuals. As can be seen from the Table S3, the tests for each covariate were not statistically significant, and the global tests were not statistically different; therefore, we can assume equal proportional risks. Figure S1 shows the Schoenfeld residual test results for the six variables, used to test the proportional hazards assumption in the Cox model. Figure S2 shows the plot of deviance residuals. We found that the regression coefficients changed very little after removing each observation, indicating that no single observation had a particularly influential effect on the model. Overall, all variables satisfied the proportional hazards assumption at conventional significance levels.

Establishing a prognostic model using six variables
A Cox proportional hazards model was constructed using six variables selected by backward stepwise regression: FEV1, MVV, TBIL, smoking status, tumor classification, and pathological stage. The model achieved C-index of 0.740 (95% CI: 0.707–0.773), 0.734 (95% CI: 0.681–0.787), and 0.746 (95% CI: 0.701–0.791) in the training set, test set, and external validation set, respectively. Figure 2A-2C display the time-dependent ROC curves, with AUC values exceeding 0.7 across all three cohorts, indicating robust predictive performance of the model. In the training set (Figure 2D), the AUC values for predicting 3- and 5-year OS were 0.770 and 0.774, respectively. Similarly, in the test set (Figure 2E), the model showed AUCs of 0.770 and 0.789 for 3- and 5-year OS predictions, respectively. External validation (Figure 2F) confirmed the model’s reliability, with AUCs of 0.763 and 0.783 for 3- and 5-year OS, respectively.

The Cox model exhibits strong calibration and clinical utility
Calibration plots for 3- and 5-year OS predictions in patients with NSCLC are presented in Figures 3A (training cohort), Figure 3B (test cohort), and Figure 3C (external validation cohort). The model consistently demonstrated robust calibration accuracy across all cohorts. To evaluate clinical utility, DCA was performed. Figure 3D-3F illustrate clinically actionable threshold probabilities for 3-year OS predictions in the training, test, and external validation cohorts, respectively, while Figure 3G-3I display corresponding thresholds for 5-year OS predictions. Notably, the model exhibited enhanced clinical applicability for 5-year OS risk stratification compared to shorter-term predictions.

Constructing a visual nomogram
FEV1, MVV, TBIL, smoking status, tumor classification, and pathological stage data of 1,013 patients with NSCLC included in the study were integrated to generate a nomogram. According to the contribution of the variables in the Cox model to the outcome, the value of the variables was assigned, and the scores of each variable were added to obtain the total score. The mapping relationship between the total score and the outcome was used to predict the final result (Figure 4). For example, a patient with stage II adenocarcinoma and no history of smoking and a TBIL value of 35, a FEV1 value of 4, and an MVV value of 80 had a total score of 103. Consequently, his 3- and 5-year survival rates were more than 90%.

Comparative predictive performance of the risk stratification model
Patients were stratified into high- and low-risk cohorts based on the median predicted OS derived from the model. Across all cohorts—training (Figure 5A), test (Figure 5B), and validation (Figure 5C)—the low-risk category exhibited significantly higher OS rates compared to the high-risk group (P<0.001). Subsequent stratified analyses were used to evaluate the model’s discriminative capacity across clinical stages of NSCLC. In both the modeling (Figure 5D) and external validation (Figure 5E) cohorts, the model effectively differentiated survival outcomes among patients with stage I–V NSCLC, demonstrating robust performance across disease severities, P<0.001.

Discussion

Discussion
In this study, we developed a Cox proportional hazards model integrating pulmonary function parameters (FEV1 and MVV), hematological indices (TBIL), and clinicopathological data to predict survival outcomes in patients with NSCLC, highlighting the synergistic impact of respiratory function and systemic metabolic status on prognosis. This multidimensional approach offers novel insights for personalized risk stratification in clinical practice.
FEV1, a cornerstone metric for assessing chronic obstructive pulmonary disease severity, is a critical predictor of surgical tolerance and postoperative pulmonary complications in lung resection candidates. Wang et al. (17) demonstrated that optimizing FEV1 reduces postoperative atelectasis risk. Emerging evidence further implicates reduced FEV1 as an independent risk factor for lung cancer incidence (18), with meta-analyses indicating that a decrease in FEV1 by 10% is associated with a 20% (95% CI: 17–23%) increase in lung cancer risk (19). FEV1 is a prognostic factor for lung cancer (20) but its clinical utility remains controversial due to inconsistent findings from underpowered studies. Our analysis of 1,013 cases across multiple centers provides robust validation of FEV1’s predictive value. Notably, the inclusion of non-surgical patients (diagnosed via biopsy without operative indications) suggests that FEV1 may worsen prognosis through the airway inflammatory microenvironment.
As a core parameter of pulmonary ventilatory reserve, MVV was first identified in our study as a prognostic indicator for lung cancer. MVV reflects integrated respiratory mechanics, including respiratory muscle strength, thoracic compliance, lung elasticity, and airway resistance. Targeted respiratory muscle training may enhance MVV (21,22). We hypothesized that reduced MVV may indicate respiratory muscle atrophy, potentially exacerbating tumorigenesis through hypoxia-driven pathways. Early respiratory rehabilitation in patients with lung cancer and impaired MVV could improve survival outcomes. Another point we need to be mindful of in the study is the collinearity issue between FEV1 and MVV. We found that regardless of whether one of them is removed or both are excluded and the model is restructured, the C-index value of the model is inferior compared to when both are included. In fact, FEV1 and MVV are both indicators of lung function tests obtained through the same means, so including both does not incur any additional economic cost. We also consulted clinical experts, who indicated that it is difficult to prioritize the importance of one over the other. After comprehensive consideration, we decided to include both in our model.
TBIL, a key hepatic function parameter, is typically elevated in hepatic injury, biliary obstruction, or hemolytic disorders. In the course of immunotherapy for lung cancer, some patients develop immune checkpoint inhibitor-related hepatotoxicity due to the accumulation of immune checkpoint inhibitors, with its incidence ranging from 1% to 20% (23,24). TBIL is an important monitoring index. The survival rate of patients with NSCLC and liver injury after immunotherapy is reduced (25). Emerging evidence links TBIL to prognosis across diverse diseases. For instance, Cao et al. (26) identified TBIL as a prognostic marker of colorectal cancer. Similar associations have been observed in bladder cancer (27), idiopathic pulmonary fibrosis (28), and ovarian cancer (29), underscoring its broader biological significance. Notably, Atasoy et al. (30) demonstrated a three-fold survival advantage in patients with lung cancer with high versus low TBIL levels. A prospective study conducted in Belgium by Temme et al. demonstrated a significant inverse association between elevated total bilirubin (TBIL) levels and cancer-related mortality (31). This finding may be attributed to the antioxidant, anti-inflammatory, and antiproliferative properties of bilirubin, as oxidative stress is a key contributor to carcinogenesis (32). Mechanistically, bilirubin has been shown to suppress the mammalian target of rapamycin (mTOR) signaling pathway by modulating adenosine monophosphate activated protein kinase (AMPK) activity, thereby exerting antiproliferative effects. AMPK plays a pivotal role in maintaining cellular energy homeostasis, and its regulation under elevated bilirubin levels may contribute positively to these outcomes (33).
Our findings corroborate TBIL as a predictor of NSCLC prognosis. Mechanistically, this may relate to bilirubin’s role as an endogenous antioxidant—low TBIL levels may compromise antioxidant defenses, exacerbating oxidative stress, deoxyribonucleic acid (DNA) damage, and carcinogenesis. This metric could aid in immunotherapy decision-making. However, TBIL can be influenced by various physiological and pathological factors, such as liver diseases, biliary obstruction, iron-deficiency anemia, and others. In clinical practice, we need to differentiate among patients and exclude these confounding factors.
Traditional prognostic prediction for NSCLC predominantly relies on clinical tumor node metastasis (TNM) staging. However, real-world clinical observations reveal limitations to this approach because factors such as severe pulmonary dysfunction may portend poor outcomes even in early-stage disease, further underscoring the need for biomarker-integrated models. Our study addresses this gap by incorporating readily accessible clinical parameters—including TNM stage, histological subtype, smoking status, TBIL levels, FEV1, and MVV—into a machine learning-derived prognostic tool trained on a large-scale cohort. Compared to TNM staging alone, the model is more sophisticated in risk stratification of patients, exemplified by a 5-year survival rate <30% for patients with nomogram scores >180. Such stratification supports tailored management: low-risk cohorts may benefit from quality-of-life optimization, while high-risk groups warrant early multidisciplinary intervention.
We excluded patients with chronic obstructive pulmonary disease to mitigate confounding effects. The model’s utility extends beyond prognosis prediction; for instance, the assessment of lung function can guide the exercise plan for patients with lung cancer, and the quality of life can be improved following the improvement in lung function through respiratory muscle training and other approaches.
There are some limitations in this study. The model was based on baseline data, but the prognosis of patients with lung cancer may change dynamically with treatment response and complications. Secondly, our dataset did not include detailed treatment-related information such as specific surgical techniques, postoperative outcomes, and systemic therapy regimens. While these are important factors that can influence patient prognosis, the absence of such detailed data may introduce some residual confounding. The fluctuations in TBIL levels during treatment are not captured by our current static model, which could affect the understanding of how these dynamic changes impact patient outcomes. However, the framework established in our study provides a foundational analysis that future research can build upon by incorporating more dynamic and comprehensive datasets. This will enable a deeper exploration of how detailed treatment modalities and biomarker fluctuations influence long-term prognostic outcomes, potentially enhancing the precision and applicability of prognostic models in clinical practice. Although our study included data from three research centers, all of these data were collected within China. We are acutely aware of the importance of ethnic diversity in research outcomes. In the future, we will promote collaboration with more international institutions and include participants from different ethnicities and regions to validate and broaden the generalizability of our research conclusions.

Conclusions

Conclusions
In summary, FEV1, MVV, TBIL, smoking status, tumor classification, and pathological stage data can be used to construct a clinical prognosis prediction model for NSCLC. It is a low-cost and easy-to-obtain prognostic tool that can help clinicians make decisions and achieve more accurate individualized treatment. Future studies should elucidate its biological mechanism and explore its translational value in clinical practice.

Supplementary

Supplementary
The article’s supplementary files as

출처: PubMed Central (JATS). 라이선스는 원 publisher 정책을 따릅니다 — 인용 시 원문을 표기해 주세요.

🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반

🟢 PMC 전문 열기