T-cell receptor dynamics in digestive system cancers: a multi-layer machine learning approach for tumor diagnosis and staging.
[BACKGROUND] T-cell receptor (TCR) repertoires provide insights into tumor immunology, yet their variations across digestive system cancers are not well understood.
- 연구 설계 cohort study
APA
Yuan C, Wang B, et al. (2025). T-cell receptor dynamics in digestive system cancers: a multi-layer machine learning approach for tumor diagnosis and staging.. Frontiers in immunology, 16, 1556165. https://doi.org/10.3389/fimmu.2025.1556165
MLA
Yuan C, et al.. "T-cell receptor dynamics in digestive system cancers: a multi-layer machine learning approach for tumor diagnosis and staging.." Frontiers in immunology, vol. 16, 2025, pp. 1556165.
PMID
40264789
Abstract
[BACKGROUND] T-cell receptor (TCR) repertoires provide insights into tumor immunology, yet their variations across digestive system cancers are not well understood. Characterizing TCR differences between colorectal cancer (CRC) and gastric cancer (GC), as well as developing machine learning models to distinguish cancer types, metastatic status, and disease stages are crucial for guiding clinical practices.
[METHODS] A cohort study of 143 tumor patients (96 CRC, 47 GC) was conducted. High-throughput TCR sequencing was performed to capture TCR beta (TRB), delta (TRD), and gamma (TRG) chain data. Tissue-specific patterns in TCR repertoire features, such as V-J gene recombination, complementarity-determining region 3 (CDR3) sequences, and motif distributions, were analyzed. Multi-layer machine learning-based diagnostic models were developed by leveraging motif-based feature and deep learning-based feature extraction using ProteinBERT from the 100 most abundant CDR3 sequences per sample. These models were used to differentiate CRC from GC, distinguish between primary and metastatic CRC lesions, and predict disease stages in CRC.
[RESULTS] Tissue-specific differences in TCR repertoires were observed across CRC, GC, and between primary and metastatic lesions, as well as across disease stages in CRC. Distinct V-J gene recombination patterns were identified, with CRC showing enrichment in *-* combinations, while GC exhibited higher levels of γδT-cell-related recombination. Primary and metastatic lesions of CRC patients displayed distinct V-J recombination preferences (e.g., / higher in metastatic; / higher in primary) and CDR3 sequence differences, with metastatic having shorter TRG CDR3 lengths (-value = 0.019). Across CRC stages, later stages (III-IV) showed higher clonal diversity (-value < 0.05) and stage-specific V-J patterns, alongside distinct CDR3 amino acid preferences at N-terminal (positions 1-2) and central positions (positions 5-12). Multi-dimensional machine learning models demonstrated exceptional diagnostic performance across all classification tasks. For distinguishing CRC from GC, the model achieved an accuracy of 97.9% and an area under the curve (AUC) of 0.996. For differentiating primary from metastatic CRC, the model achieved 100% accuracy with an AUC of 1.000. In predicting CRC disease stages, the model attained an accuracy of 96.9% and an AUC of 0.993. Extensive validation using simulated and publicly available datasets, confirmed the robustness and reliability of the models, demonstrating consistent performance across diverse datasets and experimental conditions.
[CONCLUSIONS] Our investigation provides novel insights into TCR repertoire variations in digestive system tumors, and highlight the potential of immune repertoire features as powerful diagnostic tools for understanding cancer progression and potentially improving clinical decision-making.
[METHODS] A cohort study of 143 tumor patients (96 CRC, 47 GC) was conducted. High-throughput TCR sequencing was performed to capture TCR beta (TRB), delta (TRD), and gamma (TRG) chain data. Tissue-specific patterns in TCR repertoire features, such as V-J gene recombination, complementarity-determining region 3 (CDR3) sequences, and motif distributions, were analyzed. Multi-layer machine learning-based diagnostic models were developed by leveraging motif-based feature and deep learning-based feature extraction using ProteinBERT from the 100 most abundant CDR3 sequences per sample. These models were used to differentiate CRC from GC, distinguish between primary and metastatic CRC lesions, and predict disease stages in CRC.
[RESULTS] Tissue-specific differences in TCR repertoires were observed across CRC, GC, and between primary and metastatic lesions, as well as across disease stages in CRC. Distinct V-J gene recombination patterns were identified, with CRC showing enrichment in *-* combinations, while GC exhibited higher levels of γδT-cell-related recombination. Primary and metastatic lesions of CRC patients displayed distinct V-J recombination preferences (e.g., / higher in metastatic; / higher in primary) and CDR3 sequence differences, with metastatic having shorter TRG CDR3 lengths (-value = 0.019). Across CRC stages, later stages (III-IV) showed higher clonal diversity (-value < 0.05) and stage-specific V-J patterns, alongside distinct CDR3 amino acid preferences at N-terminal (positions 1-2) and central positions (positions 5-12). Multi-dimensional machine learning models demonstrated exceptional diagnostic performance across all classification tasks. For distinguishing CRC from GC, the model achieved an accuracy of 97.9% and an area under the curve (AUC) of 0.996. For differentiating primary from metastatic CRC, the model achieved 100% accuracy with an AUC of 1.000. In predicting CRC disease stages, the model attained an accuracy of 96.9% and an AUC of 0.993. Extensive validation using simulated and publicly available datasets, confirmed the robustness and reliability of the models, demonstrating consistent performance across diverse datasets and experimental conditions.
[CONCLUSIONS] Our investigation provides novel insights into TCR repertoire variations in digestive system tumors, and highlight the potential of immune repertoire features as powerful diagnostic tools for understanding cancer progression and potentially improving clinical decision-making.
MeSH Terms
Humans; Machine Learning; Neoplasm Staging; Male; Stomach Neoplasms; Female; Middle Aged; Receptors, Antigen, T-Cell; Complementarity Determining Regions; Colorectal Neoplasms; Aged; Digestive System Neoplasms; High-Throughput Nucleotide Sequencing
같은 제1저자의 인용 많은 논문 (5)
- Corrigendum to "circSCYL2 antagonizes hnRNPA2B1-mediated m6A-chromatin crosstalk to promote ferroptosis and inhibit metastasis of breast cancer to brain" [Int. J. Biol. Macromol., 343 (2026) 149699].
- circSCYL2 antagonizes hnRNPA2B1-mediated m6A-chromatin crosstalk to promote ferroptosis and inhibit metastasis of breast cancer to brain.
- Erratum: Author correction to "Self-illuminating liposome-derived triggerable photodynamic therapy combining radionuclide therapy for synergistic treatment of lung cancer" [Acta Pharm Sin B 15 (2025) 4973-4994].
- Clinical relevance and mechanistic investigation of miR-1908-5p as a prognostic biomarker in colorectal cancer.
- The preoperative albumin-to-carcinoembryonic antigen ratio (ACR) predicts prognosis and facilitates risk stratification in gastric cancer: a retrospective cohort study.