본문으로 건너뛰기
← 뒤로

A transformer and 3D CNN-based feature fusion network with interpretable ability for Raman spectra analysis: improving the diagnosis of thyroid cancer.

2/5 보강
Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy 📖 저널 OA 7.8% 2023: 0/1 OA 2024: 0/1 OA 2025: 0/13 OA 2026: 5/49 OA 2023~2026 2026 Vol.354() p. 127623 Spectroscopy Techniques in Biomedica
TL;DR A novel multimodal deep learning framework that synergistically integrates 1D spectral and 2D spatiotemporal features that establishes a novel method for thyroid cancer diagnosis, achieving both exceptional diagnostic performance and significantly enhanced model interpretability.
Retraction 확인
출처
PubMed DOI OpenAlex Semantic 마지막 보강 2026-04-28
OpenAlex 토픽 · Spectroscopy Techniques in Biomedical and Chemical Research AI in cancer detection Gold and Silver Nanoparticles Synthesis and Applications

Sun Y, Fan D, Jia C, Li Q, Ma P, Zhang X

📝 환자 설명용 한 줄

A novel multimodal deep learning framework that synergistically integrates 1D spectral and 2D spatiotemporal features that establishes a novel method for thyroid cancer diagnosis, achieving both excep

이 논문을 인용하기

↓ .bib ↓ .ris
APA Yu Sun, Dandan Fan, et al. (2026). A transformer and 3D CNN-based feature fusion network with interpretable ability for Raman spectra analysis: improving the diagnosis of thyroid cancer.. Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy, 354, 127623. https://doi.org/10.1016/j.saa.2026.127623
MLA Yu Sun, et al.. "A transformer and 3D CNN-based feature fusion network with interpretable ability for Raman spectra analysis: improving the diagnosis of thyroid cancer.." Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy, vol. 354, 2026, pp. 127623.
PMID 41762798 ↗

Abstract

Accurate differentiation of benign and malignant thyroid lesions continues to pose a significant clinical challenge. Raman spectroscopy offers label-free molecular fingerprints of cells, yet the identification of diagnostic spectral patterns remains challenging. While artificial intelligence has been applied to analyze Raman data as one-dimensional (1D) signals, such approaches may overlook subtle nonlinear relationships across wavenumbers, particularly in cases involving spectrally similar constituents. Converting 1D spectral data into two-dimensional (2D) representations can preserve both amplitude and positional correlations, thereby uncovering latent temporal and structural features. However, such transformations risk incurring information loss, the extent of which is contingent upon the encoding strategy employed. To address this, we propose a novel multimodal deep learning framework that synergistically integrates 1D spectral and 2D spatiotemporal features, representing the first application in Raman-based thyroid cancer detection. Our model uniquely combines a Transformer to capture global dependencies in 1D spectra and a 3D-CNN to extract local spatial patterns from multiple 2D spectral transformations. These dual-modality features are adaptively fused through a multi-head cross-attention mechanism, enabling dynamic feature integration. The multimodal model ultimately achieves an accuracy of 94.7% in the identification of thyroid lesions, outperforming the unimodal Transformer and 3D-CNN models, which achieve accuracies of 91.0% and 89.4%, respectively. Notably, the multimodal model enhances interpretability by identifying contributions of key Raman peaks to the classification decision. Thus, the integration of SERS with explainable deep learning establishes a novel method for thyroid cancer diagnosis, achieving both exceptional diagnostic performance and significantly enhanced model interpretability.

🏷️ 키워드 / MeSH 📖 같은 키워드 OA만

같은 제1저자의 인용 많은 논문 (5)

🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반