본문으로 건너뛰기
← 뒤로

LLM2image: A novel framework for accurate diagnosis of diarrhea viruses using pathological images and semantic information.

1/5 보강
Computerized medical imaging and graphics : the official journal of the Computerized Medical Imaging Society 📖 저널 OA 4% 2023: 0/1 OA 2025: 0/9 OA 2026: 1/12 OA 2023~2026 2026 Vol.129() p. 102735
Retraction 확인
출처

Liu L, Niu Z, Zhao F, Zhang L

📝 환자 설명용 한 줄

Accurate diagnosis of diarrheal viruses from histopathological images is critical for veterinary medicine and animal disease control.

이 논문을 인용하기

↓ .bib ↓ .ris
APA Liu L, Niu Z, et al. (2026). LLM2image: A novel framework for accurate diagnosis of diarrhea viruses using pathological images and semantic information.. Computerized medical imaging and graphics : the official journal of the Computerized Medical Imaging Society, 129, 102735. https://doi.org/10.1016/j.compmedimag.2026.102735
MLA Liu L, et al.. "LLM2image: A novel framework for accurate diagnosis of diarrhea viruses using pathological images and semantic information.." Computerized medical imaging and graphics : the official journal of the Computerized Medical Imaging Society, vol. 129, 2026, pp. 102735.
PMID 41734695 ↗

Abstract

Accurate diagnosis of diarrheal viruses from histopathological images is critical for veterinary medicine and animal disease control. Existing deep learning approaches, however, often rely solely on visual features, lacking integration of clinical semantics and struggling with limited annotated data. To overcome these limitations, we propose LLM2image, a multimodal framework that enhances pathological image classification by integrating visual features with semantically rich textual descriptions generated by large language models (LLMs). The framework comprises: (1) a pixel-level MLP encoder for image representation, (2) GPT-4.0 for generating class-specific pathological descriptions, and (3) a cross-modal fusion transformer that aligns visual and textual features via multi-head attention. Evaluated on an in-house dataset of 516 porcine intestinal images across four categories healthy, ETEC, PDCoV, and PoRVour method achieved an accuracy of 89.51%, surpassing state-of-the-art models and matching the diagnostic performance of veterinarians with 10 years of experience. External validation on the public PAIP2020 colorectal cancer dataset further confirmed its generalizability, with 87.42% accuracy. Ablation studies and visual attention analysis demonstrate that the inclusion of LLM-generated text significantly improves both classification accuracy and interpretability. The model has been deployed as a lightweight Android application, supporting offline rapid diagnosis, highlighting its potential for real-world veterinary and medical applications.

🏷️ 키워드 / MeSH 📖 같은 키워드 OA만

같은 제1저자의 인용 많은 논문 (5)

🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반