본문으로 건너뛰기
← 뒤로

Evaluating an information theoretic approach for selecting multimodal data fusion methods.

1/5 보강
Journal of biomedical informatics 2025 Vol.167() p. 104833
Retraction 확인
출처

PICO 자동 추출 (휴리스틱, conf 2/4)

유사 논문
P · Population 대상 환자/모집단
환자: non-small cell lung cancer, prostate cancer, and glioblastoma
I · Intervention 중재 / 시술
추출되지 않음
C · Comparison 대조 / 비교
추출되지 않음
O · Outcome 결과 / 결론
We found that, while PID metrics are informative, solely relying on these metrics to decide on a fusion approach does not always yield a machine learning model with optimal performance.

Zhang T, Ding R, Luong KD, Hsu W

📝 환자 설명용 한 줄

[OBJECTIVE] Interest has grown in combining radiology, pathology, genomic, and clinical data to improve the accuracy of diagnostic and prognostic predictions toward precision health.

이 논문을 인용하기

↓ .bib ↓ .ris
APA Zhang T, Ding R, et al. (2025). Evaluating an information theoretic approach for selecting multimodal data fusion methods.. Journal of biomedical informatics, 167, 104833. https://doi.org/10.1016/j.jbi.2025.104833
MLA Zhang T, et al.. "Evaluating an information theoretic approach for selecting multimodal data fusion methods.." Journal of biomedical informatics, vol. 167, 2025, pp. 104833.
PMID 40354908

Abstract

[OBJECTIVE] Interest has grown in combining radiology, pathology, genomic, and clinical data to improve the accuracy of diagnostic and prognostic predictions toward precision health. However, most existing works choose their datasets and modeling approaches empirically and in an ad hoc manner. A prior study proposed four partial information decomposition (PID)-based metrics to provide a theoretical understanding of multimodal data interactions: redundancy, uniqueness of each modality, and synergy. However, these metrics have only been evaluated in a limited collection of biomedical data, and the existing work does not elucidate the effect of parameter selection when calculating the PID metrics. In this work, we evaluate PID metrics on a wider range of biomedical data, including clinical, radiology, pathology, and genomic data, and propose potential improvements to the PID metrics.

[METHODS] We apply the PID metrics to seven different modality pairs across four distinct cohorts (datasets). We compare and interpret trends in the resulting PID metrics and downstream model performance in these multimodal cohorts. The downstream tasks being evaluated include predicting the prognosis (either overall survival or recurrence) of patients with non-small cell lung cancer, prostate cancer, and glioblastoma.

[RESULTS] We found that, while PID metrics are informative, solely relying on these metrics to decide on a fusion approach does not always yield a machine learning model with optimal performance. Of the seven different modality pairs, three had poor (0%), three had moderate (66%-89%), and only one had perfect (100%) consistency between the PID values and model performance. We propose two improvements to the PID metrics (determining the optimal parameters and uncertainty estimation) and identified areas where PID metrics could be further improved.

[CONCLUSION] The current PID metrics are not accurate enough for estimating the multimodal data interactions and need to be improved before they can serve as a reliable tool. We propose improvements and provide suggestions for future work. Code: https://github.com/zhtyolivia/pid-multimodal.

🏷️ 키워드 / MeSH

같은 제1저자의 인용 많은 논문 (5)