Evaluating an information theoretic approach for selecting multimodal data fusion methods.
1/5 보강
PICO 자동 추출 (휴리스틱, conf 2/4)
유사 논문P · Population 대상 환자/모집단
환자: non-small cell lung cancer, prostate cancer, and glioblastoma
I · Intervention 중재 / 시술
추출되지 않음
C · Comparison 대조 / 비교
추출되지 않음
O · Outcome 결과 / 결론
We found that, while PID metrics are informative, solely relying on these metrics to decide on a fusion approach does not always yield a machine learning model with optimal performance.
[OBJECTIVE] Interest has grown in combining radiology, pathology, genomic, and clinical data to improve the accuracy of diagnostic and prognostic predictions toward precision health.
APA
Zhang T, Ding R, et al. (2025). Evaluating an information theoretic approach for selecting multimodal data fusion methods.. Journal of biomedical informatics, 167, 104833. https://doi.org/10.1016/j.jbi.2025.104833
MLA
Zhang T, et al.. "Evaluating an information theoretic approach for selecting multimodal data fusion methods.." Journal of biomedical informatics, vol. 167, 2025, pp. 104833.
PMID
40354908
Abstract
[OBJECTIVE] Interest has grown in combining radiology, pathology, genomic, and clinical data to improve the accuracy of diagnostic and prognostic predictions toward precision health. However, most existing works choose their datasets and modeling approaches empirically and in an ad hoc manner. A prior study proposed four partial information decomposition (PID)-based metrics to provide a theoretical understanding of multimodal data interactions: redundancy, uniqueness of each modality, and synergy. However, these metrics have only been evaluated in a limited collection of biomedical data, and the existing work does not elucidate the effect of parameter selection when calculating the PID metrics. In this work, we evaluate PID metrics on a wider range of biomedical data, including clinical, radiology, pathology, and genomic data, and propose potential improvements to the PID metrics.
[METHODS] We apply the PID metrics to seven different modality pairs across four distinct cohorts (datasets). We compare and interpret trends in the resulting PID metrics and downstream model performance in these multimodal cohorts. The downstream tasks being evaluated include predicting the prognosis (either overall survival or recurrence) of patients with non-small cell lung cancer, prostate cancer, and glioblastoma.
[RESULTS] We found that, while PID metrics are informative, solely relying on these metrics to decide on a fusion approach does not always yield a machine learning model with optimal performance. Of the seven different modality pairs, three had poor (0%), three had moderate (66%-89%), and only one had perfect (100%) consistency between the PID values and model performance. We propose two improvements to the PID metrics (determining the optimal parameters and uncertainty estimation) and identified areas where PID metrics could be further improved.
[CONCLUSION] The current PID metrics are not accurate enough for estimating the multimodal data interactions and need to be improved before they can serve as a reliable tool. We propose improvements and provide suggestions for future work. Code: https://github.com/zhtyolivia/pid-multimodal.
[METHODS] We apply the PID metrics to seven different modality pairs across four distinct cohorts (datasets). We compare and interpret trends in the resulting PID metrics and downstream model performance in these multimodal cohorts. The downstream tasks being evaluated include predicting the prognosis (either overall survival or recurrence) of patients with non-small cell lung cancer, prostate cancer, and glioblastoma.
[RESULTS] We found that, while PID metrics are informative, solely relying on these metrics to decide on a fusion approach does not always yield a machine learning model with optimal performance. Of the seven different modality pairs, three had poor (0%), three had moderate (66%-89%), and only one had perfect (100%) consistency between the PID values and model performance. We propose two improvements to the PID metrics (determining the optimal parameters and uncertainty estimation) and identified areas where PID metrics could be further improved.
[CONCLUSION] The current PID metrics are not accurate enough for estimating the multimodal data interactions and need to be improved before they can serve as a reliable tool. We propose improvements and provide suggestions for future work. Code: https://github.com/zhtyolivia/pid-multimodal.
🏷️ 키워드 / MeSH
같은 제1저자의 인용 많은 논문 (5)
- Metabolomic profiling of genotype-derived ABO blood group, secretor status and Lewis antigens and association with pancreatic ductal adenocarcinoma risk.
- The molecular mechanisms and potential therapeutic implications of the crosstalk between DNA methylation and metabolic reprogramming in thyroid cancer.
- Pathology of Cutaneous T Cell Lymphoma: A Narrative Review.
- Dropping out or Moving on? A Systematic Review With Meta-Aggregation of the Experience of Prehabilitation Among Patients With Cancer.
- NR4A1 mediates chemotherapy‑induced senescence via the PI3K/AKT pathway in gastric cancer cells.