ME-p: A Deep Learning Method with Multimodal Learning for Protein p Prediction.
1/5 보강
Proteins are crucial in biological processes and are important substances that mediate biochemical reactions, regulate cellular processes, and facilitate drug binding through their active sites and su
APA
Shi S, Miao R, et al. (2026). ME-p: A Deep Learning Method with Multimodal Learning for Protein p Prediction.. Journal of chemical theory and computation, 22(2), 1149-1163. https://doi.org/10.1021/acs.jctc.5c01747
MLA
Shi S, et al.. "ME-p: A Deep Learning Method with Multimodal Learning for Protein p Prediction.." Journal of chemical theory and computation, vol. 22, no. 2, 2026, pp. 1149-1163.
PMID
41528986
Abstract
Proteins are crucial in biological processes and are important substances that mediate biochemical reactions, regulate cellular processes, and facilitate drug binding through their active sites and surface residues. The p values of proteins determine the protonation state of ionizable amino acids under specific pH conditions, profoundly impacting protein structure, function, and drug design. However, experimental determination of p values is normally laborious and complex. Moreover, existing prediction methods are limited by the data quantity and quality, as well as their inability to address the intricate structural and physicochemical attributes of proteins, thereby hindering accuracy and generalization, especially in predicting p values for buried residues. In this study, we developed a multimodal protein p prediction model named ME-p (Multimodal ESM p), which leverages the multimodal information and employs a multifidelity learning strategy to speedily and accurately predict molecular p values. The ME-p method facilitates data augmentation by integrating the local environmental attributes of amino acids with the FASTA sequence characteristics of proteins. Furthermore, the incorporation of multifidelity learning has addressed the challenge of limited data availability to some extent. Our ME-p model outperforms several state-of-the-art models in predicting protein p values, achieving impressive results with a low RMSE of 0.845 ± 0.09 and MAE of 0.641 ± 0.07, a high of 0.921 ± 0.02, and R of 0.959 ± 0.01 on the PE-p data set. Notably, ME-p demonstrated balanced and robust performance across major ionizable residue types (ASP, GLU, HIS, LYS). It demonstrates superior accuracy in predicting p values for buried residues (RSA < 0.2), achieving the lowest MAE values of 0.921 ± 0.05 on the PE-p data set and 0.911 ± 0.06 on the Small Set data set, which collectively excel in capturing complex environmental influences on p. Moreover, our method confirmed pH-dependent binding of PD-L1 antibodies mediated by the protonation state of His-69 in PD-L1, emphasizing the critical role of amino acid protonation states in drug design. The source code of ME-p can be found at https://github.com/yzjyg215/ME-pKa.
MeSH Terms
Deep Learning; Proteins; Hydrogen-Ion Concentration; Amino Acids
같은 제1저자의 인용 많은 논문 (5)
- Effect of botulinum toxin type A on flap surgery in animal models: a systematic review and meta-analysis.
- Effects of Ropivacaine-Based Thoracic Paravertebral Block versus Erector Spinae Plane Block on Quality of Recovery in Laparoscopic Colorectal Cancer Surgery: A Randomized Controlled Trial.
- Latent Physical Activity Patterns and Their Related Factors in Breast Cancer Patients During Chemotherapy Intervals.
- Construction of a homologous targeting exosome-based drug delivery system for colorectal cancer and evaluation of its anti-tumor efficacy.
- Biowaste-Archetyped Hierarchical Calcium Carbonate Nanoreactors Induce Tumor Bioenergetic Crisis and Reverse Cisplatin Resistance via Mitochondrial Metabolic Reprogramming.