Universal Baseline for Selection of Genetically Encoded Libraries.
Genetically encoded (GE) libraries enable identification of high-affinity ligands for diverse molecular targets through iterative selection and DNA sequencing or next-generation sequencing (NGS).
APA
Yan K, Lima GM, et al. (2026). Universal Baseline for Selection of Genetically Encoded Libraries.. bioRxiv : the preprint server for biology. https://doi.org/10.64898/2026.02.14.705946
MLA
Yan K, et al.. "Universal Baseline for Selection of Genetically Encoded Libraries.." bioRxiv : the preprint server for biology, 2026.
PMID
41727091
Abstract
Genetically encoded (GE) libraries enable identification of high-affinity ligands for diverse molecular targets through iterative selection and DNA sequencing or next-generation sequencing (NGS). Despite their impact in therapeutic development, a systematic framework for evaluating reproducibility in GE-molecular discoveries remains limited. To aid such analysis, we introduce the concept of baseline response, which reproducibly partitions active and inactive members of selection. The baseline response is provided by spiking a random DNA-barcoded population. We calibrated the baseline concept using Bioconductor EdgeR differential enrichment (DE) analysis of NGS of phage-displayed selection on oligosaccharide chitin and hepatitis virus NS3a* protease as model targets. We further show that mixing discovery campaigns also offers an effective baseline: chitin-enriched peptides serve as a baseline for DE-analysis of NS3a* selection and NS3a*-enriched peptides serve as a baseline for chitin binders. We applied baseline-stratified DE-analysis to 66 parallel selections performed in 3-5 replicates across 22 extracellular targets, including HER1-3, EpCAM, CAIX, PD-L1, and eight integrin receptors. Automated DE-analysis across hundreds of NGS files produced hits validated in a secondary screen and yielded synthetic macrocyclic ligands with mid-nanomolar affinity confirmed in 2-3 biophysical assays. For PD-L1, we further demonstrated how baseline-calibrated NGS data provide decision-enabling information for optimization of peptide macrocycles to yield potent single-digit nanomolar ligands for the cell-surface receptor. We anticipate that baseline-based analyses of NGS data from selection procedures will offer a scalable framework for reproducible hit discovery and standardized analysis across diverse selection campaigns.
같은 제1저자의 인용 많은 논문 (5)
- The Effect of Self-efficacy-Enhancing Interventions on Quality of Life of Cancer Survivors: A Systematic Review.
- Clinical Accuracy and Postoperative Stability of 3D-Printed Surgical Guides in Bimaxillary Orthognathic Surgery: A Prospective Evaluation.
- Retraction notice to "Novel biosynthesis of gold nanoparticles for multifunctional applications: Electrochemical detection of hydrazine and treatment of gastric cancer" [Environ. Res. 238 (2023) 117081].
- The Mediating Effects in the Relationship Among Symptoms, Self-Efficacy, and Quality of Life in Cancer Survivors: Evidence From a Scoping Review.
- Pathological Image Segmentation of Breast Cancer via Template Matching.