Issue #38: Evaluating zero-shot prediction of monomeric protein design success by AlphaFold, ESMFold, and ProteinMPNN.
Protein Design Digest - 2026-02-02 - Evaluating zero-shot prediction of monomeric protein design success by AlphaFold, ESMFold, and ProteinMPNN.

Building something in Protein Design?
I love collaborating on new challenges. Let's build together.
Subscribe to Protein Design Digest
Daily curated signals from arXiv, PubMed, and BioRxiv.
Signal of the Day
Evaluating zero-shot prediction of monomeric protein design success by AlphaFold, ESMFold, and ProteinMPNN.
De novo protein design has enabled the creation of proteins with diverse functionalities that are not found in nature. Despite recent advances, experimental success rates remain inconsistent and context-dependent, posing a bottleneck for broader applications of de novo design. To overcome this, structure and sequence prediction models have been applied to assess design quality prior to experimental testing to save time and resources. In this study, we examined the extent to which AlphaFold, Protein MPNN, and ESMFold can discriminate between experimentally successful and unsuccessful designs. We first curated a benchmark dataset of 614 experimentally characterized de novo designed monomers from 11 different design studies between 2012 and 2021. All predictive models demonstrated moderate ability to discriminate experimental successes (expressed, soluble, monomeric, and fold with the correct secondary structure) from failures. Still, many failed designs have better confidence metrics than successful designs, and confidence metrics were topology-dependent. Among all computational models evaluated, ESMFold average predicted local-distance difference test (pLDDT) yielded the best individual performance at distinguishing between successful and unsuccessful designs. A logistic regression model combining all confidence metrics provided only modest improvement over ESMFold pLDDT alone. Overall, these results show that these models can serve as an initial filtering strategy prior to experimental validation; however, their utility at accurately predicting experimentally successful designs remains limited without task-specific training.
Why this matters: Critical for improving fold accuracy and reducing structural uncertainty in de novo design.
Also Worth Reading
Comparison of In Vitro Multiple Physiological Activities of Cys-Tyr-Gly-Ser-Arg (CYGSR) Linear and Cyclic Peptides and Analysis Based on Molecular Docking.
Peptide cyclization is a strategy to improve biological stability and functional activity, but direct comparison between linear and cyclic peptides with the same sequence is still limited. In this study, linear (L-CR5) and cyclic (C-CR5) forms were synthesized, and biological functions such as antioxidant, whitening, and anti-wrinkle activity were compared and evaluated. C-CR5 showed about 22.3 times of DPPH radical scavenging activity, which was significantly stronger than L-CR5, and tyrosinase inhibition increased rapidly in C-CR5 to reach inhibition of 95% or more, whereas L-CR5 showed only moderate activity in the same range (about 6.5 times). MMP-1 expression in the evaluation of anti-wrinkle activity did not show a decreasing trend in L-CR5 at all, while C-CR5 showed an anti-wrinkle effect, which was reduced by about 92.8% at 400 μg/mL. As a result of molecular docking analysis, C-CR5 exhibited lower MolDock scores than L-CR5 toward both tyrosinase and MMP-1, indicating a potentially higher binding affinity and improved binding stability. This is expected to be due to reduced structural flexibility and optimized residue directions (especially Tyr and Arg). These results indicate that peptide cyclization is an example of enhanced functional bioactivity of CYGSR and provides a positive case for the structure-activity relationship.
Decrypting potential mechanisms linking ochratoxin A to hepatocellular carcinoma: an integrated approach combining toxicology, machine learning, molecular docking, and molecular dynamics simulation.
Background Ochratoxin A (OTA), a common food-borne mycotoxin, is a potential human carcinogen, yet the specific molecular mechanisms linking it to hepatocellular carcinoma (HCC) remain unclear. Methods We integrated network toxicology to predict OTA targets and intersected them with HCC transcriptomic data to identify key candidate genes. Functional enrichment analysis was then conducted. Multiple machine learning algorithms were applied to screen and validate core genes. Furthermore, molecular docking and molecular dynamics (MD) simulations were employed to evaluate the binding stability between OTA and key target proteins. Results A total of 50 key genes were identified as potential targets for potential OTA-associated hepatocarcinogenesis. Enrichment analysis revealed their significant involvement in critical processes such as xenobiotic metabolism and oxidative stress response. Machine learning analysis prioritized eight core genes (AURKA, GABARAPL1, CA2, PARP1, LMNA, SLC27A5, EPHX2, and GSTP1), and a combined diagnostic model demonstrated outstanding performance (AUC = 0.986). Structural analyses via molecular docking and MD simulations confirmed stable binding interactions between OTA and these core targets. Conclusions This integrated computational study identifies a set of candidate genes through which OTA may potentially interact with HCC-associated molecular networks. The robust binding predicted between OTA and the core targets provides a structural basis for these interactions. These findings offer a prioritized list of targets and a theoretical framework for subsequent experimental validation and investigation into OTA’s toxicological role in HCC.
Study on the Mechanism of Ku Diding in the Treatment of Diabetes based on Network Pharmacology, Molecular Docking Technology, and Molecular Dynamics.
Introduction To explore how Ku Diding (KDD) works in managing Diabetes Mellitus (DM), researchers utilized network pharmacology, molecular docking, and molecular dynamics methodologies. Methods Key active components of KDD were identified using the Traditional Chinese Medicine Systematic Pharmacology Database and Analysis Platform (TCMSP). Data for diabetesrelated targets were retrieved from the Human Genetic Comprehensive Databases (Genecards) and the Online Mendelian Inheritance in Man (OMIM) database. The intersection of these targets was analyzed to determine potential therapeutic targets for diabetes treatment. Proteinprotein interaction networks (PPI) were constructed using the STRING database and Cytoscape software, followed by Gene Ontology (GO) enrichment and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis. Molecular docking between the components and key targets was performed using the AutoDock Vina platform. Results This study identified that Dihydrosanguinarine, (S)-Scoulerine, among others, are the main active ingredients of KDD for treating DM, showing high affinity for critical targets like PTGS2 and PRKACA, through multiple pathways including vascular regulation, neuromodulation, metabolic regulation, and endocrine regulation. The molecular docking results showed that there are interactions between the active ingredients and the key targets, with the majority of the effective components exhibiting a stronger binding affinity than Metformin. Among them, (S)-Scoulerine and Dihydrosanguinarine demonstrated high docking affinity with the key target proteins PTGS2 and PRKACA. Discussion DM is closely linked to oxidative stress, chronic inflammation, and insulin signaling dysregulation. This study reveals that KDD exerts anti-diabetic effects via a multi-target network involving proteins such as PRKACA, PTGS2, ESR1, FOS, and DRD2. These targets are associated with glucose metabolism, inflammation, oxidative stress, and neural regulation. Modulation of these pathways likely enhances insulin sensitivity, lowers blood glucose, suppresses inflammation, and protects against oxidative damage. GO and KEGG analyses further indicate involvement in MAPK signaling, synaptic transmission, and vascular regulation, forming a multidimensional “metabolism-inflammation-neural” regulatory network. Compared to Metformin, most KDD-derived compounds showed stronger binding, highlighting their therapeutic potential. Molecular dynamics simulations support the stability of the observed binding conformations, suggesting their potential as therapeutic targets. These findings underscore KDD’s ability to simultaneously target multiple pathological mechanisms, offering a holistic treatment strategy for DM. Conclusion This study provides preliminary evidence that KDD is characterized by a multicomponent, multi-target, and multi-pathway approach in the treatment of diabetes mellitus (DM), thereby establishing a scientific foundation for further in-depth exploration of KDD’s molecular mechanisms.
Research & AI Updates
- $2M Grant Backs AI Initiative to Protect Crops from Disease - Seed World — $2M Grant Backs AI Initiative to Protect Crops from Disease Seed World.
- AI Tools Are Finally Decoding Proteins That Defied Scientists for Decades - BioTecNika — AI Tools Are Finally Decoding Proteins That Defied Scientists for Decades BioTecNika.
- Nobel laureates in Dubai: AI can speed up science, but can’t replace human discovery - Gulf News — Nobel laureates in Dubai: AI can speed up science, but can’t replace human discovery Gulf News.
From the Industry
- AbCellera Advances ABCL635 Menopause Candidate As Shares Trade Below Targets - simplywall.st — AbCellera Advances ABCL635 Menopause Candidate As Shares Trade Below Targets simplywall.st.
- J&K Ingredients, Pallas Biotech partnership spurs clean-label innovation - Commercial Baking — J&K Ingredients, Pallas Biotech partnership spurs clean-label innovation Commercial Baking.
- Biologics Contract Research Organizations Market Trends Analysis and Forecast Report 2021-2025 & 2025-2033 - GlobeNewswire — Biologics Contract Research Organizations Market Trends Analysis and Forecast Report 2021-2025 & 2025-2033 GlobeNewswire.
- Summit Therapeutics Announces U.S. FDA Acceptance of Biologics License Application (BLA) Seeking Approval for Ivonescimab in Combination with Chemotherapy in Treatment of Patients with EGFRm NSCLC Post-TKI Therapy - Business Wire — Summit Therapeutics Announces U.S.
- Repertoire adds Lilly to range of Big Pharma partners in autoimmune deal worth up to $1.9B - Fierce Biotech — Repertoire adds Lilly to range of Big Pharma partners in autoimmune deal worth up to $1.9B Fierce Biotech.
- Layoff Tracker: Gilead Continues California Cuts With 34 Laid Off - BioSpace — Layoff Tracker: Gilead Continues California Cuts With 34 Laid Off BioSpace.
- Fierce Biotech Fundraising Tracker ‘26: Breakthru emerges with $60M; Tenpoint raises $85M - Fierce Biotech — Fierce Biotech Fundraising Tracker ‘26: Breakthru emerges with $60M; Tenpoint raises $85M Fierce Biotech.
Quick Reads
Vision-Language Controlled Deep Unfolding for Joint Medical Image Restoration and Segmentation
We propose VL-DUN, a principled framework for joint All-in-One Medical Image Restoration and Segmentation (AiOMIRS) that bridges the gap between low-level signal recovery and high-level semantic understanding. Read more →
Computational Screening, ADME Study, and Evaluation of Benzothiazole Derivatives as Potential Anticancer Agents.
Benzothiazole derivatives have garnered considerable interest in medicinal chemistry due to their diverse biological activities, including anticancer potential. Read more →
Novel and sustainable microfabricated Cu ion selective sensor doped with ionophore and supported with docking study for determination of vonoprazan fumarate in tablet dosage form.
In the field of drug analysis, there is a growing emphasis on developing techniques that are environmentally friendly, cost-effective, and efficient. Read more →
ME-PFP: An Ensemble Learning Approach Fusing Multi-Source Features for Protein Function Prediction.
Proteins, as essential components of living organisms, play a critical role in both drug discovery and disease mechanism research. Read more →
Impact of clove oil on behavioral and biochemical parameters in restrained rats by using docking and experimental approaches.
Background Acute restraint stress activates the (HPA) axis, elevating corticosterone and influencing cognitive function. Read more →
Enhancing Drug Repurposing with Consensus Docking: Discovery of Novel Discoidin Domain Receptor 1 Inhibitors.
Developing novel drugs is a long and difficult process, particularly in oncology, where high attrition rates make clinical trials costly and time-consuming. Read more →
Regarding “The molecular mechanisms through which psilocybin prevents suicide: evidence from network pharmacology and molecular docking analyses”.
Enzymatic preparation of epigallocatechin gallate diglucoside and bioactivity assessment.
Epigallocatechin gallate (EGCG), the predominant catechin in green tea, has limited application due to its poor water solubility and instability. Read more →
Pipeline Tip
Employ HADDOCK for ambiguous restraints in protein-protein docking.
Resources & Tools
- Dataset: AlphaFold Structure Database - 200M+ predicted structures from DeepMind/EMBL-EBI.
- Dataset: Uniprot Knowledgebase - The world’s most comprehensive resource for protein sequence and annotation.
- Tool: MultiFOLD/IntFOLD - High-performance protein structure prediction and quality assessment server. View all tools →
- Tool: PyMOL - Gold standard for molecular visualization and publication-quality imaging. View all tools →
- Event: Structural Biology Events (Open)
- Event: Protein Design Hub (LinkedIn Group) (Ongoing)
- Job: Art Director - Workable at Workable
- Job: Scientist - Omics Sampling & Perturbation - Workable at Workable
Deep learning is not a magic wand, but a powerful lens for structural biology. — Recep Adiyaman