Skip to main content
recep.adiyaman
Daily Signal May 22, 2026 · 11 min read

Issue #114: Benchmarking generative scaffold design methods for peptide engineering in TCR-MHC complexes

Protein Design Digest #114: Benchmarking generative scaffold design methods for peptide engineering …

Share X LinkedIn
Protein Design Daily

Building something in Protein Design?

I love collaborating on new challenges. Let's build together.

Subscribe to Protein Design Digest

Daily curated signals from arXiv, PubMed, and BioRxiv.

Signal of the Day

Benchmarking generative scaffold design methods for peptide engineering in TCR-MHC complexes

De novo peptide design at T cell receptor-peptide-major histocompatibility complex (TCR-pMHC) interfaces is a central challenge in computational immunology, with direct implications for vaccine development, cancer immunotherapy, and autoimmune disease. Despite rapid advances in generative protein modeling, there is currently no systematic benchmark evaluating these methods in the highly constrained and immunologically relevant setting of peptide-MHC presentation and TCR recognition. Here, we present two complementary contributions. First, we introduce a multi-stage computational pipeline for peptide design in predefined TCR-pMHC contexts, integrating generative modeling with sequence optimization and structure-based filtering. Second, we establish a benchmark for evaluating generative peptide design methods in TCR-pMHC complexes. Using a curated dataset of high-quality crystal structures deposited after the AlphaFold3 training cutoff, we assess state-of-the-art generative approaches for peptide backbone generation, sequence design, and the enrichment of near-native solutions. We explicitly examine whether different backbone generation strategies respect the geometric constraints of the MHC binding groove and recover native-like peptide conformations. Our results reveal substantial method-dependent differences: some generative strategies fail systematically in the groove-bound peptide setting, whereas others generate physically plausible backbones with varying accuracy and conformational diversity. We further show that enforcing anchor constraints strongly influences peptide conformations at non-anchor positions, highlighting a trade-off between structural accuracy and conformational sampling. To enable fair and reproducible comparison, we introduce a standardized, multi-stage scoring protocol that integrates MHC binding prediction, physics-based energy evaluation, and independent structure prediction confidence metrics to enrich near-native designs from large candidate pools. Together, this work establishes the first comprehensive pipeline and benchmark for generative peptide design at TCR-pMHC interfaces and provides practical guidelines for developing peptide design workflows and evaluating generative models in immunologically constrained protein design settings.

Why this matters: Critical for improving fold accuracy and reducing structural uncertainty in de novo design.


Also Worth Reading

Exploring the mechanism of saffron in treating viral myocarditis using network pharmacology and molecular docking.

Viral myocarditis (VM) is a cardiovascular disorder that can lead to heart failure and cardiogenic shock. Saffron, a traditional Chinese medicinal herb, has shown therapeutic potential against VM in numerous studies. However, the mechanisms through which saffron exerts its effects on VM remain poorly understood. Thus, this study aimed to elucidate the active compounds, molecular targets, and signaling pathways involved in saffron’s therapeutic action against VM by employing network pharmacology and molecular docking approaches. The active compounds and corresponding targets of saffron were retrieved from the Traditional Chinese Medicine Systems Pharmacology database. VM-associated targets were sourced from the GeneCards database. Overlapping targets between saffron and VM were then identified. Protein-protein interaction networks were established and analyzed utilizing the STRING platform and Cytoscape software to determine core targets. Furthermore, gene ontology and Kyoto encyclopedia of genes and genomes enrichment analyses were carried out utilizing Bioconductor in R to explore the potential biological activities and signaling pathways through which saffron may act against VM. Finally, molecular docking and model visualization were carried out using AutoDock Tools and PyMOL open-source software. From the database, we identified 4 active compounds in saffron with potential effects against VM: crocetin, isorhamnetin, kaempferol, and quercetin. A total of 60 corresponding targets were observed, with TNF, IL-6, IL-1β, CXCL8, and JUN emerging as core targets. Kyoto encyclopedia of genes and genomes enrichment analysis revealed 155 regulatory signaling pathways, among which the TNF, AGE-RAGE, and IL-17 signaling pathways, lipid metabolism, and atherosclerosis were the most prominent. Molecular docking results indicated that quercetin showed the strongest binding affinity toward IL-1β and CXCL8. The therapeutic effect of saffron against VM is not driven by a single factor, but rather involves multiple active compounds, targets, and signaling pathways.

Unraveling the anti-neuroinflammatory mechanisms of Cervus cucumis polypeptide injection in Alzheimer’s disease: insights from network pharmacology, molecular docking, molecular dynamics simulation, and experimental validation.

Objective Alzheimer’s disease (AD) is a progressive neurodegenerative disorder with increasing global prevalence, in which neuroinflammation serves as a critical pathological driver exacerbating cognitive decline. While current therapies offer limited symptomatic relief, multi-target strategies are urgently needed. Cervus cucumis polypeptide injection (CCPI), a traditional Chinese medicine (TCM) formulation, has demonstrated anti-inflammatory properties; however, its mechanisms of action against AD remain unclear. This study aimed to elucidate the anti-AD potential mechanisms of CCPI using an integrated approach combining network pharmacology, molecular docking, molecular dynamics (MD) simulation, and experimental validation. Methods Active components and corresponding targets of CCPI were retrieved from the TCMSP database, while AD-related targets were collected from Genecards, OMIM, and DrugBank. Potential therapeutic targets were identified by intersecting drug and disease targets, followed by protein-protein interaction (PPI) network construction, Gene Ontology (GO), and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses. Molecular docking and MD simulations were performed to evaluate interactions between potential active components and key targets. In vitro experiments were conducted on Aβ 25-35 -induced BV2 microglial cells to assess cell viability (CCK-8 assay), inflammatory cytokine levels (ELISA), and protein expression (Western blot) related to the neuroinflammation pathway and microglial polarization. Results A total of 28 active components and 50 common targets of CCPI for AD treatment were identified. Linoleic acid (LA) was determined to be a potential active component, with IL-6 as the key target based on PPI network topology. Molecular docking and MD simulation confirmed a stable binding affinity between LA and IL-6. KEGG analysis revealed significant enrichment in the HIF-1 signaling pathway, particularly the IL-6/STAT3/VEGF signaling pathway. In vitro , CCPI treatment significantly enhanced cell viability and attenuated the pro-inflammatory response, as evidenced by reduced levels of IL-6, IL-1β, and TNF-α, decreased the expression of the pro-inflammatory marker iNOS. Concurrently, it elevated the expression of the anti-inflammatory/repair-associated marker CD206. Western blot analysis further verified that CCPI suppressed IL-6/STAT3 activation while upregulating VEGF expression. Additionally, LA alone significantly reduced IL-6 levels and STAT3 phosphorylation, decreased the expression of iNOS, and increased the expression of CD206, with therapeutic efficacy comparable to CCPI. Conclusion CCPI exerts neuroprotective effects in AD models by regulating the IL-6/STAT3/VEGF pathway, downregulating the expression of the inflammation-related iNOS protein, upregulating the expression of the CD206 protein associated with anti-inflammatory and reparative functions, remodeling the functional state of microglia, inhibiting their pro-inflammatory responses, and enhancing their reparative functions. Its potential active component, LA, likely mediates this effect by stably binding to and inhibiting IL-6, thus suppressing the downstream STAT3 phosphorylation that drives inflammatory activation.

A multimodal approach integrating spectroscopy, deep learning guided molecular docking, and molecular dynamics simulation for predictive assessment of pioglitazone to albumin binding for formulation development.

Binding affinity is a critical parameter that can influence the state of the drug in vivo and help to define the formulation strategy. The current study implements a multimodal approach to analyse the binding affinity between human serum albumin (HSA) and pioglitazone. Ultraviolet (UV) absorbance and fluorescence spectrometry analyses were performed on different combinations of HSA and pioglitazone complexes, and the absorbance and fluorescence intensities were mapped to calculate the binding constant. DynamicBind, a distinct deep-learning artificial intelligence tool, was implemented to perform in silico docking studies using a non-conventional approach. Furthermore, molecular dynamics simulation was also performed to generate root mean square deviation, radius of gyration, and root mean square fluctuation values, followed by principal component analysis, probability distribution function, and free energy landscape analysis. The simulation output was analysed to interpret the binding affinity and associated conformation of the protein-active pharmaceutical ingredient (API) complex. The binding constant calculated through UV analysis was 1.1 × 10 4 M -1 . Fluorescence spectroscopic analysis derived a value of 1.7 × 10 5 M -1 . At the same time, DynamicBind predicted the cLDDT score for the top predicted model to be 0.634, and a binding affinity value of greater than 5, indicating a relatively moderate binding between pioglitazone and HSA. The results from molecular dynamics simulations further complemented our earlier observations, indicating non-covalent binding interactions and a stable protein-API complex, which is desirable for developing a formulation using HSA as a carrier polymer. This orthogonal approach also provided critical information on the fate of the API and possible considerations that needed to be made during the design of the formulation process, highlighting the need for similar approaches that could provide multifaceted advantages and help in optimising R&D costs and timelines.


Research & AI Updates

From the Industry


Quick Reads

Studies of design, synthesis and biological properties, ADMET profiling, molecular docking, network pharmacology and molecular dynamics simulation of novel chalcone derivatives containing benzoyl-piperazin.

A close causal relationship exists between Alzheimer’s disease (AD) and inflammation, where chronic inflammation serves as a critical driver in AD pathogenesis. Read more →

Structure Based Identification of Some Potential Pan-Coronavirus Main Protease Inhibitors via Pharmacophore Modeling and Molecular Dynamics Simulation Within a One Health Framework

Background: The main protease (Mpro/3CLpro) of coronaviruses (CoVs) is an essential enzyme involved in viral replication and represents an attractive target for antiviral drug discovery. Read more →

EnzySeek: Efficient Exploration of Enzyme Reaction Pathways Using AI Agents

With the rapid development of Large Language Models (LLMs) and Agent technologies, AI can assist in solving a variety of real-world problems across multiple domains, such as autonomous driving, drug discovery, and materials design. Read more →

ESMDisPred: A Structure-Aware CNN-Transformer Architecture for Intrinsically Disordered Protein Prediction

Intrinsically disordered proteins (IDPs) lack stable three-dimensional structures, yet play vital roles in key biological processes, including signaling, transcription regulation, and molecular scaffolding. Read more →

Phytochemical profiling, metabolomics, and molecular docking studies of Atriplex halimus aerial parts revealing potential insecticidal activity against the malaria vector Anopheles pharoensis.

Atriplex halimus is traditionally used in folk medicine for the treatment of skin infections, inflammation, and parasitic disorders. Read more →

Integrating network pharmacology, molecular docking, machine learning, and experimental validation: puerarin improves sepsis-induced acute kidney injury via the Sirt1-Nrf2-HO-1 pathway.

Acute kidney injury (AKI) associated with sepsis has a high clinical mortality rate, and there is a lack of effective therapeutic targets; uncontrolled oxidative stress and inflammatory responses are central pathological mechanisms. Read more →

Comprehensive Multispectroscopic and Molecular Docking Investigation of the Anticancer Drug Idarubicin-Bovine Serum Albumin Interaction.

Understanding the binding mechanism of idarubicin (IDA), a chemotherapeutic agent used in the treatment of acute myeloid leukemia (AML), with bovine serum albumin (BSA) is essential for elucidating its toxicity and pharmacodynamic and pharmacokinetic behavior and for supporting the development of novel drugs. Read more →

Structure-Based Identification of Natural Product-Derived BCL-2 Inhibitors as Potential Therapeutics for Hepatocellular Carcinoma.

Introduction/objective Hepatocellular carcinoma (HCC), the most common form of primary liver cancer, remains a major therapeutic challenge due to its poor prognosis and resistance to conventional therapies. Read more →

Pipeline Tip

Check for missing residues in PDB files using PDB-Fixer before simulation.


Resources & Tools

The protein structure is the language of life; design is its poetry. — Recep Adiyaman

BS HF DK