Issue #17: Discovery of PPARγ Partial Agonists for Treatment of Type 2 Diabetes Based on an Integrated Virtual Screening Strategy that Combines Fragment Molecular Orbital Calculations, Machine Learning, Molecular Docking, Interaction Fingerprint Filtering, and Molecular Dynamics Simulations.

Subscribe to Protein Design Digest

Daily curated signals from arXiv, PubMed, and BioRxiv.

Signal of the Day

Discovery of PPARγ Partial Agonists for Treatment of Type 2 Diabetes Based on an Integrated Virtual Screening Strategy that Combines Fragment Molecular Orbital Calculations, Machine Learning, Molecular Docking, Interaction Fingerprint Filtering, and Molecular Dynamics Simulations.

Peroxisome proliferator-activated receptor γ (PPARγ) is a key therapeutic target for type 2 diabetes and cardiovascular diseases due to its central role in regulating glucose and lipid metabolism. While full PPARγ agonists exhibit efficacy, they are linked to adverse effects; in contrast, PPARγ partial agonists retain metabolic regulatory functions with improved safety, representing promising candidates for type 2 diabetes treatment. However, their action mechanisms and structure-activity relationships remain unclear. Herein, we developed an integrated virtual screening strategy combining fragment molecular orbital (FMO) calculations, machine learning, molecular docking, interaction fingerprint (IFP) filtering, and molecular dynamics (MD) simulations to identify potential PPARγ partial agonists and elucidate their interaction mechanisms. FMO analysis first confirmed interaction differences between PPARγ agonist classes at the binding pocket, pinpointing critical residues (CYS285, ARG288, ILE341, and SER342) for partial agonist activity. Using three machine learning algorithms (random forest, extra trees, and XGBoost) with extended connectivity fingerprints (ECFP), we constructed QSAR classification models and screened 9630 compounds. SHAP analysis highlighted key fingerprint fragments (positions 45, 1034, and 1243) governing bioactivity. Molecular docking and IFP refinement yielded six high-potency candidates, whose binding stability and partial agonist properties were validated via MD simulations, MM/PBSA binding free energy calculations, hydrogen bond analysis, and FMO calculations. Notably, these candidates did not directly interact with the AF2 domain, consistent with the canonical partial agonist mode of action. This multidisciplinary approach provides a framework for rational design of novel PPARγ partial agonists, and the identified molecules serve as promising leads for type 2 diabetes therapeutics.

Why this matters: Expands the searchable sequence space for novel folds and high-affinity binders.

Also Worth Reading

AlphaFold for Docking Screens.

AlphaFold is an AI system developed by Google DeepMind to generate three-dimensional structures of proteins without experimental data. The models created with AlphaFold are available on the AlphaFold Protein Structure Database (AlphaFoldDB) ( https://alphafold.ebi.ac.uk/ ). The AlphaFold database is searchable by sequence and protein identification. This chapter focuses on an AlphaFold model and its use for docking screens using Molegro Virtual Docker. We rely on Jupyter Notebooks to integrate docking simulations and build regression models based on the atomic coordinates of protein-pose complexes. Our study focuses on constructing a neural network regression model to predict the inhibition of cyclin-dependent kinase 19 (CDK19). This enzyme is a target for anticancer drugs and does not have experimental data for its atomic coordinates. We utilize the Molegro Data Modeller to construct a regression model based on docking results of inhibitors for which binding affinity data is available. All CDK19 datasets and Jupyter Notebooks discussed in this work are available at GitHub: https://github.com/azevedolab/docking#readme .

Geometric deep learning assists protein engineering. Opportunities and Challenges.

Protein engineering is experiencing a paradigmatic transformation through the integration of geometric deep learning (GDL) into computational design workflows. While traditional approaches such as rational design and directed evolution have achieved significant progress, they remain constrained by the vastness of sequence space and the cost of experimental validation. GDL overcomes these limitations by operating on non-Euclidean domains and by capturing the spatial, topological, and physicochemical features that govern protein function. This perspective provides a comprehensive and critical overview of GDL applications in stability prediction, functional annotation, molecular interaction modeling, and de novo protein design. It consolidates methodological principles, architectural diversity, and performance trends across representative studies, emphasizing how GDL enhances interpretability and generalization in protein science. Aimed at both computational method developers and experimental protein engineers, the review bridges algorithmic concepts with practical design considerations, offering guidance on data representation, model selection, and evaluation strategies. By integrating explainable artificial intelligence and structure-based validation within a unified conceptual framework, this work highlights how GDL can serve as a foundation for transparent, interpretable, and autonomous protein design. As GDL converges with generative modeling, molecular simulation, and high-throughput experimentation, it is poised to become a cornerstone technology for next-generation protein engineering and synthetic biology.

Modeling Protein-Protein Complexes by Combining pyDock and AlphaFold.

The lack of experimental structures for the majority of protein-protein complexes has motivated the development of a variety of strategies for the structural modeling of protein complexes, such as computational docking, in active development for the last decades, and the more recent artificial intelligence (AI)-based ground-breaking methodologies. Among the existing computational docking methods, Python docking (pyDock) has shown competitive predictive rates and high robustness over the years. However, the field has dramatically changed with the appearance of artificial intelligence (AI)-based methods, like AlphaFold. While structure prediction of individual proteins is virtually solved by this program, the focus is now on how to improve the prediction of challenging cases like antibody-antigen complexes, multiprotein complexes, weak interactions, or highly flexible interacting proteins. Successful strategies are based on the generation of more diverse sets of models and the integration with other “classical” approaches that facilitate the identification of the correct models. Here, we will show in practical terms how to combine the structural modeling capabilities of AlphaFold with the energy-based scoring function in pyDock to improve structural predictions in challenging protein-protein complexes.

Research & AI Updates

The Genesis Mission: How AI is Revolutionizing Fundamental Sciences Through DOE Partnerships - QUASA Connect — The Genesis Mission: How AI is Revolutionizing Fundamental Sciences Through DOE Partnerships QUASA Connect.
Egli awarded the Richard Armstrong Professorship of Innovation in Biochemistry - Vanderbilt University — Egli awarded the Richard Armstrong Professorship of Innovation in Biochemistry Vanderbilt University.
Topos Bio Emerges from Stealth to Tackle Disordered Proteins with $10.5M and Frontier AI - HIT Consultant — Topos Bio Emerges from Stealth to Tackle Disordered Proteins with $10.5M and Frontier AI HIT Consultant.
Exclusive: An MIT trio built some of biotech’s favorite AI models. Now, they’re building a business - Endpoints News — Exclusive: An MIT trio built some of biotech’s favorite AI models.

From the Industry

Fierce Pharma Asia—China biotech deal spree rolls on; Shionogi buys Tanabe’s ALS business - Fierce Pharma — Fierce Pharma Asia—China biotech deal spree rolls on; Shionogi buys Tanabe’s ALS business Fierce Pharma.
Aktis raises $318M in 2026’s first biotech IPO - BioPharma Dive — Aktis raises $318M in 2026’s first biotech IPO BioPharma Dive.
Boltz takes off with $28M seed, partners with Pfizer on AI drug discovery - FirstWord — Boltz takes off with $28M seed, partners with Pfizer on AI drug discovery FirstWord.
Boltz Bags $28M Funding and Pfizer Partnership for Biomolecular AI Boost - TechRepublic — Boltz Bags $28M Funding and Pfizer Partnership for Biomolecular AI Boost TechRepublic.
Parabilis Medicines raises $305 million as CEO warms to an IPO - statnews.com — Parabilis Medicines raises $305 million as CEO warms to an IPO statnews.com.
Former Genentech leaders’ protein degrader startup nets $107M - Endpoints News — Former Genentech leaders’ protein degrader startup nets $107M Endpoints News.
‘Unprecedented’ collaboration news not what Auris ordered - The Pharma Letter — ‘Unprecedented’ collaboration news not what Auris ordered The Pharma Letter.

Quick Reads

Exploring the Anti-Inflammatory Molecular Mechanism of Gentiana szechenyii Kanitz. Based on UPLC-MS/MS Combined With Network Pharmacology, Molecular Docking, and Molecular Dynamics Simulation.

This study explored the anti-inflammatory mechanisms of Gentiana szechenyii Kanitz. Read more →

SMARTDock: A Toolkit for the Automated Development of Target-Specific Scoring Functions Using Bioactivity Data.

Molecular docking has become an essential tool in the early stages of structure-based drug discovery, enabling rapid virtual screening of large compound libraries against biological targets. Read more →

Establishing FDA-approved oncology drugs as GPR176 inhibitor through homology modelling, molecular docking, MMGBSA, DFT, and molecular dynamics simulation.

Molecular docking and dynamic simulation of escherichia coli K-12 Elements as a Biosensor for Detecting 2,4,6-Trinitrotoluene (TNT).

Trinitrotoluene (TNT) is widely used in military and industrial fields due to its strong explosive properties and chemical stability. Read more →

Assessing the validity of leucine zipper constructs predicted by AlphaFold.

AP-1 transcription factors are a network of cellular regulators that combine in different dimer pairs to control a range of pathways involved in differentiation, growth, and cell death. Read more →

A screening strategy for bioactive components from Amaranth: An integrated approach of network pharmacology, molecular docking and molecular dynamics simulation.

Amaranth is a traditional medicinal and forage plant with promising anti-inflammatory properties. Read more →

Network pharmacology and molecular docking reveal mechanisms of amiodarone-induced pulmonary fibrosis.

Pulmonary fibrosis is a common end-stage outcome of various chronic lung diseases, characterized by excessive extracellular matrix deposition, alveolar structural destruction, and progressive loss of pulmonary function. Read more →

In silico characterization and molecular docking of the MIOX gene in Nile tilapia (Oreochromis niloticus).

Myo-inositol oxygenase (MIOX) plays an essential role in metabolic pathways and cell processes, controls oxidative stress response mechanisms, and balances osmotic stress in aquatic organisms. Read more →

Pipeline Tip

Verify FASTA headers for special characters that break Rosetta pipelines.

Resources & Tools

Dataset: SCOPe - Curated structural classification of proteins for fold analysis.
Dataset: Pfam - Protein families database with curated multiple sequence alignments.
Tool: Chai-1 - Multi-modal foundation model for molecular structure prediction. View all tools →
Tool: Boltz-1 - Open-source biomolecular structure prediction model. View all tools →
Event: Structural Biology Events (Open)
Event: Protein Design Hub (LinkedIn Group) (Ongoing)
Job: Bioinformatics Associate II - Indeed at Indeed Jobs
Job: Senior Bioinformatics Software Engineer - Indeed at Indeed Jobs

Deep learning is not a magic wand, but a powerful lens for structural biology. — Recep Adiyaman

Building something in Protein Design?