DeepPSA: A Geometric Deep Learning Model for PROTAC Synthetic Accessibility Prediction

Saved in:
Bibliographic Details
Title: DeepPSA: A Geometric Deep Learning Model for PROTAC Synthetic Accessibility Prediction
Authors: Ran Zhang, Shihang Wang, Lin Wang, Siyuan Tian, Yilin Tang, Fang Bai
Publication Year: 2025
Subject Terms: Biochemistry, Genetics, Molecular Biology, Biotechnology, Biological Sciences not elsewhere classified, Information Systems not elsewhere classified, target proteins via, molecules remain underdeveloped, https :// bailab, generative artificial intelligence, garnered significant attention, demonstrate superior performance, based partitioned datasets, based model built, https :// github, exceptional generalization ability, drug design due, experimental synthetic data, protac synthetic accessibility, synthetic accessibility, github repository, web server, test set, targeting chimeras, protac synthesis, prediction accuracy, novel compounds, house dataset, essential tool, driven approach
Description: Proteolysis-targeting chimeras (PROTACs) have garnered significant attention in drug design due to their ability to induce the degradation of the target proteins via the ubiquitin–proteasome system. However, the synthesis of PROTACs remains a challenging process, requiring the consideration of factors such as chemical complexity and accessibility. With the rise of generative artificial intelligence, several PROTAC generation models have been introduced, but tools to evaluate the synthetic accessibility of these molecules remain underdeveloped. To address this gap, we propose a deep learning-based computational model named DeepPSA (Deep learning-based PROTAC Synthetic Accessibility) designed to predict the synthetic accessibility of PROTACs. DeepPSA offers a systematic, data-driven approach to assess the feasibility of PROTAC synthesis, providing an essential tool for the design and screening of novel compounds. DeepPSA is a graph-based model built on a graph neural network architecture, trained on an in-house dataset of 3644 PROTACs with experimental synthetic data. As the first model specifically focused on PROTAC synthetic accessibility, DeepPSA demonstrates impressive performance, achieving 92.9% prediction accuracy and an Area Under the Receiver Operating Characteristic curve (AUROC) of 0.963 on the test set, indicating its ability to capture key structural characteristics of PROTACs. Moreover, DeepPSA continues to demonstrate superior performance on the structure-based partitioned datasets, further validating its exceptional generalization ability and robustness. DeepPSA is available online at a web server (https://bailab.siais.shanghaitech.edu.cn/psa) and at GitHub repository (https://github.com/Zhang-Ran-0119/DeepPSA).
Document Type: article in journal/newspaper
Language: unknown
Relation: https://figshare.com/articles/journal_contribution/DeepPSA_A_Geometric_Deep_Learning_Model_for_PROTAC_Synthetic_Accessibility_Prediction/29403711
DOI: 10.1021/acs.jcim.5c00366.s001
Availability: https://doi.org/10.1021/acs.jcim.5c00366.s001
https://figshare.com/articles/journal_contribution/DeepPSA_A_Geometric_Deep_Learning_Model_for_PROTAC_Synthetic_Accessibility_Prediction/29403711
Rights: CC BY-NC 4.0
Accession Number: edsbas.B063B86E
Database: BASE
Be the first to leave a comment!
You must be logged in first