A machine learning framework for QSPR modeling of drug-like compounds using graph invariants.

Gespeichert in:
Bibliographische Detailangaben
Titel: A machine learning framework for QSPR modeling of drug-like compounds using graph invariants.
Autoren: Alzahrani, Ebraheem, Hanif, Muhammad Farhan
Quelle: AIMS Mathematics; 2025, Vol. 10 Issue 10, p1-40, 40p
Schlagwörter: DRUG discovery, MOLECULAR structure, MACHINE learning, COMPUTER simulation, CHEMICAL structure, PREDICTIVE validity, BIOACTIVE compounds, CHEMICAL properties
Abstract: Quantitative structure property relationship (QSPR) is a computational modeling approach that correlates the chemical structure of compounds with their physicochemical or biological properties. Accurate estimation of physicochemical and other biological parameters of drug molecules is a critical factor in drug discovery. In the present work, we developed a graph-based QSPR model for molecular structures which employed molecular structural invariants as predicting features. Degree and distance topological indices were derived from molecular graphs and combined with random forest (RF), gradient boosting, and multiple line regression (MLR) for prediction of predictive performance on the diverse drug datasets. The proposed RF model obtained an approximate 18–25% improvement in R 2 and a reduction of about 30% in RMSE over the classical linear regression models, showing better generalization performance. In the context of drug screening, the model accurately predicted early physicochemical properties including molar refractivity and polarizability, rendering it a tool to assess rapidly compounds for neurological and anticancer therapeutics. In addition, the computational model was about 15 times faster on average than existing QSPR approaches, achieving its excellent efficiency and applicability. The final results demonstrated that the molecular structural invariants functioned as good descriptors for generating reliable, interpretable, and predictive QSPR models relevant to early-stage drug discovery. [ABSTRACT FROM AUTHOR]
Copyright of AIMS Mathematics is the property of American Institute of Mathematical Sciences and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Datenbank: Complementary Index
Beschreibung
Abstract:Quantitative structure property relationship (QSPR) is a computational modeling approach that correlates the chemical structure of compounds with their physicochemical or biological properties. Accurate estimation of physicochemical and other biological parameters of drug molecules is a critical factor in drug discovery. In the present work, we developed a graph-based QSPR model for molecular structures which employed molecular structural invariants as predicting features. Degree and distance topological indices were derived from molecular graphs and combined with random forest (RF), gradient boosting, and multiple line regression (MLR) for prediction of predictive performance on the diverse drug datasets. The proposed RF model obtained an approximate 18–25% improvement in R 2 and a reduction of about 30% in RMSE over the classical linear regression models, showing better generalization performance. In the context of drug screening, the model accurately predicted early physicochemical properties including molar refractivity and polarizability, rendering it a tool to assess rapidly compounds for neurological and anticancer therapeutics. In addition, the computational model was about 15 times faster on average than existing QSPR approaches, achieving its excellent efficiency and applicability. The final results demonstrated that the molecular structural invariants functioned as good descriptors for generating reliable, interpretable, and predictive QSPR models relevant to early-stage drug discovery. [ABSTRACT FROM AUTHOR]
ISSN:24736988
DOI:10.3934/math.20251093