A general approach for developing system-specific functions to score protein-ligand docked complexes using support vector inductive logic programming

Despite the increased recent use of protein–ligand and protein–protein docking in the drug discovery process due to the increases in computational power, the difficulty of accurately ranking the binding affinities of a series of ligands or a series of proteins docked to a protein receptor remains la...

Full description

Saved in:
Bibliographic Details
Published in:Proteins, structure, function, and bioinformatics Vol. 69; no. 4; pp. 823 - 831
Main Authors: Amini, Ata, Shrimpton, Paul J., Muggleton, Stephen H., Sternberg, Michael J. E.
Format: Journal Article
Language:English
Published: Hoboken Wiley Subscription Services, Inc., A Wiley Company 01.12.2007
Subjects:
ISSN:0887-3585, 1097-0134, 1097-0134
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Despite the increased recent use of protein–ligand and protein–protein docking in the drug discovery process due to the increases in computational power, the difficulty of accurately ranking the binding affinities of a series of ligands or a series of proteins docked to a protein receptor remains largely unsolved. This problem is of major concern in lead optimization procedures and has lead to the development of scoring functions tailored to rank the binding affinities of a series of ligands to a specific system. However, such methods can take a long time to develop and their transferability to other systems remains open to question. Here we demonstrate that given a suitable amount of background information a new approach using support vector inductive logic programming (SVILP) can be used to produce system‐specific scoring functions. Inductive logic programming (ILP) learns logic‐based rules for a given dataset that can be used to describe properties of each member of the set in a qualitative manner. By combining ILP with support vector machine regression, a quantitative set of rules can be obtained. SVILP has previously been used in a biological context to examine datasets containing a series of singular molecular structures and properties. Here we describe the use of SVILP to produce binding affinity predictions of a series of ligands to a particular protein. We also for the first time examine the applicability of SVILP techniques to datasets consisting of protein–ligand complexes. Our results show that SVILP performs comparably with other state‐of‐the‐art methods on five protein–ligand systems as judged by similar cross‐validated squares of their correlation coefficients. A McNemar test comparing SVILP to CoMFA and CoMSIA across the five systems indicates our method to be significantly better on one occasion. The ability to graphically display and understand the SVILP‐produced rules is demonstrated and this feature of ILP can be used to derive hypothesis for future ligand design in lead optimization procedures. The approach can readily be extended to evaluate the binding affinities of a series of protein–protein complexes. Proteins 2007. © 2007 Wiley‐Liss, Inc.
Bibliography:istex:F2760B7462F15528E882D50D8F5339B08D56108A
ark:/67375/WNG-FXTF2RG5-D
BBSRC
ArticleID:PROT21782
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0887-3585
1097-0134
1097-0134
DOI:10.1002/prot.21782