Cheminformatics meets molecular mechanics: a combined application of knowledge-based pose scoring and physical force field-based hit scoring functions improves the accuracy of structure-based virtual screening

Poor performance of scoring functions is a well-known bottleneck in structure-based virtual screening (VS), which is most frequently manifested in the scoring functions' inability to discriminate between true ligands vs known nonbinders (therefore designated as binding decoys). This deficiency...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of chemical information and modeling Jg. 52; H. 1; S. 16
Hauptverfasser: Hsieh, Jui-Hua, Yin, Shuangye, Wang, Xiang S, Liu, Shubin, Dokholyan, Nikolay V, Tropsha, Alexander
Format: Journal Article
Sprache:Englisch
Veröffentlicht: United States 23.01.2012
Schlagworte:
ISSN:1549-960X, 1549-960X
Online-Zugang:Weitere Angaben
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract Poor performance of scoring functions is a well-known bottleneck in structure-based virtual screening (VS), which is most frequently manifested in the scoring functions' inability to discriminate between true ligands vs known nonbinders (therefore designated as binding decoys). This deficiency leads to a large number of false positive hits resulting from VS. We have hypothesized that filtering out or penalizing docking poses recognized as non-native (i.e., pose decoys) should improve the performance of VS in terms of improved identification of true binders. Using several concepts from the field of cheminformatics, we have developed a novel approach to identifying pose decoys from an ensemble of poses generated by computational docking procedures. We demonstrate that the use of target-specific pose (scoring) filter in combination with a physical force field-based scoring function (MedusaScore) leads to significant improvement of hit rates in VS studies for 12 of the 13 benchmark sets from the clustered version of the Database of Useful Decoys (DUD). This new hybrid scoring function outperforms several conventional structure-based scoring functions, including XSCORE::HMSCORE, ChemScore, PLP, and Chemgauss3, in 6 out of 13 data sets at early stage of VS (up 1% decoys of the screening database). We compare our hybrid method with several novel VS methods that were recently reported to have good performances on the same DUD data sets. We find that the retrieved ligands using our method are chemically more diverse in comparison with two ligand-based methods (FieldScreen and FLAP::LBX). We also compare our method with FLAP::RBLB, a high-performance VS method that also utilizes both the receptor and the cognate ligand structures. Interestingly, we find that the top ligands retrieved using our method are highly complementary to those retrieved using FLAP::RBLB, hinting effective directions for best VS applications. We suggest that this integrative VS approach combining cheminformatics and molecular mechanics methodologies may be applied to a broad variety of protein targets to improve the outcome of structure-based drug discovery studies.
AbstractList Poor performance of scoring functions is a well-known bottleneck in structure-based virtual screening (VS), which is most frequently manifested in the scoring functions' inability to discriminate between true ligands vs known nonbinders (therefore designated as binding decoys). This deficiency leads to a large number of false positive hits resulting from VS. We have hypothesized that filtering out or penalizing docking poses recognized as non-native (i.e., pose decoys) should improve the performance of VS in terms of improved identification of true binders. Using several concepts from the field of cheminformatics, we have developed a novel approach to identifying pose decoys from an ensemble of poses generated by computational docking procedures. We demonstrate that the use of target-specific pose (scoring) filter in combination with a physical force field-based scoring function (MedusaScore) leads to significant improvement of hit rates in VS studies for 12 of the 13 benchmark sets from the clustered version of the Database of Useful Decoys (DUD). This new hybrid scoring function outperforms several conventional structure-based scoring functions, including XSCORE::HMSCORE, ChemScore, PLP, and Chemgauss3, in 6 out of 13 data sets at early stage of VS (up 1% decoys of the screening database). We compare our hybrid method with several novel VS methods that were recently reported to have good performances on the same DUD data sets. We find that the retrieved ligands using our method are chemically more diverse in comparison with two ligand-based methods (FieldScreen and FLAP::LBX). We also compare our method with FLAP::RBLB, a high-performance VS method that also utilizes both the receptor and the cognate ligand structures. Interestingly, we find that the top ligands retrieved using our method are highly complementary to those retrieved using FLAP::RBLB, hinting effective directions for best VS applications. We suggest that this integrative VS approach combining cheminformatics and molecular mechanics methodologies may be applied to a broad variety of protein targets to improve the outcome of structure-based drug discovery studies.Poor performance of scoring functions is a well-known bottleneck in structure-based virtual screening (VS), which is most frequently manifested in the scoring functions' inability to discriminate between true ligands vs known nonbinders (therefore designated as binding decoys). This deficiency leads to a large number of false positive hits resulting from VS. We have hypothesized that filtering out or penalizing docking poses recognized as non-native (i.e., pose decoys) should improve the performance of VS in terms of improved identification of true binders. Using several concepts from the field of cheminformatics, we have developed a novel approach to identifying pose decoys from an ensemble of poses generated by computational docking procedures. We demonstrate that the use of target-specific pose (scoring) filter in combination with a physical force field-based scoring function (MedusaScore) leads to significant improvement of hit rates in VS studies for 12 of the 13 benchmark sets from the clustered version of the Database of Useful Decoys (DUD). This new hybrid scoring function outperforms several conventional structure-based scoring functions, including XSCORE::HMSCORE, ChemScore, PLP, and Chemgauss3, in 6 out of 13 data sets at early stage of VS (up 1% decoys of the screening database). We compare our hybrid method with several novel VS methods that were recently reported to have good performances on the same DUD data sets. We find that the retrieved ligands using our method are chemically more diverse in comparison with two ligand-based methods (FieldScreen and FLAP::LBX). We also compare our method with FLAP::RBLB, a high-performance VS method that also utilizes both the receptor and the cognate ligand structures. Interestingly, we find that the top ligands retrieved using our method are highly complementary to those retrieved using FLAP::RBLB, hinting effective directions for best VS applications. We suggest that this integrative VS approach combining cheminformatics and molecular mechanics methodologies may be applied to a broad variety of protein targets to improve the outcome of structure-based drug discovery studies.
Poor performance of scoring functions is a well-known bottleneck in structure-based virtual screening (VS), which is most frequently manifested in the scoring functions' inability to discriminate between true ligands vs known nonbinders (therefore designated as binding decoys). This deficiency leads to a large number of false positive hits resulting from VS. We have hypothesized that filtering out or penalizing docking poses recognized as non-native (i.e., pose decoys) should improve the performance of VS in terms of improved identification of true binders. Using several concepts from the field of cheminformatics, we have developed a novel approach to identifying pose decoys from an ensemble of poses generated by computational docking procedures. We demonstrate that the use of target-specific pose (scoring) filter in combination with a physical force field-based scoring function (MedusaScore) leads to significant improvement of hit rates in VS studies for 12 of the 13 benchmark sets from the clustered version of the Database of Useful Decoys (DUD). This new hybrid scoring function outperforms several conventional structure-based scoring functions, including XSCORE::HMSCORE, ChemScore, PLP, and Chemgauss3, in 6 out of 13 data sets at early stage of VS (up 1% decoys of the screening database). We compare our hybrid method with several novel VS methods that were recently reported to have good performances on the same DUD data sets. We find that the retrieved ligands using our method are chemically more diverse in comparison with two ligand-based methods (FieldScreen and FLAP::LBX). We also compare our method with FLAP::RBLB, a high-performance VS method that also utilizes both the receptor and the cognate ligand structures. Interestingly, we find that the top ligands retrieved using our method are highly complementary to those retrieved using FLAP::RBLB, hinting effective directions for best VS applications. We suggest that this integrative VS approach combining cheminformatics and molecular mechanics methodologies may be applied to a broad variety of protein targets to improve the outcome of structure-based drug discovery studies.
Author Dokholyan, Nikolay V
Tropsha, Alexander
Wang, Xiang S
Hsieh, Jui-Hua
Liu, Shubin
Yin, Shuangye
Author_xml – sequence: 1
  givenname: Jui-Hua
  surname: Hsieh
  fullname: Hsieh, Jui-Hua
  organization: Laboratory for Molecular Modeling, Division of Medicinal Chemistry and Natural Products, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
– sequence: 2
  givenname: Shuangye
  surname: Yin
  fullname: Yin, Shuangye
– sequence: 3
  givenname: Xiang S
  surname: Wang
  fullname: Wang, Xiang S
– sequence: 4
  givenname: Shubin
  surname: Liu
  fullname: Liu, Shubin
– sequence: 5
  givenname: Nikolay V
  surname: Dokholyan
  fullname: Dokholyan, Nikolay V
– sequence: 6
  givenname: Alexander
  surname: Tropsha
  fullname: Tropsha, Alexander
BackLink https://www.ncbi.nlm.nih.gov/pubmed/22017385$$D View this record in MEDLINE/PubMed
BookMark eNpNUctuFDEQtFAQSRYO_ADyjdOA7Xl4hhtakYAUiQuRuK087XbG4LEHP4L2M_mjOGKDuPSrqqta6kty5oNHQl5z9o4zwd-DFYyJnsln5IL33dRMA_t-9l99Ti5T-sFY206DeEHOhWBctmN_Qf7sF1ytNyGuKltIdEXMNQaHUJyKtYdF-Yp8oIpCWGfrUVO1bc5C3QieBkN_-vDbob7DZlapwltISBOEaP0dVb4OlmOqfEerESA1Fp0-cReb_1FN8fComahdtxjuMdG8IFUAJSo4PlqlHAvkEp-s7m3MpQoniIi-irwkz41yCV-d8o7cXn36tv_c3Hy9_rL_eNOobuC5MfMw8k7ynmEvZpzHjiuhudbIOy01QqdmGIyS_SDHVoNEYfioNAxsFGaexI68_atbD_1VMOXDahOgc8pjKOkwcdlL2dYH7cibE7PMK-rDFu2q4vHw9AXxAPxbkeE
CitedBy_id crossref_primary_10_1016_j_chemolab_2021_104384
crossref_primary_10_1016_j_cclet_2013_06_016
crossref_primary_10_1016_j_jbiotec_2019_01_023
crossref_primary_10_1007_s10822_013_9637_7
crossref_primary_10_3390_molecules29081826
crossref_primary_10_1002_minf_202000105
crossref_primary_10_1002_qua_24400
crossref_primary_10_2174_0109298673334469241017053508
crossref_primary_10_1016_j_biomaterials_2013_07_011
crossref_primary_10_1080_17460441_2021_1929921
crossref_primary_10_1111_cbdd_12631
crossref_primary_10_1016_j_ymeth_2014_11_015
crossref_primary_10_1002_ps_7700
crossref_primary_10_1002_prot_24110
crossref_primary_10_1080_14756366_2018_1437156
crossref_primary_10_1002_minf_201500038
crossref_primary_10_1002_jmr_2471
ContentType Journal Article
DBID CGR
CUY
CVF
ECM
EIF
NPM
7X8
DOI 10.1021/ci2002507
DatabaseName Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
MEDLINE - Academic
DatabaseTitle MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
MEDLINE - Academic
DatabaseTitleList MEDLINE - Academic
MEDLINE
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: 7X8
  name: MEDLINE - Academic
  url: https://search.proquest.com/medline
  sourceTypes: Aggregation Database
DeliveryMethod no_fulltext_linktorsrc
Discipline Chemistry
EISSN 1549-960X
ExternalDocumentID 22017385
Genre Research Support, American Recovery and Reinvestment Act
Research Support, Non-U.S. Gov't
Journal Article
Research Support, N.I.H., Extramural
GrantInformation_xml – fundername: NIGMS NIH HHS
  grantid: GM066940
– fundername: NIGMS NIH HHS
  grantid: R01 GM066940
– fundername: NIGMS NIH HHS
  grantid: R01GM080742
– fundername: NIGMS NIH HHS
  grantid: R01 GM080742
– fundername: NIGMS NIH HHS
  grantid: GM066940-06S1
GroupedDBID ---
-~X
4.4
55A
5GY
5VS
7~N
AABXI
ABBLG
ABJNI
ABLBI
ABMVS
ABQRX
ABUCX
ACGFS
ACIWK
ACNCT
ACS
ADHLV
AEESW
AENEX
AFEFF
AHGAQ
ALMA_UNASSIGNED_HOLDINGS
AQSVZ
CGR
CUPRZ
CUY
CVF
D0L
DU5
EBS
ECM
ED~
EIF
EJD
F5P
GGK
GNL
IH9
JG~
LG6
NPM
P2P
PQQKQ
RNS
ROL
UI2
VF5
VG9
W1F
7X8
ID FETCH-LOGICAL-a461t-fb68147150e52beb841a2d1dde14d7dec4abc6fa756783dc7e2f18adc6082fb92
IEDL.DBID 7X8
ISICitedReferencesCount 39
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000299351600003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1549-960X
IngestDate Thu Jul 10 22:00:24 EDT 2025
Mon Jul 21 06:03:15 EDT 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a461t-fb68147150e52beb841a2d1dde14d7dec4abc6fa756783dc7e2f18adc6082fb92
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ObjectType-Undefined-3
OpenAccessLink https://www.ncbi.nlm.nih.gov/pmc/articles/3264743
PMID 22017385
PQID 917577310
PQPubID 23479
ParticipantIDs proquest_miscellaneous_917577310
pubmed_primary_22017385
PublicationCentury 2000
PublicationDate 2012-01-23
PublicationDateYYYYMMDD 2012-01-23
PublicationDate_xml – month: 01
  year: 2012
  text: 2012-01-23
  day: 23
PublicationDecade 2010
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle Journal of chemical information and modeling
PublicationTitleAlternate J Chem Inf Model
PublicationYear 2012
SSID ssj0033962
Score 2.2174492
Snippet Poor performance of scoring functions is a well-known bottleneck in structure-based virtual screening (VS), which is most frequently manifested in the scoring...
SourceID proquest
pubmed
SourceType Aggregation Database
Index Database
StartPage 16
SubjectTerms Algorithms
Binding Sites
Biomechanical Phenomena
Databases, Factual
Drug Discovery - methods
Humans
Informatics
Ligands
Molecular Conformation
Molecular Dynamics Simulation
Peptide Hydrolases - chemistry
Protease Inhibitors - chemistry
Protein Binding
Research Design
Thermodynamics
User-Computer Interface
Title Cheminformatics meets molecular mechanics: a combined application of knowledge-based pose scoring and physical force field-based hit scoring functions improves the accuracy of structure-based virtual screening
URI https://www.ncbi.nlm.nih.gov/pubmed/22017385
https://www.proquest.com/docview/917577310
Volume 52
WOSCitedRecordID wos000299351600003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV07T-QwELY4QLpreMPx1BS0FrHzcEKDEAJRwIoCpO1W9tgRKS5ZNrtI_Mz7Rzd2khXNiYImysPxRBlr5rNn_A1j56LQuTap5MamGU8spjzHUvPAFVboSGFHmf-gRqN8PC6e-tyctk-rHGxiMNS2Qb9GfkHTilQpAiNX0zfui0b54GpfQeMHW4sJyfiMLjVeBhHiuAj1RD0JGSegPh6IhaS4wEoG76_-DyyDg7nb_OanbbGNHlnCdTcUttmKq3fYz5uhoNsu-xvO6x6lYgt_nJvTcSiQS9d-GzA9uQQNJJomzc7CpxA3NCUsF-G4d4AWpk3roMWQxwe6phu95oEEoYOQIte3fa3my6beoYYxD1VY13AtEBYFjbiYafzwojpu28VsEPVezfxuF-rCJwtRJ3vs5e72-eae9xUduE4yMeelyXJB7jCNXCqNM3kitLSCTKxIrLIOE20wK7VKyYfGFpWTpci1xYyQSmkKuc9W66Z2vxlE5FatETFaXSRR4vLUosQyNiKzNo_sIYNBWRP6zz4MomvXLNrJUl2H7KBT-GTaMXtMJMEhT-9z9PXLx-wXNfZpLVzGJ2ytJGvhTtk6vs-rdnYWRiIdR0-P_wDhzPE0
linkProvider ProQuest
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Cheminformatics+meets+molecular+mechanics%3A+a+combined+application+of+knowledge-based+pose+scoring+and+physical+force+field-based+hit+scoring+functions+improves+the+accuracy+of+structure-based+virtual+screening&rft.jtitle=Journal+of+chemical+information+and+modeling&rft.au=Hsieh%2C+Jui-Hua&rft.au=Yin%2C+Shuangye&rft.au=Wang%2C+Xiang+S&rft.au=Liu%2C+Shubin&rft.date=2012-01-23&rft.eissn=1549-960X&rft.volume=52&rft.issue=1&rft.spage=16&rft_id=info:doi/10.1021%2Fci2002507&rft_id=info%3Apmid%2F22017385&rft_id=info%3Apmid%2F22017385&rft.externalDocID=22017385
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1549-960X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1549-960X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1549-960X&client=summon