Efficient calculation of compound similarity based on maximum common subgraphs and its application to prediction of gene transcript levels

Properties of a chemical entity, both physical and biological, are related to its structure. Since compound similarity can be used to infer properties of novel compounds, in chemoinformatics much attention has been paid to ways of calculating structural similarity. A useful metric to capture the str...

Full description

Saved in:
Bibliographic Details
Published in:International journal of bioinformatics research and applications Vol. 9; no. 4; p. 407
Main Authors: Berlo, Rogier J P Van, Winterbach, Wynand, Groot, Marco J L De, Bender, Andreas, Verheijen, Peter J T, Reinders, Marcel J T, Ridder, Dick De
Format: Journal Article
Language:English
Published: Switzerland 2013
Subjects:
ISSN:1744-5485
Online Access:Get more information
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Properties of a chemical entity, both physical and biological, are related to its structure. Since compound similarity can be used to infer properties of novel compounds, in chemoinformatics much attention has been paid to ways of calculating structural similarity. A useful metric to capture the structural similarity between compounds is the relative size of the Maximum Common Subgraph (MCS). The MCS is the largest substructure present in a pair of compounds, when represented as graphs. However, in practice it is difficult to employ such a metric, since calculation of the MCS becomes computationally intractable when it is large. We propose a novel algorithm that significantly reduces computation time for finding large MCSs, compared to a number of state-of-the-art approaches. The use of this algorithm is demonstrated in an application predicting the transcriptional response of breast cancer cell lines to different drug-like compounds, at a scale which is challenging for the most efficient MCS-algorithms to date. In this application 714 compounds were compared.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1744-5485
DOI:10.1504/IJBRA.2013.054688