ProDomAs, protein domain assignment algorithm using center-based clustering and independent dominating set
ABSTRACT Decomposition of structural domains is an essential task in classifying protein structures, predicting protein function, and many other proteomics problems. As the number of known protein structures in PDB grows exponentially, the need for accurate automatic domain decomposition methods bec...
Gespeichert in:
| Veröffentlicht in: | Proteins, structure, function, and bioinformatics Jg. 82; H. 9; S. 1937 - 1946 |
|---|---|
| Hauptverfasser: | , , , |
| Format: | Journal Article |
| Sprache: | Englisch |
| Veröffentlicht: |
United States
Blackwell Publishing Ltd
01.09.2014
Wiley Subscription Services, Inc |
| Schlagworte: | |
| ISSN: | 0887-3585, 1097-0134, 1097-0134 |
| Online-Zugang: | Volltext |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
| Zusammenfassung: | ABSTRACT
Decomposition of structural domains is an essential task in classifying protein structures, predicting protein function, and many other proteomics problems. As the number of known protein structures in PDB grows exponentially, the need for accurate automatic domain decomposition methods becomes more essential. In this article, we introduce a bottom‐up algorithm for assigning protein domains using a graph theoretical approach. This algorithm is based on a center‐based clustering approach. For constructing initial clusters, members of an independent dominating set for the graph representation of a protein are considered as the centers. A distance matrix is then defined for these clusters. To obtain final domains, these clusters are merged using the compactness principle of domains and a method similar to the neighbor‐joining algorithm considering some thresholds. The thresholds are computed using a training set consisting of 50 protein chains. The algorithm is implemented using C++ language and is named ProDomAs. To assess the performance of ProDomAs, its results are compared with seven automatic methods, against five publicly available benchmarks. The results show that ProDomAs outperforms other methods applied on the mentioned benchmarks. The performance of ProDomAs is also evaluated against 6342 chains obtained from ASTRAL SCOP 1.71. ProDomAs is freely available at http://www.bioinf.cs.ipm.ir/software/prodomas. Proteins 2014; 82:1937–1946. © 2014 Wiley Periodicals, Inc. |
|---|---|
| Bibliographie: | istex:7598EC3298E4E2B60F2EF3A12CF0616B89E8DEB3 ArticleID:PROT24547 ark:/67375/WNG-PLJ7CWT0-C School of Biological Science, Institute for Research in Fundamental Science (IPM) ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 ObjectType-Article-2 ObjectType-Feature-1 |
| ISSN: | 0887-3585 1097-0134 1097-0134 |
| DOI: | 10.1002/prot.24547 |