Protein structural domain-disease association prediction based on heterogeneous networks

Background Domains can be viewed as portable units of protein structure, folding, function, evolution, and design. Small proteins are often found to be composed of only a single domain, while most large proteins consist of multiple domains for achieving various composite cellular functions. A dysfun...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:BMC genomics Ročník 23; číslo Suppl 6; s. 869 - 15
Hlavní autoři: Zhang, Jingpu, Deng, Lianping, Deng, Lei
Médium: Journal Article
Jazyk:angličtina
Vydáno: London BioMed Central 10.04.2025
BioMed Central Ltd
Springer Nature B.V
BMC
Témata:
ISSN:1471-2164, 1471-2164
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract Background Domains can be viewed as portable units of protein structure, folding, function, evolution, and design. Small proteins are often found to be composed of only a single domain, while most large proteins consist of multiple domains for achieving various composite cellular functions. A dysfunction in domains may affect the function of proteins in some disease. Inferring the disease-related domains will help our understanding of the mechanism of human complex diseases. Results In this study, we firstly build a global heterogeneous information network based on structural-based domains, proteins, and diseases. Then the topological features of the network are extracted according to the meta-paths between domain and disease nodes. Finally, we train a binary classifier based on the XGBOOST (eXtreme Gradient Boosting) algorithm to predict the potential associations between domains and diseases. The results show that the binary classification model using the XGBOOST algorithm performs significantly better than models using other machine learning algorithms, achieving an AUC (Area Under Curve) score of 0.94 in the leave-one-out cross-validation experiment. Conclusions We develop a method to build a binary classifier using the topological features based on meta-paths and predict the potential associations between domains and diseases. Based on its predictive performance in independent test sets, the method is proved to be powerful. Moreover, representing domains and diseases through integrating more multi-omic data will further optimize predictive performance.
AbstractList BackgroundDomains can be viewed as portable units of protein structure, folding, function, evolution, and design. Small proteins are often found to be composed of only a single domain, while most large proteins consist of multiple domains for achieving various composite cellular functions. A dysfunction in domains may affect the function of proteins in some disease. Inferring the disease-related domains will help our understanding of the mechanism of human complex diseases.ResultsIn this study, we firstly build a global heterogeneous information network based on structural-based domains, proteins, and diseases. Then the topological features of the network are extracted according to the meta-paths between domain and disease nodes. Finally, we train a binary classifier based on the XGBOOST (eXtreme Gradient Boosting) algorithm to predict the potential associations between domains and diseases. The results show that the binary classification model using the XGBOOST algorithm performs significantly better than models using other machine learning algorithms, achieving an AUC (Area Under Curve) score of 0.94 in the leave-one-out cross-validation experiment.ConclusionsWe develop a method to build a binary classifier using the topological features based on meta-paths and predict the potential associations between domains and diseases. Based on its predictive performance in independent test sets, the method is proved to be powerful. Moreover, representing domains and diseases through integrating more multi-omic data will further optimize predictive performance.
Domains can be viewed as portable units of protein structure, folding, function, evolution, and design. Small proteins are often found to be composed of only a single domain, while most large proteins consist of multiple domains for achieving various composite cellular functions. A dysfunction in domains may affect the function of proteins in some disease. Inferring the disease-related domains will help our understanding of the mechanism of human complex diseases.BACKGROUNDDomains can be viewed as portable units of protein structure, folding, function, evolution, and design. Small proteins are often found to be composed of only a single domain, while most large proteins consist of multiple domains for achieving various composite cellular functions. A dysfunction in domains may affect the function of proteins in some disease. Inferring the disease-related domains will help our understanding of the mechanism of human complex diseases.In this study, we firstly build a global heterogeneous information network based on structural-based domains, proteins, and diseases. Then the topological features of the network are extracted according to the meta-paths between domain and disease nodes. Finally, we train a binary classifier based on the XGBOOST (eXtreme Gradient Boosting) algorithm to predict the potential associations between domains and diseases. The results show that the binary classification model using the XGBOOST algorithm performs significantly better than models using other machine learning algorithms, achieving an AUC (Area Under Curve) score of 0.94 in the leave-one-out cross-validation experiment.RESULTSIn this study, we firstly build a global heterogeneous information network based on structural-based domains, proteins, and diseases. Then the topological features of the network are extracted according to the meta-paths between domain and disease nodes. Finally, we train a binary classifier based on the XGBOOST (eXtreme Gradient Boosting) algorithm to predict the potential associations between domains and diseases. The results show that the binary classification model using the XGBOOST algorithm performs significantly better than models using other machine learning algorithms, achieving an AUC (Area Under Curve) score of 0.94 in the leave-one-out cross-validation experiment.We develop a method to build a binary classifier using the topological features based on meta-paths and predict the potential associations between domains and diseases. Based on its predictive performance in independent test sets, the method is proved to be powerful. Moreover, representing domains and diseases through integrating more multi-omic data will further optimize predictive performance.CONCLUSIONSWe develop a method to build a binary classifier using the topological features based on meta-paths and predict the potential associations between domains and diseases. Based on its predictive performance in independent test sets, the method is proved to be powerful. Moreover, representing domains and diseases through integrating more multi-omic data will further optimize predictive performance.
Domains can be viewed as portable units of protein structure, folding, function, evolution, and design. Small proteins are often found to be composed of only a single domain, while most large proteins consist of multiple domains for achieving various composite cellular functions. A dysfunction in domains may affect the function of proteins in some disease. Inferring the disease-related domains will help our understanding of the mechanism of human complex diseases. In this study, we firstly build a global heterogeneous information network based on structural-based domains, proteins, and diseases. Then the topological features of the network are extracted according to the meta-paths between domain and disease nodes. Finally, we train a binary classifier based on the XGBOOST (eXtreme Gradient Boosting) algorithm to predict the potential associations between domains and diseases. The results show that the binary classification model using the XGBOOST algorithm performs significantly better than models using other machine learning algorithms, achieving an AUC (Area Under Curve) score of 0.94 in the leave-one-out cross-validation experiment. We develop a method to build a binary classifier using the topological features based on meta-paths and predict the potential associations between domains and diseases. Based on its predictive performance in independent test sets, the method is proved to be powerful. Moreover, representing domains and diseases through integrating more multi-omic data will further optimize predictive performance.
Background Domains can be viewed as portable units of protein structure, folding, function, evolution, and design. Small proteins are often found to be composed of only a single domain, while most large proteins consist of multiple domains for achieving various composite cellular functions. A dysfunction in domains may affect the function of proteins in some disease. Inferring the disease-related domains will help our understanding of the mechanism of human complex diseases. Results In this study, we firstly build a global heterogeneous information network based on structural-based domains, proteins, and diseases. Then the topological features of the network are extracted according to the meta-paths between domain and disease nodes. Finally, we train a binary classifier based on the XGBOOST (eXtreme Gradient Boosting) algorithm to predict the potential associations between domains and diseases. The results show that the binary classification model using the XGBOOST algorithm performs significantly better than models using other machine learning algorithms, achieving an AUC (Area Under Curve) score of 0.94 in the leave-one-out cross-validation experiment. Conclusions We develop a method to build a binary classifier using the topological features based on meta-paths and predict the potential associations between domains and diseases. Based on its predictive performance in independent test sets, the method is proved to be powerful. Moreover, representing domains and diseases through integrating more multi-omic data will further optimize predictive performance. Keywords: Heterogeneous networks, Meta-path topological feature, Domain-disease association prediction
Domains can be viewed as portable units of protein structure, folding, function, evolution, and design. Small proteins are often found to be composed of only a single domain, while most large proteins consist of multiple domains for achieving various composite cellular functions. A dysfunction in domains may affect the function of proteins in some disease. Inferring the disease-related domains will help our understanding of the mechanism of human complex diseases. In this study, we firstly build a global heterogeneous information network based on structural-based domains, proteins, and diseases. Then the topological features of the network are extracted according to the meta-paths between domain and disease nodes. Finally, we train a binary classifier based on the XGBOOST (eXtreme Gradient Boosting) algorithm to predict the potential associations between domains and diseases. The results show that the binary classification model using the XGBOOST algorithm performs significantly better than models using other machine learning algorithms, achieving an AUC (Area Under Curve) score of 0.94 in the leave-one-out cross-validation experiment. We develop a method to build a binary classifier using the topological features based on meta-paths and predict the potential associations between domains and diseases. Based on its predictive performance in independent test sets, the method is proved to be powerful. Moreover, representing domains and diseases through integrating more multi-omic data will further optimize predictive performance.
Abstract Background Domains can be viewed as portable units of protein structure, folding, function, evolution, and design. Small proteins are often found to be composed of only a single domain, while most large proteins consist of multiple domains for achieving various composite cellular functions. A dysfunction in domains may affect the function of proteins in some disease. Inferring the disease-related domains will help our understanding of the mechanism of human complex diseases. Results In this study, we firstly build a global heterogeneous information network based on structural-based domains, proteins, and diseases. Then the topological features of the network are extracted according to the meta-paths between domain and disease nodes. Finally, we train a binary classifier based on the XGBOOST (eXtreme Gradient Boosting) algorithm to predict the potential associations between domains and diseases. The results show that the binary classification model using the XGBOOST algorithm performs significantly better than models using other machine learning algorithms, achieving an AUC (Area Under Curve) score of 0.94 in the leave-one-out cross-validation experiment. Conclusions We develop a method to build a binary classifier using the topological features based on meta-paths and predict the potential associations between domains and diseases. Based on its predictive performance in independent test sets, the method is proved to be powerful. Moreover, representing domains and diseases through integrating more multi-omic data will further optimize predictive performance.
Background Domains can be viewed as portable units of protein structure, folding, function, evolution, and design. Small proteins are often found to be composed of only a single domain, while most large proteins consist of multiple domains for achieving various composite cellular functions. A dysfunction in domains may affect the function of proteins in some disease. Inferring the disease-related domains will help our understanding of the mechanism of human complex diseases. Results In this study, we firstly build a global heterogeneous information network based on structural-based domains, proteins, and diseases. Then the topological features of the network are extracted according to the meta-paths between domain and disease nodes. Finally, we train a binary classifier based on the XGBOOST (eXtreme Gradient Boosting) algorithm to predict the potential associations between domains and diseases. The results show that the binary classification model using the XGBOOST algorithm performs significantly better than models using other machine learning algorithms, achieving an AUC (Area Under Curve) score of 0.94 in the leave-one-out cross-validation experiment. Conclusions We develop a method to build a binary classifier using the topological features based on meta-paths and predict the potential associations between domains and diseases. Based on its predictive performance in independent test sets, the method is proved to be powerful. Moreover, representing domains and diseases through integrating more multi-omic data will further optimize predictive performance.
ArticleNumber 869
Audience Academic
Author Deng, Lei
Zhang, Jingpu
Deng, Lianping
Author_xml – sequence: 1
  givenname: Jingpu
  orcidid: 0000-0002-8006-747X
  surname: Zhang
  fullname: Zhang, Jingpu
  organization: School of Computer and Data Science, Henan University of Urban Construction
– sequence: 2
  givenname: Lianping
  surname: Deng
  fullname: Deng, Lianping
  organization: School of Computer Science and Engineering, Central South University
– sequence: 3
  givenname: Lei
  surname: Deng
  fullname: Deng, Lei
  email: leideng@csu.edu.cn
  organization: School of Computer Science and Engineering, Central South University
BackLink https://www.ncbi.nlm.nih.gov/pubmed/40211147$$D View this record in MEDLINE/PubMed
BookMark eNp9kktv1DAUhSNURB_wB1igSGxgkeJXbGdZVRRGqgTiIbGzbpyb4CETD7Yj6L_HMyktgxD2wlfOd459nXNaHE1-wqJ4Ssk5pVq-ipRpKSrCREXzUBV5UJxQoWjFqBRHf9THxWmMa0Ko0qx-VBwLwrJAqJPiy_vgE7qpjCnMNs0BxrLzG3BT1bmIELGEGL11kJyfym3Aztl92eZvXZmLr5gw-AEn9HMsJ0w_fPgWHxcPexgjPrldz4rPV68_Xb6trt-9WV1eXFe25k2q2q4GQjRXoBsqrOVUMaFlzYRCLaglfc2YbYBrK5RivbakbjUKSXgDPVh-VqwW387D2myD20C4MR6c2W_4MBgIydkRTS-5sFJx2TEupGigtYL0jIrGAmmYzl4vFq9t8N9njMlsXLQ4jrDvzXCqtaaSCp7R53-haz-HKXdqOCM0X5rV9J4aIJ_vpt6nAHZnai40r4lWlO28zv9B5dnhxtn8z3uX9w8ELw8EmUn4Mw0wx2hWHz8css9uLzq3G-zuXuh3BDLAFsAGH2PA_g6hxOxyZpacmZwzs8-ZIVnEF1HM8DRguG__P6pf9BzQhg
Cites_doi 10.1093/nar/gkq973
10.1371/journal.pcbi.1007052
10.48550/arXiv.1911.08676
10.1038/35001165
10.1038/nature13835
10.1186/1471-2164-14-S3-S5
10.1093/database/bay064
10.1049/iet-syb.2009.0037
10.1038/s41586-019-1879-7
10.1016/j.ajhg.2008.02.013
10.1093/bib/bbr013
10.1371/journal.pcbi.1002819
10.1111/j.1467-9906.2008.00438.x
10.1161/CIRCRESAHA.116.304723
10.1038/nrg1522
10.1093/nar/gku1205
10.1371/journal.pone.0056653
10.1093/database/bau017
10.1093/nar/gkx973
10.1093/bioinformatics/btq447
10.1080/2150704X.2017.1280200
10.1126/science.8091226
10.1016/S0022-2836(05)80134-2
10.1073/pnas.0810772105
10.1186/s12859-016-1005-x
10.1016/S0140-6736(05)17741-1
10.1186/s12859-019-2644-5
10.1038/nmeth.3484
10.1186/s12918-015-0247-y
10.1109/TCBB.2015.2389213
10.1002/hep.23759
10.1038/nbt1203
10.1038/s41598-019-39905-9
10.1186/1752-0509-5-55
ContentType Journal Article
Copyright The Author(s) 2025
2025. The Author(s).
COPYRIGHT 2025 BioMed Central Ltd.
2025. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Copyright_xml – notice: The Author(s) 2025
– notice: 2025. The Author(s).
– notice: COPYRIGHT 2025 BioMed Central Ltd.
– notice: 2025. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
DBID C6C
AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
ISR
3V.
7QP
7QR
7SS
7TK
7U7
7X7
7XB
88E
8AO
8FD
8FE
8FH
8FI
8FJ
8FK
ABUWG
AEUYN
AFKRA
AZQEC
BBNVY
BENPR
BHPHI
C1K
CCPQU
DWQXO
FR3
FYUFA
GHDGH
GNUQQ
HCIFZ
K9.
LK8
M0S
M1P
M7P
P64
PHGZM
PHGZT
PIMPY
PJZUB
PKEHL
PPXIY
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
RC3
7X8
DOA
DOI 10.1186/s12864-024-11117-0
DatabaseName Springer Nature OA Free Journals
CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
Gale In Context: Science
ProQuest Central (Corporate)
Calcium & Calcified Tissue Abstracts
Chemoreception Abstracts
Entomology Abstracts (Full archive)
Neurosciences Abstracts
Toxicology Abstracts
Health & Medical Collection
ProQuest Central (purchase pre-March 2016)
Medical Database (Alumni Edition)
ProQuest Pharma Collection
Technology Research Database
ProQuest SciTech Collection
ProQuest Natural Science Collection
Hospital Premium Collection
Hospital Premium Collection (Alumni Edition)
ProQuest Central (Alumni) (purchase pre-March 2016)
ProQuest Central (Alumni)
ProQuest One Sustainability
ProQuest Central UK/Ireland
ProQuest Central Essentials
Biological Science Database (Proquest)
ProQuest Central
Natural Science Collection
Environmental Sciences and Pollution Management
ProQuest One Community College
ProQuest Central
Engineering Research Database
Health Research Premium Collection
Health Research Premium Collection (Alumni)
ProQuest Central Student
SciTech Premium Collection
ProQuest Health & Medical Complete (Alumni)
Biological Sciences
Health & Medical Collection (Alumni Edition)
PML(ProQuest Medical Library)
Biological Science Database
Biotechnology and BioEngineering Abstracts
ProQuest Central Premium
ProQuest One Academic
Publicly Available Content Database
ProQuest Health & Medical Research Collection
ProQuest One Academic Middle East (New)
ProQuest One Health & Nursing
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic (retired)
ProQuest One Academic UKI Edition
ProQuest Central China
Genetics Abstracts
MEDLINE - Academic
Directory of Open Access Journals (DOAJ)
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
Publicly Available Content Database
ProQuest Central Student
Technology Research Database
ProQuest One Academic Middle East (New)
ProQuest Central Essentials
ProQuest Health & Medical Complete (Alumni)
ProQuest Central (Alumni Edition)
SciTech Premium Collection
ProQuest One Community College
ProQuest One Health & Nursing
ProQuest Natural Science Collection
ProQuest Pharma Collection
ProQuest Central China
Environmental Sciences and Pollution Management
ProQuest Central
ProQuest One Applied & Life Sciences
ProQuest One Sustainability
ProQuest Health & Medical Research Collection
Genetics Abstracts
Health Research Premium Collection
Health and Medicine Complete (Alumni Edition)
Natural Science Collection
ProQuest Central Korea
Health & Medical Research Collection
Biological Science Collection
Chemoreception Abstracts
ProQuest Central (New)
ProQuest Medical Library (Alumni)
ProQuest Biological Science Collection
Toxicology Abstracts
ProQuest One Academic Eastern Edition
ProQuest Hospital Collection
Health Research Premium Collection (Alumni)
Biological Science Database
ProQuest SciTech Collection
Neurosciences Abstracts
ProQuest Hospital Collection (Alumni)
Biotechnology and BioEngineering Abstracts
Entomology Abstracts
ProQuest Health & Medical Complete
ProQuest Medical Library
ProQuest One Academic UKI Edition
Engineering Research Database
ProQuest One Academic
Calcium & Calcified Tissue Abstracts
ProQuest One Academic (New)
ProQuest Central (Alumni)
MEDLINE - Academic
DatabaseTitleList Publicly Available Content Database
MEDLINE - Academic
MEDLINE





Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: NPM
  name: PubMed
  url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 3
  dbid: PIMPY
  name: Publicly Available Content Database
  url: http://search.proquest.com/publiccontent
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 1471-2164
EndPage 15
ExternalDocumentID oai_doaj_org_article_f634c6736d234649abc40f2149ca0928
A835087123
40211147
10_1186_s12864_024_11117_0
Genre Journal Article
GeographicLocations China
GeographicLocations_xml – name: China
GrantInformation_xml – fundername: National Natural Science Foundation of China
  grantid: 62172140
  funderid: http://dx.doi.org/10.13039/501100001809
– fundername: National Natural Science Foundation of China
  grantid: 62172140
GroupedDBID ---
0R~
23N
2WC
2XV
53G
5VS
6J9
7X7
88E
8AO
8FE
8FH
8FI
8FJ
AAFWJ
AAHBH
AAJSJ
AASML
ABDBF
ABUWG
ACGFO
ACGFS
ACIHN
ACIWK
ACPRK
ACUHS
ADBBV
ADUKV
AEAQA
AENEX
AEUYN
AFKRA
AFPKN
AFRAH
AHBYD
AHMBA
AHYZX
ALMA_UNASSIGNED_HOLDINGS
AMKLP
AMTXH
AOIJS
BAPOH
BAWUL
BBNVY
BCNDV
BENPR
BFQNJ
BHPHI
BMC
BPHCQ
BVXVI
C6C
CCPQU
CS3
DIK
DU5
E3Z
EAD
EAP
EAS
EBD
EBLON
EBS
EMB
EMK
EMOBN
ESX
F5P
FYUFA
GROUPED_DOAJ
GX1
HCIFZ
HMCUK
IAO
IGS
IHR
INH
INR
ISR
ITC
KQ8
LK8
M1P
M7P
M~E
O5R
O5S
OK1
OVT
P2P
PGMZT
PHGZM
PHGZT
PIMPY
PJZUB
PPXIY
PQGLB
PQQKQ
PROAC
PSQYO
PUEGO
RBZ
RNS
ROL
RPM
RSV
SBL
SOJ
SV3
TR2
TUS
U2A
UKHRP
W2D
WOQ
WOW
XSB
AAYXX
AFFHD
CITATION
ALIPV
CGR
CUY
CVF
ECM
EIF
NPM
3V.
7QP
7QR
7SS
7TK
7U7
7XB
8FD
8FK
AZQEC
C1K
DWQXO
FR3
GNUQQ
K9.
M48
P64
PKEHL
PQEST
PQUKI
PRINS
RC3
7X8
ID FETCH-LOGICAL-c539t-bd5a00837a8914cc31724865247e841c0f522c9a38c4772f8c05b8e46039afac3
IEDL.DBID DOA
ISICitedReferencesCount 0
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001463540200001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1471-2164
IngestDate Fri Oct 03 12:52:57 EDT 2025
Fri Sep 05 17:40:03 EDT 2025
Sat Nov 29 15:01:50 EST 2025
Tue Nov 11 10:49:43 EST 2025
Tue Nov 04 18:16:17 EST 2025
Thu Nov 13 16:00:15 EST 2025
Mon Apr 14 01:52:01 EDT 2025
Sat Nov 29 08:01:23 EST 2025
Sat Sep 06 07:21:41 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue Suppl 6
Keywords Domain-disease association prediction
Meta-path topological feature
Heterogeneous networks
Language English
License 2025. The Author(s).
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c539t-bd5a00837a8914cc31724865247e841c0f522c9a38c4772f8c05b8e46039afac3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ORCID 0000-0002-8006-747X
OpenAccessLink https://doaj.org/article/f634c6736d234649abc40f2149ca0928
PMID 40211147
PQID 3201522251
PQPubID 44682
PageCount 15
ParticipantIDs doaj_primary_oai_doaj_org_article_f634c6736d234649abc40f2149ca0928
proquest_miscellaneous_3188816143
proquest_journals_3201522251
gale_infotracmisc_A835087123
gale_infotracacademiconefile_A835087123
gale_incontextgauss_ISR_A835087123
pubmed_primary_40211147
crossref_primary_10_1186_s12864_024_11117_0
springer_journals_10_1186_s12864_024_11117_0
PublicationCentury 2000
PublicationDate 2025-04-10
PublicationDateYYYYMMDD 2025-04-10
PublicationDate_xml – month: 04
  year: 2025
  text: 2025-04-10
  day: 10
PublicationDecade 2020
PublicationPlace London
PublicationPlace_xml – name: London
– name: England
PublicationTitle BMC genomics
PublicationTitleAbbrev BMC Genomics
PublicationTitleAlternate BMC Genomics
PublicationYear 2025
Publisher BioMed Central
BioMed Central Ltd
Springer Nature B.V
BMC
Publisher_xml – name: BioMed Central
– name: BioMed Central Ltd
– name: Springer Nature B.V
– name: BMC
References S Oliver (11117_CR17) 2000; 403
G Fu (11117_CR25) 2016; 17
H Zhang (11117_CR34) 2017; 8
11117_CR5
S Köhler (11117_CR19) 2008; 82
K Rahimi (11117_CR28) 2015; 116
11117_CR11
M Claussnitzer (11117_CR3) 2020; 577
C Savojardo (11117_CR13) 2019; 20
JS Amberger (11117_CR27) 2015; 43
11117_CR30
K Lage (11117_CR4) 2008; 105
JE Shim (11117_CR32) 2019; 15
N Dorius (11117_CR29) 2009; 31
11117_CR14
11117_CR12
KKH Farh (11117_CR6) 2015; 518
WY Wang (11117_CR2) 2005; 6
TA Peterson (11117_CR9) 2013; 14
S Aerts (11117_CR18) 2006; 24
H Yang (11117_CR7) 2015; 12
AG Murzin (11117_CR24) 1995; 247
W Zhang (11117_CR21) 2011; 5
11117_CR8
L Deng (11117_CR22) 2015; 12
ES Lander (11117_CR1) 1994; 265
FW Ortiz (11117_CR33) 2019; 9
PM Kearney (11117_CR35) 2005; 365
M Krassowski (11117_CR16) 2018; 46
TA Peterson (11117_CR10) 2010; 26
W Wang (11117_CR20) 2010; 4
11117_CR26
JT Dudley (11117_CR15) 2011; 12
G Wu (11117_CR31) 2019; 20
11117_CR23
References_xml – ident: 11117_CR26
  doi: 10.1093/nar/gkq973
– volume: 15
  start-page: e1007052
  issue: 5
  year: 2019
  ident: 11117_CR32
  publication-title: PLoS Comput Biol.
  doi: 10.1371/journal.pcbi.1007052
– ident: 11117_CR8
  doi: 10.48550/arXiv.1911.08676
– volume: 20
  start-page: 1530
  issue: 7
  year: 2019
  ident: 11117_CR13
  publication-title: Proc Natl Acad Sci.
– volume: 403
  start-page: 601
  issue: 6770
  year: 2000
  ident: 11117_CR17
  publication-title: Nature.
  doi: 10.1038/35001165
– volume: 518
  start-page: 337
  issue: 7539
  year: 2015
  ident: 11117_CR6
  publication-title: Nature.
  doi: 10.1038/nature13835
– volume: 14
  start-page: S5
  issue: S3
  year: 2013
  ident: 11117_CR9
  publication-title: BMC Genomics.
  doi: 10.1186/1471-2164-14-S3-S5
– ident: 11117_CR12
  doi: 10.1093/database/bay064
– volume: 4
  start-page: 212
  issue: 3
  year: 2010
  ident: 11117_CR20
  publication-title: IET Systems Biol.
  doi: 10.1049/iet-syb.2009.0037
– volume: 577
  start-page: 179
  issue: 7789
  year: 2020
  ident: 11117_CR3
  publication-title: Nature.
  doi: 10.1038/s41586-019-1879-7
– volume: 82
  start-page: 949
  issue: 4
  year: 2008
  ident: 11117_CR19
  publication-title: Am J Hum Genet.
  doi: 10.1016/j.ajhg.2008.02.013
– volume: 12
  start-page: 303
  issue: 4
  year: 2011
  ident: 11117_CR15
  publication-title: Brief Bioinforma.
  doi: 10.1093/bib/bbr013
– ident: 11117_CR11
  doi: 10.1371/journal.pcbi.1002819
– volume: 31
  start-page: 97
  issue: 1
  year: 2009
  ident: 11117_CR29
  publication-title: J Urban Affairs.
  doi: 10.1111/j.1467-9906.2008.00438.x
– volume: 116
  start-page: 925
  issue: 6
  year: 2015
  ident: 11117_CR28
  publication-title: Circ Res.
  doi: 10.1161/CIRCRESAHA.116.304723
– volume: 6
  start-page: 109
  issue: 2
  year: 2005
  ident: 11117_CR2
  publication-title: Nat Rev Genet.
  doi: 10.1038/nrg1522
– volume: 43
  start-page: D789
  issue: D1
  year: 2015
  ident: 11117_CR27
  publication-title: Nucleic Acids Res.
  doi: 10.1093/nar/gku1205
– ident: 11117_CR30
  doi: 10.1371/journal.pone.0056653
– ident: 11117_CR5
  doi: 10.1093/database/bau017
– volume: 46
  start-page: D901
  issue: D1
  year: 2018
  ident: 11117_CR16
  publication-title: Nucleic Acids Res.
  doi: 10.1093/nar/gkx973
– volume: 26
  start-page: 2458
  issue: 19
  year: 2010
  ident: 11117_CR10
  publication-title: Bioinformatics.
  doi: 10.1093/bioinformatics/btq447
– volume: 8
  start-page: 438
  issue: 5
  year: 2017
  ident: 11117_CR34
  publication-title: Remote Sens Lett.
  doi: 10.1080/2150704X.2017.1280200
– volume: 265
  start-page: 2037
  issue: 5181
  year: 1994
  ident: 11117_CR1
  publication-title: Science.
  doi: 10.1126/science.8091226
– volume: 247
  start-page: 536
  issue: 4
  year: 1995
  ident: 11117_CR24
  publication-title: J Mol Biol.
  doi: 10.1016/S0022-2836(05)80134-2
– volume: 105
  start-page: 20870
  issue: 52
  year: 2008
  ident: 11117_CR4
  publication-title: Proc Natl Acad Sci.
  doi: 10.1073/pnas.0810772105
– volume: 17
  start-page: 160
  issue: 1
  year: 2016
  ident: 11117_CR25
  publication-title: BMC Bioinformatics.
  doi: 10.1186/s12859-016-1005-x
– volume: 365
  start-page: 217
  issue: 9455
  year: 2005
  ident: 11117_CR35
  publication-title: Lancet.
  doi: 10.1016/S0140-6736(05)17741-1
– volume: 20
  start-page: 134
  issue: 3
  year: 2019
  ident: 11117_CR31
  publication-title: BMC Bioinformatics.
  doi: 10.1186/s12859-019-2644-5
– volume: 12
  start-page: 841
  issue: 9
  year: 2015
  ident: 11117_CR7
  publication-title: Nat Methods.
  doi: 10.1038/nmeth.3484
– ident: 11117_CR14
  doi: 10.1186/s12918-015-0247-y
– volume: 12
  start-page: 902
  issue: 4
  year: 2015
  ident: 11117_CR22
  publication-title: IEEE/ACM Trans Comput Biol Bioinforma.
  doi: 10.1109/TCBB.2015.2389213
– ident: 11117_CR23
  doi: 10.1002/hep.23759
– volume: 24
  start-page: 537
  issue: 5
  year: 2006
  ident: 11117_CR18
  publication-title: Nat Biotechnol.
  doi: 10.1038/nbt1203
– volume: 9
  start-page: 1
  issue: 1
  year: 2019
  ident: 11117_CR33
  publication-title: Sci Rep.
  doi: 10.1038/s41598-019-39905-9
– volume: 5
  start-page: 55
  issue: 1
  year: 2011
  ident: 11117_CR21
  publication-title: BMC Syst Biol.
  doi: 10.1186/1752-0509-5-55
SSID ssj0017825
Score 2.4571447
Snippet Background Domains can be viewed as portable units of protein structure, folding, function, evolution, and design. Small proteins are often found to be...
Domains can be viewed as portable units of protein structure, folding, function, evolution, and design. Small proteins are often found to be composed of only a...
Background Domains can be viewed as portable units of protein structure, folding, function, evolution, and design. Small proteins are often found to be...
BackgroundDomains can be viewed as portable units of protein structure, folding, function, evolution, and design. Small proteins are often found to be composed...
Abstract Background Domains can be viewed as portable units of protein structure, folding, function, evolution, and design. Small proteins are often found to...
SourceID doaj
proquest
gale
pubmed
crossref
springer
SourceType Open Website
Aggregation Database
Index Database
Publisher
StartPage 869
SubjectTerms Algorithms
Animal Genetics and Genomics
Biomedical and Life Sciences
China
Commuting
Computational Biology - methods
Data processing
Datasets
Disease
Disease - genetics
Diseases
Domain-disease association prediction
Genes
Health aspects
Heterogeneous networks
Humans
Life Sciences
Machine Learning
Medical research
Medicine, Experimental
Meta-path topological feature
Microarrays
Microbial Genetics and Genomics
Physiological aspects
Plant Genetics and Genomics
Protein Domains
Protein folding
Protein structure
Proteins
Proteins - chemistry
Proteins - metabolism
Proteomics
Semantics
Structure-function relationships
Topology
SummonAdditionalLinks – databaseName: Biological Science Database
  dbid: M7P
  link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1Lb9QwELaggNQL70egoICQOIBVJ3YS-4QKogIJVSte2pvlTOKyB5LtZovEv2fGcXZZEFy4RYkTxZnP8_BMvmHsqZa6FJU3PCt9QWlGwWswwBvXytqAa_JQPP7lfXVyoudzM4sbbkMsq5x0YlDUTQ-0R34o0VIVFJxkL5dnnLpGUXY1ttC4yC4RS0IeSvdmmywCWr9i-lFGl4cD6uJScbRKnBRFxcWOMQqc_X9q5l9M02-50mCCjq_978tfZ1ej85kejWi5wS603U12ZWxH-eMWm8-ItGHRpSOpLBFypE3_zS06HvM4qdtKM12uKMkTDskWNikefKXqmh5B2fbnQ9qNNebDbfb5-M2n12957LzAoZBmzeumcOScVU6bTAGgk5ErXRa5qlqtMhAeZwPGSQ0K3XOvQRS1blUppHHegbzD9rq-a--xlCIm5dGPBAlKFdI1ui7QaawyyFtvioQ9n0RglyPBhg2BiS7tKDCLAguBSmVFwl6RlDYjiRw7nOhXpzauNetLqYDq1ZpcqlIZV4MSHkGH2BMm1wl7QjK2RH_RUX3NqTsfBvvu4wd7hA6pwBgylwl7Fgf5fr1y4OLvCjgrYszaGXmwMxLXJ-xenjBho34Y7BYQCXu8uUx3Us1bkJFFbas1OuQKH3F3hOBm3hj14wdRVcJeTJjcPvzvn-_-v9_lAdvPqb0xUVmKA7aHYGsfssvwfb0YVo_C2voJbWoniQ
  priority: 102
  providerName: ProQuest
– databaseName: SpringerLINK Contemporary 1997-Present
  dbid: RSV
  link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9QwELZQAYkL70egoICQOICFEz9iHwuiAglVVQvV3ixnEpc9kFSbLRL_nhkn2bI8DnCLNpNoM57HN5rxZ8aeW2mNqKLjhYma2oyC1-CAN6GVtYPQlGl4_ORjdXBgFwt3OG0KG-Zp97klmSJ1cmtrXg8YSY3imFM4uXnFsVC_jOnO0oENR8cnm94B5jw9b4_543NbKSgx9f8ej39KSL90SFPi2b_xf3_5Jrs-Ac18b7SMW-xS291mV8ejJ7_fYYtDImhYdvlIIEvkG3nTfw3Ljk89mzxcrFx-tqKGTrqkvNfkePGFJml6NMC2Px_ybpwnH-6yz_vvPr19z6dTFjho6da8bnQgIFYF6woFgICiVNboUlWtVQWIiBANXJAWFELxaEHo2rbKCOlCDCDvsZ2u79oHLKfqSEXEjCBBKS1DY2uNALEqoGyj0xl7OSven41kGj4VIdb4UVcedZWKksqLjL2htdlIEhF2-qFfnfrJr3w0UgHNpjWlVEa5UIMSEQ0M7Uy40mbsGa2sJ6qLjmZpTsP5MPgPx0d-D8GnwHqxlBl7MQnFfr0KEKatCfhVxI61Jbm7JYm-CNu3ZwPyUywYvESMpamsLjL2dHObnqT5trRGHiOrtQi-Fb7i_mh4m-_GCh8VoqqMvZqt7OLlf1ffw38Tf8SulXS0MdFYil22g8bXPmZX4Nt6OayeJA_7AYIXHcM
  priority: 102
  providerName: Springer Nature
Title Protein structural domain-disease association prediction based on heterogeneous networks
URI https://link.springer.com/article/10.1186/s12864-024-11117-0
https://www.ncbi.nlm.nih.gov/pubmed/40211147
https://www.proquest.com/docview/3201522251
https://www.proquest.com/docview/3188816143
https://doaj.org/article/f634c6736d234649abc40f2149ca0928
Volume 23
WOSCitedRecordID wos001463540200001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVADU
  databaseName: Open Access: BioMedCentral Open Access Titles
  customDbUrl:
  eissn: 1471-2164
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017825
  issn: 1471-2164
  databaseCode: RBZ
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: https://www.biomedcentral.com/search/
  providerName: BioMedCentral
– providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 1471-2164
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017825
  issn: 1471-2164
  databaseCode: DOA
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 1471-2164
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017825
  issn: 1471-2164
  databaseCode: M~E
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
– providerCode: PRVPQU
  databaseName: Health & Medical Collection
  customDbUrl:
  eissn: 1471-2164
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017825
  issn: 1471-2164
  databaseCode: 7X7
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/healthcomplete
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Biological Science
  customDbUrl:
  eissn: 1471-2164
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017825
  issn: 1471-2164
  databaseCode: M7P
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: http://search.proquest.com/biologicalscijournals
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Central
  customDbUrl:
  eissn: 1471-2164
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017825
  issn: 1471-2164
  databaseCode: BENPR
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/central
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Publicly Available Content Database
  customDbUrl:
  eissn: 1471-2164
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017825
  issn: 1471-2164
  databaseCode: PIMPY
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: http://search.proquest.com/publiccontent
  providerName: ProQuest
– providerCode: PRVAVX
  databaseName: Springer Nature Link Journals
  customDbUrl:
  eissn: 1471-2164
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017825
  issn: 1471-2164
  databaseCode: RSV
  dateStart: 20001201
  isFulltext: true
  titleUrlDefault: https://link.springer.com/search?facet-content-type=%22Journal%22
  providerName: Springer Nature
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Lb9QwELaggMQF8SZQVgEhcQCrTmzH9rFFragEq2gL1XKyHCeGHkiqzRaJf8-Mk2y7IMSFS5SHYyUzY883mvFnQl5prgumgqFZESSmGRmtvPG0dg2vjHd1HovHTz-o-Vwvl6a8stUX1oQN9MCD4PZCwYXH4qM656IQxlVesAA9QEfM5HGZL1NmCqbG_AH4PTktkdHFXg-zcCEo-COKU4SibMsNRbb-P-fkK07ptyxpdD5Hd8mdETWm-8PX3iPXmvY-uTXsI_nzAVmWyLZw1qYDGywyaaR19x2CfjomYFJ3qYb0fIXZmXiKTqxO4eQblsV0YE1Nd9Gn7VAc3j8kn48OP717T8ctE6iX3KxpVUuHqEo5bTLhPaCDXOhC5kI1WmSeBcBb3jiuvQBcHbRnstKNKBg3LjjPH5GdtmubJyTFUEcEAICeeyEkd7WuJKA9lfm8CUYm5M0kQXs-MGPYGFHowg7ytiDvGGEoyxJygELetERW63gDdG1HXdt_6TohL1FFFnkrWiyM-eou-t4enyzsPiBJBsFfzhPyemwUuvXKeTeuM4C_QqqrrZa7Wy1hYPntx5Ml2HFg95YDYJIYI2cJebF5jG9isVrUkYVpUmtA0gK6eDxY0Oa_IVwHgQiVkLeTSV12_nfxPf0f4ntGbue4ezEyVbJdsgMm2TwnN_2P9Vm_mpHraqniUc_IjYPDebmYxUE1w3rYEu6Vxx_LL3C1ODn9BeywH9g
linkProvider Directory of Open Access Journals
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V1Lj9MwEB4tXRBceD8CCwQE4gDWurGT2AeElsdqq-1WFSyonIzjJEsPJKXpgvZP8RuZyaOlILjtgVvUuG4cfzPfTGc8A_BICRXxONesH-UhhRk5S5x2LLWZSLSzaVAnj38YxqORmkz0eAN-dGdhKK2y04m1ok5LR_-RbwtkqpCck_6L2VdGXaMoutq10GhgsZ-dfEeXrXo-eI37-zgIdt8cvtpjbVcB5kKhFyxJQ0uGR2yV7kvnkEADqaIwkHGmZN_xHH_GaSuUk2h65srxMFGZjLjQNrdO4LxnYFMS2HuwOR4cjD8u4xbIt2F3NEdF2xVq_0gy5EFGqilmfI3-6i4Bf3LBL2T4W3S2Jr3dS__b67oMF1vz2t9p5OEKbGTFVTjXNNw8uQaTMZWlmBZ-UzaXSo74afnFTgvWRqp8u8KrP5tTGKu-JLZPfbz4TPlDJYpdVh5XftFk0VfX4f2pLOsG9IqyyG6BTz6hzNFSdsJJGQqbqiREszjuuyDLdejB027LzawpIWJq10tFpgGIQYDUrlhsuAcvCRXLkVT-u_6gnB-ZVpuYPBLSUUZeGggZSW0TJ3mOYoXSxXWgPHhImDJU4KOgDKIje1xVZvDurdlBk5ujlxwID560g_JyMbfOtgcycFVUE2xt5NbaSNRAbv12h0HTasDKrADowYPlbfomZfXVe2SQT5RCl0PiFDcbyC_XLdH4RF899uBZJwOryf_--m7_-1nuw_m9w4OhGQ5G-3fgQkDNnKlwJ9-CHgIvuwtn3bfFtJrfayXbh0-nLR0_AZ7tg3Y
linkToPdf http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1Lb9QwELZQeYgL70egQEBIHKhVJ7YT-1geKyqq1YpCtTfLceKyB5LVZovEv2fGTrZdHgfELUomUeyZ8XyjGX8m5KXiqmCl1zQrvMQyI6OV047WtuGVdrbOQ_P4yVE5nar5XM8u7OIP3e5jSTLuaUCWpna9v6x9dHFV7PewqhaCQnyh6PIlhaT9ssBGeszXj082dQSIf3LcKvPH97bCUWDt_31tvhCcfqmWhiA0ufn_v3-L3BgAaHoQLeY2udS0d8jVeCTlj7tkPkPihkWbRmJZJOVI6-6bXbR0qOWk9lyj6XKFhZ5wifGwTuHiK3bYdGCYTXfWp23sM-_vkS-T95_ffqDD6QvUSa7XtKqlRYBWWqUz4RwAjVyoQuaibJTIHPMA3Zy2XDkBEN0rx2SlGlEwrq23jt8nO23XNg9JilmT8IAlHXdCSG5rVUkAjmXm8sZrmZDXoxLMMpJsmJCcqMLEuTIwVyFZKQ1LyBvU00YSCbLDjW51agZ_M77gwmHPWp1zUQhtKyeYB8MD-2M6Vwl5gVo2SIHRYo_NqT3re3N4_MkcAChlkEfmPCGvBiHfrVfW2WHLAowKWbO2JHe3JMFH3fbj0ZjMsEb0hgP2kphuZwl5vnmMb2LfW9CRgRVXKQDlAj7xIBrhZtyQ-cOEiDIhe6PFnX_879P36N_En5Frs3cTc3Q4_fiYXM_x9GNkumS7ZAfssHlCrrjv60W_ehoc7ydEBymL
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Protein+structural+domain-disease+association+prediction+based+on+heterogeneous+networks&rft.jtitle=BMC+genomics&rft.au=Zhang%2C+Jingpu&rft.au=Deng%2C+Lianping&rft.au=Deng%2C+Lei&rft.date=2025-04-10&rft.pub=BioMed+Central+Ltd&rft.issn=1471-2164&rft.eissn=1471-2164&rft.volume=23&rft.issue=Suppl+6&rft_id=info:doi/10.1186%2Fs12864-024-11117-0&rft.externalDocID=A835087123
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1471-2164&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1471-2164&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1471-2164&client=summon