Optimizing weighted gene co-expression network analysis with a multi-threaded calculation of the topological overlap matrix

Biomolecular networks are often assumed to be scale-free hierarchical networks. The weighted gene co-expression network analysis (WGCNA) treats gene co-expression networks as undirected scale-free hierarchical weighted networks. The WGCNA R software package uses an Adjacency Matrix to store a networ...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Statistical applications in genetics and molecular biology Ročník 20; číslo 4-6; s. 145
Hlavní autoři: Shuai, Min, He, Dongmei, Chen, Xin
Médium: Journal Article
Jazyk:angličtina
Vydáno: Germany 31.12.2021
Témata:
ISSN:1544-6115, 1544-6115
On-line přístup:Zjistit podrobnosti o přístupu
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract Biomolecular networks are often assumed to be scale-free hierarchical networks. The weighted gene co-expression network analysis (WGCNA) treats gene co-expression networks as undirected scale-free hierarchical weighted networks. The WGCNA R software package uses an Adjacency Matrix to store a network, next calculates the topological overlap matrix (TOM), and then identifies the modules (sub-networks), where each module is assumed to be associated with a certain biological function. The most time-consuming step of WGCNA is to calculate TOM from the Adjacency Matrix in a single thread. In this paper, the single-threaded algorithm of the TOM has been changed into a multi-threaded algorithm (the parameters are the default values of WGCNA). In the multi-threaded algorithm, Rcpp was used to make R call a C++ function, and then C++ used OpenMP to start multiple threads to calculate TOM from the Adjacency Matrix. On shared-memory MultiProcessor systems, the calculation time decreases as the number of CPU cores increases. The algorithm of this paper can promote the application of WGCNA on large data sets, and help other research fields to identify sub-networks in undirected scale-free hierarchical weighted networks. The source codes and usage are available at https://github.com/do-somethings-haha/multi-threaded_calculate_unsigned_TOM_from_unsigned_or_signed_Adjacency_Matrix_of_WGCNA.
AbstractList Biomolecular networks are often assumed to be scale-free hierarchical networks. The weighted gene co-expression network analysis (WGCNA) treats gene co-expression networks as undirected scale-free hierarchical weighted networks. The WGCNA R software package uses an Adjacency Matrix to store a network, next calculates the topological overlap matrix (TOM), and then identifies the modules (sub-networks), where each module is assumed to be associated with a certain biological function. The most time-consuming step of WGCNA is to calculate TOM from the Adjacency Matrix in a single thread. In this paper, the single-threaded algorithm of the TOM has been changed into a multi-threaded algorithm (the parameters are the default values of WGCNA). In the multi-threaded algorithm, Rcpp was used to make R call a C++ function, and then C++ used OpenMP to start multiple threads to calculate TOM from the Adjacency Matrix. On shared-memory MultiProcessor systems, the calculation time decreases as the number of CPU cores increases. The algorithm of this paper can promote the application of WGCNA on large data sets, and help other research fields to identify sub-networks in undirected scale-free hierarchical weighted networks. The source codes and usage are available at https://github.com/do-somethings-haha/multi-threaded_calculate_unsigned_TOM_from_unsigned_or_signed_Adjacency_Matrix_of_WGCNA.
Biomolecular networks are often assumed to be scale-free hierarchical networks. The weighted gene co-expression network analysis (WGCNA) treats gene co-expression networks as undirected scale-free hierarchical weighted networks. The WGCNA R software package uses an Adjacency Matrix to store a network, next calculates the topological overlap matrix (TOM), and then identifies the modules (sub-networks), where each module is assumed to be associated with a certain biological function. The most time-consuming step of WGCNA is to calculate TOM from the Adjacency Matrix in a single thread. In this paper, the single-threaded algorithm of the TOM has been changed into a multi-threaded algorithm (the parameters are the default values of WGCNA). In the multi-threaded algorithm, Rcpp was used to make R call a C++ function, and then C++ used OpenMP to start multiple threads to calculate TOM from the Adjacency Matrix. On shared-memory MultiProcessor systems, the calculation time decreases as the number of CPU cores increases. The algorithm of this paper can promote the application of WGCNA on large data sets, and help other research fields to identify sub-networks in undirected scale-free hierarchical weighted networks. The source codes and usage are available at https://github.com/do-somethings-haha/multi-threaded_calculate_unsigned_TOM_from_unsigned_or_signed_Adjacency_Matrix_of_WGCNA.Biomolecular networks are often assumed to be scale-free hierarchical networks. The weighted gene co-expression network analysis (WGCNA) treats gene co-expression networks as undirected scale-free hierarchical weighted networks. The WGCNA R software package uses an Adjacency Matrix to store a network, next calculates the topological overlap matrix (TOM), and then identifies the modules (sub-networks), where each module is assumed to be associated with a certain biological function. The most time-consuming step of WGCNA is to calculate TOM from the Adjacency Matrix in a single thread. In this paper, the single-threaded algorithm of the TOM has been changed into a multi-threaded algorithm (the parameters are the default values of WGCNA). In the multi-threaded algorithm, Rcpp was used to make R call a C++ function, and then C++ used OpenMP to start multiple threads to calculate TOM from the Adjacency Matrix. On shared-memory MultiProcessor systems, the calculation time decreases as the number of CPU cores increases. The algorithm of this paper can promote the application of WGCNA on large data sets, and help other research fields to identify sub-networks in undirected scale-free hierarchical weighted networks. The source codes and usage are available at https://github.com/do-somethings-haha/multi-threaded_calculate_unsigned_TOM_from_unsigned_or_signed_Adjacency_Matrix_of_WGCNA.
Author He, Dongmei
Shuai, Min
Chen, Xin
Author_xml – sequence: 1
  givenname: Min
  surname: Shuai
  fullname: Shuai, Min
  organization: Pharmacy College, Chengdu University of Traditional Chinese Medicine, Ministry of Education Key Laboratory of Standardization of Chinese Herbal Medicine, Key Laboratory of Systematic Research, Development and Utilization of Chinese Medicine Resources in Sichuan Province - Key Laboratory Breeding Base of Co-founded by Sichuan Province and MOST, Chengdu 611137, China
– sequence: 2
  givenname: Dongmei
  surname: He
  fullname: He, Dongmei
  organization: Center for Post-doctoral research, National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, State Key Laboratory Breeding Base of Dao-di Herbs, Beijing 100700, China
– sequence: 3
  givenname: Xin
  surname: Chen
  fullname: Chen, Xin
  organization: Pharmacy College, Chengdu University of Traditional Chinese Medicine, Ministry of Education Key Laboratory of Standardization of Chinese Herbal Medicine, Key Laboratory of Systematic Research, Development and Utilization of Chinese Medicine Resources in Sichuan Province - Key Laboratory Breeding Base of Co-founded by Sichuan Province and MOST, Chengdu 611137, China
BackLink https://www.ncbi.nlm.nih.gov/pubmed/34757703$$D View this record in MEDLINE/PubMed
BookMark eNpNkDtPwzAUhS1URB8wsyGPLAE_4qQdUcVLQuoCc2THN4nBsUPs0Bb-PEEUCZ3hHul85w5njibOO0DonJIrKqi4DrJuVcIIowkhTByhGRVpmmSUisk_P0XzEF5HgjJOTtCUp7nIc8Jn6GvTRdOaT-NqvAVTNxE0rsEBLn0Cu66HEIx32EHc-v4NSyftPpiAtyY2WOJ2sNEkselB6rFZSlsOVsafiq9wbABH33nrazNG2H9Ab2WHWxl7sztFx5W0Ac4Od4Fe7m6f1w_J0-b-cX3zlJR8SWNScV0yrUiWphUhlCogfDkmNOWgc7VUGdFE5yzViks2SmohyhyqUsrVaqXYAl3-_u16_z5AiEVrQgnWSgd-CAUTq4wwJng-ohcHdFAt6KLrTSv7ffE3GPsGuFJy4A
CitedBy_id crossref_primary_10_1186_s12870_022_03786_4
crossref_primary_10_3389_fimmu_2022_944286
crossref_primary_10_3389_fcell_2022_919637
crossref_primary_10_1186_s12870_025_06496_9
crossref_primary_10_3389_fphar_2025_1529525
crossref_primary_10_1007_s12035_025_04970_x
crossref_primary_10_3389_fmolb_2025_1637980
crossref_primary_10_3389_fcell_2022_901207
crossref_primary_10_1007_s43032_025_01917_4
crossref_primary_10_3389_fimmu_2025_1570903
crossref_primary_10_1007_s41060_024_00534_9
crossref_primary_10_3389_fmolb_2023_1182512
crossref_primary_10_1177_09287329251322278
crossref_primary_10_31083_j_fbl2908294
ContentType Journal Article
Copyright 2021 Walter de Gruyter GmbH, Berlin/Boston.
Copyright_xml – notice: 2021 Walter de Gruyter GmbH, Berlin/Boston.
DBID CGR
CUY
CVF
ECM
EIF
NPM
7X8
DOI 10.1515/sagmb-2021-0025
DatabaseName Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
MEDLINE - Academic
DatabaseTitle MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
MEDLINE - Academic
DatabaseTitleList MEDLINE
MEDLINE - Academic
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: 7X8
  name: MEDLINE - Academic
  url: https://search.proquest.com/medline
  sourceTypes: Aggregation Database
DeliveryMethod no_fulltext_linktorsrc
Discipline Biology
EISSN 1544-6115
ExternalDocumentID 34757703
Genre Research Support, Non-U.S. Gov't
Journal Article
GroupedDBID ---
-~S
0R~
123
1WD
AAAEU
AAAVF
AACIX
AAFPC
AAGVJ
AAILP
AAKRG
AALGR
AAOWA
AAPJK
AAQCX
AASQH
AAWFC
AAXCG
AAXMT
ABABW
ABAOT
ABAQN
ABFKT
ABIQR
ABJNI
ABLVI
ABMIY
ABPLS
ABRDF
ABUVI
ABVMU
ABWLS
ABXMZ
ABYBW
ACDEB
ACEFL
ACGFO
ACGFS
ACHNZ
ACONX
ACPMA
ACZBO
ADEQT
ADGQD
ADGYE
ADOZN
AEDGQ
AEGVQ
AEICA
AEJQW
AEMOE
AENEX
AEQDQ
AEQLX
AERZL
AEXIE
AFBAA
AFBQV
AFCXV
AFQUK
AFYRI
AGBEV
AGQYU
AGWTP
AHCWZ
AHVWV
AHXUK
AIAGR
AIERV
AJATJ
AJPIC
AKXKS
ALMA_UNASSIGNED_HOLDINGS
ALUKF
ALWYM
ASYPN
BAKPI
BBCWN
BBDJO
BCIFA
CGR
CS3
CUY
CVF
DASCH
DBYYV
DU5
EBS
ECM
EIF
EMOBN
F5P
HZ~
IY9
J9A
K.~
KDIRW
MV1
NPM
NQBSW
O9-
P2P
QD8
SA.
SLJYH
T2Y
UK5
WTRAM
7X8
ABDRH
ACUND
ACYCL
ADNPR
AECWL
AFBDD
AIWOI
CKPZI
DSRVY
ID FETCH-LOGICAL-c381t-f3dc2db0644f0011be038381143ed7b8b60d0d724db3a2a2aad55c7efcaa999b2
IEDL.DBID 7X8
ISICitedReferencesCount 17
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000737352300003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1544-6115
IngestDate Fri Sep 05 08:43:10 EDT 2025
Wed Feb 19 02:26:48 EST 2025
IsPeerReviewed true
IsScholarly true
Issue 4-6
Keywords adjacency matrix
graph theory
parallel computing
weighted gene co-expression network
topological overlap matrix
Language English
License 2021 Walter de Gruyter GmbH, Berlin/Boston.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c381t-f3dc2db0644f0011be038381143ed7b8b60d0d724db3a2a2aad55c7efcaa999b2
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
PMID 34757703
PQID 2596022537
PQPubID 23479
ParticipantIDs proquest_miscellaneous_2596022537
pubmed_primary_34757703
PublicationCentury 2000
PublicationDate 2021-12-31
PublicationDateYYYYMMDD 2021-12-31
PublicationDate_xml – month: 12
  year: 2021
  text: 2021-12-31
  day: 31
PublicationDecade 2020
PublicationPlace Germany
PublicationPlace_xml – name: Germany
PublicationTitle Statistical applications in genetics and molecular biology
PublicationTitleAlternate Stat Appl Genet Mol Biol
PublicationYear 2021
SSID ssj0021230
Score 2.3546762
Snippet Biomolecular networks are often assumed to be scale-free hierarchical networks. The weighted gene co-expression network analysis (WGCNA) treats gene...
SourceID proquest
pubmed
SourceType Aggregation Database
Index Database
StartPage 145
SubjectTerms Algorithms
Gene Regulatory Networks
Software
Title Optimizing weighted gene co-expression network analysis with a multi-threaded calculation of the topological overlap matrix
URI https://www.ncbi.nlm.nih.gov/pubmed/34757703
https://www.proquest.com/docview/2596022537
Volume 20
WOSCitedRecordID wos000737352300003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1LS8QwEA7qKnjx_VhfRPAato-kaU8i4uLFdQ8KeytpHrLgtqutuuqfd5Jm9SQIUugtIWQmM99MJvMhdBaGUmZG2XdbSUKoSilJ09AQDqaQKg2YjheObIIPBulolA19wq32ZZVzm-gMtaqkzZH3AKYn4G9YzM-nT8SyRtnbVU-hsYg6MUAZezD56PsWwVpl9yCSUQohUsh8ax9w4b1aPEwKUJEIgukgYr_jS-dn-uv_XeEGWvMIE1-0KrGJFnS5hVZazsn3bfR5C0ZiMv4Al4XfXF5UKwxapLGsiJ75utgSl219OBa-awm2GVsssKtAJA2ogFAwEkQsPQMYrgwGOImblnfBSh_bAtFHMcUTywQw20H3_au7y2viGRiIBE_eEBMrGakCYAs1FjwWOoCINoUYKtaKF2mRBCpQPKKqiEUEn1CMSa6NFAKQZxHtoqWyKvU-whCnwpxCCZA_DUSSMc7BTzBqssBQTbvodL6rOWi4vbYQpa5e6vxnX7torxVNPm1bceQx5TBPEB_8YfQhWnXydl0aj1DHwPnWx2hZvjbj-vnEqQ78B8ObLweL0QI
linkProvider ProQuest
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Optimizing+weighted+gene+co-expression+network+analysis+with+a+multi-threaded+calculation+of+the+topological+overlap+matrix&rft.jtitle=Statistical+applications+in+genetics+and+molecular+biology&rft.au=Shuai%2C+Min&rft.au=He%2C+Dongmei&rft.au=Chen%2C+Xin&rft.date=2021-12-31&rft.issn=1544-6115&rft.eissn=1544-6115&rft.volume=20&rft.issue=4-6&rft.spage=145&rft_id=info:doi/10.1515%2Fsagmb-2021-0025&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1544-6115&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1544-6115&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1544-6115&client=summon