Optimizing weighted gene co-expression network analysis with a multi-threaded calculation of the topological overlap matrix
Biomolecular networks are often assumed to be scale-free hierarchical networks. The weighted gene co-expression network analysis (WGCNA) treats gene co-expression networks as undirected scale-free hierarchical weighted networks. The WGCNA R software package uses an Adjacency Matrix to store a networ...
Uloženo v:
| Vydáno v: | Statistical applications in genetics and molecular biology Ročník 20; číslo 4-6; s. 145 |
|---|---|
| Hlavní autoři: | , , |
| Médium: | Journal Article |
| Jazyk: | angličtina |
| Vydáno: |
Germany
31.12.2021
|
| Témata: | |
| ISSN: | 1544-6115, 1544-6115 |
| On-line přístup: | Zjistit podrobnosti o přístupu |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Abstract | Biomolecular networks are often assumed to be scale-free hierarchical networks. The weighted gene co-expression network analysis (WGCNA) treats gene co-expression networks as undirected scale-free hierarchical weighted networks. The WGCNA R software package uses an Adjacency Matrix to store a network, next calculates the topological overlap matrix (TOM), and then identifies the modules (sub-networks), where each module is assumed to be associated with a certain biological function. The most time-consuming step of WGCNA is to calculate TOM from the Adjacency Matrix in a single thread. In this paper, the single-threaded algorithm of the TOM has been changed into a multi-threaded algorithm (the parameters are the default values of WGCNA). In the multi-threaded algorithm, Rcpp was used to make R call a C++ function, and then C++ used OpenMP to start multiple threads to calculate TOM from the Adjacency Matrix. On shared-memory MultiProcessor systems, the calculation time decreases as the number of CPU cores increases. The algorithm of this paper can promote the application of WGCNA on large data sets, and help other research fields to identify sub-networks in undirected scale-free hierarchical weighted networks. The source codes and usage are available at https://github.com/do-somethings-haha/multi-threaded_calculate_unsigned_TOM_from_unsigned_or_signed_Adjacency_Matrix_of_WGCNA. |
|---|---|
| AbstractList | Biomolecular networks are often assumed to be scale-free hierarchical networks. The weighted gene co-expression network analysis (WGCNA) treats gene co-expression networks as undirected scale-free hierarchical weighted networks. The WGCNA R software package uses an Adjacency Matrix to store a network, next calculates the topological overlap matrix (TOM), and then identifies the modules (sub-networks), where each module is assumed to be associated with a certain biological function. The most time-consuming step of WGCNA is to calculate TOM from the Adjacency Matrix in a single thread. In this paper, the single-threaded algorithm of the TOM has been changed into a multi-threaded algorithm (the parameters are the default values of WGCNA). In the multi-threaded algorithm, Rcpp was used to make R call a C++ function, and then C++ used OpenMP to start multiple threads to calculate TOM from the Adjacency Matrix. On shared-memory MultiProcessor systems, the calculation time decreases as the number of CPU cores increases. The algorithm of this paper can promote the application of WGCNA on large data sets, and help other research fields to identify sub-networks in undirected scale-free hierarchical weighted networks. The source codes and usage are available at https://github.com/do-somethings-haha/multi-threaded_calculate_unsigned_TOM_from_unsigned_or_signed_Adjacency_Matrix_of_WGCNA. Biomolecular networks are often assumed to be scale-free hierarchical networks. The weighted gene co-expression network analysis (WGCNA) treats gene co-expression networks as undirected scale-free hierarchical weighted networks. The WGCNA R software package uses an Adjacency Matrix to store a network, next calculates the topological overlap matrix (TOM), and then identifies the modules (sub-networks), where each module is assumed to be associated with a certain biological function. The most time-consuming step of WGCNA is to calculate TOM from the Adjacency Matrix in a single thread. In this paper, the single-threaded algorithm of the TOM has been changed into a multi-threaded algorithm (the parameters are the default values of WGCNA). In the multi-threaded algorithm, Rcpp was used to make R call a C++ function, and then C++ used OpenMP to start multiple threads to calculate TOM from the Adjacency Matrix. On shared-memory MultiProcessor systems, the calculation time decreases as the number of CPU cores increases. The algorithm of this paper can promote the application of WGCNA on large data sets, and help other research fields to identify sub-networks in undirected scale-free hierarchical weighted networks. The source codes and usage are available at https://github.com/do-somethings-haha/multi-threaded_calculate_unsigned_TOM_from_unsigned_or_signed_Adjacency_Matrix_of_WGCNA.Biomolecular networks are often assumed to be scale-free hierarchical networks. The weighted gene co-expression network analysis (WGCNA) treats gene co-expression networks as undirected scale-free hierarchical weighted networks. The WGCNA R software package uses an Adjacency Matrix to store a network, next calculates the topological overlap matrix (TOM), and then identifies the modules (sub-networks), where each module is assumed to be associated with a certain biological function. The most time-consuming step of WGCNA is to calculate TOM from the Adjacency Matrix in a single thread. In this paper, the single-threaded algorithm of the TOM has been changed into a multi-threaded algorithm (the parameters are the default values of WGCNA). In the multi-threaded algorithm, Rcpp was used to make R call a C++ function, and then C++ used OpenMP to start multiple threads to calculate TOM from the Adjacency Matrix. On shared-memory MultiProcessor systems, the calculation time decreases as the number of CPU cores increases. The algorithm of this paper can promote the application of WGCNA on large data sets, and help other research fields to identify sub-networks in undirected scale-free hierarchical weighted networks. The source codes and usage are available at https://github.com/do-somethings-haha/multi-threaded_calculate_unsigned_TOM_from_unsigned_or_signed_Adjacency_Matrix_of_WGCNA. |
| Author | He, Dongmei Shuai, Min Chen, Xin |
| Author_xml | – sequence: 1 givenname: Min surname: Shuai fullname: Shuai, Min organization: Pharmacy College, Chengdu University of Traditional Chinese Medicine, Ministry of Education Key Laboratory of Standardization of Chinese Herbal Medicine, Key Laboratory of Systematic Research, Development and Utilization of Chinese Medicine Resources in Sichuan Province - Key Laboratory Breeding Base of Co-founded by Sichuan Province and MOST, Chengdu 611137, China – sequence: 2 givenname: Dongmei surname: He fullname: He, Dongmei organization: Center for Post-doctoral research, National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, State Key Laboratory Breeding Base of Dao-di Herbs, Beijing 100700, China – sequence: 3 givenname: Xin surname: Chen fullname: Chen, Xin organization: Pharmacy College, Chengdu University of Traditional Chinese Medicine, Ministry of Education Key Laboratory of Standardization of Chinese Herbal Medicine, Key Laboratory of Systematic Research, Development and Utilization of Chinese Medicine Resources in Sichuan Province - Key Laboratory Breeding Base of Co-founded by Sichuan Province and MOST, Chengdu 611137, China |
| BackLink | https://www.ncbi.nlm.nih.gov/pubmed/34757703$$D View this record in MEDLINE/PubMed |
| BookMark | eNpNkDtPwzAUhS1URB8wsyGPLAE_4qQdUcVLQuoCc2THN4nBsUPs0Bb-PEEUCZ3hHul85w5njibOO0DonJIrKqi4DrJuVcIIowkhTByhGRVpmmSUisk_P0XzEF5HgjJOTtCUp7nIc8Jn6GvTRdOaT-NqvAVTNxE0rsEBLn0Cu66HEIx32EHc-v4NSyftPpiAtyY2WOJ2sNEkselB6rFZSlsOVsafiq9wbABH33nrazNG2H9Ab2WHWxl7sztFx5W0Ac4Od4Fe7m6f1w_J0-b-cX3zlJR8SWNScV0yrUiWphUhlCogfDkmNOWgc7VUGdFE5yzViks2SmohyhyqUsrVaqXYAl3-_u16_z5AiEVrQgnWSgd-CAUTq4wwJng-ohcHdFAt6KLrTSv7ffE3GPsGuFJy4A |
| CitedBy_id | crossref_primary_10_1186_s12870_022_03786_4 crossref_primary_10_3389_fimmu_2022_944286 crossref_primary_10_3389_fcell_2022_919637 crossref_primary_10_1186_s12870_025_06496_9 crossref_primary_10_3389_fphar_2025_1529525 crossref_primary_10_1007_s12035_025_04970_x crossref_primary_10_3389_fmolb_2025_1637980 crossref_primary_10_3389_fcell_2022_901207 crossref_primary_10_1007_s43032_025_01917_4 crossref_primary_10_3389_fimmu_2025_1570903 crossref_primary_10_1007_s41060_024_00534_9 crossref_primary_10_3389_fmolb_2023_1182512 crossref_primary_10_1177_09287329251322278 crossref_primary_10_31083_j_fbl2908294 |
| ContentType | Journal Article |
| Copyright | 2021 Walter de Gruyter GmbH, Berlin/Boston. |
| Copyright_xml | – notice: 2021 Walter de Gruyter GmbH, Berlin/Boston. |
| DBID | CGR CUY CVF ECM EIF NPM 7X8 |
| DOI | 10.1515/sagmb-2021-0025 |
| DatabaseName | Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed MEDLINE - Academic |
| DatabaseTitle | MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) MEDLINE - Academic |
| DatabaseTitleList | MEDLINE MEDLINE - Academic |
| Database_xml | – sequence: 1 dbid: NPM name: PubMed url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: 7X8 name: MEDLINE - Academic url: https://search.proquest.com/medline sourceTypes: Aggregation Database |
| DeliveryMethod | no_fulltext_linktorsrc |
| Discipline | Biology |
| EISSN | 1544-6115 |
| ExternalDocumentID | 34757703 |
| Genre | Research Support, Non-U.S. Gov't Journal Article |
| GroupedDBID | --- -~S 0R~ 123 1WD AAAEU AAAVF AACIX AAFPC AAGVJ AAILP AAKRG AALGR AAOWA AAPJK AAQCX AASQH AAWFC AAXCG AAXMT ABABW ABAOT ABAQN ABFKT ABIQR ABJNI ABLVI ABMIY ABPLS ABRDF ABUVI ABVMU ABWLS ABXMZ ABYBW ACDEB ACEFL ACGFO ACGFS ACHNZ ACONX ACPMA ACZBO ADEQT ADGQD ADGYE ADOZN AEDGQ AEGVQ AEICA AEJQW AEMOE AENEX AEQDQ AEQLX AERZL AEXIE AFBAA AFBQV AFCXV AFQUK AFYRI AGBEV AGQYU AGWTP AHCWZ AHVWV AHXUK AIAGR AIERV AJATJ AJPIC AKXKS ALMA_UNASSIGNED_HOLDINGS ALUKF ALWYM ASYPN BAKPI BBCWN BBDJO BCIFA CGR CS3 CUY CVF DASCH DBYYV DU5 EBS ECM EIF EMOBN F5P HZ~ IY9 J9A K.~ KDIRW MV1 NPM NQBSW O9- P2P QD8 SA. SLJYH T2Y UK5 WTRAM 7X8 ABDRH ACUND ACYCL ADNPR AECWL AFBDD AIWOI CKPZI DSRVY |
| ID | FETCH-LOGICAL-c381t-f3dc2db0644f0011be038381143ed7b8b60d0d724db3a2a2aad55c7efcaa999b2 |
| IEDL.DBID | 7X8 |
| ISICitedReferencesCount | 17 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000737352300003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 1544-6115 |
| IngestDate | Fri Sep 05 08:43:10 EDT 2025 Wed Feb 19 02:26:48 EST 2025 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 4-6 |
| Keywords | adjacency matrix graph theory parallel computing weighted gene co-expression network topological overlap matrix |
| Language | English |
| License | 2021 Walter de Gruyter GmbH, Berlin/Boston. |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c381t-f3dc2db0644f0011be038381143ed7b8b60d0d724db3a2a2aad55c7efcaa999b2 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
| PMID | 34757703 |
| PQID | 2596022537 |
| PQPubID | 23479 |
| ParticipantIDs | proquest_miscellaneous_2596022537 pubmed_primary_34757703 |
| PublicationCentury | 2000 |
| PublicationDate | 2021-12-31 |
| PublicationDateYYYYMMDD | 2021-12-31 |
| PublicationDate_xml | – month: 12 year: 2021 text: 2021-12-31 day: 31 |
| PublicationDecade | 2020 |
| PublicationPlace | Germany |
| PublicationPlace_xml | – name: Germany |
| PublicationTitle | Statistical applications in genetics and molecular biology |
| PublicationTitleAlternate | Stat Appl Genet Mol Biol |
| PublicationYear | 2021 |
| SSID | ssj0021230 |
| Score | 2.3546762 |
| Snippet | Biomolecular networks are often assumed to be scale-free hierarchical networks. The weighted gene co-expression network analysis (WGCNA) treats gene... |
| SourceID | proquest pubmed |
| SourceType | Aggregation Database Index Database |
| StartPage | 145 |
| SubjectTerms | Algorithms Gene Regulatory Networks Software |
| Title | Optimizing weighted gene co-expression network analysis with a multi-threaded calculation of the topological overlap matrix |
| URI | https://www.ncbi.nlm.nih.gov/pubmed/34757703 https://www.proquest.com/docview/2596022537 |
| Volume | 20 |
| WOSCitedRecordID | wos000737352300003&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1LS8QwEA7qKnjx_VhfRPAato-kaU8i4uLFdQ8KeytpHrLgtqutuuqfd5Jm9SQIUugtIWQmM99MJvMhdBaGUmZG2XdbSUKoSilJ09AQDqaQKg2YjheObIIPBulolA19wq32ZZVzm-gMtaqkzZH3AKYn4G9YzM-nT8SyRtnbVU-hsYg6MUAZezD56PsWwVpl9yCSUQohUsh8ax9w4b1aPEwKUJEIgukgYr_jS-dn-uv_XeEGWvMIE1-0KrGJFnS5hVZazsn3bfR5C0ZiMv4Al4XfXF5UKwxapLGsiJ75utgSl219OBa-awm2GVsssKtAJA2ogFAwEkQsPQMYrgwGOImblnfBSh_bAtFHMcUTywQw20H3_au7y2viGRiIBE_eEBMrGakCYAs1FjwWOoCINoUYKtaKF2mRBCpQPKKqiEUEn1CMSa6NFAKQZxHtoqWyKvU-whCnwpxCCZA_DUSSMc7BTzBqssBQTbvodL6rOWi4vbYQpa5e6vxnX7torxVNPm1bceQx5TBPEB_8YfQhWnXydl0aj1DHwPnWx2hZvjbj-vnEqQ78B8ObLweL0QI |
| linkProvider | ProQuest |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Optimizing+weighted+gene+co-expression+network+analysis+with+a+multi-threaded+calculation+of+the+topological+overlap+matrix&rft.jtitle=Statistical+applications+in+genetics+and+molecular+biology&rft.au=Shuai%2C+Min&rft.au=He%2C+Dongmei&rft.au=Chen%2C+Xin&rft.date=2021-12-31&rft.issn=1544-6115&rft.eissn=1544-6115&rft.volume=20&rft.issue=4-6&rft.spage=145&rft_id=info:doi/10.1515%2Fsagmb-2021-0025&rft.externalDBID=NO_FULL_TEXT |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1544-6115&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1544-6115&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1544-6115&client=summon |