Standardized and accessible multi-omics bioinformatics workflows through the NMDC EDGE resource

Accessible and easy-to-use standardized bioinformatics workflows are necessary to advance microbiome research from observational studies to large-scale, data-driven approaches. Standardized multi-omics data enables comparative studies, data reuse, and applications of machine learning to model biolog...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Computational and structural biotechnology journal Ročník 23; s. 3575 - 3583
Hlavní autoři: Kelliher, Julia M., Xu, Yan, Flynn, Mark C., Babinski, Michal, Canon, Shane, Cavanna, Eric, Clum, Alicia, Corilo, Yuri E., Fujimoto, Grant, Giberson, Cameron, Johnson, Leah Y.D., Li, Kaitlyn J., Li, Po-E, Li, Valerie, Lo, Chien-Chi, Lynch, Wendi, Piehowski, Paul, Prime, Kaelan, Purvine, Samuel, Rodriguez, Francisca, Roux, Simon, Shakya, Migun, Smith, Montana, Sarrafan, Setareh, Cholia, Shreyas, McCue, Lee Ann, Mungall, Chris, Hu, Bin, Eloe-Fadrosh, Emiley A., Chain, Patrick S.G.
Médium: Journal Article
Jazyk:angličtina
Vydáno: Netherlands Elsevier B.V 01.12.2024
Elsevier
Research Network of Computational and Structural Biotechnology
Témata:
ISSN:2001-0370, 2001-0370
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Accessible and easy-to-use standardized bioinformatics workflows are necessary to advance microbiome research from observational studies to large-scale, data-driven approaches. Standardized multi-omics data enables comparative studies, data reuse, and applications of machine learning to model biological processes. To advance broad accessibility of standardized multi-omics bioinformatics workflows, the National Microbiome Data Collaborative (NMDC) has developed the Empowering the Development of Genomics Expertise (NMDC EDGE) resource, a user-friendly, open-source web application (https://nmdc-edge.org). Here, we describe the design and main functionality of the NMDC EDGE resource for processing metagenome, metatranscriptome, natural organic matter, and metaproteome data. The architecture relies on three main layers (web application, orchestration, and execution) to ensure flexibility and expansion to future workflows. The orchestration and execution layers leverage best practices in software containers and accommodate high-performance computing and cloud computing services. Further, we have adopted a robust user research process to collect feedback for continuous improvement of the resource. NMDC EDGE provides an accessible interface for researchers to process multi-omics microbiome data using production-quality workflows to facilitate improved data standardization and interoperability. •NMDC EDGE is a resource for accessible, standardized microbiome multi-omics workflows.•Layered software architecture ensures flexibility and enables updates to workflows.•Feedback is collected through user research efforts to improve the resource.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
AC05-76RL01830; 89233218CNA000001; AC02-05CH11231; 2138259; 2138286; 2138307; 2137603; 2138296
National Science Foundation (NSF)
PNNL-SA-200312; LA-UR-24-26600
USDOE Office of Science (SC), Biological and Environmental Research (BER). Biological Systems Science (BSS)
USDOE Office of Science (SC), Biological and Environmental Research (BER)
ISSN:2001-0370
2001-0370
DOI:10.1016/j.csbj.2024.09.018