Evaluating the Reproducibility of Single-Cell Gene Regulatory Network Inference Algorithms

Networks are powerful tools to represent and investigate biological systems. The development of algorithms inferring regulatory interactions from functional genomics data has been an active area of research. With the advent of single-cell RNA-seq data (scRNA-seq), numerous methods specifically desig...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Frontiers in genetics Ročník 12; s. 617282
Hlavní autoři: Kang, Yoonjee, Thieffry, Denis, Cantini, Laura
Médium: Journal Article
Jazyk:angličtina
Vydáno: Switzerland Frontiers Media 22.03.2021
Frontiers Media S.A
Témata:
ISSN:1664-8021, 1664-8021
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Networks are powerful tools to represent and investigate biological systems. The development of algorithms inferring regulatory interactions from functional genomics data has been an active area of research. With the advent of single-cell RNA-seq data (scRNA-seq), numerous methods specifically designed to take advantage of single-cell datasets have been proposed. However, published benchmarks on single-cell network inference are mostly based on simulated data. Once applied to real data, these benchmarks take into account only a small set of genes and only compare the inferred networks with an imposed ground-truth. Here, we benchmark six single-cell network inference methods based on their reproducibility, i.e., their ability to infer similar networks when applied to two independent datasets for the same biological condition. We tested each of these methods on real data from three biological conditions: human retina, T-cells in colorectal cancer, and human hematopoiesis. Once taking into account networks with up to 100,000 links, GENIE3 results to be the most reproducible algorithm and, together with GRNBoost2, show higher intersection with ground-truth biological interactions. These results are independent from the single-cell sequencing platform, the cell type annotation system and the number of cells constituting the dataset. Finally, GRNBoost2 and CLR show more reproducible performance once a more stringent thresholding is applied to the networks (1,000–100 links). In order to ensure the reproducibility and ease extensions of this benchmark study, we implemented all the analyses in scNET, a Jupyter notebook available at https://github.com/ComputationalSystemsBiology/scNET .
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Edited by: Marieke Lydia Kuijjer, Centre for Molecular Medicine Norway, Faculty of Medicine, University of Oslo, Norway
This article was submitted to Computational Genomics, a section of the journal Frontiers in Genetics
Reviewed by: Van Anh Huynh-Thu, University of Liège, Belgium; Rudiyanto Gunawan, University at Buffalo, United States
ISSN:1664-8021
1664-8021
DOI:10.3389/fgene.2021.617282