Flexible statistical methods for estimating and testing effects in genomic studies with multiple conditions

We introduce new statistical methods for analyzing genomic data sets that measure many effects in many conditions (for example, gene expression changes under many treatments). These new methods improve on existing methods by allowing for arbitrary correlations in effect sizes among conditions. This...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Nature genetics Ročník 51; číslo 1; s. 187 - 195
Hlavní autoři: Urbut, Sarah M., Wang, Gao, Carbonetto, Peter, Stephens, Matthew
Médium: Journal Article
Jazyk:angličtina
Vydáno: New York Nature Publishing Group US 01.01.2019
Nature Publishing Group
Témata:
ISSN:1061-4036, 1546-1718, 1546-1718
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:We introduce new statistical methods for analyzing genomic data sets that measure many effects in many conditions (for example, gene expression changes under many treatments). These new methods improve on existing methods by allowing for arbitrary correlations in effect sizes among conditions. This flexible approach increases power, improves effect estimates and allows for more quantitative assessments of effect-size heterogeneity compared to simple shared or condition-specific assessments. We illustrate these features through an analysis of locally acting variants associated with gene expression (cis expression quantitative trait loci (eQTLs)) in 44 human tissues. Our analysis identifies more eQTLs than existing approaches, consistent with improved power. We show that although genetic effects on expression are extensively shared among tissues, effect sizes can still vary greatly among tissues. Some shared eQTLs show stronger effects in subsets of biologically related tissues (for example, brain-related tissues), or in only one tissue (for example, testis). Our methods are widely applicable, computationally tractable for many conditions and available online. Multivariate adaptive shrinkage (mash) is a method for estimating and testing multiple effects in multiple conditions. When applied to GTEx data, mash can be used to analyze sharing of eQTL effects by examining variation in effect sizes.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
Author contributions S.M.U. and M.S. conceived of the project and developed the statistical methods. S.M.U. implemented the comparisons with simulated data. S.M.U. and G.W. performed the analyses of the GTEx data, and additional analyses. S.M.U., G.W. and M.S. implemented the software, with contributions from P.C. S.M.U. and M.S. wrote the manuscript, with input from G.W. and P.C. P.C. and G.W. prepared the online code and data resources.
ISSN:1061-4036
1546-1718
1546-1718
DOI:10.1038/s41588-018-0268-8