Flexible statistical methods for estimating and testing effects in genomic studies with multiple conditions

We introduce new statistical methods for analyzing genomic data sets that measure many effects in many conditions (for example, gene expression changes under many treatments). These new methods improve on existing methods by allowing for arbitrary correlations in effect sizes among conditions. This...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Nature genetics Ročník 51; číslo 1; s. 187 - 195
Hlavní autori: Urbut, Sarah M., Wang, Gao, Carbonetto, Peter, Stephens, Matthew
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: New York Nature Publishing Group US 01.01.2019
Nature Publishing Group
Predmet:
ISSN:1061-4036, 1546-1718, 1546-1718
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:We introduce new statistical methods for analyzing genomic data sets that measure many effects in many conditions (for example, gene expression changes under many treatments). These new methods improve on existing methods by allowing for arbitrary correlations in effect sizes among conditions. This flexible approach increases power, improves effect estimates and allows for more quantitative assessments of effect-size heterogeneity compared to simple shared or condition-specific assessments. We illustrate these features through an analysis of locally acting variants associated with gene expression (cis expression quantitative trait loci (eQTLs)) in 44 human tissues. Our analysis identifies more eQTLs than existing approaches, consistent with improved power. We show that although genetic effects on expression are extensively shared among tissues, effect sizes can still vary greatly among tissues. Some shared eQTLs show stronger effects in subsets of biologically related tissues (for example, brain-related tissues), or in only one tissue (for example, testis). Our methods are widely applicable, computationally tractable for many conditions and available online. Multivariate adaptive shrinkage (mash) is a method for estimating and testing multiple effects in multiple conditions. When applied to GTEx data, mash can be used to analyze sharing of eQTL effects by examining variation in effect sizes.
Bibliografia:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
Author contributions S.M.U. and M.S. conceived of the project and developed the statistical methods. S.M.U. implemented the comparisons with simulated data. S.M.U. and G.W. performed the analyses of the GTEx data, and additional analyses. S.M.U., G.W. and M.S. implemented the software, with contributions from P.C. S.M.U. and M.S. wrote the manuscript, with input from G.W. and P.C. P.C. and G.W. prepared the online code and data resources.
ISSN:1061-4036
1546-1718
1546-1718
DOI:10.1038/s41588-018-0268-8