Statistical significance in high-dimensional linear mixed models

This paper concerns the development of an inferential framework for high-dimensional linear mixed effect models. These are suitable models, for instance, when we have repeated measurements for subjects. We consider a scenario where the number of fixed effects is large (and may be larger than ), but...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:FODS '20 : proceedings of the 2020 ACM-IMS Foundations of Data Science Conference : October 19-20, 2020, Virtual Event, USA. ACM-IMS Foundations of Data Science Conference (2020 : Online) Ročník 2020; s. 171
Hlavní autori: Lin, Lina, Drton, Mathias, Shojaie, Ali
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: United States 01.10.2020
On-line prístup:Zistit podrobnosti o prístupe
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Abstract This paper concerns the development of an inferential framework for high-dimensional linear mixed effect models. These are suitable models, for instance, when we have repeated measurements for subjects. We consider a scenario where the number of fixed effects is large (and may be larger than ), but the number of random effects is small. Our framework is inspired by a recent line of work that proposes de-biasing penalized estimators to perform inference for high-dimensional linear models with fixed effects only. In particular, we demonstrate how to correct a 'naive' ridge estimator in extension of work by Bühlmann (2013) to build asymptotically valid confidence intervals for mixed effect models. We validate our theoretical results with numerical experiments, in which we show our method outperforms those that fail to account for correlation induced by the random effects. For a practical demonstration we consider a riboflavin production dataset that exhibits group structure, and show that conclusions drawn using our method are consistent with those obtained on a similar dataset without group structure.
AbstractList This paper concerns the development of an inferential framework for high-dimensional linear mixed effect models. These are suitable models, for instance, when we have repeated measurements for subjects. We consider a scenario where the number of fixed effects is large (and may be larger than ), but the number of random effects is small. Our framework is inspired by a recent line of work that proposes de-biasing penalized estimators to perform inference for high-dimensional linear models with fixed effects only. In particular, we demonstrate how to correct a 'naive' ridge estimator in extension of work by Bühlmann (2013) to build asymptotically valid confidence intervals for mixed effect models. We validate our theoretical results with numerical experiments, in which we show our method outperforms those that fail to account for correlation induced by the random effects. For a practical demonstration we consider a riboflavin production dataset that exhibits group structure, and show that conclusions drawn using our method are consistent with those obtained on a similar dataset without group structure.
This paper concerns the development of an inferential framework for high-dimensional linear mixed effect models. These are suitable models, for instance, when we have n repeated measurements for M subjects. We consider a scenario where the number of fixed effects p is large (and may be larger than M), but the number of random effects q is small. Our framework is inspired by a recent line of work that proposes de-biasing penalized estimators to perform inference for high-dimensional linear models with fixed effects only. In particular, we demonstrate how to correct a 'naive' ridge estimator in extension of work by Bühlmann (2013) to build asymptotically valid confidence intervals for mixed effect models. We validate our theoretical results with numerical experiments, in which we show our method outperforms those that fail to account for correlation induced by the random effects. For a practical demonstration we consider a riboflavin production dataset that exhibits group structure, and show that conclusions drawn using our method are consistent with those obtained on a similar dataset without group structure.This paper concerns the development of an inferential framework for high-dimensional linear mixed effect models. These are suitable models, for instance, when we have n repeated measurements for M subjects. We consider a scenario where the number of fixed effects p is large (and may be larger than M), but the number of random effects q is small. Our framework is inspired by a recent line of work that proposes de-biasing penalized estimators to perform inference for high-dimensional linear models with fixed effects only. In particular, we demonstrate how to correct a 'naive' ridge estimator in extension of work by Bühlmann (2013) to build asymptotically valid confidence intervals for mixed effect models. We validate our theoretical results with numerical experiments, in which we show our method outperforms those that fail to account for correlation induced by the random effects. For a practical demonstration we consider a riboflavin production dataset that exhibits group structure, and show that conclusions drawn using our method are consistent with those obtained on a similar dataset without group structure.
Author Drton, Mathias
Shojaie, Ali
Lin, Lina
Author_xml – sequence: 1
  givenname: Lina
  surname: Lin
  fullname: Lin, Lina
  organization: Department of Statistics, University of Washington
– sequence: 2
  givenname: Mathias
  surname: Drton
  fullname: Drton, Mathias
  organization: Department of Mathematics, Technical University of Munich
– sequence: 3
  givenname: Ali
  surname: Shojaie
  fullname: Shojaie, Ali
  organization: Department of Biostatistics, University of Washington
BackLink https://www.ncbi.nlm.nih.gov/pubmed/35497571$$D View this record in MEDLINE/PubMed
BookMark eNo1jztPwzAURj2AeBRmNpSRJSX32nl4A1UUkCoxAHPkx3VryXZKnErw74lEmc43HB3pu2QnaUjE2A1USwBR33MB2EG9nNl0HT9j57wWsq1buGAP75OafJ68UaHIfpu8m2cyVPhU7Px2V1ofKWU_pFkIPpEai-i_yRZxsBTyFTt1KmS6PnLBPtdPH6uXcvP2_Lp63JQKpeClrUBb54zTqJGTbBFRCeqAkwUFqlICWlvLShttpBatNsZJJdEBR1kJXLC7v-5-HL4OlKc--mwoBJVoOOQem7prBDZQzertUT3oSLbfjz6q8af_f42_0upUrQ
ContentType Journal Article
DBID NPM
7X8
DOI 10.1145/3412815.3416883
DatabaseName PubMed
MEDLINE - Academic
DatabaseTitle PubMed
MEDLINE - Academic
DatabaseTitleList PubMed
MEDLINE - Academic
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: 7X8
  name: MEDLINE - Academic
  url: https://search.proquest.com/medline
  sourceTypes: Aggregation Database
DeliveryMethod no_fulltext_linktorsrc
ExternalDocumentID 35497571
Genre Journal Article
GrantInformation_xml – fundername: NIGMS NIH HHS
  grantid: R01 GM133848
– fundername: NIGMS NIH HHS
  grantid: R01 GM114029
GroupedDBID NPM
7X8
ID FETCH-LOGICAL-a2943-d01bdffcfb2b23e97222a4e813ed1a1a0a417d590bcbc9b47bccf9a92f1329042
IEDL.DBID 7X8
IngestDate Fri Jul 11 01:28:07 EDT 2025
Thu Jan 02 22:54:11 EST 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a2943-d01bdffcfb2b23e97222a4e813ed1a1a0a417d590bcbc9b47bccf9a92f1329042
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
OpenAccessLink https://www.ncbi.nlm.nih.gov/pmc/articles/9053448
PMID 35497571
PQID 2658642610
PQPubID 23479
ParticipantIDs proquest_miscellaneous_2658642610
pubmed_primary_35497571
PublicationCentury 2000
PublicationDate 20201001
PublicationDateYYYYMMDD 2020-10-01
PublicationDate_xml – month: 10
  year: 2020
  text: 20201001
  day: 1
PublicationDecade 2020
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle FODS '20 : proceedings of the 2020 ACM-IMS Foundations of Data Science Conference : October 19-20, 2020, Virtual Event, USA. ACM-IMS Foundations of Data Science Conference (2020 : Online)
PublicationTitleAlternate FODS 20 (2020)
PublicationYear 2020
Score 1.7767636
Snippet This paper concerns the development of an inferential framework for high-dimensional linear mixed effect models. These are suitable models, for instance, when...
SourceID proquest
pubmed
SourceType Aggregation Database
Index Database
StartPage 171
Title Statistical significance in high-dimensional linear mixed models
URI https://www.ncbi.nlm.nih.gov/pubmed/35497571
https://www.proquest.com/docview/2658642610
Volume 2020
hasFullText
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1LS8QwEA7qevDiA1_riwhe4zZp0jQnFXHx4rIHhb2VPGEPdl8o_nxn2i6eBMFLcygtYWYy8yUZvo-QG2nArYJH5hL3DCIkMIiSgkFtsClLKvEkG7EJPRqVk4kZdwduq66tcp0Tm0QdZh7PyAcCSmWBeD-7my8Yqkbh7WonobFJejlAGYxqPSk7Bh8u1QBytCi5uoWxKJEa8DcY2ZST4d5_J7JPdjsgSR9azx-QjVgfknvEjg31MrzC1gxsBEK_0mlNkZmYBWTzb5k4KCJMu6Tv068YaCOJszoib8On18dn1mkkMCuMzFnIuAsp-eSEE3k0Guq9lbHkeQzccptZyXVQJnPeeeOkdt4nY41IqDAPK_aYbNWzOp4SaqVXVsE_bHIymszE3FopQhmCL4T3fXK9NkgFMYgXC7aOs49V9WOSPjlprVrNW7KMKocNqFaan_3h63OyI3A72_TKXZBeghUYL8m2_wTTLa8a58JzNH75BpX0su8
linkProvider ProQuest
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Statistical+significance+in+high-dimensional+linear+mixed+models&rft.jtitle=FODS+%2720+%3A+proceedings+of+the+2020+ACM-IMS+Foundations+of+Data+Science+Conference+%3A+October+19-20%2C+2020%2C+Virtual+Event%2C+USA.+ACM-IMS+Foundations+of+Data+Science+Conference+%282020+%3A+Online%29&rft.au=Lin%2C+Lina&rft.au=Drton%2C+Mathias&rft.au=Shojaie%2C+Ali&rft.date=2020-10-01&rft.volume=2020&rft.spage=171&rft_id=info:doi/10.1145%2F3412815.3416883&rft.externalDBID=NO_FULL_TEXT