xtgeebcv: A command for bias-corrected sandwich variance estimation for GEE analyses of cluster randomized trials

Cluster randomized trials, where clusters (for example, schools or clinics) are randomized to comparison arms but measurements are taken on individuals, are commonly used to evaluate interventions in public health, education, and the social sciences. Analysis is often conducted on individual-level o...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Stata journal Jg. 20; H. 2; S. 363
Hauptverfasser: Gallis, John A, Li, Fan, Turner, Elizabeth L
Format: Journal Article
Sprache:Englisch
Veröffentlicht: United States 01.06.2020
Schlagworte:
ISSN:1536-867X
Online-Zugang:Weitere Angaben
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract Cluster randomized trials, where clusters (for example, schools or clinics) are randomized to comparison arms but measurements are taken on individuals, are commonly used to evaluate interventions in public health, education, and the social sciences. Analysis is often conducted on individual-level outcomes, and such analysis methods must consider that outcomes for members of the same cluster tend to be more similar than outcomes for members of other clusters. A popular individual-level analysis technique is generalized estimating equations (GEE). However, it is common to randomize a small number of clusters (for example, 30 or fewer), and in this case, the GEE standard errors obtained from the sandwich variance estimator will be biased, leading to inflated type I errors. Some bias-corrected standard errors have been proposed and studied to account for this finite-sample bias, but none has yet been implemented in Stata. In this article, we describe several popular bias corrections to the robust sandwich variance. We then introduce our newly created command, xtgeebcv, which will allow Stata users to easily apply finite-sample corrections to standard errors obtained from GEE models. We then provide examples to demonstrate the use of xtgeebcv. Finally, we discuss suggestions about which finite-sample corrections to use in which situations and consider areas of future research that may improve xtgeebcv.
AbstractList Cluster randomized trials, where clusters (for example, schools or clinics) are randomized to comparison arms but measurements are taken on individuals, are commonly used to evaluate interventions in public health, education, and the social sciences. Analysis is often conducted on individual-level outcomes, and such analysis methods must consider that outcomes for members of the same cluster tend to be more similar than outcomes for members of other clusters. A popular individual-level analysis technique is generalized estimating equations (GEE). However, it is common to randomize a small number of clusters (for example, 30 or fewer), and in this case, the GEE standard errors obtained from the sandwich variance estimator will be biased, leading to inflated type I errors. Some bias-corrected standard errors have been proposed and studied to account for this finite-sample bias, but none has yet been implemented in Stata. In this article, we describe several popular bias corrections to the robust sandwich variance. We then introduce our newly created command, xtgeebcv, which will allow Stata users to easily apply finite-sample corrections to standard errors obtained from GEE models. We then provide examples to demonstrate the use of xtgeebcv. Finally, we discuss suggestions about which finite-sample corrections to use in which situations and consider areas of future research that may improve xtgeebcv.
Cluster randomized trials, where clusters (for example, schools or clinics) are randomized to comparison arms but measurements are taken on individuals, are commonly used to evaluate interventions in public health, education, and the social sciences. Analysis is often conducted on individual-level outcomes, and such analysis methods must consider that outcomes for members of the same cluster tend to be more similar than outcomes for members of other clusters. A popular individual-level analysis technique is generalized estimating equations (GEE). However, it is common to randomize a small number of clusters (for example, 30 or fewer), and in this case, the GEE standard errors obtained from the sandwich variance estimator will be biased, leading to inflated type I errors. Some bias-corrected standard errors have been proposed and studied to account for this finite-sample bias, but none has yet been implemented in Stata. In this article, we describe several popular bias corrections to the robust sandwich variance. We then introduce our newly created command, xtgeebcv, which will allow Stata users to easily apply finite-sample corrections to standard errors obtained from GEE models. We then provide examples to demonstrate the use of xtgeebcv. Finally, we discuss suggestions about which finite-sample corrections to use in which situations and consider areas of future research that may improve xtgeebcv.Cluster randomized trials, where clusters (for example, schools or clinics) are randomized to comparison arms but measurements are taken on individuals, are commonly used to evaluate interventions in public health, education, and the social sciences. Analysis is often conducted on individual-level outcomes, and such analysis methods must consider that outcomes for members of the same cluster tend to be more similar than outcomes for members of other clusters. A popular individual-level analysis technique is generalized estimating equations (GEE). However, it is common to randomize a small number of clusters (for example, 30 or fewer), and in this case, the GEE standard errors obtained from the sandwich variance estimator will be biased, leading to inflated type I errors. Some bias-corrected standard errors have been proposed and studied to account for this finite-sample bias, but none has yet been implemented in Stata. In this article, we describe several popular bias corrections to the robust sandwich variance. We then introduce our newly created command, xtgeebcv, which will allow Stata users to easily apply finite-sample corrections to standard errors obtained from GEE models. We then provide examples to demonstrate the use of xtgeebcv. Finally, we discuss suggestions about which finite-sample corrections to use in which situations and consider areas of future research that may improve xtgeebcv.
Author Li, Fan
Turner, Elizabeth L
Gallis, John A
Author_xml – sequence: 1
  givenname: John A
  surname: Gallis
  fullname: Gallis, John A
  organization: Department of Biostatistics and Bioinformatics, Duke University, Duke Global Health Institute, Durham, NC
– sequence: 2
  givenname: Fan
  surname: Li
  fullname: Li, Fan
  organization: Department of Biostatistics, Yale School of Public Health, New Haven, CT
– sequence: 3
  givenname: Elizabeth L
  surname: Turner
  fullname: Turner, Elizabeth L
  organization: Department of Biostatistics and Bioinformatics, Duke University, Duke Global Health Institute, Durham, NC
BackLink https://www.ncbi.nlm.nih.gov/pubmed/35330784$$D View this record in MEDLINE/PubMed
BookMark eNo1kEtPwzAQhH0oog-4c0I-cgn4kcQJt6oqBakSF5C4RX5sICixWzspLb8eF8pptZ9mRrszRSPrLCB0RcktpULc0YznRS72jJScEkJHaHJESWRvYzQN4ZOQVFDGztGYZ5wTUaQTtN337wBK7-7xHGvXddIaXDuPVSNDop33oHswOET-1egPvJO-kVYDhtA3newbZ3_1q-USSyvbQ4CAXY11O4QePPbR6LrmO2b00dmGC3RWxwGXpzlDrw_Ll8Vjsn5ePS3m60SnPO0TyUtJFa1JrmoqZGSGaZC5yDKhyroAw6mJyJQpYSZnJK51ZkqTZQUwVbIZuvnL3Xi3HeK1VdcEDW0rLbghVCxPU3Jsjkbp9Uk6qA5MtfHxM3-o_mtiP1XEa4M
CitedBy_id crossref_primary_10_1136_bjsports_2020_103003
crossref_primary_10_1371_journal_pone_0304600
crossref_primary_10_1080_19345747_2022_2100301
crossref_primary_10_3389_fgwh_2025_1546901
crossref_primary_10_1002_hsr2_70890
crossref_primary_10_1177_0962280220958735
crossref_primary_10_1016_j_jclinepi_2025_111838
crossref_primary_10_1080_07448481_2022_2086007
crossref_primary_10_1371_journal_pone_0305471
crossref_primary_10_1001_jamapediatrics_2024_3274
crossref_primary_10_1177_0962280221990415
crossref_primary_10_1186_s12905_025_03647_w
crossref_primary_10_1002_sim_9541
crossref_primary_10_1002_bimj_202000230
crossref_primary_10_1002_bimj_202200002
crossref_primary_10_1016_S2215_0366_20_30258_3
crossref_primary_10_1093_heapol_czab072
crossref_primary_10_1097_AUD_0000000000001265
crossref_primary_10_1186_s12884_020_03476_9
crossref_primary_10_1002_sim_9375
crossref_primary_10_1177_1536867X211025840
ContentType Journal Article
DBID NPM
7X8
DOI 10.1177/1536867x20931001
DatabaseName PubMed
MEDLINE - Academic
DatabaseTitle PubMed
MEDLINE - Academic
DatabaseTitleList PubMed
MEDLINE - Academic
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: 7X8
  name: MEDLINE - Academic
  url: https://search.proquest.com/medline
  sourceTypes: Aggregation Database
DeliveryMethod no_fulltext_linktorsrc
Discipline Statistics
Mathematics
ExternalDocumentID 35330784
Genre Journal Article
GrantInformation_xml – fundername: NICHD NIH HHS
  grantid: R01 HD075875
GroupedDBID -TM
-~X
0R~
123
51Z
54M
AADUE
AAHPS
AAQXI
AARIX
AATAA
ABCCA
ABDBF
ABEHJ
ABKRH
ABPNF
ABRHV
ABTDE
ABUJY
ACDXX
ACHQT
ACJER
ACOFE
ACOXC
ACROE
ACSIQ
ACUHS
ACUIR
ADRRZ
AEEHM
AENEX
AESZF
AEWDL
AEWHI
AEXNY
AFKRG
AFMOU
AFQAA
AFUIA
AGKLV
AGNHF
AHDMH
AJUZI
ALFTD
ALMA_UNASSIGNED_HOLDINGS
ANDLU
ARTOV
BPACV
DOPDO
DV7
EAP
EBS
EJD
ESX
F5P
FHBDP
GROUPED_SAGE_PREMIER_JOURNAL_COLLECTION
H13
IAO
INS
ITC
J8X
JAG
M4V
NPM
OJV
OK1
SAFTQ
SAUOL
SCNPE
SFC
SJN
YHZ
ZPPRI
7X8
AAPII
ABIDT
ACCVC
ADEBD
AJGYC
AJHME
AJVBE
AMNSR
ID FETCH-LOGICAL-c434t-a39a1b1f06bf17a434d2cea67557b9f8ed31dd2cd9402d62031df5d9d558e2b92
IEDL.DBID 7X8
ISICitedReferencesCount 25
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000545023800006&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1536-867X
IngestDate Sat Sep 27 20:15:19 EDT 2025
Thu Apr 03 07:07:55 EDT 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 2
Keywords st0599
generalized estimating equations
cluster randomized trials
bias-corrected variances
sandwich variance
xtgeebcv
finite-sample correction
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c434t-a39a1b1f06bf17a434d2cea67557b9f8ed31dd2cd9402d62031df5d9d558e2b92
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
OpenAccessLink http://ageconsearch.umn.edu/record/340358
PMID 35330784
PQID 2644011771
PQPubID 23479
ParticipantIDs proquest_miscellaneous_2644011771
pubmed_primary_35330784
PublicationCentury 2000
PublicationDate 2020-Jun
20200601
PublicationDateYYYYMMDD 2020-06-01
PublicationDate_xml – month: 06
  year: 2020
  text: 2020-Jun
PublicationDecade 2020
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle The Stata journal
PublicationTitleAlternate Stata J
PublicationYear 2020
SSID ssj0047122
Score 2.3768313
Snippet Cluster randomized trials, where clusters (for example, schools or clinics) are randomized to comparison arms but measurements are taken on individuals, are...
SourceID proquest
pubmed
SourceType Aggregation Database
Index Database
StartPage 363
Title xtgeebcv: A command for bias-corrected sandwich variance estimation for GEE analyses of cluster randomized trials
URI https://www.ncbi.nlm.nih.gov/pubmed/35330784
https://www.proquest.com/docview/2644011771
Volume 20
WOSCitedRecordID wos000545023800006&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV3JTsMwELWAcigHlrKVTUbiGrXO5oQLqlCBA616ACm3yPHYUIkmLWlLxdczTtL2hITEJVJGzuZMxu95nDeE3HDJA2U7HnITxS1Xe8wSgLuOYFwjI4KkFEl65v1-EEXhoJpwy6tllcuYWARqyKSZI2-Zgdvol3F2N55YpmqUya5WJTQ2Sc1BKGO8mkerLALG3SKLgB-1bwU-j9ZpypaxoWlhI6M3MkS_A8xioHnY--8t7pPdCmLSTukTB2RDpQ2y01vps-YNUjcYs5RoPiSTxfRNqUTOb2mH4pVGIgWKWJYmQ5Fb0pTvkIhLaY72r6F8p3Pk18ZZqFHoKH99LNo_drtUFCInKqeZpvJjZmQYKA6HkI2G33iOokhIfkReH7ov909WVYnBkq7jTi3hhIIlTLf9RDMu0Aa2VALJhseTUAcKHAZoghDpKPg2RgrQHoTgeegJSWgfk600S9UpoUpyR0FguwrarkS4kmjlYQtoeyJkEDTJ9bJzY_R0k74Qqcpmebzu3iY5Kd9QPC4lOWLHLJLlgXv2h6PPSd02pLmYSrkgNY3PrS7Jtpxjx39eFS6E2_6g9wM6d9IB
linkProvider ProQuest
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=xtgeebcv%3A+A+command+for+bias-corrected+sandwich+variance+estimation+for+GEE+analyses+of+cluster+randomized+trials&rft.jtitle=The+Stata+journal&rft.au=Gallis%2C+John+A&rft.au=Li%2C+Fan&rft.au=Turner%2C+Elizabeth+L&rft.date=2020-06-01&rft.issn=1536-867X&rft.volume=20&rft.issue=2&rft.spage=363&rft_id=info:doi/10.1177%2F1536867x20931001&rft_id=info%3Apmid%2F35330784&rft_id=info%3Apmid%2F35330784&rft.externalDocID=35330784
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1536-867X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1536-867X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1536-867X&client=summon