Mixed Matrix Completion in Complex Survey Sampling under Heterogeneous Missingness

Modern surveys with large sample sizes and growing mixed-type questionnaires require robust and scalable analysis methods. In this work, we consider recovering a mixed dataframe matrix, obtained by complex survey sampling, with entries following different canonical exponential distributions and subj...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Journal of computational and graphical statistics Ročník 33; číslo 4; s. 1320 - 1328
Hlavní autoři: Mao, Xiaojun, Wang, Hengfang, Wang, Zhonglei, Yang, Shu
Médium: Journal Article
Jazyk:angličtina
Vydáno: United States Taylor & Francis 01.10.2024
Taylor & Francis Ltd
Témata:
ISSN:1061-8600, 1537-2715
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Modern surveys with large sample sizes and growing mixed-type questionnaires require robust and scalable analysis methods. In this work, we consider recovering a mixed dataframe matrix, obtained by complex survey sampling, with entries following different canonical exponential distributions and subject to heterogeneous missingness. To tackle this challenging task, we propose a two-stage procedure: in the first stage, we model the entry-wise missing mechanism by logistic regression, and in the second stage, we complete the target parameter matrix by maximizing a weighted log-likelihood with a low-rank constraint. We propose a fast and scalable estimation algorithm that achieves sublinear convergence, and the upper bound for the estimation error of the proposed method is rigorously derived. Experimental results support our theoretical claims, and the proposed estimator shows its merits compared to other existing methods. The proposed method is applied to analyze the National Health and Nutrition Examination Survey data. Supplementary materials for this article are available online.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
All authors are equally contributed.
ISSN:1061-8600
1537-2715
DOI:10.1080/10618600.2024.2319154