Functional Principal Component Analysis for Continuous Non‐Gaussian, Truncated, and Discrete Functional Data
ABSTRACT Mobile health studies often collect multiple within‐day self‐reported assessments of participants' behavior and well‐being on different scales such as physical activity (continuous scale), pain levels (truncated scale), mood states (ordinal scale), and the occurrence of daily life even...
Uložené v:
| Vydané v: | Statistics in medicine Ročník 43; číslo 28; s. 5431 - 5445 |
|---|---|
| Hlavní autori: | , , , |
| Médium: | Journal Article |
| Jazyk: | English |
| Vydavateľské údaje: |
Hoboken, USA
John Wiley & Sons, Inc
10.12.2024
Wiley Subscription Services, Inc |
| Predmet: | |
| ISSN: | 0277-6715, 1097-0258, 1097-0258 |
| On-line prístup: | Získať plný text |
| Tagy: |
Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
|
| Shrnutí: | ABSTRACT
Mobile health studies often collect multiple within‐day self‐reported assessments of participants' behavior and well‐being on different scales such as physical activity (continuous scale), pain levels (truncated scale), mood states (ordinal scale), and the occurrence of daily life events (binary scale). These assessments, when indexed by time of day, can be treated and analyzed as functional data corresponding to their respective types: continuous, truncated, ordinal, and binary. Motivated by these examples, we develop a functional principal component analysis that deals with all four types of functional data in a unified manner. It employs a semiparametric Gaussian copula model, assuming a generalized latent non‐paranormal process as the underlying generating mechanism for these four types of functional data. We specify latent temporal dependence using a covariance estimated through Kendall's τ$$ \tau $$ bridging method, incorporating smoothness in the bridging process. The approach is then extended with methods for handling both dense and sparse sampling designs, calculating subject‐specific latent representations of observed data, latent principal components and principal component scores. Simulation studies demonstrate the method's competitive performance under both dense and sparse sampling designs. The method is applied to data from 497 participants in the National Institute of Mental Health Family Study of Mood Spectrum Disorders to characterize differences in within‐day temporal patterns of mood in individuals with the major mood disorder subtypes, including Major Depressive Disorder and Type 1 and 2 Bipolar Disorder. Software implementation of the proposed method is provided in an R‐package. |
|---|---|
| Bibliografia: | Funding This study was supported by grant no. ZIA MH002954‐07 Motor Activity Research Consortium for Health (mMARCH) from the National Institute of Health and a postdoctoral fellowship to Dr. Dey. The views and opinions expressed herein are those of the authors and should not be construed to represent the views of any of the sponsoring organizations, agencies, or the US government. ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 Funding: This study was supported by grant no. ZIA MH002954‐07 Motor Activity Research Consortium for Health (mMARCH) from the National Institute of Health and a postdoctoral fellowship to Dr. Dey. The views and opinions expressed herein are those of the authors and should not be construed to represent the views of any of the sponsoring organizations, agencies, or the US government. |
| ISSN: | 0277-6715 1097-0258 1097-0258 |
| DOI: | 10.1002/sim.10240 |