Hostname: page-component-77f85d65b8-lfk5g Total loading time: 0 Render date: 2026-03-29T08:19:16.608Z Has data issue: false hasContentIssue false

A Continuous-Time Dynamic Factor Model for Intensive Longitudinal Data Arising from Mobile Health Studies

Published online by Cambridge University Press:  16 June 2025

Madeline R. Abbott*
Affiliation:
Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
Walter H. Dempsey
Affiliation:
Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
Inbal Nahum-Shani
Affiliation:
Institute for Social Research, University of Michigan, Ann Arbor, MI, USA
Cho Y. Lam
Affiliation:
Department of Population Health Sciences and Huntsman Cancer Institute, University of Utah, Salt Lake City, UT, USA
David W. Wetter
Affiliation:
Department of Population Health Sciences and Huntsman Cancer Institute, University of Utah, Salt Lake City, UT, USA
Jeremy M. G. Taylor
Affiliation:
Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
*
Corresponding author: Madeline R. Abbott; Email: mrabbott@umich.edu
Rights & Permissions [Opens in a new window]

Abstract

Intensive longitudinal data (ILD) collected in mobile health (mHealth) studies contain rich information on the dynamics of multiple outcomes measured frequently over time. Motivated by an mHealth study in which participants self-report the intensity of many emotions multiple times per day, we describe a dynamic factor model that summarizes ILD as a low-dimensional, interpretable latent process. This model consists of (i) a measurement submodel—a factor model—that summarizes the multivariate longitudinal outcome as lower-dimensional latent variables and (ii) a structural submodel—an Ornstein–Uhlenbeck (OU) stochastic process—that captures the dynamics of the multivariate latent process in continuous time. We derive a closed-form likelihood for the marginal distribution of the outcome and the computationally-simpler sparse precision matrix for the OU process. We propose a block coordinate descent algorithm for estimation and use simulation studies to show that it has good statistical properties with ILD. Then, we use our method to analyze data from the mHealth study. We summarize the dynamics of 18 emotions using models with one, two, and three time-varying latent factors, which correspond to different behavioral science theories of emotions. We demonstrate how results can be interpreted to help improve behavioral science theories of momentary emotions, latent psychological states, and their dynamics.

Information

Type
Application and Case Studies - Original
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Psychometric Society
Figure 0

Figure 1 Responses to the EMA questions over time for one participant in the mHealth study, separated by positive and negative emotions. In this plot, a subset of three positive emotions and three negative emotions are highlighted solely for illustrative purposes; all 18 emotions are later included in the model. Note both the high correlation and volatility of these longitudinal outcomes over time.

Figure 1

Figure 2 Results from ILD simulation study. Relative bias of parameter estimates from the block coordinate descent algorithm for the three different settings in which the true OU process differs. Relative bias is calculated as (estimate - truth) / truth and is summarized across the 1,000 simulated datasets with box plots. The colored dots indicate 0 bias.

Figure 2

Figure 3 Results from ILD simulation study. Comparison of estimated standard errors (from Fisher information) and standard deviation of point estimates. The similarity of the standard error estimates and empirical standard deviation suggests that the standard errors are of appropriate size. Note that the standard error estimate for $\sigma ^2_{\epsilon _4}$ is missing for one dataset in setting 3 (see Section A.8 of the Supplementary Material for details).

Figure 3

Table 1 For datasets generated under each true model, we summarize the percent of times that the model-selection metric chose the fitted model with the indicated number of factors. When generating data from models with 2 and 3 factors, we considered two different settings: a high signal setting in which latent factors have lower correlation and a low signal setting in which latent factors have high correlation. The settings in which the fitted model has the same number of factors as the true data-generating model are emphasized with bold orange text. These results are presented for datasets on which the algorithm either converged or reached the maximum number of iterations (200) for all three models. See Section A.8 of the Supplementary Material for more details.

Figure 4

Figure 4 Point estimates and corresponding 95% confidence intervals (CI) for each of the parameter matrices in our two-factor OUF model. Intervals for OU parameters $\sigma _{11}$ and $\sigma _{22}$ are based on a parametric bootstrap. Because we assume structural zeros in the loadings matrix are known, each emotion has only a single loading. Parameter subscripts 1-18 correspond to the emotions as follows: 1 = happy, 2 = joyful, 3 = enthusiastic, 4 = active, 5 = calm, 6 = determined, 7 = grateful, 8 = proud, 9 = attentive, 10 = sad, 11 = scared, 12 = disgusted, 13 = angry, 14 = ashamed, 15 = guilty, 16 = irritable, 17 = lonely, 18 = nervous.

Figure 5

Figure 5 The top panel shows the decay in autocorrelation and cross-correlation between latent factors that represent positive affect ($\eta _1(t)$) and negative affect ($\eta _2(t)$) across increasing gap times, where time is measured in hours. Curves are calculated using OU parameters estimated from emotions measured in the mHealth study. The shaded bands indicate the 2.5th and 97.5th percentiles of a parametric bootstrap. The bottom plot summarizes the distribution of the observed gap times (in hours) between measurements for all individuals in the mHealth study.

Supplementary material: File

Abbott et al. supplementary material

Abbott et al. supplementary material
Download Abbott et al. supplementary material(File)
File 663.3 KB