Hostname: page-component-6766d58669-h8lrw Total loading time: 0 Render date: 2026-05-20T21:53:53.263Z Has data issue: false hasContentIssue false

Using machine learning with intensive longitudinal data to predict depression and suicidal ideation among medical interns over time

Published online by Cambridge University Press:  30 September 2022

Adam G. Horwitz*
Affiliation:
Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
Shane D. Kentopp
Affiliation:
Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
Jennifer Cleary
Affiliation:
Department of Psychology, University of Michigan, Ann Arbor, MI, USA
Katherine Ross
Affiliation:
Department of Psychology, University of Michigan, Ann Arbor, MI, USA
Zhenke Wu
Affiliation:
Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
Srijan Sen
Affiliation:
Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
Ewa K. Czyz
Affiliation:
Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
*
Author for correspondence: Adam G. Horwitz, E-mail: ahor@umich.edu
Rights & Permissions [Opens in a new window]

Abstract

Background

Use of intensive longitudinal methods (e.g. ecological momentary assessment, passive sensing) and machine learning (ML) models to predict risk for depression and suicide has increased in recent years. However, these studies often vary considerably in length, ML methods used, and sources of data. The present study examined predictive accuracy for depression and suicidal ideation (SI) as a function of time, comparing different combinations of ML methods and data sources.

Methods

Participants were 2459 first-year training physicians (55.1% female; 52.5% White) who were provided with Fitbit wearable devices and assessed daily for mood. Linear [elastic net regression (ENR)] and non-linear (random forest) ML algorithms were used to predict depression and SI at the first-quarter follow-up assessment, using two sets of variables (daily mood features only, daily mood features + passive-sensing features). To assess accuracy over time, models were estimated iteratively for each of the first 92 days of internship, using data available up to that point in time.

Results

ENRs using only the daily mood features generally had the best accuracy for predicting mental health outcomes, and predictive accuracy within 1 standard error of the full 92 day models was attained by weeks 7–8. Depression at 92 days could be predicted accurately (area under the curve >0.70) after only 14 days of data collection.

Conclusions

Simpler ML methods may outperform more complex methods until passive-sensing features become better specified. For intensive longitudinal studies, there may be limited predictive value in collecting data for more than 2 months.

Information

Type
Original Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2022. Published by Cambridge University Press
Figure 0

Fig. 1. Full sample ENR and RF performance (mood predictors only).

Figure 1

Table 1. Simple and complex ML models

Figure 2

Fig. 2. Depression model performance by rates of missingness (ENR with mood predictors only).

Figure 3

Table 2. Missingness and prediction accuracy

Figure 4

Fig. 3. SI model performance by rates of missingness (ENR with mood predictors only).