Hostname: page-component-89b8bd64d-nlwjb Total loading time: 0 Render date: 2026-05-09T13:39:29.517Z Has data issue: false hasContentIssue false

Deep learning-based automated speech detection as a marker of social functioning in late-life depression

Published online by Cambridge University Press:  16 January 2020

Bethany Little
Affiliation:
Institute of Neuroscience, Newcastle University, Newcastle upon Tyne, UK
Ossama Alshabrawy
Affiliation:
Interdisciplinary Computing and Complex BioSystems (ICOS) group, School of Computing, Newcastle University, Newcastle upon Tyne, UK Faculty of Science, Damietta University, New Damietta, Egypt
Daniel Stow
Affiliation:
Institute of Health and Society, Newcastle University, Newcastle upon Tyne, UK
I. Nicol Ferrier
Affiliation:
Institute of Neuroscience, Newcastle University, Newcastle upon Tyne, UK
Roisin McNaney
Affiliation:
Faculty of Engineering, Bristol University, Bristol, UK
Daniel G. Jackson
Affiliation:
Open Lab, School of Computing, Newcastle University, Newcastle upon Tyne, UK
Karim Ladha
Affiliation:
Open Lab, School of Computing, Newcastle University, Newcastle upon Tyne, UK
Cassim Ladha
Affiliation:
Cascom Ltd, Newcastle upon Tyne, UK
Thomas Ploetz
Affiliation:
School of Interactive Computing, Georgia Institute of Technology, Atlanta, GA, USA
Jaume Bacardit
Affiliation:
Interdisciplinary Computing and Complex BioSystems (ICOS) group, School of Computing, Newcastle University, Newcastle upon Tyne, UK
Patrick Olivier
Affiliation:
Faculty of Information Technology, Monash University, Melbourne, Australia
Peter Gallagher
Affiliation:
Institute of Neuroscience, Newcastle University, Newcastle upon Tyne, UK
John T. O'Brien*
Affiliation:
Institute of Neuroscience, Newcastle University, Newcastle upon Tyne, UK Department of Psychiatry, University of Cambridge, Cambridge, UK
*
Author for correspondence: John T. O'Brien, E-mail: john.obrien@medschl.cam.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

Background

Late-life depression (LLD) is associated with poor social functioning. However, previous research uses bias-prone self-report scales to measure social functioning and a more objective measure is lacking. We tested a novel wearable device to measure speech that participants encounter as an indicator of social interaction.

Methods

Twenty nine participants with LLD and 29 age-matched controls wore a wrist-worn device continuously for seven days, which recorded their acoustic environment. Acoustic data were automatically analysed using deep learning models that had been developed and validated on an independent speech dataset. Total speech activity and the proportion of speech produced by the device wearer were both detected whilst maintaining participants' privacy. Participants underwent a neuropsychological test battery and clinical and self-report scales to measure severity of depression, general and social functioning.

Results

Compared to controls, participants with LLD showed poorer self-reported social and general functioning. Total speech activity was much lower for participants with LLD than controls, with no overlap between groups. The proportion of speech produced by the participants was smaller for LLD than controls. In LLD, both speech measures correlated with attention and psychomotor speed performance but not with depression severity or self-reported social functioning.

Conclusions

Using this device, LLD was associated with lower levels of speech than controls and speech activity was related to psychomotor retardation. We have demonstrated that speech activity measured by wearable technology differentiated LLD from controls with high precision and, in this study, provided an objective measure of an aspect of real-world social functioning in LLD.

Information

Type
Original Articles
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © The Author(s) 2020
Figure 0

Fig. 1. The wearable device.

Figure 1

Table 1. Demographic information, clinical and social characteristics, speech measures and group comparisons

Figure 2

Fig. 2. (a) Mean proportion of speech detected in a 24-h period (averaged over 7 days) and (b) mean proportion of speech produced by the wearer themselves (out of all speech detected) for LLD and healthy controls. Dots represent individual participants and are randomly spread across the x-axis within each group. Groups differed significantly in the proportion of speech detected in 24 h, such that all participants with LLD showed lower levels of speech detected than all healthy controls (U = 0.0, z = −6.541, p < 0.001). Of all speech detected, LLD produce a smaller proportion of speech themselves, compared to healthy controls (t(32.477) = 38.562, p < 0.001).

Figure 3

Fig. 3. Mean probability of speech being detected for participants with LLD and healthy controls across a 24-h period (averaged over 7 days).

Figure 4

Fig. 4. Relationships between key variables and: (a) mean proportion of total speech detected across 24-h (averaged over 7 days); and (b) mean proportion of speech produced by the wearer (out of all speech detected), for participants with LLD (N = 29) and healthy controls (N = 29). MADRS, Montgomery-Asberg Depression Rating Scale; APS, Attention and Psychomotor Speed; DSSI, Duke Social Support Index; LSNS-R, Lubben Social Network Scale-Revised.

Supplementary material: File

Little et al. supplementary material

Little et al. supplementary material

Download Little et al. supplementary material(File)
File 2.9 MB