Hostname: page-component-89b8bd64d-shngb Total loading time: 0 Render date: 2026-05-05T14:48:49.269Z Has data issue: false hasContentIssue false

Frequency and machine learning predictors of severe depressive symptoms and suicidal ideation among university students

Published online by Cambridge University Press:  07 July 2023

Nicola Meda
Affiliation:
Department of Neuroscience, University of Padova, Padova, Italy
Susanna Pardini
Affiliation:
Department of General Psychology, University of Padova, Padova, Italy
Paolo Rigobello
Affiliation:
Department of Molecular Medicine, University of Padova, Padova, Italy
Francesco Visioli*
Affiliation:
Department of Molecular Medicine, University of Padova, Padova, Italy IMDEA-Food, CEI UAM + CSIC, Madrid, Spain
Caterina Novara
Affiliation:
Department of General Psychology, University of Padova, Padova, Italy
*
Corresponding author: Francesco Visioli; Email: francesco.visioli@unipd.it
Rights & Permissions [Opens in a new window]

Abstract

Aims

Prospective studies on the mental health of university students highlighted a major concern. Specifically, young adults in academia are affected by markedly worse mental health status than their peers or adults in other vocations. This situation predisposes to exacerbated disability-adjusted life-years.

Methods

We enroled 1,388 students at the baseline, 557 of whom completed follow-up after 6 months, incorporating their demographic information and self-report questionnaires on depressive, anxiety and obsessive–compulsive symptoms. We applied multiple regression modelling to determine associations – at baseline – between demographic factors and self-reported mental health measures and supervised machine learning algorithms to predict the risk of poorer mental health at follow-up, by leveraging the demographic and clinical information collected at baseline.

Results

Approximately one out of five students reported severe depressive symptoms and/or suicidal ideation. An association of economic worry with depression was evidenced both at baseline (when high-frequency worry odds ratio = 3.11 [1.88–5.15]) and during follow-up. The random forest algorithm exhibited high accuracy in predicting the students who maintained well-being (balanced accuracy = 0.85) or absence of suicidal ideation but low accuracy for those whose symptoms worsened (balanced accuracy = 0.49). The most important features used for prediction were the cognitive and somatic symptoms of depression. However, while the negative predictive value of worsened symptoms after 6 months of enrolment was 0.89, the positive predictive value is basically null.

Conclusions

Students’ severe mental health problems reached worrying levels, and demographic factors were poor predictors of mental health outcomes. Further research including people with lived experience will be crucial to better assess students’ mental health needs and improve the predictive outcome for those most at risk of worsening symptoms.

Information

Type
Original Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press.
Figure 0

Table 1. Sample characteristic at baseline

Figure 1

Table 2. Regression model for severe depressive symptoms

Figure 2

Figure 1. Frequency of severe depressive, anxiety symptoms and suicidal ideation among students per field of study and gender.

OC = obsessive–compulsive (symptoms); HS = Health Sciences; STEM = Science, Technology, Engineering and Mathematics. Orange bars represent female participants and black bars male participants. In red, inside each bar: error bar; the number on top of the orange/black bar represents the absolute number (white) of participants in a specific field of study experiencing severe symptoms. Absolute numbers and frequencies are also detailed in Table 1.
Figure 3

Table 3. Regression models for severe anxiety or obsessive–compulsive symptoms

Figure 4

Table 4. Regression models for current suicidal ideation

Figure 5

Table 5. Comparison between follow-up completers and dropouts

Figure 6

Figure 2. Significant variables extracted by random forest algorithms to predict severe depressive symptoms at follow-up. (A) Multiway plot showing the relative importance of the top 10 variables used to predict severe depressive symptoms at follow-up (6 months after enrolment). The variables in red are statistically significant. A higher score in accuracy decrease or gini decrease (which is a measure of how each variable contributes to the homogeneity of the nodes and leaves) reflects the relative importance of that variable in the model (i.e., if the variable is located in the upper right corner of the plot – like cognitive symptoms – it means that removing that variable from the model significantly worsens the model prediction capability). (B) Predictions of the random forest of severe depressive symptoms depending on values of the BDI-II subscale scores at baseline. Participants could be classified into four classes: improvement from baseline, worsening from baseline or stability (either steady severe symptoms or steady well-being). For each subplot predictions range from 0 (deep blue) to 1 (red). For example, the probability of ‘still struggling’ reaches almost 1 (certainty) if the subscale scores at the beginning of the study are high.

Figure 7

Figure 3. Significant variables extracted by random forest algorithms to predict suicidal ideation at follow-up. (A) Multiway plot showing the relative importance of the top 10 variables used to predict suicidal ideation at follow-up. The variables in red are statistically significant. A higher score in the decrease in accuracy or the decrease in gini reflects the relative importance of that variable in the model. (B) Predictions of the random forest of suicidal ideation according to the values of the BDI-II subscale scores at baseline.

Supplementary material: File

Meda et al. supplementary material

Meda et al. supplementary material

Download Meda et al. supplementary material(File)
File 981.5 KB