Hostname: page-component-77f85d65b8-lfk5g Total loading time: 0 Render date: 2026-03-28T17:58:15.375Z Has data issue: false hasContentIssue false

Predicting patients who will drop out of out-patient psychotherapy using machine learning algorithms

Published online by Cambridge University Press:  18 February 2022

Björn Bennemann*
Affiliation:
Department of Clinical Psychology and Psychotherapy, University of Trier, Germany
Brian Schwartz
Affiliation:
Department of Clinical Psychology and Psychotherapy, University of Trier, Germany
Julia Giesemann
Affiliation:
Department of Clinical Psychology and Psychotherapy, University of Trier, Germany
Wolfgang Lutz
Affiliation:
Department of Clinical Psychology and Psychotherapy, University of Trier, Germany
*
Correspondence: Björn Bennemann. Email: bennemann@uni-trier.de
Rights & Permissions [Opens in a new window]

Abstract

Background

About 30% of patients drop out of cognitive–behavioural therapy (CBT), which has implications for psychiatric and psychological treatment. Findings concerning drop out remain heterogeneous.

Aims

This paper aims to compare different machine-learning algorithms using nested cross-validation, evaluate their benefit in naturalistic settings, and identify the best model as well as the most important variables.

Method

The data-set consisted of 2543 out-patients treated with CBT. Assessment took place before session one. Twenty-one algorithms and ensembles were compared. Two parameters (Brier score, area under the curve (AUC)) were used for evaluation.

Results

The best model was an ensemble that used Random Forest and nearest-neighbour modelling. During the training process, it was significantly better than generalised linear modelling (GLM) (Brier score: d = –2.93, 95% CI (−3.95, −1.90)); AUC: d = 0.59, 95% CI (0.11 to 1.06)). In the holdout sample, the ensemble was able to correctly identify 63.4% of cases of patients, whereas the GLM only identified 46.2% correctly. The most important predictors were lower education, lower scores on the Personality Style and Disorder Inventory (PSSI) compulsive scale, younger age, higher scores on the PSSI negativistic and PSSI antisocial scale as well as on the Brief Symptom Inventory (BSI) additional scale (mean of the four additional items) and BSI overall scale.

Conclusions

Machine learning improves drop-out predictions. However, not all algorithms are suited to naturalistic data-sets and binary events. Tree-based and boosted algorithms including a variable selection process seem well-suited, whereas more advanced algorithms such as neural networks do not.

Information

Type
Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © The Author(s), 2022. Published by Cambridge University Press on behalf of the Royal College of Psychiatrists
Figure 0

Table 1 Predictors used for model generation. Predictors were routinely collected at intakea

Figure 1

Table 2 Classification of all machine learning algorithms16 that were used in this studya

Figure 2

Table 3 Mean scores of the models generated by all 45 algorithms and ensemblesa

Figure 3

Fig. 1 Distribution of the 20 outer cross-validation models generated by each algorithm and ensemble ranked from best to worst using the area under the curve.Each value was grand-mean centred; the horizontal line represents the total average of all models. The numbers on the graphs are the standard deviations.

Supplementary material: File

Bennemann et al. supplementary material

Bennemann et al. supplementary material

Download Bennemann et al. supplementary material(File)
File 301 KB

This journal is not currently accepting new eletters.

eLetters

No eLetters have been published for this article.