Hostname: page-component-89b8bd64d-46n74 Total loading time: 0 Render date: 2026-05-06T14:01:00.242Z Has data issue: false hasContentIssue false

Computational modelling of reinforcement learning and functional neuroimaging of probabilistic reversal for dissociating compulsive behaviours in gambling and cocaine use disorders

Published online by Cambridge University Press:  11 December 2023

Katharina Zühlsdorff*
Affiliation:
Department of Psychology, University of Cambridge, UK; Behavioural and Clinical Neuroscience Institute, University of Cambridge, UK; and the Alan Turing Institute, London, UK
Juan Verdejo-Román
Affiliation:
Department of Personality, Assessment and Psychological Treatment, Universidad de Granada, Spain; and Mind, Brain and Behavior Research Center, Universidad de Granada, Spain
Luke Clark
Affiliation:
Department of Psychology and Djavad Mowafaghian Centre for Brain Health, University of British Columbia, Canada
Natalia Albein-Urios
Affiliation:
Cognitive Neuroscience Unit, School of Psychology, Deakin University, Australia
Carles Soriano-Mas
Affiliation:
Department of Psychiatry, Bellvitge Biomedical Research Institute-IDIBELL, Spain; Department of Social Psychology and Quantitative Psychology, University of Barcelona, Spain; and CIBERSAM, Carlos III Health Institute, Madrid, Spain
Rudolf N. Cardinal
Affiliation:
Behavioural and Clinical Neuroscience Institute, University of Cambridge, UK; Department of Psychiatry, University of Cambridge, UK; and Liaison Psychology, Cambridgeshire and Peterborough NHS Foundation Trust, UK
Trevor W. Robbins
Affiliation:
Department of Psychology, University of Cambridge, UK; and Behavioural and Clinical Neuroscience Institute, University of Cambridge, UK
Jeffrey W. Dalley
Affiliation:
Department of Psychology, University of Cambridge, UK; Behavioural and Clinical Neuroscience Institute, University of Cambridge, UK; and Department of Psychiatry, University of Cambridge, UK
Antonio Verdejo-García
Affiliation:
School of Psychological Sciences, Monash University, Australia; and Turner Institute for Brain and Mental Health, Monash University, Australia
Jonathan W. Kanen
Affiliation:
Department of Psychology, University of Cambridge, UK; and Behavioural and Clinical Neuroscience Institute, University of Cambridge, UK
*
Correspondence: Katharina Zühlsdorff. Email: kz294@cam.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

Background

Individuals with cocaine use disorder or gambling disorder demonstrate impairments in cognitive flexibility: the ability to adapt to changes in the environment. Flexibility is commonly assessed in a laboratory setting using probabilistic reversal learning, which involves reinforcement learning, the process by which feedback from the environment is used to adjust behavior.

Aims

It is poorly understood whether impairments in flexibility differ between individuals with cocaine use and gambling disorders, and how this is instantiated by the brain. We applied computational modelling methods to gain a deeper mechanistic explanation of the latent processes underlying cognitive flexibility across two disorders of compulsivity.

Method

We present a re-analysis of probabilistic reversal data from individuals with either gambling disorder (n = 18) or cocaine use disorder (n = 20) and control participants (n = 18), using a hierarchical Bayesian approach. Furthermore, we relate behavioural findings to their underlying neural substrates through an analysis of task-based functional magnetic resonanceimaging (fMRI) data.

Results

We observed lower ‘stimulus stickiness’ in gambling disorder, and report differences in tracking expected values in individuals with gambling disorder compared to controls, with greater activity during reward expected value tracking in the cingulate gyrus and amygdala. In cocaine use disorder, we observed lower responses to positive punishment prediction errors and greater activity following negative punishment prediction errors in the superior frontal gyrus compared to controls.

Conclusions

Using a computational approach, we show that individuals with gambling disorder and cocaine use disorder differed in their perseverative tendencies and in how they tracked value neurally, which has implications for psychiatric classification.

Information

Type
Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2023. Published by Cambridge University Press on behalf of the Royal College of Psychiatrists
Figure 0

Table 1 Demographic information

Figure 1

Table 2 Model comparison summary

Figure 2

Fig. 1 Results from the hierarchical Bayesian winning reinforcement learning model, showing differences in group mean parameters. Orange indicates 0 ∉ 75% HDI. CUD, cocaine use disorder; HDI, highest density interval.

Figure 3

Fig. 2 Reward expected value tracking: differences between healthy controls and participants with gambling disorder (Montreal Neurological Institute coordinates: Y = −18 to −11). Activity was higher in the gambling disorder group in the areas indicated. Colour bar on the right-hand side represents the t-statistic.

Figure 4

Table 3 Summary of peak functional magnetic resonance imaging activity for the reward expected value controls versus gambling disorder contrast

Figure 5

Fig. 3 Punishment expected value tracking: differences between healthy controls and participants with gambling disorder (Montreal Neurological Institute coordinates: Y = −24 to −17). Activity was lower in the gambling disorder group in the areas indicated. Colour bar on the right-hand side represents the t-statistic.

Figure 6

Table 4 Summary of peak functional magnetic resonance imaging activity for the punishment expected value controls versus gambling disorder contrast

Figure 7

Fig. 4 Response to positive punishment prediction errors: differences between healthy controls and participants with CUD (Montreal Neurological Institute coordinates: X = −5, Y = 17, Z = 48). Activity was lower in the CUD group in the areas indicated. Colour bar on the right-hand side represents the t-statistic. CUD, cocaine use disorder.

Figure 8

Fig. 5 Response to negative punishment prediction errors: differences between healthy controls and participants with CUD (Montreal Neurological Institute coordinates: X = −31, Y = 30, Z = 56). Activity was higher in the CUD group in the areas indicated. Colour bar on the right-hand side represents the t-statistic. CUD, cocaine use disorder;

Figure 9

Table 5 Summary of peak functional magnetic resonance imaging activity for the positive punishment prediction error controls versus cocaine use disorder contrast

Figure 10

Table 6 Summary of peak functional magnetic resonance imaging activity for the punishment prediction error controls versus cocaine use disorder contrast

Figure 11

Fig. 6 Top: areas that have a stronger positive correlation with κstim in the gambling disorder group than in the healthy control group (MNI coordinates: X = 48, Y = 29, Z = 22). Bottom: areas that have a stronger positive correlation with κstim in the CUD group than in the healthy control group (MNI coordinates: X = 48, Y = 29, Z = 20). Colour bar on the right-hand side represents the t-statistic. CUD, cocaine use disorder; MNI, Montreal Neurological Institute.

Supplementary material: File

Zühlsdorff et al. supplementary material
Download undefined(File)
File 1.1 MB
Submit a response

eLetters

No eLetters have been published for this article.