Hostname: page-component-6766d58669-bp2c4 Total loading time: 0 Render date: 2026-05-20T12:03:49.476Z Has data issue: false hasContentIssue false

Depression is associated with reduced outcome sensitivity in a dual valence, magnitude learning task

Published online by Cambridge University Press:  14 September 2023

Erdem Pulcu*
Affiliation:
Department of Psychiatry, University of Oxford, Oxford, UK
Wanjun Lin
Affiliation:
Department of Psychiatry, University of Oxford, Oxford, UK University College London, Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London, UK
Sungwon Han
Affiliation:
Nuffield Department of Medicine, University of Oxford, Oxford, UK
Michael Browning
Affiliation:
Department of Psychiatry, University of Oxford, Oxford, UK Oxford Health NHS Foundation Trust, Oxford, UK
*
Corresponding author: Erdem Pulcu; Email: pulerd@gmail.com
Rights & Permissions [Opens in a new window]

Abstract

Background

Learning from rewarded and punished choices is perturbed in depressed patients, suggesting that abnormal reinforcement learning may be a cognitive mechanism of the illness. However, previous studies have disagreed about whether this behavior is produced by alterations in the rate of learning or sensitivity to experienced outcomes. This previous work has generally assessed learning in response to binary outcomes of one valence, rather than to both rewarding and punishing continuous outcomes.

Methods

A novel drifting reward and punishment magnitude reinforcement-learning task was administered to patients with current (n = 40) and remitted depression (n = 39), and healthy volunteers (n = 40) to capture potential differences in learning behavior. Standard questionnaires were administered to measure self-reported depressive symptom severity, trait and state anxiety and level of anhedonic symptoms.

Results

Our findings demonstrate that patients with current depression adjust their learning behaviors to a lesser degree in response to trial-by-trial variations in reward and loss magnitudes than the other groups. Computational modeling revealed that this behavioral signature of current depressive state is better accounted for by reduced reward and punishment sensitivity (all p < 0.031), rather than a change in learning rate (p = 0.708). However, between-group differences were not related to self-reported symptom severity or comorbid anxiety disorders in the current depression group.

Conclusion

These findings suggest that current depression is associated with reduced outcome sensitivity rather than altered learning rate. Previous findings reported in this domain mainly from binary learning tasks seem to generalize to learning from continuous outcomes.

Information

Type
Original Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2023. Published by Cambridge University Press
Figure 0

Figure 1. The magnitude learning task. (a) On each trial participants were presented with two abstract shapes and were asked to choose one of them. The empty bars above and below the fixation cross displayed the win and loss outcomes, represented by bar fillings in green and red colors, respectively. Here, the full length of the bars was equivalent to £1 for both wins and losses, and the length of the bar fillings from the side of the chosen shape, represented the outcome magnitudes (e.g. if 80% of the upper bar is filled with green, this would mean 80p win). Most importantly, the empty portions of the bars after the green and red fillings indicated the win and loss outcome magnitudes in the unchosen option, respectively; allowing participants to infer which shape would have been the better option on any given trial. (b) The outcomes presented to participants during the task. The outcome schedule was designed such that win and loss outcomes were decorrelated (r(79) = −0.032, p = 0.78) to require differential learning from wins and losses. The y-axis represents the magnitude of wins (in solid green) and losses (in solid red) associated with shape ‘A’ (in pence units). In total, the task consisted of 80 trials.

Figure 1

Table 1. Demographic details of the participants

Figure 2

Figure 2. (a) Influence of win and loss outcomes on participant choice behavior. A model-free approach demonstrating how outcome magnitudes from trial t-1 influence participant choice probability on trial t. Increasing outcome magnitudes had a greater effect on choice behavior (nb influence of the loss amount plotted as 1-p for ease of viewing). Bars show the mean (SEM) probability of selecting shape A across all participants in the study. A greater effect of outcome magnitude will lead to a steeper slope (θ) on this graph. (b) The effect of outcome magnitude on choice probability split by group. In this graph, the y axis (delta) is an estimate of slope (θ) from panel (A) and is computed as: ((bars 3 + 4)/2-(bars 1 + 2)/2). Bars represent the mean (SEM) of each group, separately for wins and losses. As can be seen the currently depressed group are numerically (but not statistically) less influenced by outcome magnitude than the other groups.

Figure 3

Table 2. Clinical characteristics of current (N = 40) and remitted MDD groups (N = 39)

Figure 4

Figure 3. Parameter estimates from the best-fitting reinforcement learning model that explains participant choice behavior. Groups were comparable for learning rates, whereas patients with current depression had significantly lower reward and loss sensitivity parameters. Error bars denote ± 1 SEM. All model parameters are plotted in the normal space for ease of viewing and interpretation.

Supplementary material: PDF

S0033291723002520sup001.pdf

Pulcu et al. supplementary material

Download S0033291723002520sup001.pdf(PDF)
PDF 332.3 KB