Hostname: page-component-89b8bd64d-72crv Total loading time: 0 Render date: 2026-05-06T18:17:24.553Z Has data issue: false hasContentIssue false

Effect of lysergic acid diethylamide (LSD) on reinforcement learning in humans

Published online by Cambridge University Press:  22 November 2022

Jonathan W. Kanen*
Affiliation:
Department of Psychology, University of Cambridge, Cambridge, UK Behavioural and Clinical Neuroscience Institute, University of Cambridge, Cambridge, UK
Qiang Luo
Affiliation:
National Clinical Research Center for Aging and Medicine at Huashan Hospital, State Key Laboratory of Medical Neurobiology and Ministry of Education Frontiers Center for Brain Science, Institutes of Brain Science and Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, 200433, China Center for Computational Psychiatry, Ministry of Education-Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Human Phenome Institute, Fudan University, Shanghai, 200032, China Shanghai Key Laboratory of Mental Health and Psychological Crisis Intervention, School of Psychology and Cognitive Science, East China Normal University, Shanghai, 200241, China
Mojtaba Rostami Kandroodi
Affiliation:
Department of Cognitive Science and Artificial Intelligence, Tilburg University, Tilburg, The Netherlands Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands
Rudolf N. Cardinal
Affiliation:
Behavioural and Clinical Neuroscience Institute, University of Cambridge, Cambridge, UK Department of Psychiatry, University of Cambridge, Cambridge, UK Cambridgeshire and Peterborough NHS Foundation Trust, Cambridge, UK
Trevor W. Robbins
Affiliation:
Department of Psychology, University of Cambridge, Cambridge, UK Behavioural and Clinical Neuroscience Institute, University of Cambridge, Cambridge, UK
David J. Nutt
Affiliation:
Department of Brain Sciences, Centre for Psychedelic Research, Imperial College London, London, UK
Robin L. Carhart-Harris
Affiliation:
Neuroscape Psychedelics Division, University of California San Francisco, San Francisco, California, USA
Hanneke E. M. den Ouden
Affiliation:
Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands
*
Author for correspondence: Jonathan W. Kanen, E-mail: jonathan.kanen@gmail.com
Rights & Permissions [Opens in a new window]

Abstract

Background

The non-selective serotonin 2A (5-HT2A) receptor agonist lysergic acid diethylamide (LSD) holds promise as a treatment for some psychiatric disorders. Psychedelic drugs such as LSD have been suggested to have therapeutic actions through their effects on learning. The behavioural effects of LSD in humans, however, remain incompletely understood. Here we examined how LSD affects probabilistic reversal learning (PRL) in healthy humans.

Methods

Healthy volunteers received intravenous LSD (75 μg in 10 mL saline) or placebo (10 mL saline) in a within-subjects design and completed a PRL task. Participants had to learn through trial and error which of three stimuli was rewarded most of the time, and these contingencies switched in a reversal phase. Computational models of reinforcement learning (RL) were fitted to the behavioural data to assess how LSD affected the updating (‘learning rates’) and deployment of value representations (‘reinforcement sensitivity’) during choice, as well as ‘stimulus stickiness’ (choice repetition irrespective of reinforcement history).

Results

Raw data measures assessing sensitivity to immediate feedback (‘win-stay’ and ‘lose-shift’ probabilities) were unaffected, whereas LSD increased the impact of the strength of initial learning on perseveration. Computational modelling revealed that the most pronounced effect of LSD was the enhancement of the reward learning rate. The punishment learning rate was also elevated. Stimulus stickiness was decreased by LSD, reflecting heightened exploration. Reinforcement sensitivity differed by phase.

Conclusions

Increased RL rates suggest LSD induced a state of heightened plasticity. These results indicate a potential mechanism through which revision of maladaptive associations could occur in the clinical application of LSD.

Information

Type
Original Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2022. Published by Cambridge University Press
Figure 0

Fig. 1. (a) Schematic of the PRL task. Subjects chose one of three stimuli. The timeline of a trial is depicted: stimuli appear, a choice is made, the outcome is shown, a fixation cross is presented during the intertrial interval, stimuli appear for the next trial (etc.) (RT, reaction time). One stimulus delivered positive feedback (green smiling face) with a 75% probability, one with 50%, and one with 25%. The probabilistic alternative was negative feedback (red sad face). Midway through the task, the contingencies for the best and worst stimuli swapped. s, seconds. (b) Better initial learning was predictive of more perseveration on LSD and not on placebo. Shading indicates ± 1 standard error of the mean (s.e.). (c) Trial-by-trial average probability of choosing each stimulus, averaged over subjects during the placebo session. A sliding 5-trial window was used for smoothing. The vertical dotted line indicates the reversal of contingencies. R-P indicates mostly rewarded stimulus, later mostly punished. N-N indicates neutral stimulus during both acquisition and reversal. P-R indicates mostly punished stimulus, later mostly rewarded stimulus. Shading indicates ± 1 s.e. (d) Trial-by-trial average probability of choosing each stimulus, averaged over subjects during the LSD session. A sliding 5-trial window was used for smoothing. The vertical dotted line indicates the reversal of contingencies. R-P indicates mostly rewarded stimulus, later mostly punished. N-N indicates neutral stimulus during both acquisition and reversal. P-R indicates mostly punished stimulus, later mostly rewarded stimulus. Shading indicates ± 1 s.e. (e) Distributions depicting the average per-subject probability (scattered dots) of choosing each stimulus while under placebo (shown in dark blue) and LSD (light blue). The mean value for each distribution is illustrated with a single dot at the base of each distribution, and the mean values for the probability of choosing different stimuli in each condition are connected by a line. Black error bars around the mean value show ± 1 s.e. Horizontal dotted line indicates chance-level ‘stay’ behaviour (33%). The global probability of choosing each stimulus did not differ between the placebo and LSD conditions. (f) Raw data measures of feedback sensitivity were unaffected by LSD. Distributions depicting the average per-subject probability (scattered dots) of repeating a choice (staying) after receiving positive or negative feedback under placebo (dark blue) and LSD (light blue). The horizontal dotted line indicates chance-level ‘stay’ behaviour (33%).

Figure 1

Table 1. Prior distributions for model parameters

Figure 2

Table 2. Model comparison

Figure 3

Fig. 2. Effects of LSD relative to placebo on model parameters. Contrasts with the posterior 95% (or greater) HDI of the difference between means excluding zero (0 ∉ 95% HDI) are shown in red. Yellow signifies 0 ∉ 90% HDI. (a) Acquisition and reversal phases (all trials) modelled together. The third row represents a difference of differences scores: (αrewLSDαpunLSD) – (αrewplaceboαpunplacebo). (b) Isolating the acquisition phase. (c) Isolating the reversal phase.

Supplementary material: File

Kanen et al. supplementary material

Kanen et al. supplementary material

Download Kanen et al. supplementary material(File)
File 724.6 KB