Hostname: page-component-6766d58669-r8qmj Total loading time: 0 Render date: 2026-05-20T23:10:25.602Z Has data issue: false hasContentIssue false

Modelling mood updating: a proof of principle study

Published online by Cambridge University Press:  13 December 2022

James E. Clark
Affiliation:
Translational and Clinical Research Institute, Newcastle University, UK
Stuart Watson*
Affiliation:
Translational and Clinical Research Institute, Newcastle University, UK; and Cumbria, Northumberland, Tyne and Wear NHS Foundation Trust, UK
*
Correspondence: Stuart Watson. Email: stuart.watson@newcastle.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

Background

Recent developments in computational psychiatry have led to the hypothesis that mood represents an expectation (prior belief) on the likely interoceptive consequences of action (i.e. emotion). This stems from ideas about how the brain navigates its external world by minimising an upper bound on surprisal (free energy) of sensory information and echoes developments in other perceptual domains.

Aims

In this paper we aim to present a simple partial observable Markov decision process that models mood updating in response to stressful or non-stressful environmental fluctuations while seeking to minimise surprisal in relation to prior beliefs about the likely interoceptive signals experienced with specific actions (attenuating or amplifying stress and pleasure signals).

Method

We examine how, by altering these prior beliefs we can model mood updating in depression, mania and anxiety.

Results

We discuss how these models provide a computational account of mood and its related psychopathology and relate it to previous research in reward processing.

Conclusions

Models such as this can provide hypotheses for experimental work and also open up the potential modelling of predicted disease trajectories in individual patients.

Information

Type
Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2022. Published by Cambridge University Press on behalf of the Royal College of Psychiatrists
Figure 0

Fig. 1 This figure shows the relationship among different states in the model. Observable states are shown in panels with solid lines, whereas hidden states are in panels with broken lines. The environment in this case (shown in the bottom right panel) is either stressful or non-stressful. The system (in this case modelling an agent attempting to infer the emotional content of the environment) will attempt to match their own internal (emotional) states to the environment. The environment generates observations that are either pleasurable (P –the sun icon) or stressful (S – the storm-cloud). The system must then use these observations to infer the state of the environment and will do so by minimising the difference between its expectations (the ‘mood’ of the system in our model) and the environment. The top panel is, therefore, the internal state (emotion) of the system at any point – again this is either stressful (ζ) or pleasurable (ρ). Matrix (a) is the likelihood matrix and shows the probability that observations are interpreted as stressful or pleasurable under the current internal state and the policy (αγ) being followed at the time. The system can transition to a different state or maintain its current state. Whether it does this or not is a function of the policy the system is following at any given time and the state at the previous time point. These probabilities are reflected in the transition probability matrix (b). The policy (or action) of the system is to either amplify or attenuate stress signals according to the optimality function where value is equal to inverse surprisal or model evidence. This is reflected in the reward matrix (c) and highlights the fact that Bellman optimality is a special case of free-energy minimisation. The probability of an observation is conditional on the state of the system and its current policy. The system can either wait, minimise or amplify stress/pleasure signals. Our conjecture is that mood functions as a (hyper)prior distribution over the likely emotional outcomes of any given policy (action). This is best reflected in the probability values in matrix (b), and means that the most valuable policy is the one that minimises the difference between the expected and actual emotional states. This can be achieved either though attenuation or amplification of sensory signals, or by altering the system's own internal states (i.e. changing mood). We propose that a healthy system is relatively receptive to changes in the emotional content of the environment, whereas pathological mood states result in either policy failure or inappropriate policy that results in mood states resistant to environmental signals – as detailed in the main text.

Figure 1

Fig. 2 (a) This network shows how belief states about the stress content of the environment are updated in a healthy mood state. Arrows indicate transitions between belief states based on the type of signals the agent observes. The coloured edges of each node represent the probability that the environment is stressful (dark blue portion) or non-stressful (light blue portion). Text within each node represents the optimal action that the agent will take given the current belief state. (b) This figure shows a frequency density plot of the probability that an event is decided to be stressful by the agent in healthy mood updating. Note that there is roughly equal density shared between being certain an event is non-stressful and certain an event is stressful. The key to the healthy mood network, therefore, is an ability to transition between mood states and a resistance to uncertainty about outcomes of action (the region in the middle of the distribution). AmSS, amplify stress signals; AtSS, attenuate stress signals; AmPS, amplify pleasure signals; AtPS, attenuate pleasure signals.

Figure 2

Fig. 3 (a) This network shows how belief states about the stress content of the environment are updated in a depressed mood state. Arrows indicate transitions between belief states based on the type of signals the agent observes. The coloured edges of each node represent the probability that the environment is stressful (dark blue portion) or non-stressful (light blue portion). Text within each node represents the optimal action that the agent will take given the current belief state. Note that if enough stress signals are received the agent becomes stuck in a loop in which belief states are constantly expecting a stressful environment and action is aimed at maintaining this belief, despite conflicting signals. (b) This figure shows a frequency density plot of the probability that an event is inferred to be stressful by the agent in depressed mood updating. In this case the distribution is skewed to the right (in contrast to Fig. 2(b)) indicating a much greater frequency of a stressful environment. AmSS, amplify stress signals; AtSS, attenuate stress signals; AmPS, amplify pleasure signals; AtPS, attenuate pleasure signals.

Figure 3

Fig. 4 (a) Bar graph showing mean probability in each mood state, across all belief states, that the agent thinks an event is stressful. The healthy mood state shows a more balanced probability, whereas in depression and anxiety the agent is more likely to believe events are stressful. In mania, events are more likely labelled as non-stressful. (b) Bar graph showing total expected reward from most valuable policy in each mood state. Depression and mania are associated with lower rewards, although not as low as the anxiety state. This is because reward in this context is framed in terms of minimising surprisal (maximising model evidence) that relies on certainty in belief states.

Figure 4

Fig. 5 (a) This network shows how belief states about the stress content of the environment are updated in a manic mood state. Arrows indicate transitions between belief states based on the type of signals the agent observes. The coloured edges of each node represent the probability that the environment is stressful (dark blue portion) or non-stressful (light blue portion). Text within each node represents the optimal action that the agent will take given the current belief state. Note that if enough pleasure signals are received the agent becomes stuck in a loop in which belief states are constantly expecting a non-stressful environment and action is aimed at maintaining this belief, despite conflicting signals. (b) This figure shows a frequency density plot of the probability that an event is inferred to be stressful by the agent in manic mood updating. In this case the distribution is skewed to the left (in contrast to Fig. 1(b)) indicating a much greater frequency of a non-stressful environment. AmSS, amplify stress signals; AtSS, attenuate stress signals; AmPS, amplify pleasure signals; AtPS, attenuate pleasure signals.

Figure 5

Fig. 6 (a) This network shows how belief states about the stress content of the environment are updated in the anxiety mood state. Arrows indicate transitions between belief states based on the type of signals the agent observes. The coloured edges of each node represent the probability that the environment is stressful (dark blue portion) or non-stressful (light blue portion). Text within each node represents the optimal action that the agent will take given the current belief state. In this case nodes are generally much more uncertain. Note that, unlike the other models, the agent attempts to amplify belief-consistent signals under uncertainty. Eventually, if enough stress signals are received, the agent becomes stuck in a node characterised by uncertainty about a stressful environment that is maintained whichever signals are received. (b) This figure shows a frequency density plot of the probability that an event is inferred to be stressful by the agent in anxious mood updating. In this case the distribution is quite normal with a peak at an uncertain belief in a stressful outcome. We propose this inability to resolve uncertainty is central to anxiety states. AmSS, amplify stress signals; AtSS, attenuate stress signals; AmPS, amplify pleasure signals; AtPS, attenuate pleasure signals.

Supplementary material: File

Clark and Watson supplementary material

Clark et al. supplementary material
Download Clark and Watson supplementary material(File)
File 34.3 KB

This journal is not currently accepting new eletters.

eLetters

No eLetters have been published for this article.