Hostname: page-component-89b8bd64d-mmrw7 Total loading time: 0 Render date: 2026-05-06T09:50:57.776Z Has data issue: false hasContentIssue false

Input-dependent noise can explain magnitude-sensitivity in optimalvalue-based decision-making

Published online by Cambridge University Press:  01 January 2023

Angelo Pirrone*
Affiliation:
Centre for Philosophy of Natural and Social Science, London School of Economics and Political Science, London, UK
Andreagiovanni Reina*
Affiliation:
IRIDIA, Université Libre de Bruxelles, Belgium, and Department of Computer Science, University of Sheffield, Sheffield, UK
Fernand Gobet*
Affiliation:
Centre for Philosophy of Natural and Social Science, London School of Economics and Political Science, London, UK
Rights & Permissions [Opens in a new window]

Abstract

Recent work has derived the optimal policy for two-alternative value-baseddecisions, in which decision-makers compare the subjective expected reward oftwo alternatives. Under specific task assumptions — such as linearutility, linear cost of time and constant processing noise — the optimalpolicy is implemented by a diffusion process in which parallel decisionthresholds collapse over time as a function of prior knowledge about averagereward across trials. This policy predicts that the decision dynamics of eachtrial are dominated by the difference in value between alternatives and areinsensitive to the magnitude of the alternatives (i.e., their summed values).This prediction clashes with empirical evidence showing magnitude-sensitivityeven in the case of equal alternatives, and with ecologically plausible accountsof decision making. Previous work has shown that relaxing assumptions aboutlinear utility or linear time cost can give rise to optimal magnitude-sensitivepolicies. Here we question the assumption of constant processing noise, infavour of input-dependent noise. The neurally plausible assumption ofinput-dependent noise during evidence accumulation has received strong supportfrom previous experimental and modelling work. We show that includinginput-dependent noise in the evidence accumulation process results in amagnitude-sensitive optimal policy for value-based decision-making, even in thecase of a linear utility function and a linear cost of time, for both single(i.e., isolated) choices and sequences of choices in which decision-makersmaximise reward rate. Compared to explanations that rely on non-linear utilityfunctions and/or non-linear cost of time, our proposed account ofmagnitude-sensitive optimal decision-making provides a parsimonious explanationthat bridges the gap between various task assumptions and between various typesof decision making.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
The authors license this article under the terms of the Creative Commons Attribution 3.0 License.
Copyright
Copyright © The Authors [2021] This is an Open Access article, distributed under the terms of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Figure 0

Figure 1: Optimal policy for binary value-based decision-making with input-dependent noise. The policy determines when an optimal decision-maker should choose an option: decision-makers continue to accumulate evidence until a decision boundary is reached and a decision is made. In the top row, the two panels show two representative sampling trajectories for equal alternatives with low (left) and high (right) magnitude conditions. The panels below show the time course for the low magnitude condition, in (A) to (C), and for the high magnitude condition, in (D) and (E). Both trajectories and collapsing boundaries are colour-coded, representing time (top legend). With input-dependent noise, the size of the random fluctuations varies with the input magnitude, therefore the high-magnitude conditions have on average larger fluctuations that hit a decision boundary faster compared to the low-magnitude conditions (0.8 s, compared to 2 s). In the absence of input-dependent noise, low and high-magnitude conditions would be indistinguishable and reach a boundary in the same time, exhibiting magnitude-insensitivity.

Figure 1

Figure 2: Results from stochastic simulations for a single choice: input-dependent noise can explain magnitude-sensitive optimal policies. Φ quantifies the strength of the input-dependent noise. The figure shows mean reaction time as a function of the magnitude of equal alternatives (the bars are 95% confidence intervals). When Φ=0, the magnitude-insensitive optimal policy is derived (Tajima et al., 2016). This figure shows magnitude-sensitive optimal reaction times for a single choice (i.e., expected reward for each individual choice is maximised) as a function of input-dependent noise and magnitude of the stimuli.

Figure 2

Figure 3: Results from stochastic simulations for a sequence of choices: input-dependent noise can explain magnitude-sensitive optimal policies. This figure shows magnitude-sensitive optimal reaction times for a sequence of choices (i.e., total expected reward within a fixed time period is maximised) as a function of input-dependent noise and magnitude of the stimuli.