Hostname: page-component-77f85d65b8-6bnxx Total loading time: 0 Render date: 2026-03-27T21:23:41.359Z Has data issue: false hasContentIssue false

Do people exploit risk–reward structures to simplify information processing in risky choice?

Published online by Cambridge University Press:  17 January 2025

Christina Leuker*
Affiliation:
Max Planck Institute for Human Development, Berlin, Germany
Thorsten Pachur
Affiliation:
Max Planck Institute for Human Development, Berlin, Germany
Ralph Hertwig
Affiliation:
Max Planck Institute for Human Development, Berlin, Germany
Timothy J. Pleskac
Affiliation:
Max Planck Institute for Human Development, Berlin, Germany University of Kansas Max Planck Institute for Human Development, Berlin, Germany
Rights & Permissions [Opens in a new window]

Abstract

The high rewards people desire are often unlikely. Here, we investigated whether decision-makers exploit such ecological correlations between risks and rewards to simplify their information processing. In a learning phase, participants were exposed to options in which risks and rewards were negatively correlated, positively correlated, or uncorrelated. In a subsequent risky choice task, where the emphasis was on making either a ‘fast’ or the ‘best’ possible choice, participants’ eye movements were tracked. The changes in the number, distribution, and direction of eye fixations in ‘fast’ trials did not differ between the risk–reward conditions. In ‘best’ trials, however, participants in the negatively correlated condition lowered their evidence threshold, responded faster, and deviated from expected value maximization more than in the other risk–reward conditions. The results underscore how conclusions about people’s cognitive processing in risky choice can depend on risk–reward structures, an often neglected environmental property.

Information

Type
Original Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Copyright
Copyright © The Author(s) 2019, corrected publication 2019
Figure 0

Fig. 1 Experimental setup. a Learning phase gambles were drawn from one of three risk–reward environments. Each dot represents one learning phase gamble. b Participants learned about one of these risk–reward environments incidentally, from pricing gambles. c Participants completed a choice phase where they chose among two nondominated gambles while their eye movements were tracked. The choice phase consisted of test gambles (60%) that were identical across risk–reward conditions and were used to model the data and environment gambles (40%) that were used to reinforce the previously learned risk–reward structure. Eye-tracking analyses are based on the first screen in the choice phase, before participants indicated their choice. ‘Best’ (no time pressure) and ‘fast’ choices were made in interleaved blocks with 16 choices each

Figure 1

Table 1 Overview of regression models for processing and choice. Reference group set for environment: “uncorrelated”. Models included a random effect for “participant.” Coefficients are the mean and the 95% credible intervals of the posterior distributions. Credible differences in bold

Figure 2

Fig. 2 Behavioral (a, b) and eyetracking (c, d, e) results of the choice task. Colors indicate means and 95% highest density intervals of the posterior predictive distributions. Black triangles/circles indicate means. Posterior predictive distributions are based on a model accounting for individual variation and EV differences. Posterior predictive distributions in a are based on the median EV difference in the experiment (7E$), and across EV differences in the other panels. Dashed lines in c indicate the average number of areas of interest (AOIs) that can be inspected (2.5). Dashed line in e indicates an equal distribution of gaze to both payoffs and probabilities

Figure 3

Fig. 3 Depiction of the DDM and conditions for which parameters were estimated. We set the bias parameter z to .5 because participants cannot be biased to either the higher or lower EV option before inspecting it. In the best-fitting DDM (see Table 2), the drift rate was a function of individual differences in ability/effort to detect the higher EV option (intercept), an EV coefficient βEV measuring how much the use of EV differences contributed to the drift rate, and a gaze coefficient βgaze measuring how much gaze differences or biases contribute to choice. In this model, the interaction between value and gaze, βEV×gaze, was 0

Figure 4

Table 2 Deviance information criteria (DIC) for four different formalizations of the drift-diffusion model using the gamble with the higher expected value at the threshold. Lower DIC indicates better fit. Best-fitting model (DDM 3) in bold

Figure 5

Fig. 4 Parameter estimates for the winning model (additive value and gaze). Group means and 95% highest density intervals of the posterior distributions

Supplementary material: File

Leuker et al. supplementary material

Leuker et al. supplementary material
Download Leuker et al. supplementary material(File)
File 594.2 KB