Hostname: page-component-89b8bd64d-z2ts4 Total loading time: 0 Render date: 2026-05-08T09:30:03.855Z Has data issue: false hasContentIssue false

Base rate neglect and conservatism in probabilistic reasoning: Insights from eliciting full distributions

Published online by Cambridge University Press:  01 January 2023

Piers Douglas Lionel Howe
Affiliation:
School of Psychological Sciences, University of Melbourne. Email: pdhowe@unimelb.edu.au.
Andrew Perfors
Affiliation:
School of Psychological Sciences, University of Melbourne. Email: andrew.perfors@unimelb.edu.au.
Bradley Walker
Affiliation:
School of Psychological Science, University of Western Australia. Email: bradley.walker@uwa.edu.au.
Yoshihisa Kashima
Affiliation:
School of Psychological Sciences, University of Melbourne. Email: ykashima@unimelb.edu.au.
Nicolas Fay
Affiliation:
School of Psychological Science, University of Western Australia. Email: nicolas.fay@uwa.edu.au.
Rights & Permissions [Opens in a new window]

Abstract

Bayesian statistics offers a normative description for how a person should combine their original beliefs (i.e., their priors) in light of new evidence (i.e., the likelihood). Previous research suggests that people tend to under-weight both their prior (base rate neglect) and the likelihood (conservatism), although this varies by individual and situation. Yet this work generally elicits people’s knowledge as single point estimates (e.g., x has a 5% probability of occurring) rather than as a full distribution. Here we demonstrate the utility of eliciting and fitting full distributions when studying these questions. Across three experiments, we found substantial variation in the extent to which people showed base rate neglect and conservatism, which our method allowed us to measure for the first time simultaneously at the level of the individual. While most people tended to disregard the base rate, they did so less when the prior was made explicit. Although many individuals were conservative, there was no apparent systematic relationship between base rate neglect and conservatism within each individual. We suggest that this method shows great potential for studying human probabilistic reasoning.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
The authors license this article under the terms of the Creative Commons Attribution 4.0 License.
Copyright
Copyright © The Authors [2022] This is an Open Access article, distributed under the terms of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Figure 0

Figure 1: The importance of distributional shape in Bayesian updating. The reasoners in both panels are perfect Bayesians who are estimating the probability of observing a certain outcome, such as pulling a red ball from an urn. Both have priors with the same expected value (single dots such that ) but different full prior distributions (dotted lines). The prior on the left, Beta(0.25,0.5), reflects initial beliefs that the urn has either mostly red balls or mostly blue balls (probably mostly blue). The one on the right, Beta(2,4), reflects the belief that there are slightly more blue balls. This difference in distributional shape has a strong effect on the posterior distributions that are inferred after seeing a single new data point corresponding to one red ball. Not only do the posteriors have different distributional shapes, the expected values (diamonds) are also different: the one on the left has an expected value of P(red) = 0.74 and the one on the right P(red) = 0.43. This shows that fully capturing Bayesian updating requires getting the shape of the underlying distribution right, not just accurately measuring the point estimate of the expected value of the prior.

Figure 1

Figure 2: A screenshot showing the methodology we used (in all experiments) for participants to report their probability distributions, similar to that of Goldstein and Rothschild (2014). People clicked on each bar to adjust its height. Clicking on a bar temporarily changed its colour to red. The different set of controls mentioned in the screenshot were a series of up and down buttons participants could press to adjust each slider. All probability distributions were constrained to sum to 100%.

Figure 2

Figure 3: Aggregate posterior estimates of the distribution of the probability that the bag contains a given proportion of red chips (y axis), for proportions ranging from 0% to 100% (x axis), for the two conditions in Experiment 1. Left panel. Posterior estimates after viewing five chips. The solid dark blue line reflects the aggregate posterior estimate of people in the Main condition after having seen five chips, while the dashed black line reflects the aggregate posterior estimate of those in the OnlyFive condition. Dark blue dots indicate individual estimates in the Main condition and light grey Xs indicate those in the OnlyFive condition. The aggregate posterior estimates are extremely similar in both conditions, with a mode at 80%, indicating that the posteriors are reasonable and that eliciting priors beforehand does not measurably change their behaviour. Right panel. Posterior estimates after viewing an unlimited number of chips. The solid light blue line reflects the aggregate posterior estimate of people in the Main condition after having seen unlimited chips, while the dashed black line reflects the aggregate posterior estimate of those in the OnlyUnlimited condition. Light blue dots indicate individual estimates in the Main condition and light grey Xs indicate those in the OnlyUnlimited condition. The aggregate posterior estimates are extremely similar in both conditions, with a mode at 80%. As before, this indicates that the posteriors are reasonable and that reporting their distributions multiple times does not change what the participants report.

Figure 3

Figure 4: Aggregate best-fit estimates in the Main condition in Experiment 1. The red lines depict the aggregate prior, the dark blue line (left panel) depicts the aggregate posterior after seeing five chips and the light blue line (right panel) depicts the aggregate posterior after seeing unlimited chips. In both panels, the grey line indicates the optimal Bayesian prediction (i.e., β = γ = 1) given the aggregate prior, while the black dotted line indicates the predicted posterior based on the inferred parameters β and γ. In both panels, the best fit β is zero, indicating that the aggregate posterior was best fit by completely disregarding the aggregate prior (i.e., complete base rate neglect). The best fit values for γ indicate a moderate degree of conservatism in both conditions.

Figure 4

Figure 5: Histograms showing the distribution of best-fit β and γ values across individuals after five and unlimited chips in the Main condition. The β distribution indicates that the majority of people showed a moderate or large amount of base rate neglect; their inferences were best described with β values less than one and often close to zero, indicating that the posterior distribution they reported was best explained by assuming that they disregarded their reported prior at least partially and often completely. There was a varied distribution of γ values, with about half showing conservative updating (γ<1, i.e., log(γ)<0).

Figure 5

Figure 6: Illustrative examples of individual distributions after receiving five chips (left) and unlimited chips (right). Data obtained from the Main condition in Experiment 1. In each plot, the red line is the reported prior, the dark and light blue lines are the reported posteriors after five and unlimited chips respectively, the grey line is the posterior obtained by an optimally calibrated Bayesian reasoner with that prior (β = γ = 1), and the dotted black line is the posterior obtained by the best-fit values of β and γ for that person. The grey label for each panel reports those values as well as the mean squared error of the fit (MSE, with 0 being perfect). The number in parenthesis is the participant ID.

Figure 6

Figure 7: A screenshot depicting the priors that participants saw in the two conditions of Experiment 2.

Figure 7

Figure 8: Reported distributions for the aggregate prior (red line) and the aggregate posterior in the Peaked (dark green line, left panel) and Uniform (light green line, right panel) conditions of Experiment 2. The grey line indicates the optimal Bayesian prediction given the aggregate prior, while the black dotted line indicates the prediction of the line of best fit based on the inferred parameters β and γ. The aggregate posterior is noticeably sharper in the Peaked condition, suggesting that people are using the base rate information to at least some extent. Consistent with this, β in the Peaked condition is 0.39, indicating that the aggregate posterior was best fit assuming that participants partially used the prior they were given: more than in Experiment 1, but less than an optimal Bayesian would (β=1). In both conditions, the value for γ indicates a moderate degree of conservatism on average.

Figure 8

Figure 9: Histograms showing the distribution of best-fit β and γ values across individuals in the Peaked and Uniform conditions in Experiment 2. Most people (although fewer than in Experiment 1) showed a moderate or large amount of base rate neglect and there was a varied distribution of γ values, with more people showing conservative updating in the Uniform condition. (N.B. β was not estimated in the Uniform condition as it was not defined in this condition.)

Figure 9

Figure 10: Reported prior distributions in Experiment 3. The solid red line reflects the aggregate of the priors reported in the Estimated condition, while for comparison the dashed black line reflects the prior shown to people in the Given condition. Red dots indicate individual estimates in the Estimated condition. The priors are extremely similar in both conditions, suggesting that any difference in posteriors is not due to differences in the prior.

Figure 10

Figure 11: Aggregate best-fit estimates in the Estimated (left panel) and Given (right panel) conditions of Experiment 3. For the Estimated condition, the red line depicts the reported aggregate prior and the dark purple line depicts the aggregate posterior. In the Given condition, the red line depicts the supplied prior and the light purple line depicts the aggregate posterior. The grey line indicates the optimal Bayesian prediction given the aggregate prior (left panel) or given prior (right panel), while the black dotted line indicates the prediction of the line of best fit based on the inferred parameters β and γ. The posterior distribution is unimodal in the Estimated condition but multimodal in the Given condition, suggesting that in the Given condition people are using the given prior, at least to some extent. Consistent with this, β in the Given condition is 0.32, indicating that the aggregate posterior was best fit assuming that participants partially used the prior they were given: more than in Experiment 1 and the Estimated condition where β was equal to zero, but less than an optimal Bayesian would (β=1). In both conditions, the value for γ indicates a moderate degree of conservatism, though the degree of conservatism is somewhat less in the Estimated condition.

Figure 11

Figure 12: Histograms showing the distribution of best-fit β and γ values across individuals in the Estimated and Given conditions in Experiment 3. The β distribution shows that, as in Experiment 1, most people in the Estimated condition showed a moderate or large amount of base rate neglect, but that this flipped in the Given condition. There was again a varied distribution of γ values, with many participants showing conservative updating (γ < 1, i.e., log(γ)<0).

Supplementary material: File

Howe et al. supplementary material
Download undefined(File)
File 3 MB