Skip to main content
    • Aa
    • Aa
  • Get access
    Check if you have access via personal or institutional login
  • Cited by 79
  • Cited by
    This article has been cited by the following publications. This list is generated based on data provided by CrossRef.

    Cariani, Fabrizio and Rips, Lance J. 2016. Conditionals, Context, and the Suppression Effect. Cognitive Science, p. n/a.

    Danks, David 2016. A Companion to Experimental Philosophy.

    Elqayam, Shira and Over, David E. 2016. Editorial: From Is to Ought: The Place of Normative Models in the Study of Human Thought. Frontiers in Psychology, Vol. 7,

    Glassen, Thomas and Nitsch, Verena 2016. Hierarchical Bayesian models of cognitive development. Biological Cybernetics, Vol. 110, Issue. 2-3, p. 217.

    Hlobil, Ulf 2016. Chains of Inferences and the New Paradigm in the Psychology of Reasoning. Review of Philosophy and Psychology, Vol. 7, Issue. 1, p. 1.

    Irvine, Elizabeth 2016. Model-Based Theorizing in Cognitive Neuroscience. The British Journal for the Philosophy of Science, Vol. 67, Issue. 1, p. 143.

    Kharratzadeh, Milad and Shultz, Thomas 2016. Neural implementation of probabilistic models of cognition. Cognitive Systems Research, Vol. 40, p. 99.

    Maes, Pieter-Jan 2016. Sensorimotor Grounding of Musical Embodiment and the Role of Prediction: A Review. Frontiers in Psychology, Vol. 7,

    Martin, Andrea E. 2016. Language Processing as Cue Integration: Grounding the Psychology of Language in Perception and Neurophysiology. Frontiers in Psychology, Vol. 7,

    McCormack, Teresa Bramley, Neil Frosch, Caren Patrick, Fiona and Lagnado, David 2016. Children’s use of interventions to learn causal structure. Journal of Experimental Child Psychology, Vol. 141, p. 1.

    Spence, Charles 2016. Oral referral: On the mislocalization of odours to the mouth. Food Quality and Preference, Vol. 50, p. 117.

    Tentori, Katya Chater, Nick and Crupi, Vincenzo 2016. Judging the Probability of Hypotheses Versus the Impact of Evidence: Which Form of Inductive Inference Is More Accurate and Time-Consistent?. Cognitive Science, Vol. 40, Issue. 3, p. 758.

    Trapp, Sabrina Lepsien, Jöran Kotz, Sonja A. and Bar, Moshe 2016. Prior probability modulates anticipatory activity in category-specific areas. Cognitive, Affective, & Behavioral Neuroscience, Vol. 16, Issue. 1, p. 135.

    Wyart, Valentin and Koechlin, Etienne 2016. Choice variability and suboptimality in uncertain environments. Current Opinion in Behavioral Sciences, Vol. 11, p. 109.

    Zednik, Carlos and Jäkel, Frank 2016. Bayesian reverse-engineering considered as a research strategy for cognitive science. Synthese,

    Barth, Hilary Lesser, Ellen Taggart, Jessica and Slusser, Emily 2015. Spatial estimation: a non-Bayesian alternative. Developmental Science, Vol. 18, Issue. 5, p. 853.

    Bechtel, William and Shagrir, Oron 2015. The Non-Redundant Contributions of Marr's Three Levels of Analysis for Explaining Information-Processing Mechanisms. Topics in Cognitive Science, Vol. 7, Issue. 2, p. 312.

    Colombo, Matteo and Hartmann, Stephan 2015. Bayesian Cognitive Science, Unification, and Explanation. The British Journal for the Philosophy of Science, p. axv036.

    Cooper, Richard P. and Peebles, David 2015. Beyond Single-Level Accounts: The Role of Cognitive Architectures in Cognitive Scientific Explanation. Topics in Cognitive Science, Vol. 7, Issue. 2, p. 243.

    Eliasmith, Chris and Kolbeck, Carter 2015. Marr's Attacks: On Reductionism and Vagueness. Topics in Cognitive Science, Vol. 7, Issue. 2, p. 323.


Bayesian Fundamentalism or Enlightenment? On the explanatory status and theoretical contributions of Bayesian models of cognition

  • Matt Jones (a1) and Bradley C. Love (a2)
  • DOI:
  • Published online: 25 August 2011

The prominence of Bayesian modeling of cognition has increased recently largely because of mathematical advances in specifying and deriving predictions from complex probabilistic models. Much of this research aims to demonstrate that cognitive behavior can be explained from rational principles alone, without recourse to psychological or neurological processes and representations. We note commonalities between this rational approach and other movements in psychology – namely, Behaviorism and evolutionary psychology – that set aside mechanistic explanations or make use of optimality assumptions. Through these comparisons, we identify a number of challenges that limit the rational program's potential contribution to psychological theory. Specifically, rational Bayesian models are significantly unconstrained, both because they are uninformed by a wide range of process-level data and because their assumptions about the environment are generally not grounded in empirical measurement. The psychological implications of most Bayesian models are also unclear. Bayesian inference itself is conceptually trivial, but strong assumptions are often embedded in the hypothesis sets and the approximation algorithms used to derive model predictions, without a clear delineation between psychological commitments and implementational details. Comparing multiple Bayesian models of the same task is rare, as is the realization that many Bayesian models recapitulate existing (mechanistic level) theories. Despite the expressive power of current Bayesian models, we argue they must be developed in conjunction with mechanistic considerations to offer substantive explanations of cognition. We lay out several means for such an integration, which take into account the representations on which Bayesian inference operates, as well as the algorithms and heuristics that carry it out. We argue this unification will better facilitate lasting contributions to psychological theory, avoiding the pitfalls that have plagued previous theoretical movements.

Linked references
Hide All

This list contains references from the content that can be linked to their source. For a full set of references and notes please see the PDF or HTML where available.

J. R. Anderson (1991b) The adaptive nature of human categorization. Psychological Review 98:409–29.

J. R. Anderson , D. Bothell , C. Lebiere & M. Matessa (1998) An integrated theory of list memory. Journal of Memory and Language 38:341–80.

J. R. Anderson & L. J. Schooler (1991) Reflections of the environment in memory. Psychological Science 2:396408.

C. L. Baker , R. Saxe & J. B. Tenenbaum (2009) Action understanding as inverse planning. Cognition 113:329–49.

J. M. Beck , W. J. Ma , R. Kiani , T. Hanks , A. K. Churchland , J. Roitman , M. N. Shadlen , P. E. Latham & A. Pouget (2008) Probabilistic population codes for Bayesian decision making. Neuron 60:1142–52.

D. F. Bjorklund & A. D. Pellegrini (2000) Child development and evolutionary psychology. Child Development 71:1607–708.

L. Boucher , T. J. Palmeri , G. D. Logan & J. D. Schall (2007) Inhibitory control in mind and brain: An interactive race model of countermanding saccades. Psychological Review 114:376–97.

S. D. Brown & M. Steyvers (2009) Detecting and predicting changes. Cognitive Psychology 58:4967.

D. M. Buss , M. G. Haselton , T. K. Shackelford , A. L. Bleske & J. C. Wakefield (1998) Adaptations, exaptations, and spandrels. American Psychologist 53:533–48.

A. Caramazza & J. R. Shelton (1998) Domain-specific knowledge systems in the brain: The animate-inanimate distinction. Journal of Cognitive Neuroscience 10:134.

N. Chater & C. D. Manning (2006) Probabilistic models of language processing and acquisition. Trends in Cognitive Sciences 10:335–44.

N. Chater & M. Oaksford (1999) The probability heuristics model of syllogistic reasoning. Cognitive Psychology 38:191258.

N. Chater & M. Oaksford (2008) The probabilistic mind: Prospects for a Bayesian cognitive science. In: The probabilistic mind: Prospects for rational models of cognition, ed. M. Oaksford & N. Chater , p. 331. Oxford University Press.

N. Chater , M. Oaksford , R. Nakisa & M. Redington (2003) Fast, frugal, and rational: How rational norms explain behavior. Organizational Behavior and Human Decision Processes 90:6386.

N. Chater , F. Reali & M. H. Christiansen (2009) Restrictions on biological adaptation in language evolution. Proceedings of the National Academy of Sciences USA 106:1015–20.

N. Chater , J. Tenenbaum & A. Yuille (2006) Probabilistic models of cognition: Conceptual foundations. Trends in Cognitive Sciences 10(7):287–91.

N. Chomsky (1959) A review of B. F. Skinner's Verbal Behavior. Language 35:2658.

M. W. Clearfield , E. Dineva , L. B. Smith , F. J. Diedrich & E. Thelen (2009) Cue salience and infant perseverative reaching: Tests of the dynamic field theory. Developmental Science 12:2640.

E. Colunga & L. Smith (2005) From the lexicon to expectations about kinds: A role for associative learning. Psychological Review 112(2):347–82.

C. Conati , A. Gertner , K. VanLehn & M. Druzdzel (1997) On-line student modeling for coached problem solving using Bayesian networks. In: User modeling: Proceedings of the Sixth International Conference, UM97, Berlin, 1997, pp. 231–42, ed. A. Jameson , C. Paris & C. Tasso . Springer.

G. S. Cree & K. McRae (2003) Analyzing the factors underlying the structure and computation of the meaning of chipmunk, cherry, chisel, cheese, and cello (and many other such concrete nouns). Journal of Experimental Psychology: General 132:163201.

F. Crick (1989) The recent excitement about neural networks. Nature 337:129–32.

D. Danks (2008) Rational analyses, instrumentalism, and implementations. In: The probabilistic mind: Prospects for Bayesian cognitive science, ed. M. Oaksford & N. Chater , p. 5975. Oxford University Press.

N. D. Daw , A. C. Courville & P. Dayan (2008) Semi-rational models: The case of trial order. In: The probabilistic mind: Prospects for rational models of cognition, ed. M. Oaksford & N. Chater , p. 431–52. Oxford University Press.

N. D. Daw , Y. Niv & P. Dayan (2005) Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nature Neuroscience 8:1704–11.

S. Denève (2008) Bayesian spiking neurons. I: Inference. Neural Computation 20:91117.

J. T. Devlin , L. M. Gonnerman , E. S. Andersen & M. S. Seidenberg (1998) Category-specific semantic deficits in focal and widespread brain damage: A computational account. Journal of Cognitive Neuroscience 10:7794.

A. M. Dickinson (2000) The historical roots of organizational behavior management in the private sector: The 1950s–1980s. Journal of Organizational Behavior Management 20(3/4): 958.

B. B. Doll , W. J. Jacobs , A. G. Sanfey & M. J. Frank (2009) Instructional control of reinforcement learning: A behavioral and neurocomputational investigation. Brain Research 1299:7494.

A. Doucet , S. Godsill & C. Andrieu (2000) On sequential Monte Carlo sampling methods for Bayesian filtering. Statistics and Computing 10:197208.

F. W. Dyson , A. S. Eddington & C. Davidson (1920) A determination of the deflection of light by the sun's gravitational field, from observations made at the total eclipse of May 29, 1919. Philosophical Transactions of the Royal Society of London, Series A: Mathematical, Physical and Engineering Sciences 220:291333.

T. Ein-Dor , M. Mikulincer , G. Doron & P. R. Shaver (2010) The attachment paradox: How can so many of us (the insecure ones) have no adaptive advantages? Perspectives on Psychological Science 5(2):123–41.

A. Einstein (1916) Die Grundlage der allgemeinen Relativitätstheorie [The foundation of the generalized theory of relativity]. Annalen der Physik 354(7):769822.

J. L. Elman (1990) Finding structure in time. Cognitive Science 14:179211.

J. L. Elman (1993) Learning and development in neural networks: The importance of starting small. Cognition 48:7199.

J. Engelfriet & G. Rozenberg (1997) Node replacement graph grammars. In: Handbook of graph grammars and computing by graph transformation, vol. 1, ed. G. Rozenberg , p. 194. World Scientific.

W. K. Estes (1957) Theory of learning with constant, variable, or contingent probabilities of reinforcement. Psychometrika 22:113–32.

B. Fitelson (1999) The plurality of Bayesian measures of confirmation and the problem of measure sensitivity. Philosophy of Science 66:362–78.

J. A. Fodor & Z. Pylyshyn (1988) Connectionism and cognitive architecture: A critical analysis. Cognition 28: 371.

M. J. Frank , L. Seeberger & R. C. O'Reilly (2004) By carrot or by stick: Cognitive reinforcement learning in Parkinsonism. Science 306:1940–43.

W. S. Geisler & R. L. Diehl (2003) A Bayesian approach to the evolution of perceptual and cognitive systems. Cognitive Science 27:379402.

W. S. Geisler , J. S. Perry , B. J. Super & D. P. Gallogly (2001) Edge co-occurrence in natural images predicts contour grouping performance. Vision Research 41:711–24.

S. Geman , E. Bienenstock & R. Doursat (1992) Neural networks and the bias/variance dilemma. Neural Computation 4:158.

D. Gentner (1983) Structure-mapping: A theoretical framework for analogy. Cognitive Science 7:155–70.

D. Gentner , S. Brem , R. W. Ferguson , A. B. Markman , B. B. Levidow , P. Wolff & K. D. Forbus (1997) Analogical reasoning and conceptual change: A case study of Johannes Kepler. Journal of the Learning Sciences 6(1):340.

G. Gigerenzer & H. Brighton (2009) Homo heuristicus: Why biased minds make better inferences. Topics in Cognitive Science 1:107–43.

J. I. Gold & M. N. Shadlen (2001) Neural computations that underlie decisions about sensory stimuli. Trends in Cognitive Sciences 5:1016.

N. D. Goodman , J. B. Tenenbaum , J. Feldman & T. L. Griffiths (2008b) A rational analysis of rule-based concept learning. Cognitive Science 32(1):108–54.

S. J. Gould & R. Lewontin (1979) The spandrels of San Marco and the Panglossian paradigm: A critique of the adaptationist programme. Proceedings of the Royal Society of London Series B: Biological Sciences 205:581–98.

T. L. Griffiths , A. N. Sanborn , K. R. Canini & D. J. Navarro (2008b) Categorization as nonparametric Bayesian density estimation. In: The probabilistic mind: Prospects for rational models of cognition, ed. M. Oaksford & N. Chater . Oxford University Press.

T. L. Griffiths , M. Steyvers & J. B. Tenenbaum (2007) Topics in semantic representation. Psychological Review 114:211–44.

T. L. Griffiths & J. B. Tenenbaum (2006) Optimal predictions in everyday cognition. Psychological Science 17(9):767–73.

T. L. Griffiths & J. B. Tenenbaum (2009) Theory-based causal induction. Psychological Review 116:661716.

N. Guttman & H. I. Kalish (1956) Discriminability and stimulus generalization. Journal of Experimental Psychology 51:7988.

W. D. Hamilton (1964) The genetical theory of social behavior. Journal of Theoretical Biology 7:152.

W. K. Hastings (1970) Monte Carlo sampling methods using Markov chains and their applications. Biometrika 57:97109.

R. M. Hogarth & N. Karelaia (2005) Ignoring information in binary choice with continuous variables: When is less “more”? Journal of Mathematical Psychology 49:115–24.

J. Horgan (1999) The undiscovered mind: How the human brain defies replication, medication, and explanation. Psychological Science 10:470–74.

K. Hornik , M. Stinchcombe & H. White (1989) Multilayer feedforward networks are universal approximators. Neural Networks 2:359–66.

D. E. Huber , R. M. Shiffrin , K. B. Lyle & K. I. Ruys (2001) Perception and preference in short-term word priming. Psychological Review 108:149–82.

J. E. Hummel & I. Biederman (1992) Dynamic binding in a neural network for shape recognition. Psychological Review 99:480517.

E. T. Jaynes (1968) Prior probabilities. IEEE Transactions on Systems Science and Cybernetics 4:227–41.

H. Jeffreys (1946) An invariant form for the prior probability in estimation problems. Proceedings of the Royal Society of London Series A: Mathematical, Physical and Engineering Sciences 186:453–61.

M. F. Joanisse & M. S. Seidenberg (1999) Impairments in verb morphology after brain injury: A connectionist model. Proceedings of the National Academy of Sciences USA 96:7592–97.

M. F. Joanisse & M. S. Seidenberg (2003) Phonology and syntax in specific language impairment: Evidence from a connectionist model. Brain and Language 86:4056.

M. Jones & J. Zhang (2003) Which is to blame: Instrumental rationality, or common knowledge? Behavioral and Brain Sciences 26:166–67.

H. Köver , S. Bao (2010) Cortical plasticity as a mechanism for storing Bayesian priors in sensory perception. PLoS ONE 5(5):e10497.

M. D. Lee & B. W. Sarnecka (2010) A model of knower-level behavior in number-concept development. Cognitive Science 34:5167.

B. C. Love (2002) Comparing supervised and unsupervised category learning. Psychonomic Bulletin and Review 9:829–35.

G. F. Marcus (1998) Rethinking eliminative connectionism. Cognitive Psychology 37:243–82.

A. B. Markman & B. H. Ross (2003) Category use and category learning. Psychological Bulletin 129:592615.

D. Marr (1982) Vision: A computational investigation into the human representation and processing of visual information. W. H. Freeman.

J. M. McNamara & A. I. Houston (2009) Integrating function and mechanism. Trends in Ecology and Evolution 24:670–75.

E. K. Miller & J. D. Cohen (2001) An integrative theory of prefrontal cortex function. Annual Review of Neuroscience 24:167202.

G. A. Miller (2003) The cognitive revolution: A historical perspective. Trends in Cognitive Sciences 7:141–44.

D. Mortimer , J. Feldner , T. Vaughan , I. Vetter , Z. Pujic , W. J. Rosoff , K. Burrage , P. Dayan , L. J. Richards & G. J. Goodhill (2009) Bayesian model predicts the response of axons to molecular gradients. Proceedings of the National Academy of Sciences USA 106:10296–301.

M. C. Mozer , H. Pashler & H. Homaei (2008) Optimal predictions in everyday cognition: The wisdom of individuals or crowds? Cognitive Science 32:1133–47.

G. L. Murphy (1993) A rational theory of concepts. Psychology of Learning and Motivation 29:327–59.

E. L. Newport (1990) Maturational constraints on language learning. Cognitive Science 14:1128.

R. M. Nosofsky , T. J. Palmeri & S. C. Mckinley (1994) Rule-plus-exception model of classification learning. Psychological Review 104:266300.

M. Oaksford & N. Chater (1994) A rational analysis of the selection task as optimal data selection. Psychological Review 101:608–31.

M. Oaksford & N. Chater (2007) Bayesian rationality: The probabilistic approach to human reasoning. Oxford University Press.

S. Pinker & A. Prince (1988) On language and connectionism: Analysis of a parallel distributed processing model of language acquisition. Cognition 28:73193.

M. A. Pitt , I. J. Myung & S. Zhang (2002) Toward a method of selecting among computational models of cognition. Psychological Review 109:472–91.

D. C. Plaut , J. L. McClelland , M. S. Seidenberg & K. Patterson (1996) Understanding normal and impaired word reading: Computational principles in quasi-regular domains. Psychological Review 103:56115.

J. B. Pollack (1990) Recursive distributed representations. Artificial Intelligence 46:77105.

E. M. Pothos & N. Chater (2002) A simplicity principle in unsupervised human categorization. Cognitive Science 26:303–43.

M. Ramscar , D. Yarlett , M. Dye , K. Denny & K. Thorpe (2010) The effects of feature-label-order and their implications for symbolic learning. Cognitive Science 34:149.

G. Ricci & T. Levi-Civita (1900) Méthodes de calcul différentiel absolu et leurs applications [Methods of absolute differential calculus and their applications]. Mathematische Annalen 54(1–2):125201.

N. P. Rougier , D. Noelle , T. S. Braver , J. D. Cohen & R. C. O'Reilly (2005) Prefrontal cortex and the flexibility of cognitive control: Rules without symbols. Proceedings of the National Academy of Sciences USA 102:7338–43.

D. E. Rumelhart , G. E. Hinton & R. J. Williams (1986) Learning representations by back-propagating errors. Nature 323:533–36.

Y. Sakamoto , M. Jones & B. C. Love (2008) Putting the psychology back into psychological models: Mechanistic versus rational approaches. Memory and Cognition 36(6):1057–65.

A. N. Sanborn , T. L. Griffiths & D. J. Navarro (2010a) Rational approximations to rational models: Alternative algorithms for category learning. Psychological Review 117:1144–67.

A. N. Sanborn , T. L. Griffiths & R. M. Shiffrin (2010b) Uncovering mental representations with Markov chain Monte Carlo. Cognitive Psychology 60:63106.

G. E. Schwarz (1978) Estimating the dimension of a model. Annals of Statistics 6(2):461–64.

P. Shafto , C. Kemp , V. M. Mansinghka & J. B. Tenenbaum (2011) A probablistic model of cross-categorization. Cognition. 120:125.

B. F. Skinner (1957) Verbal behavior. Appleton-Century-Crofts.

B. F. Skinner (1958) Reinforcement today. American Psychologist 13:9499.

S. A. Sloman & P. M. Fernbach (2008) The value of rational analysis: An assessment of causal reasoning and learning. In: The probabilistic mind: Prospects for rational models of cognition, ed. N. Chater & M Oaksford , p. 485500. Oxford University Press.

D. L. Smith (2007) Beyond Westemarck: Can shared mothering or maternal phenotype matching account for incest avoidance? Evolutionary Psychology 5:202–22.

L. B. Smith , S. S. Jones , B. Landau , L. Gershkoff-Stowe & L. Samuelson (2002) Object name learning provides on-the-job training for attention. Psychological Science 13:1319.

P. K. Smith (1982) Does play matter? Functional and evolutionary aspects of animal and human play. Behavioral and Brain Sciences 5:139–84.

P. Smolensky (1988) On the proper treatment of connectionism. Behavioral and Brain Sciences 11:123.

D. M. Sobel , J. B. Tenenbaum & A. Gopnik (2004) Children's causal inferences from indirect evidence: Backwards blocking and Bayesian reasoning in preschoolers. Cognitive Science 28(3):303–33.

A. Soltani & X.-J. Wang (2010) Synaptic computation underlying probabilistic inference. Nature Neuroscience 13(1):112–19.

D. Sperber & L. A. Hirschfeld (2003) The cognitive foundations of cultural stability and diversity. Trends in Cognitive Sciences 8:4046.

M. Steyvers , M. D. Lee & E.-J. Wagenmakers (2009) A Bayesian analysis of human decision-making on bandit problems. Journal of Mathematical Psychology 53:168–79.

M. Steyvers , J. B. Tenenbaum , E.-J. Wagenmakers & B. Blum (2003) Inferring causal networks from observations and interventions. Cognitive Science 27:453–89.

J. B. Tenenbaum & T. L. Griffiths (2001) Generalization, similarity, and Bayesian inference. Behavioral and Brain Sciences 24(4):629–40.

J. B. Tenenbaum , T. L. Griffiths & C. Kemp (2006) Theory-based Bayesian models of inductive learning and reasoning. Trends in Cognitive Sciences 10:309–18.

P. Thagard (1989) Explanatory coherence. Behavioral and Brain Sciences 12:435502.

A. Tversky & D. Kahneman (1974) Judgment under uncertainty: Heuristics and biases. Science 185:1124–31.

J. B. Watson (1913) Psychology as the behaviorist views it. Psychological Review 20:158–77.

J. N. Wood & J. Grafman (2003) Human prefrontal cortex: Processing and representational perspectives. Nature Reviews: Neuroscience 4:129–47.

F. Xu & J. B. Tenenbaum (2007b) Word learning as Bayesian inference. Psychological Review 114(2):245–72.

T. Yamauchi & A. B. Markman (1998) Category learning by inference and classification. Journal of Memory and Language 39:124–48.

Recommend this journal

Email your librarian or administrator to recommend adding this journal to your organisation's collection.

Behavioral and Brain Sciences
  • ISSN: 0140-525X
  • EISSN: 1469-1825
  • URL: /core/journals/behavioral-and-brain-sciences
Please enter your name
Please enter a valid email address
Who would you like to send this to? *