- Print publication year: 2014
- Online publication date: June 2014

- Publisher: Cambridge University Press
- DOI: https://doi.org/10.1017/CBO9781107338579.007
- pp 195-252

Abstract. The problem of replicating the flexibility of human common-sense reasoning has captured the imagination of computer scientists since the early days of Alan Turing's foundational work on computation and the philosophy of artificial intelligence. In the intervening years, the idea of cognition as computation has emerged as a fundamental tenet of Artificial Intelligence (AI) and cognitive science. But what kind of computation is cognition?

We describe a computational formalism centered around a probabilistic Turing machine called QUERY, which captures the operation of probabilistic conditioning via conditional simulation. Through several examples and analyses, we demonstrate how the QUERY abstraction can be used to cast common-sense reasoning as probabilistic inference in a statistical model of our observations and the uncertain structure of the world that generated that experience. This formulation is a recent synthesis of several research programs in AI and cognitive science, but it also represents a surprising convergence of several of Turing's pioneering insights in AI, the foundations of computation, and statistics.

§1. Introduction. In his landmark paper Computing Machinery and Intelligence [Tur50], Alan Turing predicted that by the end of the twentieth century, “general educated opinion will have altered so much that one will be able to speak of machines thinking without expecting to be contradicted.” Even if Turing has not yet been proven right, the idea of cognition as computation has emerged as a fundamental tenet of Artificial Intelligence (AI) and cognitive science. But what kind of computation—what kind of computer program—is cognition?

[AFR11] Proceedings of the 26th annual IEEE symposium on Logic in Computer Science (LICS 2011), 20ll, pp. 107-116. , , and , Noncomputable conditional distributions,

[And90] The adaptive character of thought, Erlbaum, Hillsdale, NJ, 1990. ,

[AB13] Computability and analysis: The legacy of Alan Turing, in this volume. and ,

[BJ03] Advances in Neural Information Processing Systems 15 (NIPS 2002) ( , , and , editors), The MIT Press, Cambridge, MA, 2003, pp. 1009-1016. and , Learning graphical models with Mercer kernels,

[Bak12] Bayesian theory of mind: Modeling human reasoning about beliefs, desires, goals, and social relations, Ph.D. thesis, Massachusetts Institute of Technology, 2012. ,

[BGT07] Proceedings of the 30th annual conference of the Cognitive Science Society, 2007, pp. 1447-1452. , , and , Theory-based social goal inference,

[BST09] Action understanding as inverse planning, Cognition, vol. 113 (2009), no. 3, pp. 329-349. , , and ,

[Bar98] Bayesian statistics 6: Proceedings of the sixth Valencia international meeting ( , , , and , editors), 1998, pp. 27-52. , Information-theoretic characterization of Bayes performance and the choice ofpriors in parametric and nonparametric problems,

[Be157] Dynamic programming, Princeton University Press, Princeton, NJ, 1957. ,

[B1a97] Domain representability of metric spaces, Annals of Pure and Applied Logic, vol. 83 (1997), no. 3, pp. 225-247. ,

[Cam11] Behavioral game theory: Experiments in strategic interaction, The Roundtable Series in Behavioral Economics, Princeton University Press, 2011. ,

[Car09] The origin of concepts, Oxford University Press, New York, 2009. ,

[CSH08] Proceedings of the twenty fourth conference on Uncertainty in Artificial Intelligence (UAI 2008) (Corvalis, Oregon), AUAI Press, 2008, pp. 70-78. , , and , Complexity of inference in graphical models,

[Coo90] The computational complexity of probabilistic inference using Bayesian belief networks, Artificial Intelligence, vol. 42 (1990), no. 2–3, pp. 393-405. ,

[Cop04] The essential Turing: Seminal writings in computing, logic, philosophy, artificial intelligence, and artificial life: Plus the secrets of enigma, Oxford University Press, Oxford, 2004. (editor),

[CP96] On Alan Turing's anticipation of connectionism, Synthese, vol. 108 (1996), no. 3, pp. 361-377. and ,

[DKLR00] An optimal algorithm for Monte Carlo estimation, SIAM Journal on Computing, vol. 29 (2000), no. 5, pp. 1484-1496. , , , and ,

[DL93] Approximating probabilistic inference in Bayesian belief networks is NP-hard, Artificial Intelligence, vol. 60 (1993), no. 1, pp. 141-153. and ,

[dMSS56] Automata Studies, Annals of Mathematical Studies, no. 34, Princeton University Press, Princeton, NJ, 1956, pp. 183-212. , , , and , Computability by probabilistic machines,

[DeG05] Optimal statistical decisions, Wiley Classics Library, Wiley, 2005. ,

[DWRT10] Advances in Neural Information Processing Systems 23 (NIPS 2010) ( , , , , and , editors), 2010, pp. 532-540. , , , and , Nonparametric Bayesian policy priors for reinforcement learning,

[Eda96] 11th annual IEEE symposium on Logic in Computer Science (LICS 1996), IEEE Computer Society Press, Los Alamitos, CA, 1996, pp. 372-381. , The Scott topology induces the weak topology,

[EH98] A computational model for metric spaces, Theoretical Computer Science, vol. 193 (1998), no. 1–2, pp. 53-73. and ,

[FG12] Predicting pragmatic reasoning in language games, Science, vol. 336 (2012), no. 6084, p. 998. and ,

[Gac05] Uniform test of algorithmic randomness over a general space, Theoretical Computer Science, vol. 341 (2005), no. 1–3, pp. 91-137. ,

[GHR10] Effective symbolic dynamics, random points, statistical behavior, complexity and entropy, Information and Computation, vol. 208 (2010), no. 1, pp. 23-41. , , and ,

[Gei84] Physical limits of acuity and hyperacuity, Journal of the Optical Society of America A, vol. 1 (1984), no. 7, pp. 775-782. ,

[GG12] Proceedings of the thirty-fourth annual conference of the Cognitive Science Society (Austin, TX) ( , , and , editors), Cognitive Science Society, 2012. and , Ping pong in Church: Productive use of concepts in human probabilistic inference,

[GGLT12] Proceedings of the thirty-fourth annual conference of the Cognitive Science Society (Austin, TX) ( , , and , editors), Cognitive Science Society, 2012. , , , and , Noisy Newtons: Unifying process and dependency accounts of causal attribution,

[GT07] Introduction to statistical relational learning, The MIT Press, 2007. and ,

[GG02] Models of ecological rationality: The recognition heuristic, Psychological Review, vol. 109 (2002), no. 1, pp. 75-90. and ,

[Goo61] A causal calculus. I, The British Journal for the Philosophy of Science, vol. 11 (1961), pp. 305-318. ,

[Goo68] Corroboration, explanation, evolving probability, simplicity and a sharpened razor, The British Journalfor the Philosophy of Science, vol. 19 (1968), no. 2, pp. 123-143. ,

[Goo75] Explicativity, corroboration, and the relative odds of hypotheses, Synthese, vol. 30 (1975), no. 1, pp. 39-73. ,

[Goo79] A. M. Turing's statistical work in World War II, Biometrika, vol. 66 (1979), no. 2, pp. 393-396, Studies in the history of probability and statistics. XXXVII. ,

[Goo91] The use of statistics in forensic science ( and , editors), Ellis Horwood, Chichester, 1991. , Weight of evidence and the Bayesian likelihood ratio,

[Goo00] Turing's anticipation of empirical Bayes in connection with the cryptanalysis of the naval Enigma, Journal of Statistical Computation and Simulation, vol. 66 (2000), no. 2, pp. 101-111. ,

[GBT09] Proceedings of the 31st annual conference of the Cognitive Science Society, 2009, pp. 2759-2764. , , and , Cause and intent: Social reasoning in causal learning,

[GMRBT08] Proceedings of the twenty-fourth conference on Uncertainty in Artificial Intelligence (UAI 2008) (Corvalis, Oregon), AUAI Press, 2008, pp. 220-229. , , , , and , Church: A language for generative models,

[GS12] Proceedings of the thirty-fourth annual Conference of the Cognitive Science Society (Austin, TX) ( , , and , editors), Cognitive Science Society, 2012. and , Knowledge and implicature: Modeling language understanding as social cognition,

[GT12] The probabilistic language of thought, in preparation, 2012. and ,

[GTFG08] A rational analysis of rule-based concept learning, Cognitive Science, vol. 32 (2008), no. 1, pp. 108-154. , , , and ,

[GTO11] Church wiki, (2011), http://projects.csail.mit.edu/church/wiki/Probabilistic_Models-of-Cognition. , , and , Probabilistic models of cognition,

[GUT11] Learning a theory of causality, Psychological Review, vol. 118 (2011), no. 1, pp. 110-119. , , and ,

[Gop12] Scientific thinking in young children: Theoretical advances, empirical research, and policy implications, Science, vol. 337 (2012), no. 6102, pp. 1623-1627. ,

[GKT08] Cambridge handbook of computational cognitive modeling, Cambridge University Press, 2008. , , and , Bayesian models of cognition,

[GT05] Structure and strength in causal induction, Cognitive Psychology, vol. 51 (2005), no. 4, pp. 334-384. and ,

[GT06] Optimal predictions in everyday cognition, Psychological Science, vol. 17 (2006), no. 9, pp. 767-773. and ,

[GT09] Theory-based causal induction, Psychological Review, vol. 116 (2009), no. 4, pp. 661-716. and ,

[GSW07] Computable metrization, Mathematical Logic Quarterly, vol. 53 (2007), no. 4–5, pp. 381-395. , , and ,

[Ham12] Physical reasoning in complex scenes is sensitive to mass, Master of Engineering thesis, Massachusetts Institute of Technology, Cambridge, MA, 2012. ,

[HBT11] Proceedings of the thirty-third annual Conference of the Cognitive Science Society (Austin, TX) ( , , and , editors), Cognitive Science Society, 2011, pp. 1545-1550. , , and , Internal physics models guide probabilistic judgments about object dynamics,

[Hem02] Effective metric spaces and representations of the reals, Theoretical Computer Science, vol. 284 (2002), no. 2, pp. 347-372. ,

[Hod97] Turing: A natural philosopher, Phoenix, London, 1997. ,

[How60] Dynamic programming and Markov processes, The MIT Press, Cambridge, MA, 1960. ,

[KLC98] Planning and acting in partially observable stochastic domains, Artificial Intelligence, vol. 101 (1998), pp. 99-134. , , and ,

[KLM96] Reinforcement learning: A survey, Journal of Artificial Intelligence Research, vol. 4 (1996), pp. 237-285. , , and ,

[Kal02] Foundations of moden probability, 2nd ed., Probability and its Applications, Springer, New York, 2002. ,

[KGT08] Advances in Neural Information Processing Systems 20 (NIPS 2007), 2008. , , and , Learning and using relational theories,

[KSBT07] Advances in Neural Information Processing Systems 19 (NIPS 2006) ( , , and , editors), The MIT Press, Cambridge, MA, 2007, pp. 681-688. , , , and , Combining causal and similarity-based reasoning,

[KT08] The discovery of structurai form, Proceedings of the National Academy of Sciences, vol. 105 (2008), no. 31, pp. 10687-10692. and ,

[KY03] Bayesian models of object perception, Current Opinion in Neurobiology, vol. 13 (2003), no. 2, pp. 150-158. and ,

[LBFL93] DENDRAL: A case study of the first expert system for scientific hypothesis formation, Artificial Intelligence, vol. 61 (1993), no. 2, pp. 209-261. , , , and ,

[Luc59] Individual choice behavior, John Wiley, New York, 1959. ,

[Luc77] The choice axiom after twenty years, Journal of Mathematical Psychology, vol. 15 (1977), no. 3, pp. 215-233. ,

[Mac03] Information theory, inference, and learning algorithms, Cambridge University Press, Cambridge, UK, 2003. ,

[MHC03] On the undecidability of probabilistic planning and related stochastic optimization problems, Artificial Intelligence, vol. 147 (2003), no. 1–2, pp. 5-34. , , and ,

[Man09] Natively probabilistic computation, Ph.D. thesis, Massachusetts Institute of Technology, 2009. ,

[Man11] NIPS Philosophy and Machine Learning Workshop, (2011). , Beyond calculation: Probabilistic computing machines and universal stochastic inference,

[MJT08] Technical Report MIT-CSAIL-TR-2008-069, Massachusetts Institute of Technology, 2008. , , and , Stochastic digital circuits for probabilistic inference,

[MKTG06] Proceedings of the twenty-second conference on Uncertainty in Artificial Intelligence (UAI2006) (Arlington, Virginia), AUAI Press, 2006, pp. 324-331. , , , and , Structured priors for structure learning,

[MR13] Stochastic inference machines, in preparation. and ,

[Mar82] Vision, Freeman, San Francisco, 1982. ,

[McC68] Semantic information processing, The MIT Press, 1968, pp. 403-418. , Programs with common sense,

[MUSTT12] Proceedings of the thirty-fourth annual conference of the Cognitive Science Society (Austin, TX) ( , , and , editors), Cognitive Science Society, 2012. , , , , and , Why blame Bob? Probabilistic generative models, counterfactual reasoning, and blame attribution,

[MP43] A logical calculus of the ideas immanent in nervous activity, Bulletin of Mathematical Biology, vol. 5 (1943), no. 4, pp. 115-133. and ,

[Mon82] A survey of partially observable Markov Decision Processes: Theory, models, andalgorithms, Management Science, vol. 28 (1982), no. 1, pp. 1-16. ,

[Mug91] Inductive logic programming, New Generation Computing, vol. 8 (1991), no. 4, pp. 295-318. ,

[OC98] Rational models of cognition, Oxford University Press, Oxford, 1998. and (editors),

[OC07] Bayesian rationality: The probabilistic approach to human reasoning, Oxford University Press, New York, 2007. and (editors),

[PT87] The complexity of Markov Decision Processes, Mathematics of Operations Research, vol. 12 (1987), no. 3, pp. 441-450. and ,

[Pea88] Probabilistic reasoning in intelligent systems: Networks of plausible inference, Morgan Kaufmann, San Francisco, 1988. ,

[Pea04] Computer science handbook ( , editor), CRC Press, 2nd ed., 2004. , Graphical models for probabilistic and causal reasoning,

[Pfa79] Conditional distributions as derivatives, The Annals of Probability, vol. 7 (1979), no. 6, pp. 1046-1050. ,

[Rao88] Paradoxes in conditional probability, Journal of Multivariate Analysis, vol. 27 (1988), no. 2, pp. 434-446. ,

[Rao05] Conditional measures and applications, 2nd ed., Pure and Applied Mathematics, vol. 271, Chapman & Hall/CRC, Boca Raton, FL, 2005. ,

[RH11] A philosophical treatise of universal induction, Entropy, vol. 13 (2011), no. 6, pp. 1076-1136. and ,

[Roy11] Computability, inference and modeling in probabilistic programming, Ph.D. thesis, Massachusetts Institute of Technology, 2011. ,

[Sch07] Admissible representations for probability measures, Mathematical Logic Quarterly, vol. 53 (2007), no. 4–5, pp. 431-445. ,

[Sch12] The origins of inquiry: Inductive inference and exploration in early childhood, Trends in Cognitive Sciences, vol. 16 (2012), no. 7, pp. 382-389. ,

[She87] Toward a universal law of generalization for psychological science, Science, vol. 237 (1987), no. 4820, pp. 1317-1323. ,

[SMH+91] Probabilistic diagnosis using a reformulation of the INTERNIST-1/QMR knowledge base, Methods of Information in Medicine, vol. 30 (1991), pp. 241-255. , , , , , , and ,

[SG92] Bayesian statistics without tears: A sampling-resampling perspective, The American Statistician, vol. 46 (1992), no. 2, pp. 84-88. and ,

[Sol64] A formal theory of inductive inference: Parts I and II, Information and Control, vol. 7 (1964), no. 1, pp. 1–22 and 224-254. ,

[SG13] Reasoning about reasoning by nested conditioning: Modeling theory ofmind with probabilistic programs, submitted. and ,

[SG12] A dynamic programming algorithm for inference in recursive probabilistic programs, Second Statistical Relational AI workshop at UAI 2012 (StaRAI-12), (2012). ,

[TG01] Generalization, similarity, and Bayesian inference, Behavioral and Brain Sciences, vol. 24 (2001), no. 4, pp. 629-640. and ,

[TGK06] Theory-based Bayesian models of inductive learning and reasoning, Trends in Cognitive Sciences, vol. 10 (2006), no. 7, pp. 309-318. , , and ,

[TKGG11] How to grow a mind: Statistics, structure, andabstraction, Science, vol. 331 (2011), no. 6022, pp. 1279-1285. , , , and ,

[Teu02] Turing's connectionism: An investigation of neural network architectures, Springer-Verlag, London, 2002. ,

[Tju74] Conditional probability distributions, Lecture Notes, no. 2, Institute of Mathematical Statistics, University of Copenhagen, Copenhagen, 1974. ,

[Tju75] A constructive definition of conditional distributions, Preprint 13, Institute of Mathematical Statistics, University of Copenhagen, Copenhagen, 1975. ,

[Tju80] Probability based on Radon measures, Wiley Series in Probability and Mathematical Statistics, John Wiley & Sons Ltd., Chichester, 1980. ,

[THS06] Technical Report EDI-INF-RR-0934, University of Edinburgh, School of Informatics, 2006. , , and , Probabilistic inference for solving (PO)MDPs,

[Tur36] On computable numbers, with an application to the Entscheidungsproblem, Proceedings of the London Mathematical Society. Second Series, vol. 42 (1936), no. 1, pp. 230-265. ,

[Tur39] Systems of logic based on ordinals, Proceedings of the London Mathematical Society. Second Series, vol. 45 (1939), no. 1, pp. 161-228. ,

[Tur48] Intelligent machinery, National Physical Laboratory Report, 1948. ,

[Tur50] Computing machinery and intelligence, Mind, vol. 59 (1950), pp. 433-460. ,

[Tur52] The chemical basis of morphogenesis, Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences, vol. 237 (1952), no. 641, pp. 37-72. ,

[Tur96] Intelligent machinery, a heretical theory, Philosophia Mathematica. Philosophy of Mathematics, its Learning, andits Applications. Series III, vol. 4 (1996), no. 3, pp. 256-260, Originally a radio presentation, 1951. ,

[Tur12] The applications of probability to cryptography, c. 1941, UK National Archives, HW25/37, 2012. ,

[UBM+09] Advances in Neural Information Processing Systems 22 (NIPS2009), 2009, pp. 1874-1882. , , , , , and , Help or hinder: Bayesian models of social goal inference,

[Wat89] Learning from delayed rewards, Ph.D. thesis, King's College, University of Cambridge, 1989. ,

[WD92] Q-Learning, Machine Learning, vol. 8 (1992), pp. 279-292. and ,

[Wei93] Computability on computable metric spaces, Theoretical Computer Science, vol. 113 (1993), no. 2, pp. 191-210. ,

[Wei99] Computability on the probability measures on the Borel sets of the unit interval, Theoretical Computer Science, vol. 219 (1999), no. 1–2, pp. 421-437. ,

[Wei00] Computable analysis: An introduction, Texts in Theoretical Computer Science, An EATCS Series, Springer-Verlag, Berlin, 2000. ,

[WGRKT11] Proceedings of the twenty-second International Joint Conference on Artificial Intelligence (IJCAI) (Menlo Park, CA) ( , editor), AAAI Press, 2011. , , , , and , Bayesian policy search with policy priors,

[WGSS11] Advances in Neural Information Processing Systems 24 (NIPS 2011), 2011. , , , and , Nonstandard interpretations of probabilistic programs for efficient inference,

[WSG11] Proceedings of the fourteenth international conference on Artificial Intelligence and Statistics (AISTATS), Journal of Machine Learning Research: Workshop and Conference Proceedings, vol. 15, 2011, pp. 770-778. , , and , Lightweight implementations of probabilistic programming languages via transformational compilation,

[Yam99] Polynomial time samplable distributions, Journal of Complexity, vol. 15 (1999), no. 4, pp. 557-574. ,

[Zab95] Alan Turing and the central limit theorem, American Mathematical Monthly, vol. 102 (1995), no. 6, pp. 483-494. ,

[Zab12] Commentary on Alan M. Turing: The applications of probability to cryptography, Cryptologia, vol. 36(2012), no. 3, pp. 191-214. ,