Argiento, R., Pemantle, R., Skyrms, B., and Volkov, S. 2009. ‘Learning to Signal: Analysis of a Micro-Level Reinforcement Model.’ Stochastic Processes and their Applications, 119: 319–73.
Barrett, J. A., and Zollman, K. 2009. ‘The Role of Forgetting in the Evolution and Learning of Language.’ Journal of Experimental and Theoretical Artificial Intelligence, 21: 293–309.
Donaldson, M., Lachmann, M., and Bergstrom, C. T. 2007. ‘The Evolution of Functionally Referential Meaning in a Structured World.’ Journal of Theoretical Biology, 246: 225–33.
Erev, I., and Roth, A. 1998. ‘Predicting How People Play Games: Reinforcement Learning in Games with Unique Mixed-Strategy Equilibria.’ American Economic Review, 88: 848–81.
Estes, W. K. 1950. ‘Toward a Statistical Theory of Learning.’ Psychological Review, 57: 94–107.
Hofbauer, J., and Huttegger, S. 2008. ‘Feasibility of Communication in Binary Signaling Games.’ Journal of Theoretical Biology, 254: 843–9.
Hu, Y. 2010. ‘Essays on Random Processes with Reinforcement.’ Ph.D. thesis, St Anne's College, Oxford University.
Hu, Y., Skyrms, B., and Tarrés, P. In preparation. ‘Reinforcement Learning in Signaling Games.’
Huttegger, S., and Skyrms, B. Forthcoming. ‘Emergence of a Signaling Network with Probe and Adjust.’ In Calcott, B., Joyce, R. and Sterelney, K. (eds), Cooperation, Complexity, and Signaling. Cambridge, MA: MIT Press.
Kimbrough, S. O., and Murphy, F. H. 2009. ‘Learning to Collude Tacitly on Production Levels by Oligopolistic Agents.’ Computational Economics, 33: 47–78.
Lewis, D. K. 1969. Convention: A Philosophical Study. Cambridge, MA: Harvard University Press.
Marden, J. P., Young, H. P., Arslan, G., and Shamma, J. S. 2009. ‘Payoff-based dynamics for Multiplayer Weakly Acyclic Games.’ SIAM Journal on Control and Optimization, 48: 373–96.
Roth, A., and Erev, I. 1995. ‘Learning in Extensive Form Games: Experimental Data and Simple Dynamic Models in the Intermediate Term.’ Games and Economic Behavior, 8: 164–212.
Skyrms, B. 2010. Signals: Evolution, Learning and Information. London and New York: Oxford University Press.
Suppes, P., and Atkinson, R. C. 1960. Markov Learning Models for Multiperson Interactions. Stanford, CA: Stanford University Press.
Vulkan, N. 2000. ‘An Economist's Perspective on Probability Matching.’ Journal of Economic Surveys, 14: 101–18.
Young, H. P. 2004. Strategic Learning and its Limits. London and New York: Oxford University Press.
Young, H. P. 2009. ‘Learning by Trial and Error.’ Games and Economic Behavior, 65: 626–43.