Hostname: page-component-6766d58669-7cz98 Total loading time: 0 Render date: 2026-05-21T08:28:09.479Z Has data issue: false hasContentIssue false

Can Machines Think Like Humans: A Behavioral Evaluation of LLM Agents in Dictator Games

Published online by Cambridge University Press:  17 March 2026

Ji Ma*
Affiliation:
The University of Texas at Austin Lyndon B Johnson School of Public Affairs, USA Gradel Institute of Charity, New College, University of Oxford, UK
*
Corresponding author: Ji Ma; Email: maji@austin.utexas.edu
Rights & Permissions [Opens in a new window]

Abstract

As large language model (LLM)-based agents increasingly engage with human society, how well do we understand their prosocial behaviors? We (1) investigate how LLM agents’ prosocial behaviors can be induced by different personas and benchmarked against human behaviors and (2) introduce a social science approach to evaluate LLM agents’ decision-making. We explored how different personas and experimental framings affect these AI agents’ altruistic behavior in dictator games and compared their behaviors within the same LLM family, across various families, and with human behaviors. The findings reveal that merely assigning a human-like identity to LLMs does not produce human-like behaviors. They suggest that LLM agents’ reasoning does not consistently exhibit textual markers of human decision-making in dictator games and that their alignment with human behavior varies substantially across model architectures and prompt formulations; even worse, such dependence does not follow a clear pattern. As society increasingly integrates machine intelligence, “prosocial AI” emerges as a promising and urgent research direction in philanthropic studies.

Information

Type
Research Paper
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - ND
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (http://creativecommons.org/licenses/by-nc-nd/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided that no alterations are made and the original article is properly cited. The written permission of Cambridge University Press or the rights holder(s) must be obtained prior to any commercial use and/or adaptation of the article.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of International Society for Third-Sector Research
Figure 0

Fig. 1. Experiment design: LLM agent in dictator game. Note: Numbers in circles indicate the order of steps. See Section A.2 of the Supplementary Material and “LLM personas” section for detailed descriptions of the variables and experimental settings.

Figure 1

Table 1. Model performance: Instruction FOLLOWING and math reasoning

Figure 2

Fig. 2. Giving rate by model family and size (SoS). Note: Vertical red dashed lines indicate giving rates at −0.5, 0, and 0.5, respectively; horizontal red dashed lines indicate 50% of total observations. The giving rate is calculated as the percentage of the amount transferred by the dictator to the recipient out of the total stake. Results of the Theory of Mind trials are in Figure D1 in the Supplementary Material.

Figure 3

Fig. 3. Predicting generosity: Demographics and LLM temperature (SoS). Note: The coefficients (showing 95% confidence intervals) are from a linear regression model using the proportion of stake transferred in the dictator game as the dependent variable. Deep colors represent larger models, and light colors represent smaller models within the same LLM family. The shaded areas indicate expected directions of impact based on human studies (Section A.2 of the Supplementary Material). Results of the Theory of Mind trials are in Figure D2 in the Supplementary Material.

Figure 4

Fig. 4. Predicting generosity: Myers–Briggs Type Indicator (SoS). Note: The coefficients (showing 95% CI) are from a linear regression model using the proportion of stake transferred in the dictator game as the dependent variable. Deep colors represent larger models, and light colors represent smaller models within the same LLM family. The shaded areas indicate expected directions of impact based on human studies (Section A.2 of the Supplementary Material). Results of the Theory of Mind trials are in Figure D3 in the Supplementary Material.

Figure 5

Fig. 5. Predicting generosity: Framing of experiment (SoS). Note: The coefficients (showing 95% CI) are from a linear regression model using the proportion of stake transferred in the dictator game as the dependent variable. Deep colors represent larger models, and light colors represent smaller models within the same LLM family. The shaded areas indicate expected directions of impact based on human studies (Section A.2 of the Supplementary Material). The “Stranger” framing is the reference group for “Friend” and “Stranger Meet.” The “Give” framing is the reference group for “Take.” Results of the Theory of Mind trials are in Figure D4 in the Supplementary Material.

Figure 6

Fig. 6. Predicting generosity: Psychological process (SoS). (a) LIWC Categories Effectively Predicting Compassion Controlling for Empathy. (b) LIWC Categories Effectively Predicting Empathy Controlling for Compassion. Note: The coefficients (showing 95% confidence intervals) are from a linear regression model using the proportion of stake transferred in the dictator game as the dependent variable. Deep colors represent larger models, and light colors represent smaller models within the same LLM family. The shaded areas indicate expected directions of impact based on human studies (Section A.2 of the Supplementary Material). LIWC categories are selected for analysis according to Yaden et al. (2024). “She/He” and “Male” categories for Compassion are excluded due to limited number of observations. LIWC = Linguistic Inquiry and Word Count (Tausczik & Pennebaker, 2010). Results of the Theory of Mind trials are in Figure D5 in the Supplementary Material.

Figure 7

Table 2. LLM agents’ alignment with humans in dictator games (sense of self)

Figure 8

Table 3. LLM agents’ alignment with humans in dictator games: Compassion (sense of self)

Figure 9

Table 4. LLM agents’ alignment with humans in dictator games: Empathy (sense of self)

Supplementary material: File

Ma supplementary material

Ma supplementary material
Download Ma supplementary material(File)
File 334.4 KB