Hostname: page-component-77f85d65b8-2tv5m Total loading time: 0 Render date: 2026-04-19T16:03:38.197Z Has data issue: false hasContentIssue false

Harnessing heterogeneity in behavioural research using computational social science

Published online by Cambridge University Press:  04 December 2023

Giuseppe A. Veltri*
Affiliation:
Department of Sociology and Social Research, University of Trento, Via Verdi 26, Trento 38121, Italy
Rights & Permissions [Opens in a new window]

Abstract

Similarly to other domains of the social sciences, behavioural science has grappled with a crisis concerning the effect sizes of research findings. Different solutions have been provided to answer this challenge. This paper will discuss analytical strategies developed in the context of computational social science, namely causal tree and forest, that will benefit behavioural scientists in harnessing heterogeneity of treatment effects in RCTs. As a mixture of theoretical and data-driven approaches, these techniques are well suited to exploit the rich information provided by large studies conducted using RCTs. We discuss the characteristics of these methods and their methodological rationale and provide simulations to illustrate their use. We simulate two scenarios of RCTs-generated data and explore the heterogeneity of treatment effects using causal tree and causal forest methods. Furthermore, we outlined a potential theoretical use of these techniques to enrich behavioural science ecological validity by introducing the notion of behavioural niche.

Information

Type
Perspective
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press
Figure 0

Figure 1. Scenario 1: Simulation of a causal tree with a binary treatment (X), one normally distributed covariate (C1) and two non-normally distributed covariates (C2, C3). Max-depth set to 3.

Figure 1

Figure 2. Scenario 1: Scatterplot reporting ATE's heterogeneity values due to Covariate 1 estimated using a causal forest model (2000 trees).

Figure 2

Figure 3. Scenario 1: Scatterplot reporting ATE's heterogeneity values due to Covariate 2 estimated using a causal forest model (2000 trees).

Figure 3

Figure 4. Scenario 1: Scatterplot reporting ATE's heterogeneity values due to Covariate 3 estimated.

Figure 4

Figure 5. Scenario 2: Simulation of a causal tree with a continuous treatment (X), one normally distributed covariate (C1) and two non-normally distributed covariates (C2, C3). Max-depth set to 3.

Figure 5

Figure 6. Scenario 2: Scatterplot reporting ATE's heterogeneity values due to Covariate 1 estimated using a causal forest model (2000 trees).

Figure 6

Figure 7. Scenario 2: Scatterplot reporting ATE's heterogeneity values due to Covariate 2 estimated using a causal forest model (2000 trees).

Figure 7

Figure 8. Scenario 2: Scatterplot reporting ATE's heterogeneity values due to Covariate 3 estimated using a causal forest model (2000 trees).

Figure 8

Figure 9. The ecological use of heterogeneity: behavioural niches.