Hostname: page-component-77f85d65b8-pztms Total loading time: 0 Render date: 2026-04-19T03:59:20.176Z Has data issue: false hasContentIssue false

Bayesian causal discovery for policy decision making

Published online by Cambridge University Press:  04 March 2025

Catarina Moreira*
Affiliation:
Human Technology Institute, University Technology Sydney, Ultimo, NSW, Australia
Ngoc Lan Chi Nguyen
Affiliation:
School of Computer Science, The University of Sydney, Sydney, NSW, Australia
Gilad Francis
Affiliation:
Human Technology Institute, University Technology Sydney, Ultimo, NSW, Australia
Hadi Mohasel Afshar
Affiliation:
Human Technology Institute, University Technology Sydney, Ultimo, NSW, Australia
Anna Lopatnikova
Affiliation:
Human Technology Institute, University Technology Sydney, Ultimo, NSW, Australia Discipline of Business Analytics, The University of Sydney, Darlington, NSW, Australia
Sally Cripps
Affiliation:
Human Technology Institute, University Technology Sydney, Ultimo, NSW, Australia
Roman Marchant
Affiliation:
Human Technology Institute, University Technology Sydney, Ultimo, NSW, Australia
*
Corresponding author: Catarina Moreira; Email: catarina.pintomoreira@uts.edu.au

Abstract

This paper demonstrates how learning the structure of a Bayesian network, often used to predict and represent causal pathways, can be used to inform policy decision-making.

We show that Bayesian networks are a rigorous and interpretable representation of interconnected factors that affect the complex environment in which policy decisions are made. Furthermore, Bayesian structure learning differentiates between proximal or immediate factors and upstream or root causes, offering a comprehensive set of potential causal pathways leading to specific outcomes.

We show how these causal pathways can provide critical insights into the impact of a policy intervention on an outcome. Central to our approach is the integration of causal discovery within a Bayesian framework, which considers the relative likelihood of possible causal pathways rather than only the most probable pathway.

We argue this is an essential part of causal discovery in policy making because the complexity of the decision landscape inevitably means that there are many near equally probable causal pathways. While this methodology is broadly applicable across various policy domains, we demonstrate its value within the context of educational policy in Australia. Here, we identify pathways influencing educational outcomes, such as student attendance, and examine the effects of social disadvantage on these pathways. We demonstrate the methodology’s performance using synthetic data and its usefulness by applying it to real-world data. Our findings in the real example highlight the usefulness of Bayesian networks as a policy decision tool and show how data science techniques can be used for practical policy development.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NC
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial licence (http://creativecommons.org/licenses/by-nc/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. (a) Bayesian network of a synthetic socioeconomic scenario. The factors of interest specify the nodes in the network, while the directed edges (arcs) indicate conditional dependence relationships between the nodes. (b) Linear regression model. The red edges indicate that the relationship between factors is found to be statistically significant using backwards variable selection.

Figure 1

Figure 2. Comparison highlighting specific limitations in the PC algorithm’s ability to capture certain key relationships within the data. (left) Graph learned using the PC algorithm. (right) The difference between the edges in the predicted PC graph and the true graph.

Figure 2

Figure 3. This figure illustrates the probability distribution of Bayesian network structures as compared to the true graph. Among the networks analysed, there are 19 equivalent classes, each representing structures that exhibit identical conditional independencies. Four of these equivalent classes are emphasized, showcasing how their structures align with or differ from the true graph, providing insight into the robustness and variability within the network interpretations.

Figure 3

Figure 4. Comparative visualisation of Bayesian network structures inferred from PMCMC simulation. The bar chart displays the probability distribution of sampled networks, with the true network denoted in red, indicating the highest probability. Adjacency matrices compare edge occurrence probabilities from sampled DAGs against the true data generation graph, highlighting variances in edge predictions. Below, the predicted graphs with the highest scores are depicted, and the true graph is shown for reference.

Figure 4

Figure 5. Comparison of True Posterior and PMCMC Approximate Posterior Distributions. This figure displays the true posterior probabilities (blue bars) and the PMCMC approximated probabilities (red squares) for a range of DAGs.

Figure 5

Figure 6. The Maximum A Posteriori (MAP) DAG, based on the LSAC dataset, features the ‘School Absence Frequent’ node emphasised with a red diamond. The edge coefficients are derived from the data according to the most probable DAG obtained with Partition MCMC. Blue edges represent positive correlations, while orange edges signify negative correlations. Ancestor nodes of ‘School Absence Frequent’ are marked with orange ellipses.

Figure 6

Table A1. OLS summarised results

Figure 7

Table D1. Expected value and standard deviation for coefficients of a logistic regression model which assumes independent and direct influence of all factors over School Absence

Figure 8

Figure E1. Top graphs, following the MAP graph (Figure 6), in the posterior distribution.

Submit a response

Comments

No Comments have been published for this article.