Hostname: page-component-89b8bd64d-j4x9h Total loading time: 0 Render date: 2026-05-13T09:49:32.588Z Has data issue: false hasContentIssue false

Automatic SDG budget tagging: Building public financial management capacity through natural language processing

Published online by Cambridge University Press:  28 September 2023

Daniele Guariso*
Affiliation:
The Alan Turing Institute, London, UK
Omar A. Guerrero
Affiliation:
The Alan Turing Institute, London, UK
Gonzalo Castañeda
Affiliation:
Centro de Investigación y Docencia Económicas (CIDE), Mexico City, Mexico
*
Corresponding author: Daniele Guariso; Email: dguariso@turing.ac.uk

Abstract

The “budgeting for SDGs”–B4SDGs–paradigm seeks to coordinate the budgeting process of the fiscal cycle with the sustainable development goals (SDGs) set by the United Nations. Integrating the goals into public financial management systems is crucial for an effective alignment of national development priorities with the objectives set in the 2030 Agenda. Within the dynamic process defined in the B4SDGs framework, the step of SDG budget tagging represents a precondition for subsequent budget diagnostics. However, developing a national SDG taxonomy requires substantial investment in terms of time, human, and administrative resources. Such costs are exacerbated in least-developed countries, which are often characterized by a constrained institutional capacity. The automation of SDG budget tagging could represent a cost-effective solution. We use well-established text analysis and machine learning techniques to explore the scope and scalability of automatic labeling budget programs within the B4SDGs framework. The results show that, while our classifiers can achieve great accuracy, they face limitations when trained with data that is not representative of the institutional setting considered. These findings imply that a national government trying to integrate SDGs into its planning and budgeting practices cannot just rely solely on artificial intelligence (AI) tools and off-the-shelf coding schemes. Our results are relevant to academics and the broader policymaker community, contributing to the debate around the strengths and weaknesses of adopting computer algorithms to assist decision-making processes.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press
Figure 0

Table 1. Distribution of public spending over SDGs by country (2020)

Figure 1

Figure 1. Average indicator levels by SDG (in percentage).Notes: The indicators have been normalized between zero and one, and presented in percentage. A higher indicator value denotes a better outcome. The striped areas indicate that no indicators in such SDG were available for the year considered (2020). The dataset used (the 2021 Sustainable Development Report) does not contain indicators for SDG 12 (‘Responsible Consumption and Production’).

Figure 2

Table 2. Distribution of BPs over SDGs by country

Figure 3

Table 3. BP complexity and text similarity across SDGs

Figure 4

Table 4. Single-label classification

Figure 5

Table 5. Single-label classification: LinkedSDGs

Figure 6

Table 6. Single-label classification across countries

Figure 7

Table 7. Single-label classification merged data

Supplementary material: PDF

Guariso et al. supplementary material

Appendix

Download Guariso et al. supplementary material(PDF)
PDF 140.7 KB
Submit a response

Comments

No Comments have been published for this article.