Hostname: page-component-89b8bd64d-n8gtw Total loading time: 0 Render date: 2026-05-06T18:10:14.843Z Has data issue: false hasContentIssue false

VigIA: prioritizing public procurement oversight with machine learning models and risk indices

Published online by Cambridge University Press:  20 December 2024

Andrés Salazar
Affiliation:
Department of Economics, Universidad del Rosario, Bogotá, Colombia.
Juan F. Pérez*
Affiliation:
Department of Industrial Engineering, Universidad de los Andes, Bogotá, Colombia
Jorge Gallego
Affiliation:
Office of Evaluation and Oversight, Inter-American Development Bank, Washington, DC, USA
*
Corresponding author: Juan F. Pérez; Email: jf.perez33@uniandes.edu.co

Abstract

Public procurement is a fundamental aspect of public administration. Its vast size makes its oversight and control very challenging, especially in countries where resources for these activities are limited. To support decisions and operations at public procurement oversight agencies, we developed and delivered VigIA, a data-based tool with two main components: (i) machine learning models to detect inefficiencies measured as cost overruns and delivery delays, and (ii) risk indices to detect irregularities in the procurement process. These two components cover complementary aspects of the procurement process, considering both active and passive waste, and help the oversight agencies to prioritize investigations and allocate resources. We show how the models developed shed light on specific features of the contracts to be considered and how their values signal red flags. We also highlight how these values change when the analysis focuses on specific contract types or on information available for early detection. Moreover, the models and indices developed only make use of open data and target variables generated by the procurement processes themselves, making them ideal to support continuous decisions at overseeing agencies.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. Number of contracts reported virtually by procurement mechanism across the years in Bogotá.

Figure 1

Figure 2. Distribution of contracts according to the presence of cost overruns or delivery delays.

Figure 2

Table 1. Descriptive statistics of numeric variables in the cost overruns models

Figure 3

Figure 3. Partial dependency plots of explanatory variables in the Random forest model to predict cost overruns. All variables in the $ x $ axis are standardized, and red dashed lines indicate tendency breakpoints with the value in the original scale.

Figure 4

Figure 4. Results for logistic regression and random forests to predict cost overruns.

Figure 5

Figure 5. Distribution of the contract duration by type.

Figure 6

Table 2. Random forest performance measures with additions in value as target variable and different contract types

Figure 7

Figure 6. Results for the random forests model to predict additions in value for different contract types.

Figure 8

Figure 7. Partial dependency plots of explanatory variables in the Random forest model to predict cost overruns for non-professional services contracts. Variables in the $ x $ axis are standardized, and red dashed lines indicate tendency breakpoints with the value in the original scale.

Figure 9

Figure 8. Results for the random forest models for additions in value using all variables vs pre-execution variables.

Figure 10

Table 3. Descriptive statistics of IRIC and IRICP, by contract type, for contracts signed in 2020

Figure 11

Table 4. Top entities by average IRIC for professional services contracts in 2020

Figure 12

Table 5. Top entities by average IRIC for non-professional services contracts in 2020

Supplementary material: File

Salazar et al. supplementary material

Salazar et al. supplementary material
Download Salazar et al. supplementary material(File)
File 515.5 KB
Submit a response

Comments

No Comments have been published for this article.