Hostname: page-component-89b8bd64d-4ws75 Total loading time: 0 Render date: 2026-05-06T04:54:36.427Z Has data issue: false hasContentIssue false

Introducing a global dataset on conflict forecasts and news topics

Published online by Cambridge University Press:  25 March 2024

Hannes Mueller
Affiliation:
Institut d’Anàlisi Econòmica, CSIC, Barcelona, Spain Barcelona School of Economics, Barcelona, Spain
Christopher Rauh*
Affiliation:
Faculty of Economics, University of Cambridge, Cambridge, UK PRIO, Oslo, Norway
Ben Seimon
Affiliation:
Institut d’Anàlisi Econòmica, CSIC, Barcelona, Spain Fundació d’Economía Analítica, Barcelona, Spain
*
Corresponding author: Christopher Rauh; Email: cr542@cam.ac.uk

Abstract

This article provides a structured description of openly available news topics and forecasts for armed conflict at the national and grid cell level starting January 2010. The news topics, as well as the forecasts, are updated monthly at conflictforecast.org and provide coverage for more than 170 countries and about 65,000 grid cells of size 55 × 55 km worldwide. The forecasts rely on natural language processing (NLP) and machine learning techniques to leverage a large corpus of newspaper text for predicting sudden onsets of violence in peaceful countries. Our goals are a) to support conflict prevention efforts by making our risk forecasts available to practitioners and research teams worldwide, b) to facilitate additional research that can utilize risk forecasts for causal identification, and c) to provide an overview of the news landscape.

Information

Type
Data Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. Schematic illustration of prediction pipeline.

Figure 1

Table 1. National forecast target variables

Figure 2

Table 2. National violence and armed conflict forecast features

Figure 3

Table 3. National violence intensity forecast features

Figure 4

Figure 2. ROC-AUC 95% confidence intervals.

Figure 5

Figure 3. ROC-AUCs by year for armed conflict, 12 months ahead.

Figure 6

Figure 4. Precision recall curve for armed conflict forecast, 12 months ahead.

Figure 7

Figure 5. Distribution of average fatalities per capita.

Figure 8

Table 4. Average MSE by bin

Figure 9

Figure 6. Violence intensity performance, escalations, and de-escalations 3 months ahead.

Figure 10

Figure 7. ROC curve and precision–recall curve at subnational level.

Figure 11

Table 5. National and subnational forecast dataset.

Figure 12

Figure 8. National conflict risk across the globe at the end of August 2023.

Figure 13

Figure 9. Subnational conflict risk across the globe at the end of August 2023.

Figure 14

Table 6. Top 10 keywords of the 15 topics and suggested labels

Figure 15

Figure 10. Politics topics in the USA and the UK over time.

Figure 16

Figure 11. Military topic share in August 2023.

Submit a response

Comments

No Comments have been published for this article.