Hostname: page-component-77f85d65b8-zzw9c Total loading time: 0 Render date: 2026-03-28T07:09:45.850Z Has data issue: false hasContentIssue false

Introducing an Interpretable Deep Learning Approach to Domain-Specific Dictionary Creation: A Use Case for Conflict Prediction

Published online by Cambridge University Press:  22 March 2023

Sonja Häffner
Affiliation:
Center for Crisis Early Warning (Kompetenzzentrum Krisenfrüherkennung), Universität der Bundeswehr München, Neubiberg, Germany. E-mail: julian.walterskirchen@unibw.de
Martin Hofer
Affiliation:
Center for Crisis Early Warning (Kompetenzzentrum Krisenfrüherkennung), Universität der Bundeswehr München, Neubiberg, Germany. E-mail: julian.walterskirchen@unibw.de
Maximilian Nagl
Affiliation:
Lehrstuhl für Statistik und Risikomanagement, Universität Regensburg, Regensburg, Germany
Julian Walterskirchen*
Affiliation:
Center for Crisis Early Warning (Kompetenzzentrum Krisenfrüherkennung), Universität der Bundeswehr München, Neubiberg, Germany. E-mail: julian.walterskirchen@unibw.de
*
Corresponding author Julian Walterskirchen
Rights & Permissions [Opens in a new window]

Abstract

Recent advancements in natural language processing (NLP) methods have significantly improved their performance. However, more complex NLP models are more difficult to interpret and computationally expensive. Therefore, we propose an approach to dictionary creation that carefully balances the trade-off between complexity and interpretability. This approach combines a deep neural network architecture with techniques to improve model explainability to automatically build a domain-specific dictionary. As an illustrative use case of our approach, we create an objective dictionary that can infer conflict intensity from text data. We train the neural networks on a corpus of conflict reports and match them with conflict event data. This corpus consists of over 14,000 expert-written International Crisis Group (ICG) CrisisWatch reports between 2003 and 2021. Sensitivity analysis is used to extract the weighted words from the neural network to build the dictionary. In order to evaluate our approach, we compare our results to state-of-the-art deep learning language models, text-scaling methods, as well as standard, nonspecialized, and conflict event dictionary approaches. We are able to show that our approach outperforms other approaches while retaining interpretability.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press on behalf of the Society for Political Methodology
Figure 0

Figure 1 Schematic overview of our advanced dictionary creation approach.

Figure 1

Figure 2 Distribution of our main data sources.

Figure 2

Figure 3 Neural network architecture.

Figure 3

Figure 4 Unprocessed CrisisWatch report for Afghanistan, September 2003. Source: https://www.crisisgroup.org/crisiswatch.

Figure 4

Figure 5 Preprocessed CrisisWatch report with dictionary words and scores highlighted.

Figure 5

Table 1 Top 10 most positive (more fatalities) and negative (fewer fatalities) terms based on feature importance for International Crisis Group reports.

Figure 6

Figure 6 Distribution of feature importance scores for all words.

Figure 7

Figure 7 Comparison between different scores and fatalities.

Figure 8

Figure 8 Correlation plot for fatalities and the different scores.

Figure 9

Table 2 Results of predicting fatalities with different approaches.

Figure 10

Figure 9 Observed versus difference (sorted by fatalities), Random Forest.

Supplementary material: Link

Häffner et al. Dataset

Link
Supplementary material: PDF

Häffner et al. supplementary material

Häffner et al. supplementary material

Download Häffner et al. supplementary material(PDF)
PDF 355.6 KB