Hostname: page-component-77f85d65b8-5ngxj Total loading time: 0 Render date: 2026-03-27T14:18:19.295Z Has data issue: false hasContentIssue false

PopBERT. Detecting Populism and Its Host Ideologies in the German Bundestag

Published online by Cambridge University Press:  01 October 2024

Lukas Erhard*
Affiliation:
Institute for Social Sciences, University of Stuttgart, Stuttgart, Germany
Sara Hanke
Affiliation:
Institute for Social Sciences, University of Stuttgart, Stuttgart, Germany
Uwe Remer
Affiliation:
Institute for Social Sciences, University of Stuttgart, Stuttgart, Germany
Agnieszka Falenska
Affiliation:
Institute for Natural Language Processing, University of Stuttgart, Stuttgart, Germany
Raphael Heiko Heiberger
Affiliation:
Institute for Social Sciences, University of Stuttgart, Stuttgart, Germany
*
Corresponding author: Lukas Erhard; Email: lukas.erhard@sowi.uni-stuttgart.de
Rights & Permissions [Opens in a new window]

Abstract

The rise of populism concerns many political scientists and practitioners, yet the detection of its underlying language remains fragmentary. This paper aims to provide a reliable, valid, and scalable approach to measure populist rhetoric. For that purpose, we created an annotated dataset based on parliamentary speeches of the German Bundestag (2013–2021). Following the ideational definition of populism, we label moralizing references to “the virtuous people” or “the corrupt elite” as core dimensions of populist language. To identify, in addition, how the thin ideology of populism is “thickened,” we annotate how populist statements are attached to left-wing or right-wing host ideologies. We then train a transformer-based model (PopBERT) as a multilabel classifier to detect and quantify each dimension. A battery of validation checks reveals that the model has a strong predictive accuracy, provides high qualitative face validity, matches party rankings of expert surveys, and detects out-of-sample text snippets correctly. PopBERT enables dynamic analyses of how German-speaking politicians and parties use populist language as a strategic device. Furthermore, the annotator-level data may also be applied in cross-domain applications or to develop related classifiers.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press on behalf of The Society for Political Methodology
Figure 0

Table 1 Number of annotated sentences in the dataset. In total, 8,795 sentences were annotated by five coders each. Column N indicates how many of these sentences were labeled by at least one coder with the respective dimension. The remaining three columns provide information about the inter-annotator agreement among the five coders for each dimension.

Figure 1

Table 2 Performance of the model on the 20% test set.

Figure 2

Table 3 Selected examples. Three sentences have been manually selected that correspond to only one of the two core dimensions (anti-elitism or people-centrism) as well as three sentences containing both dimensions simultaneously. For each of these, one sentence is predicted to be neutral, one is attached to a left-wing, and one to a right-wing host ideology.

Figure 3

Figure 1 Populist dimensions in speeches of the Bundestag, by party. Each dimension represents the average of model predictions across all sentences for each party. The values are normalized to their maximum value to highlight the proportions between the parties. Subplots with unstandardized values can be found in Appendix 4 of the Supplementary Material.

Figure 4

Figure 2 Populism by party in the 18th and 19th Bundestag using different aggregation methods. The panels depict (a) a multiplicative index of populism, (b) the Goertz Index, (c) the Bollen Index, and (d) the Sartori Index.

Figure 5

Table 4 Ranking of members in the German Bundestag using populist language, 18th and 19th electoral term. The ranking rests on a multiplicative index for anti-elitism and people-centrism. Alternative aggregation strategies are presented in Appendix 7 of the Supplementary Material.

Supplementary material: File

Erhard et al. supplementary material

Erhard et al. supplementary material
Download Erhard et al. supplementary material(File)
File 569.8 KB