Hostname: page-component-5db58dd55d-8mwbx Total loading time: 0 Render date: 2026-05-31T08:37:26.699Z Has data issue: false hasContentIssue false

Large Language Models Are Democracy Coders with Attitudes

Published online by Cambridge University Press:  30 July 2025

Nils B. Weidmann
Affiliation:
University of Konstanz , Germany
Mats Faulborn
Affiliation:
University of Konstanz , Germany
David García
Affiliation:
University of Konstanz , Germany
Rights & Permissions [Opens in a new window]

Abstract

Current political developments worldwide illustrate that research on democratic backsliding is as important as ever. A recent exchange in Political Science & Politics (February 2024) highlighted again that the measurement of democracy remains a challenge. With many democracy indicators consisting of subjective assessments rather than factual observations, trends in democracy over time could be due to human biases in the coding of these indicators rather than empirical facts. This article leverages two cutting-edge Large Language Models (LLMs) for the coding of democracy indicators from the V-Dem project. With access to huge amounts of information, these models may be able to rate the many “soft” characteristics of regimes at substantially lower costs. Whereas LLM-generated codings largely align with expert coders for many countries, we show that when these models deviate from human assessments, they do so in different but consistent ways. Some LLMs are too pessimistic and others consistently overestimate the democratic quality of these countries. Although the combination of the two LLM codings can alleviate this concern, we conclude that it is difficult to replace human coders with LLMs because the extent and direction of these attitudes is not known a priori.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of American Political Science Association
Figure 0

Figure 1 Different Types of Human Coding Tasks

Figure 1

Figure 2 Distribution of the Average Difference Across All V-Dem Questions per CountryA difference of 0 (blue line) indicates perfect correspondence with the expert coders.

Figure 2

Figure 3 Average Differences Between LLMs Ratings and Human Codings at the Level of CountriesCountries are ordered by difficulty of coding (i.e., disagreement among coders).

Figure 3

Figure 4 Difference Between LLM and Human Codings for Two Countries and Selected V-Dem Indicators

Supplementary material: File

Weidmann et al. supplementary material

Weidmann et al. supplementary material
Download Weidmann et al. supplementary material(File)
File 149 KB