Hostname: page-component-89b8bd64d-nlwjb Total loading time: 0 Render date: 2026-05-07T06:48:50.981Z Has data issue: false hasContentIssue false

Positioning Political Texts with Large Language Models by Asking and Averaging

Published online by Cambridge University Press:  27 January 2025

Gaël Le Mens*
Affiliation:
Department of Economics and Business, Universitat Pompeu Fabra, Barcelona, 08005, Spain UPF-Barcelona School of Management, Barcelona, 08008, Spain Barcelona School of Economics, Barcelona, 08005, Spain
Aina Gallego
Affiliation:
Department of Political Science, Constitutional Law and Philosophy of Law, Universitat de Barcelona, Barcelona, 08007, Spain Institut Barcelona d’Estudis Internacionals, Barcelona, Spain
*
Corresponding author: Gaël Le Mens; Email: gael.le-mens@upf.edu
Rights & Permissions [Opens in a new window]

Abstract

We use instruction-tuned large language models (LLMs) like GPT-4, Llama 3, MiXtral, or Aya to position political texts within policy and ideological spaces. We ask an LLM where a tweet or a sentence of a political text stands on the focal dimension and take the average of the LLM responses to position political actors such as US Senators, or longer texts such as UK party manifestos or EU policy speeches given in 10 different languages. The correlations between the position estimates obtained with the best LLMs and benchmarks based on text coding by experts, crowdworkers, or roll call votes exceed .90. This approach is generally more accurate than the positions obtained with supervised classifiers trained on large amounts of research data. Using instruction-tuned LLMs to position texts in policy and ideological spaces is fast, cost-efficient, reliable, and reproducible (in the case of open LLMs) even if the texts are short and written in different languages. We conclude with cautionary notes about the need for empirical validation.

Information

Type
Letter
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of The Society for Political Methodology
Figure 0

Table 1 LLMs used for the comparative analyses.

Figure 1

Table 2 Reliability of the measures based on human ratings used as benchmark for assessing the performance of position estimates produced with LLMs.

Figure 2

Figure 1 Positioning tweets published by members of the US Congress on the left-right ideological spectrum ($N=899$).

Figure 3

Figure 2 Positioning Senators of the 117th Congress on the left-right ideological spectrum based on a random sample of 100 of their tweets ($N=98$). Each dot represents a senator (‘+’: Democrats, ‘x’: Republicans).

Figure 4

Figure 3 Positioning British party manifestos on the Economic policy dimension (left to right wing scale). The numbers next to the dots indicate the years of the manifestos.

Figure 5

Figure 4 Positioning British party manifestos on the Social policy dimension (liberal to conservative scale). The numbers next to the dots indicate the years of the manifestos.

Figure 6

Figure 5 Positioning EU legislative speeches in 10 languages on the “anti-subsidy” to “pro-subsidy” dimension.

Supplementary material: File

Le Mens and Gallego supplementary material

Le Mens and Gallego supplementary material
Download Le Mens and Gallego supplementary material(File)
File 7.1 MB