Hostname: page-component-77f85d65b8-45ctf Total loading time: 0 Render date: 2026-03-27T12:50:08.179Z Has data issue: false hasContentIssue false

Measuring Media Criticism with ALC Word Embeddings

Published online by Cambridge University Press:  26 August 2025

Christopher Barrie*
Affiliation:
Department of Sociology, New York University , New York, NY 10012, USA
Neil Ketchley
Affiliation:
Department of Politics and International Relations, University of Oxford , Oxford OX1 2JD, UK
Alexandra Siegel
Affiliation:
Department of Political Science, University of Colorado , Boulder, CO 80309, USA
Mossaab Bagdouri
Affiliation:
BagTek, Tetouan 93020, Morocco
*
Corresponding author: Christopher Barrie; Email: christopher.barrie@nyu.edu
Rights & Permissions [Opens in a new window]

Abstract

The ability of news media to report on events and opinions that are critical of the executive branch of government is central to media freedom and a marker of meaningful democratization. Existing indices use scoring criteria or expert surveys to develop country year measures of media criticism. In this article, we introduce a computationally inexpensive and fully open-source method for estimating media criticism from news articles using à la carte (ALC) word embeddings. We validate our approach using Arabic-language news media published during the Arab Spring. An applied example demonstrates how our technique generates credible estimates of changes in media criticism after a democratic transition is ended by a military coup. Experiments demonstrate the method works even with sparse data. Analyses of synthetic news media demonstrate that the method extends to multiple languages. Our approach points to new possibilities in the monitoring of media freedom within authoritarian and democratizing settings.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of The Society for Political Methodology
Figure 0

Figure 1 Data analysis pipeline for measurement of media criticism in Arabic-language news media. This figure, illustrates the key steps in our methodology. (1) We collect news articles from publicly available sources across multiple countries and preprocess the text by removing stopwords, punctuation, and irrelevant content. (2) We identify mentions of political leaders and extract context words appearing within a six-word window around each mention. (3) Using pre-trained GloVe embeddings, we generate a reference embedding layer for all articles, which forms the basis for estimating media criticism. (4) We apply the ALC embedding method to construct time-specific word embeddings for each leader, allowing us to track changes in media discourse over time. (5) We compute a criticism index by projecting leader embeddings onto a semantic dimension spanning words associated with support and opposition. This pipeline enables us to estimate media criticism dynamically and at a granular temporal scale.

Figure 1

Figure 2 A: V-Dem media criticism scores over time across all countries; B: Normalized cosine similarity criticism scores over time across all countries. Lines in panel B is smoothed (LOESS) curve with span ($\alpha $) set to 0.5. Confidence intervals for the V-Dem scores in panel A are based on cross-coder aggregations calculated according to the Bayesian item response theory measurement model described in Coppedge et al. (2019).

Figure 2

Figure 3 Top panel: F-statistics over time for changepoint procedure across versions; Bottom panel: text-based criticism scores for Egypt and Tunisia and breakpoints for each version. Breakpoints displayed with slight offset for visibility.

Figure 3

Figure 4 Treatment effect of the coup (black arrow) on media criticism in Egypt using Tunisian newspapers as a counterfactual. Dashed line marks the coup.

Figure 4

Figure 5 Normalized cosine similarity criticism scores over time across eight languages using synthetic articles generated by gpt-3.5-turbo. Coloured lines represent the linear best fit over each period. The line at 0 is the point at which the LLM shifts to producing “not critical” articles.

Figure 5

Figure 6 Normalized cosine similarity criticism scores over time across eight languages using synthetic articles generated by gpt-4o. Coloured lines represent the linear best fit over each period. The line at 0 is the point at which the LLM shifts to producing “not critical” articles.

Supplementary material: File

Barrie et al. supplementary material

Barrie et al. supplementary material
Download Barrie et al. supplementary material(File)
File 1 MB
Supplementary material: Link

Barrie et al. Dataset

Link