Hostname: page-component-6766d58669-7cz98 Total loading time: 0 Render date: 2026-05-18T08:00:37.548Z Has data issue: false hasContentIssue false

Classification Accuracy as a Substantive Quantity of Interest: Measuring Polarization inWestminster Systems

Published online by Cambridge University Press:  29 January 2018

Andrew Peterson*
Affiliation:
Postdoctoral Researcher, University of Geneva, Switzerland. Email: andrew.peterson@unige.ch
Arthur Spirling
Affiliation:
Associate Professor of Politics and Data Science, New York University, USA. Email: arthur.spirling@nyu.edu
Rights & Permissions [Opens in a new window]

Abstract

Measuring the polarization of legislators and parties is a key step in understanding how politics develops over time. But in parliamentary systems—where ideological positions estimated from roll calls may not be informative—producing valid estimates is extremely challenging. We suggest a new measurement strategy that makes innovative use of the “accuracy” of machine classifiers, i.e., the number of correct predictions made as a proportion of all predictions. In our case, the “labels” are the party identifications of the members of parliament, predicted from their speeches along with some information on debate subjects. Intuitively, when the learner is able to discriminate members in the two main Westminster parties well, we claim we are in a period of “high” polarization. By contrast, when the classifier has low accuracy—and makes a relatively large number of mistakes in terms of allocating members to parties based on the data—we argue parliament is in an era of “low” polarization. This approach is fast and substantively valid, and we demonstrate its merits with simulations, and by comparing the estimates from 78 years of House of Commons speeches with qualitative and quantitative historical accounts of the same. As a headline finding, we note that contemporary British politics is approximately as polarized as it was in the mid-1960s—that is, in the middle of the “postwar consensus”. More broadly, we show that the technical performance of supervised learning algorithms can be directly informative about substantive matters in social science.

Information

Type
Letter
Copyright
Copyright © The Author(s) 2018. Published by Cambridge University Press on behalf of the Society for Political Methodology. 
Figure 0

Table 1. Definition of terms for accuracy calculation.

Figure 1

Figure 1. Classification accuracy ($y$-axis) for different levels of separation ($x$-axis) at different levels of noise.

Figure 2

Figure 2. Density plot of predicted probability conservative for different levels of noise. Note that as the fraction of noise in the data generating process increases, the mean positions of the parties are forced closer together.

Figure 3

Figure 3. Estimates of parliamentary polarization, by session. Election dates mark $x$-axis. Estimated change points are [green] vertical lines.

Figure 4

Figure 4. Mean variance by session.

Figure 5

Figure 5. Left/right (RILE) scores from the Manifesto Project. Higher scores correspond to more right wing policies. Lines are difference between the parties (solid) and lowess (broken) of the same.

Supplementary material: File

Peterson and Spirling supplementary material 1

Peterson and Spirling supplementary material

Download Peterson and Spirling supplementary material 1(File)
File 200.1 KB