Hostname: page-component-89b8bd64d-dvtzq Total loading time: 0 Render date: 2026-05-07T22:09:37.980Z Has data issue: false hasContentIssue false

What Would You Say? Estimating Causal Effects of Social Context on Political Expression

Published online by Cambridge University Press:  27 December 2024

William Small Schulz*
Affiliation:
Department of Politics, Princeton University, Princeton, NJ, USA.
Rights & Permissions [Opens in a new window]

Abstract

I develop a survey method for estimating social influence over individual political expression, by combining the content-richness of document scaling with the flexibility of survey research. I introduce the “What Would You Say?” question, which measures self-reported usage of political catchphrases in a hypothetical social context, which I manipulate in a between-subjects experiment. Using Wordsticks, an ordinal item response theory model inspired by Wordfish, I estimate each respondent’s lexical ideology and outspokenness, scaling their political lexicon in a two-dimensional space. I then identify self-censorship and preference falsification as causal effects of social context on respondents’ outspokenness and lexical ideology, respectively. This improves upon existing survey measures of political expression: it avoids conflating expressive behavior with populist attitudes, it defines preference falsification in terms of code-switching, and it moves beyond trait measures of self-censorship, to characterize relative shifts in the content of expression between different contexts. I validate the method and present experiments demonstrating its application to contemporary concerns about self-censorship and polarization, and I conclude by discussing its interpretation and future uses.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press on behalf of The Society for Political Methodology
Figure 0

Figure 1 The proposed method uses a novel survey question (a) to scale respondents (b) and phrases (c). Panel (a) shows an example of the “What Would You Say?” (see Section 2.1) question with the three-point response scale used in Studies 1, 2, and 4. Study 3 used a four-point response scale. Typically 20 phrases are shown. Panel (b) plots respondents’ latent lexical ideology $\alpha $ (x-axis) and outspokenness $\beta $ (y-axis) estimated from WWYS responses using the Wordsticks model (see Section 2.2), with points colored according to observed responses to the example phrase systemic racism, and dashed lines marking boundaries between most-probable-response regions (see Equations (2) and (3)) estimated by Wordsticks. Data from Studies 1, 2, and 4, collected in 2021–22 ($N=2,058$). Supplementary Appendix G provides similar plots for all phrases. Panel (c) provides an ideological scaling of phrases, in terms of predicted response at different points of the lexical ideology ($\alpha $) spectrum, for a respondent with $\beta = 0$. Thresholds between predicted response regions are estimated as c. Pixel values were averaged over 10,000 posterior draws, to create smooth gradients that represent estimation uncertainty. Rugplots on the x-axes similarly plot the (standard normal) posterior distribution of respondent lexical ideology $\alpha $, to illustrate respondent density along this dimension. Supplementary Appendix H provides a version of this plot showing all phrases, as well as a plot of phrases from Study 3, which used a four-point scale. Supplementary Appendix F visualizes phrases’ $\gamma $ slant parameter estimates (which determine the direction of the gradient) in isolation. Data on phrases used in Studies 1, 2, and 4 (see Section 3) collected in 2021–22, based on Wordsticks scaling model (see Section 2) including pooled data from all studies, collected in 2021–23 (see Supplementary Appendix D).

Figure 1

Table 1 Study information.

Figure 2

Figure 2 Validation of lexical ideology estimates against (a) issue ideology derived from one-dimensional scaling of 10 issue preference items, and (b) signed identity strength constructed by one-dimensional scaling of three ideological identity questions and interacting with a signed indicator for liberal/conservative identification. Panel (c) plots regression coefficients (with 95% CIs) for both of these measures predicting $\alpha $ (see Table 2, model 4). Data from Studies 2 and 3, collected in 2021 and 2023 ($N=1,701$).

Figure 3

Table 2 Alpha regressions.

Figure 4

Figure 3 Additional validations of $\alpha $ and $\beta $ estimates: Panel (a) plots lexical ideology ($\alpha $) against ideology scores from hand-labeled tweets ($p<0.01$, $R^2$=.08, data collected in Study 4, 2022, $N = 148$). Points represent users, and are shaded according to the number of tweets available from that user, ranging from 1 tweet (light grey) to 10 tweets (black). Panel (b) plots outspokenness ($\beta $) against participants’ responses to the question, “Generally speaking, how outspoken are you about politics and current events?” ($p<0.001$, $R^2$=.09, data collected in Study 3, 2023, $N = 901$).

Figure 5

Figure 4 Linear regression coefficients (and 95% CIs) for descriptive predictors and context treatment effects. Panel (a) plots the descriptive predictors of lexical ideology $\alpha $ (see Table 2, model 1, $N = 2,200$; data from Studies 1, 2, and 3, collected in 2021–23). Panel (b) plots the descriptive predictors of outspokenness $\beta $ (see Table 3, model 1, $N = 1,698$; data from Studies 2 and 3, collected in 2021–23). Panel (c) plots the effects of social context treatments: the effects of the “someone you just met” treatment (black) and the “most liberal/conservative person you talk to” treatments (blue/red) are plotted relative to the “close friend” control (black circle, at origin by construction). In this two-dimensional plot, the x-axis denotes left-right shift in lexical ideology $\alpha $ (see Table 2, model 2, $N = 1,401$), and the y-axis denotes up-down shifts in outspokenness $\beta $ (see Table 3, model 3, $N = 1,401$; Data from Studies 1 and 3, collected in 2021–23).

Figure 6

Figure 5 Study 3 treatment coefficients (and 95% CIs) by respondent ideology: Liberals (Panel (a); see model 1 of Supplementary Tables 7 and 8), Moderates (Panel (b); see model 2 of Supplementary Tables 7 and 8), and Conservatives (Panel (c); see model 3 of Supplementary Tables 7 and 8). Data from Studies 1 and 3, collected in 2021–23 (see Supplementary Appendix M).

Figure 7

Figure 6 Phrase-level effects of “most liberal/conservative person you know” treatments, subset by treatment condition and respondent ideology (see annotations at top). Barbs plotted in green (red) point upward (downward) in proportion to the increase (decrease) in phrase usage induced by the treatment in each subset of respondents. Ordinal logit models were estimated for all 40 phrases included in Study 3. False discovery was attenuated by Benjamini–Hochberg correction; phrases are only plotted if $p_{\text {corrected}}<0.05$. Within each subgroup, phrases are ordered horizontally by rank(gamma) so that left-slanted phrases appear to the left of right-slanted phrases. Data from Study 3, collected in 2023 ($N = 901$).

Figure 8

Table 3 Beta regressions.

Supplementary material: File

Spohn supplementary material

Spohn supplementary material
Download Spohn supplementary material(File)
File 23.9 MB