Hostname: page-component-6766d58669-zlvph Total loading time: 0 Render date: 2026-05-23T04:23:50.491Z Has data issue: false hasContentIssue false

Toxic Speech and Limited Demand for Content Moderation on Social Media

Published online by Cambridge University Press:  24 January 2024

FRANZISKA PRADEL*
Affiliation:
Technical University of Munich, Germany
JAN ZILINSKY*
Affiliation:
Technical University of Munich, Germany
SPYROS KOSMIDIS*
Affiliation:
University of Oxford, United Kingdom
YANNIS THEOCHARIS*
Affiliation:
Technical University of Munich, Germany
*
Corresponding author: Franziska Pradel, Postdoctoral Researcher, Chair of Digital Governance, Department of Governance, TUM School of Social Sciences and Technology, Technical University of Munich, Germany, franziska.pradel@tum.de.
Jan Zilinsky, Postdoctoral Researcher, Chair of Digital Governance, Department of Governance, TUM School of Social Sciences and Technology, Technical University of Munich, Germany, jan.zilinsky@tum.de.
Spyros Kosmidis, Associate Professor, Department of Politics and International Relations, University of Oxford, United Kingdom, spyros.kosmidis@politics.ox.ac.uk.
Yannis Theocharis, Professor, Chair of Digital Governance, Department of Governance, TUM School of Social Sciences and Technology, Technical University of Munich, Germany, yannis.theocharis@tum.de.
Rights & Permissions [Opens in a new window]

Abstract

When is speech on social media toxic enough to warrant content moderation? Platforms impose limits on what can be posted online, but also rely on users’ reports of potentially harmful content. Yet we know little about what users consider inadmissible to public discourse and what measures they wish to see implemented. Building on past work, we conceptualize three variants of toxic speech: incivility, intolerance, and violent threats. We present results from two studies with pre-registered randomized experiments (Study 1, $ N=\mathrm{5,130} $; Study 2, $ N=\mathrm{3,734} $) to examine how these variants causally affect users’ content moderation preferences. We find that while both the severity of toxicity and the target of the attack matter, the demand for content moderation of toxic speech is limited. We discuss implications for the study of toxicity and content moderation as an emerging area of research in political science with critical implications for platforms, policymakers, and democracy more broadly.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press on behalf of American Political Science Association
Figure 0

Figure 1. Experimental Treatment GroupsNote: Possible target groups are LGBTQ people, billionaires, and truck drivers with Christian bumper stickers (Study I). See the Supplementary Material for the exact wording.

Figure 1

Table 1. Participants’ Perceptions of the Underlying Toxic Language Dimension (Study I—Targeting Social Groups)

Figure 2

Figure 2. Effects of Treatments on Support for Any Form of Content Moderation (Study I, Pooled Results and Estimates Broken Down by Target Group)Note: The point estimates and the 95% confidence intervals represent average marginal effects calculated from a binary logit model. The dependent variable is set to 1 if the respondent selected any of “Permanently remove the post,” “Place a warning label on the post,” “Reduce how many people can see the post,” or “Suspend the person’s account” as their preferred action against the offending post. $ N=\mathrm{5,130} $ (pooled data). The logit results can be found in the SM and are presented in Table S9 in the SM.

Figure 3

Figure 3. Preferences for Content Moderation by Treatment and by Experiment in Study INote: The bars show support for each type of action for the pooled model (upper left), the LGBTQ study (upper right), the Christian study (bottom left), and the Billionaires study (bottom right). The actual percentages are reported in Tables S7 and S8 in the SM.

Figure 4

Figure 4. Heterogeneous Effects of Distinct Treatments on Support for Content ModerationNote: Point estimates and the corresponding 95% confidence intervals of the top graph represent differences between Democrats and Republicans in the probability to demand content moderation by study and treatment. The bottom graph shows changes in the average probability for Democrats and Republicans when the treatments are compared to the anti-target control group. Results from the logit model can be found in the SM and are presented in Table S10 in the SM while predictions and contrasts are presented in Table S11 in the SM.

Figure 5

Table 2. Participants’ Perceptions of the Underlying Toxic Language Dimension (Study II—Targeting Partisans)

Figure 6

Figure 5. Average (Top) and Heterogeneous (Bottom) Treatment Effects on Support for Any Form of Content Moderation (Study II)Note: The top panel shows average marginal effects and the 95% confidence intervals (CIs). The bottom panel splits the estimates by partisanship. Full results of the logit models can be found in Tables S16 (upper panel) and S17 (lower panel) in the SM. Table S18 in the SM reports the predicted probabilities and 95% CIs for both graphs.

Figure 7

Figure 6. Preferences for Specific Types of Content Moderation by Treatment in Study IINote: The bars show support for each type of action for the Democrat target study (left) and for the Republican target study (right). The actual percentages are reported in Tables S19 and S20 in the SM.

Supplementary material: File

Pradel et al. supplementary material

Pradel et al. supplementary material
Download Pradel et al. supplementary material(File)
File 5.7 MB
Submit a response

Comments

No Comments have been published for this article.