Hostname: page-component-6766d58669-rxg44 Total loading time: 0 Render date: 2026-05-20T00:56:51.713Z Has data issue: false hasContentIssue false

Fightin' Words: Lexical Feature Selection and Evaluation for Identifying the Content of Political Conflict

Published online by Cambridge University Press:  04 January 2017

Burt L. Monroe*
Affiliation:
Department of Political Science, Quantitative Social Science Initiative, The Pennsylvania State University
Michael P. Colaresi*
Affiliation:
Department of Political Science, Michigan State University
Kevin M. Quinn*
Affiliation:
Department of Government and Institute for Quantitative Social Science, Harvard University
*
e-mail: burtmonroe@psu.edu (corresponding author)
Rights & Permissions [Opens in a new window]

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the 'Save PDF' action button.

Entries in the burgeoning “text-as-data” movement are often accompanied by lists or visualizations of how word (or other lexical feature) usage differs across some pair or set of documents. These are intended either to establish some target semantic concept (like the content of partisan frames) to estimate word-specific measures that feed forward into another analysis (like locating parties in ideological space) or both. We discuss a variety of techniques for selecting words that capture partisan, or other, differences in political speech and for evaluating the relative importance of those words. We introduce and emphasize several new approaches based on Bayesian shrinkage and regularization. We illustrate the relative utility of these approaches with analyses of partisan, gender, and distributive speech in the U.S. Senate.

Information

Type
Special Issue: The Statistical Analysis of Political Text
Copyright
Copyright © The Author 2009. Published by Oxford University Press on behalf of the Society for Political Methodology