Hostname: page-component-6766d58669-nf276 Total loading time: 0 Render date: 2026-05-15T12:00:29.864Z Has data issue: false hasContentIssue false

Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts

Published online by Cambridge University Press:  04 January 2017

Justin Grimmer*
Affiliation:
Department of Political Science, Stanford University, Encina Hall West 616 Serra Street, Stanford, CA 94305
Brandon M. Stewart
Affiliation:
Department of Government and Institute for Quantitative Social Science, Harvard University, 1737 Cambridge Street, Cambridge, MA 02138 e-mail: bstewart@fas.harvard.edu
*
e-mail: jgrimmer@stanford.edu (corresponding author)
Rights & Permissions [Opens in a new window]

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the 'Save PDF' action button.

Politics and political conflict often occur in the written and spoken word. Scholars have long recognized this, but the massive costs of analyzing even moderately sized collections of texts have hindered their use in political science research. Here lies the promise of automated text analysis: it substantially reduces the costs of analyzing large collections of text. We provide a guide to this exciting new area of research and show how, in many instances, the methods have already obtained part of their promise. But there are pitfalls to using automated methods—they are no substitute for careful thought and close reading and require extensive and problem-specific validation. We survey a wide range of new methods, provide guidance on how to validate the output of the models, and clarify misconceptions and errors in the literature. To conclude, we argue that for automated text methods to become a standard tool for political scientists, methodologists must contribute new methods and new methods of validation.

Information

Type
Research Article
Copyright
Copyright © The Author 2013. Published by Oxford University Press on behalf of the Society for Political Methodology