Hostname: page-component-6766d58669-76mfw Total loading time: 0 Render date: 2026-05-17T12:34:09.436Z Has data issue: false hasContentIssue false

On the Replicability of Data Collection Using Online News Databases

Published online by Cambridge University Press:  11 January 2023

Mikaela Karstens
Affiliation:
The Pennsylvania State University – The Behrend College, USA
Michael J. Soules
Affiliation:
Naval Postgraduate School, USA
Nick Dietrich
Affiliation:
Ohio Wesleyan University, USA
Rights & Permissions [Opens in a new window]

Abstract

News databases, such as Factiva and Nexis Uni, are vital for the construction of many commonly used datasets of political events because they provide researchers with access to thousands of diverse news sources. This article raises several issues with news databases that pose a threat to the quality and replicability of data-collection efforts. We recommend best practices for using news databases to gather event data.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press on behalf of the American Political Science Association
Figure 0

Figure 1 Illustration of the Coding Process Using Newspaper Databases

Figure 1

Table 1 Comparison of Selected Conflict Datasets

Figure 2

Table 2 Comparison of Database Terms of Use for 2020

Figure 3

Figure 2 Variations in the Number of Factiva Search Results over Time

Figure 4

Figure 3 Variations in the Number of Associated Press Stories Retrieved Using the MID Search String

Supplementary material: PDF

Karstens et al. supplementary material

Karstens et al. supplementary material

Download Karstens et al. supplementary material(PDF)
PDF 1.2 MB