Hostname: page-component-77f85d65b8-grvzd Total loading time: 0 Render date: 2026-03-29T16:24:08.346Z Has data issue: false hasContentIssue false

All that glitters is not gold: Relational events models with spurious events

Published online by Cambridge University Press:  16 September 2022

Cornelius Fritz*
Affiliation:
Department of Statistics, LMU Munich, Munich, Germany
Marius Mehrl
Affiliation:
Geschwister Scholl Institute of Political Science, LMU Munich, Munich, Germany
Paul W. Thurner
Affiliation:
Geschwister Scholl Institute of Political Science, LMU Munich, Munich, Germany
Göran Kauermann
Affiliation:
Department of Statistics, LMU Munich, Munich, Germany
*
*Corresponding author. E-mail: cornelius.fritz@stat.uni-muenchen.de
Rights & Permissions [Opens in a new window]

Abstract

As relational event models are an increasingly popular model for studying relational structures, the reliability of large-scale event data collection becomes more and more important. Automated or human-coded events often suffer from non-negligible false-discovery rates in event identification. And most sensor data are primarily based on actors’ spatial proximity for predefined time windows; hence, the observed events could relate either to a social relationship or random co-location. Both examples imply spurious events that may bias estimates and inference. We propose the Relational Event Model for Spurious Events (REMSE), an extension to existing approaches for interaction data. The model provides a flexible solution for modeling data while controlling for spurious events. Estimation of our model is carried out in an empirical Bayesian approach via data augmentation. Based on a simulation study, we investigate the properties of the estimation procedure. To demonstrate its usefulness in two distinct applications, we employ this model to combat events from the Syrian civil war and student co-location data. Results from the simulation and the applications identify the REMSE as a suitable approach to modeling relational event data in the presence of spurious events.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (http://creativecommons.org/licenses/by/4.0/) which permits unrestricted re-use, distribution and reproduction in any medium, provided that no alterations are made and the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use and/or adaptation of the article.
Copyright
© The Author(s), 2022. Published by Cambridge University Press
Figure 0

Figure 1. Graphical illustrations of endogenous and exogenous covariates. Solid lines represent past interactions, while dotted lines are possible but unrealized events. Node coloring indicates the node’s value on a categorical covariate. The relative risk of the events in the second row compared to the events in the first row is $\exp \{\theta _{end}\}$ if all other covariates are fixed, where $\theta _{end}$ is the coefficient of the respective statistic of each row.

Figure 1

Figure 2. Graphical illustration of a possible path of the counting process of observed events ($N_{ab}(t)$) between actors $a$ and $b$ that encompasses spurious ($N_{ab,0}(t)$) and true events ($N_{ab,1}(t)$).

Figure 2

Table 1. Result of the simulation study for the REMSE and REM with the two data-generating processes (DG 1, DG 2). For each DG and covariate, we note the AVE (AVerage Estimate), RMSE (Root-Mean-Squared Error), and CP (Coverage Probability). We report the average Percentage of False Events (PFE) for each DG in the last row

Figure 3

Table 2. Descriptive information on the two analyzed data sets

Figure 4

Table 3. Combat events in the Syrian civil war: Estimated coefficients with confidence intervals noted in brackets in the first column, while the Z values are given in the second column. The results of the REMSE are given in the first two columns, while the coefficients of the REM are depicted in the last two columns. The last row reports the estimated average Percentage of False Events (PFE)

Figure 5

Table 4. Co-location Events in University Housing: Estimated coefficients with confidence intervals noted in brackets in the first column, while the Z values are given in the second column. The results of the REMSE are given in the first two columns, while the coefficients of the REM are depicted in the last two columns. The last row reports the estimated average Percentage of False Events (PFE)