Hostname: page-component-89b8bd64d-7zcd7 Total loading time: 0 Render date: 2026-05-09T12:39:55.522Z Has data issue: false hasContentIssue false

When can networks be inferred from observed groups?

Published online by Cambridge University Press:  12 April 2024

Zachary P. Neal*
Affiliation:
Michigan State University, East Lansing, MI, USA
*
Rights & Permissions [Opens in a new window]

Abstract

Collecting network data directly from network members can be challenging. One alternative involves inferring a network from observed groups, for example, inferring a network of scientific collaboration from researchers’ observed paper authorships. In this paper, I explore when an unobserved undirected network of interest can accurately be inferred from observed groups. The analysis uses simulations to experimentally manipulate the structure of the unobserved network to be inferred, the number of groups observed, the extent to which the observed groups correspond to cliques in the unobserved network, and the method used to draw inferences. I find that when a small number of groups are observed, an unobserved network can be accurately inferred using a simple unweighted two-mode projection, provided that each group’s membership closely corresponds to a clique in the unobserved network. In contrast, when a large number of groups are observed, an unobserved network can be accurately inferred using a statistical backbone extraction model, even if the groups’ memberships are mostly random. These findings offer guidance for researchers seeking to indirectly measure a network of interest using observations of groups.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. Relationship between an unobserved network, observed groups, and inferred network. Accuracy may depend on (a) the structure of the unobserved network, (b) the number of observed groups, (c) the extent to which observed group correspond to cliques in the unobserved network, and (d) the method used to infer network relationships from group memberships.

Figure 1

Table 1. Summary of factorial experimental design

Figure 2

Figure 2. Accuracy of a network inferred from observed groups using an unweighted projection, by (a) the structure of the unobserved network being inferred, (b) number of groups observed, and (c) extent to which the observed groups correspond to cliques. Accuracy is measured using the correlation between the unobserved and inferred networks.

Figure 3

Table 2. Regression predicting the accuracy of an network inferred using a given approach, as a function of the unobserved network’s topology and characteristics of the observed groups

Figure 4

Figure 3. Accuracy of a network inferred from observed groups using a backbone extracted with the stochastic degree sequence model, by (a) the structure of the unobserved network being inferred, (b) number of groups observed, and (c) extent to which the observed groups correspond to cliques. Accuracy is measured using the correlation between the unobserved and inferred networks.