Hostname: page-component-89b8bd64d-rbxfs Total loading time: 0 Render date: 2026-05-08T03:40:03.517Z Has data issue: false hasContentIssue false

Micro-level network dynamics of scientific collaboration and impact: Relational hyperevent models for the analysis of coauthor networks

Published online by Cambridge University Press:  28 November 2022

Jürgen Lerner*
Affiliation:
University of Konstanz, Germany RWTH Aachen, Germany
Marian-Gabriel Hâncean
Affiliation:
University of Bucharest, Romania
*
*Corresponding author. Email: juergen.lerner@uni-konstanz.de
Rights & Permissions [Opens in a new window]

Abstract

We discuss a recently proposed family of statistical network models—relational hyperevent models (RHEMs)—for analyzing team selection and team performance in scientific coauthor networks. The underlying rationale for using RHEM in studies of coauthor networks is that scientific collaboration is intrinsically polyadic, that is, it typically involves teams of any size. Consequently, RHEM specify publication rates associated with hyperedges representing groups of scientists of any size. Going beyond previous work on RHEM for meeting data, we adapt this model family to settings in which relational hyperevents have a dedicated outcome, such as a scientific paper with a measurable impact (e.g., the received number of citations). Relational outcome can on the one hand be used to specify additional explanatory variables in RHEM since the probability of coauthoring may be influenced, for instance, by prior (shared) success of scientists. On the other hand, relational outcome can also serve as a response variable in models seeking to explain the performance of scientific teams. To tackle the latter, we propose relational hyperevent outcome models that are closely related with RHEM to the point that both model families can specify the likelihood of scientific collaboration—and the expected performance, respectively—with the same set of explanatory variables allowing to assess, for instance, whether variables leading to increased collaboration also tend to increase scientific impact. For illustration, we apply RHEM to empirical coauthor networks comprising more than 350,000 published papers by scientists working in three scientific disciplines. Our models explain scientific collaboration and impact by, among others, individual activity (preferential attachment), shared activity (familiarity), triadic closure, prior individual and shared success, and prior success disparity among the members of hyperedges.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/) which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2022. Published by Cambridge University Press
Figure 0

Figure 1. Bottom: a list of five hyperevents $e_i=(t_i,h_i)$ representing publication events at event times $t_1\lt \ldots,t_5$. Authors are denoted by letters $A,\ldots,I$; event type and relational outcome are not given in this example. left: representation of the hyperevents as a two-mode “author-paper” network. Papers (that is, events) are displayed as rectangular nodes labeled $e_1,\ldots,e_5$ and are connected to their authors by solid lines. The event nodes are ordered from top to bottom by publication time and older papers are represented by nodes with a darker shade. Right: representation of the hyperevents as a hypergraph. Hyperedges represent papers (that is, events) and are displayed as gray-shaded convex hulls enclosing their authors (gray shades of hyperedges match those of the event nodes in the two-mode network). Dashed lines represent author–author ties in the one-mode projection.

Figure 1

Figure 2. Illustrating closure effects. Bottom: a list of five hyperevents $e_i=(t_i,h_i)$ representing publication events. Left: representation of the hyperevents as a hypergraph. Hyperedges represent papers (that is, events) and are displayed as gray-shaded convex hulls enclosing their authors. Dashed lines represent author–author ties in the one-mode projection (compare Figure 1). right: representation of the identical hypergraph with two additional hyperedges (possible candidates for future events), $h=\{C,F,G\}$ and $h'=\{D,E,H\}$, represented as white convex hulls with dark borders enclosing their members. An event on $\{D,E,H\}$ would point to a closure effect, while an event on $\{C,F,G\}$ could alternatively be explained by subset repetition.

Figure 2

Figure 3. Histogram of sizes of observed hyperevents in the coauthor data. Hyperevent size is the number of authors of published papers.

Figure 3

Table 1. Descriptive statistics of the empirical coauthor networks in the three disciplines: number of papers (i.e., publication events), number of unique authors, number of links in the two-mode author-paper networks, maximum and mean of the number of authors per paper (note that the number of authors per paper has been limited to 100; see the text for additional explanation and see Figure 3 for the distribution of the number of authors per paper) and the number of papers per author

Figure 4

Table 2. Correlation of hyperedge statistics computed from the combined observations (events and sampled controls) from the three disciplines

Figure 5

Table 3. CoxPH model for publication rates associated with hyperedges, estimated parameters and standard errors (in brackets)

Figure 6

Table 4. OLS for impact (normalized number of citations) of published papers, estimated parameters and standard errors (in brackets)

Figure 7

Table A1. Effect of the number of past collaborators and subset repetition of higher order. CoxPH model for publication rates associated with hyperedges. Estimated parameters and standard errors (in brackets). The first model has been already reported in the main text. Second model includes the (weighted) number of prior collaborators and the success-weighted number of collaborators. The third model includes subset repetition and prior success up to order 10.

Figure 8

Table A2. Effect of the number of past collaborators and subset repetition of higher order. OLS for impact (normalized number of citations) of published papers. Estimated parameters and standard errors (in brackets). The first model has already been reported in the main text. The second model includes the statistics for the weighted number of past collaborators and the respective statistic weighted by success. The third model includes subset repetition and prior shared success up to order 10.

Figure 9

Table A3. Interaction of hyperevent effects with hyperedge size (i.e., number of authors of published papers). CoxPH model for publication rates associated with hyperedges. Estimated parameters and standard errors (in brackets). The first model has been already reported in the main text. The second model interacts all effects with the size of hyperedges.

Figure 10

Table A4. Interaction of hyperevent effects with hyperedge size (i.e., number of authors of published papers). OLS for impact (normalized number of citations) of published papers. Estimated parameters and standard errors (in brackets). The first model has been already reported in the main text. The second model interacts all effects with the size of hyperedges.

Figure 11

Table A5. Letting the effect of past events decay over time. CoxPH model for publication rates associated with hyperedges. Estimated parameters and standard errors (in brackets). The first model has been already reported in the main text. The second model lets the effect of past events on future events decay with a half life of 5 years.

Figure 12

Table A6. Letting the effect of past events decay over time. Estimated parameters and standard errors (in brackets). OLS for impact (normalized number of citations) of published papers. The first model has been already reported in the main text. The second model lets the effect of past events on future events decay with a half life of 5 years.