Hostname: page-component-77f85d65b8-5ngxj Total loading time: 0 Render date: 2026-03-28T15:52:01.351Z Has data issue: false hasContentIssue false

Fifty years of astrobiology: mapping researcher communities with topic-based network analyses

Published online by Cambridge University Press:  13 February 2025

Christophe Malaterre*
Affiliation:
Department of Philosophy & CIRST, Université du Québec à Montréal, 455 bd. René-Lévesque Est, Montréal, QC H3C 3P8, Canada
Francis Lareau
Affiliation:
Computer Science Department, Université du Québec à Montréal, 201 av. Président-Kennedy, Montréal, QC H2X 3Y7, Canada
*
Corresponding author: Christophe Malaterre; Email: malaterre.christophe@uqam.ca
Rights & Permissions [Opens in a new window]

Abstract

Astrobiology is often defined as the study of the origin, evolution, distribution and future of life on Earth and in the Universe and thought of as a discipline. In practice though, the delineation of astrobiology-related research and corresponding groups of researchers is far from straightforward. Here, we propose to apply text-mining methods to identify researcher communities depending on thematic similarities in their published works. After fitting a latent Dirichlet allocation topic model to the complete article corpus of three flagship journals in the field – Origins of Life and Evolution of Biospheres (1968–2020), Astrobiology (2001–2020), the International Journal of Astrobiology (2002–2020) – and computing author topic profiles, researcher communities are inferred from topic similarity networks to which community detection is applied. Such semantic social networks reveal, as we call them, ‘hidden communities of interest’ that gather researchers who publish on similar topics. The evolution of these communities is also mapped through time, bringing to light the significant shifts that the discipline underwent in the past 50 years.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2025. Published by Cambridge University Press
Figure 0

Table 1. Topics and their top-10 words

Figure 1

Figure 1. Evolution of the number of authors and articles per time period (left-hand side y-axis), with percentages of new authors, of authors with a publication weight of 1 or less, and of multi-authored articles (right-hand side y-axis).

Figure 2

Figure 2. (a) Astrobiology author network of the first time period (1968–1979) and identified communities of interest. Nodes are authors, size proportional to number of publications, colour of the dominant topic of the community topic profile. Edge thickness proportional to correlation between author topic profiles. (b) Community topic profiles.

Figure 3

Figure 3. (a) Astrobiology author network of the second time period (1980–1989) and identified communities of interest. Nodes are authors, size proportional to number of publications, colour of the dominant topic of the community topic profile. Edge thickness proportional to correlation between author topic profiles. (b) Community topic profiles.

Figure 4

Figure 4. (a) Astrobiology author network of the third time period (1990–1999) and identified communities of interest. Nodes are authors, size proportional to number of publications, colour of the dominant topic of the community topic profile. Edge thickness proportional to correlation between author topic profiles. (b) Community topic profiles.

Figure 5

Figure 5. (a) Astrobiology author network of the fourth time period (2000–2009) and identified communities of interest. Nodes are authors, size proportional to number of publications, colour of the dominant topic of the community topic profile. Edge thickness proportional to correlation between author topic profiles. (b) Community topic profiles.

Figure 6

Figure 6. (a) Astrobiology author network of the fifth time period (2010–2020) and identified communities of interest. Nodes are authors, size proportional to number of publications, colour of the dominant topic of the community topic profile. Edge thickness proportional to correlation between author topic profiles. (b) Community topic profiles.

Figure 7

Figure 7. Community distances (pairwise Hellinger distances between topic probability distributions of communities from adjacent time periods; shorter distances coloured in shades of reds indicate a higher proximity of the communities in terms of their topic distribution profiles). (a) Between communities of the first time period (1968–1979, communities 1a–1f) and communities of the second time period (1980–1989, communities 2a–2e). (b) Between communities of the second time period (1980–1989, 2a–2e) and communities of the third time period (1990–1999, 3a–3f). (C) Between communities of the third time period (1990–1999, 3a–3f) and communities of the fourth time period (2000–2009, 4a–4g). (d) Between communities of the fourth time period (2000–2009, 4a–4g) and communities of the fifth time period (2010–2020, 5a–5g).

Supplementary material: File

Malaterre and Lareau supplementary material 1

Malaterre and Lareau supplementary material
Download Malaterre and Lareau supplementary material 1(File)
File 1.7 MB
Supplementary material: File

Malaterre and Lareau supplementary material 2

Malaterre and Lareau supplementary material
Download Malaterre and Lareau supplementary material 2(File)
File 18 KB