Hostname: page-component-77f85d65b8-2tv5m Total loading time: 0 Render date: 2026-03-28T06:50:17.477Z Has data issue: false hasContentIssue false

Looking inside the black box – hybrid epidemiology approaches to identify causal inferences

Published online by Cambridge University Press:  15 July 2025

Ahmed Elagali*
Affiliation:
Plastics and Human Health, Minderoo Foundation, Perth, WA, Australia School of Biological Sciences, The University of Western Australia, Perth, WA, Australia Florey Institute of Neuroscience and Mental Health, University of Melbourne, Parkville, VIC, Australia
Katherine Drummond
Affiliation:
Florey Institute of Neuroscience and Mental Health, University of Melbourne, Parkville, VIC, Australia
Adriano Winterton
Affiliation:
Division of Mental and Physical Health, Norwegian Institute of Public Health , Oslo, Norway
Yannick Mulders
Affiliation:
Plastics and Human Health, Minderoo Foundation, Perth, WA, Australia
Christos Symeonides
Affiliation:
Plastics and Human Health, Minderoo Foundation, Perth, WA, Australia
Bhedita Seewoo
Affiliation:
Plastics and Human Health, Minderoo Foundation, Perth, WA, Australia School of Biological Sciences, The University of Western Australia, Perth, WA, Australia
Gro Andersen
Affiliation:
Division of Mental and Physical Health, Norwegian Institute of Public Health , Oslo, Norway
Anne-Louise Ponsonby
Affiliation:
Florey Institute of Neuroscience and Mental Health, University of Melbourne, Parkville, VIC, Australia
Sarah Dunlop
Affiliation:
Plastics and Human Health, Minderoo Foundation, Perth, WA, Australia School of Biological Sciences, The University of Western Australia, Perth, WA, Australia
*
Corresponding author: Ahmed Elagali; Email: aelagali@minderoo.org
Rights & Permissions [Opens in a new window]

Abstract

Plastic chemicals are numerous and ubiquitous in modern life and pose significant risks to human health. Observational epidemiological studies have been instrumental in identifying consistent and statistically significant associations between exposure to certain chemicals and adverse health outcomes. However, these studies often fail to establish causality due to the complexity of real-world chemical mixtures, confounding factors, reverse causation, and study designs that lack measures reflecting underlying genetic and cellular mechanisms indicating causal pathways to harm. Addressing these limitations requires moving beyond traditional ‘black-box’ epidemiology, which mainly focuses on the strength of associations. We propose adopting hybrid epidemiological methodologies that incorporate genetic susceptibility and molecular mechanisms to uncover biological pathways, combined with machine learning and statistical analysis of chemical mixtures, to strengthen the causal evidence linking exposure to harm. By integrating observational multi-omics data with experimental and mechanistic models, hybrid epidemiology offers a transformative path to improve causal evidence and public health interventions. In addition, machine learning and statistical methods provide a more nuanced understanding of the health effects of exposures to plastic chemical mixtures, facilitating the identification of interactions within chemical mixtures and the influence of biological pathways. This paradigm shift is critical addressing the complex challenges of plastic exposure and protecting human health.

Information

Type
Perspective
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Unlike the traditional epidemiological approach (represented by the grey arrow), which examines the relationship between a single exposure and a disease outcome while many critical aspects of the biological pathway remain uncovered, hybrid epidemiology (depicted by the purple arrow) adopts a more comprehensive view. The hybrid epidemiology approach considers the broader exposome and examines how factors such as genetic predisposition, molecular mechanisms, environmental factors and lifestyle variations within individuals can modulate this effect. Hybrid epidemiology goes a step further by integrating multiple analytical approaches and techniques – including machine learning, multi-omics, advanced statistical methods and animal or cell models – to triangulate and identify the most parsimonious effect of exposure on disease aetiology.

Author comment: Looking inside the black box – hybrid epidemiology approaches to identify causal inferences — R0/PR1

Comments

No accompanying comment.

Review: Looking inside the black box – hybrid epidemiology approaches to identify causal inferences — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

General Comments

This perspective article is well-written and makes a positive contribution to the field.

The article recommends a shift toward hybrid epidemiological approaches that incorporate genetic and molecular data to explore underlying biological mechanisms and move beyond traditional “black-box” approaches. It also recommends combing this with machine learning and statistical techniques to analyse chemical mixtures, enabling a deeper understanding of their combined health effects and the biological pathways they influence.

The authors effectively convey their messages by seamlessly guiding the reader through epidemiology, statistics, and basic sciences. The recommendations flow logically as a natural outcome of the preceding discussion.

Below are a few comments that may improve the quality of the paper, primarily focusing on making it more accessible and broadening its readership.

(1)

Pages 5 (from line 51) to page 6 (up to line 49): From “In recent years, numerous methods have been introduced….” to “ …….. omics data and mixed categorical and continuous variables—remains limited”

- This section provides a helpful overview of statistical methods used in mixture analysis. It might be beneficial to briefly expand on the background (specifically, the challenges these methods are designed to address) before describing how each method tackles them. Providing this context would be especially valuable in a ‘Perspective’ paper, as it helps engage a broader audience, including readers without advanced statistical expertise.

(2)

Page 6, lines 56-59: “This preliminary step helps refine the dataset and enhance model performance before implementing Bayesian approaches such as Bayesian Hierarchical Models (Gelman et al. 1995), BKMR, or Bayesian Additive Regression Trees (Chipman, George, and McCulloch 2010)”.

- To make it more informative for readers, it could be useful to briefly explain each approach and how it relates to the topic (for example, in what contexts each method is most suitable). This would help strengthen the connection between the methodology and its application and make the discussion more accessible.

(3)

Page 7, lines 3-25: “An alternative to this multistage analysis is the application of machine learning techniques, such as neural networks ….” to the end of the paragraph

- Machine learning is mentioned several times in the manuscript and included as a keyword. Since this section introduces machine learning for the first time, it offers an opportunity to engage the reader. Consider rewriting this part to make the introduction more engaging. It would be helpful to explain how machine learning can enhance causal inference and to provide context for the techniques mentioned, so readers can better understand their relevance and application.

(4)

Page 7, lines 39-44: “This approach, extensively employed across various disciplines, including theoretical physics, astrophysics, and mathematics (Tegmark 2015; Abbott et al. 2016; Lawlor, Tilling, and Davey Smith 2016), leverages complementary methodologies to enhance causal understanding”.

- Triangulation is a well-established concept in epidemiology, so this quoted part may be a bit redundant. Authors might consider removing it to keep the text more concise and focused.

(5)

While the term “black-box” epidemiology is conceptually engaging, it might be clearer for readers if a term like “classical” or “traditional” epidemiology were used instead.

(6)

I wonder if the phrase “establishing causality” might be a little overused here. In a practical context like this paper, it might be more helpful for readers if we focus on how credible it is to suggest that the exposure caused the outcome, rather than framing it as definitively establishing causality.

(7)

The authors make a strong case for the shift toward new approaches. It may be helpful to briefly acknowledge some of the key challenges or limitations that could arise in implementing this shift, to provide a more balanced ‘perspective’.

(8)

I suggest having a look at the figure and its various components. I’m not sure it clearly conveys the intended messages in its current form.

Minor comments

(9)

I am not sure if GDP needs to be spelled out.

Review: Looking inside the black box – hybrid epidemiology approaches to identify causal inferences — R0/PR3

Conflict of interest statement

Reviewer declares none.

Comments

This manuscript presents a perspective on the limitations of traditional observational epidemiology in studying the health impacts of plastic chemicals and proposes a shift towards hybrid epidemiological methodologies. The authors argue that relying solely on associations between exposures and outcomes (“black box” epidemiology) is insufficient to establish causality due to the complexity of chemical mixtures, confounding factors, and the absence of underlying mechanistic data. They advocate for integrating genetic susceptibility, molecular mechanisms, multiomics data, machine learning, and advanced statistical analyses to strengthen causal inference and improve public health interventions leveraging a triangulation of evidence.

This represents a crucial and timely issue in environmental health research. The proposed hybrid epidemiological framework has significant scientific value and potential impact. While this is not a completely novel concept/idea, this perspective offers a useful proposal to move beyond simple associations and delve into the underlying biological pathways altered by complex chemical mixtures.

There are however several aspects that should be better addressed to strengthen the relevance, clarity and rigour of the manuscript, as suggested in the comments below.

While the manuscript provides a general overview of the hybrid epidemiological framework, it could benefit from a more specific definition of applicability domains. For example, the authors could refer to the seminal review by Peters et al 1, and explain how the hallmarks of environmental insults can be leveraged in their integrative framework.

“These challenges emphasize the critical need for a hybrid epidemiological research approach that incorporates advancements in chemical mixture analysis, machine learning, and molecular epidemiology techniques to strengthen causal inferences.”

The need to integrate experimental testing with clinical data to provide the causal link should be better emphasised in the above passage and overall in the manuscript.

While the manuscript covers a wide range of topics, some sections could benefit from more in-depth discussion. For example, when presenting the need to integrate epidemiological associations with molecular mechanisms, the authors could better discuss recent technological advances, particularly in the areas of single cell omics and human organoid modelling, that are enabling novel and more powerful ways to link epidemiological data to molecular mechanisms (as for example discussed in 2–4).

The manuscript acknowledges some limitations of the proposed methodologies (e.g., interpretability of machine learning models), however a more in-depth discussion of potential challenges and limitations would be beneficial, in particular including issues and challenges related to data availability, data sharing, and data harmonization across heterogeneous sources. Also for single chemical vs mixtures effects there are several challenges related to the modelling of additive vs synergistic effects that should be better presented and discussed.

The recent perspective in Science about integrating exposomics into biomedicine 5 seems an interesting reference for this manuscript.

As a more theoretical thought/suggestion: a recent paper proposes to use mathematical frameworks from information theory to quantitatively understand information flow in cell biology 6. Are similar principles implied when the “complementary methodologies in theoretical physics, astrophysics, and mathematics to enhance causal understanding” are mentioned to explain the “triangulation of evidence” approach proposed here? If yes it would be interesting to discuss these converging development of analytical approaches from epidemiology to cell biology.

1. Peters, A., Nawrot, T. S. & Baccarelli, A. A. Hallmarks of environmental insults. Cell 184, 1455–1468 (2021).

2. Farbehi, N. et al. Integrating population genetics, stem cell biology and cellular genomics to study complex human diseases. Nat. Genet. 56, 758–766 (2024).

3. Cuomo, A. S. E., Nathan, A., Raychaudhuri, S., MacArthur, D. G. & Powell, J. E. Single-cell genomics meets human genetics. Nat. Rev. Genet. 24, 535–549 (2023).

4. Caporale, N. et al. Multiplexing cortical brain organoids for the longitudinal dissection of developmental traits at single-cell resolution. Nat. Methods 22, 358–370 (2025).

5. Miller, G. W. & Banbury Exposomics Consortium. Integrating exposomics into biomedicine. Science 388, 356–358 (2025).

6. Quake, S. R. The cellular dogma. Cell 187, 6421–6423 (2024).

Recommendation: Looking inside the black box – hybrid epidemiology approaches to identify causal inferences — R0/PR4

Comments

The Cambridge Prisms Plastics editorial team have worked very hard to secure appropriate available reviewers for this manuscript. It is unfortunate this has meant that the authors have had to wait so long for this review outcome. We are very happy that we are now in the position to provide this recommendation.

The manuscript is well-written. Both reviewers are complementary of the manuscript and share the opinion that the article will make a positive contribution to the field. Reviewer 2 describes this as a “a crucial and timely issue in environmental health research. The proposed hybrid epidemiological framework has significant scientific value and potential impact [offering]… a useful proposal to move beyond simple associations and delve into the underlying biological pathways altered by complex chemical mixtures.”

Both reviewers offer recommendations to strengthen some sections of the manuscript including providing more in-depth discussion and context in areas, as well as specific definitions. One reviewer provides a list of potentially helpful publications to assist the authors in deepening the theoretical analysis. This manuscript will be significantly strengthened if the authorial team systematically consider and respond to the recommendations of both reviewers.

Decision: Looking inside the black box – hybrid epidemiology approaches to identify causal inferences — R0/PR5

Comments

No accompanying comment.

Author comment: Looking inside the black box – hybrid epidemiology approaches to identify causal inferences — R1/PR6

Comments

No accompanying comment.

Recommendation: Looking inside the black box – hybrid epidemiology approaches to identify causal inferences — R1/PR7

Comments

I am wholly satisfied that the authors have comprehensively, systematically, and appropriately responded to all the reviewers’ comments and that this manuscript is now ready for publication.

Decision: Looking inside the black box – hybrid epidemiology approaches to identify causal inferences — R1/PR8

Comments

No accompanying comment.