Hostname: page-component-77f85d65b8-6c7dr Total loading time: 0 Render date: 2026-04-17T17:53:15.754Z Has data issue: false hasContentIssue false

SyROCCo: enhancing systematic reviews using machine learning

Published online by Cambridge University Press:  14 October 2024

Zheng Fang
Affiliation:
Department of Computer Science, University of Warwick and Alan Turing Institute for Data Science and AI, Coventry, UK
Miguel Arana-Catania
Affiliation:
School of Aerospace, Transport and Manufacturing, Cranfield University, Cranfield, UK
Felix-Anselm van Lier
Affiliation:
Government Outcomes Lab, Blavatnik School of Government, University of Oxford, Oxford, UK
Juliana Outes Velarde
Affiliation:
Government Outcomes Lab, Blavatnik School of Government, University of Oxford, Oxford, UK
Harry Bregazzi
Affiliation:
Government Outcomes Lab, Blavatnik School of Government, University of Oxford, Oxford, UK
Mara Airoldi
Affiliation:
Government Outcomes Lab, Blavatnik School of Government, University of Oxford, Oxford, UK
Eleanor Carter
Affiliation:
Government Outcomes Lab, Blavatnik School of Government, University of Oxford, Oxford, UK
Rob Procter*
Affiliation:
Department of Computer Science, University of Warwick and Alan Turing Institute for Data Science and AI, Coventry, UK
*
Corresponding author: Rob Procter; Email: rob.procter@warwick.ac.uk

Abstract

The sheer number of research outputs published every year makes systematic reviewing increasingly time- and resource-intensive. This paper explores the use of machine learning techniques to help navigate the systematic review process. Machine learning has previously been used to reliably “screen” articles for review – that is, identify relevant articles based on reviewers’ inclusion criteria. The application of machine learning techniques to subsequent stages of a review, however, such as data extraction and evidence mapping, is in its infancy. We, therefore, set out to develop a series of tools that would assist in the profiling and analysis of 1952 publications on the theme of “outcomes-based contracting.” Tools were developed for the following tasks: assigning publications into “policy area” categories; identifying and extracting key information for evidence mapping, such as organizations, laws, and geographical information; connecting the evidence base to an existing dataset on the same topic; and identifying subgroups of articles that may share thematic content. An interactive tool using these techniques and a public dataset with their outputs have been released. Our results demonstrate the utility of machine learning techniques to enhance evidence accessibility and analysis within the systematic review processes. These efforts show promise in potentially yielding substantial efficiencies for future systematic reviewing and for broadening their analytical scope. Beyond this, our work suggests that there may be implications for the ease with which policymakers and practitioners can access evidence. While machine learning techniques seem poised to play a significant role in bridging the gap between research and policy by offering innovative ways of gathering, accessing, and analyzing data from systematic reviews, we also highlight their current limitations and the need to exercise caution in their application, particularly given the potential for errors and biases.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. Description of the GO Lab systematic review process and application of NLP methods. The symbol * indicates the point at which ML tools are conventionally applied within the systematic review process.

Figure 1

Figure 2. SyROCCo has three panels: visualization panel (left), information panel (top right), and article list panel (bottom right).

Figure 2

Figure 3. Three different types of interactive visualizations. World map (left). Histograms (center) and force-directed graph (right).

Figure 3

Figure 4. Information panel showing overview information about articles that meet selected criteria.

Figure 4

Figure 5. Information panel showing detailed information about a selected article.

Submit a response

Comments

No Comments have been published for this article.