Hostname: page-component-77f85d65b8-jkvpf Total loading time: 0 Render date: 2026-03-27T01:39:05.926Z Has data issue: false hasContentIssue false

Exploring the methodological quality and risk of bias in 200 systematic reviews: A comparative study of ROBIS and AMSTAR-2 tools

Published online by Cambridge University Press:  27 October 2025

Carole Lunny*
Affiliation:
Knowledge Translation Program, St. Michael’s Hospital, Unity Health Toronto, Toronto, ON, Canada Anesthesiology, Pharmacology and Therapeutics, The University of British Columbia, Vancouver, BC, Canada Precision AQ, Vancouver, BC, Canada
Nityanand Jain
Affiliation:
Independent Statistical Consultant, Chandigarh, India
Tina Nazari
Affiliation:
Department of Medical Geriatrics, School of Medicine, Tehran University of Medical Sciences , Tehran, Iran
Melodi Kosaner-Kließ
Affiliation:
Independent Health Economics and Outcomes Research Specialist, Duisburg, Germany
Lucas Santos
Affiliation:
Masters and Doctoral Programs in Physical Therapy, Universidade Cidade de São Paulo, São Paulo, Brazil
Ian Goodman
Affiliation:
Deanery of Clinical Sciences, University of Edinburgh, Edinburgh, UK
Alaa A. M. Osman
Affiliation:
Department of Clinical Pharmacy and Pharmacy Practice, Faculty of Pharmacy, University of Gezira, Wad Madani, Sudan Department of Pharmacodynamics and Biopharmacy, Faculty of Pharmacy, University of Szeged, Szeged, Hungary
Stefano Berrone
Affiliation:
Division of Dietetics, Nutrition and Biological Sciences, Physiotherapy, Podiatry and Radiography, School of Health Sciences, Queen Margaret University , Musselburgh, UK
Mohammad Najm Dadam
Affiliation:
Department of Orthopedics and Trauma Surgery, Helios Klinikum Schwelm, Schwelm, Germany
Connor T. A. Brenna
Affiliation:
Department of Anesthesiology & Pain Medicine, University of Toronto, Toronto, ON, Canada Department of Physiology, University of Toronto, Toronto, ON, Canada
Heba Hussein
Affiliation:
Department of Oral Medicine and Periodontology, Faculty of Dentistry, Cairo University , Cairo, Egypt
Gioia Dahdal
Affiliation:
Department of Translational Medicine, University of Ferrara , Ferrara, Italy
Diana Cespedes A.
Affiliation:
Department of Thoracic Surgery, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
Nicola Ferri
Affiliation:
Department of Biomedical and Neuromotor Sciences (DIBINEM), Alma Mater Studiorum, University of Bologna, Bologna, Italy
Salmaan Kanji
Affiliation:
The Ottawa Hospital and Ottawa Health Research Institute, Ottawa, ON, Canada
Yuan Chi
Affiliation:
Beijing Health Technology Co., Ltd, Beijing, China Department of Health Research Methods, Evidence, & Impact, McMaster University , Hamilton, ON, Canada
Dawid Pieper
Affiliation:
Institute for Health Services and Health System Research, Faculty of Health Sciences Brandenburg, Brandenburg Medical School, Brandenburg, Germany Center for Health Services Research Brandenburg, Brandenburg Medical School, Brandenburg, Germany
Beverly Shea
Affiliation:
The Ottawa Health Research Institute, University of Ottawa , Ottawa, ON, Canada
Amanda Parker
Affiliation:
Knowledge Translation Program, Li Ka Shing Knowledge Institute, St. Michael’s Hospital, Toronto, ON, Canada
Dipika Neupane
Affiliation:
Knowledge Translation Program, Li Ka Shing Knowledge Institute, St. Michael’s Hospital, Toronto, ON, Canada
Paul A. Khan
Affiliation:
Knowledge Translation Program, Li Ka Shing Knowledge Institute, St. Michael’s Hospital, Toronto, ON, Canada
Daniella Rangira
Affiliation:
Li Ka Shing Knowledge Institute, St. Michael’s Hospital, Unity Health Toronto, Toronto, ON, Canada Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
Kat Kolaski
Affiliation:
Departments of Orthopaedic Surgery and Rehabilitation, Neurology, Pediatrics, and Epidemiology and Prevention, Wake Forest University School of Medicine, Winston Salem, NC, USA
Ben Ridley
Affiliation:
IRCCS Istituto delle Scienze Neurologiche di Bologna, Bologna, Italy
Amina Berour
Affiliation:
University of Mouloud Maameri, Tizi Ouzou, Algeria.
Kevin Sun
Affiliation:
Department of Mechanical & Industrial Engineering, University of Toronto, Toronto, ON, Canada
Radin Hamidi Rad
Affiliation:
Faculty of Information, University of Toronto, Toronto, ON, Canada
Zihui Ouyang
Affiliation:
Department of Statistics, University of British Columbia, Vancouver, BC, Canada
Emma K. Reid
Affiliation:
Department of Pharmacy, Nova Scotia Health Authority, Halifax, NS, Canada
Iván Pérez-Neri
Affiliation:
Evidence Synthesis Unit, National Institute of Rehabilitation Luis Guillermo Ibarra Ibarra , Mexico City, Mexico
Sanabel O. Barakat
Affiliation:
Faculty of Dentistry, Zarqa University , Zarqa, Jordan
Silvia Bargeri
Affiliation:
Unit of Clinical Epidemiology, IRCCS Istituto Ortopedico Galeazzi, Milan, Italy
Silvia Gianola
Affiliation:
Unit of Clinical Epidemiology, IRCCS Istituto Ortopedico Galeazzi, Milan, Italy
Greta Castellini
Affiliation:
Unit of Clinical Epidemiology, IRCCS Istituto Ortopedico Galeazzi, Milan, Italy
Sera Whitelaw
Affiliation:
Faculty of Medicine and Health Sciences, McGill University , Montreal, QC, Canada
Adrienne Stevens
Affiliation:
Centre for Immunization Programs, Infectious Diseases and Vaccine Programs Branch, Public Health Agency of Canada, Ottawa, ON, Canada
Shailesh B. Kolekar
Affiliation:
Consultant Adult Respiratory Medicine, Zealand University Roskilde Hospital, Roskilde, Denmark Department of Clinical Medicine, Clinical Care, Copenhagen university Hospital , Copenhagen, Denmark Clinical Care Group 1.01, European Respiratory Society, Lausanne, Switzerland
Kristy Wong
Affiliation:
School of Pharmacy, University of Waterloo , Waterloo, ON, Canada
Paityn Major
Affiliation:
School of Nursing, McMaster University , Hamilton, ON, Canada
Ebrahim Bagheri
Affiliation:
Faculty of Information, University of Toronto, Toronto, ON, Canada
Andrea C. Tricco
Affiliation:
Li Ka Shing Knowledge Institute, St. Michael’s Hospital, Unity Health Toronto, Toronto, ON, Canada Epidemiology Division and Institute of Health Policy, Management, and Evaluation, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada Queen’s Collaboration for Health Care Quality: A JBI Centre of Excellence, Queen’s University, Kingston, ON, Canada
*
Corresponding author: Carole Lunny; Email: carolelunny@gmail.com
Rights & Permissions [Opens in a new window]

Abstract

AMSTAR-2 (A Measurement Tool to Assess Systematic Reviews, version 2) and ROBIS are tools used to assess the methodological quality and the risk of bias in a systematic review (SR). We applied AMSTAR-2 and ROBIS to a sample of 200 published SRs. We investigated the overlap in their methodological constructs, responses by item, and overall, percentage agreement, direction of effect, and timing of assessments. AMSTAR-2 contains 16 items and ROBIS 24 items. Three items in AMSTAR-2 and nine in ROBIS did not overlap in construct. Of the 200 SRs, 73% were low or critically low quality using AMSTAR-2, and 81% had a high risk of bias using ROBIS. The median time to complete AMSTAR-2 and ROBIS was 51 and 64 minutes, respectively. When assessment times were calibrated to the number of items in each tool, each item took an average of 3.2 minutes per item for AMSTAR-2 compared to 2.7 minutes for ROBIS. Nine percent of SRs had opposing ratings (i.e., AMSTAR-2 was high quality while ROBIS was high risk). In both tools, three-quarters of items showed more than 70% agreement between raters after extensive training and piloting. AMSTAR-2 and ROBIS provide complementary rather than interchangeable assessments of systematic reviews. AMSTAR-2 may be preferable when efficiency is prioritized and methodological rigour is the focus, whereas ROBIS offers a deeper examination of potential biases and external validity. Given the widespread reliance on systematic reviews for policy and practice, selecting the appropriate appraisal tool remains crucial. Future research should explore strategies to integrate the strengths of both instruments while minimizing the burden on assessors.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of The Society for Research Synthesis Methodology
Figure 0

Table 1 Study characteristics of included systematic reviews (n = 200)

Figure 1

Table 2 Mapping AMSTAR-2 and ROBIS items

Figure 2

Figure 1 Circular bar plot showing the proportion of ROBIS items assessed positively (‘Yes’ or ‘Probably Yes’) across Cochrane and non-Cochrane systematic reviews (SRs), stratified by with or without meta-analysis (n = 200). Navy blue colour represents Cochrane SRs with MA; Teal colour represents Cochrane SRs without MA; Fusia colour represents non-Cochrane SRs with MA; and Pink respresents non-Cochrane SRs without MA. Bar height reflects the percentage of ROBIS items assessed positively (0–100% scale). ROBIS item description is provided in Table 2. Abbreviations: MA, meta-analysis; SR, systematic review.

Figure 3

Figure 2 Circular bar plot showing the proportion of AMSTAR-2 items assessed positively (‘Yes’ or ‘Probably Yes’) across Cochrane and non-Cochrane systematic reviews (SRs), stratified by with or without meta-analysis (n = 200). Navy blue colour represents Cochrane SRs with MA; Teal colour represents Cochrane SRs without MA; Fusia colour represents non-Cochrane SRs with MA; and Pink respresents non-Cochrane SRs without MA. Bar height reflects the percentage of AMSTAR-2 items assessed positively (0–100% scale). AMSTAR-2 item description is provided in Table 2. Abbreviations: MA, meta-analysis; SR, systematic review.

Figure 4

Table 3 Percentage agreement of ROBIS assessments

Figure 5

Figure 3 Total assessment time, in minutes, for (a) AMSTAR-2 when AMSTAR-2 is applied first; and (b) for ROBIS when ROBIS is applied first, depending on the number of previous assessments performed. Each dot represents an individual’s timing, with the median (dashed blue line) and its 95% confidence interval (blue shaded area), as well as the overall trendline (red line), and its standard error (red shaded area). The confidence interval for the median was calculated by bootstrapping for n = 1,000 samples.

Supplementary material: File

Lunny et al. supplementary material

Lunny et al. supplementary material
Download Lunny et al. supplementary material(File)
File 80.6 KB