Hostname: page-component-77f85d65b8-45ctf Total loading time: 0 Render date: 2026-03-29T08:11:10.380Z Has data issue: false hasContentIssue false

Calibrate your confidence in research findings: A tutorial on improving research methods and practices

Published online by Cambridge University Press:  24 April 2020

Aline da Silva Frost*
Affiliation:
Department of Psychology, University of California, Davis, California, USA
Alison Ledgerwood
Affiliation:
Department of Psychology, University of California, Davis, California, USA
*
Author for correspondence: Aline da Silva Frost, Email: asfrost@ucdavis.edu

Abstract

This article provides an accessible tutorial with concrete guidance for how to start improving research methods and practices in your lab. Following recent calls to improve research methods and practices within and beyond the borders of psychological science, resources have proliferated across book chapters, journal articles, and online media. Many researchers are interested in learning more about cutting-edge methods and practices but are unsure where to begin. In this tutorial, we describe specific tools that help researchers calibrate their confidence in a given set of findings. In Part I, we describe strategies for assessing the likely statistical power of a study, including when and how to conduct different types of power calculations, how to estimate effect sizes, and how to think about power for detecting interactions. In Part II, we provide strategies for assessing the likely type I error rate of a study, including distinguishing clearly between data-independent (“confirmatory”) and data-dependent (“exploratory”) analyses and thinking carefully about different forms and functions of preregistration.

Information

Type
Original Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - ND
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (http://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is unaltered and is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use or in order to create a derivative work.
Copyright
© The Author(s) 2020
Figure 0

Figure 1. Consider the case of a researcher testing 10 true effects and 10 false effects. Perhaps they will follow up or publish significant results but leave nonsignificant results in a file drawer. The statistical power and type I error rate of the studies will determine how the effects are sorted into a set of significant results (follow up!) and a set of nonsignificant results (file drawer). Notice that because power is higher in the scenario on the right (vs. left), the likelihood that any one of the significant findings reflects a true effect in the population is also higher.

Figure 1

Figure 2. Sample sizes needed to achieve 80% power in a two-condition, between-subjects study. This figure helps you organize visually the effect size intuitions.

Figure 2

Table 1. Rules of thumb for powering a 2 × 2 between-subjects Study 2 that seeks to moderate a main effect observed in Study 1

Figure 3

Table 2. Alpha thresholds for sequential analyses

Figure 4

Table 3. Common decisions to specify in a preanalysis plan

Figure 5

Table 4. Different definitions of preregistration and their intended purpose

Figure 6

Table 5. Summary of recommendations