Hostname: page-component-699b5d5946-lkkl4 Total loading time: 0 Render date: 2026-02-27T03:36:42.737Z Has data issue: false hasContentIssue false

Report tau or exp(tau) rather than tau-squared in random-effects meta-analyses

Published online by Cambridge University Press:  25 February 2026

Mark D. Chatfield*
Affiliation:
School of Public Health, The University of Queensland , Australia
Louise Marquart-Wilson
Affiliation:
School of Public Health, The University of Queensland , Australia
Annette Dobson
Affiliation:
School of Public Health, The University of Queensland , Australia
Daniel Farewell
Affiliation:
School of Medicine, Cardiff University , United Kingdom
*
Corresponding author: Mark D. Chatfield; Email: m.chatfield@uq.edu.au
Rights & Permissions [Opens in a new window]

Abstract

In random-effects meta-analysis, the between-study heterogeneity variance, $\tau ^2$, is often reported but is not easy to interpret. For meta-analyses of differences (such as mean differences, standardized mean differences, or risk differences), the standard deviation (SD), $\tau $, indicates the extent to which studies’ true effects vary about their average. For meta-analyses of (natural) log-transformed measures of effect (such as log risk ratios [RRs]), we explain how the geometric SD, $\exp (\tau )$, is helpful to understand how untransformed measures (such as RRs) vary multiplicatively about their average. We recommend that authors and software developers report $\tau $ for differences and $\exp (\tau )$ for ratios, rather than $\tau ^2$. This will facilitate the interpretation of the magnitude of heterogeneity values, for example, the interpretation of heterogeneity estimates and confidence intervals beyond simple binary statements about the presence or absence of heterogeneity.

Information

Type
Research-in-Brief
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of The Society for Research Synthesis Methodology

Highlights

What is already known?

In random-effects meta-analysis, the between-study heterogeneity variance, $\tau ^2$ , is often reported but is not easy to interpret. For meta-analyses of differences, the standard deviation (SD), $\tau $ , is helpful to understand the extent to which studies’ true differences vary about their average.

What is new?

For meta-analyses of ratios (such as odds ratios, risk ratios, etc.), the geometric SD, $\exp (\tau )$ , is helpful to understand the extent to which studies’ true ratios vary multiplicatively about their average.

Potential impact for RSM readers

We recommend that authors and software developers report $\tau $ for differences and $\exp (\tau )$ for ratios, rather than $\tau ^2$ . This will facilitate the interpretation of the magnitude of heterogeneity values, for example, the interpretation of heterogeneity estimates and confidence intervals beyond simple binary statements about the presence or absence of heterogeneity.

1 Introduction

In random-effects meta-analysis, the distribution of underlying true effect sizes is modeled. The extent of heterogeneity (i.e., how studies’ true effects vary about their average) is very important. For example, an estimate of heterogeneity can substantially influence the calculation of i) a confidence interval (CI) of the average effectReference Viechtbauer1 and ii) a prediction interval for the true effect of the next study.Reference Higgins, Thompson and Spiegelhalter2

Although heterogeneity values appear in various graphs, tables, and text,Reference Viechtbauer1 Reference Röver, Rindskopf and Friede4 they are often expressed in a way that does not facilitate understanding. For example, the heterogeneity variance ( $\tau ^2$ ) is reported more frequently than the standard deviation (SD, $\tau $ ).Reference Borenstein5 For meta-analyses of differences (such as mean differences, standardized mean differences [SMDs], or risk differences), $\tau $ is on the same scale and is easier to interpret.Reference Borenstein5 For meta-analyses of ratios (such as odds ratios, risk ratios [RRs], hazard ratios, incidence rate ratios, and ratios of means or response ratios), $\tau $ (the SD of log-transformed ratios) is on the logarithmic scaleReference Röver, Bender and Dias6 and is less obviously interpretable.

In this article, we explain how $\tau $ is helpful to understand the heterogeneity of differences and how $\exp (\tau )$ is helpful to understand the heterogeneity of ratios. We also explain how some values of $\tau $ itself can be meaningfully interpreted for ratios in an Appendix of the Supplementary Material. We recommend that authors and software developers replace the reporting of $\tau ^2$ with more accessible formulations of heterogeneity. This will facilitate the interpretation of the magnitude of heterogeneity values, for example, the interpretation of heterogeneity estimates and CIs beyond simple binary statements about the presence or absence of heterogeneity.

2 Random-effects models

2.1 Differences

We consider a meta-analysis model of difference measures of effect, with a focus on SMDs. Let $\theta _i$ denote the true SMD for study $i \ (i = 1, \ldots , k)$ . A random-effects model for SMDs can be described in terms of the true SMD $\theta _i$ , the observed SMD $\widehat {\theta }_i$ , and its standard error $\sigma _i$ . A popular modelReference Röver, Bender and Dias6 is

(1) $$ \begin{align} \widehat{\theta}_i &\sim \mathcal{N}(\theta_i, \sigma_i^2), \end{align} $$
(2) $$ \begin{align} \theta_i &\sim \mathcal{N}(\mu, \tau^2). \end{align} $$

The modeled distribution of true SMDs is a normal distribution with mean $\mu $ and SD $\tau $ . Therefore, the interval $[\mu - \tau , \mu + \tau ]$ covers approximately 68% (or 2/3) of the distribution and the interval $[\mu - 2\tau , \mu + 2\tau ]$ covers approximately 95% (or 19/20) of the distribution (as does the interval $[\mu - 1.96\tau , \mu + 1.96\tau ]$ ). We will refer to such intervals as 68% and 95% ranges.

2.2 Ratios

We now consider a meta-analysis model of ratio measures of effect, with a focus on RRs. Throughout, log denotes the natural logarithm.

Let $\alpha _i$ denote the true RR for study $i \ (i = 1, \ldots , k)$ . A random-effects model for RRs is typically described in terms of the true log RR ( $\theta _i = \log \alpha _i$ ), the observed log RR ( $\widehat {\theta }_i$ ), and its standard error ( $\sigma _i$ ) using (1) and (2).

We now focus on describing the modeled distribution of true RRs, $\alpha _i = \exp (\theta _i)$ . A lognormal distribution is implied, and the geometric mean (GM) and median of the distribution are $\exp (\mu )$ . A pooled or overall RR from a meta-analysis is an estimate of the $\mathrm {{GM}}$ . There are several ways to describe variation about the $\mathrm {{GM}}$ .Reference Kirkwood7, Reference Chatfield, Marquart-Wilson, Dobson and Farewell8 We explain the simplest way by using $\exp (\tau )$ below, and an alternative way using $\tau $ directly in the Appendix of the Supplementary Material.

The geometric SD of the modeled distribution of true RRs is $\exp (\tau )$ . It quantifies variation about the $\mathrm {{GM}}$ in a multiplicative manner.Reference Kirkwood7 Approximately 68% of the distribution lies in the interval $[\mathrm {LB}1, \mathrm {UB}1]$ , where

$$ \begin{align*} {\mathrm{{LB}}}1 &= \exp(\mu - \tau) = \exp(\mu) \times \exp(-\tau) = {\mathrm{{GM}}} / \exp(\tau),\\{\mathrm{UB}}1 &= \exp(\mu + \tau) = \exp(\mu) \times \exp(\tau) = {\mathrm{GM}} \times \exp(\tau). \end{align*} $$

Approximately 95% of the distribution lies in the interval $[{\mathrm {{LB}}}2, {\mathrm {{UB}}}2]$ , where

$$ \begin{align*} {\mathrm{{LB}}}2 &= \exp(\mu - 2\tau) = \exp(\mu) \times \exp(-2\tau) = {\mathrm{{GM}}} / \{ \exp(\tau) \}^2,\\{\mathrm{{UB}}}2 &= \exp(\mu + 2\tau) = \exp(\mu) \times \exp(2\tau) = {\mathrm{{GM}}} \times \{ \exp(\tau) \}^2. \end{align*} $$

2.3 Prediction intervals

Reporting a prediction interval for the true effect of the next studyReference Higgins, Thompson and Spiegelhalter2, Reference Mátrai, Kói, Sipos and Farkas9 is recommended by many.Reference Borenstein5, Reference Higgins, Thomas and Chandler10, Reference IntHout, Ioannidis, Rovers and Goeman11 Most software packages will calculate and display a prediction interval on a forest plot.

Assuming that

(3) $$ \begin{align} \theta_{k+1} &\sim \mathcal{N}(\mu, \tau^2), \end{align} $$
(4) $$ \begin{align} \hat{\mu} &\sim \mathcal{N}(\mu, {\mathrm{{SE}}}(\hat{\mu})^2), \end{align} $$
(5) $$ \begin{align} \theta_{k+1} - \hat{\mu} &\sim \mathcal{N}(0, \tau^2 + {\mathrm{{SE}}}(\hat{\mu})^2). \end{align} $$

Higgins et al.Reference Higgins, Thompson and Spiegelhalter2 proposed an approximate 95% prediction interval for $\theta _{k+1}$ is

$$\begin{align*}\hat{\mu} \pm t_{k-2} \sqrt{\left\{ \hat{\tau}^2 + \widehat{SE}(\hat{\mu})^2 \right\}}, \end{align*}$$

where $t_{k-2}$ is the 0.975 quantile of the t-distribution with $k-2$ degrees of freedom. A 95% prediction interval calculated this way will be similar to the 95% range when $\hat {\mu } \approx \mu $ , $\hat {\tau } \approx \tau $ , the number of studies in a meta-analysis is not small, and $\widehat {SE}(\hat {\mu })^2 << \widehat {\tau }^2$ .

However, a prediction interval will not convey the (often considerable) uncertainty in the estimate of $\tau $ . Therefore, a 95% CI for $\tau $ provides valuable information in addition to a prediction interval.Reference Higgins, Thompson and Spiegelhalter2

3 Examples

3.1 Differences

Roberts et al.Reference Higgins, Thompson and Spiegelhalter2, Reference Roberts, Tchanturia, Stahl, Southgate and Treasure12 performed meta-analysis on 14 studies comparing the time to complete a trail making task between people with eating disorders and healthy controls. They calculated SMDs (Cohen’s d) and considered these effect sizes negligible if $\geq -0.15$ and $<0.15$ , small if $\geq 0.15$ and $<0.40$ , medium if $\geq 0.40$ and $<0.75$ , large if $\geq 0.75$ and $<1.10$ , very large if $\geq 1.10$ and $<1.45,$ and huge if $\geq 1.45$ . We performed a random-effects meta-analysis on the data and produced a forest plot showing an estimate and 95% CI for $\tau $ (Figure 1).

Figure 1 Random-effects meta-analysis comparing the time to complete a trail making task in people with eating disorders and healthy controls.Reference Higgins, Thompson and Spiegelhalter2, Reference Roberts, Tchanturia, Stahl, Southgate and Treasure12 DerSimonian and Laird estimator of $\tau $ used. Figure produced using the R package meta.

In this example, the mean [95% CI] of the modeled distribution of true SMDs was estimated to be $\widehat {\mu }=0.36 \ [0.19, 0.53]$ . The estimated SD of that distribution was $\widehat {\tau }=0.15$ , which corresponds to a 68% range of $[\mu - 0.15, \mu + 0.15]$ and a 95% range of $[\mu - 0.30, \mu + 0.30]$ when a normal distribution is assumed. For example, if $\mu $ was 0.35, then the 68% range would be $[0.2,0.5]$ and the 95% range would be $[0.05,0.65]$ . We view this as a substantial amount of heterogeneity in this context. The 95% CIReference Viechtbauer1 for $\tau $ was [0, 0.49], indicating that a degenerate distribution (homogeneity) is possible, as is a distribution with a huge SD (if $\tau =0.49,$ then the 95% range is $[\mu - 0.98, \mu + 0.98]$ ). It is clear that there is considerable uncertainty in the SD of this distribution. These interpretations are readily apparent because an estimate and CI for $\tau $ were reported. This provides a more informative and nuanced understanding than a binary statement, such as “heterogeneity was present ( $\widehat {\tau }>0$ )” or “no evidence of heterogeneity was found ( $p=0.21$ ).”

3.2 Ratios

The bacille Calmette–Guérin (BCG) vaccine is used to prevent tuberculosis. Colditz et al.Reference Colditz, Brewer and Berkey13 performed a meta-analysis on the efficacy of the vaccine using RRs from 13 randomized trials. We performed a random-effects meta-analysis on the data and produced a forest plot showing an estimate and 95% CI for $\exp (\tau )$ (Figure 2).

Figure 2 Random-effects meta-analysis comparing the risk of tuberculosis (TB) between vaccine and control groups.Reference Colditz, Brewer and Berkey13 REML estimator of $\tau $ used. Figure produced using the R package meta with some manual editing.

The GM [95% CI] of the modeled distribution of true RRs was estimated to be $\widehat {{\mathrm {{GM}}}} = 0.49 \ [0.34, 0.70]$ . The estimated geometric SD was $\exp (\widehat {\tau })=1.75$ , which corresponds to a 68% range of $[{\mathrm {{GM}}} / 1.75, {\mathrm {{GM}}} \times 1.75]$ and a 95% range of $[{\mathrm {{GM}}} / 1.75^2, {\mathrm {{GM}}} \times 1.75^2] = [{\mathrm {{GM}}} / 3.06, {\mathrm {{GM}}} \times 3.06]$ when a lognormal distribution is assumed. For example, if the $\mathrm {{GM}}$ was 0.5, then the 68% range would be $[0.29,0.88]$ and the 95% range would be $[0.16,1.53]$ . We interpret this as considerable heterogeneity in the true RRs between trials. Repeating the process with the lower bound of the 95% CIReference Viechtbauer1 for $\exp (\tau )$ (i.e., 1.41), if the $\mathrm {{GM}}$ was 0.5, then the 68% range would be $[0.35,0.71]$ and the 95% range would be $[0.25,0.99]$ . These calculations are straightforward because an estimate and CI of $\exp (\tau )$ were reported. Clearly, there is much heterogeneity here. This is more informative than a binary statement, such as “heterogeneity was present ( $\widehat {\tau }>0$ )” or “evidence of heterogeneity was found ( $p<0.01$ ).”

4 Conclusion

For meta-analyses of differences, we recommend reporting $\tau $ rather than $\tau ^2$ . For meta-analyses of ratios, we recommend reporting $\exp (\tau )$ rather than $\tau ^2$ . This will facilitate the interpretation of the magnitude of heterogeneity estimates. Similarly, reporting CIs or credible intervals for $\tau $ or $\exp (\tau )$ will be more helpful than following the current recommendation to report intervals for $\tau ^2$ .Reference Viechtbauer1 Reference Veroniki, Jackson and Viechtbauer3

Author contributions

Conceptualization: M.D.C.; Formal analysis: M.D.C.; Supervision: L.M.-W., A.D., and D.F.; Writing—original draft: M.D.C.; Writing—review and editing: M.D.C., L.M.-W., A.D., and D.F.

Competing interest statement

The authors declare that no competing interests exist.

Data availability statement

Previously published summary data are provided in the figures and in an Excel file.

Funding statement

The authors declare that no specific funding has been received for this article.

Supplementary material

The supplementary material for this article can be found at https://doi.org/10.1017/rsm.2026.10075.

References

Viechtbauer, W. Confidence intervals for the amount of heterogeneity in meta-analysis. Stat Med. 2007;26(1):3752. https://doi.org/10.1002/sim.2514.CrossRefGoogle ScholarPubMed
Higgins, JPT, Thompson, SG, Spiegelhalter, DJ. A re-evaluation of random-effects meta-analysis. J Royal Stat Soc Ser A Stat Soc. 2009;172(1):137159. https://doi.org/10.1111/j.1467-985X.2008.00552.x.CrossRefGoogle ScholarPubMed
Veroniki, AA, Jackson, D, Viechtbauer, W, et al. Methods to estimate the between-study variance and its uncertainty in meta-analysis. Res Synth Methods. 2016;7(1):5579. https://doi.org/10.1002/jrsm.1164.CrossRefGoogle ScholarPubMed
Röver, C, Rindskopf, D, Friede, T. How trace plots help interpret meta-analysis results. Res Synth Methods. 2024;15(3):413429. https://doi.org/10.1002/jrsm.1693.CrossRefGoogle ScholarPubMed
Borenstein, M. Avoiding common mistakes in meta-analysis: understanding the distinct roles of Q, I-squared, tau-squared, and the prediction interval in reporting heterogeneity. Res Synth Methods. 2024;15(2):354368. https://doi.org/10.1002/jrsm.1678.CrossRefGoogle ScholarPubMed
Röver, C, Bender, R, Dias, S, et al. On weakly informative prior distributions for the heterogeneity parameter in Bayesian random-effects meta-analysis. Res Synth Methods. 2021;12(4):448474. https://doi.org/10.1002/jrsm.1475.CrossRefGoogle ScholarPubMed
Kirkwood, TBL. Geometric means and measures of dispersion. Biometrics. 1979;35(4):908909.Google Scholar
Chatfield, MD, Marquart-Wilson, L, Dobson, AJ, Farewell, DM. Mean relative error and standard relative deviation. Statistica Neerlandica. 2025;79(1):e70001. https://doi.org/10.1111/stan.70001.CrossRefGoogle Scholar
Mátrai, P, Kói, T, Sipos, Z, Farkas, N. Assessing the properties of the prediction interval in random-effects meta-analysis. Res Synth Methods. 2026;121. https://doi.org/10.1017/rsm.2025.10055.CrossRefGoogle Scholar
Higgins, JPT, Thomas, J, Chandler, J, et al. Cochrane Handbook for Systematic Reviews of Interventions. version 6.5 (updated August 2024). Cochrane; 2024.Google Scholar
IntHout, J, Ioannidis, JPA, Rovers, MM, Goeman, JJ. Plea for routinely presenting prediction intervals in meta-analysis. BMJ Open. 2016;6:e010247. https://doi.org/10.1136/bmjopen-2015-010247.CrossRefGoogle ScholarPubMed
Roberts, ME, Tchanturia, K, Stahl, D, Southgate, L, Treasure, J. A systematic review and meta-analysis of set-shifting ability in eating disorders. Psychol Med. 2007;37(8):10751084. https://doi.org/10.1017/S0033291707009877.CrossRefGoogle ScholarPubMed
Colditz, GA, Brewer, TF, Berkey, CS, et al. Efficacy of BCG vaccine in the prevention of tuberculosis: meta-analysis of the published literature. JAMA. 1994;271(9):698702. https://doi.org/10.1001/jama.1994.03510330076038.CrossRefGoogle ScholarPubMed
Figure 0

Figure 1 Random-effects meta-analysis comparing the time to complete a trail making task in people with eating disorders and healthy controls.2,12 DerSimonian and Laird estimator of $\tau $ used. Figure produced using the R package meta.

Figure 1

Figure 2 Random-effects meta-analysis comparing the risk of tuberculosis (TB) between vaccine and control groups.13 REML estimator of $\tau $ used. Figure produced using the R package meta with some manual editing.

Supplementary material: File

Chatfield et al. supplementary material

Chatfield et al. supplementary material
Download Chatfield et al. supplementary material(File)
File 35.1 KB