Report tau or exp(tau) rather than tau-squared in random-effects meta-analyses

Mark D. Chatfield; Louise Marquart-Wilson; Annette Dobson; Daniel Farewell

doi:10.1017/rsm.2026.10075

Report tau or exp(tau) rather than tau-squared in random-effects meta-analyses

Published online by Cambridge University Press: 25 February 2026

Mark D. Chatfield

Louise Marquart-Wilson ,

Annette Dobson and

Daniel Farewell

Show author details

Mark D. Chatfield*: Affiliation:
School of Public Health, The University of Queensland , Australia
Louise Marquart-Wilson: Affiliation:
School of Public Health, The University of Queensland , Australia
Annette Dobson: Affiliation:
School of Public Health, The University of Queensland , Australia
Daniel Farewell: Affiliation:
School of Medicine, Cardiff University , United Kingdom
*: Corresponding author: Mark D. Chatfield; Email: m.chatfield@uq.edu.au

Article contents

Abstract
Highlights
Introduction
Random-effects models
Examples
Conclusion
Author contributions
Competing interest statement
Data availability statement
Funding statement
References

Rights & Permissions

Abstract

In random-effects meta-analysis, the between-study heterogeneity variance, $\tau ^2$, is often reported but is not easy to interpret. For meta-analyses of differences (such as mean differences, standardized mean differences, or risk differences), the standard deviation (SD), $\tau $, indicates the extent to which studies’ true effects vary about their average. For meta-analyses of (natural) log-transformed measures of effect (such as log risk ratios [RRs]), we explain how the geometric SD, $\exp (\tau )$, is helpful to understand how untransformed measures (such as RRs) vary multiplicatively about their average. We recommend that authors and software developers report $\tau $ for differences and $\exp (\tau )$ for ratios, rather than $\tau ^2$. This will facilitate the interpretation of the magnitude of heterogeneity values, for example, the interpretation of heterogeneity estimates and confidence intervals beyond simple binary statements about the presence or absence of heterogeneity.

Keywords

geometric mean geometric standard deviation heterogeneity lognormal

Information

Type: Research-in-Brief
Information: Research Synthesis Methods , First View , pp. 1 - 5

DOI: https://doi.org/10.1017/rsm.2026.10075 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2026. Published by Cambridge University Press on behalf of The Society for Research Synthesis Methodology

Highlights

What is already known?

In random-effects meta-analysis, the between-study heterogeneity variance, $\tau ^2$ , is often reported but is not easy to interpret. For meta-analyses of differences, the standard deviation (SD), $\tau $ , is helpful to understand the extent to which studies’ true differences vary about their average.

What is new?

For meta-analyses of ratios (such as odds ratios, risk ratios, etc.), the geometric SD, $\exp (\tau )$ , is helpful to understand the extent to which studies’ true ratios vary multiplicatively about their average.

Potential impact for RSM readers

We recommend that authors and software developers report $\tau $ for differences and $\exp (\tau )$ for ratios, rather than $\tau ^2$ . This will facilitate the interpretation of the magnitude of heterogeneity values, for example, the interpretation of heterogeneity estimates and confidence intervals beyond simple binary statements about the presence or absence of heterogeneity.

1 Introduction

In random-effects meta-analysis, the distribution of underlying true effect sizes is modeled. The extent of heterogeneity (i.e., how studies’ true effects vary about their average) is very important. For example, an estimate of heterogeneity can substantially influence the calculation of i) a confidence interval (CI) of the average effectReference Viechtbauer¹ and ii) a prediction interval for the true effect of the next study.Reference Higgins, Thompson and Spiegelhalter²

Although heterogeneity values appear in various graphs, tables, and text,Reference Viechtbauer¹^– Reference Röver, Rindskopf and Friede⁴ they are often expressed in a way that does not facilitate understanding. For example, the heterogeneity variance ( $\tau ^2$ ) is reported more frequently than the standard deviation (SD, $\tau $ ).Reference Borenstein⁵ For meta-analyses of differences (such as mean differences, standardized mean differences [SMDs], or risk differences), $\tau $ is on the same scale and is easier to interpret.Reference Borenstein⁵ For meta-analyses of ratios (such as odds ratios, risk ratios [RRs], hazard ratios, incidence rate ratios, and ratios of means or response ratios), $\tau $ (the SD of log-transformed ratios) is on the logarithmic scaleReference Röver, Bender and Dias⁶ and is less obviously interpretable.

In this article, we explain how $\tau $ is helpful to understand the heterogeneity of differences and how $\exp (\tau )$ is helpful to understand the heterogeneity of ratios. We also explain how some values of $\tau $ itself can be meaningfully interpreted for ratios in an Appendix of the Supplementary Material. We recommend that authors and software developers replace the reporting of $\tau ^2$ with more accessible formulations of heterogeneity. This will facilitate the interpretation of the magnitude of heterogeneity values, for example, the interpretation of heterogeneity estimates and CIs beyond simple binary statements about the presence or absence of heterogeneity.

2 Random-effects models

2.1 Differences

We consider a meta-analysis model of difference measures of effect, with a focus on SMDs. Let $\theta _i$ denote the true SMD for study $i \ (i = 1, \ldots , k)$ . A random-effects model for SMDs can be described in terms of the true SMD $\theta _i$ , the observed SMD $\widehat {\theta }_i$ , and its standard error $\sigma _i$ . A popular modelReference Röver, Bender and Dias⁶ is

(1)

$$ \begin{align} \widehat{\theta}_i &\sim \mathcal{N}(\theta_i, \sigma_i^2), \end{align} $$

(2)

$$ \begin{align} \theta_i &\sim \mathcal{N}(\mu, \tau^2). \end{align} $$

The modeled distribution of true SMDs is a normal distribution with mean $\mu $ and SD $\tau $ . Therefore, the interval $[\mu - \tau , \mu + \tau ]$ covers approximately 68% (or 2/3) of the distribution and the interval $[\mu - 2\tau , \mu + 2\tau ]$ covers approximately 95% (or 19/20) of the distribution (as does the interval $[\mu - 1.96\tau , \mu + 1.96\tau ]$ ). We will refer to such intervals as 68% and 95% ranges.

2.2 Ratios

We now consider a meta-analysis model of ratio measures of effect, with a focus on RRs. Throughout, log denotes the natural logarithm.

Let $\alpha _i$ denote the true RR for study $i \ (i = 1, \ldots , k)$ . A random-effects model for RRs is typically described in terms of the true log RR ( $\theta _i = \log \alpha _i$ ), the observed log RR ( $\widehat {\theta }_i$ ), and its standard error ( $\sigma _i$ ) using (1) and (2).

We now focus on describing the modeled distribution of true RRs, $\alpha _i = \exp (\theta _i)$ . A lognormal distribution is implied, and the geometric mean (GM) and median of the distribution are $\exp (\mu )$ . A pooled or overall RR from a meta-analysis is an estimate of the $\mathrm {{GM}}$ . There are several ways to describe variation about the $\mathrm {{GM}}$ .Reference Kirkwood⁷^, Reference Chatfield, Marquart-Wilson, Dobson and Farewell⁸ We explain the simplest way by using $\exp (\tau )$ below, and an alternative way using $\tau $ directly in the Appendix of the Supplementary Material.

The geometric SD of the modeled distribution of true RRs is $\exp (\tau )$ . It quantifies variation about the $\mathrm {{GM}}$ in a multiplicative manner.Reference Kirkwood⁷ Approximately 68% of the distribution lies in the interval $[\mathrm {LB}1, \mathrm {UB}1]$ , where

$$ \begin{align*} {\mathrm{{LB}}}1 &= \exp(\mu - \tau) = \exp(\mu) \times \exp(-\tau) = {\mathrm{{GM}}} / \exp(\tau),\\{\mathrm{UB}}1 &= \exp(\mu + \tau) = \exp(\mu) \times \exp(\tau) = {\mathrm{GM}} \times \exp(\tau). \end{align*} $$

Approximately 95% of the distribution lies in the interval $[{\mathrm {{LB}}}2, {\mathrm {{UB}}}2]$ , where

$$ \begin{align*} {\mathrm{{LB}}}2 &= \exp(\mu - 2\tau) = \exp(\mu) \times \exp(-2\tau) = {\mathrm{{GM}}} / \{ \exp(\tau) \}^2,\\{\mathrm{{UB}}}2 &= \exp(\mu + 2\tau) = \exp(\mu) \times \exp(2\tau) = {\mathrm{{GM}}} \times \{ \exp(\tau) \}^2. \end{align*} $$

2.3 Prediction intervals

Reporting a prediction interval for the true effect of the next studyReference Higgins, Thompson and Spiegelhalter²^, Reference Mátrai, Kói, Sipos and Farkas⁹ is recommended by many.Reference Borenstein⁵^, Reference Higgins, Thomas and Chandler¹⁰^, Reference IntHout, Ioannidis, Rovers and Goeman¹¹ Most software packages will calculate and display a prediction interval on a forest plot.

Assuming that

(3)

$$ \begin{align} \theta_{k+1} &\sim \mathcal{N}(\mu, \tau^2), \end{align} $$

(4)

$$ \begin{align} \hat{\mu} &\sim \mathcal{N}(\mu, {\mathrm{{SE}}}(\hat{\mu})^2), \end{align} $$

(5)

$$ \begin{align} \theta_{k+1} - \hat{\mu} &\sim \mathcal{N}(0, \tau^2 + {\mathrm{{SE}}}(\hat{\mu})^2). \end{align} $$

Higgins et al.Reference Higgins, Thompson and Spiegelhalter² proposed an approximate 95% prediction interval for $\theta _{k+1}$ is

$$\begin{align*}\hat{\mu} \pm t_{k-2} \sqrt{\left\{ \hat{\tau}^2 + \widehat{SE}(\hat{\mu})^2 \right\}}, \end{align*}$$

where $t_{k-2}$ is the 0.975 quantile of the t-distribution with $k-2$ degrees of freedom. A 95% prediction interval calculated this way will be similar to the 95% range when $\hat {\mu } \approx \mu $ , $\hat {\tau } \approx \tau $ , the number of studies in a meta-analysis is not small, and $\widehat {SE}(\hat {\mu })^2 << \widehat {\tau }^2$ .

However, a prediction interval will not convey the (often considerable) uncertainty in the estimate of $\tau $ . Therefore, a 95% CI for $\tau $ provides valuable information in addition to a prediction interval.Reference Higgins, Thompson and Spiegelhalter²

3 Examples

3.1 Differences

Roberts et al.Reference Higgins, Thompson and Spiegelhalter²^, Reference Roberts, Tchanturia, Stahl, Southgate and Treasure¹² performed meta-analysis on 14 studies comparing the time to complete a trail making task between people with eating disorders and healthy controls. They calculated SMDs (Cohen’s d) and considered these effect sizes negligible if $\geq -0.15$ and $<0.15$ , small if $\geq 0.15$ and $<0.40$ , medium if $\geq 0.40$ and $<0.75$ , large if $\geq 0.75$ and $<1.10$ , very large if $\geq 1.10$ and $<1.45,$ and huge if $\geq 1.45$ . We performed a random-effects meta-analysis on the data and produced a forest plot showing an estimate and 95% CI for $\tau $ (Figure 1).

Figure 1 Random-effects meta-analysis comparing the time to complete a trail making task in people with eating disorders and healthy controls.Reference Higgins, Thompson and Spiegelhalter²^, Reference Roberts, Tchanturia, Stahl, Southgate and Treasure¹² DerSimonian and Laird estimator of $\tau $ used. Figure produced using the R package meta.

In this example, the mean [95% CI] of the modeled distribution of true SMDs was estimated to be $\widehat {\mu }=0.36 \ [0.19, 0.53]$ . The estimated SD of that distribution was $\widehat {\tau }=0.15$ , which corresponds to a 68% range of $[\mu - 0.15, \mu + 0.15]$ and a 95% range of $[\mu - 0.30, \mu + 0.30]$ when a normal distribution is assumed. For example, if $\mu $ was 0.35, then the 68% range would be $[0.2,0.5]$ and the 95% range would be $[0.05,0.65]$ . We view this as a substantial amount of heterogeneity in this context. The 95% CIReference Viechtbauer¹ for $\tau $ was [0, 0.49], indicating that a degenerate distribution (homogeneity) is possible, as is a distribution with a huge SD (if $\tau =0.49,$ then the 95% range is $[\mu - 0.98, \mu + 0.98]$ ). It is clear that there is considerable uncertainty in the SD of this distribution. These interpretations are readily apparent because an estimate and CI for $\tau $ were reported. This provides a more informative and nuanced understanding than a binary statement, such as “heterogeneity was present ( $\widehat {\tau }>0$ )” or “no evidence of heterogeneity was found ( $p=0.21$ ).”

3.2 Ratios

The bacille Calmette–Guérin (BCG) vaccine is used to prevent tuberculosis. Colditz et al.Reference Colditz, Brewer and Berkey¹³ performed a meta-analysis on the efficacy of the vaccine using RRs from 13 randomized trials. We performed a random-effects meta-analysis on the data and produced a forest plot showing an estimate and 95% CI for $\exp (\tau )$ (Figure 2).

Figure 2 Random-effects meta-analysis comparing the risk of tuberculosis (TB) between vaccine and control groups.Reference Colditz, Brewer and Berkey¹³ REML estimator of $\tau $ used. Figure produced using the R package meta with some manual editing.

The GM [95% CI] of the modeled distribution of true RRs was estimated to be $\widehat {{\mathrm {{GM}}}} = 0.49 \ [0.34, 0.70]$ . The estimated geometric SD was $\exp (\widehat {\tau })=1.75$ , which corresponds to a 68% range of $[{\mathrm {{GM}}} / 1.75, {\mathrm {{GM}}} \times 1.75]$ and a 95% range of $[{\mathrm {{GM}}} / 1.75^2, {\mathrm {{GM}}} \times 1.75^2] = [{\mathrm {{GM}}} / 3.06, {\mathrm {{GM}}} \times 3.06]$ when a lognormal distribution is assumed. For example, if the $\mathrm {{GM}}$ was 0.5, then the 68% range would be $[0.29,0.88]$ and the 95% range would be $[0.16,1.53]$ . We interpret this as considerable heterogeneity in the true RRs between trials. Repeating the process with the lower bound of the 95% CIReference Viechtbauer¹ for $\exp (\tau )$ (i.e., 1.41), if the $\mathrm {{GM}}$ was 0.5, then the 68% range would be $[0.35,0.71]$ and the 95% range would be $[0.25,0.99]$ . These calculations are straightforward because an estimate and CI of $\exp (\tau )$ were reported. Clearly, there is much heterogeneity here. This is more informative than a binary statement, such as “heterogeneity was present ( $\widehat {\tau }>0$ )” or “evidence of heterogeneity was found ( $p<0.01$ ).”

4 Conclusion

For meta-analyses of differences, we recommend reporting $\tau $ rather than $\tau ^2$ . For meta-analyses of ratios, we recommend reporting $\exp (\tau )$ rather than $\tau ^2$ . This will facilitate the interpretation of the magnitude of heterogeneity estimates. Similarly, reporting CIs or credible intervals for $\tau $ or $\exp (\tau )$ will be more helpful than following the current recommendation to report intervals for $\tau ^2$ .Reference Viechtbauer¹^– Reference Veroniki, Jackson and Viechtbauer³

Author contributions

Conceptualization: M.D.C.; Formal analysis: M.D.C.; Supervision: L.M.-W., A.D., and D.F.; Writing—original draft: M.D.C.; Writing—review and editing: M.D.C., L.M.-W., A.D., and D.F.

Competing interest statement

The authors declare that no competing interests exist.

Data availability statement

Previously published summary data are provided in the figures and in an Excel file.

Funding statement

The authors declare that no specific funding has been received for this article.

Supplementary material

The supplementary material for this article can be found at https://doi.org/10.1017/rsm.2026.10075.

References

Viechtbauer, W. Confidence intervals for the amount of heterogeneity in meta-analysis. Stat Med. 2007;26(1):37–52. https://doi.org/10.1002/sim.2514.CrossRef Google Scholar PubMed

Higgins, JPT, Thompson, SG, Spiegelhalter, DJ. A re-evaluation of random-effects meta-analysis. J Royal Stat Soc Ser A Stat Soc. 2009;172(1):137–159. https://doi.org/10.1111/j.1467-985X.2008.00552.x.CrossRef Google Scholar PubMed

Veroniki, AA, Jackson, D, Viechtbauer, W, et al. Methods to estimate the between-study variance and its uncertainty in meta-analysis. Res Synth Methods. 2016;7(1):55–79. https://doi.org/10.1002/jrsm.1164.CrossRef Google Scholar PubMed

Röver, C, Rindskopf, D, Friede, T. How trace plots help interpret meta-analysis results. Res Synth Methods. 2024;15(3):413–429. https://doi.org/10.1002/jrsm.1693.CrossRef Google Scholar PubMed

Borenstein, M. Avoiding common mistakes in meta-analysis: understanding the distinct roles of Q, I-squared, tau-squared, and the prediction interval in reporting heterogeneity. Res Synth Methods. 2024;15(2):354–368. https://doi.org/10.1002/jrsm.1678.CrossRef Google Scholar PubMed

Röver, C, Bender, R, Dias, S, et al. On weakly informative prior distributions for the heterogeneity parameter in Bayesian random-effects meta-analysis. Res Synth Methods. 2021;12(4):448–474. https://doi.org/10.1002/jrsm.1475.CrossRef Google Scholar PubMed

Kirkwood, TBL. Geometric means and measures of dispersion. Biometrics. 1979;35(4):908–909.Google Scholar

Chatfield, MD, Marquart-Wilson, L, Dobson, AJ, Farewell, DM. Mean relative error and standard relative deviation. Statistica Neerlandica. 2025;79(1):e70001. https://doi.org/10.1111/stan.70001.CrossRef Google Scholar

Mátrai, P, Kói, T, Sipos, Z, Farkas, N. Assessing the properties of the prediction interval in random-effects meta-analysis. Res Synth Methods. 2026;1–21. https://doi.org/10.1017/rsm.2025.10055.CrossRef Google Scholar

Higgins, JPT, Thomas, J, Chandler, J, et al. Cochrane Handbook for Systematic Reviews of Interventions. version 6.5 (updated August 2024). Cochrane; 2024.Google Scholar

IntHout, J, Ioannidis, JPA, Rovers, MM, Goeman, JJ. Plea for routinely presenting prediction intervals in meta-analysis. BMJ Open. 2016;6:e010247. https://doi.org/10.1136/bmjopen-2015-010247.CrossRef Google Scholar PubMed

Roberts, ME, Tchanturia, K, Stahl, D, Southgate, L, Treasure, J. A systematic review and meta-analysis of set-shifting ability in eating disorders. Psychol Med. 2007;37(8):1075–1084. https://doi.org/10.1017/S0033291707009877.CrossRef Google Scholar PubMed

Colditz, GA, Brewer, TF, Berkey, CS, et al. Efficacy of BCG vaccine in the prevention of tuberculosis: meta-analysis of the published literature. JAMA. 1994;271(9):698–702. https://doi.org/10.1001/jama.1994.03510330076038.CrossRef Google Scholar PubMed

Figure 1 Random-effects meta-analysis comparing the time to complete a trail making task in people with eating disorders and healthy controls.2,12 DerSimonian and Laird estimator of $\tau $ used. Figure produced using the R package meta.

Figure 2 Random-effects meta-analysis comparing the risk of tuberculosis (TB) between vaccine and control groups.13 REML estimator of $\tau $ used. Figure produced using the R package meta with some manual editing.

Chatfield et al. supplementary material

DOI: https://doi.org/10.1017/rsm.2026.10075.sm001

File 35.1 KB

Article contents

Report tau or exp(tau) rather than tau-squared in random-effects meta-analyses

Abstract

Keywords

Information

Highlights

What is already known?

What is new?

Potential impact for RSM readers

1 Introduction

2 Random-effects models

2.1 Differences

2.2 Ratios

2.3 Prediction intervals

3 Examples

3.1 Differences

3.2 Ratios

4 Conclusion

Author contributions

Competing interest statement

Data availability statement

Funding statement

Supplementary material

References

Chatfield et al. supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests