Meta-analytic-predictive priors based on a single study

Christian Röver; Tim Friede

doi:10.1017/rsm.2026.10081

Meta-analytic-predictive priors based on a single study

Published online by Cambridge University Press: 24 March 2026

Christian Röver

and

Tim Friede

Show author details

Christian Röver*: Affiliation:
Department of Medical Statistics, University Medical Center Göttingen , Göttingen, Germany
Tim Friede: Affiliation:
Department of Medical Statistics, University Medical Center Göttingen , Göttingen, Germany DZHK (German Center for Cardiovascular Research), Partner Site Lower Saxony, Göttingen, Germany DZKJ (German Center for Child and Adolescent Health), Göttingen, Germany
*: Corresponding author: Christian Röver; Email: christian.roever@med.uni-goettingen.de

Article contents

Abstract
Highlights
Introduction
Shrinkage estimation using two studies
Two practical applications
Discussion
Author contributions
Competing interest statement
Data availability statement
Funding statement
Footnotes
References

Rights & Permissions

Abstract

Meta-analytic-predictive (MAP) priors have been proposed as a generic approach to deriving informative prior distributions, where external empirical data are processed to learn about certain parameter distributions. The use of MAP priors is also closely related to shrinkage estimation (also sometimes referred to as dynamic borrowing). A potentially odd situation arises when the external data consist only of a single study. Conceptually, this is not a problem, it only implies that certain prior assumptions gain in importance and need to be specified with particular care. We outline this important, not uncommon special case and demonstrate its implementation and interpretation based on the normal–normal hierarchical model. The approach is illustrated using example applications in clinical medicine.

Keywords

bias allowance dynamic borrowing MAP prior power prior random-effects meta-analysis shrinkage estimation

Information

Type: Research Article
Information: Research Synthesis Methods , First View , pp. 1 - 19

DOI: https://doi.org/10.1017/rsm.2026.10081 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Open Practices: Open data Open materials
Copyright: © The Author(s), 2026. Published by Cambridge University Press on behalf of The Society for Research Synthesis Methodology

Highlights

What is already known?

• Shrinkage estimation may be used to effectively and robustly borrow information between related data sources.
• Shrinkage estimation may alternatively be motivated via a meta-analytic-predictive (MAP) approach.

What is new?

• A MAP approach remains sensible down to the extreme case of only a single study.
• The MAP prior’s usual features are retained, in addition, there are connections to power prior and bias allowance approaches.

Potential impact for RSM readers

• MAP priors are useful for constructing empirically motivated priors based on external/historical data.
• MAP priors may serve as an additional motivation for related approaches (bias allowance models and power priors).
• Practical application is straightforward using existing software packages.

1 Introduction

The potential of clinical research is commonly limited by data sparsity issues; such problems particularly arise in the context of rare diseases, where the number of potential study subjects is small, or in pediatric indications, where ethical considerations may limit the recruitment of patients. The large variety of rare diseases still means that a sizeable proportion of the population is affected by rare diseases, posing a considerable economic burden. Even in more common indications, data sparsity problems may arise, for example, when the focus is on smaller sub-populations, or when novel treatments or standards of care emerge. In any of these cases, the careful consideration of all potentially relevant evidence available is essential.Reference Gagne, Thompson, O’Keefe and Kesselheim ¹ ^– Reference Gamalo-Siebers, Savic and Basu ³ When evidence from a single experiment, such as a clinical trial, is not sufficiently conclusive on its own, it may sometimes help to view the data in the context of related instances (similar experiments) in order to yield more confident conclusions. This idea is explicitly implemented in shrinkage estimation, where a hierarchical model is set up accounting for estimation uncertainly at the study level as well as for variability (and similarity) between studiesReference Morris and Lysy ⁴ ^, Reference Gelman and Hill ⁵ ; models of this kind are commonly also used in the context of meta-analysis.Reference Fleiss ⁶ ^, Reference Röver ⁷ The borrowing-of-information taking place between the study of primary interest and the external data may be viewed in terms of the overarching joint model as a meta-analytic-combined (MAC) approach, or, equivalently, by formulating the meta-analytic predictive (MAP) prior that explicates the information contributed by the external data to the shrinkage estimate.Reference Schmidli, Gsteiger, Roychoudhury, O’Hagan, Spiegelhalter and Neuenschwander ⁸ A particular special case is given when a single (“target”) study is supported by a single (“source”) study; such situations are not uncommon, and shrinkage estimation here has proven useful.Reference Röver and Friede ⁹ ^– Reference Lesaffre, Qi, Banbeta and van Rosmalen ¹¹ Application of a hierarchical model makes it behave dynamically in the sense that more or less information is borrowed, depending on the apparent similarity of target and source data.Reference Röver and Friede ⁹ When considering this case in terms of the implied MAP prior, the “meta-analysis” involved here is based on a single study, which may appear somewhat counterintuitive at first. It is this perceived contradiction that we aim to address here; while a meta-analysis is commonly thought of as involving larger amounts of data, we will see that a hierarchical model may essentially be fit also to a single data point, and sensible predictions may be derived. Such a smallest-possible meta-analysis does not pose a conceptual problem, and there is no reason to abandon the general concept even if the amount of historical data drops below a couple of studies. It only implies that, due to the particular sparsity of data, prior specification within the model receives special importance, a problem which, however, is common in meta-analysis of few studies in general,Reference Röver, Bender and Dias ¹² and which analogously applies for alternative (and closely related) borrowing methods, such as power priors or bias allowance models.Reference Lesaffre, Qi, Banbeta and van Rosmalen ¹¹ ^, Reference Welton, Sutton, Cooper, Abrams and Ades ¹³ Since others appear to have struggled with or shied away from the idea of a single-study meta-analysis where in fact it may have been a viable option,Reference Iglesias, Muller and Zaugg ¹⁴ ^, Reference Harari, Soltanifar, Verhoek and Heeg ¹⁵ it seems worthwhile to investigate this special case a bit closer. Closer inspection of this particular case then also highlights how the properties of this MAP prior materialize, as well as its close connection to bias allowance and power prior approaches.

The remainder of this article is structured as follows: in Section 2, the normal–normal hierarchical model (NNHM) is introduced, the meta-analysis model which then is the basis for shrinkage estimation between a pair of studies, and for the MAP prior based on a single study. The ideas will be illustrated in two practical examples in Section 3. Section 3.1 discusses an application in paediatric Alport syndrome that was originally formulated in terms of a shrinkage estimation problem. Section 3.2 introduces a trial design application in cardiology, where information from a similar past study is designated for consideration in the eventual analysis via an informative MAP prior. Section 4 then closes with a brief discussion.

2 Shrinkage estimation using two studies

2.1 The normal–normal hierarchical model

The most common model for random-effects meta-analysis is given by the NNHM. It implements sampling error as well as between-study heterogeneity using normal distributions. The data are given in terms of k estimates $y_i$ and associated standard errors $s_i$ ( $i=1,\ldots ,k$ ). Each individual study aims to quantify a parameter $\theta _i$ , so that

(2.1)

$$ \begin{align} y_i|\theta_i,s_i \;\sim\;{\mathrm{Normal}}(\theta_i, s_i^2){.} \end{align} $$

The underlying parameters $\theta _i$ are not necessarily identical for all studies, instead some amount of (between-study) heterogeneity is allowed for, expressed as

(2.2)

$$ \begin{align} \theta_i|\mu,\tau \;\sim\; {\mathrm{Normal}}(\mu,\tau^2) {.} \end{align} $$

Often the overall mean $\mu $ is the aim of the analysis, while sometimes the study-specific parameters $\theta _i$ are also of interest.Reference Röver and Friede ⁹ ^, Reference Wandel, Neuenschwander, Röver and Friede ¹⁶ The heterogeneity, while important, usually remains a nuisance parameter. In the context of “shrinkage estimation” of the $\theta _i$ , an interesting aspect is that the problem may be motivated in two ways; classically, one may think of shrinkage estimation as a joint analysis of all (k) estimates, which also returns estimates of any $\theta _i$ parameter along the way; this is also denoted as the meta-analytic-combined (MAC) approach. The problem, however, may also be factored into the evidence stemming from the ith study alone, as well as the information provided by the remaining ( $k-1$ ) estimates. Shrinkage estimation then may be interpreted as the analysis of the ith study, based on a prior distribution that results as the predictive distribution derived from a meta-analysis of the other ( $k-1$ ) studies; this prior is denoted as the meta-analytic-predictive (MAP) prior. Both MAC and MAP approaches are equivalent and yield identical shrinkage estimates.Reference Schmidli, Gsteiger, Roychoudhury, O’Hagan, Spiegelhalter and Neuenschwander ⁸

In the following, we will focus on the special case of only two studies ( $k=2$ ). For the shrinkage estimate ( $\theta _2$ ), this implies a MAP prior that is based on a single study (i.e., the data provided through $y_1$ and $s_1$ ). While this may appear odd at first, the idea readily applies also in this special case, as will be demonstrated in the following. Analysis may generally be performed based on informative or uninformative priors for $\mu $ , while a proper, informative prior is required for $\tau $ .Reference Röver ⁷ ^, Reference Röver, Bender and Dias ¹² The case of only $k=2$ studies is also closely connected to the related concepts of power prior Reference Röver and Friede ⁹ or bias allowance models.Reference Welton, Sutton, Cooper, Abrams and Ades ¹³ ^, Reference Pocock ¹⁷

2.2 Uniform prior for the overall mean effect ( $\mu $ )

Priors for the overall mean parameter ( $\mu $ ) in the NNHM may be specified as informative or as uninformative. For (more or less informative) priors, normal distributions are an obvious choice, also since these lead to analytically simple inference. Quite commonly, effect priors however are chosen as uninformative and (improper) uniform, not least due to certain analogies to frequentist meta-analysis procedures.Reference Röver ⁷ In case of an (improper, non-informative) uniform prior for $\mu $ , certain expressions turn out particularly simple, which is also why we will focus on this particular, yet insightful, common and practically relevant case in the following. As the (improper) uniform prior constitutes the limiting case of an increasingly uninformative effect prior, the following considerations may also be viewed as relating to the limiting behavior for increasingly uninformative priors (e.g., for normal priors when their variance approaches infinity). The uniform effect prior leads to a normal conditional posterior for the overall mean effect $\mu $ , with moments given by

(2.3)

$$ \begin{align} {\mathrm{E}}[\mu|y_1, s_1, \tau] \;=\; y_1 \quad \text{and} \quad {\mathrm{Var}}(\mu|y_1, s_1, \tau) \;=\; s_1^2 + \tau^2 {,} \end{align} $$

and a marginal heterogeneity likelihood that is constant (independent of $\tau $ ), so that the heterogeneity’s posterior equals its prior.Reference Röver ⁷ This seems reasonable, since (as long as the overall mean effect prior is uniform) a single observation $y_1$ does not provide information on the heterogeneity $\tau $ .

2.3 The MAP prior for the effect in a new study

The MAP prior results as the posterior predictive distribution for a “new,” second study’s (study-specific) effect $\theta _2$ given the data from the first study ( $y_1$ , $s_1$ ). In the NNHM framework, the conditional predictive distribution again is normal with mean

(2.4)

$$ \begin{align} {\mathrm{E}}[\theta_2|y_1, s_1, \tau] \;=\; {\mathrm{E}}[\mu|y_1, s_1, \tau] \;=\; y_1 \end{align} $$

and variance

(2.5)

$$ \begin{align} {\mathrm{Var}}(\theta_2|y_1, s_1, \tau) \;=\; {\mathrm{Var}}(\mu|y_1, s_1, \tau) + \tau^2 \;=\; s_1^2 + 2\tau^2 \end{align} $$

(see also (2.2) and (2.3), and the more detailed derivation in Appendix A.1). These expressions make sense in the present context: We know $\theta _1$ with accuracy given by the standard error $s_1$ , and we know that the difference between $\theta _1$ and $\theta _2$ is normally distributed with variance $2\tau ^2$ , so that the (conditional) variance expression results as a corresponding sum.Reference Röver and Friede ⁹ As pointed out above, information on heterogeneity ( $\tau $ ) so far is based on the prior only; the heterogeneity in this context generally requires a proper, informative prior (since $k<3$ ).Reference Röver ⁷ ^, Reference Röver, Bender and Dias ¹²

The eventual (marginal) predictive distribution (marginalized over the distribution of $\tau $ ) hence results as a normal scale mixture Reference Lee, McLachlan, Balakrishnan, Colton, Everitt, Piegorsch, Ruggeri and Teugels ¹⁸ ^, Reference Lindsay ¹⁹ with fixed mean (2.4) and with variance as given in (2.5), where $\tau $ is distributed according to the specified prior. In particular, one may think of the MAP prior as the first study’s point estimate $p(\theta _1|y_1,s_1)$ convolved with the prior predictive distribution $p(\theta _2|\theta _1,\tau )$ and then marginalized over $\tau $ . The MAP prior is symmetric around $y_1$ , and its (marginal) variance results from (2.5) as

(2.6)

$$ \begin{align} {\mathrm{Var}}(\theta_2|y_1, s_1) \;=\; {\mathrm{E}}[s_1^2+2\tau^2] \;=\; s_1^2 + 2\,{\mathrm{E}}[\tau^2] {,} \end{align} $$

where the expectation ${\mathrm {E}}[\tau ^2]$ depends on the assigned heterogeneity prior. For a range of common prior specifications, this expectation may be derived analytically; Table A1 in Appendix A.4 lists some popular cases. From this expression, one can see that the relative magnitudes of $s_1^2$ and ${\mathrm {E}}[\tau ^2]$ determine whether the resulting MAP prior’s variance is dominated by estimation uncertainty (regarding $\theta _1$ ) or anticipated heterogeneity ( $\tau $ ). These two variance components may in fact also be considered to reflect so-called “type A” and “type B” uncertainties relating to measurement uncertainty and background knowledge, respectively, which together sum up to form the combined standard uncertainty $u_c$ (the square root of (2.6)).Reference Kirkup ²⁰ ^, Reference van der Bles, van der Linden and Freeman ²¹

Since the MAP prior results as a normal scale mixture, it is generally heavier-tailed than a normal distribution, which has implications for the resulting operating characteristics. A heavy-tailed (MAP-) prior means that in combination with a (“shorter-tailed”) normal likelihood, the likelihood will dominate in case of a prior-data conflict.Reference O’Hagan and Pericchi ²² Such robustness properties have in fact been noted and demonstrated in the meta-analysis context, as these lead to a dynamic borrowing behavior.Reference Röver and Friede ⁹ The MAP priors’ tail behavior will also be illustrated for some examples below (Figures 4 and 5).

Another way to quantify the precision of a prior distribution is by relating it to a number of observations that would in a certain sense convey an equivalent amount of information. In the following, we will use two approaches to this effect. Firstly, a prior may be assessed in terms of a corresponding absolute effective sample size (ESS) (here: number of patients), this will be done by quoting ESSs based on the expected local-information-ratio ( ${\mathrm {ESS}_{\mathrm {ELIR}}}$ ). This measure is based on the prior density’s curvature and it ensures predictive consistency, that is, on expectation, the posterior’s ESS will be the sum of the prior’s ${\mathrm {ESS}_{\mathrm {ELIR}}}$ plus the actual sample size.Reference Neuenschwander, Weber, Schmidli and O’Hagan ²³ Secondly, the added information from including the prior in a specific analysis may be expressed in terms of the (relative) gain in ESS.Reference Röver and Friede ⁹ This is based on comparing the relative width of a confidence interval with and without considering the informative prior, and then determining by what factor the sample size would have needed to be increased to yield the same precision gain (see also Appendix A.2).

2.4 The bias allowance model connection

In the 2-study case, there is a one-to-one correspondence between the NNHM and a simple bias allowance model Reference Welton, Sutton, Cooper, Abrams and Ades ¹³ ; instead of the NNHM assumption (2.2) in combination with a uniform prior for the overall mean effect $\mu $ and heterogeneity prior $p_\star (\tau )$ as in Section 2.1, one may specify

(2.7)

$$ \begin{align} \theta_2|\alpha,\beta &= \alpha\qquad\ \ \qquad \end{align} $$

(2.8)

$$ \begin{align}\kern9pt \theta_1|\alpha,\beta &\sim {\mathrm{Normal}}(\alpha, \, \beta^2) \end{align} $$

with prior $p(\beta )=\frac {1}{\sqrt {2}}\,p_{\star }\bigl (\frac {\beta }{\sqrt {2}}\bigr )$ for the standard deviation $\beta $ .Reference Röver and Friede ⁹ This “reference model” is different in that one estimate (the reference, or “target” $y_2$ ) directly relates to $\alpha $ , while the other one (the “source” $y_1$ ) is associated with an additional offset to account for potential bias. The shrinkage estimates of $\theta _i$ , however, can be shown to be identical in both models as long as a uniform prior for the overall mean effect ( $\mu $ ) is used.Reference Röver and Friede ⁹ The reference model may be considered a variation of Pocock’s bias model or, more generally, a bias allowance model.Reference Welton, Sutton, Cooper, Abrams and Ades ¹³ ^, Reference Pocock ¹⁷ ^, Reference Neuenschwander, Schmidli, Lesaffre, Baio and Boulanger ²⁴ The $\beta $ parameter, which only differs from $\tau $ by a scaling factor of $\sqrt {2}$ , may also help motivating a heterogeneity prior, as it directly relates to the expected difference between $\theta _2$ and $\theta _1$ , without reference to a common overall mean $\mu $ .

2.5 The power prior connection

There is also a connection to a so-called power prior, which has been proposed as an approach for deliberate down-weighting of prior information. It is intended for a prior distribution that itself results as a posterior, and the power prior results from applying an exponent $a_0$ (with $0 \leq a_0 \leq 1$ ) to its likelihood contribution.Reference Neuenschwander, Schmidli, Lesaffre, Baio and Boulanger ²⁴ ^, Reference Ibrahim and Chen ²⁵ When conditioning on a fixed $\tau $ value, the (conditional) MAP prior is normal with moments given in (2.4) and (2.5); in particular, note that $\tau ^2$ acts additively on the “plain” variance ( $s_1^2$ ). In this context, a power prior with fixed exponent $a_0$ on the other hand would correspond to a ${\mathrm {Normal}}\Bigl (y_1,\,\frac {s_1^2}{a_0}\Bigr )$ distribution, where the (inverse) exponent acts multiplicatively on the variance. Both MAP and power prior then are identical if $a_0=\bigl (2\frac {\tau ^2}{s_1^2}+1\bigr )^{-1}$ .Reference Röver and Friede ⁹ ^, Reference Chen and Ibrahim ²⁶ ^, Reference Pawel, Aust, Held and Wagenmakers ²⁷ It is interesting to note that the relationship between $a_0$ and $\tau $ here depends on the ratio $\frac {\tau }{s_1}$ ; Pawel et al. (2024)Reference Pawel, Aust, Held and Wagenmakers ²⁷ point out that the exponent $a_0$ directly relates to the “relative” heterogeneity as expressed though the popular $I^2$ statistic,Reference Higgins and Thompson ²⁸ which in this case simply equals $I^2=\frac {\tau ^2}{\tau ^2+s_1^2}$ . The exponent may then directly be expressed as a function of the corresponding $I^2$ as $a_0=\frac {1-I^2}{1+I^2}$ . While a prior probability distribution for $\tau $ is readily motivated with reference to the effect scale of $y_i$ and $\theta _i$ ,Reference Röver, Bender and Dias ¹² specification of a fixed $\alpha _0$ value remains tricky, and a prior distribution may be even harder to motivate, as it would implicitly relate to the $I^2$ scale. On the other hand, through the above functional correspondence, any prior for the heterogeneity $\tau $ immediately implies a corresponding distribution for the exponent $a_0$ ; an example is illustrated in Appendix A.5.

3 Two practical applications

3.1 Pediatric Alport example

3.1.1 Application

Gross et al. (2020)Reference Gross, Tönshoff and Weber ²⁹ performed a randomized controlled trial (RCT) in Alport syndrome to investigate the effects of ramipril, an angiotensin-converting enzyme inhibitor (ACEi). Recruitment of participants to the RCT was hampered by the rare and pediatric nature of the disease, and so the analysis of the RCT had been planned with the inclusion of observational data from an open-label arm and a natural disease cohort.Reference Gross, Friede and Hilgers ³⁰ Time to disease progression was a co-primary endpoint, and from the observational data, a hazard ratio (HR) of 0.53 [0.22, 1.29] was estimated based on 70 patients. Only 20 patients entered into the RCT, and an HR of 0.51 [0.12, 2.20] was estimated. The data are also summarized in Table 1.

Table 1

Alport example data from Gross et al. (2020).Reference Gross, Tönshoff and Weber ²⁹

The analysis was then performed by jointly considering both log-HR estimates in an NNHM, anticipating a reasonable amount of heterogeneity between them (expressed through a $\text {half-Normal}(0.5)$ prior for $\tau $ ), and deriving a shrinkage estimate for the RCT effect ( $\theta _2$ ). The resulting estimate then was substantially more precise than if the RCT data were considered in isolation; the HR was estimated at 0.52 [0.19, 1.39].Reference Gross, Tönshoff and Weber ²⁹

To make the flow of information transparent, we may now derive the corresponding MAP prior reflecting the information contributed by the observational data. Figure 1 illustrates MAP-prior, likelihood, and posterior (shrinkage estimate) for the Alport example. The MAP prior here has a mean of $y_i=-0.63$ and a variance of $s_1^2 + 2\,{\mathrm {E}}[\tau ^2] = 0.45^2 + 2\times 0.5^2 = 0.84^2$ . While the observational sample size was 70 patients, the MAP prior’s effective sample size ( ${\mathrm {ESS}_{\mathrm {ELIR}}}$ ) is at only 26 patients (i.e., 37% of originally 70 actual patients).Reference Neuenschwander, Weber, Schmidli and O’Hagan ²³ The eventual shrinkage interval is only 67% as wide as the original, implying a substantial “effective gain in sample size”Reference Röver and Friede ⁹ ; such a precision increase would otherwise have required more than doubling the sample size (by the addition of 24 extra patients). So in this case, the absolute ( ${\mathrm {ESS}_{\mathrm {ELIR}}}$ ) estimate matches well the observed precision gain.

Figure 1

Illustration of MAP-prior, likelihood, and (shrinkage-) posterior for the Alport example discussed in Section 3.1.Reference Gross, Tönshoff and Weber ²⁹ The horizontal lines at the bottom indicate point estimates and corresponding 95% intervals.

3.1.2 Variations of the MAP prior

The heterogeneity ( $\tau $ ) prior was specified as a ${\text {half-Normal}}(0.5)$ distribution, which is a reasonably conservative choice for endpoints such as HRs, as it covers “reasonable” and up to “fairly high” levels of heterogeneity ( $\tau \leq 1$ ) and leaves a small prior probability for “fairly extreme” amounts ( $\tau>1$ ).Reference Röver, Bender and Dias ¹² ^, Reference Friede, Röver, Wandel and Neuenschwander ³¹ Since conclusions heavily depend on the heterogeneity prior settings, it may however be interesting to investigate the effects of a range of reasonable alternative specifications; in particular, we will consider different prior scales and different distribution families. Among the various assumptions implemented in the analysis, a “too optimistic” heterogeneity prior (favoring small heterogenity) might yield results inappropriately close to a common-effect analysis, while overly “pessimistic” or “conservative” assumptions may on the other hand eventually lead to very little borrowing of information.

Assuming that $s_1=0.451$ (as in the present example, see Table 1), we can illustrate the resulting MAP prior when varying the heterogeneity prior scale. Figure 2 shows the likelihood of the observational estimate along with the corresponding MAP priors for half-normal heterogeneity priors with scales 0.25, 0.50, and 1.00. Increasing the heterogeneity prior scale yields a MAP prior that becomes increasingly wider than the plain likelihood alone. In the present case, the effect scale was a logarithmic HR, and on the logarithmic scale, the original MAP prior (based on a ${\text {half-Normal}}(0.5)$ heterogeneity prior) covers a range of $y_1 \pm 1.72$ with 95% probability. On the exponentiated scale (see also the top axis in Figure 2), a difference of 1.72 in log-HR would correspond to a 5.6-fold larger HR. If one switched to a ${\text {half-Normal}}(0.25)$ or ${\text {half-Normal}}(1.0)$ prior instead, the range would change to $y_1\pm 1.13$ or $y_1\pm 3.18$ instead, corresponding to multiplicative factors of 3.1 or 24.0, respectively.

Figure 2

Illustration of the resulting MAP-prior for varying heterogeneity prior scales. The dashed line indicates the likelihood of the observational data alone for comparison.

Half-normal distributions are a common and obvious choice as heterogeneity priors, possible reasons may be familiarity and availability, as well as a “flat” shape near the origin and a rather short tail.Reference Röver, Bender and Dias ¹² Variations of the distribution family commonly do not alter conclusions dramatically as long as they cover a similar range, as manifested, for example, in a common prior median.

Figure 3

Illustration of MAP-prior’s dependence on the heterogeneity prior distribution family. The different heterogeneity priors shown here all share the same prior median.

Figure 4

The MAP-priors’ cumulative distribution functions corresponding to the densities shown in Figure 3.

Figure 5

The MAP-priors’ densities on a logarithmic scale (see also Figure 3). Note that the likelihood for the observational data alone follows a parabola shape here, while the corresponding MAP priors are clearly much heavier-tailed.

MAP priors corresponding to alternative specifications to a ${\text {half-Normal}}(0.5)$ for the heterogeneity are illustrated in Figure 3. A range of distribution families is used, with their scale parameters specified such that all correspond to a common prior median for $\tau $ (of $0.34$ ). These different heterogeneity prior families are also shown in Appendix A.3. The resulting MAP prior densities themselves are hard to distinguish. Differences are more noticeable when focusing on the tail behavior, for example, considering cumulative distribution functions (as shown in Figure 4) or logarithmic densities (shown in Figure 5). One can see that heavier-tailed heterogeneity priors also yield correspondingly heavier-tailed MAP-priors. The heterogeneity and the corresponding MAP priors’ properties are also summarized and compared in Table 2 in terms of prior quantiles and effective sample sizes ( ${\mathrm {ESS}_{\mathrm {ELIR}}}$ ).

Table 2

Summaries of MAP priors resulting from several settings for the heterogeneity ( $\tau $ ) prior. The half-normal(0.5) prior is contrasted with half-normal priors of differing scale, as well as with priors of differing distributional families, but with matching prior medians. Note that in the context of the present example, the MAP prior’s domain corresponds to logarithmic hazard ratios (log-HRs). Quantiles are centered at $y_1$

It is sometimes also instructive to observe the effects of variations of the prior on the resulting estimates; for example, varying the heterogeneity prior scale allows for a sensitivity (or tipping point) analysis. Such an analysis is shown in Appendix A.6; the amount of borrowing is reflected in the shrinkage interval’s width, but in the present example, inference would not change qualitatively, and a log-HR of zero always remains included.

3.2 Heart failure example

The Spirit-HF trial has been designed in order to test the efficacy of spironolactone in patients with heart failure (HF).Reference Pieske ³² Spironolactone is expected to reduce cardiovascular mortality as well as hospitalizations due to HF, and had previously been investigated in the Topcat trial.Reference Pitt, Pfeiffer and Assmann ³³ Both studies refer to the composite of (recurrent) HF hospitalization and cardiovascular death as the primary endpoint to evaluate treatment efficacy. Despite a sizeable sample size of 3,445 patients and a mean follow-up duration of more than three years, the Topcat trial failed to demonstrate statistical significance; the estimated HR was at 0.89 (0.77, 1.04) ( $p=0.14$ ).Reference Pitt, Pfeiffer and Assmann ³³

The analysis of the new Spirit-HF trial meanwhile is being planned, and may take into consideration the evidence already generated in the Topcat trial. One idea may be to derive a shrinkage estimate, anticipating some between-study heterogeneity, and dynamically borrowing information from the earlier study based on the corresponding MAP prior.Reference Röver and Friede ⁹ ^, Reference Röver and Friede ¹⁰ For the between-study heterogeneity $\tau $ , use of a ${\text {half-Normal}}(0.25)$ prior may be appropriate. The heterogeneity prior may be motivated referring to anticipated levels of heterogeneity based on general considerationsReference Röver, Bender and Dias ¹² or using empirical evidence, in particular in view of the similar study designs and the effect measure being a log-HR.Reference Lilienthal, Sturtz and Schürmann ³⁴

In the present case, analysis is based on the logarithmic HR; the HR estimated in the Topcat trial corresponds to a log-HR of $-0.117$ with a standard error of $0.077$ . The corresponding Topcat likelihood along with the resulting MAP prior is illustrated in Figure 6. The variance (squared standard error) of the Topcat study’s estimate was $s_1^2=0.077^2$ while for the assumed heterogeneity prior the expected heterogeneity variance is ${\mathrm {E}}[\tau ^2]=0.25^2$ (see also Table A1), so that the resulting MAP prior’s variance (2.6) is $s_1^2 + 2\,{\mathrm {E}}[\tau ^2] = 0.362^2$ , and the majority of the variance is due to epistemic uncertainty relating to the anticipated similarity of the Topcat and Spirit-HF parameters. The 95% prediction interval for the MAP prior is centered at the Topcat log-HR estimate and ranges from $-0.899$ to $+0.665$ , corresponding to HRs in the range [ $0.407$ , $1.945$ ]. According to the MAP prior, the probability of a beneficial treatment effect (a log-HR below zero) is 71%. The MAP prior has an ${\mathrm {ESS}_{\mathrm {ELIR}}}$ of 399, that is, only 12% of the 3,445 actual Topcat patients, and 31% of the estimated enrolment of 1,300 Spirit-HF patients.Reference Pieske ³² This means that the prior derived from the Topcat data will not enter the eventual analysis as an additional 3,445 patients (as would be the case if both study populations were pooled naïvely), but instead we expect an accuracy corresponding to a total of some $1,300+399$ patients. The Spirit-HF study’s contribution to its own shrinkage estimate may also be assessedReference Röver and Friede ¹⁰ ; assuming that both studies show the same dependence of standard error and sample size, the Spirit-HF study will account for a minimum of $61\%$ in weight to the eventual effect estimate.

Figure 6

Illustration of likelihood and corresponding MAP-prior for the heart failure example, using a ${\text {half-Normal}}(0.25)$ prior for $\tau $ . The horizontal line at the bottom indicates the 95% prediction interval.

4 Discussion

Despite the seemingly odd notion of a meta-analysis of a single study, the use of MAP priors remains completely consistent down to the extreme case of only one data point. The “usual” toolbox remains available, including common prior specifications,Reference Röver, Bender and Dias ¹² computation of ESSs,Reference Neuenschwander, Weber, Schmidli and O’Hagan ²³ robustification,Reference Schmidli, Gsteiger, Roychoudhury, O’Hagan, Spiegelhalter and Neuenschwander ⁸ as well as common meta-analysis software (e.g., the bayesmeta or RBesT R packages)Reference Röver ⁷ ^, Reference Weber, Li, Seaman, Kakizume and Schmidli ³⁵ for practical implementation. In addition, for $k=1$ , there are connections to bias allowance and power prior models (see Section 2) that may help motivating a MAP approach (or vice versa). MAP priors based on a few estimates are generally rather heavy-tailed, which will ensure robust operating characteristics.Reference Röver and Friede ⁹ ^, Reference Röver and Friede ¹⁰ ^, Reference O’Hagan and Pericchi ²² For a few data points in general, and in particular for only a single data point, the prior specification for the heterogeneity parameter $\tau $ gains in importance and needs to be particularly well-founded and convincing.Reference Röver, Bender and Dias ¹²

While only the normal model (NNHM) was discussed here, the idea also extends to other model families; for example, derivation of a MAP prior would also work for a binomial-normal model (as implemented in the RBesT package).Reference Weber, Li, Seaman, Kakizume and Schmidli ³⁵ Another related approach (with some similarity to the power prior) is given by the commensurate prior,Reference Hobbs, Carlin, Mandrekar and SD. ³⁶ which, however, does not constitute a special case of the MAP prior. Empirical MAP priors may then be utilized in different ways, either to simply motivate a reasonable sample size (or other design aspects), or to implement explicit borrowing of historical information.Reference Schmidli, Neuenschwander and Friede ³⁷ ^, Reference Muehlemann, Zhou, Mukherjee, Hossain, Roychoudhury and Russek-Cohen ³⁸

When MAP priors are used to also inform the analysis, it is important to approach the evaluation of operating characteristics from a sensible angle; the naive application of classically “frequentist” measures to judge a Bayesian procedure, in particular when informative priors are involved, will often not provide a meaningful assessment of its actual features.Reference Gneiting, Balabdaoui and Raftery ³⁹ ^– Reference Best, Ajimi, Neuenschwander, Saint-Hilary and Wandel ⁴¹

A common concern in the context of the use of historical data is that an informative prior might unduly dominate the eventual analysis; for example, in the HF example application, one might be worried that the much larger Topcat trial would swamp the data from the smaller Spirit-HF study. However, for the shrinkage estimate of interest here, the second study’s contribution is bounded by a minimum of $61\%$ within the suggested setup. This proportion would increase for a more conservative heterogeneity prior specificationReference Röver, Bender and Dias ¹² or when implementing robustification,Reference Schmidli, Gsteiger, Roychoudhury, O’Hagan, Spiegelhalter and Neuenschwander ⁸ however, such modeling decisions should probably rather be based on considerations of prior information than on deduced operating characteristics.

Besides considerations of the value of “borrowed” information for a given parameter estimate, MAP priors based on historical data may also be interesting for the design of subsequent trials, with or without the eventual use of shrinkage estimation in the final analysis. Historical information may then help determining sensible ranges for nuisance parametersReference Schmidli, Neuenschwander and Friede ³⁷ or sample sizes,Reference Lindley ⁴² ^, Reference Brutti, De Santis and Gubbiotti ⁴³ for interim decisions,Reference Schmidli, Gsteiger, Roychoudhury, O’Hagan, Spiegelhalter and Neuenschwander ⁸ ^, Reference Neuenschwander, Roychoudhuri and Schmidli ⁴⁴ or it may be used in a more comprehensive fashion to ensure a positive joint outcome.Reference Neuenschwander, Roychoudhuri and Schmidli ⁴⁴ ^, Reference Pawel, Consonni and Held ⁴⁵

Van Zwet et al. (2024) argue that the analysis of a single study should also account for heterogeneity of the treatment effect across studies. Therefore, they propose to consider analyses of individual studies also within an overarching NNHM framework similar to our approach presented here; using informative, empirically motivated priors for both $\mu $ and $\tau $ , inference may then be focused on the overall mean effect ( $\mu $ ) rather than the study-specific $\theta _1$ even in the analysis of only a single study.Reference Gelman ⁴⁶ ^, Reference van Zwet, Wiecek and Gelman ⁴⁷

Author contributions

Conceptualization: C.R. and T.F.; Methodology: C.R.; Writing original draft: C.R. Both authors approved the final submitted draft.

Competing interest statement

The authors declare that no competing interests exist.

Data availability statement

The data supporting the findings of this study are openly available at Zenodo under the URL https://doi.org/10.5281/zenodo.18633334.

Funding statement

Support from the German Centre for Cardiovascular Research (Deutsches Zentrum für Herz-Kreislauf-Forschung e.V., DZHK) is gratefully acknowledged (Grant No. 81Z0300108).

A Appendix

A.1 Posterior predictive distribution

When an informative ${\mathrm {Normal}}({\mu _{\mathrm {p}}}, {\sigma _{\mathrm {p}}}^2)$ prior is assumed for the overall mean $\mu $ , the posterior predictive distribution for a “new” study-specific mean $\theta _{k+1}$ (conditional on a given heterogeneity value) is again normal with moments

(A.1)

$$ \begin{align} {\mathrm{E}}[\theta_{k+1}|y_1, \ldots, y_k, s_1, \ldots, s_k, \tau] & = \frac{\frac{{\mu_{\mathrm{p}}}}{{\sigma_{\mathrm{p}}}^2}+\sum_{i=1}^k \frac{y_i}{s_i^2+\tau^2}}{\frac{1}{{\sigma_{\mathrm{p}}}^2}+\sum_{i=1}^k\frac{1}{s_i^2+\tau^2}} \end{align} $$

(A.2)

$$ \begin{align} {\mathrm{Var}}(\theta_{k+1}|y_1, \ldots, y_k, s_1, \ldots, s_k, \tau) & = \frac{1}{\frac{1}{{\sigma_{\mathrm{p}}}^2}+\sum_{i=1}^k\frac{1}{s_i^2+\tau^2}} + \tau^2 \end{align} $$

implying for the specific case of $k=1$ that

(A.3)

$$ \begin{align} {\mathrm{E}}[\theta_{2}|y_1, s_1, \tau] & = {\mu_{\mathrm{p}}}\frac{s_1^2+\tau^2}{{\sigma_{\mathrm{p}}}^2+s_1^2+\tau^2} + y_1 \frac{{\sigma_{\mathrm{p}}}^2}{{\sigma_{\mathrm{p}}}^2+s_1^2+\tau^2} \end{align} $$

(A.4)

$$ \begin{align} {\mathrm{Var}}(\theta_{2}|y_1, s_1, \tau) & = \frac{1}{\frac{1}{{\sigma_{\mathrm{p}}}^2} + \frac{1}{s_1^2 + \tau^2}} + \tau^2\end{align} $$

(see Röver (2020)Reference Röver ⁷ ). One can already see that in the limiting case of an increasingly vague effect prior ( ${\sigma _{\mathrm {p}}}\rightarrow \infty )$ , the prior’s influence vanishes, and the (conditional) variance increases.

Specification of an improper uniform prior for the overall mean effect $\mu $ also leads to a proper posterior; the posterior predictive moments then are of the slightly simpler form

(A.5)

$$ \begin{align} {\mathrm{E}}[\theta_{k+1}|y_1, \ldots, y_k, s_1, \ldots, s_k, \tau] & = \frac{\sum_{i=1}^k \frac{y_i}{s_i^2+\tau^2}}{\sum_{i=1}^k\frac{1}{s_i^2+\tau^2}} \end{align} $$

(A.6)

$$ \begin{align} {\mathrm{Var}}(\theta_{k+1}|y_1, \ldots, y_k, s_1, \ldots, s_k, \tau) & = \frac{1}{\sum_{i=1}^k\frac{1}{s_i^2+\tau^2}} + \tau^2 \end{align} $$

(see Röver (2020)Reference Röver ⁷ ). In the case of a single study ( $k=1$ ), these expressions then simplify to

(A.7)

$$ \begin{align} {\mathrm{E}}[\theta_{2}|y_1, s_1, \tau] \; = \; y_1 \quad \text{and} \quad {\mathrm{Var}}(\theta_{2}|y_1, s_1, \tau) \; = \; s_1^2 + 2\tau^2. \end{align} $$

Figure A1

Illustration of the several heterogeneity priors compared in Section 3.1.2 in terms of their probability density functions. All priors are scaled such that they have a common median of 0.34 (the median of a half-normal(0.5) prior; dashed line).

A.2 Gain in effective sample size

The relative gain in information from a prior may be quantified using the gain in ESS, which is based on the relative width of 95% intervals with and without consideration of the (informative) prior. First, consider the relative width q, the ratio of interval widths (or standard errors) ( $q=\frac {\text { width using informative prior}}{\text { width using vague prior}}$ ). Assuming that standard errors are proportional to the inverse of the square root of the sample size, the gain in ESS then is given by $q^{-2}-1$ . For example, if the informative prior yields an interval that is only half as wide ( $q=0.5$ ), this would otherwise have required a quadrupled sample size, or a $q^{-2}-1=300\%$ increase. If the interval is 90% as wide ( $q=0.9$ ), this corresponds to an approximate $q^{-2}-1=23\%$ increase.Reference Röver and Friede ⁹

A.3 Heterogeneity priors

Figures A1 and A2 show several prior densities and cumulative distribution functions (CDFs) discussed in Section 3.1.2 that are all scaled to a common median (of 0.34, the median of an HN(0.5) distribution).

Figure A2

Illustration of the several heterogeneity priors compared in Section 3.1.2 in terms of their cumulative distribution functions (CDFs). All priors are scaled such that they have a common median of 0.34 (dashed line).

Table A1

Expected values of $\tau ^2$ based on various common (prior) distributions for $\tau $ , depending on their scale parameter s. An asterisk ( $\ast $ ) indicates that there is no simple analytical expression. The half-Cauchy prior would be an additional option, but does not have a finite expectation (it is in fact also a special case of the half-Student-t prior, with $\nu =1$ degree of freedom)

A.4 Variance of the MAP prior

The prior predictive distribution $p(\theta _2|y_1,s_1)$ is a normal scale mixture with (constant) mean $y_1$ and (conditional) variance $s_1^2+2\tau ^2$ , where $\tau $ is distributed according to the specified heterogeneity prior. The mixture distribution’s marginal variance results as ${\mathrm {Var}}(\theta _2|y_1,s_1) = s_1^2 + 2\,{\mathrm {E}}[\tau ^2]$ and depends on the (prior) expectation of the squared heterogeneity ${\mathrm {E}}[\tau ^2]$ (see (2.6)). Table A1 summarizes expectations for $\tau ^2$ for a range of common heterogeneity priors. Note also the related Table B1 in the appendix of Röver et al. (2021)Reference Röver, Bender and Dias ¹² giving some additional details for the prior distributions shown here.

A.5 Power prior exponent’s distribution

For any fixed heterogeneity value $\tau $ , the (conditional) MAP prior is equivalent to a power prior with exponent $a_0=\bigl (2\frac {\tau ^2}{s_1^2}+1\bigr )^{-1}$ (see also Section 2.5).Reference Röver and Friede ⁹ Through this functional relationship, any prior density $p_\star (\tau )$ for the heterogeneity implies a corresponding prior for the exponent $a_0$ with probability density function

(A.8)

$$ \begin{align}\textstyle p(a_0) \;=\; \frac{s_1}{2\sqrt{2}} \frac{\sqrt{\frac{a_0}{1-a_0}}}{a^2}\; p_\star\biggl(s_1\sqrt{\frac{1-a_0}{2\,a_0}}\biggr) {.} \end{align} $$

Figure A3 illustrates such densities for an example value of $s_1=0.451$ (as in the Alport example from Section 3.1). The prior densities for $a_0$ are shown for half-normal priors with scales 0.25, 0.50, and 1.00 (corresponding to the cases also illustrated in Figure 2).

A value of $a_0\!=\!1$ for the exponent corresponds to full borrowing, while smaller values imply increasing degrees of discounting of prior information. While the ${\text {half-Normal}}(0.25)$ prior places a substantial share of prior probability near $a_0\!=\!1$ , larger prior scale parameters correspond to less a-priori expected borrowing, eventually resulting in bimodal priors for $a_0$ .

Figure A3

Illustration of prior distributions for the power prior exponent $a_0$ corresponding to certain prior distributions assumed for the heterogeneity $\tau $ (and $s_1=0.451$ ).

Note that (as elaborated in Section 2.5) the mapping between $\tau $ and $a_0$ always depends on the “source” study’s standard error ( $s_1$ ). A prior for $\tau $ may be motivated independent of $s_1$ , while specification of a value (or distribution) for $a_0$ may have odd consequences when varying the source study’s size or precision ( $s_1$ ).

Figure A4

Illustration of the effect of varying the (half-normal) heterogeneity prior’s scale on the resulting RCT shrinkage estimate from Section 3.1.

A.6 Sensitivity analysis (Alport example)

In the Alport example application from Section 3.1, varying the half-Normal heterogeneity prior’s scale parameter affects the precision of the corresponding MAP prior, and with that, the eventual amount of borrowing from the observational data. Figure A4 shows how the resulting shrinkage estimate for the RCT is affected. A smaller prior scale leads to more borrowing and hence a shorter shrinkage interval eventually approaching the common-effect estimate. A larger prior scale leads to less borrowing, with the shrinkage interval eventually approaching the interval based on the RCT data alone. Since a log-HR of zero always remains included, in this example, there is no “tipping point” for the heterogeneity’s prior scale parameter.

A.7 Example R code

A.7.1 Alport example

A.7.2 Heart failure example

Footnotes

This article was awarded Open Data and Open Materials badges for transparent practices. See the Data availability statement for details.

References

Gagne, JJ, Thompson, L, O’Keefe, K, Kesselheim, AS. Innovative research methods for studying treatments for rare diseases: methodological review. BMJ. 2014;349:g6802.10.1136/bmj.g6802CrossRef Google Scholar

Tudur Smith, C, Williamson, PR, Beresford, MW. Methodology of clinical trials for rare diseases. Best Pract Res Clin Rheumatol. 2014;28:247–262.10.1016/j.berh.2014.03.004CrossRef Google Scholar PubMed

Gamalo-Siebers, M, Savic, J, Basu, C, et al. Statistical modeling for Bayesian extrapolation of adult clinical trial information in pediatric drug evaluation. Pharm Stat. 2017;16:232–249.10.1002/pst.1807CrossRef Google Scholar PubMed

Morris, CN, Lysy, M. Shrinkage estimation in multilevel normal models. Stat Sci. 2012;7:115–134.Google Scholar

Gelman, A, Hill, J. Data Analysis Using Regression and Multilevel/Hierarchical Models. Cambridge University Press; 2007.Google Scholar

Fleiss, JL. The statistical basis of meta-analysis. Stat Methods Med Res. 1993;2:121–145.10.1177/096228029300200202CrossRef Google Scholar PubMed

Röver, C. Bayesian random-effects meta-analysis using the Bayesmeta R package. J Stat Softw. 2020;93:1–51.10.18637/jss.v093.i06CrossRef Google Scholar

Schmidli, H, Gsteiger, S, Roychoudhury, S, O’Hagan, A, Spiegelhalter, D, Neuenschwander, B. Robust meta-analytic-predictive priors in clinical trials with historical control information. Biometrics. 2014;70:1023–1032.10.1111/biom.12242CrossRef Google Scholar PubMed

Röver, C, Friede, T. Dynamically borrowing strength from another study through shrinkage estimation. Stat Methods Med Res. 2020;29:293–308.10.1177/0962280219833079CrossRef Google Scholar PubMed

Röver, C, Friede, T. Bounds for the weight of external data in shrinkage estimation. Biom J. 2021;65:1131–1143.10.1002/bimj.202000227CrossRef Google Scholar

Lesaffre, E, Qi, H, Banbeta, A, van Rosmalen, J. A review of dynamic borrowing methods with applications in pharmaceutical research. Braz J Probab Stat. 2024;38:1–31.10.1214/24-BJPS598CrossRef Google Scholar

Röver, C, Bender, R, Dias, S, et al. On weakly informative prior distributions for the heterogeneity parameter in Bayesian random-effects meta-analysis. Res Synth Methods. 2021;12:448–474.10.1002/jrsm.1475CrossRef Google Scholar PubMed

Welton, NJ, Sutton, AJ, Cooper, NJ, Abrams, KR, Ades, AE. Evidence Synthesis for Decision Making in Healthcare. Wiley; 2012.10.1002/9781119942986CrossRef Google Scholar

Iglesias, JF, Muller, O, Zaugg, S, et al. A comparison of an ultrathin-strut biodegradable polymer sirolimus-eluting stent with a durable polymer everolimus-eluting stent for patients with acute ST-segment elevation myocardial infarction undergoing primary percutaneous coronary intervention: rationale and design of the BIOSTEMI trial. EuroIntervention. 2018;18:692–699.10.4244/EIJ-D-17-00734CrossRef Google Scholar

Harari, O, Soltanifar, M, Verhoek, A, Heeg, B. Alone, together: on the benefits of Bayesian borrowing in a meta-analytic setting. Pharm Stat. 2023;22:903–920.10.1002/pst.2318CrossRef Google Scholar

Wandel, S, Neuenschwander, B, Röver, C, Friede, T. Using phase II data for the analysis of phase III studies: an application in rare diseases. Clin Trials. 2017;14:277–285.10.1177/1740774517699409CrossRef Google Scholar PubMed

Pocock, SJ. The combination of randomized and historical controls in clinical trials. J Chronic Dis. 1976;29:175–188.10.1016/0021-9681(76)90044-8CrossRef Google Scholar PubMed

Lee, SX, McLachlan, GJ. Scale mixture distribution. In: Balakrishnan, N, Colton, T, Everitt, B, Piegorsch, W, Ruggeri, F, Teugels, JL, eds. Wiley StatsRef: Statistics Reference Online. John Wiley & Sons; 2019. https://doi.org/10.1002/9781118445112.stat08201.Google Scholar

Lindsay, BG. Mixture models: theory, geometry and applications. In: Collins J, Kelly P, Donoho MG eds. NSF-CBMS Regional Conference Series in Probability and Statistics. Vol. 5. Institute of Mathematical Statistics; 1995. http://www.jstor.org/stable/4153184.Google Scholar

Kirkup, L. A guide to GUM. Eur J Phys. 2002;23:483–487.10.1088/0143-0807/23/5/305CrossRef Google Scholar

van der Bles, AM, van der Linden, S, Freeman, ALJ, et al. Communicating uncertainty about facts, numbers and science. R Soc Open Sci. 2019;6:181870.10.1098/rsos.181870CrossRef Google Scholar PubMed

O’Hagan, A, Pericchi, L. Bayesian heavy-tailed models and conflict resolution: a review. Braz J Probab Stat. 2012;26:372–401.10.1214/11-BJPS164CrossRef Google Scholar

Neuenschwander, B, Weber, S, Schmidli, H, O’Hagan, A. Predictively consistent prior effective sample sizes. Biometrics. 2020;76:578–587.10.1111/biom.13252CrossRef Google Scholar PubMed

Neuenschwander, B, Schmidli, H. Use of historical data. In: Lesaffre, E, Baio, G, Boulanger, B, eds. Bayesian Methods in Pharmaceutical Research. Chapman & Hall / CRC; 2020:111–137. Chap. 6. https://doi.org/10.1201/9781315180212.CrossRef Google Scholar

Ibrahim, JG, Chen, MH. Power prior distributions for regression models. Stat Sci. 2000;15:46–60.Google Scholar

Chen, MH, Ibrahim, JG. The relationship between the power prior and hierarchical models. Bayesian Anal. 2006;1:551–574.10.1214/06-BA118CrossRef Google Scholar

Pawel, S, Aust, F, Held, L, Wagenmakers, EJ. Power priors for replication studies. TEST. 2024;33:127–154.10.1007/s11749-023-00888-5CrossRef Google Scholar PubMed

Higgins, JPT, Thompson, SG. Quantifying heterogeneity in a meta-analysis. Stat Med. 2002;21:1539–1558.10.1002/sim.1186CrossRef Google Scholar PubMed

Gross, O, Tönshoff, B, Weber, LT, et al. A multicenter, randomized, placebo-controlled, double-blind phase 3 trial with open-arm comparison indicates safety and efficacy of nephroprotective therapy with ramipril in children with Alport’s syndrome. Kidney Int. 2020;97:1275–1286.10.1016/j.kint.2019.12.015CrossRef Google Scholar PubMed

Gross, O, Friede, T, Hilgers, R, et al. Safety and efficacy of the ACE-inhibitor ramipril in Alport syndrome: the double-blind, randomized, placebo-controlled, multicenter phase III EARLY PRO-TECT Alport trial in pediatric patients. ISRN Pediatr. 2012;2012:436046.10.5402/2012/436046CrossRef Google Scholar PubMed

Friede, T, Röver, C, Wandel, S, Neuenschwander, B. Meta-analysis of few small studies in orphan diseases. Res Synth Methods. 2017;8:79–91.10.1002/jrsm.1217CrossRef Google Scholar PubMed

Pieske, B. Spironolactone in the treatment of heart failure (SPIRIT-HF). NCT 04727073. ClinicalTrials.gov. 2021. https://www.clinicaltrials.gov/study/NCT04727073.Google Scholar

Pitt, B, Pfeiffer, MA, Assmann, SF, et al. Spironolactone for heart failure with preserved ejection fraction. N Engl J Med. 2014;370:1383–1392.10.1056/NEJMoa1313731CrossRef Google Scholar PubMed

Lilienthal, J, Sturtz, S, Schürmann, C, et al. Bayesian random-effects meta-analysis with empirical heterogeneity priors for application in health technology assessment with very few studies. Res Synth Methods. 2024;15:275–287.10.1002/jrsm.1685CrossRef Google Scholar PubMed

Weber, S, Li, Y, Seaman, JW, Kakizume, T, Schmidli, H. Applying meta-analytic-predictive priors with the R Bayesian evidence synthesis tools. J Stat Softw. 2021;100:1–32.10.18637/jss.v100.i19CrossRef Google Scholar

Hobbs, BP, Carlin, BP, Mandrekar, SJ, SD., J. Hierarchical commensurate and power prior models for adaptive incorporation of historical information in clinical trials. Biometrics. 2011;67:1047–1056.10.1111/j.1541-0420.2011.01564.xCrossRef Google Scholar PubMed

Schmidli, H, Neuenschwander, B, Friede, T. Meta-analytic-predictive use of historical variance data for the design and analysis of clinical trials. Comput Stat Data Anal. 2017;113:100–110.10.1016/j.csda.2016.08.007CrossRef Google Scholar

Muehlemann, N, Zhou, T, Mukherjee, R, Hossain, MI, Roychoudhury, S, Russek-Cohen, E. A tutorial on modern Bayesian methods in clinical trials. Ther Innov Regul Sci. 2023;57:402–416.10.1007/s43441-023-00515-3CrossRef Google Scholar PubMed

Gneiting, T, Balabdaoui, F, Raftery, AE. Probabilistic forecasts, calibration and sharpness. J Royal Stat Soc B. 2007;69:243–268.10.1111/j.1467-9868.2007.00587.xCrossRef Google Scholar

Cook, SR, Gelman, A, Rubin, DB. Validation of software for Bayesian models using posterior quantiles. J Comput Graph Stat 2006;15:675–692.10.1198/106186006X136976CrossRef Google Scholar

Best, N, Ajimi, M, Neuenschwander, B, Saint-Hilary, G, Wandel, S. Beyond the classical type I error: Bayesian metrics for Bayesian designs using informative priors. Stat Biopharm Res. 2025;17:183–196. DOI: https://doi.org/10.1080/19466315.2024.2342817.CrossRef Google Scholar

Lindley, DV. The choice of sample size. J Royal Stat Soc D. 1997;46:129–138.Google Scholar

Brutti, P, De Santis, F, Gubbiotti, S. Bayesian-frequentist sample size determination: a game of two priors. Metro. 2014;72:133–151.10.1007/s40300-014-0043-2CrossRef Google Scholar

Neuenschwander, B, Roychoudhuri, S, Schmidli, H. On the use of co-data in clinical trials. Stat Biopharm Res. 2016;8:345–354.10.1080/19466315.2016.1174149CrossRef Google Scholar

Pawel, S, Consonni, G, Held, L. Bayesian approaches to designing replication studies. Psychol Methods. 2023. Advance online publication. https://doi.org/10.1037/met0000604.CrossRef Google Scholar PubMed

Gelman, A. Meta-analysis with a single study. In: Statistical Modeling, Causal Inference, and Social Science. 2024. https://statmodeling.stat.columbia.edu/2024/11/11/meta-analysis-with-a-single-study/.Google Scholar

van Zwet, E, Wiecek, W, Gelman, A. Meta-analysis with a single study. Stat Methods Med Res. 2025;34:2302–2312. DOI: https://doi.org/10.1177/09622802251380628.CrossRef Google Scholar PubMed

Table 1 Alport example data from Gross et al. (2020).29

Figure 1 Illustration of MAP-prior, likelihood, and (shrinkage-) posterior for the Alport example discussed in Section 3.1.29 The horizontal lines at the bottom indicate point estimates and corresponding 95% intervals.

Figure 2 Illustration of the resulting MAP-prior for varying heterogeneity prior scales. The dashed line indicates the likelihood of the observational data alone for comparison.

Figure 3 Illustration of MAP-prior’s dependence on the heterogeneity prior distribution family. The different heterogeneity priors shown here all share the same prior median.

Figure 4 The MAP-priors’ cumulative distribution functions corresponding to the densities shown in Figure 3.

Figure 5 The MAP-priors’ densities on a logarithmic scale (see also Figure 3). Note that the likelihood for the observational data alone follows a parabola shape here, while the corresponding MAP priors are clearly much heavier-tailed.

Table 2 Summaries of MAP priors resulting from several settings for the heterogeneity ($\tau $) prior. The half-normal(0.5) prior is contrasted with half-normal priors of differing scale, as well as with priors of differing distributional families, but with matching prior medians. Note that in the context of the present example, the MAP prior’s domain corresponds to logarithmic hazard ratios (log-HRs). Quantiles are centered at $y_1$

Figure 6 Illustration of likelihood and corresponding MAP-prior for the heart failure example, using a ${\text {half-Normal}}(0.25)$ prior for $\tau $. The horizontal line at the bottom indicates the 95% prediction interval.

Figure A1 Illustration of the several heterogeneity priors compared in Section 3.1.2 in terms of their probability density functions. All priors are scaled such that they have a common median of 0.34 (the median of a half-normal(0.5) prior; dashed line).

Figure A2 Illustration of the several heterogeneity priors compared in Section 3.1.2 in terms of their cumulative distribution functions (CDFs). All priors are scaled such that they have a common median of 0.34 (dashed line).

Table A1 Expected values of $\tau ^2$ based on various common (prior) distributions for $\tau $, depending on their scale parameter s. An asterisk ($\ast $) indicates that there is no simple analytical expression. The half-Cauchy prior would be an additional option, but does not have a finite expectation (it is in fact also a special case of the half-Student-t prior, with $\nu =1$ degree of freedom)

Figure A3 Illustration of prior distributions for the power prior exponent $a_0$ corresponding to certain prior distributions assumed for the heterogeneity $\tau $ (and $s_1=0.451$).

Figure A4 Illustration of the effect of varying the (half-normal) heterogeneity prior’s scale on the resulting RCT shrinkage estimate from Section 3.1.

Article contents

Meta-analytic-predictive priors based on a single study

Abstract

Keywords

Information

Highlights

What is already known?

What is new?

Potential impact for RSM readers

1 Introduction

2 Shrinkage estimation using two studies

2.1 The normal–normal hierarchical model

2.2 Uniform prior for the overall mean effect ( $\mu $ )

2.3 The MAP prior for the effect in a new study

2.4 The bias allowance model connection

2.5 The power prior connection

3 Two practical applications

3.1 Pediatric Alport example

3.1.1 Application

3.1.2 Variations of the MAP prior

3.2 Heart failure example

4 Discussion

Author contributions

Competing interest statement

Data availability statement

Funding statement

A Appendix

A.1 Posterior predictive distribution

A.2 Gain in effective sample size

A.3 Heterogeneity priors

A.4 Variance of the MAP prior

A.5 Power prior exponent’s distribution

A.6 Sensitivity analysis (Alport example)

A.7 Example R code

A.7.1 Alport example

A.7.2 Heart failure example

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests