Hostname: page-component-77f85d65b8-lfk5g Total loading time: 0 Render date: 2026-03-28T22:34:48.877Z Has data issue: false hasContentIssue false

When models fail: An introduction to posterior predictive checks and model misspecification in gravitational-wave astronomy

Published online by Cambridge University Press:  08 June 2022

Isobel M. Romero-Shaw*
Affiliation:
Monash Astrophysics, School of Physics and Astronomy, Monash University, Clayton, VIC 3800, Australia OzGrav: The ARC Centre of Excellence for Gravitational-Wave Discovery, Clayton, VIC 3800, Australia
Eric Thrane
Affiliation:
Monash Astrophysics, School of Physics and Astronomy, Monash University, Clayton, VIC 3800, Australia OzGrav: The ARC Centre of Excellence for Gravitational-Wave Discovery, Clayton, VIC 3800, Australia
Paul D. Lasky
Affiliation:
Monash Astrophysics, School of Physics and Astronomy, Monash University, Clayton, VIC 3800, Australia OzGrav: The ARC Centre of Excellence for Gravitational-Wave Discovery, Clayton, VIC 3800, Australia
*
Corresponding author: Isobel M. Romero-Shaw, email: isobel.romero-shaw@monash.edu
Rights & Permissions [Opens in a new window]

Abstract

Bayesian inference is a powerful tool in gravitational-wave astronomy. It enables us to deduce the properties of merging compact-object binaries and to determine how these mergers are distributed as a population according to mass, spin, and redshift. As key results are increasingly derived using Bayesian inference, there is increasing scrutiny on Bayesian methods. In this review, we discuss the phenomenon of model misspecification, in which results obtained with Bayesian inference are misleading because of deficiencies in the assumed model(s). Such deficiencies can impede our inferences of the true parameters describing physical systems. They can also reduce our ability to distinguish the ‘best fitting’ model: it can be misleading to say that Model A is preferred over Model B if both models are manifestly poor descriptions of reality. Broadly speaking, there are two ways in which models fail. Firstly, models that fail to adequately describe the data (either the signal or the noise) have misspecified likelihoods. Secondly, population models—designed, for example, to describe the distribution of black hole masses—may fail to adequately describe the true population due to a misspecified prior. We recommend tests and checks that are useful for spotting misspecified models using examples inspired by gravitational-wave astronomy. We include companion python notebooks to illustrate essential concepts.

Information

Type
Review Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2022. Published by Cambridge University Press on behalf of the Astronomical Society of Australia
Figure 0

Figure 1. Anscombe’s Quartet: a demonstration of the importance of data visualisation. While these datasets appear very different when plotted, they have identical summary statistics: mean $\bar{x} = 9, \bar{y} = 7.50$, sample variance $s^2_{x} = 9, s^2_{y} = 4.125 \pm 0.003$, $x-y$ correlation coefficient $0.816$, linear regression line $y_R = 3.00 + 0.500 x_R$, and linear regression coefficient of determination $R^2 = 0.67$. An Anscombe’s Quartet notebook is provided to demonstrate the calculation of these summary statistics for these datasets.

Figure 1

Figure 2. Forms of misspecification that we explore in this Article. Individual events can be misspecified if the model for the noise or the signal is not an adequate description of reality. The population of events may also be misspecified. This manifests itself as prior misspecification, which can impact both individual analyses (where the prior may be restricted to a limited portion of the true extent of the posterior) and population analyses (where the goal is to uncover the true distribution of the population).

Figure 2

Figure 3. The correctly specified waveform (pink) and the misspecified waveform (grey) used in Section 3.2, plotted in the time domain.

Figure 3

Figure 4. Identifying a misspecified signal model. The left-hand column shows tests performed on data containing a signal consistent with the sine-Gaussian pulse model that we test against. The right-hand column shows the same tests performed on data containing a different signal.

Figure 4

Figure 5. Identifying a misspecified noise model. The left-hand column shows tests performed with a correctly specified Gaussian noise model while the right-hand column shows the same tests with the same Gaussian model, but performed against a misspecified Student’s-t distribution.

Figure 5

Figure 6. Plots illustrating how model misspecification may manifest for an unparameterised population model, with a correctly specified dataset shown in the left-hand plots, and the misspecified case in the right-hand plots.

Figure 6

Figure 7. Model misspecification for a parameterised population model, with the correctly specified case demonstrated in the left-hand side of the lower two plots, and the misspecified case demonstrated on the right.