Hostname: page-component-89b8bd64d-r6c6k Total loading time: 0 Render date: 2026-05-07T19:30:22.421Z Has data issue: false hasContentIssue false

Bayesian evidence-driven likelihood selection for sky-averaged 21-cm signal extraction

Published online by Cambridge University Press:  24 March 2023

K. H. Scheutwinkel*
Affiliation:
Astrophysics Group, Cavendish Laboratory, J. J. Thomson Avenue, Cambridge CB3 0HE, UK Kavli Institute for Cosmology, Madingley Road, Cambridge CB3 0HA, UK
W. Handley
Affiliation:
Astrophysics Group, Cavendish Laboratory, J. J. Thomson Avenue, Cambridge CB3 0HE, UK Kavli Institute for Cosmology, Madingley Road, Cambridge CB3 0HA, UK
E. de Lera Acedo
Affiliation:
Astrophysics Group, Cavendish Laboratory, J. J. Thomson Avenue, Cambridge CB3 0HE, UK Kavli Institute for Cosmology, Madingley Road, Cambridge CB3 0HA, UK
*
Corresponding author: K. H. Scheutwinkel, email: khs40@cam.ac.uk.
Rights & Permissions [Opens in a new window]

Abstract

We demonstrate that the Bayesian evidence can be used to find a good approximation of the ground truth likelihood function of a dataset, a goal of the likelihood-free inference (LFI) paradigm. As a concrete example, we use forward modelled sky-averaged 21-cm signal antenna temperature datasets where we artificially inject noise structures of various physically motivated forms. We find that the Gaussian likelihood performs poorly when the noise distribution deviates from the Gaussian case, for example, heteroscedastic radiometric or heavy-tailed noise. For these non-Gaussian noise structures, we show that the generalised normal likelihood is on a similar Bayesian evidence scale with comparable sky-averaged 21-cm signal recovery as the ground truth likelihood function of our injected noise. We therefore propose the generalised normal likelihood function as a good approximation of the true likelihood function if the noise structure is a priori unknown.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press on behalf of the Astronomical Society of Australia
Figure 0

Figure 1. Top: Comparison of standardised probability density functions. Bottom: Exemplary heteroscedastic radiometric noise realisation that decreases exponentially $\propto \nu^{-2.6}$ over a range $\nu$.

Figure 1

Table 1. Prior choices for the foreground spectral indices $\beta_{1:N_{\mathrm{reg}}}$, sky-averaged 21-cm signal shape $f_{0,21}$, $\sigma_{21}$, $A_{21}$ and the varying noise parameters.

Figure 2

Table 2. PolyChord initialisations with $n_{\mathrm{live}}$ the number of live points, $n_{\mathrm{prior}}$ the number of randomly drawn prior samples before execution, $n_{\mathrm{fail}}$ the failed spawn criterion, and $n_{\mathrm{repeats}}$ the number of slice sampling repeats which is proportional to the model dimension $N_{\mathrm{dim}}$. The precision_criterion is the termination criterion and do_clustering activates the clustering algorithm within a nested sampling run.

Figure 3

Figure 2. Bayesian evidence $\log \mathcal{Z}$ (first row), signal model comparison ($\Delta M = M_1 - M_2$) (second row) and likelihood comparison Bayes factor (third row) for the Gaussian and generalised normal noise datasets. The errors are in the order of $\sigma_{\log \mathcal{Z}} \approx 0.3$.

Figure 4

Figure 3. Bayesian evidence $\log \mathcal{Z}$ (first row), signal model comparison ($\Delta M = M_1 - M_2$) (second row) and likelihood comparison Bayes factor (third row) for the Cauchy and radiometric noise datasets. The errors are in the order of $\sigma_{\log \mathcal{Z}} \approx 0.3$.

Figure 5

Figure 4. Foreground subtracted residuals (beige) and sky-averaged 21-cm signal recovery (blue) for the Cauchy noise case ($\gamma = 0.025$) (top) and radiometric noise case ($\tau = 10^4$ s) (bottom) when using varying likelihood functions for inference. The upper case shows signs of biased signal mean estimates and the lower case show signs of higher parameter variance.

Figure 6

Table 3. The Bayes factor for the no-signal model $M_2$ when using the Gaussian and generalised normal likelihood functions applied on a slightly modified dataset i.e. stronger sky-averaged 21-cm signal or larger noise. The noise parameter refers to the generalised normal noise distribution of the dataset.

Figure 7

Figure 5. Marginalised posterior estimates of the sky-averaged 21-cm signal parameters when using a Gaussian likelihood (red) versus a generalised normal likelihood function (blue) for an antenna temperature dataset containing generalised normal noise with $\beta_D=1$. The black dashed lines represent the ground truth values of the parameters.

Figure 8

Table 4. Bayesian evidences $\log \mathcal{Z}$ of all models and likelihoods considered for a simulated antenna temperature dataset with varying degrees and types of noise.