Hostname: page-component-6766d58669-l4t7p Total loading time: 0 Render date: 2026-05-21T03:47:51.577Z Has data issue: false hasContentIssue false

Bias-free model fitting of correlated data in interferometry

Published online by Cambridge University Press:  09 July 2021

Régis Lachaume*
Affiliation:
Instituto de Astronomía and Centro de Astroingeniería, Facultad de Física, Pontificia Universidad Católica de Chile, casilla 306, Santiago 22, Chile and Max-Planck-Institut für Astronomie, Königstuhl 17, D-69117 Heidelberg, Germany
*
Author for correspondence: Régis Lachaume, E-mail: regis.lachaume@gmail.com
Rights & Permissions [Opens in a new window]

Abstract

In optical and infrared long-baseline interferometry, data often display significant correlated errors because of uncertain multiplicative factors such as the instrumental transfer function or the pixel-to-visibility matrix. In the context of model fitting, this situation often leads to a significant bias in the model parameters. In the most severe cases, this can can result in a fit lying outside of the range of measurement values. This is known in nuclear physics as Peelle’s Pertinent Puzzle. I show how this arises in the context of interferometry and determine that the relative bias is of the order of the square root of the correlated component of the relative uncertainty times the number of measurements. It impacts preferentially large datasets, such as those obtained in medium to high spectral resolution. I then give a conceptually simple and computationally cheap way to avoid the issue: model the data without covariances, estimate the covariance matrix by error propagation using the modelled data instead of the actual data, and perform the model fitting using the covariance matrix. I also show that a more imprecise but also unbiased result can be obtained from ignoring correlations in the model fitting.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2021. Published by Cambridge University Press on behalf of the Astronomical Society of Australia
Figure 0

Table 1. Symbols used in this paper. Lower case bold font is used for vectors and upper case bold font for matrices.

Figure 1

Figure 1. Original Peelle problem rewritten in the context of interferometry. Top: Two raw visibility amplitudes $\nu_1$ and $\nu_2$ (points with statistic error bars of $\approx 0.6\%$) are calibrated by the transfer function $1/\tau$ (solid line with systematic error zone of 5%). Bottom: the two calibrated visibility measurements ${{\scriptstyle V}}_1$ and ${{\scriptstyle V}}_2$ (points with statistic error bars of $\approx 0.6\%$) are strongly correlated. The least squares estimate for the visibility ${{\scriptstyle V}}$ (dashed line, with statistic uncertainty error zone displayed) falls outside of the data range. The systematic error on ${{\scriptstyle V}}$, ${{\scriptstyle V}}_1$, and ${{\scriptstyle V}}_2$ is shown on the right.

Figure 2

Figure 2. Fit $\mu^{\star}$ to unresolved visibilities (${{\scriptstyle V}} = 1$), as a function of the relative uncertainty on the calibration $\varsigma_{\tau}$ and the number of measurements n. $2/n\times10^5$ simulations were made and averaged, assuming that $\varepsilon_{\nu}$ and $\varepsilon_{\tau}$ follow normal distributions. Top: fully correlated normalisation like in original Peelle’s puzzle ($\varsigma_{\nu}=0.02$ and $\varrho = 1$). Bottom: normalisation error without correlation ($\varsigma_{\nu}=0.02$ and $\varrho = 0$).

Figure 3

Figure 3. Model fitting to simulated correlated data from a four-telescope interferometer (6 baselines) with medium spectral resolution (R = 100) with 2% uncorrelated measurement error and 3% correlated normalisation error (light grey points with the measurement error bar). Top: Simulated under-resolved data ${\scriptstyle V} = 1-x^2$ (thick grey line) are fitted with linear least squares model $\mu = a-bx^2$ using the four prescriptions for the covariance matrix. Bottom: The same for well-resolved data ${{\scriptstyle V}} = \exp -3x^2$ and non-linear least squares with model $\mu = a\exp -bx^2$. (a) Under-resolved, linear least squares. (b) Well resolved, non-linear least squares.

Figure 4

Figure 4. Distribution of the fitted parameters and fit properties for the four covariance matrix prescriptions analysed in Sect. 4. $5\times10^4$ simulations of 6 groups of 100 correlated data points ${{\scriptstyle V}}$ (measurement error $\varsigma_{\nu} = 2\%$ and normalisation error $\varsigma_{\tau} = 3\%$, correlation of the latter $\varrho = 1$, normal distributions) are performed and fitted with a model using least squares minimisation. Top graph: under-resolved data follow ${{\scriptstyle V}} = 1-x^2$ and are fitted with linear least squares $\mu = a - bx^2$. Bottom graph: well resolved data follow ${{\scriptstyle V}} = \exp -3x^2$ and are fitted with non-linear least squares $\mu = a\exp -bx^2$. Reported quantities include median and 1-$\sigma$ interval of their distribution and, within brackets, the median uncertainty reported by the least squares fit. The covariance matrix prescriptions are: top row: correlations are ignored; second row: a nave covariance matrix uses the data values; third row: covariance matrix uses modelled values from fit without correlations; bottom row: covariance matrix and model are recursively computed, with the covariance matrix of the next recursion using the modelled value of the last step. (a) Under-resolved data fitted with a linear least squares model. (b) Well-resolved data fitted with a non-linear least squares model.