Hostname: page-component-89b8bd64d-nlwjb Total loading time: 0 Render date: 2026-05-06T04:22:01.285Z Has data issue: false hasContentIssue false

w-Stacking w-projection hybrid algorithm for wide-field interferometric imaging: implementation details and improvements

Published online by Cambridge University Press:  08 October 2020

Luke Pratley*
Affiliation:
Mullard Space Science Laboratory (MSSL), University College London (UCL), Holmbury St Mary, Surrey RH5 6NT, UK Dunlap Institute for Astronomy and Astrophysics, University of Toronto, 50 St George Street, Toronto, ON M5S 3H4, Canada
Melanie Johnston-Hollitt
Affiliation:
International Centre for Radio Astronomy Research (ICRAR), Curtin University, 1 Turner Ave, Bentley, 6102, WA, Australia
Jason D. McEwen
Affiliation:
Mullard Space Science Laboratory (MSSL), University College London (UCL), Holmbury St Mary, Surrey RH5 6NT, UK
*
Author for correspondence: Luke Pratley, E-mail: Luke.Pratley@gmail.com
Rights & Permissions [Opens in a new window]

Abstract

We present a detailed discussion of the implementation strategies for a recently developed w-stacking w-projection hybrid algorithm used to reconstruct wide-field interferometric images. In particular, we discuss the methodology used to deploy the algorithm efficiently on a supercomputer via use of a Message Passing Interface (MPI) k-means clustering technique to achieve efficient construction and application of non-coplanar effects. Additionally, we show that the use of conjugate symmetry can increase the w-stacking efficiency, decrease the time required to construction, and apply w-projection kernels for large data sets. We then demonstrate this implementation by imaging an interferometric observation of Fornax A from the Murchison Widefield Array (MWA). We perform an exact non-coplanar wide-field correction for 126.6 million visibilities using 50 nodes of a computing cluster. The w-projection kernel construction takes only 15 min prior to reconstruction, demonstrating that the implementation is both fast and efficient.

Information

Type
Research Article
Copyright
© The Author(s), 2020. Published by Cambridge University Press on behalf of the Astronomical Society of Australia
Figure 0

Algorithm 1. k-means w-stacking: The k-means algorithm sorts the visibilities into clusters (w-stacks) by minimizing the average w deviation, $(\bar{w} - w)^2$, within each cluster. We use bold variables to denote an array, subscript to denote the array element and superscript to denote the iteration. The algorithm returns two arrays: ${\textit{n}}$ is the array of indices that labels the w-stack for each visibility; ${\bar{w}}$ is the average w value within each w-stack. The algorithm requires a starting w-stack distribution $\bar{{\textit{w}}}^{(0)}$, which we choose to be evenly distributed between the minimum and maximum w-values. The algorithm should iterate until $\bar{{\textit{w}}}^{(t)}$ has converged, which we choose to be a relative difference of $10^{-3}$. Note p is the index of visibility, q is the index for w-stacks, and c is the place holder for the minimum deviation for the visibility at index p. The ${\rm AllSumAll}(x)$ operation is an MPI reduction of a summation followed by broadcasting the result to all compute nodes.

Figure 1

Figure 1. The number of visibilities per w-stack in log scale $\log M_k$ as a function of mean w compared between using the k-means algorithm and uniform binning for 16 stacks. The horizontal error bars show the root-mean-squared w offset for the visibilities in a w-stack; this is proportional to the phase error that needs to be corrected due to the w-term. In particular, the k-means stacks show less residual w where there are more visibilities. To generate this plot, we used the Phase I MWA configuration after applying conjugation to ensure that all $w \geq 0$. We used a coverage from 768 channels with a width of 40 kHz centred at 87.68 MHz and a pointing centre 45 degree away from zenith towards the horizon. The k-means algorithm places less bins at larger values of w where there are less visibilities and conversely increases the number of bins at small w values where there are more visibilities. This has two possible advantages that the computational load is more balanced and that the w-offsets are reduced for the majority of the visibilities.

Figure 2

Figure 2. The left column shows plots of measurement operator (${\boldsymbol{\Phi}}$) construction times, and the right column shows plots of ${\boldsymbol{\Phi}}^\dagger{\boldsymbol{\Phi}}$ application times, as a function of image size N. The top, middle, and bottom rows show the times when using $M = 10^6, 10^7, 10^8$ visibilities. The standard deviation of the w sample density is determined by the value $\sigma_w$, which we chose values of 50, 100, and 150 wavelengths. We show results before and after applying conjugate symmetry to the visibilities. We find improvements in performance for large w ranges due to an increase in w-stacking efficiency, as described in Section 4. The kernel construction time is independent of image size due to the use of adaptive quadrature; this is clear for large M in the middle and bottom rows. For $\sigma_w = 150$ wavelengths with $M = 10^8$, we found that not applying conjugation resulted in large kernel construction times of greater than 140 min (not shown). We found construction time reduces to 30 min after applying conjugation as shown in the figure.

Figure 3

Figure 3. The left figure shows plots of measurement operator (${\boldsymbol{\Phi}}$) construction times, and the right figure shows plots of ${\boldsymbol{\Phi}}^\dagger{\boldsymbol{\Phi}}$ application times, as a function of the number of visibilities M for $N = 256^2$. The standard deviation of the w sample density is determined by the value $\sigma_w$, which we chose values of 50, 100, and 150 wavelengths. We find that the kernel construction time increases linearly with M. We find that the applying the conjugate consistently reduces the time required to calculate the w-projection kernels and can reduce the time for application.

Figure 4

Figure 4. The dirty map (Top Left), residuals (Top Right), and sky model reconstruction (Bottom) of the 112 s MWA Fornax A observation centred at 184.955 MHz, using 126.6 million visibilities and an image size of $2\,048^2$ (each pixel is 45 arcsec and the field of view is approximately 25 by 25 degrees). This image was reconstructed using the MPI-distributed w-stacking-w-projection hybrid algorithm, exploiting conjugate symmetry and the k-means clustering algorithm for distribution of w-stacks presented herein, and using the radial symmetric w-projection kernels, in conjunction with the ADMM algorithm. The dynamic range of the reconstruction is 844,000. The RMS of the residuals is approximately 15 mJy beam−1 over the entire field of view. The residuals are larger at the edges of the image due to side lobes of sources outside the field of view. The axes show the distance from image centre at right ascension 3:22:41.7 and declination -37:12:30 in J2000.

Figure 5

Figure 5. Same as Figure 4 zoomed view centred on Fornax A, showing the recovered structure of the double lobed radio galaxy. The residuals have been scaled to show the details. The residuals over the zoomed region have an RMS of 1.2 mJy beam−1.