Hostname: page-component-89b8bd64d-9prln Total loading time: 0 Render date: 2026-05-07T15:40:11.478Z Has data issue: false hasContentIssue false

χ2 from redundant calibration as a tool in the detection of faint radio-frequency interference

Published online by Cambridge University Press:  02 December 2024

Theodora Kunicki*
Affiliation:
Department of Physics, Brown University, Providence, RI, USA
Jonathan C. Pober
Affiliation:
Department of Physics, Brown University, Providence, RI, USA
*
Corresponding author: Theodora Kunicki; Email: theodora_kunicki@brown.edu.
Rights & Permissions [Opens in a new window]

Abstract

Radio-frequency interference detection and flagging is one of the most difficult and urgent problems in 21 cm Epoch of Reionisation research. In this work, we present $\chi^2$ from redundant calibration as a novel method for RFI detection and flagging, demonstrating it to be complementary to current state-of-the-art flagging algorithms. Beginning with a brief overview of redundant calibration and the meaning of the $\chi^2$ metric, we demonstrate a two-step RFI flagging algorithm which uses the values of this metric to detect faint RFI. We find that roughly 27.4% of observations have RFI from digital television channel 7 detected by at least one algorithm of the three tested: 18.0% of observations are flagged by the novel $\chi^2$ algorithm, 16.5% are flagged by SSINS, and 6.8% are flagged by AOFlagger (there is significant overlap in these percentages). Of the 27.4% of observations with detected DTV channel 7 RFI, 37.1% (10.2% of the total observations) are detected by $\chi^2$ alone, and not by either SSINS or AOFlagger, demonstrating a significant population of as-yet undetected RFI. We find that $\chi^2$ is able to detect RFI events which remain undetectable to SSINS and AOFlagger, especially in the domain of long-duration, weak RFI from digital television. We also discuss the shortcomings of this approach and discuss examples of RFI which seems undetectable using $\chi^2$ while being successfully flagged by SSINS and/or AOFlagger.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press on behalf of Astronomical Society of Australia
Figure 0

Figure 1. Antenna positions in the MWA Phase II compact configuration.

Figure 1

Figure 2. The mean value of $\chi^2/\mathrm{nDoF}$ versus local sidereal time; each point represents one two-minute observation, with the standard deviation of $\chi^2/\mathrm{nDoF}$ within the observation represented as an error bar. Each color represents data from a different night, while each shape represents data from a different pointing. The $\chi^2$ means have a pronounced LST dependence. Within individual instrument pointings, there is also an upward trend. Because of the dependence on LST, a single $\chi^2/\mathrm{nDoF}$ cutoff value applied across all observations in sub-optimal for flagging RFI.

Figure 2

Figure 3. This plot represents the same data as Fig. 2, except now modified z-scores have been taken of the $\chi^2/\mathrm{nDoF}$ data. The mean of the modified z-scores of $\chi^2/\mathrm{nDoF}$ is plotted against local sidereal time, with the standard deviation of the modified z-scores of the data represented as error bars. The LST dependence of $\chi^2$ present in Fig. 2 is effectively eliminated by using modified z-score. As with Fig. 2, color denotes the day data were taken, and different point shapes represent different antenna pointings of the instrument.

Figure 3

Figure 4. Waterfalls and histograms of the modified z-scores of $\chi^2/\mathrm{nDoF}$ of 6 two-minute observations. The waterfall plots, left, show the modified z-score of $\chi^2 / \mathrm{nDoF}$ as a function of time and frequency. Elevated values are indicative of RFI. To the right of each waterfall plot is the corresponding histogram, showing the distribution of the modified z-scores of the data. The first observation is RFI-free and shows a nearly Gaussian normal distribution, whereas the subsequent examples all contain RFI and have high-z-score outliers of varying degree. The second through fifth images all represent DTV events, while the sixth is a narrow-band event at 196.175 MHz.

Figure 4

Figure 5. An example $\chi^2$ waterfall plot of an incompletely-flagged channel 7 DTV event. The top panel shows the calculated z-scores, while the bottom panel shows the derived flags in magenta. This event was flagged using the algorithm described in Section 3.4, but the channel 7 DTV band is only partially flagged for any given time.

Figure 5

Figure 6. The fraction of data flagged using $\chi^2$ across all observations as a function of frequency. The most commonly flagged RFI (in approximately 5 to 6% of the data) is channel 7 DTV, as indicated by the large feature between 181 and 188 MHz. Also visible are slight amounts of flagging in channels 6 (174–181 MHz) and 9 (195–202 MHz). Narrow-band RFI is apparent at 196.175 and 194.455 MHz. Most other narrow-band peaks correspond to MWA coarse band edges, indicated by the shaded regions and are unlikely to be actual RFI. The slight increase in flagging at the lowest frequencies is due to spurious flagging of elevated $\chi^2$ stemming from the presence of the Galactic plane in the sidelobes of the primary beam (see Fig. 7).

Figure 6

Figure 7. An example of an observation with elevated low-frequency NN $\chi^2$ values due to the presence of the Galactic plane in the sidelobes of the MWA primary beam. In the corresponding histogram, we can see that the distribution deviates from a Gaussian normal. Observations like this (all corresponding to the -3 pointing with the instrument phased to EoR0) cause the uptick in flagging at very low frequencies in Fig. 6.

Figure 7

Figure 8. A comparison of $\chi^2$, SSINS, and AOFlagger performance on flagging DTV channel 7, the most commonly-detected source of RFI in our dataset. The x-axis represents the local time of the observation in UTC, while each vertical panel represents observations from a different date. Within a panel, each observation is represented by three circles: the top row (blue circles) is our $\chi^2$ method, the middle row (orange circles) is SSINS, and the bottom row (green circles) is AOFlagger. Filled circles indicate that RFI was detected by an algorithm in the channel 7 band for that file, whereas empty circles indicate that the algorithm did not detect any RFI in that observation file. “Detected” meaning, in the case of $\chi^2$ and AOFlagger, that $ \gt 5\%$ of possibly contaminated bins were flagged, and in the case of SSINS, that the match filter identified DTV channel 7 in this file. $\chi^2$ seems well-suited to detecting long-duration DTV events which are picked up only sporadically by SSINS and AOFlagger. Meanwhile, SSINS detects isolated events which are not detected by $\chi^2$ and, often, AOFlagger.

Figure 8

Table 1. Number of files flagged in the DTV channel 7 band for $\chi^2$, SSINS, and AOFlagger.

Figure 9

Figure 9. A Venn diagram comparing the percentage of DTV events flagged by the $\chi^2$, SSINS, and AOFlagger algorithms. Most of the events flagged by AOFlagger are detected by either $\chi^2$ or SSINS, but there are a significant number which are flagged by $\chi^2$ and missed by SSINS, and vice-versa. The circles in this diagram are to scale, although overlaps are not.

Figure 10

Figure 10. The same as Fig. 8, except for the 196.175 MHz narrow-band RFI signal (as opposed to channel 7 DTV). This signal appears to be periodic with a period of around 105 min.

Figure 11

Figure A1. A single time-slice of $\chi^2 / \mathrm{nDoF}$, generated by hera_cal without downsampling in time first (top) and after downsampling in time (bottom). The result without downsampling is noisy and difficult to process to identify RFI. After downsampling, noise in the $\chi^2$ metric is significantly reduced, and the RFI is visibly elevated over the noise floor.

Figure 12

Figure C1. A typical example of an observation with RFI in DTV channel 7 which is detectable with $\chi^2$, and not with other flagging algorithms. This observation is a part of a long string of DTV detections seen by $\chi^2$ alone.

Figure 13

Figure C2. This SSINS DTV detection occurs as part of a small cluster of files flagged by SSINS alone. The event is also visible in AOFlagger, and a hint of it is visible in $\chi^2$, but it did not reach the threshold for being listed as being ‘detected’ by any algorithm besides SSINS.

Figure 14

Figure C3. This SSINS DTV detection occurs as part of the same small cluster of SSINS detections as Fig. C2. The event is also visible in AOFlagger, but it did not reach the threshold for being listed as being ‘detected’ by AOFlagger or $\chi^2$.

Figure 15

Figure C4. This SSINS DTV detection occurs isolated in time from other RFI events.

Figure 16

Figure C5. This DTV channel 7 event was detected by $\chi^2$ and AOFlagger, but missed by SSINS.

Figure 17

Figure C6. An example of DTV channel 7, detected by all three algorithms.

Figure 18

Figure C7. An example of narrow-band RFI at 196.175 MHz, detected by all three algorithms.

Figure 19

Figure C8. An example of a ‘streak’ found in SSINS, but invisible to $\chi^2$.

Figure 20

Figure C9. An unusual event, which might have been due to weather, is also nearly invisible to $\chi^2$.