Hostname: page-component-89b8bd64d-b5k59 Total loading time: 0 Render date: 2026-05-09T08:06:44.055Z Has data issue: false hasContentIssue false

Interpretable Faraday complexity classification

Published online by Cambridge University Press:  23 April 2021

M. J. Alger*
Affiliation:
Research School of Astronomy and Astrophysics, The Australian National University, Canberra, ACT 2611, Australia Data61, CSIRO, Canberra, ACT 2601, Australia
J. D. Livingston
Affiliation:
Research School of Astronomy and Astrophysics, The Australian National University, Canberra, ACT 2611, Australia
N. M. McClure-Griffiths
Affiliation:
Research School of Astronomy and Astrophysics, The Australian National University, Canberra, ACT 2611, Australia
O. I. Wong
Affiliation:
CSIRO Astronomy & Space Science, PO Box 1130, Bentley, WA 6102, Australia ICRAR-M468, University of Western Australia, Crawley, WA 6009, Australia ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia
C. S. Ong
Affiliation:
Data61, CSIRO, Canberra, ACT 2601, Australia Research School of Computer Science, The Australian National University, Canberra, ACT 2601, Australia
*
Author for correspondence: M. J. Alger, E-mail: matthew.alger@gmail.com
Rights & Permissions [Opens in a new window]

Abstract

Faraday complexity describes whether a spectropolarimetric observation has simple or complex magnetic structure. Quickly determining the Faraday complexity of a spectropolarimetric observation is important for processing large, polarised radio surveys. Finding simple sources lets us build rotation measure grids, and finding complex sources lets us follow these sources up with slower analysis techniques or further observations. We introduce five features that can be used to train simple, interpretable machine learning classifiers for estimating Faraday complexity. We train logistic regression and extreme gradient boosted tree classifiers on simulated polarised spectra using our features, analyse their behaviour, and demonstrate our features are effective for both simulated and real data. This is the first application of machine learning methods to real spectropolarimetry data. With 95% accuracy on simulated ASKAP data and 90% accuracy on simulated ATCA data, our method performs comparably to state-of-the-art convolutional neural networks while being simpler and easier to interpret. Logistic regression trained with our features behaves sensibly on real data and its outputs are useful for sorting polarised sources by apparent Faraday complexity.

Information

Type
Research Article
Copyright
© The Author(s), 2021. Published by Cambridge University Press on behalf of the Astronomical Society of Australia
Figure 0

Figure 1. A simple FDF and its corresponding polarised spectra: (a) groundtruth FDF F, (b) noise-free polarised spectrum P, (c) noisy observed FDF $\hat F$, (d) noisy polarised spectrum $\hat P$. Blue and orange mark real and imaginary components, respectively.

Figure 1

Figure 2. A complex FDF and its corresponding polarised spectra: (a) groundtruth FDF F, (b) noise-free polarised spectrum P, (c) noisy observed FDF $\hat F$, (d) noisy polarised spectrum $\hat P$. Blue and orange mark real and imaginary components, respectively.

Figure 2

Figure 3. An example of how an observed FDF $\hat F$ relates to our features. $\phi_w$ is the $W_2$-minimising Faraday depth, and $\phi_a$ is the $\hat F$-maximising Faraday depth (approximately equal to the Euclidean-minimising Faraday depth). The remaining two features are the $W_2$ and Euclidean distances between the depicted FDFs.

Figure 3

Table 1. Confusion matrix entries for LR and XGB on ‘ASKAP’ and ‘ATCA’ simulated datasets, and the CNN confusion matrix entries are adapted from Brown et al. (2018)

Figure 4

Figure 4. Mean prediction as a function of component depth separation and minimum component amplitude for (a) XGB and (b) LR.

Figure 5

Figure 5. Principal component analysis for simulated data (coloured dots) with observations overlaid (black-edged circles). Observations are coloured by their XGB- or LR-estimated probability of being complex, with blue indicating ‘most simple’ and pink indicating ‘most complex’.

Figure 6

Figure 6. Estimated rates of Faraday complexity for the Livingston and O’Sullivan datasets as functions of threshold. The horizontal lines indicate the rates of Faraday complexity estimated by Livingston and O’Sullivan respectively.