Hostname: page-component-89b8bd64d-ktprf Total loading time: 0 Render date: 2026-05-09T00:22:07.555Z Has data issue: false hasContentIssue false

Hydra II: Characterisation of Aegean, Caesar, ProFound, PyBDSF, and Selavy source finders

Published online by Cambridge University Press:  13 June 2023

M. M. Boyce*
Affiliation:
Department of Physics and Astronomy, University of Manitoba, 30A Sifton Road, Winnipeg, MB R3T 2N2, Canada
A. M. Hopkins
Affiliation:
Australian Astronomical Optics, Macquarie University, 105 Delhi Rd, North Ryde, NSW 2113, Australia
S. Riggi
Affiliation:
INAF, Osservatorio Astrofisico di Catania Via S. Sofia 78, Catania 95123, Italy
L. Rudnick
Affiliation:
School of Physics and Astronomy, Minnesota Institute for Astrophysics, University of Minnesota, 116 Church Street SE, Minneapolis, MN 55455, USA
M. Ramsay
Affiliation:
Department of Physics and Astronomy, University of Manitoba, 30A Sifton Road, Winnipeg, MB R3T 2N2, Canada
C. L. Hale
Affiliation:
School of Physics and Astronomy, Institute for Astronomy, University of Edinburgh, Royal Observatory, Blackford Hill, Edinburgh EH9 3HJ, UK
J. Marvil
Affiliation:
National Radio Astronomy Observatory, P.O. Box O, Socorro, NM 87801, USA
M. T. Whiting
Affiliation:
CSIRO Space & Astronomy, PO Box 76 Epping, NSW 1710, Australia
P. Venkataraman
Affiliation:
Dunlap Institute for Astronomy and Astrophysics, University of Toronto, 50 St. George Street, Toronto, ON M5S 3H4, Canada
C. P. O’Dea
Affiliation:
Department of Physics and Astronomy, University of Manitoba, 30A Sifton Road, Winnipeg, MB R3T 2N2, Canada
S. A. Baum
Affiliation:
Department of Physics and Astronomy, University of Manitoba, 30A Sifton Road, Winnipeg, MB R3T 2N2, Canada
Y. A. Gordon
Affiliation:
Department of Physics and Astronomy, University of Manitoba, 30A Sifton Road, Winnipeg, MB R3T 2N2, Canada Department of Physics, University of Wisconsin-Madison, Madison, WI 57306, USA
A. N. Vantyghem
Affiliation:
Department of Physics and Astronomy, University of Manitoba, 30A Sifton Road, Winnipeg, MB R3T 2N2, Canada
M. Dionyssiou
Affiliation:
Dunlap Institute for Astronomy and Astrophysics, University of Toronto, 50 St. George Street, Toronto, ON M5S 3H4, Canada
H. Andernach
Affiliation:
Departamento de Astronomía, DCNE, Universidad de Guanajuato, Callejón de Jalisco s/n, Guanjuato, CP 36023, GTO, Mexico
J. D. Collier
Affiliation:
Department of Astronomy, Inter-University Institute for Data Intensive Astronomy (IDIA), University of Cape Town, Private Bag X3, Rondebosch 7701, South Africa School of Science, Western Sydney University, Locked Bag 1797, Penrith, NSW 2751, Australia
J. English
Affiliation:
Department of Physics and Astronomy, University of Manitoba, 30A Sifton Road, Winnipeg, MB R3T 2N2, Canada
B. S. Koribalski
Affiliation:
School of Science, Western Sydney University, Locked Bag 1797, Penrith, NSW 2751, Australia Australia Telescope National Facility, CSIRO Astronomy and Space Science, PO Box 76, Epping, NSW 1710, Australia
D. Leahy
Affiliation:
Department of Physics and Astronomy, University of Calgary, 2500 University Dr. NW, Calgary, AB T2N 1N4, Canada
M. J. Michałowski
Affiliation:
Faculty of Physics, Astronomical Observatory Institute, Adam Mickiewicz University, ul. Słoneczna 36, 60-286 Poznań, Poland
S. Safi-Harb
Affiliation:
Department of Physics and Astronomy, University of Manitoba, 30A Sifton Road, Winnipeg, MB R3T 2N2, Canada
M. Vaccari
Affiliation:
Department of Astronomy, Inter-University Institute for Data Intensive Astronomy (IDIA), University of Cape Town, Private Bag X3, Rondebosch 7701, South Africa Department of Physics and Astronomy, Inter-University Institute for Data Intensive Astronomy (IDIA), University of the Western Cape, Robert Sobukwe Road, Bellville, Cape Town 7535, South Africa INAF - Istituto di Radioastronomia, via Gobetti 101, Bologna 40129, Italy
E. L. Alexander
Affiliation:
Jodrell Bank Centre for Astrophysics, Department of Physics and Astronomy, University of Manchester, Manchester M13 9PL, UK
M. Cowley
Affiliation:
School of Chemistry and Physics, Queensland University of Technology, Brisbane, QLD 4000, Australia Centre for Astrophysics, University of Southern Queensland, West Street, Toowoomba, QLD 4350, Australia
A. D. Kapinska
Affiliation:
National Radio Astronomy Observatory, P.O. Box O, Socorro, NM 87801, USA
A. S. G. Robotham
Affiliation:
ICRAR, M468, University of Western Australia, Crawley, WA 6009, Australia
H. Tang
Affiliation:
Department of Astronomy, Tsinghua University, Beijing 100084, China
*
Corresponding author: M. M. Boyce, Email: michelle.boyce2@umanitoba.ca
Rights & Permissions [Opens in a new window]

Abstract

We present a comparison between the performance of a selection of source finders (SFs) using a new software tool called Hydra. The companion paper, Paper I, introduced the Hydra tool and demonstrated its performance using simulated data. Here we apply Hydra to assess the performance of different source finders by analysing real observational data taken from the Evolutionary Map of the Universe (EMU) Pilot Survey. EMU is a wide-field radio continuum survey whose primary goal is to make a deep ($20\mu$Jy/beam RMS noise), intermediate angular resolution ($15^{\prime\prime}$), 1 GHz survey of the entire sky south of $+30^{\circ}$ declination, and expecting to detect and catalogue up to 40 million sources. With the main EMU survey it is highly desirable to understand the performance of radio image SF software and to identify an approach that optimises source detection capabilities. Hydra has been developed to refine this process, as well as to deliver a range of metrics and source finding data products from multiple SFs. We present the performance of the five SFs tested here in terms of their completeness and reliability statistics, their flux density and source size measurements, and an exploration of case studies to highlight finder-specific limitations.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press on behalf of the Astronomical Society of Australia
Figure 0

Table 1. Radio surveys in the Northern (upper partition) and Southern (lower partition) hemispheres, with columns indicating the telescope (Telescope), radio survey (Survey), percentage sky-coverage (Sky), resolution ($\delta\theta_{res.}$), frequency ($\nu$), and depth ($\sigma_{rms}$).

Figure 1

Figure 1. $2^\circ\times2^\circ$ central cutout of an EMU pilot tile.

Figure 2

Table 2. SF order of magnitude PRD CPU times (rounded to one significant figure) for CMP, EXT, and EMU images. The processing was done on a 2GHz 16 core (single threaded) Intel Xeon Processor with 60G of RAM running Ubuntu 20.04.3 LTS.

Figure 3

Table 3. Typhon run statistics for $2^\circ\times2^\circ$ CMP and EXT and EMU images, with SF, image depth ($\mbox{D}= \mathcal{D},\,\mathcal{S}$), RMS parameter ($n_{rms}$ [$\sigma$]) island parameter, ($n_{island}$ [$\sigma$]), source numbers (N), residual RMS ($\mu$Jy beam$^{-1}$), and residual MADFM (median absolute deviation from the median) ($\mu$Jy beam$^{-1}$) columns.$^{\rm a}$ The CMP and EXT results are reproduced from Paper I.

Figure 4

Table 4. Hydra $\mu$-optimised Aegean, PyBDSF, and Selavy box_size and step_size input parameters,$^{\rm a}$ for CMP, EXT, and EMU $\mathcal{D}/\mathcal{S}$-images. The CMP and EXT results have been incorporated from Paper I.

Figure 5

Figure 2. SF CMP, EXT, EMU $\mathcal{D/S}$-image detection stacked plots (from the N columns of Table 3).

Figure 6

Table 5. $\mathcal{S}\;:\;\mathcal{D}$ recovery rates.

Figure 7

Table 6. $\mathcal{D/S}$ (D) residual (res.) $|\mbox{RMS}-\mbox{MADFM}|/\mbox{MADFM}$ (%), RMS ($\mu$Jy beam$^{-1}$), and MADFM ($\mu$Jy beam$^{-1}$) statistics extracted from Table 3.

Figure 8

Figure 3. Major-axis distributions for EMU sources (showing size distributions from both $\mathcal{D}$ and $\mathcal{S}$ image measurements together). NB: Size estimates between SFs are not necessarily directly comparable as they are estimated using different methods (Paper I).

Figure 9

Figure 4. CMP and EXT $\mathcal{C_D}$ (a and g), $\mathcal{R_D}$ (b and h), $\mathcal{C_S}$ (c and i), $\mathcal{R_S}$ (d and j), $\mathcal{C_{DS}}$ (e and k), and $\mathcal{R_{DS}}$ (f and l) vs. S/N. These results are reproduced from Paper I, Figures 14, 15, and 16.

Figure 10

Figure 5. EMU-image $\mathcal{C_{DS}}$vs. S/N (a) and $\mathcal{R_{DS}}$ (b) vs. S/N, with S/N expressed as $\mathcal{D}$-signal/$\mathcal{S}$-noise and $\mathcal{S}$-signal/$\mathcal{S}$-noise, respectively.

Figure 11

Figure 6. Flux density ratios ($S_{out}/S_{in}$) for CMP (left), simulated-extended (middle), and real (right) sources for Aegean (a, b, and c), Caesar (c, e, and f), ProFound (g, h, and i), PyBDSF (j, k, and l) and Selavy (m, n, and o) vs. S/N, expressed as $\mathcal{J}$-signal/$\mathcal{D}$-noise and $\mathcal{D}$-signal/$\mathcal{S}$-noise for simulated and real sources, respectively. The $1\sigma$ (solid) and $3\sigma$ (dashed) curves are RMS noise ($\sigma$) deviations from the $\mbox{flux-ratio}=1$ lines (dotted). Also shown are the detection threshold ($n_{rms}$; dot-dashed curves) and nominal $5\sigma$ threshold (dotted curves). These curves are annotated in (c).

Figure 12

Figure 7. False positives vs. S/N for CMP (a), EXT (b), and real (c) sources, with S/N expressed as $\mathcal{J}$-signal/$\mathcal{S}$-noise and $\mathcal{D}$-signal/$\mathcal{S}$-noise for simulated and real sources, respectively.

Figure 13

Table 7. $3\sigma$ scatter ($s_{3\sigma}(\hat{S}_i)$, Equation (2)) at $\hat{S}_i=3,\,5,\,10$, for SF CMP, EXT, and EMU source flux density ratios (Figure 6). Also shown are averaged $3\sigma$ scatters.

Figure 14

Figure 8. Edge Detection Infographic: Edge detection example cutouts for (a) CMP $\mathcal{D}$-image (from the $\mbox{S/N}\sim1\,600\pm110$ bin of $\mathcal{C_D}$, Figure 4a), (b) CMP $\mathcal{S}$-image (from the $\mbox{S/N}\sim42.6\pm2.9$ bin of $\mathcal{C_S}$, Figure 4c), and (c) EXT $\mathcal{D}$-image (from the $\mbox{S/N}\sim0.318\pm0.036$ bin of $\mathcal{C_D}$, Figure 4g). In examples (a) and (b), Caesar and Selavy failed to detect the injected sources at match_ids 4 and 7 270, respectively. In example (c), there is an injected source at match_id 6607 ($\mbox{S/N}\sim0.34$) which is detected by ProFound ($\mbox{S/N}\sim4.82$) and PyBDSF ($\mbox{S/N}\sim6.39$). The remaining detections, by both SFs, at match_ids 6608 and 6609 are spurious, due to noise fluctuations.

Figure 15

Figure 9. Blended Sources Infographic: CMP (a–b) and real (EMU; c–f) $\mathcal{D}$-image examples of blended sources. In example (a) (from the $\mbox{S/N}\sim19.6\pm1.3$ bin of $\mathcal{C_D}$, Figure 7a), only ProFound missed detection of the isolated source at match_id 6201. The remaining injected sources overlap to make up a single unresolved compact object, which is identified as match_id 6203 by all SFs. In example (b) (from the $\mbox{S/N}\sim0.318\pm0.036$ bin of $\mathcal{C_D}$, Figure 7a), there are two injected sources at match_ids 6665 ($\mbox{S/N}\sim0.070$) and 6666 ($\mbox{S/N}\sim0.336$). Only match_id 6666 is detected by ProFound ($\mbox{S/N}\sim7.468$) and PyBDSF ($\mbox{S/N}\sim12.446$). Examples (c) and (d) show a $\mathcal{D}$-image and $\mathcal{S}$-image cutouts, respectively, of a compact object with a diffuse tail. Only Caesar is able to resolve the $\mathcal{D}$-image (into two components; top), but not the $\mathcal{S}$-image (bottom) as the diffuse emission is washed out. Examples (e) and (f) are the corresponding a $\mathcal{D}$-residual and $\mathcal{S}$-residual image cutouts in the previous example, respectively, for Caesar. All SFs make $\mathcal{D}$ and $\mathcal{S}$ detections at match_id 742. Caesar separately detects the bright peak at match_id 743 in the $\mathcal{D}$-image, but with no corresponding $\mathcal{S}$ match. This contributes to a reduction in the inferred $\mathcal{C_{DS}}$ for Caesar at $\mbox{S/N}\sim3.4$ in Figure 8a.

Figure 16

Figure 10. Deblending Issues Infographic: CMP PyBDSF $\mathcal{D}$-residual-image (a) with Selavy $\mathcal{S}$-residual-image (b) (from clump_id 875), CMP $\mathcal{D}$-image (c) with ProFound $\mathcal{D}$-residual-image (d) (from clump_id 4672), and real (EMU) $\mathcal{D}$-image (e) with Aegean $\mathcal{D}$-residual-image (f) (from clump_id 875) cutout examples of deblending issues. In example (a), only PyBDSF makes a detection at match_id 1287. In example (b), only Selavy makes a detection at match_id 1287. In example (c–d), ProFound provides a single flux-weighted component that blends these two adjacent sources, and which is best matched to match_id 6669. This leads to the result that match_id 6668 is deemed undetected in the $\mathcal{C_D}$ statistics (Figure 4a). In example (e–f), Aegean overestimates the extent of the vary faint (diffuse) source at match_id 1698 (bottom image, middle): i.e., $(a,b,\theta)\sim(130^{\prime\prime},\,57.6^{\prime\prime},\,45.8^\circ)$, with major, minor, and position-angle components, respectively. There is no corresponding $\mathcal{S}$ match. This leads to a degradation in the $\mathcal{C_{DS}}$ S/N $\sim$ 25 bin of Figure 5a.

Figure 17

Figure 11. Caesar clump_id 4627 $\mathcal{D}$$0.916^\prime\times0.916^\prime$ residual-image cutout for simulated-compact sources. Caesar underestimates the flux density for match_id 6668 ($\sim$$0.664\,\mu$Jy, compared to $\sim$$291\,\mu$Jy for the injected source), resulting in an artifact in the calculated reliability, placing this source at an artificially low S/N. This information was extracted from the $\mathcal{R_D}$$\mbox{S/N}\sim0.0331\pm0.0036$ bin (Figure 4b).

Figure 18

Figure 12. PyBDSF clump_id 1472 $\mathcal{D}$ (top) and $\mathcal{S}$ (bottom) $1.69^\prime\times1.69^\prime$ residual-image cutouts for real (EMU) sources. PyBDSF fits the core of the source shown in Figure 10a, while leaving out the diffuse emission (i.e., it over-subtracts), in both $\mathcal{D}$ and $\mathcal{S}$ images. This contributes to the $\mathcal{C_{DS}}$ S/N $\sim$ 152 bin (Figure 4a).

Figure 19

Figure 13. Noise Spike Detection Infographic: CMP $\mathcal{D}$-image (a), EMU PyBDSF $\mathcal{D}$-residual-image (b), and EMU $\mathcal{D}$-image (/w ProFound residual-image inset) (c) cutout examples of noise spike detection. In example (a) (from the $\mbox{S/N}\sim12.4\pm1.4$ bin of $\mathcal{R_D}$, Figure 4b), the detections within this clump are anomalous, as there is no injected source. In example (b), Aegean and PyBDSF detect a $\mathcal{D}$ source (match_id 1549) here, although visually this object is consistent with a noise spike. With no corresponding $\mathcal{S}$ detection, such results contribute to the inferred $\mathcal{C_D}$ at low S/N (Figure 5a). Example (c): At first glance, match_id 9535 appears to be part of a faint ring structure, something for which ProFound is uniquely suited. The size of the emission is on the order of common EMU’s beam size ($18^{\prime\prime}$), making it consistent with either a noise spike or a faint compact source.

Figure 20

Figure 14. $3^\prime\times3^\prime$ image cutout of clump_id 2071 of the EXT $\mathcal{D}$-image. All SFs detected the injected source ($\mbox{S/N}\sim226$) at match_id 4142, except for Caesar. Only Selavy detects the adjacent fainter source, match_id 4141, where $(\mbox{S/N})_{injected}\sim19$. This information was extracted from the $\mathcal{C_D}$$\mbox{S/N}\sim220\pm25$ bin of Figure 4g.

Figure 21

Figure 15. $3^\prime\times3^\prime$ cutout of clump_id 50 of the EXT $\mathcal{D}$-image. The orange ellipses indicate injected sources, summarised in Table 8. The associated SF detections are shown in Figure 16. These sources span $0.28<\mbox{S/N}<80$ in $\mathcal{C_{D}}$ (Figure 4h).

Figure 22

Table 8. Summary of injected EXT-sources in Figure 15. The Detected column indicates if at least one SF has detected an injected source.

Figure 23

Figure 16. $2.51^\prime\times2.51^\prime$ cutout, without (top) and with (bottom) annotations, of clump_id 50 of the EXT $\mathcal{D}$-image. The corresponding injected sources are shown in Figure 15. Aegean (green), Caesar (cyan), ProFound (red), and PyBDSF (gold) make $\mathcal{D}$ and $\mathcal{S}$ detections at match_ids 110 and 114 only, except for Selavy which only finds them in the $\mathcal{S}$-image (not shown).

Figure 24

Figure 17. Oversized Components Infographic: CMP PyBDSF $\mathcal{D}$ ((a) and (c)) and $\mathcal{S}$ (b) residual-image, Caesar $\mathcal{D}$ (d) residual-image, and EXT $\mathcal{D}$-image (e) and Caesar $\mathcal{D}$-residual-image (f) cutout examples of oversized components. Example (a) (from the $\mbox{S/N}\sim12.4\pm1.4$ bin of $\mathcal{R_D}$, Figure 4b) is a spurious detection as there is no injected source. In example (b) (from the $\mbox{S/N}\sim12.4\pm1.4$ bin of $\mathcal{R_D}$, Figure 4b), the detection by PyBDSF at match_id 1225 is spurious, as there is no injected source at this location. In example (c), the large-footprint detection by PyBDSF at match_id 2796 demonstrates one of its most frequent failure modes. Its flux density is 1.84 mJy, as compared to 0.0798 mJy for the injected source. In example (d) (from the $\mbox{S/N}\sim20.7\pm1.4$ bin of $\mathcal{C_{DS}}$, Figure 4a), Caesar’s Gaussian fit at match_id 5806 extends well beyond its island. Its flux density estimate is 2.18 mJy compared to 0.14 mJy for the injected source. (For clump_ids 5804 and 5805 the injected flux densities are 0.07 mJy and 0.12 mJy, respectively.) In examples (e–f) (from the $\mbox{S/N}\sim259\pm39$ bin of $\mathcal{C_S}$, Figure 4a), Caesar overestimates the flux density at match_id 2639, with $18\,$mJy, compared to $0.074\,$mJy for the injected source. Its respective semi-major and semi-minor axes are $920^{\prime\prime}$ and $11^{\prime\prime}$, compared to $20^{\prime\prime}$ and $10^{\prime\prime}$ for the injected source.

Figure 25

Figure 18. $\mathcal{D}$ (top) and $\mathcal{S}$ (bottom) real image (EMU) cutouts of clump_id 2293 (see also Figure 19 and Table 9). The elements of the main clump (referred to as B2293) are labelled (a) through (e) for bright compact emission, and ($\chi$), ($\epsilon$), and ($\lambda$) for diffuse emission. Also shown is match_id 2667, which is identified as part of this clump, although not physically associated. While highly likely to be related to the emission from B2293, the clump labelled C2294 is separated in our analysis. Its compact nature leads the SFs to characterise it with ellipses that do not overlap with any ellipse in B2293, resulting in these being labelled as independent clumps.

Figure 26

Table 9. Cluster table information for clump_ids 2293 (upper partition) and 2294 (lower partition). The cutouts shown in Figure 19 are for the upper partition. The source component footprint parameters, a, b, and $\theta$, correspond to the major axis, minor axis, and position angle, respectively. The S/N is calculated using the noise in the $\mathcal{S}$-image, estimated using Bane. The MADFMs are computed within the source component footprints and normalised with respect to their areas for the residual images. The values for ProFound are its flux-weighted estimates (Paper I).

Figure 27

Figure 19. $4.73^\prime\times4.73^\prime$$\mathcal{D}$ and $\mathcal{S}$ real-image (a–b), and Aegean (c–d), Caesar (e–f), ProFound (g–h), PyBDSF (i–j), and Caesar (k–l) residual-image cutouts for clump_id 2293 of the B2293+C2294 bent-tail system. Only the Caesar and ProFound cutouts are annotated with match_id information (Table 9) for clarity.

Figure 28

Figure 20. Diffuse Emission Case Study Summary: Summary of bent-tail system (B2293+C2294) diffuse emission case study (see Section 3.7, Figures 18 and 19, and Table 9).

Figure 29

Figure 21. Real image $\mathcal{D}$ (top) and $\mathcal{S}$ (bottom) detection confidence charts. The stacked bars indicate agreement between SF detections. From left to right, 5 SFs agree, 4 SFs agree, etc.

Figure 30

Figure 22. Example of a diffuse VLASS source, uniquely detected by ProFound in a comparison study with PyBDSF (Boyce 2020). The source was found in a $3^{\prime}\times3^{\prime}$ region centered at J112328+064341. The cutout was extracted from a QL image tile, available at NRAO (https://science.nrao.edu), and then processed using ProFound in R Studio.

Figure 31

Table 10. Residual image statistics for the full $2^{\circ}\times 2^{\circ}$ EMU pilot sample image and for clump_id 2293 drawn from it (extracted from Tables 3 and 9, respectively). The MADFMs are normalised by cutout area, and are in units of mJy arcmins$^{-2}$ beam$^{-1}$). N is the component count.

Figure 32

Figure 23. Distribution of clumps with the lowest MADFM wrt Aegean, PyBDSF, and Selavy.