Hostname: page-component-89b8bd64d-rbxfs Total loading time: 0 Render date: 2026-05-07T17:53:22.027Z Has data issue: false hasContentIssue false

Extended radio galaxies in EMU: A comparative look at source-finding techniques

Published online by Cambridge University Press:  30 March 2026

Lachlan J. Barnes*
Affiliation:
Macquarie University , Australia
Andrew M. Hopkins
Affiliation:
Macquarie University , Australia
Yjan Gordon
Affiliation:
University of Wisconsin-Madison, USA
Nikhel Gupta
Affiliation:
Space & Astronomy, CSIRO Astronomy and Space Science, Australia
Gary Segal
Affiliation:
School of Mathematics and Physics, The University of Queensland, Australia
Heinz Andernach
Affiliation:
Depto de Astronomia, DCNE, Universidad de Guanajuato, Cjonde Jalisco, Mexico
Michael J.I. Brown
Affiliation:
School of Physics, Monash University, Australia
Duncan Farrah
Affiliation:
Department of Physics and Astronomy, University of Hawai‘i at Manoa, USA
Stanislav S. Shabala
Affiliation:
School of Natural Sciences, University of Tasmania, Australia
Sarah V. White
Affiliation:
International Centre for Radio Astronomy Research, Curtin University, Australia
O. Ivy Wong
Affiliation:
Space & Astronomy, CSIRO, Australia
*
Corresponding author: Lachlan J. Barnes; Email: lachlan.barnes3@hdr.mq.edu.au
Rights & Permissions [Opens in a new window]

Abstract

Extended radio sources present unique challenges for automated detection and classification in wide-field radio surveys. With current surveys such as the Evolutionary Map of the Universe (EMU), robust and scalable methods are essential to identify and catalogue these complex sources. We apply three automatic approaches to detect complex radio emission in EMU observations of the Galaxy And Mass Assembly (GAMA) 09 field (EMU-G09) in order to evaluate their relative strengths and limitations in preparation for large-scale application across future EMU data releases. These include DRAGNhunter, designed to detect likely DRAGNs (Double Radio sources associated with Active Galactic Nuclei) from a component catalogue; coarse-grained complexity, a metric designed to highlight regions of complex emission; and RG-CAT, a machine learning pipeline trained on radio sources identified in the EMU pilot survey. We find that together, the three methods recover nearly all extended sources in EMU-G09 but identify largely distinct, partially overlapping subsets, with only 375 sources identified by all finders. This demonstrates that a combination of complementary techniques will be required to achieve a complete census of extended radio sources in future large-scale surveys.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of Astronomical Society of Australia
Figure 0

Figure 1. EMU mosaic constructed from the three EMU tiles that overlap the GAMA09 region with the black box indicating the GAMA09 footprint. The EMU observations are centred on a frequency of 944 MHz, a native resolution of $11''\times13''$, and an rms sensitivity of $\sim$$25\,\mu$Jy beam$^{-1}$. These data constitute the EMU-G09 region used throughout this paper.

Figure 1

Figure 2. Distributions of angular separation of identified pairs in EMU-G09 (red bars) in bins of mean misalignment. Black lines mark the local minima, with the shaded region showing the $1\sigma$ uncertainty of their positions.

Figure 2

Figure 3. Mean misalignment and angular separation for candidate lobe pairs in EMU-G09. Black crosses show the local minima of pair separation in bins of mean misalignment, with vertical bars indicating bin size and horizontal bars the $1\sigma$ uncertainty. The black dashed and grey solid lines show the derived upper limits for likely DRAGNs in EMU and VLASS, respectively, reflecting their different observational parameters. The black dotted line represents the minimum pair separation (15$^{\prime\prime}$) used to select DRAGNs in this work.

Figure 3

Figure 4. Distributions of LAS and integrated flux density (S) for likely DRAGNs in the EMU G09 field. Density contours of the distributions of LAS and S for EMU DRAGNs (red) for the VLASS DRAGNs (grey). Contours contain 90%, 50%, and 10% of the data points. The black dotted line represents the minimum pair separation (15$^{\prime\prime}$) used to select DRAGNs in this work. The difference between these distributions likely arises from the observational differences between EMU and VLASS.

Figure 4

Figure 5. Illustrative example of the coarse-grained complexity process. An input image (left) is smoothed by a Gaussian filter to produce the smoothed image (right) which is then compressed using gzip. The byte length is then used as a proxy for apparent complexity of the radio source. The images here are $64\times64$ pixels, corresponding to an angular size of $128''\times128''$.

Figure 5

Figure 6. Source complexity as a function of the number of components fit to each Selavy island, with quartiles marked inside. Note the general trend of increasing complexity with the number of components.

Figure 6

Figure 7. Comparison of distributions of the coarse-grained complexity values for all islands (dark orange dashed line), the complexity-only GMM (orange line), and the complexity$+$S/N GMM (light orange solid bars).

Figure 7

Figure 8. Examples of the ten most (top) and least (bottom) complex sources in the significantly complex dataset. The most complex sources display extended, multi-component, or diffuse morphologies, while the least complex examples are dominated by compact, unresolved sources where apparent complexity likely arises from background noise or faint nearby emission. All cutouts are $64\times64$ pixels, corresponding to an angular size of $128''\times128''$.

Figure 8

Figure 9. Top panel: Distributions of all sources (solid bars) and subsets of different classification types (coloured steps) based on the confidence score. Bottom panel: Number of different morphology types with confidence scores predicted by Gal-DINO above and below 0.5 (light blue and dark blue bars, respectively).

Figure 9

Table 1. Comparison of RG-CAT morphological class distributions between EMU-G09 sources and EMU-PS sources (Gupta et al. 2024b), with fractions given relative to the total number of extended sources identified in each dataset.

Figure 10

Figure 10. EMU field (grey), overlaid with the sky positions of sources detected by each finder: likely DRAGNs identified by DRAGNhunter (red dots), significantly complex regions from CG-Complexity (orange crosses), and extended sources detected by RG-CAT (blue triangles). At this scale, it is evident there is clustering of sources in some regions, and others showing a lack of detections from all finders. There is also very little overlap between each source-finder.

Figure 11

Table 2. Number of sources detected in EMU-G09 field by each extended-source finder, and the corresponding surface densities.

Figure 12

Figure 11. Separation distributions for matches between source-finder catalogues: DH-CG (dark orange), DH-RG (purple), and CG-RG (green). Each distribution is bimodal, where the first peak is likely due to genuine matches and the second is likely dominated by random associations. We adopt a 15$^{\prime\prime}$ radius as the threshold for genuine matches.

Figure 13

Figure 12. Overlap ($\lt15''$) of sources detected by DRAGNhunter (red), CG-Complexity (orange), and RG-CAT (blue). Each region of the diagram represents the number of sources uniquely or jointly identified by the corresponding source-finders. Only 375 sources are common to all three, with this small overlap likely reflecting the differing selection biases of each method.

Figure 14

Figure 13. Example sources detected by each source-finding approach. Each panel contains a random selection of 21 sources identified by each of the respective finders. All cutouts are $64\times64$ pixels, corresponding to an angular size of $128''\times128''$. DH appears to predominantly identify typical double-lobed structure; CG-Complexity identifies compact sources, typical doubles and more diffuse irregular structure; RG-CAT tends to identify typical doubles as well as more complex extended structure.

Figure 15

Figure 14. Distributions of LAS and integrated flux density (S) for sources detected by each source-finding approach. Density contours for the distributions of LAS and S for DRAGNhunter sources are shown in red, high complexity islands in orange, and RG-CAT sources in blue. Contours contain 90%, 50%, and 10% of the data points. While there is substantial overlap in this parameter space, CG-Complexity includes a population of smaller-angular-size sources, whereas RG-CAT tends to detect larger and brighter systems, again likely reflecting selection biases inherent to each method.

Figure 16

Figure 15. A random selection of 16 sources detected by all source-finders. These objects typically exhibit well-defined double-lobed morphologies of moderate angular size. All cutouts are $64\times64$ pixels, corresponding to an angular size of $128''\times128''$.

Figure 17

Figure 16. Comparison of calculated complexity of sources detected by DRAGNhunter (red), CG-Complexity (orange), and RG-CAT (blue). All sources identified by each approach have comparable apparent complexity, with RG-CAT tending to pick up the higher complexity sources. This likely reflects its training on visually classified, often larger and more irregular sources, while DH tends to favour more symmetric, double-lobed morphologies.

Figure 18

Figure 17. Spectroscopic redshift distributions of DH, CG-Complexity and RG-CAT sources cross-matched with GAMA-DR4 data (red, orange, and blue solid lines, respectively). All redshifts here have a quality factor $\gt2$. The DH and RG-CAT sources exhibit similar redshift distributions, while CG-Complexity extends to systematically higher redshifts, consistent with its sensitivity to smaller or more compact emission regions that may correspond to unresolved high-redshift doubles.

Figure 19

Figure 18. Distributions of LLS and radio luminosity for sources identified by DH (red), CG-Complexity (orange), and RG-CAT (blue). Contours enclose 90%, 50%, and 10% of the data points. LLS values are calculated using LAS, which is defined differently for each finder (see Section 4.2)

Figure 20

Figure 19. Stellar mass distributions of DH, CG-Complexity and RG-CAT sources cross-matched with GAMA-DR4 data (red, orange, and blue solid lines, respectively). All three distributions are broadly similar, peaking around $10^{11.4}$ M$_\odot$.

Figure 21

Figure 20. The WISE colour distributions for the hosts of sources identified by DH (red), CG-Complexity (orange), and RG-CAT (blue). The contours contain 90%, 50%, and 10% of the data points. Despite differences in detection strategy, sources identified by all three approaches have similar infrared host populations, with many sources classified in the star-forming/ULIRG region of WISE colour space. Note, only sources with a S/N $\gt 3$ in W1, W2, and W3 are shown.

Figure 22

Table 3. The top section highlights the percentage of sources detected in AllWISE with S/N in W1, W2 and ${W}3\gt3$, and those that are unclassified (either insufficient S/N or no detection in one or more bands), relative to the total number of AllWISE matches (shown under each source-finder, respectively). The bottom section shows the percentages of WISE colour classifications (AGN, Passive, Star-Forming, U-LIRG) for the S/N $\gt3$ subset, relative to the number of sources in the S/N $\gt3$ subset (shown under each source-finder, respectively). For all percentages, we include $1\sigma$ binomial uncertainties.

Figure 23

Figure 21. Radio luminosity versus W3 luminosity for sources identified by each source-finder, colour–coded by WISE-colour classification (passive, star-forming, ULIRG, AGN). The dashed line shows the 1:1 reference, included as a visual guide only and not representing the true radio–IR correlation (better traced by the distribution of star-forming sources). Most sources lie above the line, indicating a radio excess consistent with AGN-dominated emission, even for hosts classified as star-forming. Only sources with S/N $\gt 3$ in W1, W2, and W3 are shown.

Figure 24

Figure 22. Example sources classified as star-forming or ULIRG based on their WISE colours. For each system, the left panel shows the radio emission with the detecting source-finder indicated in the top right, the middle panel shows the WISE W3 ($12\mu$m) image, and the right panel shows the corresponding SDSS composite-colour image using the g, r, i bands. Contours in both the WISE and SDSS images correspond to the radio flux. All cutouts have an angular size of $128''\times128''$. Many of the WISE star-forming classifications appear to be from a resolved spiral galaxy with the radio emission coming from star-forming regions in the spiral arms.

Figure 25

Figure A1. Sources with the lowest component pair separation. Overplotted are ellipses corresponding to the Gaussian components fit by Selavy. For each of these sources, the components are nested, therefore producing extremely small component pair separation. All cutouts are $64\times64$ pixels, corresponding to an angular size of $128''\times128''$.

Figure 26

Figure B1. Signal to noise distributions of all islands (dark orange line) and islands with zero fitted components (orange bars).

Figure 27

Figure B2. Cutouts of a random selection of 16 islands with zero fitted Gaussian components. All cutouts are $64\times64$ pixels, corresponding to an angular size of $128''\times128''$.

Figure 28

Figure C1. Distributions of complexity and flux density (S) for the entire high complexity sample (orange) and isolated point sources within the high complexity sample (dark orange). Density contours of the distributions of complexity and S for the full high complexity sample contain 90%, 50%, and 25% of the data points.

Figure 29

Figure D1. Top panel: Distribution of likelihood ratio values for DH sources crossmatched with AllWISE sources. Bottom panel: Likelihood ratio vs the calculated reliability for each cross-match.