Hostname: page-component-77f85d65b8-t6st2 Total loading time: 0 Render date: 2026-04-18T13:29:35.240Z Has data issue: false hasContentIssue false

Performance of the segment anything model in various RFI/events detection in radio astronomy

Published online by Cambridge University Press:  06 January 2025

Yanbin Yang
Affiliation:
Shanghai Astronomical Observatory, Chinese Academy of Sciences, Shanghai, China University of Chinese Academy of Sciences, Beijing, China School of Physical Science and Technology, ShanghaiTech University, Shanghai, China
Feiyu Zhao
Affiliation:
Shanghai Astronomical Observatory, Chinese Academy of Sciences, Shanghai, China University of Chinese Academy of Sciences, Beijing, China
Ruxi Liang
Affiliation:
Shanghai Astronomical Observatory, Chinese Academy of Sciences, Shanghai, China
Quan Guo*
Affiliation:
Shanghai Astronomical Observatory, Chinese Academy of Sciences, Shanghai, China Key Laboratory of Radio Astronomy and Technology, Chinese Academy of Sciences, Beijing, China
Junhua Gu*
Affiliation:
National Astronomical Observatories, Chinese Academy of Sciences, Beijing, China
Yan Huang
Affiliation:
National Astronomical Observatories, Chinese Academy of Sciences, Beijing, China
Yun Yu
Affiliation:
Shanghai Astronomical Observatory, Chinese Academy of Sciences, Shanghai, China
*
Corresponding authors: Quan Guo; Email: guoquan@shao.ac.cn; Junhua Gu; Email: jhgu@nao.cas.cn
Corresponding authors: Quan Guo; Email: guoquan@shao.ac.cn; Junhua Gu; Email: jhgu@nao.cas.cn
Rights & Permissions [Opens in a new window]

Abstract

The emerging era of big data in radio astronomy demands more efficient and higher-quality processing of observational data. While deep learning methods have been applied to tasks such as automatic radio frequency interference (RFI) detection, these methods often face limitations, including dependence on training data and poor generalisation, which are also common issues in other deep learning applications within astronomy. In this study, we investigate the use of the open-source image recognition and segmentation model, Segment Anything Model (SAM), and its optimised version, HQ-SAM, due to their impressive generalisation capabilities. We evaluate these models across various tasks, including RFI detection and solar radio burst (SRB) identification. For RFI detection, HQ-SAM (SAM) shows performance that is comparable to or even superior to the SumThreshold method, especially with large-area broadband RFI data. In the search for SRBs, HQ-SAM demonstrates strong recognition abilities for Type II and Type III bursts. Overall, with its impressive generalisation capability, SAM (HQ-SAM) can be a promising candidate for further optimisation and application in RFI and event detection tasks in radio astronomy.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Astronomical Society of Australia
Figure 0

Figure 1. An example of transient narrowband RFI (scattered FM radio signals) from the 21CMA is shown below. The horizontal axis represents frequency, while the vertical axis represents time. The image is 200$\times$200 in size.

Figure 1

Figure 2. The raw data of the continuous narrowband RFI from the 21CMA and the detection results of three methods are shown in the following plots. The horizontal axis represents frequency, and the vertical axis represents time. (a) shows the raw data, which includes a large amount of continuous narrowband RFI. (b) shows the flagging results using HQ-SAM. (c) shows the flagging results using SAM. (d) shows the flagging results using the SumThreshold method. The white areas in the images correspond to the detected RFI.

Figure 2

Figure 3. The same as in Fig. 2 but another example of the detection results of the 21CMA data.

Figure 3

Figure 4. Classification of narrowband RFI in frequency. (a) is a common slender signal. (b) is the result of detecting (a) using HQ-SAM. (c) is RFI with fluctuating frequencies, and (d) is the detection result of (c) by HQ-SAM.

Figure 4

Figure 5. Two types of continuous RFI. (a) and (c) are respectively slender RFI and RFI with fluctuating frequencies similar to frequency modulation. Unlike the RFI in Fig. 4, which last only around 50–200 ms, these RFI last for tens of seconds or more (the entire event of RFI is not fully displayed here).

Figure 5

Figure 6. The bursts of transient RFI and continuous RFI.

Figure 6

Figure 7. transitions between straight lines and polylines during transmission for continuous RFI.

Figure 7

Figure 8. An example of broadband RFI which can contaminate frequency range of up to 20MHz.

Figure 8

Figure 9. An example of Type A mock RFI, which contains dtv RFI and narrowband RFI.

Figure 9

Figure 10. An example of Type B mock RFI. Compared to Type A, it includes an additional type of broadband RFI.

Figure 10

Table 1. Accuracy, Recall, Precision, and F1 Score of three methods for the Type A mock RFI recognition task.

Figure 11

Table 2. Accuracy, Recall, Precision, and F1 Score of three methods for the Type B mock RFI recognition task.

Figure 12

Figure 11. Several types of SRB data used in the Detection. There are Type II-IV alone and in pairs. The horizontal axis represents time, while the vertical axis represents frequency.

Figure 13

Figure 12. Examples of successful detection for Type II-IV SRBs. The right side is the raw data, and the left side is the output masks.

Figure 14

Figure 13. Examples of detection failure for Type II-IV SRBs.

Figure 15

Table 3. The ratio of successful detection by HQ-SAM in recognising various SRBs.

Figure 16

Figure B.1. The differences in the masks obtained by each methods for Fig. 2 are as follows: (a) shows the difference between HQ-SAM and the SumThreshold method; (b) shows the difference between SAM and the SumThreshold method; and (c) shows the difference between HQ-SAM and SAM.

Figure 17

Figure B.2. The same as in Fig. B.1, but comparing the results from the three methods for Fig. 3.

Figure 18

Table C.1. Accuracy, Recall, Precision, and F1 Score of SAM and HQ-SAM for the Type A mock RFI recognition task when the three methods have comparable runtime.

Figure 19

Table C.2. Accuracy, Recall, Precision, and F1 Score of SAM and HQ-SAM for the Type B mock RFI recognition task when the three methods have comparable runtime or F1 Score.

Figure 20

Table D.1. Accuracy, Recall, Precision, and F1 Score of SAM and HQ-SAM for the Type A mock RFI recognition task when the images are in pseudocolour greyscale format.

Figure 21

Table D.2. Accuracy, Recall, Precision, and F1 Score of SAM and HQ-SAM for the Type B mock RFI recognition task when the images are in pseudocolour greyscale format.

Figure 22

Figure E.1. More examples of successful detection for separat Type II-IV SRBs are shown here.

Figure 23

Figure E.2. Examples of successful detection for pairs of Type II-IV SRBs.

Figure 24

Figure E.3. More examples of failed detection for different types of SRBs are shown here.