Hostname: page-component-6766d58669-l4t7p Total loading time: 0 Render date: 2026-05-21T13:33:48.712Z Has data issue: false hasContentIssue false

Frabjous: Deep learning Fast Radio Burst morphologies

Published online by Cambridge University Press:  22 April 2026

Ajay Kumar*
Affiliation:
National Centre for Radio Astrophysics, Tata Institute of Fundamental Research , India
Ashish Mahabal
Affiliation:
Division of Physics, Mathematics and Astronomy, California Institute of Technology, Pasadena, CA, USA Center for Data Driven Discovery, California Institute of Technology, Pasadena, CA, USA
Shriharsh P. Tendulkar
Affiliation:
CIFAR Azrieli Global Scholars Program, CIFAR, Toronto, Canada
*
Corresponding author: Ajay Kumar; Email: akumar@ncra.tifr.res.in
Rights & Permissions [Opens in a new window]

Abstract

The increasing field of view of radio telescopes and improved data processing capabilities have led to a surge in the detection of Fast Radio Bursts (FRBs). The discovery rate of FRBs is already a few per day and is expected to increase rapidly with new surveys coming online. The growing number of events necessitates prioritised follow-up due to limited multi-wavelength resources, requiring rapid and automated classification. In this study, we introduce Frabjous, a deep learning framework for an automated morphology classifier with an aim towards enabling the prompt follow-up of anomalous and intriguing FRBs, and a comprehensive statistical analysis of FRB morphologies. Deep learning models require a large training set of each FRB archetype; however, publicly available data lack sufficient samples for most FRB types. In this paper, we build a simulation framework for generating realistic examples of FRBs and train a network based on a combination of simulated and real data starting with the CHIME/FRB catalog. Applying our framework to the first CHIME/FRB catalog, we achieve an overall classification accuracy of approximately 55%, well over a random multiclass classification rate of 20% with five balanced classes during training. While this falls short of desirable performance, we critically discuss the limitations of our approach and propose potential avenues for improvement. Future work should explore strategies to augment training datasets and broaden the scope of FRB morphological studies, aiming for more accurate and reliable classification results.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of Astronomical Society of Australia
Figure 0

Figure 1. In each row from left to right example of type I, II, III, IV, V, and VI burst morphology simulated from our framework. We have randomly chosen three bursts from simulations for each type to demonstrate the different types.

Figure 1

Table 1. Parameter ranges for different FRB morphologies. For multiple components, these ranges define each sub-burst. All parameters are sampled from uniform distributions, except for the width, which combines a log-normal distribution and a uniform distribution for widths $\geq 10$ ms.

Figure 2

Figure 2. Examples of type IV (top row) and of type V (bottom row) where there is overlap between successive components and can be easily confused for single component bursts (e.g. type II).

Figure 3

Figure 3. A schematic diagram of a typical binary classifier network. A dynamic spectrum ($256\times256$) is the input for the classifier. The first few layers are convolutional, the next layers are fully connected layers, and the final layer is one that gives the confidence of the input belonging to the classes under consideration. Figure made using NN-SVG (LeNail 2019).

Figure 4

Figure 4. A schematic diagram illustrating the workflow where input dynamic spectra is processed through multiple binary classifiers (with the first class labelled as negative and the second as positive). The outputs from these binary classifiers are then combined to infer the final classification, detailed in Sections 4.4. For clarity, only a subset of binary classifiers is shown, and dashed and solid lines indicate how outputs from individual classifiers are mapped to their respective types.

Figure 5

Figure 5. An instance of training a specific network with over 2 000 epochs for binary classification of type IV vs. type V. Left Panel: Accuracy vs. epoch for the training (blue) and validation set (orange). Right Panel: Loss (binary cross-entropy loss function) as a function of epoch.

Figure 6

Figure 6. We present pairwise binary classification confusion matrices. Each confusion matrix represents the inference on the test data (includes samples for all SNR values) from simulated dataset for an optimised model obtained by hyperparameter tuning for each of the binary classification. Light gray boxes represent the correct classifications and dark gray represent the incorrect classifications. Top number in each box represents the actual number and the bottom denotes the fraction of test samples for that particular type. Accuracy and F1-score for each case are shown on the right of each confusion matrix.

Figure 7

Table 2. Parameters and ranges used for hyperparameter tuning.

Figure 8

Figure 7. False positive rate (FPR) and false negative rate (FNR) as a function of confidence output of the classifier. The intersection of the FPR and FNR curves signifies the value of confidence at which false positives equal false negatives. This specific example is for a type I vs. II classifier.

Figure 9

Figure 8. Example classification matrix for a single burst showing the output confidence from all the optimised binary models. Each element indicates the output score from a binary classifier corresponding to the archetype denoted by the row and column. Each element’s upper value corresponds to the confidence, i.e. the output confidence of the classifier while the lower value represents the optimal thresholds determined for that particular binary classification. Diagonal elements do not have any information. The last column is the sum of confidences of the that row after subtracted from the optimal threshold (i.e. augmented confidence). Here, a type V burst is classified through the multiclass classification matrix. It beats type IV narrowly, and type II is not far behind.

Figure 10

Figure 9. Each violin plot in this figure displays distribution of the augmented confidences sum for one type as described in Figure 8. Each violin plot represents a total of 400 burst samples with a mix of SNR values. The red line indicates the approximate threshold where the augmented confidences sum is greater for the correct type compared to the other types. The number above each violin plot is the number of times the maximum value occurs for that particular type in the last column Figure 8.

Figure 11

Figure 10. (a) This figure shows the Intensity as a function of frequency channels for one particular time sample corresponding to FRB emission seen in dynamic spectra of one of the CHIME/FRB catalog. The masked channels are indicated by orange hashes. The pink hashes indicate the neighbouring region used to fill the masked channels. (b) Dynamic spectra of one of the CHIME/FRB burst (c) Dynamic spectra after using linear interpolation to fill the masked channels (d) Dynamic spectra after interpolating as described in Section 5.1 and illustrated in panel (a).

Figure 12

Figure 11. (a) Waterfall plot of one of the type II CHIME/FRB catalog bursts. (b) Model waterfall plot for the same burst shown in panel (a). (c) This matrix is as described in Figure 8 for panel (a) as the input. The last column indicates the consensus class (type II in this case).

Figure 13

Figure 12. Confusion matrix after classifying the CHIME/FRB first catalog using the multi-class framework described in Sections 4.4. In each square, the upper number corresponds to the bursts classified in that combination of true (row) and predicted (column) archetypes. The lower number is the same as a fraction of the total number of bursts of that type in the CHIME/FRB catalog – 163, 270, 62, 20, 20 samples for types I, II, III, IV, and V, respectively.

Figure 14

Figure 13. Confusion matrix for multi class classification using a single multi-class classifier. The details of the plot are the same as in Figure 12 except that each type has half the number of bursts present in CHIME/FRB catalog.

Figure 15

Table B1. SNR and width measurements for selected FRBs from CHIME/FRB bursts using boxcar and T90 definition.

Figure 16

Figure C1. Left Panel: Confusion matrix with half of CHIME/FRB test catalog for the type II, III, IV, and V after we get optimised model as described in the text. In each element, the value above is the number corresponding to predicted archetype and actual archetype. The values below is the percentage of the particular type. Middle Panel: Similar to the left panel, the confusion matrix for type I, III, IV, V. Right Panel: Similar to left panel, the confusion matrix is for type III, IV, V.

Figure 17

Figure C2. Left Panel: Same as Figure C1 but with all the CHIME/FRB catalog bursts using the binary optimised models for each pair of classification. Middle Panel: Similar to left panel, the confusion matrix for type I, III, IV, and V. Right Panel: Similar to left panel, confusion matrix for type III, IV, and V.

Figure 18

Figure D1. SHAP values for three correctly classified burst examples from the CHIME/FRB catalogue. In each row, the left column shows the preprocessed input waterfall and the next five columns show SHAP value outputs of each classifier, ordered in decreasing order of classification confidence. In each SHAP plot, the pixels in red (blue) contribute positively (negatively) to the prediction of the model. From top to bottom, the input types are type II, I, and III, respectively. SHAP values indicate that feature extraction is good for types II and III. But for type I bursts, pixels areas corresponding to the background seem to contribute substantially to the classification. We generally see that SHAP values are higher (in red) at the end of the observing band on both sides or lower at one end (in blue) for type I when they are correctly classified.

Figure 19

Figure D2. Same plots as Figure D1 but for incorrectly classified examples. From top to bottom, these are: (a) type I is misclassified as type IV. (b) type I misclassified as type IV due to scintillation seen in this case. (c) type II misclassified as type IV due to low S/N of the burst and fixating on noise.