Hostname: page-component-89b8bd64d-ksp62 Total loading time: 0 Render date: 2026-05-07T04:25:39.727Z Has data issue: false hasContentIssue false

Photometric redshift predictions with a neural network for DESI quasars

Published online by Cambridge University Press:  31 July 2025

Jeremy Moss*
Affiliation:
Victoria University of Wellington, Wellington, New Zealand
Stephen Curran
Affiliation:
Victoria University of Wellington, Wellington, New Zealand
Yvette Perrott
Affiliation:
Victoria University of Wellington, Wellington, New Zealand
*
Corresponding author: Jeremy Moss; Email: mossji@staff.vuw.ac.nz
Rights & Permissions [Opens in a new window]

Abstract

Accurate redshift measurements are essential for studying the evolution of quasi-stellar objects (QSOs) and their role in cosmic structure formation. While spectroscopic redshifts provide high precision, they are impractical for the vast number of sources detected in large-scale surveys. Photometric redshifts, derived from broadband fluxes, offer an efficient alternative, particularly when combined with machine learning techniques. In this work, we develop and evaluate a neural network model for predicting the redshifts of QSOs in the Dark Energy Spectroscopic Instrument (DESI) Early Data Release spectroscopic catalogue, using photometry from DESI, the Widefield Infrared Survey Explorer (WISE), and the Galactic Evolution Explorer (GALEX). We compare the performance of the neural network model against a k-Nearest Neighbours approach, these being the most accurate and least resource-intensive of the methods trialled herein, optimising model parameters and assessing accuracy with standard statistical metrics. Our results show that incorporating ultraviolet photometry from GALEX improves photometric redshift estimates, reducing scatter and catastrophic outliers compared to models trained only on near infrared and optical bands. The neural network achieves a correlation coefficient with spectroscopic redshift of $0.9187$ with normalised median absolute deviation of $0.197$, representing a significant improvement over other methods. Our work combines DESI, WISE, and GALEX measurements, providing robust predictions which address the difficulties in predicting photometric redshift of QSOs over a large redshift range.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Astronomical Society of Australia
Figure 0

Figure 1. The sky distributions of the DESI EDR QSO spectroscopic catalogue (top, from Adame et al. (2023) and SDSS (bottom, from Lyke et al. 2020) samples. The histograms show the number of sources in right ascension and declination.

Figure 1

Figure 2. The distribution of magnitudes for the DESI sample as a whole compared to those with a match in SDSS (see Section 3.2). The legend in each panel shows the mean magnitude and the standard deviation. While DESI fluxes are used directly for model training, the comparison in this figure is made in magnitude space to match the SDSS format.

Figure 2

Figure 3. The distribution of redshifts for the full DESI sample and the SDSS-matched subset. The legend in each panel shows the mean redshift, standard deviation and maximum redshift.

Figure 3

Figure 4. The variation of source-frame wavelength with redshift for g, r, z, W1, W2 and GALEX bands. The coloured horizontal bands show the ‘windows’ provided by the filter ranges at the observed-frame wavelengths and the curves show those rest-frame wavelengths as a function of source redshift. The dashed red line represents the Lyman break ($\lambda = 1.216\times 10^{-7}$ m), the green square hashed region represents the Lyman forest and the shaded cyan region shows the Big Blue Bump. The black dotted line represents the Mg ii emission line ($\lambda = 2.8\times 10^{-7}$ m) and the green dotted line shows the 4000 Å break. Labels identify the bands and lines. For example, for a source at a redshift of $z=5$, the FUV line ($\lambda_{\text{rest}}=1.575\times10^{-7}$ m) has been shifted into the z-band.

Figure 4

Table 1. Filtering criteria for the DxS sample and the number of sources affected by each filter. Names are as follows (Adame et al. 2023): ZERR is the uncertainty in the spectroscopic redshift; ZWARN is a bitmask indicating if there are any known problems with the data or the spectroscopic fit; SPECTYPE is the spectral classification, which could be STAR, GALAXY or QSO.

Figure 5

Figure 5. Architecture of the neural network used in the NN algorithm. Blue boxes show densely-connected hidden layers, each with 200 neurons and activation functions, and with optimiser and learning rate (lr) indicated. The orange box indicates the loss function (MSE) used to measure the accuracy of the model’s training. Arrows indicate the downwards flow of information from one layer to the next. The red box indicates the output layer. Visualisation developed using Bird’s Neural Notation Convention (Bird 2023).

Figure 6

Table 2. Final configuration of the kNN model used in this study. These hyperparameters were selected based on grid search performance across 100 iterations, optimising for the lowest RMS error.

Figure 7

Table 3. Average performance metrics from 100 runs of the kNN and NN models, trained on the DESI dataset (g, r, z, W1, W2), where the training and test sets are randomised for each trial. Numbers in parentheses represent uncertainties in the last reported digits. Changes refer to the increase or decrease from kNN DESI to NN DESI. Positive percentage changes indicate an increase (improvement for Corr and EV, but worsening for NMAD, ME, and MAE), and vice versa.

Figure 8

Figure 6. Example plot showing prediction results after training the NN model on the DESI fluxes (g, r, z, W1, W2). The top panel illustrates the NN predictions, while the bottom panel shows the normalised residuals. In each plot, the solid red line represents the 1:1 relationship, and the dotted red lines indicate the $1\sigma$ deviation from the mean. Inset: distribution of the normalised residuals ($z_\mathrm{phot}-z_\mathrm{spec}$) plotted against redshift, with the red line indicating the line of perfect correlation.

Figure 9

Figure 7. $z_{\mathrm{DESI}}-z_{\mathrm{SDSS}}$ versus the angular separation between the DESI and SDSS coordinates.

Figure 10

Figure 8. Top: the difference in magnitudes versus difference in redshift between DESI and SDSS measurements, with m being g, r or z. Red stars show sources for which $z_{{\text{DESI}}}-z_{\text{SDSS}} \gt 0.14$. Bottom: distribution of the $|\Delta m|$ in the top row.

Figure 11

Table 4. As for Table 3, but using the SDSS magnitudes (u, g, r, i, z) only.

Figure 12

Figure 9. Comparison of neural network photometric redshift predictions for SDSS-only versus SDSS+GALEX fluxes. (a) Example plot showing prediction results after training the NN model on the SDSS magnitudes (u, g, r, i, z). The top panel illustrates the redshift predictions, while the bottom panel shows the normalised residuals. In each plot, the solid red line represents the 1:1 relationship, and the dotted red lines indicate the $1\sigma$ deviation from the mean. Inset: distribution of the normalised residuals ($z_\mathrm{phot}-z_\mathrm{spec}$) plotted against redshift, with the red line indicating the line of perfect correlation. (b) As for Figure (a) but for the DESI and GALEX fluxes g, r, z, FUV, NUV.

Figure 13

Table 5. As for Table 3, but using the DESI fluxes and GALEX NUV and FUV.

Figure 14

Table 6. Performance metrics and relative improvements computed with respect to the NN DESI baseline. Positive percentages indicate an increase (improvement for Corr and EV, but worsening for NMAD, ME, and MAE), and vice versa.

Figure 15

Table 7. Photometric redshift performance on the same DxS sample ($0.1 \lt z \leq 4.8$) using two feature sets: $ugriz+W1W2$ and $grz+W1W2$. Metrics are from the kNN model. The addition of u and i yields modest improvements, while WISE bands appear to play a key role in breaking colour–redshift degeneracies.

Figure 16

Figure 10. The points from each of the bimodal groups in Figure 6 in the $z - W1$ vs. $g - r$ space, incorporating Gaussian ellipses (black ellipses) with centres marked as crosses. The marginal histograms illustrate the distributions of $g - r$ and $z - W1$ within each group, and individual points are coloured by their $g - r$ values, with bluer values on the left and redder values on the right.

Figure 17

Figure 11. Comparison of kNN performance metrics across the DESI-only and DESI+GALEX (DxS) samples over five redshift bins. (a) The average of the performance metrics for 100 runs of the kNN model for the DESI (g, r, z, W1, W2) sample across the five redshift bins. Ideal values for each metric are represented by horizontal dashed lines. (b) As for Figure (a) but for the DxS (g, r, z, W1, W2, NUV, FUV) sample.

Figure 18

Figure 12. Spectroscopic redshifts from SDSS and DESI for the matched sources in the DxS sample. The background colour-coded scatterplot shows the 24 509 QSOs for which $z_{\text{SDSS}} \approx z_{\text{DESI}}$. The green crosses indicate sources classified as outliers which lie outside $1\sigma \sim 0.14$. The legend shows the mean and standard deviation of the residuals for the full sample. The inset displays the distribution of $\Delta z = z_{\mathrm{SDSS}} - z_{\mathrm{DESI}}$ for the outliers only.

Figure 19

Table 8. Missing values by photometric band for each dataset. All datasets use extinction-corrected magnitudes or fluxes where applicable. GALEX fluxes include both direct detections and forced photometry. Only 504 DESI sources were matched to GALEX with reliable UV fluxes.

Figure 20

Figure 13. The sum of the difference in the SDSS and DESI g, r, z magnitudes versus the difference between the predicted and closest spectroscopic redshift for the outliers in Table C1. Filled markers show the DESI $z_\mathrm{spec}$ being closest and unfilled for the SDSS. The dotted lines show the median values along each axis, from which we see a concentration at $|\Delta z| \lesssim 0.2$ and a photometric discrepancy of $\langle|\Delta m|\rangle \lesssim 0.6$ in each of the g, r, z magnitudes.

Figure 21

Figure 14. Photometric redshift predictions from the NN model (red stars) for the outliers identified in Figure 12. As in Figure 12, the background colour-coded scatterplot shows the 24 509 QSOs for which $z_{\text{SDSS}} \approx z_{\text{DESI}}$. The inset shows the distribution of $\Delta z = z_\mathrm{phot} - z_{\mathrm{DESI}}$ for these predictions. The predicted redshifts cluster more tightly around the 1:1 line, with improved performance at lower redshifts, especially $1 \lt z \lt 2 $ compared to higher redshifts.

Figure 22

Figure A1. Root Mean Squared Error (RMS) vs. Number of Nearest Neighbours (k) for the kNN, showing the average RMS error for each value of k. The error decreases sharply for small values of k and stabilises around $k=18$, indicating an optimal choice for this parameter.

Figure 23

Figure A2. Example plot showing prediction results after training the kNN model on the DESI fluxes (g, r, z, W1, W2); cf. Figure 6. The top panel illustrates the kNN predictions, while the bottom panel shows the normalised residuals. In each plot, the solid red line represents the 1:1 relationship, and the dotted red lines indicate the $1\sigma$ deviation from the mean. Inset: distribution of the normalised residuals ($z_\mathrm{phot}-z_\mathrm{spec}$), with the mean and standard deviation indicated.

Figure 24

Figure A3. As for Figure A2, but using the SDSS ugriz magnitudes as training features. Cf. Figure 9a.

Figure 25

Figure A4. As for Figure A2, but using the DESI and GALEX fluxes together as training features; cf. Figure 9b.

Figure 26

Figure A5. Feature importances from (a) kNN and (b) neural network models. Each panel shows the mean increase in MSE when omitting each flux in the DESI/GALEX sample. (a) Feature importances for the kNN model on the DESI/GALEX sample. (b) As for Figure (a) but for the neural network model.

Figure 27

Table A1. Impact of dropping individual bands on photometric redshift performance using kNN and NN models trained on the DxS dataset. Metrics shown are $\sigma_\mathrm{NMAD}$ (normalised median absolute deviation) and MAE. Lower values indicate better performance.

Figure 28

Table B1. Average performance metrics for 100 runs of both the kNN and NN models across varying redshifts, for the DESI (g, r, z, W1, W2) sample. Abbreviations are as for Table 3.

Figure 29

Table B2. Average performance metrics for 100 runs of both the kNN and NN models across varying redshifts, for the DxS (g, r, z, W1, W2, NUV, FUV) sample. Abbreviations are as for Table 3.

Figure 30

Table C1. Redshift outlier predictions for DESI quasars. A machine-readable version of this table is provided as Supplementary Material.

Figure 31

Figure C1. The redshift predictions of a subset of the 107 outliers for which $z_{\mathrm{DESI}}-z_{\mathrm{SDSS}} \gt 0.14$, with 100 runs of the NN trained on the DESI/GALEX (g, r, z, W1, W2, NUV, FUV). The filled black markers show the DESI spectroscopic redshift, the unfilled markers the SDSS spectroscopic redshift and the stars the predicted redshift. The label gives the SDSS name of the source. The data is shown in Table C1.