Hostname: page-component-5db58dd55d-m58mf Total loading time: 0 Render date: 2026-06-24T09:18:09.439Z Has data issue: false hasContentIssue false

Overlap Zoo Beta: A Catalogue of $\sim$800 Occulting Pairs in the DESI Legacy Survey using Citizen Science

Published online by Cambridge University Press:  09 March 2026

Trevor Alexander Butrum*
Affiliation:
Physics and Astronomy, University of Louisville, USA
Benne Holwerda
Affiliation:
Physics and Astronomy, University of Louisville, USA
William Keel
Affiliation:
Department of Physics and Astronomy, University of Alabama, USA
Clayton Robertson
Affiliation:
Physics and Astronomy, University of Louisville, USA
Ian Castellano
Affiliation:
Department of Physics, Geology, and Engineering Technology, Northern Kentucky University, USA
Shikha Pandey
Affiliation:
Physics and Astronomy, University of Louisville, USA
S.M. Rafee Adnan
Affiliation:
Physics and Astronomy, University of Louisville, USA
Logan Bills
Affiliation:
Physics and Astronomy, University of Louisville, USA
Divya Patel
Affiliation:
Physics and Astronomy, University of Louisville, USA
Kyle Cook
Affiliation:
Physics and Astronomy, University of Louisville, USA
Tim Hardin
Affiliation:
Physics and Astronomy, University of Louisville, USA
Alexia Palao
Affiliation:
Physics and Astronomy, University of Louisville, USA
Bryn Connelly
Affiliation:
Physics and Astronomy, University of Louisville, USA
Michael Morton
Affiliation:
Physics and Astronomy, University of Louisville, USA
*
Corresponding author: Trevor Alexander Butrum, Email: trevor.butrum@louisville.edu
Rights & Permissions [Opens in a new window]

Abstract

Overlapping galaxies, in which a foreground galaxy partially overlaps a background galaxy, offer a unique opportunity to measure dust attenuation, a key nuisance parameter in galaxy studies, empirically and in great detail by modelling the light of both the foreground and background galaxy and inferring the missing light in the overlapping region. However, the current catalogue of overlapping pairs is relatively limited in number compared to catalogues dedicated to individual galaxies. Expanding this catalogue is not only a necessity to facilitate further detailed dust studies beyond the few limited studies conducted thus far but also to improve pair-to-pair variance and support automated identification through machine learning techniques. To achieve this, we utilise galaxies classified as ‘overlapping’ from Galaxy Zoo DECaLS (GZD-1, -2, and -5), along with images from Data Release 10 (DR10) of the DESI Legacy Survey, in our individual citizen science project to classify these pairs directly using volunteers. This new catalogue will not only provide a wealth of targets for future dust studies but will also contribute to a deeper understanding of these pairs and dust as a whole.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of Astronomical Society of Australia
Figure 0

Figure 1. The distribution of galaxies classified as ‘overlapping’ in GZD-1, -2, and -5. We plot the GZD-1 and GZD-2 data as black dots, the GZD-5 data as red dots, and our selections of the combined data as blue dots. Our selection criteria include only galaxies classified as overlapping with votes $\gt$13 and voting fractions $\gt$0.28.

Figure 1

Figure 2. Flowchart of the workflow presented to our internal classifiers. We labelled each question with its corresponding task number. Tasks are also colour-coded based on their tier level. Each tier, coloured grey, green, blue, and purple, represents zero, one, two, three, or four branching points in the decision tree, respectively.

Figure 2

Figure 3. The distribution of user consistency $\kappa$ based on the methods outlined in Willett et al. (2013, 2017). Bottom: the cumulative distribution of consistency for all users. Top: the average number of galaxy pairs, $\langle{n}\rangle$, classified by users as a function of their consistency.

Figure 3

Table 1. Overlap Zoo Beta: User classifications.

Figure 4

Table 2. Overlap Zoo Beta: User annotations.

Figure 5

Table 3. Overlap Zoo Beta: Morphological classifications.

Figure 6

Figure 4a. Sample pairs based on the pair types described in Keel et al. (2013). PNG images were obtained from the Legacy Survey Viewer using the legacystamps Python module (Sweijen 2022) and independently verified for use in this figure. We identify pairs by their common name (e.g. NGC 450) or truncated coordinates (e.g. 1226+00).

Figure 7

Figure 4b. The remaining sample pairs following Keel et al. (2013) mnemonics.

Figure 8

Table 4. A summary of OZb classifications based on the pair types from Keel et al. (2013).

Figure 9

Table 5. A summary of the OZb classifications based on the degree of overlap between galaxy pairs.

Figure 10

Figure 5. The fraction of galaxy pairs with foreground identification correctly and incorrectly classified by users as a function of $\Delta z$. We include only pairs with valid pixel-based redshift measurements in the sample.

Figure 11

Figure 6. The fraction of galaxy pairs with foreground identification correctly and incorrectly classified, based on whether the foreground galaxy appeared dimmer ($r_f \gt r_b$) or brighter ($r_f \lt r_b$) in r-band magnitude. We include only pairs with valid pixel-based redshift measurements in the sample.

Figure 12

Figure 7. A comparison of image quality between SDSS and DESI using the galaxy pair J095303.71+064237.6. Left: The SDSS DR7 image used for classification in Keel et al. (2013). Right: The DESI Legacy Survey DR10 image used for classification in this work. Both images match in size and pixel scale for direct comparison.

Figure 13

Table 6. This table compares the agreement between the raw vote fractions for morphological types in GZD and the corresponding OZb types. A pair is considered agreeable if it has a likelihood of at least 0.5 (i.e. a majority) for the corresponding morphological type. Here, N denotes the total number of overlapping pairs with GZD, while % indicates the fraction of those pairs that are in agreement.

Figure 14

Table 7. This table compares the agreement between the redshift debiased vote fractions for morphological types in GZD and the corresponding OZb types. A pair is considered agreeable if it has a likelihood of at least 0.5 (i.e. a majority) for the corresponding morphological type. Here, N denotes the total number of overlapping pairs with GZD, while % indicates the fraction of those pairs that are in agreement.

Figure 15

Table 8. A comparison of pair counts classified in OZb with those in Keel et al. (2013).