Hostname: page-component-77f85d65b8-2tv5m Total loading time: 0 Render date: 2026-04-14T12:19:18.791Z Has data issue: false hasContentIssue false

Overlap Zoo Beta: A Catalogue of $\sim$800 Occulting Pairs in the DESI Legacy Survey using Citizen Science

Published online by Cambridge University Press:  09 March 2026

Trevor Alexander Butrum*
Affiliation:
Physics and Astronomy, University of Louisville, USA
Benne Holwerda
Affiliation:
Physics and Astronomy, University of Louisville, USA
William Keel
Affiliation:
Department of Physics and Astronomy, University of Alabama, USA
Clayton Robertson
Affiliation:
Physics and Astronomy, University of Louisville, USA
Ian Castellano
Affiliation:
Department of Physics, Geology, and Engineering Technology, Northern Kentucky University, USA
Shikha Pandey
Affiliation:
Physics and Astronomy, University of Louisville, USA
S.M. Rafee Adnan
Affiliation:
Physics and Astronomy, University of Louisville, USA
Logan Bills
Affiliation:
Physics and Astronomy, University of Louisville, USA
Divya Patel
Affiliation:
Physics and Astronomy, University of Louisville, USA
Kyle Cook
Affiliation:
Physics and Astronomy, University of Louisville, USA
Tim Hardin
Affiliation:
Physics and Astronomy, University of Louisville, USA
Alexia Palao
Affiliation:
Physics and Astronomy, University of Louisville, USA
Bryn Connelly
Affiliation:
Physics and Astronomy, University of Louisville, USA
Michael Morton
Affiliation:
Physics and Astronomy, University of Louisville, USA
*
Corresponding author: Trevor Alexander Butrum, Email: trevor.butrum@louisville.edu
Rights & Permissions [Opens in a new window]

Abstract

Overlapping galaxies, in which a foreground galaxy partially overlaps a background galaxy, offer a unique opportunity to measure dust attenuation, a key nuisance parameter in galaxy studies, empirically and in great detail by modelling the light of both the foreground and background galaxy and inferring the missing light in the overlapping region. However, the current catalogue of overlapping pairs is relatively limited in number compared to catalogues dedicated to individual galaxies. Expanding this catalogue is not only a necessity to facilitate further detailed dust studies beyond the few limited studies conducted thus far but also to improve pair-to-pair variance and support automated identification through machine learning techniques. To achieve this, we utilise galaxies classified as ‘overlapping’ from Galaxy Zoo DECaLS (GZD-1, -2, and -5), along with images from Data Release 10 (DR10) of the DESI Legacy Survey, in our individual citizen science project to classify these pairs directly using volunteers. This new catalogue will not only provide a wealth of targets for future dust studies but will also contribute to a deeper understanding of these pairs and dust as a whole.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of Astronomical Society of Australia

1. Introduction

Only a small fraction of the mass of the interstellar medium (ISM) in a galaxy consists of interstellar dust (Casasola et al. Reference Casasola2019; La Mura et al. Reference La Mura2025). Yet, that minute fraction plays a significant role in our astronomical observations. Dust attenuation, which encompasses both the absorption and scattering of starlight as well as the geometry of the sources and the ISM itself, hinders accurate measurements of the intrinsic properties of galaxies by influencing and reshaping the spectral energy distribution (SED) (Salim & Narayanan Reference Salim and Narayanan2020). As a result, standard and fundamental galaxy evolution parameters, such as star-formation rates (SFRs) and stellar mass ( $M_\star$ ), are subject to systematic uncertainties.

The simplest way to account for this wavelength-dependent attenuation of starlight by dust is to introduce a single attenuation curve that captures the complex blending of dust properties and source geometry (Buat et al. Reference Buat2018). However, developing a single, universal attenuation law remains an open challenge in the modern era. The differences in attenuation relations used between sight-lines in empirical studies (e.g. Calzetti, Kinney, & Storchi-Bergmann Reference Calzetti, Kinney and Storchi-Bergmann1994; Calzetti et al. Reference Calzetti2000; Battisti, Calzetti, & Chary Reference Battisti, Calzetti and Chary2016, Reference Battisti, Calzetti and Chary2017; Johnson et al. Reference Johnson2007; Wild et al. Reference Wild2011) IRX- $\beta$ studies (e.g. Meurer, Heckman, & Calzetti Reference Meurer, Heckman and Calzetti1999; Seibert et al. Reference Seibert2005; Casey et al. Reference Casey2014), and model-based studies (e.g. Meurer, Heckman, & Calzetti Reference Meurer, Heckman and Calzetti1999; Seibert et al. Reference Seibert2005; Casey et al. Reference Casey2014), and model-based studies (e.g. Burgarella, Buat, & Iglesias- Páramo 2005; Conroy, Schiminovich, & Blanton Reference Conroy, Schiminovich and Blanton2010; Leja et al. Reference Leja, Johnson, Conroy, van Dokkum and Byler2017; Salim, Boquien, & Lee Reference Salim, Boquien and Lee2018) all demonstrate that a universal law does not currently exist. Some explanations of this diversity may involve star-dust geometric effects, metallicity, or sampling biases introduced by emission lines (e.g. Calzetti et al. Reference Calzetti, Kinney and Storchi-Bergmann1994, Reference Calzetti2000; Charlot & Fall Reference Charlot and Fall2000; Gordon et al. Reference Gordon, Clayton, Misselt, Landolt and Wolff2003; Wild et al. Reference Wild2011; Reddy et al. Reference Reddy2015; Buat et al. Reference Buat2018). However, we ultimately need a deeper understanding of interstellar dust and its influence on our observational data to be able to explain the cause of these differences confidently.

Occulting pairs of galaxies, in which a foreground galaxy partially overlaps a background galaxy, are becoming increasingly valuable for studying dust attenuation and other dust properties in galaxies. These occulting systems present a unique opportunity to derive attenuation measurements by modelling the light from both galaxies and inferring the missing light in the overlapping regions, leaving only the light attenuated by dust remaining. This approach, first introduced by White & Keel (Reference White and Keel1992), demonstrated practical use in many areas of photometry (Andredakis & van der Kruit Reference Andredakis and van der Kruit1992; Berlind et al. Reference Berlind, Quillen, Pogge and Sellgren1997; Holwerda Reference Holwerda2009; Holwerda et al. Reference Holwerda2013; Keel et al. Reference Keel2023; Robertson et al. Reference Robertson2024b). However, despite the success of this technique, we are still limited to a relatively small number of occulting pairs to which we can apply it and derive attenuation and other measurements.

Researchers have made several efforts to identify and catalogue occulting pairs. Currently, the master catalogue of occulting pairs consists of $\sim$ $2\,400$ pairs suitable for dust studies. These pairs include 15 identified in Hubble Space Telescope images (HST; Keel & White Reference Keel and White2001a,b; Elmegreen, Elmegreen, & Eberwein Reference Elmegreen, Elmegreen and Eberwein2001; Holwerda Reference Holwerda2009; Holwerda et al. Reference Holwerda2013), and 2 079 identified in the Sloan Digital Sky Survey (SDSS) spectroscopic catalogue, with 86 pairs identified by Holwerda (Reference Holwerda2007) and 1 993 identified by Keel et al. (Reference Keel2013) through forum science using a Galaxy Zoo project (Lintott et al. Reference Lintott2008). More recently, 280 were identified in the Galaxy and Mass Assembly (GAMA) catalogue by Holwerda et al. (Reference Holwerda2015).

Expanding this master catalogue will undoubtedly provide numerous benefits. As the sample size of this catalogue increases: (1) more ‘ideal pairs’–a foreground face-on spiral partially overlapping a smooth background galaxy–will be available for follow-up studies, (2) the variance of the data will increase, providing a better ability to capture the true variability of pairs, thus improving the statistical robustness of measurements in the catalogue, and (3) it will pave the way for improving the accuracy of machine learning predictions.

The catalogue produced by Keel et al. (Reference Keel2013) represented a significant advance in this effort, significantly increasing the number of known occulting pairs nearly 20 times. However, it relied solely only on a single independent expert’s classifications, where volunteers helped identify potential occulting candidates, rather than providing direct classifications. Given the increasing success of citizen science projects in large-scale astronomical classifications, especially in identifying low-redshift galaxies, directly incorporating volunteer classifications into the workflow represents the next promising strategy for improving and facilitating occulting pair classification. Therefore, in this study, we introduce a method that attempts to seamlessly integrate a citizen science approach into the classification process of a Galaxy Zoo-like project, allowing for a more organised and scalable approach to classifying occulting pairs. Our foremost goal is to expand the sample size of our current catalogue of occulting pairs and to evaluate the effectiveness of our (relatively small-scale) citizen science approach in making these classifications in preparation for future higher resolution surveys such as Euclid (Euclid Collaboration et al. Reference Collaboration2025a), SPHEREx (Crill et al. Reference Crill2020), and LSST (Ivezić et al. Reference Ivezić2019).

We organise this paper as follows. Section 2 describes the project as a whole, including our selection and image generation process. Section 3 details our data reduction steps taken before assembling the catalogue. Section 4 provides an overview of the main catalogue and its results. Section 5 evaluates the effectiveness of both the project and the resulting catalogue and compares our catalogue to previous catalogues. Section 7 discusses the limitations found after concluding the project and outlines directions for future work. Finally, Section 8 concludes the paper by summarizing our main findings.

2. Project description

Our citizen science project includes a sample of galaxies identified as ‘overlapping’ by human volunteers in Galaxy Zoo DECaLS (GZD) Data Releases 1, 2, and 5 (GZD-1, GZD-2, and GZD-5; Walmsley et al. Reference Walmsley2022) and images from Data Release 10 (DR10) of the DESI Legacy Survey (Dey et al. Reference Dey2019). DR10 was chosen as the source of images presented to volunteers because it offers improved depth and uniformity relative to the earlier GZD data releases, providing better data for visual classification. In the following, we describe our selection process and the image generation process for the DR10 images presented to the classifiers. We also briefly detail the workflow and web-based interface presented to each volunteer during their classification process.

2.1. Sample selection

We selected candidate occulting pairs by applying targeted data cuts to galaxies voted by human volunteers as ‘overlapping’ in the GZD datasets. The goal of our sample selection was to include only valid overlapping pairs while minimizing false overlaps, such as star-galaxy and interacting pairs. Since GZD provides only votes and voting fractions, we restricted our selection to these two variables. However, these restrictions presented a challenge, as many star-galaxy overlaps garnered high votes and voting fractions from volunteers in GZD-1, GZD-2, and GZD-5, with many classifications of actual overlaps consisting of low votes and voting fractions. Therefore, as a conservative measure, we required that all pairs in our selection have ‘overlapping’ votes $\gt$ 13 and a voting fraction $\gt$ 0.28 (see Figure 1). We note that it is well known that this selection process can be flawed, and our sample may contain many false classifications, such as star or merging or interacting overlaps (Darg et al. Reference Darg2010; Lintott et al. Reference Lintott2011). Applying these restrictions and manually removing 54 images due to noticeable visual artifacts resulted in 2 380 occulting pairs ( $1.56\%$ of the original GZD sample) that met this criterion, which we regard as our ‘original’ sample.

Figure 1. The distribution of galaxies classified as ‘overlapping’ in GZD-1, -2, and -5. We plot the GZD-1 and GZD-2 data as black dots, the GZD-5 data as red dots, and our selections of the combined data as blue dots. Our selection criteria include only galaxies classified as overlapping with votes $\gt$ 13 and voting fractions $\gt$ 0.28.

2.2. Generating images

For classification, we generated images of the candidate occulting pairs from legacystamps (Sweijen Reference Sweijen2022), a Python module that provides JPEG cutouts of DR10 from the DESI Legacy Survey. We downloaded each grz image at a fixed native resolution of $0.262$ arcsec per pixel. The images were then selectively rescaled to fit each occulting pair into the field of view (FOV) and subsequently resized to $500 \times 500$ pixels using Lanczos interpolation (Lanczos Reference Lanczos1938) for use in Zooniverse. The names of the image files included identifiers for the astronomical object (iauname) for use in cross-referencing, along with the image scaling factor (ISF), as our workflow involves pixel measurements (see T06-09; Figure 2).

Figure 2. Flowchart of the workflow presented to our internal classifiers. We labelled each question with its corresponding task number. Tasks are also colour-coded based on their tier level. Each tier, coloured grey, green, blue, and purple, represents zero, one, two, three, or four branching points in the decision tree, respectively.

2.3. Classification workflow

Since occulting pairs are distinctly different from standard non-occulting morphological classifications, it was essential to develop a new workflow, similar to traditional Galaxy Zoo projects, but tailored to classify occulting pairs and accommodate volunteer accessibility. To implement this workflow, we utilised the Zooniverse Project Builder software, which enables any user to easily create a simple citizen science project through a user-friendly browser interface. This platform allows volunteers to classify directly through their web-based interface, simplifying and streamlining the classification process. We note that due to the small-scale nature of this project ( $\sim$ 2 400 pairs), internally selected student and public volunteers completed the classifications through the website.

We designed the questions in our customised workflow (see Figure 2) to be answered by a classifier with little astrophysical background while fitting the pair categories described by Keel et al. (Reference Keel2013). Given the citizen scientist nature and subject of this project, it was almost essential to draw inspiration from previous Galaxy Zoo projects and Keel et al. (Reference Keel2013)’s work to create the most straightforward classification questions that yielded classifiers using the fewest possible questions. However, the actual effectiveness of our workflow remains uncertain. We intend to examine the accuracy of our approach in more detail in Section 5.

As an additional aid to classifiers, we also provided a brief morphology tutorial to all users, which they were required to complete at the beginning of the workflow. Further assistance was also available on a per-task basis to support classifiers in answering these questions.

Our workflow for this project follows a hierarchical design similar to that of previous Galaxy Zoo projects. Every classification begins with the step of identifying whether the pair was a ‘spiral-spiral,’ a ‘spiral-elliptical,’ a ‘elliptical-elliptical,’ or a ‘other’ (artifact, star, multiple, or irregular) galaxy overlap. We excluded ’multiple’ pairs due to their classification difficulty and ’irregular’ pairs because of their lack of symmetry, which makes them difficult to use in dust studies. Likewise, we excluded ‘elliptical-elliptical’ overlaps from additional tasks, as it is challenging for individuals to decide on which elliptical is in the foreground or background from images lacking proper redshift measurements. Subsequent questions in the workflow depend on the previous answer(s) given by each classifier, with each classifier selecting a single answer before continuing to the next question. Classifiers also had the option to return to previous questions if needed. We note that we categorise the annotations created by tasks T06-09 as an answer.

After completing the workflow and classifying an image, we presented the user with a new image to classify. Each new image presented to classifiers was entirely random. Users continued to receive images until each image reached a retirement count of 4 classifications for each occulting pair. Although we initially set a higher limit, we decided to lower it due to the small-scale and beta nature of our citizen science project. For a larger, public project, we would opt for a higher number to ensure greater reliability of classifications. Later, we will also explore how this low classification count impacts the result. In total, the internal project ended with $\sim$ 9 500 classifications from 15 registered and 47 unregistered participants for the 2 380 images selected to be classified. We note that the unregistered participants only contributed about 4% of the total classifications, while the registered participants classified the remaining 96%.

3. Data reduction

3.1. Multiple classifications

In a small subset of cases, users submitted multiple classifications for a single object, either due to a technical issue with the website or deliberate reclassification. Regardless of the cause, to treat each vote as an independent measurement, these repeat classifications were excluded from the final dataset, leaving only the first classification for each object from each user. These repeat classifications occurred for fewer than 1% of pairs and did not have a significant impact on morphological classifications.

3.2. Tied classifications

Given the low retirement count of 4 classifications per occulting pair, the inherent difficulty of classifying these pairs, and potential issues with our workflow (see Section 7), approximately 23% of our classifications resulted in a tie (up to 56% when excluding ‘Other’ classifications). To address this issue and ensure a valid morphological classification in the catalogue, we implemented a voting scheme based on majority rule and user consistency throughout the project.

Figure 3. The distribution of user consistency $\kappa$ based on the methods outlined in Willett et al. (Reference Willett2013, Reference Willett2017). Bottom: the cumulative distribution of consistency for all users. Top: the average number of galaxy pairs, $\langle{n}\rangle$ , classified by users as a function of their consistency.

3.2.1. User consistency

We calculated the consistency of each user using methods established in previous Galaxy Zoo projects (Willett et al. Reference Willett2013, Reference Willett2017). First, we aggregated individual user votes to calculate a vote fraction for each possible response $(f_r)$ to each task in the decision tree, excluding annotation tasks. We then compared each user’s votes with these vote fractions to compute their consistency score, $\kappa$ :

\begin{align}\kappa = \frac{1}{N_r}\sum_{i=1}^{N_r}{\kappa_i}\end{align}

where $N_r$ is the total number of potential responses to a task and

\begin{align}\kappa_i =\begin{cases}f_r, & \text{if vote corresponds to this response,} \\(1 - f_r), & \text{if vote does not correspond.}\end{cases}\end{align}

Votes that agreed with the majority classification were assigned a high consistency value, while those that disagreed received a lower value. We assigned each user a mean consistency, $\tilde{\kappa}$ , by calculating the mean of the consistency values for each task. Figure 3 shows the distribution of these values, together with the average number of galaxy pairs classified by each user.

3.2.2. Tie breaking

To resolve a tied classification for a task, we selected the classification provided by the user with the highest mean consistency as the definitive classification label. We also ensured that this user’s classifications aligned with the majority before selecting their classification as the tiebreaker. This approach ensured that each occulting pair received a single, majority-based label while also leveraging user reliability to resolve uncertain pairs. Out of the 2 380 candidate pairs, 551 required tie correction to one or more tasks. Pairs with task classifications resolved through this method were included in the catalogue and are flagged in the morphological catalogue (Section 4.3) as ‘t’ to indicate that these labels do carry some additional uncertainty.

4. The catalogue

The Overlap Zoo Beta (hereafter OZb) catalogue includes three distinct sub-samples: vote counts and fractions for each task in the decision tree for each galaxy pair, pixel-based annotations for each galaxy pair, and morphological classifications based on the pair categories outlined in Keel et al. (Reference Keel2013), along with redshifts and colour magnitudes. Researchers can access the data for the three sub-samples from (insert link here). Included in this paper is a subset of each catalogue (Tables 13).

Table 1. Overlap Zoo Beta: User classifications.

Table 2. Overlap Zoo Beta: User annotations.

Table 3. Overlap Zoo Beta: Morphological classifications.

4.1. Classification dataset

Table 1 contains the classifications for the 2 380 candidate occulting galaxy pairs presented to volunteers via Zooniverse. We identify each galaxy pair by its unique J2000.00 name (iauname). The total number of classifications and the total number of votes for each response are denoted by $N_\mathrm{class}$ and $N_\mathrm{votes}$ , respectively, following the terminology outlined in Willett et al. (Reference Willett2013). For each galaxy pair, we provide two additional parameters: the raw number of votes ( $^\star\_count$ ) and the fraction of votes for said response ( $^\star\_fraction$ ). Morphological classifications for this OZb subset are excluded from this table to maintain a consistent structure with the Keel et al. (Reference Keel2013) catalogue and to remove pairs that are not genuine occulting candidates from the final morphological sample.

4.2. Annotation dataset

Table 2 lists the 1 428 pixel-based annotations (including duplicates), derived from annotation tasks T06–T09 in Figure 2. Each galaxy in this subset is, again, identified by its unique J2000.0 name. The corresponding ISF value is also included, as it is necessary for accurate use of the pixel measurements (see Section 2.2). Point tool annotation values for the foreground and background galaxy (Task T06-T07) are denoted by X ( $^\star\_X$ ) and Y ( $^\star\_Y$ ), with the associated right ascension (RA) and declination (DEC) for each measurement also included. Ellipse tool annotation values for the foreground and background galaxy (Task T08-T09) are denoted by X ( $^\star\_X$ ), Y ( $^\star\_Y$ ), rX ( $^\star\_rX$ ), rY ( $^\star\_rY$ ), and $\theta$ ( $^\star\_\theta$ ). In both cases, X and Y represent the centre coordinates of the tool, while rX, rY, and $\theta$ represent the horizontal radius, vertical radius, and orientation angle in radians of the ellipse, respectively.

By including these annotations in the OZb catalogue, we hope to expand the application of various analysis techniques across a larger sample of candidate occulting pairs. The annotations in tasks T06-T09 are not only necessary for querying (see Section 4.3.3) but also for applying various analysis techniques in and based on White & Keel’s (Reference White and Keel1992) work. This includes differential photometry techniques such as rotational subtraction and isophotal fitting and subtraction. Both approaches enable accurate estimation of key nuisance parameters, including dust extinction and attenuation. The latter differential approach was recently applied with great success to the occulting pair VV191a/b (Keel et al. Reference Keel2023; Robertson et al. Reference Robertson2024a). Our goal is to extend these techniques to more pairs in order to increase the statistical variance and robustness of the results of the current pairs, such as the remarked pair VV191a/b.

4.3. Morphological dataset

Table 3 contains the morphological classifications for 765 occulting galaxy pairs, out of the initial 2 380, that could be categorised based on the pair categories outlined in Keel et al. (Reference Keel2013). The remaining 1 615 pairs that could not be categorised were those classified as ‘Other’ in Task T00 (Figure 2), and are excluded in this subset of the catalogue. We identify each occulting pair by its unique J2000.0 coordinates, SDSS ID, and DESI ID. Using our new querying methodology (see Section 4.3.3), we obtain magnitude and spectroscopic redshift measurements from SDSS (Almeida et al. Reference Almeida2023) and DESI (Abdul-Karim et al. Reference Abdul-Karim2025), denoting the central galaxy’s values as r1 and z1, and the off-centre galaxy values as r2 and z2. While DESI does not directly provide magnitude measurements, we calculated the r-band magnitudes from the provided fluxes in nanomaggies using the standard zero-point conversion. However, we warrant caution when interpreting these magnitudes, as blending from overlapping galaxies may impact their accuracy (Keel et al. Reference Keel2013). The subset included in this paper comprises all systems shown in Figure 4, along with a few additional interesting pairs, listed in order of increasing RA.

Figure 4a. Sample pairs based on the pair types described in Keel et al. (Reference Keel2013). PNG images were obtained from the Legacy Survey Viewer using the legacystamps Python module (Sweijen Reference Sweijen2022) and independently verified for use in this figure. We identify pairs by their common name (e.g. NGC 450) or truncated coordinates (e.g. 1226+00).

Figure 4b. The remaining sample pairs following Keel et al. (Reference Keel2013) mnemonics.

4.3.1. Pair categories

Below, we briefly summarise each mnemonic designation from the categories defined in Keel et al. (Reference Keel2013), along with one additional category of our own. For each designation, we outline the specific tasks and corresponding responses used for classification. We also provide a summary of the number of pairs in each designation in Table 4.

  1. 1. F: a nearly face-on spiral in front of a smooth elliptical or S0 background system [Task T0 (A1), Task T01 (A0), and Task T02 (A3)].

  2. 2. Q: a background galaxy nearly edge-on and projected nearly radially to the foreground galaxy [Task T0 (A0, A1), Task T01 (A0, A1), Task T02 (A1), Task T08, and Task T09].

  3. 3. $\boldsymbol\Phi$ : an essentially edge-on spiral backlit by a smooth elliptical or S0 background system [Task T0 (A1), Task T01 (A0), Task T02 (A0), and Task T03 (A0)].

  4. 4. X: both galaxies are nearly edge-on with their disks essentially crossing [Task T0 (A0), Task T01 (A0), and Task T02 (A2)].

  5. 5. SE: spiral/elliptical superpositions that do not fall in one of the other categories [Task T0 (A1), Task T01 (A1), and Task T02 (A0, A2, A3)].

  6. 6. S: spiral/spiral overlaps. Useful for probing extinction and attenuation in the UV [Task T0 (A0), Task T01 (A0), and Task T02 (A0, A3)].

  7. 7. B: the background galaxy has a much smaller angular size than the foreground galaxy [Task T0 (A0, A1), Task T04 (A0), and Task T05 (A0)].

  8. 8. E: elliptical or S0 pairs [Task T0 (A3)].

  9. 9. $\boldsymbol\Delta{z}$ : pairs with a known redshift distance large enough to rule out interaction. Pairs with $\Delta{z} \gt 0.008$ fall under this category.

  10. 10. CT: pairs that volunteers could not confidently identify the foreground/background distinction [Task T0 (A0, A1) and Task T01 (A2)].

Table 4. A summary of OZb classifications based on the pair types from Keel et al. (Reference Keel2013).

4.3.2. Overlap types

In addition to these categories, we introduce further designations to describe the degree of overlap between the two galaxies in each pair, based on Task T10. These are defined as follows:

  1. 1. I: slightly overlapping galaxies. [Task T10 (A0)].

  2. 2. II: intermediately overlapping galaxies [Task T10 (A1)].

  3. 3. III: deeply overlapping galaxies [Task T10 (A2)].

  4. 4. IV: completely overlapping galaxies [Task T10 (A3)].

Note that we assume all E-type pairs are complete overlaps, as we rarely observe significant separation between two elliptical galaxies. Again, we provide a summary of the counts for each overlap type in Table 5.

Table 5. A summary of the OZb classifications based on the degree of overlap between galaxy pairs.

4.3.3. Querying

To obtain the measurements for the catalogue, we developed a new querying method tailored to occulting galaxy pairs. Unlike single, isolated galaxies in the Galaxy Zoo project, overlapping pairs require two distinct coordinate inputs and thus require manual annotation of the off-centre companion.

Since tasks T06–T07 (Figure 2) address only foreground and background classification, we first had to determine which galaxy occupied the central and off-centre position using these annotations. To do this, we converted the annotated pixel coordinates into RA and DEC to estimate the centres of both galaxies, then compared these to the provided J2000.0 coordinates. We designated the galaxy with coordinates closest to the J2000.0 reference as the central galaxy, and the other as the off-centre companion. Using these coordinates, we then queried and assigned measurements such as object IDs, spectroscopic redshifts, and r-band magnitudes from both SDSS (Almeida et al. Reference Almeida2023) and DESI (Abdul-Karim et al. Reference Abdul-Karim2025) for the central (z1 and r1) and off-centre companion (z2 and r2), prioritizing the DESI measurement when valid. To ensure that these queries returned only the measurement corresponding to the intended galaxy centre, we applied a strict separation tolerance of 0.00105 degrees ( $\sim 3.6$ arcsec) from the input coordinates. This value is much larger than the astrometric precision of DR10 ( $\sim$ $0.2$ arcsec; Dey et al. Reference Dey2019), so even in cases of minor coordinate uncertainties, the correct galaxy measurement should be reliably selected. For cross-identifications, we employed a similar query approach using SIMBAD (Wenger et al. Reference Wenger2000), albeit with less restrictive criteria relying solely on the central galaxy’s position.

Manual inspection of the queried subsample showed that our querying methodology was 98–99% effective, with only a few problematic pairs. Most of the issues stemmed from proximity that fell under our strict tolerance, overcrowded fields, or occasional query failures. In such cases, we manually cross-referenced the DESI Legacy Survey with SDSS and DESI data to assign the correct IDs and redshifts. Although this level of accuracy was sufficient for the current project, we hope to improve it further in future iterations if possible.

Of the 765 galaxies queried, 22 centre and 403 off-centre pairs were also missing spectroscopic measurements necessary for accurate foreground, background distinction. We initially considered substituting missing values with photometric redshifts. However, these estimates are often unreliable. As shown in Keel et al. (Reference Keel2013), photometric redshifts can have errors exceeding $5\sigma$ traceable to deblending problems. We chose to forgo them for these reasons.

5. Classification accuracy

To assess the accuracy of the classifications in the OZb catalogue, we currently employ three methods: (1) performing internal diagnostic tests comparing redshift difference $(\Delta{z} = |z_f -z_b|)$ and magnitudes ( $r_f$ and $r_b$ ) against the fraction of classifications where the visual classifier correctly ( $z_f \lt z_b$ ) or incorrectly ( $z_f \gt z_b$ ) identified the foreground galaxy based on annotation positioning, (2) comparing non-expert morphological classifications to those performed by an expert (Keel et al. Reference Keel2013), and (3) comparing morphological classifications with GZD for S-, X-, and E-type pairs. These approaches allow us to assess how accurately people can distinguish foreground from background galaxies, which can be difficult to visually distinguish depending on image quality and redshift separation, as well as how reliably non-experts and experts classify occulting pairs according to the morphological categories described in Section 4.3.

5.1. Overlap Zoo Beta

5.1.1. Redshift separation

Figure 5 shows the fraction of occulting galaxy pairs for which the visual classifier correctly and incorrectly annotated the foreground galaxy, plotted against the redshift difference $\Delta z$ . We restrict the sample to pairs with valid pixel-based annotations, as our querying methodology required them (see Section 4.3.3), and to pairs with two valid spectroscopic redshifts needed to calculate the redshift difference.

Figure 5. The fraction of galaxy pairs with foreground identification correctly and incorrectly classified by users as a function of $\Delta z$ . We include only pairs with valid pixel-based redshift measurements in the sample.

Since GZD includes a majority of galaxies under $z=0.2$ , the bulk of our redshift data will and does lie in the low $\Delta{z}$ range (0.0–0.013), decreasing in sample size as the $\Delta{z}$ increases (451 pairs in the 0.0–0.04 bin, 70 in 0.04–0.09, and 55 in 0.09–0.13). Within this range, we find that the ability to identify the foreground galaxy correctly improves as the redshift difference between pairs increases. However, at the lowest redshift separation, classifiers identify the foreground galaxy correctly nearly as often as they do incorrectly. Although higher-quality images could potentially make this distinction easier, visual classifications with minimal separation are likely unreliable at the current image resolution and depth. Beyond this range, our sample size decreases drastically, and we no longer observe this pattern. Large separations would likely make the distinction easier, though a bigger sample size at higher separations is necessary to confirm this.

5.1.2. Magnitudes

Figure 6 shows the fraction of occulting galaxy pairs for which the visual classifier correctly and incorrectly annotated the foreground galaxy, based on whether the foreground galaxy was brighter ( $r_f \lt r_b$ ) or dimmer ( $r_f \gt r_b$ ) in r-band magnitude. Again, we restricted the sample to pairs with valid pixel-based annotations that had two valid spectroscopic measurements.

Figure 6. The fraction of galaxy pairs with foreground identification correctly and incorrectly classified, based on whether the foreground galaxy appeared dimmer ( $r_f \gt r_b$ ) or brighter ( $r_f \lt r_b$ ) in r-band magnitude. We include only pairs with valid pixel-based redshift measurements in the sample.

Unsurprisingly, pairs with brighter foreground galaxies have their positioning identified correctly more often than those with a dimmer foreground galaxy ( $31\% \text{ vs. } 69\%$ ). Surprisingly, however, pairs with a dimmer foreground galaxy show nearly comparable rates of correct and incorrect identification but are more often incorrectly identified ( $58\% \text{ vs. } 42\%$ ). One possible explanation is that when the foreground galaxy is fainter, its features may blend with those of the background galaxy, making the distinction between the foreground and background galaxies more difficult. In contrast, a brighter foreground galaxy, especially in Spiral–Elliptical systems, likely shows more distinct features and suffers less from blending, aiding the user in distinguishing foreground from background. However, these predicted magnitudes may be affected by the aforementioned blending effects (see the beginning of Section 4.3) and could introduce some uncertainty into our comparisons.

5.2. OZb vs. expert visual classification

To verify the effectiveness of the citizen science approach in classifying occulting pairs, we compared our user-classified pairs with those assigned by an expert (Keel et al. Reference Keel2013). Of the 765 galaxy pairs in our catalogue, 189 overlap with the expert sample. Among these, 49 morphological classifications (26%) matched between our catalogue and the expert’s. While this relatively low level of agreement may initially suggest an issue, several plausible explanations exist for the mismatch and other discrepancies.

A key factor likely influencing the discrepancy is the difference in imaging depth between SDSS DR7 (Abazajian et al. Reference Abazajian2009) and the DESI Legacy Survey DR10 (Dey et al. Reference Dey2019). As a more recent survey equipped with improved instrumentation, DESI delivers images with greater depth (1.5–2 mag deeper) than SDSS. Figure 7 presents a direct comparison of the same galaxy pair, as imaged by both surveys and classified by both the expert and this work. The difference in image quality between the two surveys is substantial enough that structural features of the background galaxy, clearly resolved in DESI, are almost indistinguishable in SDSS. This has direct implications for morphological classification; in cases where the background galaxy’s structure may be subtle or faint, classifiers using SDSS DR7, regardless of expertise, would likely be unable to identify key cues necessary to assign a distinct classification. Indeed, the classifications of both works reflect this: the pair in this work was classified as ‘Other’, as there are visible signs of merging in the DESI image, whereas the expert classified it as an F-type (see Section 4.3.1), as there are no distinguishable merging signs due to the lack of visible structure in the SDSS image.

Figure 7. A comparison of image quality between SDSS and DESI using the galaxy pair J095303.71+064237.6. Left: The SDSS DR7 image used for classification in Keel et al. (Reference Keel2013). Right: The DESI Legacy Survey DR10 image used for classification in this work. Both images match in size and pixel scale for direct comparison.

Although not available for every pair, the expert catalogue also had access to spectroscopic and/or photometric redshift information to aid in distinguishing the foreground and background galaxies. In contrast, our classification process did not present redshifts directly to users, which could also explain the low agreement in pair types where positional orientation matters. Manually providing these measurements becomes impractical as our datasets scale; hence, we did not offer them directly (also see Section 4.3.3). One of our primary goals was to assess non-experts’ ability to distinguish foreground and background galaxies. Including redshifts would undermine this goal.

5.3. OZb vs. GZD-1, -2, and -5

While less robust than a direct comparison with an occulting pair catalogue, we also evaluate the accuracy of our classifications using single-galaxy morphological classifications from GZD-1, -2, and -5. However, the current iteration of our catalogue does not yet allow for a comprehensive comparative analysis across all morphological categories (see Section 7.1). Despite these limitations, comparisons with S-, X-, and E-type occulting pair morphologies remain feasible, as positional morphological knowledge of the central galaxy is not required for these types. For these comparisons, we use raw and debiased vote fractions from tasks T00 (A0 and A1) and T01 (A0) of the GZD decision tree (see Figure 4; Walmsley et al. Reference Walmsley2022), adopting an agreement threshold of 0.5; pairs with vote fractions exceeding this threshold are considered to have an agreeable classification.

Table 6 presents the agreement metrics based on the raw vote fractions for morphological answers in GZD, which correspond directly to the aforementioned morphological types of occulting pairs in our catalogue. Any agreement or disagreement in this comparison reflects a more direct view of how human observers perceive galaxy morphology from the images they are shown. Based on these comparisons, we find that, despite our relatively low classification count of 4 votes per pair, the classifications provided in our catalogue are relatively in agreement with the higher number of visual classifications in the GZD datasets. However, agreement could be improved, especially for X-types, as they show the lowest agreement between our datasets. The limited number of votes could be contributing to this lower agreement, as could the image quality (see Section 7.3), or it may be due to redshift bias, as galaxies at higher redshifts are much harder to distinguish by the key features needed to identify these types.

To test whether this redshift bias is present, we present the agreement metrics based on the de-biased vote fractions in Table 7, following the same structure as Table 6. In GZD, redshift de-biasing mitigates classification bias by estimating how each galaxy would appear if observed at a fixed low redshift. This reference redshift is set to $z=0.02$ (Walmsley et al. Reference Walmsley2022), with the full de-biasing method described in detail by Hart et al. (Reference Hart2016).

Comparing the two, we find that the agreement percentage for E-types is strongly affected, showing an approximately eightfold difference in GZD-1 and -2, and a twofold difference in GZD-5. This is consistent with the findings of Walmsley et al. (Reference Walmsley2022), who reported that the de-biasing procedure tends to over-correct the ‘Smooth or Featured’ vote fractions, effectively reversing the original observed redshift trend (see their Figure 10). For X-type pairs, agreement follows the same reversing pattern and improves slightly for the ‘featured or disk’ category, while the ‘edge-on’ agreement exhibits a different trend. Walmsley et al. (Reference Walmsley2022) did not observe a comparable change in ‘edge-on’ vote fractions with redshift de-biasing, and it is likely that a marginal fraction of our edge-on pairs may be misclassified in our sample. On the other hand, we do not observe a comparable effect for S-type pairs as seen in E-types and X-types. We observe only a minor change in agreement compared with GZD-5, suggesting that most S-types in our sample are likely readily identifiable, large, luminous systems with well-defined disks or features, that are not greatly affected by redshift bias.

Table 6. This table compares the agreement between the raw vote fractions for morphological types in GZD and the corresponding OZb types. A pair is considered agreeable if it has a likelihood of at least 0.5 (i.e. a majority) for the corresponding morphological type. Here, N denotes the total number of overlapping pairs with GZD, while % indicates the fraction of those pairs that are in agreement.

Table 7. This table compares the agreement between the redshift debiased vote fractions for morphological types in GZD and the corresponding OZb types. A pair is considered agreeable if it has a likelihood of at least 0.5 (i.e. a majority) for the corresponding morphological type. Here, N denotes the total number of overlapping pairs with GZD, while % indicates the fraction of those pairs that are in agreement.

It is important to note, however, that the redshift de-biasing procedure itself carries significant uncertainties. The corrections applied rely on assumptions and models about how a galaxy would appear at a fixed low redshift. As a result, the agreement metrics from the de-biased sample may be highly uncertain, and these comparisons with the de-biased data should be interpreted cautiously. Some of the trends we observe, particularly in E- and X-type pairs, may be amplified by these uncertainties and by our low sample size, meaning that the true underlying agreement may differ significantly from that suggested by the de-biased vote fraction comparison. The true agreement is likely between our results here.

6. Implications of the expanded catalogue

Expanding and growing the catalogue of occulting pairs is similar in nature to expanding single galaxy catalogues, albeit with one major difference: our targets are on several orders of magnitude rarer than those of single galaxies. Consequently, any growth in this catalogue, however incremental, represents a significant gain in statistical power. Increasing the number of identified pairs directly improves the robustness of our analyses across all pair types, including systems with favourable geometries for dust attenuation and similar studies.

We summarise the increase in sample size of our catalogue relative to previous occulting-pair catalogues in Table 8. The growth of the each pair category, while all benefiting from an increase in sample size, also has distinct implications for different pair types (see Section 4.3.1 for definitions). For F-type pairs, the expanded catalogue increases the likelihood of identifying pairs with the ideal geometry (e.g. VV191a/b) necessary for a detailed analysis of dust attenuation and related measurements along the outer regions of the spiral disk (see Keel et al. Reference Keel2013, Reference Keel2023; Robertson et al. Reference Robertson2024a, 2025). Some of the pairs identified in the first row of Figure 4 are potentially ideal pairs we could use for such analysis. Similarly, $\phi$ , Q, and X-type pairs also require somewhat ideal geometry for detailed analysis of vertical dust structures in the foreground or background edge-on disk galaxy, and thus benefit from the same statistical gain. S-type pairs are particularly useful for measuring and comparing optical and UV extinction (Keel & White Reference Keel and White2001b), and their scientific utility likewise increases with sample size. B-type pairs require considerable care when using these small background sources to map extinction, making larger samples particularly valuable for mitigating individual system uncertainties. Finally, E-type pairs, unlike ideal-geometry pairs, require larger samples to identify low-opacity diffuse dust components. Our catalogue represents a substantial increase over Keel et al. (Reference Keel2013) in this front (see Table 8), so our statistical power is greatly increased for these pairs.

Table 8. A comparison of pair counts classified in OZb with those in Keel et al. (Reference Keel2013).

By increasing our sample size, we are also no longer limited to studies of individual systems, such as what was done with VV191a/b (Keel et al. Reference Keel2023; Robertson et al. Reference Robertson2024a, 2025), but can instead conduct large-scale, population-level analyses of these occulting pairs. This enables a systematic investigation of the optical properties (the transmission) of the ISM across diverse galaxy types and environments. This project and its future iterations will serve as the foundation for these analyses. Work toward extending such studies across a wide range of occulting pairs is already underway, following the isophotal and rotational methodologies outlined in Holwerda et al. (Reference Holwerda, Keel, Williams, Dalcanton and de Jong2009).

7. Project limitations

Some of the discrepancies identified in Section 5 can be attributed to inherent limitations in the project itself. Following the conclusion of the Overlap Zoo beta period, we identified several key issues in our workflow, selection criteria, and image generation process that likely contributed to these discrepancies. Work is already underway to address these issues and will likely result in significant improvements in upcoming iterations.

7.1. Workflow structure

Some aspects of our workflow (Figure 2) are not ideal. For example, responses to Task T00 could better align with the structure used in previous Galaxy Zoo projects. Our current responses for this task frequently lead to misclassification of edge-on spiral galaxies as ellipticals, which may partly explain discrepancies in classifications. Task T01 may mistakenly label Spiral-Spiral systems as having an elliptical galaxy in the foreground. More importantly, it restricts tasks T06–T09 to only those pairs where users can distinguish between the foreground and background galaxies, which limits our ability to collect annotations for all valid pairs and complicates the querying process. Additionally, Task T01 does not provide information on whether the foreground or background galaxy occupies the central or off-central position. This absence currently makes direct comparison of morphological classifications with GZD-1, -2, and -5 difficult for most pairs, but possible for some (see Section 5.3). Task T02 also introduces confusion, as users sometimes misidentify Spiral-Elliptical pairs as systems with both galaxies edge-on. Lastly, Task T05 appears too subjective, as the meaning of ‘significantly smaller’ is not clearly defined, warranting caution when using these classifications. This ambiguity is also present in the expert catalogue. We have begun addressing these issues in the next iteration of the project by restructuring the decision tree and removing subjective questions, with the goal of improving consistency and clarity throughout the workflow.

The number of votes per galaxy pair in our workflow is also a significant limitation to consider. Each occulting pair candidate in our catalogue was retired after receiving just four user classifications (see Section 7.1). This number is considerably lower than in other citizen science projects such as Galaxy Zoo (Lintott et al. Reference Lintott2011; Willett et al. Reference Willett2013; Walmsley et al. Reference Walmsley2022), where each galaxy typically receives upwards of 40 classifications. Due to the relatively low number of votes, we often had to correct for ties in the classifications (see Section 3.2.2), which may introduce additional uncertainty into the final labels. We anticipate that increasing the number of classifications per image would not only improve the accuracy illustrated in Figure 5 but also reduce some discrepancies between our work and the expert catalogue. Future public iterations of the project will raise the retirement threshold, allowing more classifications per image and enhancing overall classification reliability.

7.2. Selection criteria

Our image selection criteria could also be improved to better filter out false overlaps (see Section 2.1). For instance, imposing stricter thresholds on the number of votes and vote fractions for galaxies flagged as merging could reduce misclassification of interacting or merging pairs. Unfortunately, we cannot rely on redshift information, as both galaxies in a pair do not always have measured redshifts. We would also like to impose thresholds to identify false overlaps involving stars; however, this is more challenging. At the current resolution of DESI, stars often appear as tiny, unresolved blobs, making them difficult to differentiate from smooth galaxies, meaning we can’t use the ‘star or artifact’ response in Galaxy Zoo projects to exclude these. Properly separating the two will likely require higher-resolution data from future surveys and visual classifications from additional Galaxy Zoo projects using this data.

7.3. Image generation process

For the Overlap Zoo Beta project, images were generated as JPEG cutouts from the DESI Legacy Survey. While sufficient for visual inspection, JPEG compression inherently reduces information compared to lossless PNG images. This compression may have introduced minor biases in volunteer classifications, particularly for more distant pairs where subtle morphological features can be blurred due to compression. As a result, the improvements offered by DR10 over earlier releases may have had a limited impact in the current iteration. In the next iteration, we plan to produce images following the methodology of previous Galaxy Zoo projects (e.g. Walmsley et al. Reference Walmsley2022) to avoid this issue entirely.

8. Conclusion

We have presented Overlap Zoo Beta, a beta citizen science project that tests the ability of non-experts to classify occulting galaxy pairs at the imaging depth and quality of DR10 of the DESI Legacy Survey. The project also presents a catalogue comprising three subset catalogues: (1) an initial 2 380 candidate pairs presented and classified by volunteers, (2) 1 428 total annotations derived from these candidates (including duplicates), and (3) a sample of 765 occulting galaxy pairs with classifications that could be categorised following Keel et al. (Reference Keel2013), as well as associated redshift and magnitude data. Out of the 765 identified pairs, $\sim$ 17% have known redshift differences significant enough to eliminate the possibility of interaction ( $\Delta{z} \gt 0.008$ ; Keel et al. Reference Keel2013). However, we warrant caution when using these morphological classifications, as each image received only four votes per galaxy pair from non-expert classifiers.

Future work will focus on refining our classification workflow and reclassifying these pairs using a larger number of classifications per galaxy pair, thereby improving classification reliability. We also plan to include the Keel et al. (Reference Keel2013) catalogue in the next iteration to take advantage of the improved imaging depth that the DESI DR10 provides over SDSS DR7. Looking further ahead, we aim to apply our classification workflow to data from upcoming large-scale surveys such as Euclid (Euclid Collaboration et al. Reference Collaboration2025b), LSST (Ivezić et al. Reference Ivezić2019), and the forthcoming Nancy Grace Roman Space Telescope, with support from future Galaxy Zoo projects.

Acknowledgements

The author(s) would like to thank Mike Walmsley and the Galaxy Zoo team for providing the initial dataset used to create this catalogue.

The author made use of Grammarly to assist with the drafting of this article to enhance readability and language. After using this tool/service, the authors reviewed and edited the content as needed and take full responsibility for the content of the publication.

Data availability statement

The catalogue generated in this work is available at https://doi.org/10.5281/zenodo.16884326.

References

Abazajian, K. N., et al. 2009, ApJS, 182, 543Google Scholar
Abdul-Karim, M., et al. 2025, arXiv preprint arXiv:2503.14745 Google Scholar
Almeida, A., et al. 2023, ApJS, 267, 44Google Scholar
Andredakis, Y. C., & van der Kruit, P. C. 1992, A&A, 265, 396Google Scholar
Battisti, A. J., Calzetti, D., & Chary, R.-R. 2016, arXiv e-prints arXiv:1601.00208 Google Scholar
Battisti, A. J., Calzetti, D., & Chary, R. R. 2017, ApJ, 840, 109Google Scholar
Berlind, A. A., Quillen, A. C., Pogge, R. W., & Sellgren, K. 1997, AJ, 114, 107Google Scholar
Buat, V., et al. 2018, A&A, 619, A135Google Scholar
Burgarella, D., Buat, V., & Iglesias-Páramo, J. 2005, MNRAS, 360, 1413Google Scholar
Calzetti, D., et al. 2000, ApJ, 533, 682Google Scholar
Calzetti, D., Kinney, A. L., & Storchi-Bergmann, T. 1994, ApJ, 429, 582Google Scholar
Casasola, V., et al. 2019, arXiv e-prints, arXiv:1911.09187 Google Scholar
Casey, C. M., et al. 2014, arXiv e-prints arXiv:1410.0702 Google Scholar
Charlot, S., & Fall, S. M. 2000, ApJ, 539, 718Google Scholar
Conroy, C., Schiminovich, D., & Blanton, M. R. 2010, ApJ, 718, 184Google Scholar
Crill, B. P., et al. 2020, in Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, Vol. 11443, Space Telescopes and Instrumentation 2020: Optical, Infrared, and MillimeterWave, ed. M. Lystrup, & M. D. Perrin, 114430IGoogle Scholar
Darg, D. W., et al. 2010, MNRAS, 401, 1043Google Scholar
Dey, A., et al. 2019, AJ, 157, 168Google Scholar
Elmegreen, D. M., Elmegreen, B. G., & Eberwein, K. S. 2001, in Bulletin of the American Astronomical Society, Vol. 33, American Astronomical Society Meeting Abstracts #198, 798–+Google Scholar
Collaboration, Euclid, et al. 2025a, A&A, 697, A1 Google Scholar
Collaboration, Euclid, et al. 2025b, arXiv e-prints, arXiv:2503.15302 Google Scholar
Gordon, K. D., Clayton, G. C., Misselt, K. A., Landolt, A. U., &Wolff, M. J. 2003, ApJ, 594, 279Google Scholar
Hart, R. E., et al. 2016, MNRAS, 461, 3663Google Scholar
Holwerda, B. 2007, in HST Proposal, Vol. 16, 11257 Google Scholar
Holwerda, B. W. 2009, in HST Proposal, Vol. 18, 12134 Google Scholar
Holwerda, B. W., Keel, W. C., Williams, B., Dalcanton, J. J., & de Jong, R. S. 2009, AJ, 137, 3000Google Scholar
Holwerda, B. W., et al. 2013, AN, 334, 268Google Scholar
Holwerda, B. W., et al. 2015, MNRAS, 449, 4277Google Scholar
Ivezić, Ž., et al. 2019, ApJ, 873, 111Google Scholar
Johnson, B. D., et al. 2007, ApJS, 173, 392Google Scholar
Keel, W. C., & White, III, R. E. 2001a, AJ, 121, 1442Google Scholar
Keel, W. C., & White, III, R. E. 2001b, AJ, 122, 1369Google Scholar
Keel, W. C., et al. 2013, PASP, 125, 2Google Scholar
Keel, W. C., et al. 2023, AJ, 165, 166Google Scholar
La Mura, G., et al. 2025, AdSR, https://doi.org/10.1016/j.asr.2025.05.002 Google Scholar
Lanczos, C. 1938, JMPh, 17, 123Google Scholar
Leja, J., Johnson, B. D., Conroy, C., van Dokkum, P. G., & Byler, N. 2017, ApJ, 837, 170Google Scholar
Lintott, C., et al. 2011, MNRAS, 410, 166Google Scholar
Lintott, C. J., et al. 2008, MNRAS, 389, 1179Google Scholar
Meurer, G. R., Heckman, T. M., & Calzetti, D. 1999, ApJ, 521, 64Google Scholar
Reddy, N. A., et al. 2015, ApJ, 806, 259Google Scholar
Robertson, C., et al. 2024a, arXiv e-prints, arXiv:2403.15619 Google Scholar
Robertson, C., et al. 2024b, AJ, 167, 263Google Scholar
Robertson, C. D., et al. 2025, AJ, 170, 195Google Scholar
Salim, S., Boquien, M., & Lee, J. C. 2018, ApJ, 859, 11Google Scholar
Salim, S., & Narayanan, D. 2020, ARA&A, 58, 529Google Scholar
Seibert, M., et al. 2005, ApJ, 619, L55Google Scholar
Sweijen, F. 2022, legacystamps: Retrieve DESI Legacy Imaging Surveys cutouts, Astrophysics Source Code Library, record ascl:2204.003Google Scholar
Walmsley, M., et al. 2022, MNRAS, 509, 3966Google Scholar
Wenger, M., et al. 2000, A&AS, 143, 9Google Scholar
White, R. E., & Keel, W. C. 1992, Natur, 359, 655Google Scholar
Wild, V., et al. 2011, MNRAS, 417, 1760Google Scholar
Willett, K. W., et al. 2013, MNRAS, 435, 2835Google Scholar
Willett, K. W., et al. 2017, MNRAS, 464, 4176Google Scholar
Figure 0

Figure 1. The distribution of galaxies classified as ‘overlapping’ in GZD-1, -2, and -5. We plot the GZD-1 and GZD-2 data as black dots, the GZD-5 data as red dots, and our selections of the combined data as blue dots. Our selection criteria include only galaxies classified as overlapping with votes $\gt$13 and voting fractions $\gt$0.28.

Figure 1

Figure 2. Flowchart of the workflow presented to our internal classifiers. We labelled each question with its corresponding task number. Tasks are also colour-coded based on their tier level. Each tier, coloured grey, green, blue, and purple, represents zero, one, two, three, or four branching points in the decision tree, respectively.

Figure 2

Figure 3. The distribution of user consistency $\kappa$ based on the methods outlined in Willett et al. (2013, 2017). Bottom: the cumulative distribution of consistency for all users. Top: the average number of galaxy pairs, $\langle{n}\rangle$, classified by users as a function of their consistency.

Figure 3

Table 1. Overlap Zoo Beta: User classifications.

Figure 4

Table 2. Overlap Zoo Beta: User annotations.

Figure 5

Table 3. Overlap Zoo Beta: Morphological classifications.

Figure 6

Figure 4a. Sample pairs based on the pair types described in Keel et al. (2013). PNG images were obtained from the Legacy Survey Viewer using the legacystamps Python module (Sweijen 2022) and independently verified for use in this figure. We identify pairs by their common name (e.g. NGC 450) or truncated coordinates (e.g. 1226+00).

Figure 7

Figure 4b. The remaining sample pairs following Keel et al. (2013) mnemonics.

Figure 8

Table 4. A summary of OZb classifications based on the pair types from Keel et al. (2013).

Figure 9

Table 5. A summary of the OZb classifications based on the degree of overlap between galaxy pairs.

Figure 10

Figure 5. The fraction of galaxy pairs with foreground identification correctly and incorrectly classified by users as a function of $\Delta z$. We include only pairs with valid pixel-based redshift measurements in the sample.

Figure 11

Figure 6. The fraction of galaxy pairs with foreground identification correctly and incorrectly classified, based on whether the foreground galaxy appeared dimmer ($r_f \gt r_b$) or brighter ($r_f \lt r_b$) in r-band magnitude. We include only pairs with valid pixel-based redshift measurements in the sample.

Figure 12

Figure 7. A comparison of image quality between SDSS and DESI using the galaxy pair J095303.71+064237.6. Left: The SDSS DR7 image used for classification in Keel et al. (2013). Right: The DESI Legacy Survey DR10 image used for classification in this work. Both images match in size and pixel scale for direct comparison.

Figure 13

Table 6. This table compares the agreement between the raw vote fractions for morphological types in GZD and the corresponding OZb types. A pair is considered agreeable if it has a likelihood of at least 0.5 (i.e. a majority) for the corresponding morphological type. Here, N denotes the total number of overlapping pairs with GZD, while % indicates the fraction of those pairs that are in agreement.

Figure 14

Table 7. This table compares the agreement between the redshift debiased vote fractions for morphological types in GZD and the corresponding OZb types. A pair is considered agreeable if it has a likelihood of at least 0.5 (i.e. a majority) for the corresponding morphological type. Here, N denotes the total number of overlapping pairs with GZD, while % indicates the fraction of those pairs that are in agreement.

Figure 15

Table 8. A comparison of pair counts classified in OZb with those in Keel et al. (2013).