The GALAH survey: Data release 4

Sven Buder; Janez Kos; Xi Ella Wang; Madeleine McKenzie; Madeleine Howell; Sarah Martell; Michael R. Hayden; Daniel B. Zucker; Thomas Nordlander; Benjamin Montet; Gregor Traven; Joss Bland-Hawthorn; Gayandhi M. De Silva; Kenneth Freeman; Geraint Lewis; Karin Lind; Sanjib Sharma; Jeffrey D. Simpson; Dennis Stello; Tomaz Zwitter; Anish M. Amarsi; Joseph J. Armstrong; Kirsten Banks; Mark Beavis; Kevin-Luke Beeson; Boquan Chen; Ioana Ciucă; Gary S. Da Costa; Richard de Grijs; Bailey Martin; David Moise Nataf; Melissa Ness; Adam D. Rains; Tim Scarr; Rok Vogrinčič; Zixian Purmortal Wang; Rob A. Wittenmyer; Yi Anne Xie; The GALAH Collaboration

doi:10.1017/pasa.2025.26

The GALAH survey: Data release 4

Published online by Cambridge University Press: 27 May 2025

Janez Kos ,

Thomas Nordlander and

Benjamin Montet

...Show all authors

Show author details

Sven Buder*: Affiliation:
Research School of Astronomy and Astrophysics, Australian National University, Canberra, ACT, Australia ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia ACCESS-NRI, Australian National University, Canberra, ACT, Australia
Janez Kos: Affiliation:
Faculty of Mathematics & Physics, University of Ljubljana, Ljubljana, Slovenia
Xi Ella Wang: Affiliation:
Research School of Astronomy and Astrophysics, Australian National University, Canberra, ACT, Australia ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia
Madeleine McKenzie: Affiliation:
Research School of Astronomy and Astrophysics, Australian National University, Canberra, ACT, Australia ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia
Madeleine Howell: Affiliation:
ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia School of Physics and Astronomy, Monash University, Clayton, VIC, Australia
Sarah Martell: Affiliation:
ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia School of Physics, University of New South Wales, Sydney, NSW, Australia
Michael R. Hayden: Affiliation:
ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia Homer L. Dodge Department of Physics & Astronomy, University of Oklahoma, Norman, OK, USA School of Physics, UNSW, Sydney, NSW, Australia Sydney Institute for Astronomy, School of Physics, A28, The University of Sydney, Sydney, NSW, Australia
Daniel B. Zucker: Affiliation:
ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia School of Mathematical and Physical Sciences, Macquarie University, Sydney, NSW, Australia Astrophysics and Space Technologies Research Centre, Macquarie University, Sydney, NSW, Australia
Thomas Nordlander: Affiliation:
Research School of Astronomy and Astrophysics, Australian National University, Canberra, ACT, Australia ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia Department of Physics and Astronomy, Uppsala University, Uppsala, Sweden
Benjamin Montet: Affiliation:
School of Physics, University of New South Wales, Sydney, NSW, Australia UNSW Data Science Hub, University of New South Wales, Sydney, NSW, Australia
Gregor Traven: Affiliation:
Faculty of Mathematics & Physics, University of Ljubljana, Ljubljana, Slovenia
Joss Bland-Hawthorn: Affiliation:
ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia Sydney Institute for Astronomy, School of Physics, A28, The University of Sydney, Sydney, NSW, Australia
Gayandhi M. De Silva: Affiliation:
ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia School of Mathematical and Physical Sciences, Macquarie University, Sydney, NSW, Australia Astrophysics and Space Technologies Research Centre, Macquarie University, Sydney, NSW, Australia
Kenneth Freeman: Affiliation:
Research School of Astronomy and Astrophysics, Australian National University, Canberra, ACT, Australia ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia
Geraint Lewis: Affiliation:
Sydney Institute for Astronomy, School of Physics, A28, The University of Sydney, Sydney, NSW, Australia
Karin Lind: Affiliation:
Department of Astronomy, Stockholm University, AlbaNova University Centre, Stockholm, Sweden
Sanjib Sharma: Affiliation:
Space Telescope Science Institute, Baltimore, MD, USA
Jeffrey D. Simpson: Affiliation:
ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia School of Physics, UNSW, Sydney, NSW, Australia
Dennis Stello: Affiliation:
ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia School of Physics, UNSW, Sydney, NSW, Australia Sydney Institute for Astronomy, School of Physics, A28, The University of Sydney, Sydney, NSW, Australia Stellar Astrophysics Centre, Aarhus University, Aarhus C, Denmark
Tomaz Zwitter: Affiliation:
Faculty of Mathematics & Physics, University of Ljubljana, Ljubljana, Slovenia
Anish M. Amarsi: Affiliation:
Department of Physics and Astronomy, Uppsala University, Uppsala, Sweden
Joseph J. Armstrong: Affiliation:
Department of Space, Earth & Environment, Chalmers University of Technology, Gothenburg, Sweden
Kirsten Banks: Affiliation:
ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia School of Physics, UNSW, Sydney, NSW, Australia
Mark Beavis: Affiliation:
Centre for Astrophysics, University of Southern Queensland, Toowoomba, QLD, Australia
Kevin-Luke Beeson: Affiliation:
Faculty of Mathematics & Physics, University of Ljubljana, Ljubljana, Slovenia
Boquan Chen: Affiliation:
Research School of Astronomy and Astrophysics, Australian National University, Canberra, ACT, Australia ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia
Ioana Ciucă: Affiliation:
Research School of Astronomy and Astrophysics, Australian National University, Canberra, ACT, Australia ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia
Gary S. Da Costa: Affiliation:
Research School of Astronomy and Astrophysics, Australian National University, Canberra, ACT, Australia ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia
Richard de Grijs: Affiliation:
School of Mathematical and Physical Sciences, Macquarie University, Sydney, NSW, Australia Astrophysics and Space Technologies Research Centre, Macquarie University, Sydney, NSW, Australia International Space Science Institute–Beijing, Zhongguancun, Beijing, China
Bailey Martin: Affiliation:
Research School of Astronomy and Astrophysics, Australian National University, Canberra, ACT, Australia
David Moise Nataf: Affiliation:
Department of Physics & Astronomy, University of Iowa, Iowa City, IA, USA
Melissa Ness: Affiliation:
Research School of Astronomy and Astrophysics, Australian National University, Canberra, ACT, Australia ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia
Adam D. Rains: Affiliation:
Department of Physics and Astronomy, Uppsala University, Uppsala, Sweden
Tim Scarr: Affiliation:
Research School of Astronomy and Astrophysics, Australian National University, Canberra, ACT, Australia
Rok Vogrinčič: Affiliation:
Faculty of Mathematics & Physics, University of Ljubljana, Ljubljana, Slovenia
Zixian Purmortal Wang: Affiliation:
ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia Sydney Institute for Astronomy, School of Physics, A28, The University of Sydney, Sydney, NSW, Australia Department of Physics and Astronomy, University of Utah, Salt Lake City, UT, USA
Rob A. Wittenmyer: Affiliation:
Centre for Astrophysics, University of Southern Queensland, Toowoomba, QLD, Australia
Yi Anne Xie: Affiliation:
Research School of Astronomy and Astrophysics, Australian National University, Canberra, ACT, Australia ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia
The GALAH Collaboration: Affiliation:
https://www.galah-survey.org
*: Corresponding author: Sven Buder; Email: sven.buder@anu.edu.au

Article contents

Abstract
Introduction and workflow
Data
Synthetic spectra for 2DF-hermes
Single spectrum analysis (ALLSPEC)
Single star analysis (ALLSTAR)
Post-processing
Data release products
Caveats and future improvements
Conclusions
Facilities
Software
Linelist
Data availability
Footnotes
References

Rights & Permissions

Abstract

The stars of the Milky Way carry the chemical history of our Galaxy in their atmospheres as they journey through its vast expanse. Like barcodes, we can extract the chemical fingerprints of stars from high-resolution spectroscopy. The fourth data release (DR4) of the Galactic Archaeology with HERMES (GALAH) Survey, based on a decade of observations, provides the chemical abundances of up to 32 elements for 917 588 stars that also have exquisite astrometric data from the Gaia satellite. For the first time, these elements include life-essential nitrogen to complement carbon, and oxygen as well as more measurements of rare-earth elements critical to modern-life electronics, offering unparalleled insights into the chemical composition of the Milky Way. For this release, we use neural networks to simultaneously fit stellar parameters and abundances across the whole wavelength range, leveraging synthetic grids computed with Spectroscopy Made Easy. These grids account for atomic line formation in non-local thermodynamic equilibrium for 14 elements. In a two-iteration process, we first fit stellar labels to all 1 085 520 spectra, then co-add repeated observations and refine these labels using astrometric data from Gaia and 2MASS photometry, improving the accuracy and precision of stellar parameters and abundances. Our validation thoroughly assesses the reliability of spectroscopic measurements and highlights key caveats. GALAH DR4 represents yet another milestone in Galactic archaeology, combining detailed chemical compositions from multiple nucleosynthetic channels with kinematic information and age estimates. The resulting dataset, covering nearly a million stars, opens new avenues for understanding not only the chemical and dynamical history of the Milky Way but also the broader questions of the origin of elements and the evolution of planets, stars, and galaxies.

Keywords

Surveys the Galaxy methods: observational methods: data analysis stars: fundamental parameters stars: abundances

Information

Type: Research Article
Information: Publications of the Astronomical Society of Australia , Volume 42 , 2025 , e051

DOI: https://doi.org/10.1017/pasa.2025.26 [Opens in a new window]

NASA ADS Abstract Service [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2025. Published by Cambridge University Press on behalf of Astronomical Society of Australia

1. Introduction and workflow

1.1. Motivation

The history of our Milky Way galaxy is written in starlight. By capturing and analysing the light from millions of stars, which are now millions or billions of years old, we can uncover the chemical compositions embedded in their atmospheres since birth and use stars as time capsules into the past evolution of the Milky Way. The light of stars can thus guide us to explore and map our environment and Country, just as it has guided Aboriginal and Torres Strait Islander peoples and their astronomers for tens of thousands of years.

With this fourth data release (DR4) from the Galactic Archaeology with HERMES (GALAH) Survey, we are proudly publishing the next set of measurements of stellar chemical abundances for almost a third of the elements in the periodic table that are created by stars. The initial motivation for measuring so many elemental abundances was laid out by De Silva et al. (2015) and included the major motivation – chemical tagging – with the aim to trace back stars that were born together through their (expected) similar chemical compositions. The recent and ongoing efforts of GALAH and other surveys like the SDSS/APOGEE surveys (e.g. Abdurro’uf et al. Reference Abdurro’uf2022; Kollmeier et al. Reference Kollmeier2017), LAMOST (Zhao et al. Reference Zhao, Zhao, Chu, Jing and Deng2012), Gaia-ESO (Gilmore et al. Reference Gilmore2022; Hourihane et al. Reference Hourihane2023), RAVE (Steinmetz et al. Reference Steinmetz2020), and Gaia RVS (Recio-Blanco et al. Reference Recio-Blanco2023) have taught us that the chemical evolution of our Galaxy and stars is complex and it is difficult to recover stellar siblings on a large scale due to limitations in our observations, analysis methods, and intrinsic changes to chemical composition due to stellar evolution. New observations and innovations in the analysis that are presented in this data release will allow us to make significant progress towards chemical tagging.

The unique observational setup of GALAH allows us to deliver chemical abundance information for a powerful and substantial set of stars: those which have exquisite astrometric information from the revolutionary Gaia satellite (Gaia Collaboration et al. 2016) and for which we can estimate stellar ages either from empirical or theoretical models, like stellar isochrones or mass- and age-dependent relations of chemical compositions. By combining stellar ages, orbits, and chemistry, we have made major advances in the understanding of our Galaxy. In particular, the discovery of the major merger of the Milky Way with another slightly less massive galaxy between 8 and $10\,\mathrm{Gyr}$ ago (Belokurov et al. Reference Belokurov, Erkal, Evans, Koposov and Deason2018; Helmi et al. 2018) was paradigm shifting and motivated a new rush to collect more (and more diverse) information about the stars in our Milky Way.

GALAH DR4 presents two major improvements over the previous data releases. We have increased the quantity as well as quality of observations and we have implemented a hybrid spectrum synthesis approach that allows us to fit 95% of the spectrum, including broad molecular absorption features from C₂ and CN. This allows us to now infer up to 32 elements,Footnote ^a including N, with unprecedented precision for a larger number of stars. GALAH DR4 naturally continues both the observing program aimed at acquiring spectra of 1 million stars (De Silva et al. 2015), and our ongoing efforts to improve the spectrum reduction and analysis pipelines, including the novel and more accurate line modelling with non-local thermodynamics equilibrium. In GALAH DR1 and DR2 (Martell et al. Reference Martell2017; Buder et al. 2018), we developed a novel, data-driven pipeline using the interpolation and fitting code The Cannon (Ness et al. Reference Ness, Hogg, Rix, Ho and Zasowski2015). However, for DR3 (Buder et al. Reference Buder2021), we reverted to the more computationally expensive method of spectrum synthesis, applying it to a limited wavelength range to confirm the accuracy of our data-driven approach. In this data release, we are now implementing a hybrid approach. We create a training set of synthetic spectra across the full wavelength range using the same synthesis code as DR3, then train a neural network to interpolate the spectra efficiently in a high-dimensional space with up to 36 dimensions. By using neural networks, we can model the entire wavelength range, including broad molecular absorption features from C₂ and CN, rather than focusing on narrow atomic line windows. This approach allows us to simultaneously model all stellar labels – global parameters and elemental abundances. Additionally, we can infer the shape of the interstellar spectrum from the differences between observed and synthetic spectra, while also incorporating non-spectroscopic information during the optimisation process.

In the following section, we outline our workflow and provide detailed explanations of our methodology throughout this manuscript, offering insights that upcoming surveys like WEAVE (Dalton et al. Reference Dalton2014), SDSS-V (Kollmeier et al. Reference Kollmeier2017), and 4MOST (de Jong et al. Reference de Jong2019) can readily utilise.

1.2. Workflow

The workflow of GALAH DR4 is depicted in Fig. 1 and will serve as a guideline for this manuscript: We first describe the collection of data in Section 2, most notably the observation of HERMES spectra. We explain how we create synthetic stellar spectra to compare with the observed ones in Section 3. This comparison is done in two consecutive steps. In Section 4, we explain how we extract stellar labels from individual observations (without non-spectroscopic information folded into the optimisation), while Section 5 describes how we co-add repeated observations and fold in non-spectroscopic information for each star. We describe the post-processing and validation of our data in Section 6. The data products of this data release are explained in Section 7. We describe identified caveats in Section 8 and make suggestions for minimising them in the future, before concluding this manuscript in Section 9.

Figure 1. Workflow of GALAH DR4.

2. Data

The GALAH Survey uses the 3.9-m Anglo-Australian Telescope at Siding Spring Observatory on Gamilaraay Country and its Two-Degree Field positioning system (2dF) top end (Lewis et al. Reference Lewis2002). 2dF magnetically places up to 400 fibre buttons on one of two metal field plates, which can be tumbled to allow observing with one set of fibres while configuring the other. Light is delivered through the fibres to the High Efficiency and Resolution Multi-Element Spectrograph (HERMES) spectrograph (Barden et al. Reference Barden2010; Brzeski, Case, & Gers Reference Brzeski, Case and Gers2011; Heijmans et al. Reference Heijmans2012; Farrell et al. Reference Farrell2014; Sheinis et al. Reference Sheinis2015) and dispersed into four non-contiguous wavelength bands in the optical that cover $\sim 1\,000\,$ Å in the range of 4 713–4 903 (blue CCD or CCD1), 5 648–5 873 (green/CCD2), 6 478–6 737 (red/CCD3), and 7 585–7 887 Å (infrared IR/CCD4). The data used in this data release is primarily based on observations of stars with this setup, but also makes use of auxiliary photometric and astrometric information for the stars where available.

Figure 2. Overview of the distribution of stars included in this fourth GALAH data release in Galactic coordinates with the centre of the Galaxy at the origin and the Gaia DR3 all-sky colour view (Gaia Collaboration et al., 2023) as background. Shown are the targets of GALAH Phase 1 (dark blue) and Phase 2 (medium blue), the targets of the K2-HERMES follow-up along the ecliptic and TESS-HERMES in the TESS Southern Continuous Viewing Zone as well as CoRoT fields (pink). Both open and globular cluster points are shown in purple and orange, respectively. All other targets are shown in in light blue across the Southern sky.

In this Section, we describe which stars we have targeted as part of configured fields (Miszalski et al. Reference Miszalski, Shortridge, Saunders, Parker and Croom2006) and observed with the 2dF-HERMES setup (Section 2.1), including the first description of the second phase of GALAH observations (GALAH Phase 2) which has a sharper focus on main-sequence turn-off stars to estimate more precise ages. In Section 2.2, we briefly summarise the properties of the spectroscopic data and how they were reduced to one-dimensional spectra. We also point out major changes in the observations and reductions with respect to the previous (third) data release (Buder et al. Reference Buder2021). We further elaborate on the auxiliary information that was used for the analysis in Section 2.3.

2.1. Target selection and observational setup

GALAH DR4 is a combination of the main GALAH survey and additional projects to observe asteroseismic targets from the K2 (Howell et al. Reference Howell2014) and TESS (Ricker et al. Reference Ricker2015) missions, that is, K2-HERMES (Sharma et al. Reference Sharma2019) and TESS-HERMES (Sharma et al. Reference Sharma2018), as well as numerous smaller programs and public HERMES data. Additional proposals with 2dF-HERMES have contributed targeted observations of globular cluster members (PI M. McKenzie and PI M. Howell), open clusters (PI G. De Silva and PI J. Kos), young stellar associations (PI J. Kos and J. Armstrong), and halo stars (PI S. Buder) in addition to their observation through the main surveys. The column survey_name in our catalogues denotes the origin. An all-sky view of GALAH DR4 is shown in Fig. 2.

2.1.1. Target selection for GALAH Phase 1 and 2

For GALAH Phase 1 (DR1-DR3) and in the absence of a precise and volume-complete survey in the optical, we used the 2MASS photometric survey (Skrutskie et al. Reference Skrutskie2006) with its J and Ks filters as a precise and nearly volume-complete parent sample from which we selected stars based on approximated (De Silva et al. 2015) visual magnitudes

(1)

\begin{equation}V_{JK_S} = K_S+2(J-K_S+0.14)+0.382e^{((J-K_S-0.2)/0.5)}.\end{equation}

For GALAH Phase 1, a tiling pattern (with unique field_id entries) with $2\,\mathrm{deg}$ fields of view below declination $\delta \leq +10\,\mathrm{deg}$ was created for regions with Galactic latitude $\vert b \vert \geq 10\,\mathrm{deg}$ to avoid crowding and strong extinction. For each tile, a selection of 400 stars within magnitudes $9 \leq V_{JK_S} \leq 12$ for a bright magnitude cut and $12 \leq V_{JK_S} \leq 14$ for the nominal magnitude cut is randomly selected from the complete parent sample of 2MASS. Of those, typically 350 stars are actually observed with around 2/3 main-sequence and turn-off stars and 1/3 evolved stars.

For GALAH Phase 2, a stronger focus on turn-off stars was implemented with the photometric and astrometric information of Gaia data release 2 as a parent sample. For each field, we therefore first allocate fibres to stars with absolute Gaia magnitude in the range of $2 \leq M_G \leq 4$ , where

(2)

\begin{equation}M_G = G + 5 \cdot \log_{10} \left( \frac{\varpi}{100\,\mathrm{mas}} \right)\end{equation}

with apparent magnitude ( $G\;/\;\mathrm{mag} \equiv {phot\_g\_mean\_mag}$ ) and parallax measurements ( $\varpi\;/\;\mathrm{mas} \equiv {parallax}$ ) from Gaia DR2 (Gaia Collaboration et al. 2018; Evans et al. Reference Evans2018; Lindegren et al. Reference Lindegren2018). Remaining fibres are filled with targets as done with the Phase 1 selection function. This leads to a different selection function for each phase. For science cases in which selection functions matter, we thus recommend to use the survey_name (Table 1) for a clean selection of phase and selection function.

2.1.2. Observational setup

We list the observations under various sub-programs in Table 1. Except for 2 935 spectroscopic observations with the high-resolution mode of HERMES ( $R \sim 42\,000$ ) on 7, 8, 10, 11 and 12 February 2014, all observations were made in the low-resolution mode ( $R \sim 28\,000$ ) with different total exposure times chosen for different programs, but typically between 60 and 90 min. Under sufficient conditions (no clouds and seeing below $2\,\mathrm{arcsec}$ ), GALAH Phase 1 and TESS-HERMES observed 3 exposures for 6 min for bright targets ( $9 \leq V_{JK_S} \leq 12$ ) and 3 exposures for 20 min for the majority of targets ( $12 \leq V_{JK_S} \leq 14$ ).

GALAH Phase 2 extended these times to 3 exposures of 10 or 30 min, respectively, and included repeat observations of GALAH Phase 1 main targets with another 3 exposures for 15 min. K2-HERMES observations targeted stars with $13 \leq V_{JK_S}\;/\;\mathrm{mag} \leq 15$ or even $13 \leq V_{JK_S}\;/\;\mathrm{mag} \leq 15.8$ to complement the K2 Galactic Archaeology Program (Stello et al. Reference Stello2015). These fields were observed for 2 h, similar to most globular and open cluster stars. Worse seeing conditions leading to increasing full-width at half maxima or thin clouds triggered between one ( $2 \,{\lt}\, \mathrm{seeing} \leq 2.5\,\mathrm{arcsec}$ ) and 3 ( $2.5 \,{\lt}\, \mathrm{seeing} \leq 3\,\mathrm{arcsec}$ ) additional exposures. In addition to the science frames, quartz fibre flat and ThXe arc observations were taken directly before or after each set of science exposures, and bias frames were taken at the beginning or end of each observing night.

Table 1. Overview of stars observed for the programs included in GALAH DR4. Numbers of open and globular cluster observations were estimated after observations as described in Section 2.3.3. We have observed 30 globular clusters (23 with $\geq$ 5 stars) and 361 open clusters (109 with $\geq$ 5 stars).

2.2. Spectroscopic data from GALAH observations

Since the commissioning of the HERMES spectrograph in late 2013 until 6 August 2023, the GALAH collaboration and its partners have observed and successfully reduced 1 085 520 spectra of 917 588 stars. Each single observation is given a unique sobject_id YYMMDDRRR01FFF that is based on its year (YY), month (MM), and day (DD) of observations, its exposure run number (RRRR), and the used fibre (FFF). A reduced example spectrum of the asteroid Vesta (observed on 15 January 2014 during run 22 through fibre 239 with sobject_id 210115002201239) is shown in Fig. 3 and used as a reference for a Solar spectrum. The reduction process to create FITS files of reduced spectra from two-dimensional images from the cameras employs an updated and publicly available version 6 of the already well-tested reduction pipeline (Kos et al. 2017). The file extensions are listed in Table 2 and created as follows.

Science frames are corrected by removing the bias, dividing out the different gains (provided in the FITS headers) of the two readout amplifiers per CCD, flagging bad pixels, and dividing by master flat field frames, as well as removing cosmic rays and scattered light. Subsequently, apertures for each fibre trace are identified and used to extract the individual spectra.

Figure 3. Comparison of normalised observed (black) and synthetic spectra (blue) of the asteroid Vesta with solar composition as well as examples of synthetic spectra with non-solar abundances. Panels (a–d) show the observed and best-fitting synthetic spectrum as well as their absolute residual (pink) for the four wavelength channels of the HERMES spectrograph. Panel (e) shows the beginning of the blue CCD 1 (left most part of panel a) with an additional synthetic spectrum with ten times higher [C/Fe] in orange, for which the $\mathrm{C}_2$ Swan bands are prominent. Panel (f) shows the beginning of the green CCD 2 (left most part of panel b) and exemplifies with a synthetic spectrum in green that also has a ten times lower [Na/Fe] abundance (for example, in accreted stars) can still be reliably detected. Panel (g) shows the end of the red CCD 3 with a synthetic spectrum of primordial Li abundance of $\mathrm{A(Li)} = 2.75$ in red. While this abundance could be detected, the line for the Solar value $\mathrm{A(Li)} = 1.05$ is barely detectable. Panel (h) shows the end of the infrared CCD 4, which would show strong molecular absorption features of the CN molecule for $\mathrm{[N/Fe]} = +1\,\mathrm{dex}$ (purple).

Wavelength calibrations are performed via Chebyshev polynomial functions based on the up to 62, 52, 41, or 31 emission lines within the ThXe arc frames of CCDs 1-4, with wavelengths reported in air, and the spectra are interpolated onto a linearly increasing wavelength grid. Typical root mean square values for the wavelength solutions of CCDs 1-4 are 0.010, 0.015, 0.019, and $0.028\,$ Å, respectively. The starting wavelength CRVAL1 and dispersion CDELT1 are saved in the headers of each FITS file.

Finally, sky lines are subtracted and telluric features removed, before a barycentric correction is applied to create the ‘reduced’ spectra that are saved in extension 0 of the reduction pipeline FITS files and used for the subsequent analysis. Reduction pipeline spectra are normalised by an eleventh order Legendre polynomial fit and saved in extension 1 of the reduction products.

Fractional noise/uncertainties are saved in extension 2 and calculated from the square root of the sum of squared counts, sky features (extension 3), scattered light (extension 5), and crosstalk (extension 6) measurements as well as the squared readout noise.

The wavelength dependent line spread functions (LSFs) are measured from the arc calibration frames for each spectrum and CCD by fitting modified Gaussian distributions with one boxiness parameter b per CCD and full width half maxima fwhm for each wavelength point in the spectrum, that is

(3)

\begin{align} \exp \left(-0.693147 \cdot \vert 2 \cdot \boldsymbol{{x}}/{fwhm} \vert^{b}\right) \end{align}

The array $\boldsymbol{{x}}$ then includes the pixels around each wavelength step that are used to apply the convolution from higher resolution to GALAH resolution spectra. The fitted values of fwhm are saved in extension 7 with b saved in the headers.

The achieved Signal-to-Noise Ratio (SNR) per pixel of the individual exposures depends on the spectral types, reddening, and observational conditions. In particular the repeat observations of previous pointings have increased the SNR for co-added spectra with respect to GALAH DR3. This can be appreciated from Fig. 4, where we plot the cumulative distribution function for all stars of GALAH DR3 (dashed lines) and GALAH DR4 (solid lines) for the different CCDs.

2.3. Auxiliary data from Gaia, 2MASS, and literature

To support our spectroscopic analysis, we make use of astrometric and photometric information from the Gaia satellite (Gaia Collaboration et al. 2016) and 2MASS survey (Skrutskie et al. Reference Skrutskie2006), which is available for essentially all our targets. We further use the value-added catalogues, like distance estimates for field stars by Bailer-Jones et al. (Reference Bailer-Jones, Rybizki, Fouesneau, Demleitner and Andrae2021) as well as open and globular cluster membership probabilities from Cantat-Gaudin & Anders (Reference Cantat-Gaudin and Anders2020) as well as Vasiliev & Baumgardt (Reference Vasiliev and Baumgardt2021) and Baumgardt & Vasiliev (Reference Baumgardt and Vasiliev2021).

2.3.1. Gaia DR3

We crossmatch our observations to the Gaia DR3 catalogue (Gaia Collaboration et al. 2021a, 2023) using the 2MASS ID, via the nearest neighbour crossmatches provided as part of Gaia DR3 (Torra et al. Reference Torra2021). 911 754 (99.0 %) also have astrometric information (Lindegren et al. Reference Lindegren2021b) and 849 867 (93.0 %) have radial velocity estimates (Katz et al. Reference Katz2023). We apply the corrections to both photometric (Riello et al. Reference Riello2021) and astrometric (Lindegren et al. Reference Lindegren2021a) information. Where possible we prefer the photo-geometric distances over the geometric distances from Bailer-Jones et al. (Reference Bailer-Jones, Rybizki, Fouesneau, Demleitner and Andrae2021). Where neither are available, we further try to find parallaxes from van Leeuwen (Reference van Leeuwen2007). The average parallax uncertainty of the GALAH stars is $\sigma_{\varpi} / \varpi = 1.6_{-0.9}^{+2.6}\,\mathrm{\%}$ . Only $2.3\,\%$ of GALAH stars have no parallax measurementsFootnote ^b or parallax measurements beyond 20% uncertainty, for which the priors adopted by Bailer-Jones et al. (Reference Bailer-Jones, Rybizki, Fouesneau, Demleitner and Andrae2021) start to dominate distance estimates.

2.3.2. 2MASS, WISE, and extinction

In addition to the excellent infrared photometry for 99.9 % of our sources from the 2MASS survey (Skrutskie et al. Reference Skrutskie2006), 98.7 % of them have far-infrared measurements from the WISE mission (Cutri et al. Reference Cutri2014). We therefore can estimate the extinction in the $K_S$ band via the Rayleigh-Jeans colour excess (RJCE) method (Majewski, Zasowski, & Nidever Reference Majewski, Zasowski and Nidever2011) $A_{K_S} = 0.917 \cdot \left( H - W2 - 0.08 \right)$ for most stars. We confirm this estimate by estimating the extinction in $K_S$ via the extrapolation of the colour extinction of $B-V$ , that is, $A_{K_S} \sim 0.36 \cdot E(B-V)$ (Cardelli, Clayton, & Mathis Reference Cardelli, Clayton and Mathis1989). We revert to this value if it is less than half the value of the RJCE estimate, or if either of the H and W2 bands does not have an excellent quality flag ‘A’. For negative estimates by the RJCE method and very nearby stars ( $\,{\lt}\,100\,\mathrm{pc}$ ) we null the value.

2.3.3. Open and globular cluster members and distances

We identify open cluster members using the membership catalogue from Cantat-Gaudin & Anders (Reference Cantat-Gaudin and Anders2020) via crossmatch with the Gaia source_id and adjust their parallaxes and distance estimates to the average cluster values if the latter are more precise. We identify globular cluster members (with more than 70% probability) via the membership catalogue from Vasiliev & Baumgardt (Reference Vasiliev and Baumgardt2021) by crossmatching with the Gaia source_id. We then adjust the parallaxes and distances for the member stars to the mean values listed by Baumgardt & Vasiliev (Reference Baumgardt and Vasiliev2021).

3. Synthetic spectra for 2DF-hermes

The goal of our spectroscopic analysis is to estimate the optimal set of stellar properties (labels) that influence a stellar spectrum by minimising the difference between observed stellar spectra and synthetic ones. In this data release, we push the analysis further by fitting up to 32 elemental abundances and stellar parameters simultaneously across the full GALAH wavelength range with the appropriate synthetic spectra.

Table 2. Data product 1: FITS files of reduced spectra.

To make this computationally feasible, we adopt a strategy inspired by Rix et al. (Reference Rix, Ting, Conroy and Hogg2016), where we create flexible models for smaller regions of the parameter space, utilizing only a limited number of ab initio synthetic spectra (see also Ting, Conroy, & Rix Reference Ting, Conroy and Rix2016). These synthetic spectra are calculated using Spectroscopy Made Easy (sme; Valenti & Piskunov Reference Valenti and Piskunov1996; Piskunov & Valenti Reference Piskunov and Valenti2017), covering the entire wavelength range and accounting for all visible atomic and molecular lines. The spectra are generated for random selections of elemental abundances and stellar parameters within the range allowed by marcs atmospheric models (Gustafsson et al. Reference Gustafsson2008), at much higher resolution and sampling than our 2dF-HERMES spectra. From these, we select subsets of spectra corresponding to restricted regions of the parameter space defined by $T_\mathrm{eff}$ , $\log g$ , and [Fe/H]. This method is analogous to using Solar twins (see, e.g. Nissen Reference Nissen2015) or performing differential abundance analysis of globular cluster stars (e.g. Yong et al. Reference Yong2013; Monty et al. Reference Monty2023). By reducing the impact of systematic uncertainties in atomic data and atmospheric models, these approaches have proven to be highly effective (Nissen & Gustafsson Reference Nissen and Gustafsson2018).

Figure 4. Cumulative Distribution Function (CDF) of the logarithmic Signal-to-Noise Ratio (SNR) per pixel for the 4 CCDs of the HERMES spectrograph comparing GALAH DR4 (solid lines) and GALAH DR3 (dashed lines).

For each parameter subset, we train a neural network to map stellar fluxes to their corresponding stellar parameters and abundances, similar to The Payne (Ting et al. Reference Ting, Conroy, Rix and Cargile2019). These models allow us to generate synthetic spectra across the full wavelength range for any combination of elemental abundances within the restricted parameter space in under a second – compared to the minutes or hours required by traditional physics-driven spectrum synthesis approaches.

Another key motivation for creating smaller training sets is the limited flexibility of interpolation methods when dealing with the full parameter space. Spectroscopic surveys like GALAH, RAVE, and APOGEE aim to fit all types of stellar spectra simultaneously, including Sun-like stars, red clump stars, metal-poor stars, evolved stars with strong molecular bands, and hot stars with shallow and broad lines. Attempting to model this vast range with a single model leads to systematic trends, particularly in extreme cases (Casey et al. Reference Casey2016; Buder et al. 2018; Ting et al. Reference Ting, Conroy, Rix and Cargile2019). To mitigate these issues, we deliberately limit the complexity of the models by creating smaller, more focused models. For example, the model for hot stars does not need to predict the strong molecular absorption features seen in cooler stars. The potential caveats and limitations of this approach are discussed in detail in Section 8.

In the following sections, we describe our approach to dividing the parameter space into smaller bins for training (Section 3.2) and explain how we generate high-resolution synthetic spectra for this parent sample (Section 3.2). We also outline how we train neural networks to rapidly interpolate these synthetic spectra (Section 3.3).

3.1. Stellar twin training sets rather than one-fits-all

The base grid for our training set computation is the marcs grid (Gustafsson et al. Reference Gustafsson2008), which is shown with red points in Fig. 5. Following the aforementioned idea of restricting ourselves to stellar siblings, we create multiple 3-dimensional bins in $T_\mathrm{eff}$ , $\log g$ , and [Fe/H] within $\pm 1$ grid points in $T_\mathrm{eff}$ (with either $\pm 250$ or $\pm 100\,\mathrm{K}$ ), $\log g$ ( $\pm 0.5\,\mathrm{dex}$ ), and [Fe/H] ( $\pm 0.5$ or $\pm 0.25\,\mathrm{dex} $ ). An example box is shown for Solar siblings as a blue box in Fig. 5, which is centred on $T_{\text{eff}} = 5\,750\pm250\,\mathrm{K}$ , $\log g = 4.5\pm0.5\,\mathrm{dex}$ and $\mathrm{[Fe/H]} = 0.0\pm0.25\,\mathrm{dex}$ .

Figure 5. Coverage in $T_\mathrm{eff}$ and $\log g$ of the MARCS2014 grid (red) and GALAH DR3 (black, including density countours). Shown is also an example of one of the 3D bins used to create stellar sibling models with each neural network. marcs grid points $T_\mathrm{eff}$ $ \,{\lt}\, 3\,100\,\mathrm{K}$ or [Fe/H] $\,{\lt}\,-3\,\mathrm{dex}$ are neglected for GALAH DR4.

Within these bins we sample 280Footnote ^c synthetic spectra with no rotational broadening, which are later broadened with different rotational velocities $v \sin i$ to create between 1 680 and 2 240 training set spectra for each bin. For clarity, we explain the parameter and abundance sampling for an example 3D bin centred on $T_{\text{eff}} = 5\,750\pm250\,\mathrm{K}$ , $\log g = 4.5\pm0.5\,\mathrm{dex}$ and $\mathrm{[Fe/H]} = 0.0\pm0.25\,\mathrm{dex}$ (see blue box in Fig. 5.

Stellar parameters ( $T_\mathrm{eff}$ , $\log g$ , [Fe/H], $v_\mathrm{mic}$ ) and elemental abundances [X/Fe] of all 32 elements are randomly sampled within reasonable limits (see examples in Fig. 6 and Table 3) and fed into sme to create self-consistent synthetic spectra over the full HERMES wavelength range for marcs atmospheres.

Figure 6. Coverage of stellar parameters and abundances for one of the 3D bins. Shown is the example of the Solar 3D bin ( $T_\mathrm{eff}\;/\;\mathrm{K} = 5\,750$ , $\log g\;/\;\mathrm{dex} = 4.5$ , $\mathrm{[Fe/H]}\;/\;\mathrm{dex} = 0.0$ ). Panel a): $T_\mathrm{eff}$ and $\log g$ , Panel (b): [Fe/H] vs. A(Li), Panel (c): [Fe/H] vs. [O/Fe], Panel (d): [Fe/H] vs. [Mg/Fe]. While $T_\mathrm{eff}$ , $\log g$ , and [Fe/H] are sampled randomly within the 3D bin, the abundances are sampled both narrowly (blue) and broadly (purple) within limits as described in the text. Red points indicate the median label values and orange points the adjusted label values (see Table 3) to test the gradient change of spectra with individual labels.

Microturbulence velocity ( $v_\mathrm{mic}$ ) values are sampled uniformly between the upper and lower limits of the empirical relation from GALAH DR3 (Eqs. 4 and 5 from Buder et al. Reference Buder2021) and an adjusted version of the relation of Dutra-Ferreira et al. (Reference Dutra-Ferreira, Pasquini, Smiljanic, Porto de Mello and Steffen2016). The latter has been adjusted for $T_{\text{eff}}^\prime = T_{\text{eff}}/\mathrm{K} - 5\,500$ as well as $\log g^\prime = \log g/\mathrm{dex} - 4.0$ to return $v_{\text{mic}}/\mathrm{km\,s^{-1}}$ :

(4)

\begin{align}v_{\text{mic}} = \begin{array}{l}1.198 + 3.16 \times 10^{-4} \cdot T_{\text{eff}}^\prime - 0.253 \cdot \log g^\prime \\ - 2.86\times 10^{-4} \cdot T_{\text{eff}}^\prime \cdot \log g^\prime + 0.165 \cdot (\log g^\prime)^2\end{array} \end{align}

3.2. High-resolution synthetic spectra with sme

We create training sets from high-resolution stellar spectra for each smaller 3D bin region of the parameter space. We compute oversampled synthetic intensity spectra at higher resolution and sampling than the typical GALAH resolution with sme for seven equal-area angles (see Fig. 7) of the plane-parallel or spherically symmetric stellar surfaces (Gustafsson et al. Reference Gustafsson2008).

Table 3. Example of boundaries for the uniform sampling of synthetic spectrum labels (stellar parameters and elemental abundances) for the 3-dimensional bin of Solar siblings 5750_4.50_0.00.

Figure 7. Example output of sme for a solar spectrum in HERMES CCD3 (around the Balmer $\mathrm{H}_{\unicode{x03B1}}$ line). Shown are the specific intensities (sme.sint) as a function of the viewing angle $\mu = \cos \theta$ .

For each spectrum, we first run a test on all available lines in the GALAH linelist. We use the same linelist as in GALAH DR3 (Buder et al. Reference Buder2021). This linelist was adapted from the linelist of Heiter et al. (Reference Heiter2021) and implements numerous updates to line data, such as updates or corrections of $\log gf$ values in the GALAH wavelength range. The test is used to restrict the myriad of possible molecular lines to the visible ones with sme.depth above 0.001, while keeping all atomic lines for the final synthesis.

Spectra are computed at a resolution of $R = 300\,000$ on a fine wavelength grid with 60 819 pixels spread over the extended wavelengths 4 675.1–4 949.9, 5 624.1–5 900.9, 6 424.1–6 775.9, and 7 549.1–7 925.9 Å. We note that these extend significantly beyond the actual GALAH wavelength range.

We use one-dimensional (1D) marcs atmospheres from the marcs grid (Gustafsson et al. Reference Gustafsson2008, version 2014) with a trilinear interpolation for combinations of $T_\mathrm{eff}$ , $\log g$ , and [Fe/H]. We use grids of non-LTE departure coefficients from Amarsi et al. (Reference Amarsi2020b), Amarsi, Liljegren, & Nissen (Reference Amarsi, Liljegren and Nissen2022) for atomic lines of H, Li, C, N, O, Na, Mg, Al, Si, K, Ca, Mn, Fe, and Ba. For most elements, the non-LTE departure coefficient grids include isotropic and coherent scattering for lines from background atomic and ionic species (see Equation 7 of Amarsi et al. Reference Amarsi2020b) as well as Thompson and Rayleigh scattering. The calculations for C include all background species in pure absorption (Equation 6 of Amarsi et al. Reference Amarsi2020b), whereas for Fe, Thompson and Rayleigh scattering were included but all background lines were treated in pure absorption.

Our synthetic grid explicitly includes C and N abundances. C was previously included in the analysis of GALAH DR3, but limited to the atomic C line. The analysis thus neglected the molecular absorption features of C₂ and CN at the beginning of CCD1 and end of CCD4, respectively. With the new self-consistent grid, we can include these features, as they hold valuable information for both C and N, as well as several other features through the molecular equilibrium in stars (see e.g. Ting et al. Reference Ting, Conroy, Rix and Asplund2018).

To be able to test that the flux-label correlations found by our interpolation routine are limited to reasonable wavelength ranges, we also calculate one spectrum that is exactly in the middle of the parameter range and additional spectra, where we increase the value of one label at a time (e.g. increase [O/Fe] by $1\,\mathrm{dex}$ ) to test the response in the synthetic spectrum.

To save computational costs, we compute synthetic spectra with no rotational or macroturbulence broadening ( $v_{\text{mac}} = v\sin i = 0\,\mathrm{km\,s^{-1}}$ ), but save the continuum flux (sme.cmod) and the specific intensities (sme.sint) as a function of the equal-area midpoints of each equal-area annulusFootnote ^d $\mu$ (see Fig. 7). We then apply the broadening of spectra due to rotation ( $v \sin i$ ) with the flux integration code of the python-implementation PySME (Wehrhahn, Piskunov, & Ryabchikova Reference Wehrhahn, Piskunov and Ryabchikova2023) of sme. Depending on the expected rotational velocities (increasing with temperature) we sample a range of

(5)

\begin{align} v \sin i/\,\mathrm{km\,s^{-1}} \in \{ 1.5, 3, 6, 9, 12, 18, 24, 36\}.\end{align}

Note that $v \sin i = 24 \,\mathrm{km\,s^{-1}}$ is only included for bins with $T_\mathrm{eff}$ $\geq 5\,000\,\mathrm{K}$ and $v \sin i = 36 \,\mathrm{km\,s^{-1}}$ for those with $T_\mathrm{eff}$ $\geq 6\,000\,\mathrm{K}$ .

3.3. Interpolating synthetic spectra with neural networks

To allow a fast interpolation with new and different stellar labels, we use data-driven models to connect stellar fluxes at given pixels from a combination of stellar labels. This method is well established in stellar spectroscopy through the successful applications of quadratic models with The Cannon (see e.g. Ness et al. Reference Ness, Hogg, Rix, Ho and Zasowski2015; Ness et al. Reference Ness2016; Casey et al. Reference Casey2016; Casey et al. 2017; Ho et al. Reference Ho2017; Buder et al. 2018) as well as neural networks with The Payne (see e.g. Ting et al. Reference Ting, Conroy, Rix and Cargile2019; Xiang et al. Reference Xiang2019; Xiang et al. Reference Xiang2022). Because of the needed flexibility to predict synthetic spectra with 36 stellar labels for a large parameter space (for a detailed discussion of advantages of neural networks over quadratic models see Ting et al. Reference Ting, Conroy, Rix and Cargile2019), we choose neural networks to interpolate between our synthetic spectra in this data release.

In this work, we utilise the neural network architecture and training algorithms from the spectrum interpolation framework of The Payne (Ting et al. Reference Ting, Conroy, Rix and Cargile2019). While we do not implement the full functionality of The Payne, we specifically adopt its spectrum interpolation capabilities. Unlike the version originally published by Ting et al. (Reference Ting, Conroy, Rix and Cargile2019), we use the architecture of the latest available version of The Payne. This modified architecture connects k stellar labels $\boldsymbol{\ell}$ with the flux f at each wavelength pixel $\lambda$ via

(6)

\begin{equation}f_\lambda = w \cdot \mathrm{lReLU} \bigg( \tilde{w}_\lambda^i \cdot \mathrm{lReLU} \Big( w^k_{\lambda i} \ell_k + b_{\lambda i} \Big) + \tilde{b} \bigg) + \bar{f}_\lambda,\end{equation}

which encapsulates the so-called layers of a neural network with $i = 300$ neurons with weights w and biases b as well as a leaky Rectified Linear Unit (lReLU)

(7)

\begin{equation} \mathrm{lReLU} (x) = \begin{cases} x \qquad &x \geq 0 \\ 0.01 x \qquad &x \,{\lt}\, 0. \end{cases}\end{equation}

After optimising the mean absolute error loss function for $10^4$ steps, we consider the network trained with an optimised combination of three sets of weights and biases within the minimum and maximum ranges of each label. We discuss the performance and caveats of this particular neural network architecture and training setup in Section 8.3. The trained networks can then be used with new input labels to quickly create synthetic spectra for the label optimisation. Computational resources could be conserved by training neural networks exclusively on spectra from non-rotating stars and subsequently applying broadening through convolution with a center-to-limb darkening law. This method, while less accurate, could enable the fitting of broader velocity ranges and enhance neural network performance by simplifying the spectral shapes they must learn. However, shifting the broadening process from training to post-processing does not necessarily guarantee a reduction in computational costs.

4. Single spectrum analysis (ALLSPEC)

As outlined in Section 1, the workflow of GALAH DR4 includes a first analysis step of all observed spectra without including non-spectroscopic information for the optimisation. This allows us to identify shifts in radial velocity between separate spectroscopic observations of the same starFootnote ^e and a better co-adding of spectra for the allstar analysis (see Section 5). Another motivation for this step is to get a first estimate of stellar labels without potentially biased photometric and astrometric information, for example for binary stars.

The optimisation of stellar labels thus aims to minimise the absolute difference between synthetic and observed spectra, weighted by their uncertainty. Starting from a set of initial labels (Section 4.1), we create high-resolution synthetic spectra and convolve them to the resolution and wavelength grid of each observed spectrum. We remind ourselves that in GALAH DR3, we used a repeated combination of spectrum normalisation followed by stellar parameter optimisation and a subsequent fit of individual elements with fixed stellar parameters. In the analysis of GALAH DR4, we perform an on-the-fly re-normalisation of the observed spectrum for every change of the simultaneously fitted stellar parameters and elemental abundances. This allows a more accurate comparison of synthetic and observed spectra (Section 4.2) and thus a more accurate stellar label optimisation (see Section 4.3).

4.1. Initial stellar labels

Initial values of all stellar labels are needed for creating a first synthetic spectrum. For $v_\mathrm{rad}$ , $T_\mathrm{eff}$ , $\log g$ , and $v \sin i$ we use a combination of sources. Where possible, we use the previous estimates from GALAH DR3 (Buder et al. Reference Buder2021), and otherwise use estimates from the GALAH DR4 reduction pipeline (Section 2.2). Because of the limited accuracy of the latter for cool stars with $T_{\text{eff}} \,{\lt}\, 4\,000\,\mathrm{K}$ as well as the hot stars with $T_{\text{eff}} \,{\gt}\, 6\,500\,\mathrm{K}$ , we perform a consistency check with photometric information from Gaia DR3 (Gaia Collaboration et al. 2021a) and 2MASS (Skrutskie et al. Reference Skrutskie2006). For most of the aforementioned cool and hot stars, we therefore prefer the parameters from the Gaia DR3 photometric pipeline GSP-Phot (Andrae et al. Reference Andrae2023; Fouesneau et al. Reference Fouesneau2023) as initial values.

In selected cases, we further adjust the starting parameters toward reasonable limits. For example, hot stars are likely to be young and are adjusted to close to Solar metallicity. Furthermore, we recalculate the initial $v_\mathrm{mic}$ based on Equation (4) and limit rotational broadening values to $3 \leq v \sin i \leq 10\,\mathrm{km\,s^{-1}}$ for stars below $T_{\text{eff}} = 5\,500\,\mathrm{K}$ and $3 \leq v \sin i \leq 20\,\mathrm{km\,s^{-1}}$ for hotter stars. The explicit choices of starting values for $T_\mathrm{eff}$ , $\log g$ , [Fe/H], $v_\mathrm{mic}$ , and $v \sin i$ are described in our online repositoryFootnote ^f and are depicted in Fig. A1.

Based on the value of [Fe/H] we apply an offset to the ${\unicode{x03B1}}$ -process elements O, Mg, Si, Ca, and Ti. The initial value is 0.4 for $\mathrm{[Fe/H]} \,{\lt}\, -1$ , 0.0 for $\mathrm{[Fe/H]} \,{\gt}\, 0$ , and $-0.4 \cdot \mathrm{[Fe/H]}$ for $-1 \leq \mathrm{[Fe/H]} \leq 0$ . All other abundances are initialised at $\mathrm{[X/Fe]} = 0$ .

4.2. Comparison of synthetic spectra to observations

The major aim of our spectroscopic analysis is to predict the best set of stellar labels by minimising the uncertainty-weighted difference between observed and synthetic spectra. In this section, we describe several important steps to enable the pixel-level comparison of the higher resolution, oversampled synthetic spectra created with the neural networks from Section 3.3 and the observational data at actually measured resolution and sampling (presented in Section 2.2).

4.2.1. Downgrading synthetic spectra to observed resolution

Because dedicated line-spread-function measurements are available for every spectrum (see Section 2.2), we use this information to downgrade our synthetic spectrum with Gaussian kernels on an equidistant velocity grid to the measured resolution of each observation. We then interpolate the oversampled synthetic spectrum onto the observed wavelength grid.

4.2.2. On-the-fly re-normalisation of observed spectra

Measurements of the GALAH flux and flux uncertainty are reported in counts by the reduction pipeline. To compare with our synthetic spectra, which are normalised to the continuum, we fit an outlier-robust polynomial function to the ratio of observed and synthetic spectra and re-normalise our observed spectra and their uncertainties via this normalisation function.

This specific approach is similar to the internal routine of sme and has the important advantage that no continuum points have to be defined. This is advantageous because we try to cover the full parameter range of FGKM stars for which positions of continuum points – corresponding to 1 on a (pseudo-)continuum-normalised spectrum – differ significantly or for which continuum points may not even be present, or will be a strong function of $T_\mathrm{eff}$ and [Fe/H] (as is the case for M stars).

We make two additional adjustments to the reduced spectra, which come in the form of counts and uncertainty per wavelength, $f_\lambda$ and $\sigma_{f,\lambda}$ .

As we compare the observation to model spectra, we do not have to restrict ourselves to an a priori normalisation, but can take into account the residual information on the continuum in parts of the spectrum. For each model spectrum that we compare to, we therefore perform a normalisation by fitting a fourth order Chebyshev polynomial with outlier clipping to the ratio of model and observation (see Fig. 8). This allows us to both overcome previous shortcomings of the synthetic analysis in GALAH+ DR3 (Buder et al. Reference Buder2021), which had to be restricted to small wavelength segments and assumed a linear relation for those. Our new approach allows us to properly assess the structure of deep and steep molecular features that can dominate spectra of cool stars and carry significant information on $T_\mathrm{eff}$ , $v_\mathrm{rad}$ , as well as abundances (Mann et al. Reference Mann, Gaidos, Lépine and Hilton2012).

Figure 8. Example of normalisation for GALAH DR4 for a model spectrum ( $T_\mathrm{eff} = 3\,400\,\mathrm{K}$ , $\log g = 1.5$ , $\mathrm{[Fe/H]} = -1.0\,\mathrm{dexbest-fitting }$ ) that is selected during the label optimisation. Panel (a): Observed spectrum (counts). Panel (b): Ratio (blue) of observed spectrum and model spectrum as well as Chebyshev polynomial fit (orange). Panel (c): Normalised observed spectrum (black) compared to the model spectrum (blue). Residuals (red) can then be used as input for the likelihood function.

4.3. Stellar label optimisation

In up to four major loops, we optimise the radial velocities and all other stellar labels and report (a) their values, (b) their co-variances, (c) the best-fitting synthetic and re-normalised spectra along with (d) their uncertainties and (e) masks that indicate which pixels were used in the final optimisation.

Starting from the initial values, a first synthetic spectrum is computed and compared with the observation in order to assess the initial radial velocity. This is done by applying the scipy.signal.find_peaks algorithm on the normalised inverse residuals of non-shifted observed and synthetic spectra, when the latter is shifted by $v_{\text{rad}} = -1\,000..(2)..1\,000\,\mathrm{km\,s^{-1}}$ (see Fig. 9a). If no peak is found, the initial $v_\mathrm{rad}$ value is used hereafter. If more than one peak is found (see Fig. 9 with Gaia DR3 agreeing with the systemic radial velocity), the two strongest peaks are reported. For the purpose of the single star analysis, a narrower search is conducted around the highest peak with a $v_\mathrm{rad}$ shift of $-20.00..(0.04)..20.00\,\mathrm{km\,s^{-1}}$ around said peak by fitting a Gaussian function to the inverse of the residuals that were normalised with the smallest residual values (see Fig. 9c). The mean of this fit and its uncertainty are reported by the pipeline.

Figure 9. Output of the radial velocity fitting step. Panel (a) shows the initial broad search on a $v_\mathrm{rad}$ array of $-1000..(2)..1000\,\mathrm{km\,s^{-1}}$ . In the case of 2MASS J060846577815235, two peaks (yellow, dashed) are visible for this double-lined spectroscopic binary. Panel (b) shows the same plot, but overlaid with the GALAH DR4 reduction pipeline (red) and Gaia DR3 (blue, dashed) estimates for $v_\mathrm{rad}$ . Panel (c) shows the narrow window of $-20.00..(0.04)..20.00\,\mathrm{km\,s^{-1}}$ around the highest peak and its Gaussian fit (yellow). Despite their low resolution (26 KB), these on-the-fly created diagnostic images already occupy 50GB in total.

The centerpiece of our optimisation is the scipy.optimize module’s curve_fit function (Virtanen et al. Reference Virtanen2020), which we call with counts and uncertainties (our absolute sigmas) as input for a placeholder function that self-consistently re-normalises the observed spectrum. We estimate the labels via the least squares optimisation within less than $10^4$ iterations and a desired relative error (xtol) below $10^{-4}$ .

For each optimisation loop, a new, best-fitting 3D bin and neural network is identified via a grid search in the $T_\mathrm{eff}$ , $\log g$ , and [Fe/H] dimensions with sklearn.cKDtree. If the stellar labels that are being fitted have changed (for example if an element is deemed not detectable for the new 3D bin during the neural network training), the label and its value are either set to or initialised with $\mathrm{[X/Fe]} = 0$ .

While the optimisation of the neural network selection has not converged (the final parameters $T_\mathrm{eff}$ , $\log g$ , and [Fe/H] are not within the current 3D bin), the optimisation is repeated, starting with the previous best-fitting parameters as starting guesses.

We measured the time taken for the individual steps in the curve_fit function’s execution to be approximately $80\,\mathrm{ms}$ . The total fitting process for stellar labels, including input/output overheads, was timed at $89_{-29}^{+77}\,\mathrm{s}$ for the allspec module, and $125_{-33}^{+81}\,\mathrm{s}$ for the more complex allstar module.

4.3.1. Which stellar labels are optimised?

As part of the synthetic grid computations, we have sampled each label of stellar parameters and elemental abundances individually between our chosen maximum and minimum ranges (see Section 3.1). This allows us to also judge which stellar labels to fit for each given star. We choose to fit a stellar label if either of these two cases applies to said label for the GALAH wavelength range when neglecting the cores of the Balmer lines: (i) The normalised spectrum between minimum and maximum label value at any pixel exceeds the threshold of 0.007 or (ii) The normalised spectrum between the minimum and maximum value changes by more than 0.005 for at least 25% of the pixels. While the first case is constructed for atomic lines, such as Li i 6 708 Å, the second case addresses in particular molecular lines like the C_2 and CN lines. The pipeline can handle missing arms, for example in the case of readout issues of a CCD, and will fix abundances to the scaled-Solar values for elements with absorption features solely in the missing arm, for example N, O, K, and Rb for CCD4.

Figure 10. Examples of masks applied to unreliable pixels for the spectrum fitting, which is done by the minimisation of residuals (red) between observation (black) and synthesis (blue). Panel (a) shows a strong synthetic line, where no line is observed in the data. Panel (b) shows an observed line without any line being synthesised. Panel (c) shows significant disagreement between the two observed lines and the synthesis.

Initial tests of the pipeline have revealed that in cases where the initial parameter estimates deviate significantly from the final values, several elemental abundance estimates were shifted towards their boundaries, leading to a masking of their elemental abundance lines by the masking module (Section 4.3.2) at the beginning of each optimisation loop. To minimise this effect, we therefore shift the interim abundance values towards the narrow label boundaries. In practice, we limit the initial and interim abundances to 1.05.3.26 for A(Li), $\mathrm{[X/Fe]} = -0.5..1.0$ for C, N, O, Y, Ba, La, Ce, and Nd, and $\mathrm{[X/Fe]} = -0.5..0.5$ for all other elements before optimising them again. For warm and hot stars ( $T_{\text{eff}} \,{\gt}\, 6\,000\,\mathrm{K}$ ), this effect was seen to affect multiple abundances, such that we needed to implement a reset of all abundances except Li to their initial values for stars above $6\,000\,\mathrm{K}$ , which would on average be expected to be young and have a Solar-like composition.

4.3.2. Masking of unreliable wavelength regions

Not all pixels of the observed or synthetic spectra might prove useful for estimating reliable stellar labels. Observations can include bad pixels/patterns and incorrect corrections (for example of telluric or sky lines). Flux predictions of synthetic spectra are only as good as the input physics (limited for example for specific lines via uncertain oscillator strengths).

To minimise the influence of inaccurate synthetic pixel predictions, we have compared a 2dF-HERMES observation of the asteroid 4 Vesta and a high-quality Solar spectrum from Hinkle et al. (Reference Hinkle, Wallace, Valenti and Harmer2000) with the flux that would be predicted by our pipeline for a star with Solar labels ( $T_{\text{eff}} = 5\,772\,\,\mathrm{K}$ , $\log g = 4.438\,\mathrm{dex}$ , $\mathrm{[Fe/H]} = 0.00\,\mathrm{dex}$ , $v_{\text{mic}} = 1.06\,\,\mathrm{km\,s^{-1}}$ , $v \sin i = 1.6\,\,\mathrm{km\,s^{-1}}$ , $v_{\text{mac}} = 4.2\,\,\mathrm{km\,s^{-1}}$ Prša et al. Reference Prša2016; Jofré et al. Reference Jofré2017, and $\mathrm{[X/Fe]} = 0.00\,\mathrm{dex}$ for the default Solar abundance pattern for marcs by Grevesse, Asplund, & Sauval Reference Grevesse, Asplund and Sauval2007).

We have identified all lines that showed differences of the normalised flux of more than $0.1$ , lines where either a synthetic line or an observed one was completely missing, or lines that were significantly misaligned. Examples of masksFootnote ^g are shown in Fig. 10. To avoid the influence of bad spectrum regions with an observational origin, we mask pixels where the synthetic and re-normalised observed spectra differ by more than $5\sigma$ or a flux of 0.3 (0.4 before the initial optimisation). To avoid the masking of lines that are vital for our spectroscopic analysis, we have created a listFootnote ^h with segments of such lines that is mainly based on the previous element masks from GALAH DR3 (Buder et al. Reference Buder2021). The final mask of pixels to use for the optimisation then includes all vital line regions, as well as those wavelengths that do not show a too strong disagreement between observation and synthesis and are not deemed unreliable in their synthesis.

In addition to this default masking, we exclude pixels for each major iteration, for which the flux of observation and synthesis differ by more than $5 \sigma$ and 30% of the normalised flux and by more than 100% of the normalised flux for the vital line regions.

We further indirectly take into account the currently less constrained molecular data for cool stars in optical spectra, in particular towards the blue (e.g. Rains et al. Reference Rains2021; Rains et al. Reference Rains2024). For presumably cool stars (with initial $T_{\text{eff}} \,{\lt}\, 4\,100\,\mathrm{K}$ ), we therefore double the observational uncertainty of the blue arm.

5. Single star analysis (ALLSTAR)

After the allspec module (Section 4) has been used to estimate spectroscopic labels for all spectra, we use the allstar module to co-add spectra and analyse one spectrum per star while taking into account photometric and astrometric information to constrain the surface gravities.Footnote ⁱ The optimisation of stellar spectroscopic parameters with the help of non-spectroscopic information was successfully applied for GALAH DR3 (Buder et al. Reference Buder2021), using Gaia DR2 distances (Bailer-Jones et al. Reference Bailer-Jones, Rybizki, Fouesneau, Mantelet and Andrae2018) to overcome spectroscopic degeneracies. For the co-adding, we test whether the radial velocity estimates of individual exposures agree within $2\sigma$ . Below this threshold, we apply no radial velocity correction and fit a global radial velocity. Above this threshold (which is useful for single-lined spectroscopic binaries as shown in Fig. 11), we apply a radial velocity correction before co-adding.

Figure 11. Example of radial velocity evolution over modified Julian Date (vertical lines show the beginning of 2016, 2019, and 2022) for a single-lined spectroscopic binary (SB1).

To speed up computation, we use the mean results of the allspec analyses as initial stellar labels for the allstar analysis. All other methodology of the comparison of synthetic spectra to observations (Section 4.2) and label optimisation (Section 4.3) apply also to this module, with the exception of the optimisation of $\log g$ . Contrary to the allspec approach, we do not fit $\log g$ in this module, but estimate the logarithmic surface gravity $\log g$ using a combination of its definition ( $g \propto \frac{\mathcal{M}}{\mathcal{R}^2}$ ) and the Stefan-Boltzmann law relative to the Solar values:

(8)

\begin{equation}\log g = \log g_\odot + \log \frac{\mathcal{M}}{\mathcal{M_\odot}} + 4 \log \frac{T_\mathrm{eff}}{T_\mathrm{eff,\odot}} - \log \frac{L_\mathrm{bol}}{L_\mathrm{bol,\odot}} \end{equation}

While we can use our spectroscopically determined $T_\mathrm{eff}$ in Equation (8), the other values have to be estimated through models or non-spectroscopic information. The logarithmic bolometric luminosity, $L_\mathrm{bol}$ , can be estimated from the bolometric magnitude $M_\mathrm{bol}$ , such that $\log \frac{L_\mathrm{bol}}{L_\mathrm{bol,\odot}} = -0.4 \cdot \left(M_\mathrm{bol} - M_\mathrm{bol,\odot} \right)$ . The bolometric magnitude can be estimated from any given apparent magnitude, if we correct the latter for the distance modulus, bolometric correction, and extinction. Because essentially all stars in GALAH DR4 have high-quality infrared magnitudes available that suffer less from (uncertain) extinction corrections, we use $K_S$ as the magnitude to estimate our bolometric magnitudes and luminosities via

(9)

\begin{equation}M_\mathrm{bol} = K_S - 5\cdot \log \frac{D_\varpi}{10} + BC(K_S) - A(K_S). \end{equation}

While the values for $K_S$ , curated distances $D_\varpi$ (rather than raw parallaxes $\varpi$ ), and $A(K_S)$ are readily available (see Section 2.3), we need to estimate the bolometric correction from tabulated values using the routines provided by Casagrande & VandenBerg (2018):

(10)

\begin{equation}BC(K_S) = f(T_\mathrm{eff}, \log g, \mathrm{[Fe/H]})\end{equation}

We choose to assume an extinction value of $E(B-V) = 0\,\mathrm{mag}$ for this particular interpolation and post-correct the value by $A(K_S)$ based on the actual extinctions. The reason for this is that the latter values can exceed the maximum tabulated values of $E(B-V) = 0.72\,\mathrm{mag}$ of Casagrande & VandenBerg (2018).

Because of the appearance of $\log g$ in Equation (10), we iterate the calculation of $BC(K_S)$ and subsequently $\log g$ up to four times or until the latter value changes less than $0.02\,\mathrm{dex}$ between iterations. Similarly, we need to estimate the stellar masses (and ages as a byproduct) from tabulated values, that is,

(11)

\begin{equation}\mathcal{M}, \tau = f(T_\mathrm{eff}, \log g, \mathrm{[Fe/H]}, L_\mathrm{bol,\odot})\end{equation}

For this on-the-fly estimate of masses and ages we use a likelihood-weighted estimate with default uncertainties of $100\,\mathrm{K}$ , $0.25\,\mathrm{dex}$ , $0.2\,\mathrm{dex}$ , respectively, and an average uncertainty of $L_\mathrm{bol,\odot}$ from propagated uncertainties of Equation (9). We weigh the ages and masses via their likelihood of all isochrone grid points within these uncertainties of the parsec+colibri isochrones (Bressan et al. 2012; Marigo et al. Reference Marigo2017), which cover the logarithmic ages of $\log (\tau/\mathrm{Gyr}) = 8.00..(0.01)..10.18$ by default and metallicities $\mathrm{[M/H]} = -2.75..(0.25)..-0.75$ as well as $\mathrm{[M/H]} = -0.6..(0.1)..0.7$ . We exclude hot stars above $10\,000\,\mathrm{K}$ as well as extremely evolved white dwarf and extremely luminous giant stars ( $\log g \,{\gt}\, 6\,\mathrm{dex}$ or $J - K_S \,{\gt}\, 2\,\mathrm{mag}$ ) as they fall far outside our spectroscopic pipeline range. We convert between the theoretical [M/H] and our measured [Fe/H] as well as an assumed [/Fe] enhancementFootnote ^j via the correlation of Salaris & Cassisi (Reference Salaris and Cassisi2006), $\mathrm{[M/H]} = \mathrm{[Fe/H]} + \log\left(10^{{[{\unicode{x03B1}}/\textrm{Fe}]}} \cdot 0.694 + 0.306 \right)$ . For open clusters with age estimates below $1\,\mathrm{Gyr}$ as well as unevolved stars that are more luminous than expected from the oldest cool main-sequence isochrone with matching [M/H], we sample $\log (\tau/\mathrm{Gyr}) = 6.19..(0.01)..10.18$ . For globular cluster stars identified in the crossmatch with Baumgardt & Vasiliev (Reference Baumgardt and Vasiliev2021), we limit the isochrones to a minimum age of $4.5\,\mathrm{Gyr}$ .

6. Post-processing

After the allspec and allstar modules have been run for a night’s data (see Sections 4 and 5, respectively), a post-processing routine is used to estimate additional parameters from the residuals of the spectra (Section 6.1), estimate and validate accuracy and precision uncertainties (Section 6.2), and perform quality assurance tests on a global scale (flag_sp, see Section 6.3) as well as for the individual abundances of elements X (flag_X_fe, see Section 6.4).

6.1. Analysis of spectral residuals

6.1.1. Binary signatures

The residual spectrum of our best-fitting single star analysis can help us to identify a second flux contributor to the observed spectrum. In our case, there are two points in the analysis where we can identify such an influence. Firstly, the residuals are visible in the $\chi^2$ distribution as a function of radial velocity shifts (see Fig. 9). While a single star would only show one peak (saved as rv_comp_1), a binary system like 2MASS J06084657-7815235 shows a second peak ( $-70\,\mathrm{km\,s^{-1}}$ in addition to $74\,\mathrm{km\,s^{-1}}$ ) that is saved as rv_comp_2. Secondly, we perform an automatic search for reoccuring residuals as a function of radial velocity for a few selected lines. We chose the combination of strong lines in the spectra (Balmer lines, Fe lines at 4 890 and $4\,891\,$ Å, Ni at $6\,644\,$ Å) as well as those with the largest expected wavelength shift in the infrared detector (O triplet at 7 772–7 775 Å as well as Mg at 7 692 Å). If we find several peaks with a reasonably similar radial velocity, the likely $X \in {16,50,84}\text{th}$ percentiles of this radial velocity are saved in sb2_rv_X.

Figure 12. Comparison of spectroscopic and photometric $\log g$ estimates in the allspec analysis. Panel (a) shows the distribution of spectroscopic $\log g$ and $T_\mathrm{eff}$ from the allspec module. Panel (b) shows the distribution of the same $T_\mathrm{eff}$ and photometric $\log g$ . Panel (c) shows the difference of photometric $\log g$ and spectroscopic $\log g$ as a function of photometric $\log g$ . Red error bars indicate the $1\sigma$ percentiles of this difference in $0.5\,\mathrm{dex}$ bins.

Because radial velocities from the Gaia radial velocity spectrometer (Katz et al. Reference Katz2023) are reported in Gaia DR3 for 94% (774 914) of the stars observed for GALAH DR4, we can also compare against those radial velocity estimates. For 6% (50 577) of our stars, we find a difference with respect to Gaia DR3 larger than $10\,\mathrm{km\,s^{-1}}$ . For these stars, we often noticed unrealistically high $v_\mathrm{mic}$ and $v \sin i$ or negative velocities in our allspec analysis. We note that the allspec analysis was run without boundary conditions for global parameters and thus also resulted in negative velocities, which are later flagged and might indicate binarity (Section 6.3). allstar, however, was run with $v_\mathrm{mic}$ and $v \sin i$ forced to be above $0\,\mathrm{km\,s^{-1}}$ .

6.1.2. Post-correction of logg for allspec results

While we estimate logarithmic surface gravities $\log g$ solely from spectra in the allspec results, we also perform a post-processing estimate where we employ the methodology of Section 5 while fixing all other stellar parameters. The approach of only using spectroscopic information confirmed the previous conclusions of GALAH DR1-DR3 that the spectroscopic information in HERMES spectra to estimate $\log g$ is not sufficient for the majority of the parameter space for the given SNR. We show the spectroscopic $\log g$ in Fig. 12a and the photometric $\log g$ and their difference in Fig. 12b and c, respectively.

Figure 13. Example of three diffuse interstellar bands (DIBs) and interstellar K absorption for 2MASS J06453479-0102137 with an $E(B-V) = 0.84\,\mathrm{mag}$ value from Schlegel et al. (Reference Schlegel, Finkbeiner and Davis1998). Shown are the observation (black) and stellar fit (blue) as well as a Gaussian fit (red) to the residual (orange), resulting in an estimate of the equivalent width (EW) as well as radial velocity.

Figure 14. All-sky map (l,b) of GALAH DR4 equivalent width measurements of the diffuse interstellar band around 5 780 Å, with the GSPhot extinction by Andrae et al. (Reference Andrae2023) in the background.

We see an overall good agreement of both $\log g$ estimates for stars between $4\,250 \,{\lt}\, T_{\text{eff}} \,{\lt}\, 6\,500\,\mathrm{K}$ . Hotter stars show a strong dispersion of spectroscopic $\log g$ due to limited information from fewer and shallower lines. Cooler stars show a significant trend towards much lower $\log g$ for main-sequence stars and much higher $\log g$ for cool evolved stars up to an order of $\Delta \log g$ of $1\,\mathrm{dex}$ . This trend was previously seen in GALAH DR2 (Buder et al. 2018) and is believed to be caused by the onset of molecular absorption features which suppress the continuum for almost the entire HERMES wavelength range (see for example Fig. 8), thus introducing several degeneracies. In addition, we can notice a significantly lower precision of the spectroscopic $\log g$ in comparison to the excellent precision of photometric $\log g$ , for example in the red clump stars.

On closer inspection, we notice several trends in Fig. 12a. Most notably, we see noding patterns along the $T_\mathrm{eff}$ and $\log g$ grids where the allspec module switches between different neural network models. Our investigation of these noding effects is addressed in Section 8. In comparison to Fig. 12b, where a clear equal-mass binary sequence is visible just above the cool main-sequence, we do not see such a sequence in Fig. 12a. The difference between spectroscopic and photometric $\log g$ will therefore be useful to identify photometric binaries with high quality spectra with $\log g$ precisions below the single to binary system offset of up to $\Delta \log g = 0.3\,\mathrm{dex}$ , as discussed in Section 6.3. We caution, however, that the use of stellar structure models for the estimation of surface gravities can introduce systematic trends, as we discuss in Section 8.4.

6.1.3. Interstellar absorption

Because we can create synthetic stellar spectra for the full wavelength range, we can now also trace interstellar absorption in the residuals of observed spectra. By default, we try to calculate the equivalent width via Gaussian fits to the three strongest diffuse interstellar bands (DIBs; 5 780.59, 5 797.19, 6 613.66 Å) with central wavelengths identified by Vogrinčič et al. (2023) as well as for interstellar K ( $7\,698.9643\,$ Å), see Fig. 13. We report the equivalent widths eq_x, standard deviations sigma_x and radial velocities rv_x Footnote ^k for x in k_is for interstellar K and x in DIB_5780, DIB_5797, and DIB_6613 for the DIBs. The coverage of interstellar material, estimated from DIB_5780, within $D_\varpi \,{\lt}\, 5\,\mathrm{kpc}$ is shown in an all-sky map in Fig. 14, with the GSPhot extinction by Andrae et al. (Reference Andrae2023) in the background.

6.1.4. Emission estimates for the Balmer lines

The shape of the Balmer absorption lines holds valuable information on active stars as well as masses for evolved stars (Bergemann et al. Reference Bergemann2016) and possibly even information on unresolved binary systems (Sayeed et al. Reference Sayeed2024). Although the cores of these lines suffer from inaccuracies in the synthesis, the residuals of synthetic and observed lines can be used in relative analyses. We therefore perform a trapezoidal integration around the Balmer lines of each normalised spectrum at $4\,861.3230$ and $6\,562.7970\,$ Å whose values we report in ew_h_beta and ew_h_alpha. By default we integrate in a window of $\pm 0.75$ and $1.25\,$ Å for $\text{H} {\unicode{x03B2}}$ and $\text{H} {\unicode{x03B1}}$ , respectively, and increase this window to $5\,$ Å if the average observed, normalised flux within $\pm 0.5\,$ Å of the Balmer line core exceeds 1. An example of such a star is shown in Fig. 15, for which we measure a residual EW of $-1.09\,$ Å. Most emission line stars in the GALAH sample are found in the region of pre-main-sequence and hot stars (see Fig. AC6a). We conservatively only flag stars with a median normalised flux above 1 in $\text{H} {\unicode{x03B2}}$ or $\text{H} {\unicode{x03B1}}$ as emission line stars.

Figure 15. Example of a flagged emission star with clear emission in the Balmer lines (here H ${\unicode{x03B1}}$ ).

6.2. Uncertainty estimation and validation

The uncertainties that we report for our spectroscopic data analysis are based on the comparison to literature measurements (see also Beeson et al. Reference Beeson2024) to estimate accuracy uncertainties and a combined precision uncertainty estimate from adjusted covariance estimates from the fitting process and the scatter of repeat observations. Formally, we estimate the total variance of measurements as a combination of the accuracy and precision variance

(12)

\begin{align} \sigma_\mathrm{total}^2 = \sigma_\mathrm{accuracy}^2 + \sigma_\mathrm{precision}^2\end{align}

Representative values of accuracy and precision for our stellar parameters are listed in Table 4. We lay out how we estimate and validate accuracy and precision uncertainties in Sections 6.2.1 and 6.2.2, respectively.

6.2.1. Accuracy estimation and validation

Estimating the accuracy of spectroscopic measurements has always been a complicated endeavour, because there are no universal benchmark sets for all parameters across all stellar types. Subsequently, we describe the numerous comparisons that we have performed for both stellar parameters ( $T_\mathrm{eff}$ , $\log g$ , [Fe/H], $v_\mathrm{mic}$ , $v \sin i$ , and $v_\mathrm{rad}$ ) as well as the elemental abundance measurements. Consistent with GALAH DR3 (Buder et al. Reference Buder2021), and caused by the limited coverage of benchmark literature, we continue to use a single accuracy estimate for each stellar parameter and ignore the possibly large accuracy uncertainties for the individual elemental abundances. In all cases, we estimate an overall bias with respect to literature values and then combine these estimates to a globally applied zero-point correction. Where not explicitly stated otherwise, we assume that the spread of stellar parameters residuals is indicative of the accuracy of either method and estimate our accuracy by dividing the parameter spread by $\sqrt{2}$ . The applied shifts are listed in Table C1. We estimate the accuracy and bias correction for stellar parameters (including the iron abundance as a global parameter) and abundances separately.

Our primary reference source for parameter accuracy remains the Gaia FGK benchmark stars (Jofré et al. Reference Jofré2014; Jofré et al. Reference Jofré2015; Jofré et al. Reference Jofré2018; Heiter et al. Reference Heiter2015). Additionally, we use asteroseismic estimates from the K2 and TESS photometry (Zinn et al. Reference Zinn2020; Hon et al. Reference Hon2021) to compare our surface gravities and perform a validation to higher quality observations of globular cluster stars with typically lower metallicities (Carretta et al. Reference Carretta, Bragaglia, Gratton and Lucatello2009a; Carretta et al. Reference Carretta2009b; Johnson & Pilachowski Reference Johnson and Pilachowski2010). Because the overlap with APOGEE DR17 (Abdurro’uf et al. Reference Abdurro’uf2022) has increased from 41 941 stars in GALAH DR3 to 60 046 stars with 92 368 repeat observation matches in GALAH DR4, we also can assess systematic trends for a larger parameter space. For clarity, we discuss the stellar parameters separately, but show most accuracy estimates in a combined Fig. 16.

Table 4. List of accuracy and representative precision uncertainties for stellar parameters in GALAH DR4. Accuracy values are estimated from comparisons with literature references (see Section 6.2.1), whereas precision estimates are estimated from covariance uncertainties and repeat observations (Section 6.2.2). Here, we list the median precision uncertainties for stars with $SNR = 50 \pm 10$ on CCD2 (see Fig. 20).

Figure 16. Accuracy of the main stellar parameters $T_\mathrm{eff}$ , $\log g$ , [Fe/H], $v_\mathrm{mic}$ , $v \sin i$ , and $v_\mathrm{rad}$ for GALAH DR4. Each panel shows the comparison to literature (DR4 – literature) with median values as lines and contours between 16th and 84th percentiles. Comparisons are performed for the Gaia FGK Benchmark stars (red), APOGEE DR17 (blue), $\log g$ inferred from asteroseismic measurements (orange) and Gaia DR3 radial velocities (purple).

$T_\mathrm{eff}$ : The effective temperature estimates from GALAH DR4 show good agreement with the Gaia FGK benchmark stars (Fig. 16a). Specifically, we find a mean difference of $\Delta T_\mathrm{eff} = 21 \pm 92\,\mathrm{K}$ , indicating no significant bias between our temperatures and the benchmark values. Comparisons with APOGEE DR17 show an equally robust agreement, with $\Delta T_\mathrm{eff} = -8 \pm 78\,\mathrm{K}$ . This small offset and uncertainty suggest that the GALAH DR4 $T_\mathrm{eff}$ estimates are highly reliable across a wide range of at least G- and K-, but possibly also F- and M-type stars. Here, we use $1/\sqrt{2}$ of the residual spread with respect to Gaia benchmark stars as our accuracy estimate.

$\log g$ : For surface gravity, we compared our $\log g$ estimates to both the Gaia FGK benchmark stars, asteroseismic measurements from Zinn et al. (Reference Zinn2020) and Hon et al. (Reference Hon2021), and APOGEE DR17. The asteroseismic $\log g$ values are derived from $\nu_\mathrm{max}$ measurements for giant stars, and they show excellent agreement with our results, with a mean difference of $\Delta \log g = 0.026 \pm 0.078$ . Both the asteroseismic comparison as well as the Gaia benchmark star comparison ( $\Delta \log g = -0.011 \pm 0.059$ ) and APOGEE DR17 ( $\Delta \log g = 0.00 \pm 0.10$ ) agree well and show no trends across the $\log g$ range. For the low metallicity regime, we compare GALAH $\log g$ values with asteroseismically derived values from Howell et al. (Reference Howell, Campbell, Stello and De Silva2022) for the globular cluster M 4 (NGC 6121). Stars from this cluster were observed as part of a dedicated survey (PI M. Howell) aimed at spectroscopically characterising their sample of stars observed by the K2 mission (Howell et al. Reference Howell2014). Across the 75 overlapping targets, we find a $\Delta \log g = 0.056 \pm 0.128$ . The comparison between independently derived light element abundance variations and asteroseismic masses will be presented in an upcoming paper (Howell et al., in preparation.). This is a significant improvement over GALAH DR3, where significant deviations were found for luminous giant stars – whose parameter estimates in GALAH DR3 suffered from less precise and systematically biased distance and thus $\log g$ estimates. We find significant outliers, however, particularly for primary red clump stars, which were mistaken as secondary red clump stars, leading to larger deviations. We discuss this issue later in Section 8.4. Because this single group is driving the scatter in our disagreement with the asteroseismic estimates, we revert to the Gaia benchmark stars to estimate the accuracy.

[Fe/H]: The comparison of GALAH DR4 metallicities to the Gaia FGK benchmark stars initially showed the similar bias of GALAH towards more metal-poor values at the $0.049\,\mathrm{dex}$ level. The application of a zero-point correction (see Table C1) yields an excellent agreement, with $\Delta \mathrm{[Fe/H]} = 0.004 \pm 0.067$ for the benchmark stars and $\Delta \mathrm{[Fe/H]} = -0.022 \pm 0.061$ for APOGEE DR17, confirming the reliability of the GALAH DR4 metallicity estimates across a large range of metallicities. For the metal-poor regime, benchmark estimates are still rare. Luckily, a dedicated observing program – whose results are included in this data release – was performed and an overview of globular cluster Kiel diagrams is appended in Fig. AC5. We therefore only perform a comparison with globular cluster stars – often measured in 1D LTE – to get a quantitative impression of the agreement. We restrict ourselves to a few studies, namely those by Carretta et al. (Reference Carretta, Bragaglia, Gratton and Lucatello2009a,b) for NGC 104, 6121, 288, 6397, and 7099 as well as Johnson & Pilachowski (Reference Johnson and Pilachowski2010) for NGC 5139. In all cases, we find a good agreement of the metallicity distribution function for overlapping stars within the uncertainties (see Fig. 17). While this does not necessarily confirm our accuracy, it shows consistency within this uncertain parameter regime. We note however, a specific region in NGC 104, where the metallicity of the most luminous giants ( $T_\mathrm{eff} \,{\lt}\, 3\,750\,\mathrm{K}$ and $\log g \,{\lt}\, 0.5$ ) is incorrectly estimated near the Solar value. We discuss this problem in detail as a caveat in Section 8.7, since we have not been able to systematically flag these stars. More generally, we note that the strong and unexpected abundance trends with $T_\mathrm{eff}$ or $\log g$ in globular clusters from GALAH DR3 have decreased for most elements. However, we still urge users to take caution when using globular cluster abundances, and we discuss this further in Section 8.6. A custom, by hand analysis of globular cluster abundances beyond [Fe/H] will be performed in a separate study (McKenzie et al., in preparation), as these observations have been part of a dedicated observing program (PIs M. McKenzie and M. Howell). Similarly, a dedicated verification of open cluster observations (PIs J. Kos and G. De Silva) will be performed in a separate study (Kos et al. Reference Kos2025).

Figure 17. Comparison of iron abundances (16th, 50th and 84th percentiles) and overview of spectroscopic and photometric properties of globular cluster stars in GALAH DR4. Left panels show histograms of iron abundances from GALAH DR4 (blue) as well as literature estimates for the globular clusters from Giraffe (orange) and UVES (red) observations by Carretta et al. (Reference Carretta, Bragaglia, Gratton and Lucatello2009a, b) as well as observations from Johnson & Pilachowski (Reference Johnson and Pilachowski2010). Middle panels show the spectroscopic $T_\mathrm{eff}$ - $\log g$ diagrams coloured by iron abundance [Fe/H]. Right panels show the trend of GALAH DR4 [Fe/H] along the different $\log g$ values.

$v_\mathrm{mic}$ : Microturbulence velocities show a more complex pattern when compared to APOGEE DR17. We find a mean difference of $\Delta v_\mathrm{mic} = 0.23 \pm 0.39\,\mathrm{km\,s^{-1}}$ . However, the comparison reveals a linear mismatch: APOGEE DR17 tends to measure lower $v_\mathrm{mic}$ values for stars with low microturbulence and larger $v_\mathrm{mic}$ values for stars with higher microturbulence compared to GALAH DR4. This systematic trend suggests that the $v_\mathrm{mic}$ calibration between the two surveys may differ slightly, particularly at the extremes of the parameter range. We note, however, that the surveys agree much better than for GALAH DR3, where a fixed quadratic relation was used that did not allow for deviations, for example for red clump stars. Adding $v_\mathrm{mic}$ as free parameter returned a similar pattern as the empirical relation by Dutra-Ferreira et al. (Reference Dutra-Ferreira, Pasquini, Smiljanic, Porto de Mello and Steffen2016) and shows a significantly different behaviour of $v_\mathrm{mic}$ for the hottest, coolest, and red clump stars (see Fig. AA1). This mismatch of $v_\mathrm{mic}$ could have indeed driven the metallicity mismatch of metal-rich red clump stars in GALAH DR2 and DR3 (Buder et al. 2018, Reference Buder2021), since their metallicities are in agreement with other estimates now (e.g. APOGEE DR17).

$v \sin i$ : The rotational velocity estimates agree well with APOGEE DR17, with a mean difference of $\Delta v \sin i = 1.6 \pm 2.0\,\mathrm{km\,s^{-1}}$ . However, at higher rotational velocities (above approximately $24\,\mathrm{km\,s^{-1}}$ ), our neural networks start to extrapolate, leading to an upper limit in the estimates and returning significantly lower $v \sin i$ values compared to APOGEE DR17. This issue highlights the limitations of the GALAH DR4 $v \sin i$ estimates for rapidly rotating stars.

$v_\mathrm{rad}$ : For radial velocity we compared our results to both APOGEE DR17 and Gaia DR3. The comparison with APOGEE DR17 yields a small offset of $\Delta v_\mathrm{rad} = -0.09 \pm 0.39\,\mathrm{km\,s^{-1}}$ , indicating excellent agreement between the two surveys. Accounting for the much lower SNR for faint Gaia targets and unidentified binaries, we fit two Gaussian distributions to the overall difference of GALAH and Gaia radial velocities (see Fig. 18). The comparison with Gaia DR3 shows a slightly larger offset of $\Delta v_\mathrm{rad} = 0.15 \pm 0.44 \pm 1.54\,\mathrm{km\,s^{-1}}$ , which is expected due to the lower precision of the Gaia DR3 radial velocities (Katz et al. Reference Katz2023). We use the median residual of $0.15\,\mathrm{km\,s^{-1}}$ with respect to Gaia DR3 rather than the spread as our accuracy estimate.

Figure 18. Comparison of radial velocities between GALAH DR4 allspec and Gaia DR3. Panel (a) shows the difference of radial velocities as function of Gaia G magnitude. Panel (b) shows a histogram of the difference with two Gaussian distributions (with same mean) fitted to them to estimate a more robust, binary independent, radial velocity difference. Panel (c) shows the difference of radial velocities as function of radial velocity, showing the systematic scatter introduced by binaries.

Elemental abundances [X/Fe]: While there is no model-independent benchmark for absolute abundance accuracy, we continually perform comparisons with literature values to assess the consistency of our results with other studies. In GALAH DR4, we evaluate the abundance zero-points using up to five different reference estimates (see Fig. AC1): (1) the spectroscopic analysis of a Solar-composition spectrum of the asteroid Vesta (sobject_id 210115002201239), (2) abundance estimates for Solar twins corresponding to a Solar age of 4.5 Gyr (see Fig. 19), (3) abundances of Gaia FGK benchmark stars (Jofré et al. Reference Jofré2015, Reference Jofré2018), (4) stars with Solar-like metallicity $-0.1 \,{\lt}\, \mathrm{[Fe/H]} \,{\lt}\, 0.1$ within $500\,\mathrm{pc}$ of the Sun (a method introduced by Jönsson et al. Reference Jönsson2020), and (5) differences in abundance estimates for stars overlapping with the high-resolution, large-scale spectroscopic APOGEE DR17 survey (Abdurro’uf et al. Reference Abdurro’uf2022).

Figure 19. Chemical abundances [X/Fe] of Solar twin stars as a function of ages that were estimated as part of the mass and age estimation of the allstar spectrum analysis. We overplot linear fits to our age-abundance relations for Solar twins in orange and literature values from Bedell et al. (Reference Bedell2018) in red. Panels also indicate the median and standard deviation with respect to Bedell et al. (Reference Bedell2018) when assuming a correct age.

It is important to note that our abundance corrections, and consequently the Solar abundances presented in Table AC1, are determined within the framework of 1D LTE or 1D NLTE models and are not intended to represent the most accurate Solar abundances. Instead, they reflect our best effort to minimise discrepancies across different comparison cases. Given the differences in line modelling, such as those between Grevesse et al. (Reference Grevesse, Asplund and Sauval2007) (who used 3D atmospheres) and our 1D models, and possible deviations in the reference abundance from our Vesta spectrum, we refer to these adjusted values as zero-points. For certain scientific applications, adjusting these abundance zero-points might be necessary to ensure consistency with other datasets.

While we are not able to include all of our validation plots, we refer the interested reader to the publicly available code in our code repository.Footnote ^l Generally speaking, we have found that a large number of systematic trends of abundances with temperature and surface gravity has decreased with respect to GALAH DR3, as can be appreciated from dedicated validation plots (online here) – with a similar appearance as the right hand panels of Fig. 17.

Figure 20. Precision monitoring (with a median line and standard deviation shading) of stellar parameters as a function of SNR for the green CCD2 across GALAH DR4. Each panel shows the behaviour for bins of width 10 for the scatter of repeat observations of the allspec runs (blue), covariance uncertainties of allspec (orange) and allstar (red) setups as well as scatter of photometric $\log g$ from repeat observations (purple).

6.2.2. Precision estimation and validation

In addition to the accuracy uncertainty, we estimate the total uncertainty through the additional precision uncertainty (Equation 12). For this purpose, we mainly rely on the fitting uncertainties of the curve_fit function, which we rescale based on repeat observations.

While we report the raw fitting covariance matrix for each spectrum and module (see Fig. AB2 for the covariance matrices of Vesta and Arcturus), their entries are not validated for reliability and have not been adjusted to incorporate a rescaling towards the final uncertainties. For the purposes of reporting stellar parameter and abundance fitting uncertainties, we restrict ourselves to the standard deviations of each feature, that is, the square root values of the diagonal covariance matrix entries.

Figure 21. Comparison of stars with available measurements in GALAH DR4 and APOGEE DR17 for [C/Fe] and [N/Fe].

Figure 22. Comparison of stars with available measurements in GALAH DR3 (left), GALAH DR4 (middle) as well as APOGEE DR17 (right) for [Mg/Fe] (top row) and [Ni/Fe] (bottom row).

Similar to GALAH DR3, we apply a precision adjustment of the fitting uncertainty towards consistency with the scatter of repeat observations only as a function of SNR of CCD2. Contrary to GALAH DR3, we have extended this rescaling function to be fitted in bins of SNR with both a constant, linear, and exponential term with snr_px_ccd2 as the independent variable, that is, $c_1 + c2 \cdot SNR + c_3 \cdot \exp(c_4 \cdot SNR)$ . We report the fitted constants for both allspec and allstar in the online repositoryFootnote ^m for each stellar parameter and abundance.

In Figs. 20 and AC2, we then confirm that the overall trends of fitting uncertainties for allspec and allstar are consistent with the repeat observation scatter of the allspec. The latter has to be used as reference, because the allstar module uses co-added spectra of repeat observations rather than the repeat observations themselves. While this might actually overestimate the precision uncertainty of stellar parameters, we do not expect a too strong overprediction for abundances.

While the precision levels of stellar parameters have on average actually remained similar to the estimates of GALAH DR3, we see notable improvement of the precision for multiple elements, such as C, Mg, V, Cr, Co, Ni, La, Ce, Nd, and Sm. The precision of Eu, however, seems to have decreased.

Separately from this work, we performed an extensive analysis of the precision and accuracy of spectroscopic parameters from the observation of star clusters, 43 open clusters of all ages and 10 globular clusters (Kos et al. Reference Kos2025). In this work, we compare $T_\mathrm{eff}$ , $\log g$ , and stellar ages with the values obtained from cluster isochrone fitting. Ages show typical uncertainties of 10 to 50%, depending on the stellar type. $T_\mathrm{eff}$ and $\log g$ match well for stars hotter than $4\,000\,\mathrm{K}$ with a bias of $\Delta T_\mathrm{eff}=-68\,\mathrm{K}$ (GALAH – Isochrones), and $\Delta \log g = -0.03$ . For stars cooler than stars $4\,000\,\mathrm{K}$ , GALAH DR4 temperatures are overestimated by up to $250\,\mathrm{K}$ at $T_\mathrm{eff} = 3\,000\,\mathrm{K}$ and we find a complicated pattern in $\Delta \log g$ , with $\log g$ being sometimes severly overestimated for the coolest dwarf stars.

Most interesting is the analysis of elemental abundances. Assuming that stellar clusters are chemically homogeneous, we can study the precision of the reported abundances over a large range of temperatures. We find that cold stars show consistent systematic trends, that can reach values of $0.5\,\mathrm{dex}$ for some elements. Dwarf stars are most affected at temperatures $T_\mathrm{eff} \,{\lt}\, 4\,600\,\mathrm{K}$ , while giants show much smaller systematics with strong trends only at $T_\mathrm{eff} \,{\lt}\, 4\,000\,\mathrm{K}$ . The results of this cluster validation (Kos et al. Reference Kos2025), including a detrended set of elemental abundances, will be published as value-added-catalogues in DR4.

6.2.3. Uncertainties in light of GALAH DR3 and APOGEE DR17

To get a better idea of the actual improvement of accuracy and precision, we have performed more elaborate comparisons than those in Fig. 16 and only showcase a few in this manuscript with reference to the online repository. We have found the comparison of GALAH DR4 with both GALAH DR3 and APOGEE DR17 highly informative.

Because GALAH DR3 did not include N measurements and only a limited amount of C measurements, our first comparison concerns the abundances of C and N between GALAH DR4 and APOGEE DR17 in Fig. 21. We attach the comparisons for the other overlapping elements O, Na, Al, Si, K, Ca, Ti, V, Cr, Mn, Co, and Ce in Figs. AC3 and AC4. While we see a generally good agreement of the shapes, we notice biases of $-0.03\,\mathrm{dex}$ and $0.10\,\mathrm{dex}$ for C and N, respectively. These can be, however, explained by the lower precision of GALAH and might, in part, be driven by the slightly different trends of C and N towards lower metallicities. In particular, [C/Fe] decreases to sub-Solar level in APOGEE DR17 for metal-poor stars, whereas it is Solar or even enhanced in GALAH DR4. Enhanced levels would be expected for metal-poor disk stars, whereas sub-Solar levels are expected for accreted stars (Amarsi, Nissen, & Skúladóttir Reference Amarsi, Nissen and Skúladóttir2019b), warranting a future population analysis to test the accuracy of either survey.

In addition to these novel abundances, we also showcase two previously measured abundances, namely [Mg/Fe] and [Ni/Fe] in Fig. 22. The ${\unicode{x03B1}}$ -process element Mg has significant value for Galactic studies because it is predominantly produced by core-collapse supernovae (Kobayashi, Karakas, & Lugaro Reference Kobayashi, Karakas and Lugaro2020). In GALAH DR3, only the Mg i $5\,711\,$ Å line was used, whereas we now use a combination of several lines. This has led to a significant improvement in precision, as can be appreciated from the comparison of Fig. 22a and b. Even more positive, we see an improved agreement of the [Fe/H] vs. [Mg/Fe] measurements between GALAH DR4 (Fig. 22b) and APOGEE DR17 (Fig. 22c), with no abundance bias. One of the elements with the most significant precision improvement is Ni. For this element, our move to fitting the full wavelength range has increased the number of lines from two very reliable lines to several dozen lines. Albeit less reliable in their line data and possibly blended, the sheer increase in flux information used has improved the precision almost to the level of APOGEE DR17 – with no bias and a standard deviation of only $0.05\,\mathrm{dex}$ between APOGEE DR17 and GALAH DR4 (see Fig. 22c–f).

In addition to these instructive comparisons, we also return to the precision of Solar twins from Fig. 19. Here we specifically highlight the significant improvement of precision from GALAH DR3 to GALAH DR4 with respect to the linear estimate from Bedell et al. (Reference Bedell2018) for C (from 0.09 to 0.045), Si (from 0.04 to 0.023), Ca (0.07 to 0.049), Ti (0.05 to 0.031), V (0.13 to 0.050), Cr (0.06 to 0.024), Ni (0.07 to 0.033), and Y (0.12 to 0.080). This improvement of sometimes a factor of 2 is remarkable and most of our comparisons indicate that these values are representative of a precision improvement beyond the Solar twins, as shown for example for Ni in Fig. 22.

6.3. Stellar parameter flags flag_sp

We have implemented a series of post-processing routines to assess the quality of the stellar parameter determinations. These routines check for a variety of potential issues with the spectra and stellar label fitting, with each flag corresponding to a specific quality check. If any of these checks are not passed, the respective bit in the quality flag flag_sp is raised. The description of the implemented bits/flags for flag_sp and how often they were raised is listed in Table 5 and distributions in the Kiel diagram ( $T_\mathrm{eff}$ and $\log g$ ) are shown for each raised bit in Fig. AC6 for the allstar catalogue. For examples of stars with raised flags, we refer back to the emission line star (flag_sp = 1) of Fig. 15 (Section 6.1.4) and the clearly double-lined binary of Fig. 11.

Table 5. List of major quality flag flag_sp listing the bit, description and how often the flag was raised for the allstar and allspec routines. Notes: Multiple bits can be raised for each of the 1 085 520 spectra of 917 588 stars.

Because quality cuts should be applied based on the specific science case at hand, we do not make a strict recommendation for which upper limit of flag_sp should be applied. We note, however, that we have tried to implement flags that increase in concert. The first 8 bit masks (with values up to $2^9-1 = 511$ ) are therefore less problematic than those of 9 or higher ( ${flag\_sp} \leq 512$ ).

While not intended to identify binaries, we believe that both the $v \sin i$ and $v_\mathrm{mic}$ flags are informative for binaries below $T_\mathrm{eff} \,{\lt}\, 6\,000\,\mathrm{K}$ (see their elevated position in Fig. AC6f and g). We have trained the stars of this region with a lower maximum $v \sin i$ range that would be reached for a spectrum that is broadened due to binarity. This region certainly overlaps with the one of identified single-lined and double-lined binaries with flag_sp = 4 and 8, respectively (see Fig. AC6c and d). For the latter, we notice that especially cool giants are picked up by the automatic algorithm as well. This might be either due to strong extinction biasing our analysis or due to lines in the spectrum not being modelled properly and thus showing up as residual signal. While these stars are possibly flagged false-positively, we also find a remarkable amount of true binaries ( $\,{\gt}\,41\%$ in orange area of Fig. 23b), for which the Gaia DR3 radial velocity is likely the systemic radial velocity, as it is close to the mean radial velocity of both components identified in GALAH DR4. In Fig. 23, we visualise how one could use the radial velocities from GALAH DR4 and Gaia DR3 to further assess the reliability of this flag. To check if a particular bitmask flag (e.g. $2^3 = 8$ ) is raised, one can perform the check in python via

Figure 23. Comparison of radial velocity estimates of GALAH DR4 and Gaia DR3. Panel (a) shows the difference of GALAH’s primary component radial velocity with the mean Gaia DR3. Panels (b) and (c) show stars for which two components were detected in GALAH DR4 and shows the difference between each component and Gaia DR3 against the difference of mean (roughly systemic) radial velocities. The panels also include regions where actual binaries and false positive detections are expected.

flag_8_raised = (dr4[’flag_sp’] & 8) != 0

6.4. Elemental abundance flags flag_X_fe

The quality of elemental abundance measurements is also captured through flags. When an element is reliably detected in the spectrum, no flag is raised. However, if the abundance of an element is estimated as an upper limit, often due to weak spectral lines or low SNR, an upper limit flag is triggered. If no measurement of the element is possible, a flag is raised to indicate that the relevant spectral features were too weak or the SNR too low to allow for an estimate. The list of bits and flags for elemental abundances, flag_X_fe, is shown in Table 6.

By default, we recommend to only use significant detections ( ${flag\_x\_fe} = 0$ ) for an element. Because of a bug in the flagging of the [Fe/H] detection (see discussion in Section 8.8), we do not recommend to consider ${flag\_fe\_h}$ for quality cuts.

6.5. Abundance detection or upper limit

To assess whether the abundance estimates are a true detection or an upper limit for each element X, we compare a synthetic spectrum with the best-fitting parameters to a synthetic spectrum with the same parameters, except for element X, for which we use the lower limit abundance of the neural network. The residuals in units of $\sigma$ between the best-fitting spectrum and the spectrum with the lowest possible [X/Fe] or lowered [Fe/H] then allow us to identify a detection (with maximum differences beyond 3 $\sigma$ ) or upper limits, for which we raise the flag flag_x_fe by 1. Our initial test of overall detectability (Section 4.3.1) allowed us to raise the flag flag_x_fe by 2 for elements for which not even an upper limit was expected.

We further raise a flag for allspec abundances, if the element was fit above (3) or below (4) the neural network training set range. For CNO, we have identified specific regions, in particular dwarfs, for which could not verify abundances and therefore caution their use (flag 5). We have further tried to identify abundances, for which the optimisation may have failed and flagged these with flag 6 (see Section 8.7).

7. Data release products

GALAH DR4 encompasses a diverse range of data products. We describe the most important main catalogues in Section 7.1 and value-added catalogues in Section 7.2. We further explain the data products for each spectrum and star, that is, the reduced spectra (Section 7.3.1), allspec products (Section 7.3.2), and allstar products (Section 7.3.3).

The data products are provided directly on the AAO DataCentral website at https://cloud.datacentral.org.au/teamdata/GALAH/public/GALAH_DR4/. We further provide multiple ways to interact with the data release products, which are described in Section 7.4.

7.1. Main data release catalogues

1. galah_dr4_allspec_240705.fits: analysis for each spectrum (including radial velocity estimation for each spectrum) based on a single spectrum.
2. galah_dr4_allstar_240705.fits: analysis for each star based on co-added spectra of each star and using non-spectroscopic information to constrain $\log g$ .

We present the main catalogue table schema in Table A1 (see also Fig. 24), but refer the reader to the FITS headers of each catalogue for more detailed information.

One of our greatest achievements as part of this data release is the extraction of C and N abundances for giant stars from molecular absorption features. In Fig. 25, we show how stellar mass and [C/N] ratios are correlated in GALAH DR4, as is expected based on the pioneering work by Masseron & Gilmore (Reference Masseron and Gilmore2015), Martig et al. (Reference Martig2016), and Ness et al. (Reference Ness2016). Our measurements demonstrate the potential of [C/N] abundances to better separate the core-helium burning from the red giant phase (around the blue areas of Fig. 25b and c) or at least better constrain stellar masses.

7.2. Value-Added Catalogues (VAC)

We provide several value-added catalogues, namely a crossmatch catalogue to all entries of the Gaia DR3 main source catalogue and the most important entries from the 2MASS and WISE catalogues, a catalogue of stellar dynamics properties, a catalogue of 3D NLTE measurements of Li, and a catalogue with ages inferred via isochrone interpolation in a Bayesian framework.

7.2.1. VAC of crossmatches with Gaia DR3, 2MASS and WISE

The value-added catalogue of the crossmatchFootnote ⁿ with the Gaia DR3, 2MASS, and WISE catalogues as well as the distance catalogue of Bailer-Jones et al. (Reference Bailer-Jones, Rybizki, Fouesneau, Demleitner and Andrae2021) was calculated by performing an OUTER JOIN ADQL-query in the Gaia archive.

The query first performed an INNER JOIN with the 2MASS near-infrared photometry catalogueFootnote ^o via its designation and linked this match to the Gaia DR3 catalogue via the best neighbourFootnote ^p and joinedFootnote ^q catalogues of 2MASS to Gaia DR3 (Torra et al. Reference Torra2021). When cross-matching between Gaia DR3 and 2MASS, less than 1% of stars were associated with multiple possible matches. To ensure the best match, the data were sorted from brightest to faintest G-band magnitude, and only the brightest match for each sobject_id was retained.

The crossmatch to the WISE far-infrared photometry catalogueFootnote ^r (Cutri et al. Reference Cutri2014) was performed via the Gaia DR3’s best neighbour catalogueFootnote ^s (Torra et al. Reference Torra2021). The match to the distance catalogueFootnote ^t of Bailer-Jones et al. (Reference Bailer-Jones, Rybizki, Fouesneau, Demleitner and Andrae2021) via the Gaia DR3 source_id.

The catalogue also includes uncertainties in the Gaia DR3 photometric magnitudes (G, ${G_{{\rm{BP}}}}$ , ${G_{{\rm{RP}}}}$ ) that were recalculated following the recommendations from the Gaia Early Data Release 3 (EDR3) documentation (Riello et al. Reference Riello2021). The total uncertainties were computed by combining the photon flux error with an additional systematic term.

Table 6. List of elemental abundance quality flags flag_fe_h for [Fe/H] or flag_X_fe for element X.

We further corrected the Gaia DR3 parallaxes for systematic zero-point errors by applying the correction model provided by Lindegren et al. (Reference Lindegren2021a). This correction depends on several factors, including the G-band magnitude, effective wavenumber ( ${\nu _{{\rm{eff}}}}$ ) used in astrometry, pseudocolour, latitude, and the astrometric solution type. The parallax zero-points and original parallaxes are reported as plx_zpt_corr and parallax_raw, respectively.

Beyond the crossmatch with the Gaia DR3 gaia_source catalogue, multiple other crossmatches can easily be performed via the gaiadr3_source_id column. We have for example crossmatched the sources from GALAH DR4 with those from Gaia DR3’s variability catalogues (Rimoldini et al. Reference Rimoldini2023). We find 47 493 stars in GALAH DR4 that overlap with the gaiadr3.vari_classifier_result catalogue. In particular, we find 17 256 SOLAR_LIKE variables, 14 477 stars in the ${\unicode{x03B4}}$ Scuti, ${\unicode{x03B3}}$ Doradus, or SXPhoenicis category (DSCT/GDOR/SXPHE), 6 247 LPV (long-period variables), 4 074 ECL (eclipsing binaries), 3 355 RS (RS Canum Venaticorum variables), 1 096 YSO (young stellar objects), 401 RR (RR Lyrae types), and a large variety of other variables, including the white dwarf 2MASS J05005185-0930549 that was already found in GALAH data by Kawka et al. (Reference Kawka2020).

7.2.2. VAC of stellar dynamics

The value-added catalogue for stellar dynamicsFootnote ^u includes the kinematic and dynamical properties for stars in the GALAH DR4 survey. The catalogue is created with a publicly available scriptFootnote ^v as part of GALAH DR4. We define the position of the Sun in our Galactic reference frame as $R_\mathrm{GC} = 8.21\,\mathrm{kpc}$ (McMillan Reference McMillan2017), $\varphi_\mathrm{GC} = 0\,\mathrm{rad}$ , and $z_\mathrm{GC} = 25\,\mathrm{pc}$ (Bland-Hawthorn & Gerhard Reference Bland-Hawthorn and Gerhard2016). We then combine the total velocity in V of the Sun at $R_\mathrm{GC}$ based on the proper motion measurement of $6.379\pm0.024\,\mathrm{mas\,yr^{-1}}$ by (Reid & Brunthaler Reference Reid and Brunthaler2004), that is, $V_\odot = 248.27\,\mathrm{km\,s^{-1}}$ with the circular velocity of $V_\mathrm{circ} = 233.10\,\mathrm{km\,s^{-1}}$ from McMillan (Reference McMillan2017) to estimate a peculiar velocity of the Sun with respect to the local standard of rest of $15.17\,\mathrm{km\,s^{-1}}$ . For the other two components, we use the estimate by Schönrich, Binney, & Dehnen (Reference Schönrich, Binney and Dehnen2010), leading to a peculiar velocity of the Sun of $(U,V,W) = (11.1, 15.17, 7.25)\,\mathrm{km\,s^{-1}}$ .

Starting from the crossmatch of GALAH DR4 with the Gaia DR3 (see Section 7.2.1), we use the galpy.orbit module by Bovy (Reference Bovy2015) to estimate heliocentric Cartesian coordinates (X,Y,Z) and velocities (U,V,W) as well as Galactocentric cylindrical coordinates $(R, \varphi, Z)$ and velocities ( $v_R, v_\varphi, v_Z$ ). We approximate the orbit actions $J_R, J_\varphi = L_Z, J_Z$ and frequencies $\omega_i$ with the galpy.actionAngle.actionAngleStaeckel function with a focal length of the confocal coordinate system ${delta} = 0.45$ in the Milky Way potential by McMillan (Reference McMillan2017). We further use the Staeckel approximation (Binney Reference Binney2012) to calculate eccentricity, maximum orbit Galactocentric height, and apocentre/pericentre radii with galpy’s EccZmaxRperiRap (Mackereth & Bovy Reference Mackereth and Bovy2018). Our assumption of a time-invariant, axisymmetric potential further allows us to extract the orbit energy via galpy. Orbit.E.

In particular the dedicated observing programs of GALAH towards low angular momentum stars (PI S. Buder) and globular clusters (PI M. McKenzie and PI M. Howell) have increased the number of spectroscopic observations for stars on halo-like orbits. This is showcased by both the action-action diagram of angular momentum $L_Z$ versus radial action $\sqrt{J_R}$ (Fig. 26) and angular momentum $L_Z$ versus orbit energy E (Fig. 27) and visualises the potential of GALAH DR4 observations to complement Galactic dynamics studies and enable Galactic chemodynamic studies.

Figure 24. Overview of stellar parameters and elemental abundances for the allstar estimates of GALAH DR4. The top left panel shows the density distribution of stars in the Kiel diagram of $T_\mathrm{eff}$ and $\log g$ . All other panels show the logarithmic elemental abundances (for elements indicated in the top left of the panel) as a function of the logarithmic iron abundances [Fe/H]. Elements are coloured by different nucleosynthetic channels (black for big bang nucleosynthesis, blue for core-collapse supernovae, red for supernovae Type Ia, green for asymptotic giant branch star contributions and pink for the rapid neutron capture process with contributions from merging neutron stars) following the colour schema from Kobayashi et al. (Reference Kobayashi, Karakas and Lugaro2020). Percentages indicate the fraction of detections of stars for each element.

Figure 25. The ratio of [C/N] and isochrone masses in comparison panel (a), and as a function of $T_\mathrm{eff}$ and $\log g$ in panels (b) and (c), respectively.

Figure 26. Distribution of the dynamical properties of angular momentum $L_Z$ and radial action $J_R$ of stars in GALAH DR4 (black), with globular cluster members highlighted in colour. Cluster members were selected as those with more than 70 percent membership probability according to Vasiliev & Baumgardt (Reference Vasiliev and Baumgardt2021). The Sun is indicated with a red $\odot$ symbol.

Figure 27. Distribution of the dynamical properties of angular momentum $L_Z$ and orbital energy E of stars in GALAH DR4 (black), with globular cluster members highlighted in colour. Cluster members were selected as those with more than 70 percent membership probability according to Vasiliev & Baumgardt (Reference Vasiliev and Baumgardt2021). The Sun is indicated with a red $\odot$ symbol.

7.2.3. VAC of 3D NLTE lithium abundances

In this value-added catalogue,Footnote ^w we use spectrum fitting to infer 3D non-local thermodynamic equilibrium (NLTE) lithium abundances. For each spectrum, the Li line is modelled with a 3D NLTE breidablik line profile (Wang et al. Reference Wang2021). In cases where the Li line is blended with nearby lines such as Fe and CN, we model blending lines as Gaussian absorption profiles. From this model, we measure the equivalent width (EW) and errors in EW of the Li line using UltraNest (Buchner Reference Buchner2021), a Monte Carlo nested sampling algorithm. The Li abundance, A(Li), is then inferred from the measured EW using breidablik and the stellar parameters from GALAH DR4’s allstar. See Wang et al. (Reference Wang2024a) for a detailed description of the methodology.

Wang et al. (Reference Wang2024a) measured a local line width for the Li region and fit the width of the Li line separately from other lines. For this work, we use the GALAH instrumental profile convolved with the rotational velocity as the width of our blending lines and set the width of the Li line based on this convolved kernel, better constraining the line widths. Whilst we still measure a local radial velocity due to a lack of ThXe arc lines for CCD3 (see Section 8.1), we apply the GALAH radial velocity for poorly constrained Li depleted stars where we cannot measure the local radial velocity. In addition, the sampled EW posterior is now modelled using a first order boundary corrected kernel density estimator from Lewis (Reference Lewis2019), which has better convergence than histograms. Lastly, GALAH DR4 analyses stars down to 3 000 K, but the stagger model atmospheres only reach 4 000 K, therefore, we provide an additional column of 1D NLTEFootnote ^x A(Li) inferred through our measured EWs using a new interpolator. Similar to the existing EW interpolators in breidablik (Wang et al. Reference Wang2021), we train a feedforward neural network on NLTE Li abundances synthesised using the 1D marcs model atmospheres. We use a 2-layer architecture with the ReLU activation function and find best hyperparameters: $i = 900$ neurons, and $\alpha = 0.1$ L2 penalty. This model is included in the breidablik package. Using the updated methodology it takes $\sim$ 2 min per star in comparison to $0.5$ –2 h per star reported in Wang et al. (Reference Wang2024a), with the main speed up coming from fixing the Li line width.

EWs are measured for 892 223 stars (97% of GALAH DR4), with 3D NLTE A(Li) detections reported for 417 825 stars (46%) and upper limits for 474 398 stars (52%). Fig. 28 shows the mean EW over $T_\mathrm{eff}$ and $\log g$ . The Li-dip can be seen on the main-sequence turn-off at $T_\mathrm{eff}$ $\approx 6\,500$ K and $\log g$ $\approx 4.2$ and extends up the subgiant branch. There is a Li enhanced population of stars at $\log g$ $\approx 2.5$ in the red clump whilst the horizontal branch is depleted in Li. Although the secondary red clump appears to be depleted in Li, these stars have an overestimated $\log g$ driven by incorrectly inferred masses (see Section 8.4), and should be primary red clump stars. The increase of mean Li EW up the giant branch is due to a large proportion of stars with EW $\approx 100$ mÅ. Features of this figure will be studied in follow-up papers.

Figure 28. Mean EW binned in $T_\mathrm{eff}$ and $\log g$ . The Li-dip can be seen at $T_\mathrm{eff}$ $\approx 6\,500$ K and $\log g$ $\approx 4.2$ . At $\log g$ $\approx 2.5$ , red clump stars have a higher mean Li EW whilst horizontal branch stars have a lower mean Li EW compared to surrounding stars. The mean Li EW increases going up the red giant branch.

A quality flag (flag_ALi) is raised by 1 for upper limits, 2 or more to indicate other quality issues, such as stellar parameters falling outside of the model atmosphere grid (see Wang et al. Reference Wang2024a) for more details on the bitmask flag). We recommend flag_ALi < 2 when using the 3D NLTE A(Li), and flag_ALi < 4 when using Li EWs. A similar quality flag flag_ALi_1D is provided corresponding to the 1D NLTE A(Li) included in the VAC.

For convenience, we have included the most important columns of this catalogue in the allstar catalogue (see Table A1), as we recommend to use them instead of the less accurate 1D NLTE abundances estimated with an imperfect neural network interpolation, which we indicate with nn_li*.

7.2.4. Ages

A value-added-catalogue for stellar ages and masses from BSTEP (Sharma et al. Reference Sharma2018) is currently in preparation. In the meantime, users can rely on the on-the-fly age and mass estimates already provided in the allstar and allspec catalogues from the pipeline.

7.3. Data products for each spectrum and star

We provide individual data products in an orderly fashion that allow users to create links to these products based solely on the sobject_id. To download data products for individual stars we recommend creating a url string and using wget or similar commands. For bulk downloads of the advanced data products of this section, we recommend contacting the GALAH collaboration or using the bulk download interfaces of AAO DataCentral.

7.3.1. Reduced spectra

The reduced spectra of each night are provided in the observations directoryFootnote ^y and sorted into directories with four spectra – one for each of the four CCDs. These spectra are produced by the reduction pipeline (see Section 2.2) and include several extensions as outlined in Table 2, with wavelength information stored in the fits headers with starting wavelength CRVAL1 in Å and linear pixel scale CDELT1 in Å/px, and the number of pixels NAXIS1. The reduced spectra are only provided per exposure and not in a co-added manner, since the co-adding was performed as part of the allstar module (see Section 7.3.3 for co-added spectra). We note that not all files might be available for a given exposure due to the rare failure of CCD readouts.

7.3.2. Additional products of the allspec module

The allspec analysis product directoryFootnote ^z provides the files that were produced by the allspec module. These include the on-the-fly assessment of the radial velocity fit *rv.png (similar to Fig. 9), the raw fitting results *results.fits and their covariance matrices *covariances.npz (similar to the entries used to produce Fig. AB2). We also provide a combined *spectrum.fits file (concatenated over the four bands) that includes the wavelength, flux, and flux uncertainty of the velocity-corrected and re-normalised observed spectrum as well as the best-fitting model spectrum interpolated onto the same wavelength. Finally, we provide a *comparison.pdf (similar to Fig. AB1) which displays the fit results, comparison of observed and model spectrum, masked wavelength regions, and wavelengths of the most important element lines. If the module did not run to completion, for example because the SNR of the spectra was below the threshold of $\mathrm{SNR} = 10$ for any CCD to even attempt a fit, not all products are available for a spectrum.

7.3.3. Additional products of the allstar module

The allstar analysis product directoryFootnote ^aa also includes the radial velocity monitoring *rv.png, results files *results.fits, combined spectra *spectrum.fits and *comparison.pdf overview, similar to the ones described in Section 7.3.2. In addition, each directory also includes a *sobject_ids.txt file that lists all individual spectra that were co-added to create the observed spectrum and its uncertainty in *spectrum.fits.

7.4. Interactive access via AAO DataCentral

In collaboration with the AAO Data Central, a number of interactive ways are provided to explore the data of this release. Data Central provides both Simple Spectral Access and Single Object Viewer services. In addition, we recommend to download files or easily crossmatch user catalogues with the TAP server https://datacentral.org.au/vo/tap in TOPCAT (Taylor Reference Taylor2005). https://apps.datacentral.org.au/galah/spectra also provides an interactive plotting application to show normalised or un-normalised spectra of different repeat observations. As these tools are under active development, we refer to the latest documentation on both the DataCentral and the main Survey website https://www.galah-survey.org.

8. Caveats and future improvements

In this section, we attempt a detailed discussion of caveats at different steps of our analysis, while also giving suggestions for future improvements – both for GALAH and other surveys. We first discuss caveats of the spectrum reduction (Section 8.1), before extensively discussing the spectrum synthesis (Section 8.2) and spectrum interpolation (Section 8.3). We discuss possible problems arising from the use of photometric information (Section 8.4), in particular for stars that could be binaries (Section 8.5). We elaborate on caveats regarding globular clusters in Section 8.6 and the fitting iteslf in Section 8.7. Finally, we point out caveats regarding the flags in Section 8.8 as well as a bug and its correction in the reported radial velocity of interstellar K in Section 8.9. We summarise the most important caveats in Section 8.10.

8.1. Spectrum reduction

Although a significant amount of work was spent on improving the spectrum reduction, several persistent issues remain, which are summarised below.

8.1.1. Wavelength solutions

For each CCD, the reduction pipeline estimates the most suitable wavelength solution, linking pixels with actual wavelengths based on the ThXe arc lines. In GALAH DR3 (Buder et al. Reference Buder2021) we identified several issues for spectra where not enough ThXe lines could be used to constrain the wavelength solution. Improvements have been made for the new reduction version to improve the number of useful ThXe lines and restrict the flexibility of wavelength solutions to move them closer to previous results. This has helped us to decrease the number of problematic wavelength solutions towards the red end of CCD3 which includes the used absorption features of Li and Eu. We have decreased bad wavelength solutions for this CCD from initially 7.9% of the spectra to roughly 1% bad solutions, that is, similar to the other CCDs.

8.1.2. Holistic spectrum extraction

Although much work has been spent on improving telluric and sky lines in the reduction step, most reduction steps are currently run sequentially rather than in parallel. Using the information of stellar spectra when modelling the wavelength solution would certainly help to overcome the limited information in ThXe calibration spectra in the absence of laser combs (Kos et al. Reference Kos2018). Multiple steps in this direction have been taken (Saydjari et al. Reference Saydjari, Uzsoy, Zucker, Peek and Finkbeiner2023) and should be rolled out in future spectrum analysis. This would especially help to mitigate imperfect telluric and sky line removal while simultaneously improving the wavelength solution – among many other effects.

8.2. Imperfect spectrum synthesis

8.2.1. Spectrum synthesis

The GALAH survey’s success relies heavily on the ability to accurately model stellar spectra to infer accurate stellar properties. The survey has seen significant improvements in moving from the approximation of 1D LTE towards 1D NLTE (Amarsi et al. Reference Amarsi2020b). This includes the use of 1D NLTE synthesis for atomic lines using the 3D NLTE code balder (Amarsi et al. Reference Amarsi2018b), a custom version of Multi3D (Botnen & Carlsson Reference Botnen, Carlsson, Miyama, Tomisaka and Hanawa1999; Leenaarts & Carlsson Reference Leenaarts and Carlsson2009). The code employs model atoms for H (Amarsi et al. Reference Amarsi2018b), Li (Lind, Asplund, & Barklem Reference Lind, Asplund and Barklem2009a; Wang et al. Reference Wang2021), C (Amarsi et al. Reference Amarsi, Barklem, Collet, Grevesse and Asplund2019a), N (Amarsi et al. Reference Amarsi2020a), O (Amarsi et al. Reference Amarsi, Barklem, Asplund, Collet and Zatsarinny2018a), Na (Lind et al. Reference Lind, Asplund, Barklem and Belyaev2011), Mg (Osorio et al. Reference Osorio2015), Al (Nordlander & Lind Reference Nordlander and Lind2017), Si (Amarsi & Asplund Reference Amarsi and Asplund2017), K (Reggiani et al. Reference Reggiani2019), Ca (Osorio et al. Reference Osorio, Lind, Barklem, Allende Prieto and Zatsarinny2019), Mn (Bergemann et al. Reference Bergemann2019), Fe (Amarsi et al. Reference Amarsi2018b; Amarsi et al. Reference Amarsi, Liljegren and Nissen2022), and Ba (Gallagher et al. Reference Gallagher2020) over the marcs model atmosphere grid. The work by Wang et al. (Reference Wang2024a) also enables us to present measurements of Li in 3D NLTE as part of this release.

All of these advances contrast with the lack of a proper way of modelling molecular features appropriately. This could explain the significant mismatch of oxygen abundances between the optical and infrared (compare e.g. Bensby, Feltzing, & Oey Reference Bensby, Feltzing and Oey2014; Abdurro’uf et al. Reference Abdurro’uf2022). It can, however, also lead to mismatches in the GALAH wavelength range, where atomic features, such as C I, can be modelled in 1D NLTE, whereas much stronger molecular features of $\mathrm{C}_2$ and CN have to be modelled in 1D LTE and linelists of molecules, such as TiO, might be incomplete (Hoeijmakers et al. Reference Hoeijmakers2015; McKemmish et al. Reference McKemmish2019).

For our synthesis, we have employed version 580 of the IDL-based code Spectroscopy Made Easy (Valenti & Piskunov Reference Valenti and Piskunov1996; Piskunov & Valenti Reference Piskunov and Valenti2017). As part of the continuing improvement of this code, several bugs have been identified and fixed. We also note that a Python-based version of SME, pySME (Wehrhahn et al. Reference Wehrhahn, Piskunov and Ryabchikova2023), has become available. In addition, the spectrum synthesis code KORG (Wheeler et al. Reference Wheeler, Abruzzo, Casey and Ness2023; Wheeler, Casey, & Abruzzo Reference Wheeler, Casey and Abruzzo2024) has been published in Julia with a Python interface. It offers a faster alternative to SME once 1D NLTE synthesis is implemented, which is essential for applying to many NLTE-sensitive lines, such as O and K, in the GALAH wavelength range. KORG already internally adjusts the metallicity that is used to interpolate atmospheres based on the overall chemical abundances, whereas this would need to be adjusted in SME by hand, since atmospheres are interpolated with the sme.feh entry that is independent of the chemical composition sme.abund. Because we have not performed said adjustment, we note that the spectrum synthesis for chemical compositions far from scaled-Solar may have used an mismatched atmosphere in the synthesis in SME for GALAH DR4.

8.2.2. Mismatch of atmosphere and spectrum chemistry

For several of our synthetic spectra, the chosen chemical composition deviates significantly from the scaled-Solar pattern of the marcs model atmospheres, particularly for ${\unicode{x03B1}}$ -process elements such as O and Mg, as well as C and N. These elements can substantially affect opacity and energy transport, and therefore, their abundances must be adjusted to match observed spectra more accurately. For instance, ${\unicode{x03B1}}$ -enhancements in stars with non-Solar abundance patterns can shift line strengths and depths significantly (Asplund Reference Asplund2005; VandenBerg et al. 2012). Likewise, variations in C and N abundances, particularly in cooler stars, can impact molecular equilibrium, altering CO and CN molecular line strengths significantly (Tsuji Reference Tsuji1976; Smith et al. Reference Smith2013). Dedicated marcs atmospheres with modified ${\unicode{x03B1}}$ and C abundances (Mészáros et al. Reference Mészáros, Allende Prieto and Edvardsson2012; Jönsson et al. Reference Jönsson2020), such as those used in modelling APOGEE spectra (Abdurro’uf et al. Reference Abdurro’uf2022), or more flexible interpolation schemes by Westendorp Plaza, Asensio Ramos, & Allende Prieto (Reference Westendorp Plaza, Asensio Ramos and Allende Prieto2023), address this mismatch. However, the NLTE grids would also need to be expanded to cover all grid points of the extended marcs models to ensure consistency.

8.3. Spectrum interpolation with neural networks

8.3.1. Training set selection

Before the neural networks are computed, it should actually be tested what the abundance zero-points are. In the case of several elements like Na and Al they are significant, on the order of $0.2\,\mathrm{dex}$ . When this occurs, stars with actual high abundances of $0.7-0.8\,\mathrm{dex}$ , for example in old stars and especially in globular clusters (see e.g. Carretta et al. Reference Carretta2009b), are not sufficiently covered.

One of the primary challenges in creating an optimal training set for spectrum interpolation lies in the choice of parameter sampling. A common caveat is the use of randomised, uncorrelated parameter sampling, which can lead to unrealistic combinations of elemental abundances. Elements that share a similar nucleosynthesis channel often exhibit correlated behaviour, for instance, stars with high abundances of Mg are typically also enhanced in Si, Ca, and Ti, while Na and Al tend to be elevated together. Similarly, neutron-capture elements like Y and Ba often follow similar trends (e.g. Ting et al. Reference Ting, Freeman, Kobayashi, De Silva and Bland-Hawthorn2012; Kobayashi et al. Reference Kobayashi, Karakas and Lugaro2020; Buder et al. Reference Buder2021). To better capture this behaviour in the training set, the use of scaled linear functions or normalising flows could be advantageous. These approaches would help minimise the occurrence of unlikely parameter combinations and yield a more representative sample.

For stars of the thin disk population, one could for example consider sampling from a noisy age-[X/Fe] relation to model chemical evolution (see Fig. 19, Nissen Reference Nissen2015; Spina et al. Reference Spina2016; Bedell et al. Reference Bedell2018). This approach becomes more complicated when considering the thick disk, halo, and peculiar stars, where distinct nucleosynthesis histories introduce greater variability in elemental abundance trends.

8.3.2. Masking of spectra

Because the correlation between spectral features, stellar parameters, and abundances is often complex, degeneracies can arise when two stellar properties influence similar pixels of a spectrum (e.g. C and N for CN, or $T_\mathrm{eff}$ and [Fe/H] for cool dwarfs) or two stellar properties tend to act in lockstep in actual stars (e.g. Mg, Si, and Ti as ${\unicode{x03B1}}$ -process elements). In GALAH DR2 (Buder et al. 2018), we attempted to overcome these issues by specifically masking the coefficients of spectrum interpolation, that is, effectively restricting the interpolation to only change smaller parts of the spectrum for a given stellar property.

In GALAH DR4, we have relaxed this restriction again, since we have trained on random abundance combinations in the hope of being able to break correlation degeneracies. We note, however, that too little information in spectra can again cause by-chance correlations (e.g. if neutron-capture lines are always very weak and the training set is not sufficiently large). We believe that this is the cause of the decrease in precision for Eu measurements from GALAH DR3 to GALAH DR4. The Eu abundance was mainly measured only from the weak Eu $6\,645\,$ Å line in DR3, whereas the neural networks of DR4 are not restricted to this region.

8.3.3. Flexibility of neural networks in general

The decision to use a large set of neural networks, each covering a restricted region in the $T_\mathrm{eff}$ , $\log g$ , and [Fe/H] space, was motivated by the goal of reducing the complexity required of any single model. By dividing the parameter space into smaller subsets, each neural network can be specialised and therefore less flexible, which allows for more precise modelling within its specific region. This approach avoids the trade-off faced by a single, monolithic neural network, which would either lack sufficient flexibility across the entire parameter space or be computationally more expensive to train and evaluate. For this data release, we have fixed the chosen network architecture of a 2-layer perceptron with 300 neurons and specific learning rate. While we have tested other activation functions than leaky rectified linear units, namely sigmoid, tanh and exponential linear unit functions, we found the lowest root mean square errors for our chosen activation function. Given the found issues with model fluxes above 1, we also would recommend to test a sigmoid as last activation layer of the neural network to ensure that the neural network always predicts fluxes between 0 and 1, as is expected from modelled stellar spectra. We have further tested a larger number of neurons, but found the root mean square errors to stabilise around 300 neurons for our test cases. It has to be acknowledged that due to the limit of human power to properly train and test the neural networks, we have not been able to properly test all neural networks and explore more flexible architectures. For this data release, we have decided not to rerun these steps, but make the current results available to the community. In the future, the restriction to one or only a few network models is recommended. The latter could cover regions of cool dwarfs, main-sequence turn-off stars, hot stars, and giant stars with individual models – and possibly explore the split in metal-poor and solar-like regimes. This would also decrease overhead, in particular for training and loading different models as well as possible noding effects between different models.

8.3.4. Flexibility of neural networks for extreme abundances

While this approach has proved to be powerful for all elements across their abundance ranges, we have noticed sinusoidal shapes for weak Li lines (see also Wang et al. Reference Wang2021). This is likely caused by the large dynamical range of $0 \,{\lt}\, \mathrm{A(Li)} \,{\lt}\, 4$ that has to be covered by the neural network. For Li, the more sophisticated approach is to fit Gaussian lines to multiple components in the wavelength range around $6\,708\,$ Å, measure EW(Li), which are then used to infer 3D-NLTE based A(Li) abundances. This inference is preferable to our 1D-NLTE based neural network estimates, as it is independent of the network flexibility and superior to our less accurate spectrum synthesis in 1D.

While several studies have identified that the abundances of stars in the Galactic disk are often very similar (e.g. Ness et al. Reference Ness2019), the Galactic halo offers a more diverse picture. An example is 2MASS J22353100-6658174 (140707003601047), a turn-off star with extremely high s-process abundances and actually visible lines of La and Nd in addition to the usually visible Y and Ba. In this case, the fits to the La and Nd lines are significantly weaker than the observations. GALAH DR3 actually produced reasonable fits to this star with high abundances in [Y/Fe]=1.2, [Ba/Fe]=1.5, [La/Fe]=1.5, [Ce/Fe]=1.1, [Nd/Fe]=1.9, and [Sm/Fe]=1.2. A neural network that is not trained on such high abundances is likely to improperly extrapolate stellar spectra.

While we have tried to extract abundances of chemically peculiar stars, such as carbon-enhanced metal-poor stars, the significant effect of their molecular features onto the whole stellar spectrum is not to be underestimated and can in-itself pose a problem to the flexibility of neural networks.

8.3.5. Over- and underdensities at neural network edges

While the use of one neural network to interpolate the high-dimensional spectrum space is preferable, in practice, different science cases may drive the decision to use several networks. If the science case is to reach maximum precision, one neural network that is trained on the typical spectrum could be used at the expense of properly modelling peculiar spectra. If the science case is to reach maximum accuracy, only the regions with reliable line data and spectrum synthesis might be preferable. If the science case is to find peculiar stars, a larger coverage is needed to avoid the inaccurate extrapolation of stars with extreme abundances. In practice, large collaborations likely unite all of these goals, and a compromise has to be struck among the different approaches. For future analyses, a possible solution could therefore be to follow a two-step approach of first running one generic neural network for all spectra and then using optimised neural networks – or full spectrum synthesis – on smaller target samples of specific science cases.

8.3.6. Quantitative performance of neural networks

Throughout the training of our neural networks, we optimised model parameters using a mean absolute error (MAE) loss function across the spectrum pixels. The MAE remains consistently below 0.01 for all neural networks, indicating high accuracy in model predictions, particularly for turn-off and most metal-poor stars where errors are typically below 0.001 (see Figs. 29 and 30).

Figure 29. Neural network performance shown as a function of $T_\mathrm{eff}$ vs. $\log g$ with each panel showing a different range of [Fe/H]. Colours indicate the mean absolute errors of the training (large circles) and validation (small circles) for the neural networks.

Despite these low average error rates, the performance of neural networks can vary significantly across different spectral regions. Errors are minimal in continuum areas but tend to increase around strong or strongly changing absorption features, such as those of lithium, which are discussed in Section 8.3.4. The neural network architecture does not track uncertainty for each weight and bias, limiting our ability to generate perturbed models for assessing the impact of interpolation uncertainties on derived parameters and abundances. Additionally, retraining networks with varied initial conditions to evaluate prediction stability is computationally intensive.

Figure 30. Histogram of the mean absolute errors for the neural networks. These were used as loss function during the training (blue) and validation (red) on seen and unseen spectra, respectively.

To gauge the practical impact of these uncertainties, we compared the MAE against the noise levels in the GALAH spectra. Errors significantly lower than the noise levels for stars above $T_\mathrm{eff} \,{\gt}\, 5\,000\,\mathrm{K}$ suggest that the interpolation inaccuracies minimally impact our analysis. However, for cooler stars with MAE around 0.01, interpolation inaccuracies could potentially influence precise chemical abundance studies more substantially.

Despite the high degree of accuracy achieved, the limitations outlined necessitate careful interpretation of derived parameters, especially in regions with significant absorption features (see Figs. 29 and 30).

8.4. Mismatch of spectroscopic and photometric information

8.4.1. Incorrect masses driving incorrect stellar parameters

We estimate masses and ages through isochrone matching, where stellar parameters (validated against photometric estimates) are known for not being fully consistent with spectroscopic values. We believe this leads to significant mismatches especially for stars close to the red clump. In this region, a small change in spectroscopic and photometric information can imply a significant change in inferred mass (e.g. from primary to secondary red clump, with the latter being 2 or more solar masses and thus significantly more than the usual $\sim$ 1 solar mass). This issue has only become noticeable after the production runs and we have therefore decided not to rerun this particular region of the parameter space for this data release. We have extensively tested the possible reasons and identified the mismatch of isochrones and actual stellar spectroscopic parameters as the cause. We have not been able to fully resolve this issue by either including a prior based on the initial mass function to weigh against massive stars (see e.g. Sharma et al. Reference Sharma2018) and move from likelihood-weighted to posterior mass estimates. Similarly, we have not been able to resolve these effects by artificially upscaling the spectroscopic uncertainties when calculating the likelihood-weighted masses. More work needs to be done to mitigate the current inconsistencies of theoretical isochrones and spectroscopic estimates.

Another solution for this particular region of the parameter space could be the use of chemical stellar evolution through the correlation of core and thus total mass with the ratio of [C/N] after the first dredge-up (Masseron & Gilmore Reference Masseron and Gilmore2015; Martig et al. Reference Martig2016), given that GALAH spectra also contain information on both elements. This could thus be used to better constrain high masses and counteract the information from isochrone-inferred masses. For this data release, the [C/N] information could at least serve as an indicator of how trustworthy high masses for giant stars are.

8.4.2. To use or not to use non-spectroscopic information?

The implementation of non-spectroscopic information, as done in our allstar module, has the advantage of overcoming spectroscopic degeneracies (as proven for the limited information on $\log g$ in the HERMES wavelength range) as well as improving accuracy and precision also for the lowest quality spectra (because $\log g$ is no longer solely dependent on the spectrum information).

However, this approach is only useful if the non-spectroscopic information is not biased (as it would be for astrometric and photometric information in the case of unresolved binarity). While the astrometric information for almost all GALAH targets is exquisite, this may not be the case for other surveys. The significant improvement from GALAH DR3 to GALAH DR4 has most definitely benefited from the improved astrometric information of Gaia EDR3 (Gaia Collaboration et al. 2021a; Lindegren et al. Reference Lindegren2021b) and Gaia DR3 (Gaia Collaboration et al. 2023) with respect to Gaia DR2 (Gaia Collaboration et al. 2018; Lindegren et al. Reference Lindegren2018). Further improvement could be expected when also taking Gaia’s photometric information into account, in addition to our use of 2MASS photometry.

8.5. Binaries

Although not part of this release, we have created an analysis module for spectroscopic binaries. The module will be presented in a separate work (Lach et al., in preparation) with a catalogue becoming a value-added catalogue of this release. The module is motivated by the extensive study of GALAH binary star spectra by Traven et al. (Reference Traven2020) and our ability to model the full spectrum via neural networks. We show a first analysis result of the module in Fig. 31, where the module was applied to a spectroscopic binary type 2 and resulted in a significantly better fit than the single star analysis.

Figure 31. Example spectrum for a double-lined spectroscopic binary star (SB2) that is better fitted with our binary fitting algorithm.

8.6. Globular clusters

Globular clusters are well known for their light element anti-correlations (i.e. the Na-O or Mg-Al anti-correlations), though the underlying cause remains a subject of debate (for recent reviews see Bastian & Lardo Reference Bastian and Lardo2018; Gratton et al. Reference Gratton2019; Milone & Marino Reference Milone and Marino2022). It is widely accepted that one population is enhanced in elements including He, N, Na and Al, and depleted in O and C. Previous GALAH data releases have encountered issues in removing trends between abundances and stellar parameters (as discussed in Section 6.2.1), and DR4 represents a marked decrease of scatter within the Kiel diagrams of the globular clusters (see Appendix C). Despite these improvements, light element abundance anti-correlations are still not well reproduced for DR4. We attribute this to two key factors: abundance zero points (Section 6.2.1) and the masking of spectra (Section 8.3.2).

Table C1 illustrates that both Na and Al have some of the largest zero-point shifts (−0.171 and −0.185), meaning that for some clusters the full extent of the anti-correlations is not realised (particularly for the light element enhanced populations). Secondly, when inspecting the optimal synthesis for particular lines (e.g. the Na lines at $5\,682.6$ and $5\,688.2\,$ Å, or the O triplet around $7\,770\,$ Å), the fits are poorly constrained, leading to a more significant scatter in these critical elements than what has previously been reported in the literature. We expect this is related to the relaxed restrictions on the neural network. Based on the abundances in their current form, we do not recommend using these light element abundances to distinguish between the multiple populations in globular clusters.

However, the 3D NLTE Li abundances discussed in Section 7.2.3 have effectively mitigated the above issues by adopting the GALAH stellar parameters and focusing exclusively on fitting the Li line. When analysing this Li data for the globular clusters, we can effectively reproduce the Li depletion patterns reported by Lind et al. (Reference Lind, Primas, Charbonnel, Grundahl and Asplund2009b) The large sample of clusters allows for a homogeneous study of Li depletion around the RGB bump, which will be detailed in McKenzie et al. (in preparation).

Dedicated observing programs have increased the average SNR for some clusters and expanded the sample to include additional clusters, such as M 22 (PI: M. McKenzie) and M 4 (PI: M. Howell). As discussed in Section 6.2.1, M 4 will be used to spectroscopically confirm whether the stars with lower asteroseismic masses belong to the light-element-enhanced population. This confirmation will be achieved through the re-analysis of Na, O, Mg, and Al lines, following the approach used for Li, since we advise against relying on these current light element abundances for globular cluster stars.

The cluster M 22, renowned for its bimodal s-process population (Marino et al. Reference Marino2011; McKenzie et al. Reference McKenzie2022, Reference McKenzie2024), was observed as a crucial test case to evaluate the GALAH pipeline’s ability to detect s-process abundance variations. While the pipeline successfully recovers the bimodal distribution, the scatter is larger than reported in previous studies. Additionally, as noted in Section 6.2.2, the precision of Eu measurements appears to have decreased between DR3 and DR4, particularly within globular cluster populations. Therefore, we recommend against using Eu from DR4 in future globular cluster publications.

Due to their low metallicity, globular clusters are particularly susceptible to the bug in the flag_fe_h discussed in Section 8.8.1. If this condition is relaxed, we recommend that all spectra and corresponding fits be manually inspected for quality before being included in any publications. Again, we reiterate that a boutique, custom reanalysis aiming to address these caveats in the globular cluster data will be the focus of upcoming work from McKenzie et al. (in preparation).

8.7. Fit optimisation

As described in Section 4, we are using the curve_fit function of scipy.optimize (Virtanen et al. Reference Virtanen2020) to fit synthetic spectra to observed spectra, whose optimisation can get stuck in local minima. We have tried to automatically identify regions of the parameter space where the scipy.optimize.curve_fit function has become stuck. In particular for some red clump stars as well as cool giant stars with $T_\mathrm{eff} \,{\lt}\, 3\,750\,\mathrm{K}$ and $\log g \,{\lt}\, 0.5$ (see Section 6.2.1), we have been able to recover a pattern of abundances that are stuck around their initial value. However, this pattern is not consistent enough to flag stars without a significant amount of false-positives. Because of the zero point corrections, these are shifted away from the usual initial guess of $0\,\mathrm{dex}$ depending on the element (see zero-points in Table C1).

Such a fitting failure would also be expected when applying The Payne (Ting et al. Reference Ting, Conroy, Rix and Cargile2019) with its similar default setup that adopts parameter bounds for the fitting parameters and thus employs the curve_fit function with the trust region reflective (trf) method. Given the common use of curve_fit, future pipelines should test a range of approaches to avoid this issue. Firstly, instead of using trf, the Dogbox (dogbox) method, could be used. The method is potentially slower but more reliable for complex parameter spaces. It could be used to randomly check the convergence of the trf method or be applied only to regions where multiple local minima are expected.

Moving away from the curve_fit function, the leastsq,Footnote ^ab minimize or the differential_evolution function of scipy’s optimize module could be used to test options of a more expensive but more extensive optimisation. Finally, multiple randomised initial starting guesses could be applied for curve_fit, but would multiply the computing costs linearly by the number of initial guesses.

The fitting optimisation and uncertainty estimation should be performed in a more sophisticated Bayesian framework that folds in photometric, astrometric, and asteroseismic information and their uncertainties. We have indeed implemented such a framework with a likelihood estimate from spectroscopic information and prior information based on photometric, astrometric, and asteroseismic estimates for test purposes. When implementing the resulting posterior into the Markov-Chain Monte-Carlo machinery of emcee (Foreman-Mackey et al. Reference Foreman-Mackey, Hogg, Lang and Goodman2013), we have not been able to limit the computational time (when fitting all labels) to a competitive level with curve_fit and thus not implemented this approach for the analysis of a million spectra. We note, however, that a future analysis should implement this approach – either with emcee or Monte Carlo nested sampling algorithms like UltraNest (Buchner Reference Buchner2021). Furthermore, we suggest to either separate the likelihood and posterior estimation steps (see e.g. Gent et al. Reference Gent2022) or limit the optimisation to only a few major stellar labels (see e.g. Traven et al. Reference Traven2020).

8.8. Reliability of flags

We have tried to develop a quality assurance pipeline that automatically flags results and stars that may not be adequately analysed with our assumptions.

8.8.1. Bug in flag_fe_h">Bug in flag_fe_h

The quality flag for iron abundance, flag_fe_h, was computed similarly to the elemental abundances, that is, by comparing the best-fitting spectrum with a spectrum with the lowest grid value of the neural network subgrids. In the case of [Fe/H], however, this is not the appropriate reference value. For example, for a star with $\mathrm{[Fe/H]} = -0.74\,\mathrm{dex}$ , the spectrum will be compared to a reference with $\mathrm{[Fe/H]} = -0.75\,\mathrm{dex}$ , which will appear essentially identical within the spectrum uncertainties, and the code concludes there are no spectral features that are significantly different. This has affected up to 34% of stars – most with detectable iron lines – and we therefore do not recommend the use of this flag at all. In the future, such a test should be performed with respect to an actually low (undetectable) amount of iron, such as $\mathrm{[Fe/H]} = -4\,\mathrm{dex}$ .

8.8.2. Fitting machinery stuck in local minimum

As laid out in Section 8.7, we have not been able to automatically flag all estimates for which our fitting machinery has become stuck in local minima, most notably at the initial value.

8.8.3. Binary or fast rotating star?

With the increasing number of turn-off stars as part of ongoing GALAH observations, we have tried to implement a more sensitive approach to identify binaries in this region. This may, however, mean that we have also introduced more false-positive detections of stars that are only fast rotating with higher $v \sin i$ , rather than being a binary system. We therefore suggest carefully considering using or neglecting the accompanying flag in GALAH DR4 (see Table 5).

8.9. Bug of interstellar K velocity

As mentioned in Section 6.1.3, rv_k_is in v240705 is reported relative to the stellar radial velocity. To compute the barycentric radial velocity of the measured interstellar K, rv_comp_1 has to be added to rv_k_is.

8.10. Summary of caveats

In summary, the most important caveats are:

Noding in $T_\mathrm{eff}$ , $\log g$ , and [Fe/H] around edges between neural networks: Our tests when switching between neural networks indicate that this effect for $T_\mathrm{eff}$ , $\log g$ , and [Fe/H] should stay within the precision uncertainties. A more problematic effect might be that some elements could be fitted as part of one neural network based on the detectability tests that were performed at the grid centres of each neural network.
Mismatches of photometry and spectroscopy: Both imperfect isochrone and spectrum models can drive a mismatch in the estimation of spectroscopic parameters. This is most notable around the secondary red clump region and also expected for highly extincted regions.
Imperfect synthesis leading to trends in cool stars: The unreliable line data in cool stars causes increasingly inaccurate models and inferred stellar properties towards the coolest stars (see Kos et al. Reference Kos2025). The coolest giant stars ( $T_\mathrm{eff} \,{\lt}\, 3\,750\,\mathrm{K}$ and $\log g \,{\lt}\, 0.5$ ) still have unreliable parameters.
Lower precision for Eu due to missing masking of neural networks.
The radial velocity of interstellar K has to be corrected from the stellar to barycentric frame by adding rv_comp_1.

These caveats are a by-product of our ambitious goal to enhance the accuracy and precision of stellar parameters and elemental abundances, while vastly expanding the number of stars for which we report measurements. Each region of the Hertzsprung–Russell diagram brings its own set of challenges, whether in the complex physics of evolved stars or the fine-tuned data analysis required for main-sequence stars. Yet, these efforts have culminated in the remarkable success of GALAH DR4, providing an incredibly rich and robust dataset for researchers. As we continue to explore the Galaxy, this data enables new discoveries and insights, but it is essential to take measurements and peculiar findings with a grain of salt—they may sometimes reflect the complexity of data analysis rather than intrinsic stellar properties. Despite these challenges, GALAH DR4 marks a significant leap forward, opening up exciting opportunities for the community to unravel the mysteries of our Galaxy.

9. Conclusions

The GALAH survey celebrates its 10th anniversary with the release of GALAH DR4, marking a decade of transformative contributions to our understanding of the Milky Way and the elemental composition of its stars. Over the years, GALAH has been pivotal in measuring and cataloguing the chemical fingerprints of stars, which serve as cosmic barcodes that reveal their formation histories, migration patterns, and the evolutionary processes that shaped our Galaxy.

With GALAH DR4, we have achieved notable advancements in the precision and accuracy of stellar parameters and elemental abundances for nearly a million stars. This release benefits from a decade of continuous development in spectroscopic techniques, calibration processes, and the adoption of cutting-edge models like 1D NLTE and even 3D NLTE synthesis for lithium abundances. The inclusion of photometric and astrometric information from Gaia DR3 has enhanced the reliability of stellar parameters, particularly for surface gravities, helping to resolve degeneracies of spectroscopic data. The unique value of GALAH lies in its detailed mapping of elements crucial to the studies of exoplanets and life as we know it. By tracking the abundances of carbon, nitrogen, and oxygen (CNO), rock-forming elements (e.g. Mg, Si, and Fe), as well as rare heavy elements used in modern electronics (e.g. Ce, La, and Nd), GALAH has provided key insights into how the building blocks of planets, life, and technology were forged in the interiors of stars and distributed throughout the Milky Way over billions of years.

In the last decade, 321 research outputs (176 of them refereed) have mentioned GALAH in their abstract.Footnote ^ac GALAH DR3 (Buder et al. Reference Buder2021), the predecessor of this data release, was by far the most cited paper of the Monthly Notices of the Royal Astronomical Society in 2021 at the time when this manuscript was published. GALAH DR3 has made significant contributions across several major research fields. In stellar physics and evolution, GALAH has expanded our understanding of stellar structures, nucleosynthesis (Sanders, Belokurov, & Man Reference Sanders, Belokurov and Man2021; Griffith et al. Reference Griffith2022), and lithium enrichment (Martell et al. Reference Martell2021; Simpson et al. Reference Simpson2021; Bouma et al. Reference Bouma, Curtis, Hartman, Winn and Bakos2021; Sayeed et al. Reference Sayeed2024; Wang et al. Reference Wang2024a). In galactic astronomy and archaeology, GALAH has mapped the Milky Way’s chemical and kinematic properties (e.g. Bland-Hawthorn et al. Reference Bland-Hawthorn2019; Sharma et al. Reference Sharma2021; Sharma et al. Reference Sharma2022), shedding light on its formation, dynamics, and past mergers (Buder et al. Reference Buder2022). The survey has also influenced planetary formation by examining the chemical environments of exoplanet host stars (Clark et al. Reference Clark2021; Soares-Furtado et al. Reference Soares-Furtado, Cantiello, MacLeod and Ness2021; Spaargaren et al. Reference Spaargaren, Wang, Mojzsis, Ballmer and Tackley2023; Wang et al. Reference Wang, Quanz, Mahadevan and Deal2024b), while deepening our knowledge of the Galaxy’s chemical evolution and complexity (Kos et al. Reference Kos2021), especially regarding neutron-capture and r-process elements (Matsuno et al. Reference Matsuno2021; Aguado et al. Reference Aguado2021; Horta et al. Reference Horta, Ness, Rybizki, Schiavon and Buder2022; Manea et al. Reference Manea2024). Additionally, GALAH has provided insights into open clusters and star formation across the Galactic disc (e.g. Spina et al. Reference Spina2021), with broader applications in extragalactic astronomy through a refined understanding of surviving structures of galaxy mergers and streams (Myeong et al. Reference Myeong2022; Buder et al. Reference Buder2022; Manea, Hawkins, & Maas Reference Manea, Hawkins and Maas2022) through its innovative chemical tagging techniques (Buder et al. Reference Buder2022; Buder, Mijnarends, & Buck Reference Buder, Mijnarends and Buck2024). In addition to its scientific discoveries, GALAH’s influence has always been mutually beneficial with both photometric (Huang et al. Reference Huang2021), asteroseismic (Zinn et al. Reference Zinn2022), and spectroscopic analyses (Nandakumar et al. Reference Nandakumar2022; Tsantaki et al. Reference Tsantaki2022; Soubiran, Brouillet, & Casamiquela Reference Soubiran, Brouillet and Casamiquela2022). GALAH information has aided the calibration and validation of surveys (Casagrande et al. Reference Casagrande2021; Katz et al. Reference Katz2023; Frémat et al. Reference Frémat2023) as well as the improvement of stellar ages by exploring chemical abundances (Hayden et al. Reference Hayden2022; Ratcliffe et al. Reference Ratcliffe2024) and combining spectroscopic and other data (Hayden et al. Reference Hayden2022; Sahlholdt, Feltzing, & Feuillet Reference Sahlholdt, Feltzing and Feuillet2022; Queiroz et al. Reference Queiroz2023). GALAH’s extensive observations also covered a range of rare or peculiar objects, such as variable stars (Jayasinghe et al. Reference Jayasinghe2021) or metal-poor stars (Da Costa et al. Reference Da Costa2023).

The next decade holds tremendous potential for further breakthroughs as GALAH continues its mission to observe and analyse stars across the Milky Way. With a clear goal of surpassing the 1 million star milestone, GALAH not only refines its data reduction and spectral analysis techniques but also paves the way for other ambitious surveys, such as SDSS-V (Kollmeier et al. Reference Kollmeier2017), 4MOST (de Jong et al. Reference de Jong2019), and WEAVE (Dalton et al. Reference Dalton2014) at similar spectral resolution or MSE (The MSE Science Team et al. 2019) and HRMOS (Magrini et al. Reference Magrini2023) at higher spectral resolution. As a trailblazer in the field of stellar spectroscopy, GALAH’s approach has set the standard for these upcoming surveys, and its legacy will be cemented by the release of one final data set that will address the caveats and challenges discussed in this fourth data release. GALAH will undoubtedly continue to influence not only planetary, stellar and galactic astronomy but also broaden our understanding of the cosmos and the elements that shape modern life.

Acknowledgements

We acknowledge the traditional owners of the land on which the AAT and ANU stand, the Gamilaraay, the Ngunnawal and the Ngambri peoples. We pay our respects to Elders, past and present, and are proud to continue their tradition of surveying the night sky in the Southern hemisphere.

We extend our heartfelt thanks to the entire staff at Siding Spring Observatory, both past and present, for their dedicated maintenance of 2dF-HERMES and invaluable support during the decade of observations. This project would not have been possible without the collective efforts of the many individuals who have contributed their expertise, time, and hard work, including Ashley Anderson, Paula Boubel, Rob Brookfield, James Cameron, Steve Chapman, Tony Farrell, Kristin Feigert, Andy Green, Doug Grey, Dionne Haynes, Steve Lee, Chris Lidman, Angel Lopez-Sanchez, Chris McCowage, Quentin Parker, Rob Patterson, Susan Patterson, Michael Andre Phillips, Chris Ramage, Murray Riding, Mike Sharrott, Andy Sheinis, Lee Spitler, Darren Stafford, Lew Waller, Fred Watson, Duncan Wright, as well as Tayyaba Zafar, and all those who played a part in making GALAH a success.

The operation of the AAT is funded by the AAT Consortium, which includes The Australian National University (operator), The University of New South Wales, The University of Sydney, Macquarie University, Western Sydney University, The University of Melbourne, Swinburne University, Monash University, The University of Queensland, The University of Southern Queensland and The University of Tasmania, with Astronomy Australia Limited (AAL) as Consortium Manager.

This work was supported by the Australian Research Council Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), through project number CE170100013. SB acknowledges support from the Australian Research Council under grant number DE240100150. SLM, DBZ and GFL acknowledge support from the Australian Research Council through Discovery Program grant DP220102254. SLM, BTM and KB acknowledge support from the UNSW Scientia Program. JK, GT and TZ acknowledge financial support of the Slovenian Research Agency (research core funding No. P1-0188) and the European Space Agency (PRODEX Experiment Arrangements No. 4000142234 and No. 4000143450). AMA acknowledges support from the Swedish Research Council (VR 2020-03940) and from the Crafoord Foundation via the Royal Swedish Academy of Sciences (CR 2024-0015).

Facilities

AAT with 2dF-HERMES at Siding Spring Observatory: AAT observations for this data release were performed under programs 2013B/13, 2014A/25, 2015A/3, 2015A/19, 2015B/1, 2015B/19, 2016A/22, 2016B/10, 2016B/12, 2017A/14, 2017A/18, 2017B/16, 2018A/18, 2018B/15, 2019A/1, 2019A/15, 2020B/14, 2020B/23, 2022B/02, 2022B/05, 2023A/04, 2023A/08, 2023A/09, 2023B/04, and 2023B/05.

AAO Data Central: This paper includes data that has been provided by AAO Data Central (datacentral.org.au) and makes use of services and code that have been provided by AAO Data Central.

Gaia: This work has made use of data from the European Space Agency (ESA) mission Gaia (http://www.cosmos.esa.int/gaia), processed by the Gaia Data Processing and Analysis Consortium (DPAC, http://www.cosmos.esa.int/web/gaia/dpac/consortium). Funding for the DPAC has been provided by national institutions, in particular the institutions participating in the Gaia Multilateral Agreement.

Other facilities: This publication makes use of data products from the Two Micron All Sky Survey (Skrutskie et al. Reference Skrutskie2006) and the CDS VizieR catalogue access tool (Ochsenbein, Bauer, & Marcout Reference Ochsenbein, Bauer and Marcout2000). This research was supported by computational resources provided by the Australian Government through the National Computational Infrastructure (NCI) under the National Computational Merit Allocation Scheme and the ANU Merit Allocation Scheme (project y89) and HPCAI Talent Programme Scholarship (project hl99).

Software

The research for this publication was coded in python (version 3.7.4) and included its packages astropy (v. 3.2.2; Astropy Collaboration et al. 2013; Astropy Collaboration et al. 2018), astroquery (v. 0.4; Ginsburg et al. 2019), corner (v. 2.0.1; Foreman-Mackey Reference Foreman-Mackey2016), galpy (version 1.6.0; Bovy Reference Bovy2015), IPython (v. 7.8.0; Pérez & Granger Reference Pérez and Granger2007), matplotlib (v. 3.1.3; Hunter Reference Hunter2007), NumPy (v. 1.17.2; Walt et al. 2011), scipy (version 1.3.1; Virtanen et al. Reference Virtanen2020), sklearn (v. 0.21.3; Pedregosa et al. Reference Pedregosa2011), We further made use of topcat (version 4.7; Taylor Reference Taylor2005);

Linelist

Our linelist, as described in Section 3.2, makes use of the following work: References: 1982ApJ…260.395C: Cardon et al. (Reference Cardon, Smith, Scalo, Testerman and Whaling1982), 1983MNRAS.204.883B|1989A&A…208.157G: Blackwell, Menon, & Petford (Reference Blackwell, Menon and Petford1983), Grevesse et al. (Reference Grevesse, Blackwell and Petford1989), 1990JQSRT.43.207C: Chang & Tang (Reference Chang and Tang1990), 1992A&A…255.457D: Davidson et al. (Reference Davidson, Snoek, Volten and Doenszelmann1992), 1993A&AS…99.179H: Hibbert et al. (Reference Hibbert, Biemont, Godefroid and Vaeck1993),

1993PhyS…48.297N: Nahar (Reference Nahar1993), 1998PhRvA.57.1652Y: Yan, Tambasco, & Drake (Reference Yan, Tambasco and Drake1998), 1999ApJS.122.557N: Nitz et al. (Reference Nitz, Kunau, Wilson and Lentz1999), 2008JPCRD.37.709K: Kelleher & Podobedova (Reference Kelleher and Podobedova2008),

2009A&A…497.611M: Meléndez & Barbuy (Reference Meléndez and Barbuy2009),

2014ApJS.211…20W: Wood et al. (Reference Wood, Lawler, Sneden and Cowan2014), 2014ApJS.215…20L: Lawler et al. (Reference Lawler2014), 2014ApJS.215…23D: Den Hartog et al. (Reference Den Hartog2014a), 2014MNRAS.441.3127R: Ruffoni et al. (Reference Ruffoni2014),

2015ApJS.220…13L: Lawler, Sneden, & Cowan (Reference Lawler, Sneden and Cowan2015),

2015ApJS.220…13L_1982ApJ…260.395C: Lawler et al. (Reference Lawler, Sneden and Cowan2015); Cardon et al. (Reference Cardon, Smith, Scalo, Testerman and Whaling1982), 2017MNRAS.471.532P: Palmeri et al. (Reference Palmeri2017), 2017PhRvA.95e2507T: Trubko et al. (Reference Trubko, Gregoire, Holmgren and Cronin2017), BGHL: Biemont et al. (Reference Biemont, Grevesse, Hannaford and Lowe1981), BIPS: Blackwell et al. (Reference Blackwell, Ibbetson, Petford and Shallis1979), BK: Bard & Kock (Reference Bard and Kock1994), BK+BWL: Bard & Kock (Reference Bard and Kock1994); O’Brian et al. (Reference O’Brian, Wickliffe, Lawler, Whaling and Brault1991), BK+GESB82d+BWL: Bard & Kock (Reference Bard and Kock1994), Blackwell et al. (Reference Blackwell, Petford and Simmons1982b), O’Brian et al. (Reference O’Brian, Wickliffe, Lawler, Whaling and Brault1991), BKK: Bard, Kock, & Kock (Reference Bard, Kock and Kock1991), BKK+GESB82c+BWL: Bard et al. (Reference Bard, Kock and Kock1991); Blackwell et al. (Reference Blackwell, Petford, Shallis and Simmons1982a); O’Brian et al. (Reference O’Brian, Wickliffe, Lawler, Whaling and Brault1991), BLNP: Blackwell-Whitehead et al. (Reference Blackwell-Whitehead2006), BWL: O’Brian et al. (Reference O’Brian, Wickliffe, Lawler, Whaling and Brault1991),

BWL+2014MNRAS.441.3127R: O’Brian et al. (Reference O’Brian, Wickliffe, Lawler, Whaling and Brault1991); Ruffoni et al. (Reference Ruffoni2014), BWL+GESHRL14: O’Brian et al. (Reference O’Brian, Wickliffe, Lawler, Whaling and Brault1991); Den Hartog et al. (Reference Den Hartog2014a), CB: Corliss & Bozman (Reference Corliss and Bozman1962), DLSSC: Den Hartog et al. (Reference Den Hartog, Lawler, Sobeck, Sneden and Cowan2011), FMW: Fuhr, Martin, & Wiese (Reference Fuhr, Martin and Wiese1988), GARZ|BL: Garz (Reference Garz1973); O’brian & Lawler (Reference O’brian and Lawler1991), GESB82c+BWL: Blackwell et al. (Reference Blackwell, Petford, Shallis and Simmons1982a); O’Brian et al. (Reference O’Brian, Wickliffe, Lawler, Whaling and Brault1991), GESB86: Blackwell et al. (Reference Blackwell, Booth, Menon and Petford1986), GESB86+BWL: Blackwell et al. (Reference Blackwell, Booth, Menon and Petford1986); O’Brian et al. (Reference O’Brian, Wickliffe, Lawler, Whaling and Brault1991), GESMCHF: Froese Fischer, Tachiev, & Irimia (Reference Froese Fischer, Tachiev and Irimia2006), Grevesse2015: Grevesse et al. (Reference Grevesse, Scott, Asplund and Sauval2015), HLSC: Den Hartog et al. (Reference Den Hartog, Lawler, Sneden and Cowan2003), K06: Kurucz (Reference Kurucz2006), K07: Kurucz (Reference Kurucz2007), K08: Kurucz (Reference Kurucz2008), K09: Kurucz (Reference Kurucz2009), K10: Kurucz (Reference Kurucz2010), K13: Kurucz (Reference Kurucz2013), K14: Kurucz (Reference Kurucz2014), KL-astro: astrophysical, KR|1989ZPhyD.11.287C: Kock & Richter (Reference Kock and Richter1968), Carlsson et al. (Reference Carlsson, Sturesson and Svanberg1989), LBS: Lawler et al. (Reference Lawler, Bonvallet and Sneden2001a), LD: Lawler & Dakin (Reference Lawler and Dakin1989), LD-HS: Lawler et al. (Reference Lawler, Den Hartog, Sneden and Cowan2006), LGWSC: Lawler et al. (Reference Lawler, Guzman, Wood, Sneden and Cowan2013), LSCI: Lawler et al. (Reference Lawler, Sneden, Cowan, Ivans and Den Hartog2009), LWHS: Lawler et al. (Reference Lawler, Wickliffe, den Hartog and Sneden2001b), MA-astro: astrophysical, MC: Meggers, Corliss, & Scribner (Reference Meggers, Corliss and Scribner1975), MFW: Martin, Fuhr, & Wiese (Reference Martin, Fuhr and Wiese1988), MRW: May, Richter, & Wichelmann (Reference May, Richter and Wichelmann1974), NIST: Ralchenko et al. (Reference Ralchenko, Kramida and Reader2010), NWL: Nitz, Wickliffe, & Lawler (Reference Nitz, Wickliffe and Lawler1998), PQWB: Palmeri et al. (Reference Palmeri, Quinet, Wyart and Biémont2000), RU: Raassen & Uylings (Reference Raassen and Uylings1998), S: Smith (Reference Smith1988), SLS: Sobeck, Lawler, & Sneden (Reference Sobeck, Lawler and Sneden2007), SR: Smith & Raggett (Reference Smith and Raggett1981), VGH: Vaeck, Godefroid, & Hansen (Reference Vaeck, Godefroid and Hansen1988), WLSC: Wood et al. (Reference Wood, Lawler, Sneden and Cowan2013), WSL: Wickliffe, Salih, & Lawler (Reference Wickliffe, Salih and Lawler1994).

Data availability

See Section 7.

Appendix A. Initial Parameters

We append the overview of the initial and final stellar parameters of GALAH DR4 in Fig. A1. We show the density distribution of $\log g$ , [Fe/H], $v_\mathrm{mic}$ , and $v \sin i$ in each row as a function of $T_\mathrm{eff}$ .

Figure A1. Comparison of final GALAH DR4 stellar parameters (first column) against the initial parameters used in the allstar analysis (second column), estimates from the GALAH DR4 reduction pipeline (third column), Gaia DR3 (fourth column with $v_\mathrm{mic}$ based on the adjusted formula from Dutra-Ferreira et al. Reference Dutra-Ferreira, Pasquini, Smiljanic, Porto de Mello and Steffen2016), and GALAH DR3 (fifth column).

Appendix B. Data Products

We append examples of data products of GALAH DR4 that were not already shown in the main manuscript. Table B1 shows a shortened table schema of the allstar and allspec catalogues. Fig. B1 shows the automatically created fit comparison of the allstar module for Vesta (210115002201239). Fig. B2 shows examples of the covariance matrices for Vesta and Arcturus, as representative examples for main-sequence and giant stars.

Table B1. Table schema of the GALAH DR4 main catalogues. Columns that are part of allspec, but not allstar are listed below the middle line. For compactness, we have combined repetitive columns (for example with integers N). Detailed table schemas are available in the FITS headers of each catalogue file.

Figure B1. Example output of the allstar analysis for Vesta (210115002201239). The observed flux (black) is compared with the fitted model flux (red), and the residuals (purple) show the difference between the observed and modelled spectra. Important spectral lines are annotated with their corresponding elements, with element groups colour-coded for clarity. Blue-shaded regions represent the 5% of the spectrum that was masked and excluded from the fit to avoid contamination from outliers or poorly modelled lines.

Figure B2. Covariance matrices for labels for Vesta (panel a) and Arcturus (panel b).

Appendix C. Stellar Parameter and Abundance Validation

Stellar parameter and abundance zero-points of the allstar module are listed in Table C1. A complete table, including the zero-points for the allspec module can be found as FITS file in the online repository. A compromise between the different accuracy abundance indicators is shown in Fig. C1 for the allstar module. Fig. C2 shows the precision of individual abundances. Figs. C3 and C4 show the remaining comparisons with APOGEE DR17 in addition to Figs. 21 and 22. Although many of the observed globular clusters are expected to show an abundance spread, including for iron, we show a collage of globular clusters with ascending iron abundance in Fig. C5, with each panel indicating the median iron abundance per cluster as well as the spread (scatter) of the iron abundance distribution and the average measurement uncertainty. A more comprehensive analysis of the globular clusters will be presented in upcoming work (McKenzie et al., in preparation). Finally, Fig. C6 shows the distribution of flagged stars in the Kiel diagram.

Table C1. Zero point estimates and corrections applied to the allstar measurements. We used Prša et al. (Reference Prša2016) as reference for Solar parameters and Grevesse et al. (Reference Grevesse, Asplund and Sauval2007), consistent with the marcs model atmosphere composition (Gustafsson et al. Reference Gustafsson2008), as reference for Solar abundances. For reference, we also show the combined rotational and macroturbulence as well as microturbulence velocities from Jofré et al. (Reference Jofré2014). Values for Vesta indicate our uncorrected measurements for the Vesta spectrum.

Figure C1. zero-point estimates of elemental abundances for GALAH DR4. Each panel shows the comparison to literature (DR4 – literature) for Vesta (blue), Gaia FKG Benchmark Stars (orange), Stars with $\vert \mathrm{[Fe/H]} \vert \leq 0.1$ closer than $D_\varpi \,{\lt}\, 0.5\,\mathrm{kpc}$ (red), as well as stars that were also observed by APOGEE DR17 (purple).

Figure C2. Precision monitoring (with a median line and standard deviation shading) of elemental abundances as a function of SNR for the green CCD2 across for GALAH DR4. Each panel shows the behaviour for bins of width 10 for the scatter of repeat observations of the allspec runs (blue) as well as covariance uncertainties of allspec (orange) and allstar (red) setups.

Figure C3. Comparison of stars with available measurements in GALAH DR3 (left column), GALAH DR4 (middle column) and APOGEE DR17 (right) for O, Na, Al, Si, K, and Ca.

Figure C4. Continuation of Fig. C3 for Ti, V, Cr, Mn, Co, and Ce.

Figure C5. Collage of globular clusters in the $T_\mathrm{eff}$ - $\log g$ space, coloured by stellar metallicity [Fe/H]. There are only minor trends between [Fe/H] and $T_\mathrm{eff}$ , even for the horizontal branch stars in NGC 288, NGC 6656 (M22), and NGC 6121 (M4). NGC 5139 ( ${\unicode{x03C9}}$ Cen) shows a significant range in [Fe/H]. RMS scatter and median metallicity uncertainties for each cluster are given in the lower right of each panel.

Figure C6. Parameter overview of stars with raised major quality flag flag_sp for allstar. Each panel shows the logarithmic density distribution of stars in the $T_\mathrm{eff}$ and $\log g$ plane with blue colourmaps. A PARSEC isochrone with $\mathrm{[M/H]}=0$ and $\tau = 4.5\,\mathrm{Gyr}$ is overplotted in orange and the same mass binary main-sequence (shifted from the single star one by $\Delta \log g = -0.3\,\mathrm{dex}$ ) is shown in red. Panel heads denote the bit mask and its description as well as how many times the flag was raised. We neglect distributions with no flag (0), for flags which have not been raised (8,9,11), and for which no results were available (15).

Footnotes

^a Li, C, N, O, Na, Mg, Al, Si, K, Ca, Sc, Ti, V, Cr, Mn, Fe, Co, Ni, Cu, Zn, Rb, Sr, Y, Zr, Mo, Ru, Ba, La, Ce, Nd, Sm, and Eu.

^b For stars without parallaxes, we only perform an analysis without astrometric information.

^c This number is chosen to match the 28 CPUs of our computing nodes.

^d $\mu \in [0.96, 0.89, 0.8, 0.71, 0.6, 0.46, 0.27]$ .

^e While repeat observations were only done for quality assurance in GALAH Phase 1, a significant number of repeat observations was performed as part of Phase 2.

^f GALAH_DR4/spectrum_analysis/galah_dr4_initial_parameters.ipynb.

^g Example masks can be found in the GALAH DR4 repository here.

^h The list is available in the GALAH DR4 repository here.

ⁱ In line with Nissen (Reference Nissen2015), Nissen et al. (Reference Nissen2020), we refer to these non-spectroscopically constrained surface gravities as photometric ones.

^j We assume ${[{\unicode{x03B1}}/\textrm{Fe}]} = 0.4$ for $\mathrm{[Fe/H]} \,{\lt}\, -1$ , ${[{\unicode{x03B1}}/\textrm{Fe}]} = 0.0$ for $\mathrm{[Fe/H]} \,{\gt}\, 0$ and linearly interpolate between these points for $-1 \leq \mathrm{[Fe/H]} \leq 0$ .

^k In v240705, rv_comp_1 has to be added to rv_k_is due to a bug.

^l https://github.com/svenbuder/GALAH_DR4/tree/main/validation.

^m *precision_correction_factors* in https://github.com/svenbuder/GALAH_DR4/tree/main/catalogs.

ⁿ galah_dr4_vac_wise_tmass_gaiadr3.

^o gaiadr1.tmass_original_valid.

^p gaiadr3.tmass_psc_xsc_best_neighbour.

^q gaiadr3.tmass_psc_xsc_join.

^r gaiadr1.allwise_original_valid.

^s gaiadr3.allwise_best_neighbour.

^t external.gaiaedr3_distance.

^u galah_dr4_vac_dynamics.

^v Accessible in the GALAH DR4 repository here.

^w galah_dr4_vac_3dnlte_a_li.

^x Note that these 1D NLTE Li abundances are different from the 1D NLTE Li abundances published in allstar.

^y observations/YYMMDD/spectra/com/sobject_id*.fits.

^z analysis_products_single/YYMMDD/sobject_id/.

^aa analysis_products_allstar/YYMMDD/sobject_id/.

^ab leastsq is used for example by The Cannon version by Casey et al. (Reference Casey2016).

^ac A total of 1 539 astronomical research outputs mentions (1 193 refereed) mentioned GALAH throughout their manuscript.

References

Abdurro’uf, Accetta, K., Aerts, C., et al. 2022, ApJS, 259, 35Google Scholar

Aguado, D. S., et al. 2021, ApJ, 908, L8 Google Scholar

Amarsi, A. M., & Asplund, M. 2017, MNRAS, 464, 264Google Scholar

Amarsi, A. M., Barklem, P. S., Asplund, M., Collet, R., & Zatsarinny, O. 2018a, A&A, 616, A89Google Scholar

Amarsi, A. M., Barklem, P. S., Collet, R., Grevesse, N., & Asplund, M. 2019a, A&A, 624, A111Google Scholar

Amarsi, A. M., et al. 2020a, A&A, 636, A120Google Scholar

Amarsi, A. M., Liljegren, S., & Nissen, P. E. 2022, A&A, 668, A68Google Scholar

Amarsi, A. M., Nissen, P. E., & Skúladóttir, Á. 2019b, A&A, 630, A104Google Scholar

Amarsi, A. M., et al. 2018b, A&A, 615, A139Google Scholar

Amarsi, A. M., et al. 2020b, A&A, 642, A62Google Scholar

Andrae, R., et al. 2023, A&A, 674, A27Google Scholar

Asplund, M. 2005, ARA&A, 43, 481Google Scholar

Astropy Collaboration, et al. 2013, A&A, 558, A33Google Scholar

Astropy Collaboration, et al. 2018, AJ, 156, 123Google Scholar

Bailer-Jones, C. A. L., Rybizki, J., Fouesneau, M., Demleitner, M., & Andrae, R. 2021, AJ, 161, 147Google Scholar

Bailer-Jones, C. A. L., Rybizki, J., Fouesneau, M., Mantelet, G., & Andrae, R. 2018, AJ, 156, 58Google Scholar

Bard, A., Kock, A., & Kock, M. 1991, A&A, 248, 315, (BKK)Google Scholar

Bard, A., & Kock, M. 1994, A&A, 282, 1014, (BK)Google Scholar

Barden, S. C., et al. 2010, SPIE, 7735, 09Google Scholar

Bastian, N., & Lardo, C. 2018, ARA&A, 56, 83Google Scholar

Baumgardt, H., & Vasiliev, E. 2021, MNRAS, 505, 5957Google Scholar

Bedell, M., et al. 2018, ApJ, 865, 68Google Scholar

Beeson, K. L., et al. 2024, MNRAS, 529, 2483Google Scholar

Belokurov, V., Erkal, D., Evans, N. W., Koposov, S. E., & Deason, A. J. 2018, MNRAS, 478, 611Google Scholar

Bensby, T., Feltzing, S., & Oey, M. S. 2014, A&A, 562, A71Google Scholar

Bergemann, M., et al. 2016, A&A, 594, A120Google Scholar

Bergemann, M., et al. 2019, A&A, 631, A80Google Scholar

Biemont, E., Grevesse, N., Hannaford, P., & Lowe, R. M. 1981, ApJ, 248, 867, (BGHL)Google Scholar

Binney, J. 2012, MNRAS, 426, 1324Google Scholar

Blackwell, D. E., Booth, A. J., Menon, S. L. R., & Petford, A. D. 1986, MNRAS, 220, 289Google Scholar

Blackwell, D. E., Ibbetson, P. A., Petford, A. D., & Shallis, M. J. 1979, MNRAS, 186, 633, (BIPS)Google Scholar

Blackwell, D. E., Menon, S. L. R., & Petford, A. D. 1983, MNRAS, 204, 883Google Scholar

Blackwell, D. E., Petford, A. D., Shallis, M. J., & Simmons, G. J. 1982a, MNRAS, 199, 43Google Scholar

Blackwell, D. E., Petford, A. D., & Simmons, G. J. 1982b, MNRAS, 201, 595Google Scholar

Blackwell-Whitehead, R. J., et al. 2006, MNRAS, 373, 1603, (BLNP)Google Scholar

Bland-Hawthorn, J., & Gerhard, O. 2016, ARA&A, 54, 529Google Scholar

Bland-Hawthorn, J., et al. 2019, MNRAS, 486, 1167Google Scholar

Botnen, A., & Carlsson, M. 1999, in Astrophysics and Space Science Library, Vol. 240, Numerical Astrophysics, ed. Miyama, S. M., Tomisaka, K., & Hanawa, T., 379Google Scholar

Bouma, L. G., Curtis, J. L., Hartman, J. D., Winn, J. N., & Bakos, G. Á. 2021, AJ, 162, 197Google Scholar

Bovy, J. 2015, ApJS, 216, 29Google Scholar

Bressan, A., et al. 2012, MNRAS, 427, 127Google Scholar

Brzeski, J., Case, S., & Gers, L. 2011, SPIE, 8125, 04Google Scholar

Buchner, J. 2021, JOSS, 6, 3001Google Scholar

Buder, S., Mijnarends, L., & Buck, T. 2024, MNRAS, 532, 1010Google Scholar

Buder, S., et al. 2021, MNRAS, 506, 150Google Scholar

Buder, S., et al. 2022, MNRAS, 510, 2407Google Scholar

Cantat-Gaudin, T., & Anders, F. 2020, A&A, 633, A99Google Scholar

Cardelli, J. A., Clayton, G. C., & Mathis, J. S. 1989, ApJ, 345, 245Google Scholar

Cardon, B. L., Smith, P. L., Scalo, J. M., Testerman, L., & Whaling, W. 1982, ApJ, 260, 395Google Scholar

Carlsson, J., Sturesson, L., & Svanberg, S. 1989, ZPhAMC, 11, 287Google Scholar

Carretta, E., Bragaglia, A., Gratton, R., & Lucatello, S. 2009a, A&A, 505, 139Google Scholar

Carretta, E., et al. 2009b, A&A, 505, 117Google Scholar

Casagrande, L., & VandenBerg, D. A. 2018, MNRAS, 475, 5023Google Scholar

Casagrande, L., et al. 2021, MNRAS, 507, 2684Google Scholar

Casey, A. R., et al. 2016, ArXiv e-prints, arXiv:1603.03040Google Scholar

Chang, T. N., & Tang, X. 1990, J. Quant. Spec. Radiat. Transf., 43, 207Google Scholar

Clark, J. T., et al. 2021, MNRAS, 504, 4968Google Scholar

Corliss, C. H., & Bozman, W. R. 1962, NBS Monograph, Vol. 53, Experimental Transition Probabilities for Spectral Lines of Seventy Elements (US Government Printing Office), (CB)Google Scholar

Cutri, R. M., et al. 2014, VizieR Online Data Catalog, 2328Google Scholar

Da Costa, G. S., et al. 2023, MNRAS, 520, 917Google Scholar

Dalton, G., et al. 2014, SPIE, 9147, 0LGoogle Scholar

Davidson, M. D., Snoek, L. C., Volten, H., & Doenszelmann, A. 1992, A&A, 255, 457Google Scholar

de Jong, R. S., et al. 2019, Msngr, 175, 3Google Scholar

Den Hartog, E. A., Lawler, J. E., Sneden, C., & Cowan, J. J. 2003, ApJS, 148, 543, (HLSC)Google Scholar

Den Hartog, E. A., Lawler, J. E., Sobeck, J. S., Sneden, C., & Cowan, J. J. 2011, ApJS, 194, 35, (DLSSC)Google Scholar

Den Hartog, E. A., et al. 2014a, ApJS, 215, 23Google Scholar

Dutra-Ferreira, L., Pasquini, L., Smiljanic, R., Porto de Mello, G. F., & Steffen, M. 2016, A&A, 585, A75Google Scholar

Evans, D. W., et al. 2018, A&A, 616, A4Google Scholar

Farrell, T. J., et al. 2014, SPIE, 9152, 23Google Scholar

Foreman-Mackey, D. 2016, JOSS, 1, 24Google Scholar

Foreman-Mackey, D., Hogg, D. W., Lang, D., & Goodman, J. 2013, PASP, 125, 306Google Scholar

Fouesneau, M., et al. 2023, A&A, 674, A28Google Scholar

Frémat, Y., et al. 2023, A&A, 674, A8Google Scholar

Froese Fischer, C., Tachiev, G., & Irimia, A. 2006, ADNDT, 92, 607Google Scholar

Fuhr, J. R., Martin, G. A., & Wiese, W. L. 1988, JPCRD, 17, (FMW)Google Scholar

Gaia Collaboration, et al. 2021a, A&A, 649, A1Google Scholar

Gaia Collaboration, et al. 2023, A&A, 674, A1Google Scholar

Gallagher, A. J., et al. 2020, A&A, 634, A55Google Scholar

Garz, T. 1973, A&A, 26, 471, (GARZ)Google Scholar

Gent, M. R., et al. 2022, A&A, 658, A147Google Scholar

Gilmore, G., et al. 2022, A&A, 666, A120Google Scholar

Ginsburg, A., et al. 2019, AJ, 157, 98Google Scholar

Gratton, R., et al. 2019, A&A Rev., 27, 8Google Scholar

Grevesse, N., Asplund, M., & Sauval, A. J. 2007, Space Sci. Rev., 130, 105Google Scholar

Grevesse, N., Blackwell, D. E., & Petford, A. D. 1989, A&A, 208, 157Google Scholar

Grevesse, N., Scott, P., Asplund, M., & Sauval, A. J. 2015, A&A, 573, A27Google Scholar

Griffith, E. J., et al. 2022, ApJ, 931, 23Google Scholar

Gustafsson, B., et al. 2008, A&A, 486, 951Google Scholar

Hayden, M. R., et al. 2022, MNRAS, 517, 5325Google Scholar

Heijmans, J., et al. 2012, SPIE, 8446, 0WGoogle Scholar

Heiter, U., et al. 2015, A&A, 582, A49Google Scholar

Heiter, U., et al. 2021, A&A, 645, A106Google Scholar

Helmi, A., et al. 2018, Nature, 563, 85Google Scholar

Hibbert, A., Biemont, E., Godefroid, M., & Vaeck, N. 1993, A&AS, 99, 179Google Scholar

Hinkle, K., Wallace, L., Valenti, J., & Harmer, D., eds. 2000, Visible and Near Infrared Atlas of the Arcturus Spectrum, 3727-9300 Å (Astron. Soc. Pac.)Google Scholar

Ho, A. Y. Q., et al. 2017, ApJ, 836, 5Google Scholar

Hoeijmakers, H. J., et al. 2015, A&A, 575, A20Google Scholar

Hon, M., et al. 2021, ApJ, 919, 131Google Scholar

Horta, D., Ness, M. K., Rybizki, J., Schiavon, R. P., & Buder, S. 2022, MNRAS, 513, 5477Google Scholar

Hourihane, A., et al. 2023, A&A, 676, A129Google Scholar

Howell, M., Campbell, S. W., Stello, D., & De Silva, G. M. 2022, MNRAS, 515, 3184Google Scholar

Howell, S. B., et al. 2014, PASP, 126, 398Google Scholar

Huang, Y., et al. 2021, ApJ, 907, 68Google Scholar

Hunter, J. D. 2007, CSE, 9, 90Google Scholar

Jayasinghe, T., et al. 2021, MNRAS, 503, 200Google Scholar

Jofré, P., et al. 2018, RNAAS, 2, 152Google Scholar

Jofré, P., et al. 2014, A&A, 564, A133Google Scholar

Jofré, P., et al. 2015, A&A, 582, A81Google Scholar

Jofré, P., et al. 2017, A&A, 601, A38Google Scholar

Johnson, C. I., & Pilachowski, C. A. 2010, ApJ, 722, 1373Google Scholar

Jönsson, H., et al. 2020, AJ, 160, 120Google Scholar

Katz, D., et al. 2023, A&A, 674, A5Google Scholar

Kawka, A., et al. 2020, MNRAS, 495, L129Google Scholar

Kelleher, D. E., & Podobedova, L. I. 2008, JPCRD, 37, 709Google Scholar

Kobayashi, C., Karakas, A. I., & Lugaro, M. 2020, ApJ, 900, 179Google Scholar

Kock, M., & Richter, J. 1968, ZAp, 69, 180, (KR)Google Scholar

Kollmeier, J. A., et al. 2017, arXiv e-prints, arXiv:1711.03234 Google Scholar

Kos, J., et al. 2018, MNRAS, 480, 5475Google Scholar

Kos, J., et al. 2021, MNRAS, 506, 4232Google Scholar

Kos, J., et al. 2025, arXiv e-prints, arXiv:2501.06140 Google Scholar

Kurucz, R. L. 2006, Database of observed and predicted atomic transitionsGoogle Scholar

Kurucz, R. L. 2007, Database of observed and predicted atomic transitionsGoogle Scholar

Kurucz, R. L. 2008, Database of observed and predicted atomic transitionsGoogle Scholar

Kurucz, R. L. 2009, Database of observed and predicted atomic transitionsGoogle Scholar

Kurucz, R. L. 2010, Database of observed and predicted atomic transitionsGoogle Scholar

Kurucz, R. L. 2013, Database of observed and predicted atomic transitionsGoogle Scholar

Kurucz, R. L. 2014, Database of observed and predicted atomic transitionsGoogle Scholar

Lawler, J. E., Bonvallet, G., & Sneden, C. 2001a, Astrophys. J., 556, 452, (LBS)Google Scholar

Lawler, J. E., & Dakin, J. T. 1989, JOSAB, 6, 1457, (LD)Google Scholar

Lawler, J. E., Den Hartog, E. A., Sneden, C., & Cowan, J. J. 2006, ApJS, 162, 227, (LD-HS)Google Scholar

Lawler, J. E., Guzman, A., Wood, M. P., Sneden, C., & Cowan, J. J. 2013, ApJS, 205, 11Google Scholar

Lawler, J. E., Sneden, C., & Cowan, J. J. 2015, ApJS, 220, 13Google Scholar

Lawler, J. E., Sneden, C., Cowan, J. J., Ivans, I. I., & Den Hartog, E. A. 2009, ApJS, 182, 51, (LSCI)Google Scholar

Lawler, J. E., Wickliffe, M. E., den Hartog, E. A., & Sneden, C. 2001b, ApJ, 563, 1075, (LWHS)Google Scholar

Lawler, J. E., et al. 2014, ApJS, 215, 20Google Scholar

Leenaarts, J., & Carlsson, M. 2009, ASPC, 415, 87Google Scholar

Lewis, A. 2019, arXiv e-prints, arXiv:1910.13970 Google Scholar

Lewis, I. J., et al. 2002, MNRAS, 333, 279Google Scholar

Lind, K., Asplund, M., & Barklem, P. S. 2009a, A&A, 503, 541Google Scholar

Lind, K., Asplund, M., Barklem, P. S., & Belyaev, A. K. 2011, A&A, 528, A103Google Scholar

Lind, K., Primas, F., Charbonnel, C., Grundahl, F., & Asplund, M. 2009b, A&A, 503, 545Google Scholar

Lindegren, L., et al. 2018, A&A, 616, A2Google Scholar

Lindegren, L., et al. 2021a, A&A, 649, A4Google Scholar

Lindegren, L., et al. 2021b, A&A, 649, A2Google Scholar

Mackereth, J. T., & Bovy, J. 2018, PASP, 130, 114501Google Scholar

Magrini, L., et al. 2023, arXiv e-prints, arXiv:2312.08270 Google Scholar

Majewski, S. R., Zasowski, G., & Nidever, D. L. 2011, ApJ, 739, 25Google Scholar

Manea, C., Hawkins, K., & Maas, Z. G. 2022, MNRAS, 511, 2829Google Scholar

Manea, C., et al. 2024, ApJ, 972, 69Google Scholar

Mann, A. W., Gaidos, E., Lépine, S., & Hilton, E. J. 2012, ApJ, 753, 90Google Scholar

Marigo, P., et al. 2017, ApJ, 835, 77Google Scholar

Marino, A. F., et al. 2011, A&A, 532, A8Google Scholar

Martell, S. L., et al. 2017, MNRAS, 465, 3203Google Scholar

Martell, S. L., et al. 2021, MNRAS, 505, 5340Google Scholar

Martig, M., et al. 2016, MNRAS, 456, 3655Google Scholar

Martin, G., Fuhr, J., & Wiese, W. 1988, JPhChRDS, 17Google Scholar

Masseron, T., & Gilmore, G. 2015, MNRAS, 453, 1855Google Scholar

Matsuno, T., et al. 2021, A&A, 650, A110Google Scholar

May, M., Richter, J., & Wichelmann, J. 1974, A&AS, 18, 405, (MRW)Google Scholar

McKemmish, L. K., et al. 2019, MNRAS, 488, 2836Google Scholar

McKenzie, M., et al. 2022, MNRAS, 516, 3515Google Scholar

McKenzie, M., et al. 2024, MNRAS, 527, 7940Google Scholar

McMillan, P. J. 2017, MNRAS, 465, 76Google Scholar

Meggers, W. F., Corliss, C. H., & Scribner, B. F. 1975, Tables of spectral-line intensities. Part I, II_- arranged by elements. (NBS), (MC)Google Scholar

Meléndez, J., & Barbuy, B. 2009, A&A, 497, 611Google Scholar

Mészáros, S., Allende Prieto, C., Edvardsson, B., et al. 2012, AJ, 144, 120Google Scholar

Milone, A. P., & Marino, A. F. 2022, Universe, 8, 359Google Scholar

Miszalski, B., Shortridge, K., Saunders, W., Parker, Q. A., & Croom, S. M. 2006, MNRAS, 371, 1537Google Scholar

Monty, S., et al. 2023, MNRAS, 518, 965Google Scholar

Myeong, G. C., et al. 2022, ApJ, 938, 21Google Scholar

Nahar, S. N. 1993, PhS, 48, 297Google Scholar

Nandakumar, G., et al. 2022, MNRAS, 513, 232Google Scholar

Ness, M., Hogg, D. W., Rix, H.-W., Ho, A. Y. Q., & Zasowski, G. 2015, ApJ, 808, 16Google Scholar

Ness, M., et al. 2016, ApJ, 823, 114Google Scholar

Ness, M. K., et al. 2019, ApJ, 883, 177Google Scholar

Nissen, P. E. 2015, A&A, 579, A52Google Scholar

Nissen, P. E., et al. 2020, A&A, 640, A81Google Scholar

Nissen, P. E., & Gustafsson, B. 2018, A&A Rev., 26, 6Google Scholar

Nitz, D. E., Kunau, A. E., Wilson, K. L., & Lentz, L. R. 1999, ApJS, 122, 557Google Scholar

Nitz, D. E., Wickliffe, M. E., & Lawler, J. E. 1998, ApJS, 117, 313, (NWL)Google Scholar

Nordlander, T., & Lind, K. 2017, A&A, 607, A75Google Scholar

O’brian, T. R., & Lawler, J. E. 1991, PhRvA, 44, 7134, (BL)Google Scholar

O’Brian, T. R., Wickliffe, M. E., Lawler, J. E., Whaling, W., & Brault, J. W. 1991, JOSAB, 8, 1185, (BWL)Google Scholar

Ochsenbein, F., Bauer, P., & Marcout, J. 2000, A&AS, 143, 23Google Scholar

Osorio, Y., et al. 2015, A&A, 579, A53Google Scholar

Osorio, Y., Lind, K., Barklem, P. S., Allende Prieto, C., & Zatsarinny, O. 2019, A&A, 623, A103Google Scholar

Palmeri, P., et al. 2017, MNRAS, 471, 532Google Scholar

Palmeri, P., Quinet, P., Wyart, J., & Biémont, E. 2000, PhS, 61, 323, (PQWB)Google Scholar

Pedregosa, F., et al. 2011, J Mach Learn Res, 12, 2825Google Scholar

Pérez, F., & Granger, B. E. 2007, CSE, 9, 21Google Scholar

Piskunov, N., & Valenti, J. A. 2017, A&A, 597, A16Google Scholar

Prša, A., et al. 2016, AJ, 152, 41Google Scholar

Queiroz, A. B. A., et al. 2023, A&A, 673, A155Google Scholar

Raassen, A. J. J., & Uylings, P. H. M. 1998, A&A, 340, 300, (RU)Google Scholar

Rains, A. D., et al. 2021, MNRAS, 504, 5788Google Scholar

Rains, A. D., et al. 2024, MNRAS, 529, 3171Google Scholar

Ralchenko, Y., Kramida, A., Reader, J., & NIST ASD Team. 2010, NIST Atomic Spectra Database (ver. 4.0.0), [Online].Google Scholar

Ratcliffe, B., et al. 2024, MNRAS, 528, 3464Google Scholar

Recio-Blanco, A., et al. 2023, A&A, 674, A29Google Scholar

Reggiani, H., et al. 2019, A&A, 627, A177Google Scholar

Reid, M. J., & Brunthaler, A. 2004, ApJ, 616, 872Google Scholar

Ricker, G. R., et al. 2015, JATIS, 1, 014003Google Scholar

Riello, M., et al. 2021, A&A, 649, A3Google Scholar

Rimoldini, L., et al. 2023, A&A, 674, A14Google Scholar

Rix, H.-W., Ting, Y.-S., Conroy, C., & Hogg, D. W. 2016, ApJ, 826, L25Google Scholar

Ruffoni, M. P., et al. 2014, MNRAS, 441, 3127Google Scholar

Sahlholdt, C. L., Feltzing, S., & Feuillet, D. K. 2022, MNRAS, 510, 4669Google Scholar

Salaris, M., & Cassisi, S. 2006, Evolution of Stars and Stellar Populations (J. Wiley)Google Scholar

Sanders, J. L., Belokurov, V., & Man, K. T. F. 2021, MNRAS, 506, 4321Google Scholar

Saydjari, A. K., Uzsoy, A. S. M., Zucker, C., Peek, J. E. G., & Finkbeiner, D. P. 2023, ApJ, 954, 141Google Scholar

Sayeed, M., et al. 2024, ApJ, 964, 42Google Scholar

Schlegel, D. J., Finkbeiner, D. P., & Davis, M. 1998, ApJ, 500, 525Google Scholar

Schönrich, R., Binney, J., & Dehnen, W. 2010, MNRAS, 403, 1829Google Scholar

Sharma, S., et al. 2018, MNRAS, 473, 2004Google Scholar

Sharma, S., et al. 2019, MNRAS, 490, 5335Google Scholar

Sharma, S., et al. 2021, MNRAS, 506, 1761Google Scholar

Sharma, S., et al. 2022, MNRAS, 510, 734Google Scholar

Sheinis, A., et al. 2015, JATIS, 1, 035002Google Scholar

Simpson, J. D., et al. 2021, MNRAS, 507, 43Google Scholar

Skrutskie, M. F., et al. 2006, AJ, 131, 1163Google Scholar

Smith, G. 1988, JPhB, 21, 2827, (S)Google Scholar

Smith, G., & Raggett, D. S. J. 1981, JPhB, 14, 4015, (SR)Google Scholar

Smith, V. V., et al. 2013, ApJ, 765, 16Google Scholar

Soares-Furtado, M., Cantiello, M., MacLeod, M., & Ness, M. K. 2021, AJ, 162, 273Google Scholar

Sobeck, J. S., Lawler, J. E., & Sneden, C. 2007, ApJ, 667, 1267, (SLS)Google Scholar

Soubiran, C., Brouillet, N., & Casamiquela, L. 2022, A&A, 663, A4Google Scholar

Spaargaren, R. J., Wang, H. S., Mojzsis, S. J., Ballmer, M. D., & Tackley, P. J. 2023, ApJ, 948, 53Google Scholar

Spina, L., et al. 2016, A&A, 593, A125Google Scholar

Spina, L., et al. 2021, MNRAS, 503, 3279Google Scholar

Steinmetz, M., et al. 2020, AJ, 160, 82Google Scholar

Stello, D., et al. 2015, ApJ, 809, L3Google Scholar

Taylor, M. B. 2005, ASPC, 347, 2910.1016/j.lcats.2005.10.008CrossRef Google Scholar

The MSE Science Team, et al. 2019, arXiv e-prints, arXiv:1904.04907 Google Scholar

Ting, Y.-S., Conroy, C., & Rix, H.-W. 2016, ApJ, 826, 83Google Scholar

Ting, Y.-S., Conroy, C., Rix, H.-W., & Asplund, M. 2018, ApJ, 860, 159Google Scholar

Ting, Y.-S., Conroy, C., Rix, H.-W., & Cargile, P. 2019, ApJ, 879, 69Google Scholar

Ting, Y.-S., Freeman, K. C., Kobayashi, C., De Silva, G. M., & Bland-Hawthorn, J. 2012, MNRAS, 421, 1231Google Scholar

Torra, F., et al. 2021, A&A, 649, A10Google Scholar

Traven, G., et al. 2020, A&A, 638, A145Google Scholar

Trubko, R., Gregoire, M. D., Holmgren, W. F., & Cronin, A. D. 2017, Phys. Rev. A, 95, 052507Google Scholar

Tsantaki, M., et al. 2022, A&A, 659, A9510.1051/0004-6361/202141702CrossRef Google Scholar

Tsuji, T. 1976, PASJ, 28, 543Google Scholar

Vaeck, N., Godefroid, M., & Hansen, J. E. 1988, Phys. Rev. A, 38, 2830, (VGH)Google Scholar

Valenti, J. A., & Piskunov, N. 1996, A&AS, 118, 595Google Scholar

van Leeuwen, F. 2007, A&A, 474, 653Google Scholar

VandenBerg, D. A., et al. 2012, ApJ, 755, 15Google Scholar

Vasiliev, E., & Baumgardt, H. 2021, MNRAS, 505, 5978Google Scholar

Virtanen, P., et al. 2020, NM, 17, 261Google Scholar

Vogrinc̆ic̆, R., et al. 2023, MNRAS, 521, 3727Google Scholar

Walt, S. v. d., Colbert, S. C., & Varoquaux, G. 2011, CSE, 13, 22Google Scholar

Wang, E. X., et al. 2021, MNRAS, 500, 2159Google Scholar

Wang, E. X., et al. 2024a, MNRAS, 528, 5394Google Scholar

Wang, H. S., Quanz, S. P., Mahadevan, S., & Deal, M. 2024b, A&A, 688, A225Google Scholar

Wehrhahn, A., Piskunov, N., & Ryabchikova, T. 2023, A&A, 671, A171Google Scholar

Westendorp Plaza, C., Asensio Ramos, A., & Allende Prieto, C. 2023, A&A, 675, A19110.1051/0004-6361/202346372CrossRef Google Scholar

Wheeler, A. J., Abruzzo, M. W., Casey, A. R., & Ness, M. K. 2023, AJ, 165, 68Google Scholar

Wheeler, A. J., Casey, A. R., & Abruzzo, M. W. 2024, AJ, 167, 83Google Scholar

Wickliffe, M. E., Salih, S., & Lawler, J. E. 1994, J. Quant. Spec. Radiat. Transf., 51, 545, (WSL)Google Scholar

Wood, M. P., Lawler, J. E., Sneden, C., & Cowan, J. J. 2013, ApJS, 208, 27Google Scholar

Wood, M. P., Lawler, J. E., Sneden, C., & Cowan, J. J. 2014, ApJS, 211, 20Google Scholar

Xiang, M., et al. 2019, ApJS, 245, 34Google Scholar

Xiang, M., et al. 2022, A&A, 662, A66Google Scholar

Yan, Z.-C., Tambasco, M., & Drake, G. W. F. 1998, Phys. Rev. A, 57, 1652Google Scholar

Yong, D., et al. 2013, MNRAS, 434, 3542Google Scholar

Zhao, G., Zhao, Y.-H., Chu, Y.-Q., Jing, Y.-P., & Deng, L.-C. 2012, RAA, 12, 72310.1088/1674-4527/12/7/002CrossRef Google Scholar

Zinn, J. C., et al. 2020, ApJS, 251, 23Google Scholar

Zinn, J. C., et al. 2022, ApJ, 926, 191Google Scholar

Figure 1. Workflow of GALAH DR4.

Figure 2. Overview of the distribution of stars included in this fourth GALAH data release in Galactic coordinates with the centre of the Galaxy at the origin and the Gaia DR3 all-sky colour view (Gaia Collaboration et al., 2023) as background. Shown are the targets of GALAH Phase 1 (dark blue) and Phase 2 (medium blue), the targets of the K2-HERMES follow-up along the ecliptic and TESS-HERMES in the TESS Southern Continuous Viewing Zone as well as CoRoT fields (pink). Both open and globular cluster points are shown in purple and orange, respectively. All other targets are shown in in light blue across the Southern sky.

Table 2. Data product 1: FITS files of reduced spectra.

Figure 5. Coverage in $T_\mathrm{eff}$ and $\log g$ of the MARCS2014 grid (red) and GALAH DR3 (black, including density countours). Shown is also an example of one of the 3D bins used to create stellar sibling models with each neural network. marcs grid points $T_\mathrm{eff}$$ \,{\lt}\, 3\,100\,\mathrm{K}$ or [Fe/H]$\,{\lt}\,-3\,\mathrm{dex}$ are neglected for GALAH DR4.

Figure 6. Coverage of stellar parameters and abundances for one of the 3D bins. Shown is the example of the Solar 3D bin ($T_\mathrm{eff}\;/\;\mathrm{K} = 5\,750$, $\log g\;/\;\mathrm{dex} = 4.5$, $\mathrm{[Fe/H]}\;/\;\mathrm{dex} = 0.0$). Panel a): $T_\mathrm{eff}$ and $\log g$, Panel (b): [Fe/H] vs. A(Li), Panel (c): [Fe/H] vs. [O/Fe], Panel (d): [Fe/H] vs. [Mg/Fe]. While $T_\mathrm{eff}$, $\log g$, and [Fe/H] are sampled randomly within the 3D bin, the abundances are sampled both narrowly (blue) and broadly (purple) within limits as described in the text. Red points indicate the median label values and orange points the adjusted label values (see Table 3) to test the gradient change of spectra with individual labels.

Table 3. Example of boundaries for the uniform sampling of synthetic spectrum labels (stellar parameters and elemental abundances) for the 3-dimensional bin of Solar siblings 5750_4.50_0.00.

Figure 8. Example of normalisation for GALAH DR4 for a model spectrum ($T_\mathrm{eff} = 3\,400\,\mathrm{K}$, $\log g = 1.5$, $\mathrm{[Fe/H]} = -1.0\,\mathrm{dexbest-fitting }$) that is selected during the label optimisation. Panel (a): Observed spectrum (counts). Panel (b): Ratio (blue) of observed spectrum and model spectrum as well as Chebyshev polynomial fit (orange). Panel (c): Normalised observed spectrum (black) compared to the model spectrum (blue). Residuals (red) can then be used as input for the likelihood function.

Figure 9. Output of the radial velocity fitting step. Panel (a) shows the initial broad search on a $v_\mathrm{rad}$ array of $-1000..(2)..1000\,\mathrm{km\,s^{-1}}$. In the case of 2MASS J060846577815235, two peaks (yellow, dashed) are visible for this double-lined spectroscopic binary. Panel (b) shows the same plot, but overlaid with the GALAH DR4 reduction pipeline (red) and Gaia DR3 (blue, dashed) estimates for $v_\mathrm{rad}$. Panel (c) shows the narrow window of $-20.00..(0.04)..20.00\,\mathrm{km\,s^{-1}}$ around the highest peak and its Gaussian fit (yellow). Despite their low resolution (26 KB), these on-the-fly created diagnostic images already occupy 50GB in total.

Figure 11. Example of radial velocity evolution over modified Julian Date (vertical lines show the beginning of 2016, 2019, and 2022) for a single-lined spectroscopic binary (SB1).

Figure 12. Comparison of spectroscopic and photometric $\log g$ estimates in the allspec analysis. Panel (a) shows the distribution of spectroscopic $\log g$ and $T_\mathrm{eff}$ from the allspec module. Panel (b) shows the distribution of the same $T_\mathrm{eff}$ and photometric $\log g$. Panel (c) shows the difference of photometric $\log g$ and spectroscopic $\log g$ as a function of photometric $\log g$. Red error bars indicate the $1\sigma$ percentiles of this difference in $0.5\,\mathrm{dex}$ bins.

Figure 13. Example of three diffuse interstellar bands (DIBs) and interstellar K absorption for 2MASS J06453479-0102137 with an $E(B-V) = 0.84\,\mathrm{mag}$ value from Schlegel et al. (1998). Shown are the observation (black) and stellar fit (blue) as well as a Gaussian fit (red) to the residual (orange), resulting in an estimate of the equivalent width (EW) as well as radial velocity.

Figure 14. All-sky map (l,b) of GALAH DR4 equivalent width measurements of the diffuse interstellar band around 5 780 Å, with the GSPhot extinction by Andrae et al. (2023) in the background.

Figure 15. Example of a flagged emission star with clear emission in the Balmer lines (here H${\unicode{x03B1}}$).

Figure 16. Accuracy of the main stellar parameters $T_\mathrm{eff}$, $\log g$, [Fe/H], $v_\mathrm{mic}$, $v \sin i$, and $v_\mathrm{rad}$ for GALAH DR4. Each panel shows the comparison to literature (DR4 – literature) with median values as lines and contours between 16th and 84th percentiles. Comparisons are performed for the Gaia FGK Benchmark stars (red), APOGEE DR17 (blue), $\log g$ inferred from asteroseismic measurements (orange) and Gaia DR3 radial velocities (purple).

Figure 17. Comparison of iron abundances (16th, 50th and 84th percentiles) and overview of spectroscopic and photometric properties of globular cluster stars in GALAH DR4. Left panels show histograms of iron abundances from GALAH DR4 (blue) as well as literature estimates for the globular clusters from Giraffe (orange) and UVES (red) observations by Carretta et al. (2009a, b) as well as observations from Johnson & Pilachowski (2010). Middle panels show the spectroscopic $T_\mathrm{eff}$-$\log g$ diagrams coloured by iron abundance [Fe/H]. Right panels show the trend of GALAH DR4 [Fe/H] along the different $\log g$ values.

Figure 18. Comparison of radial velocities between GALAH DR4 allspec and Gaia DR3. Panel (a) shows the difference of radial velocities as function of Gaia G magnitude. Panel (b) shows a histogram of the difference with two Gaussian distributions (with same mean) fitted to them to estimate a more robust, binary independent, radial velocity difference. Panel (c) shows the difference of radial velocities as function of radial velocity, showing the systematic scatter introduced by binaries.

Figure 19. Chemical abundances [X/Fe] of Solar twin stars as a function of ages that were estimated as part of the mass and age estimation of the allstar spectrum analysis. We overplot linear fits to our age-abundance relations for Solar twins in orange and literature values from Bedell et al. (2018) in red. Panels also indicate the median and standard deviation with respect to Bedell et al. (2018) when assuming a correct age.

Figure 21. Comparison of stars with available measurements in GALAH DR4 and APOGEE DR17 for [C/Fe] and [N/Fe].

Figure 22. Comparison of stars with available measurements in GALAH DR3 (left), GALAH DR4 (middle) as well as APOGEE DR17 (right) for [Mg/Fe] (top row) and [Ni/Fe] (bottom row).

Table 5. List of major quality flag flag_sp listing the bit, description and how often the flag was raised for the allstar and allspec routines. Notes: Multiple bits can be raised for each of the 1 085 520 spectra of 917 588 stars.

Figure 23. Comparison of radial velocity estimates of GALAH DR4 and Gaia DR3. Panel (a) shows the difference of GALAH’s primary component radial velocity with the mean Gaia DR3. Panels (b) and (c) show stars for which two components were detected in GALAH DR4 and shows the difference between each component and Gaia DR3 against the difference of mean (roughly systemic) radial velocities. The panels also include regions where actual binaries and false positive detections are expected.

Table 6. List of elemental abundance quality flags flag_fe_h for [Fe/H] or flag_X_fe for element X.

Figure 24. Overview of stellar parameters and elemental abundances for the allstar estimates of GALAH DR4. The top left panel shows the density distribution of stars in the Kiel diagram of $T_\mathrm{eff}$ and $\log g$. All other panels show the logarithmic elemental abundances (for elements indicated in the top left of the panel) as a function of the logarithmic iron abundances [Fe/H]. Elements are coloured by different nucleosynthetic channels (black for big bang nucleosynthesis, blue for core-collapse supernovae, red for supernovae Type Ia, green for asymptotic giant branch star contributions and pink for the rapid neutron capture process with contributions from merging neutron stars) following the colour schema from Kobayashi et al. (2020). Percentages indicate the fraction of detections of stars for each element.

Figure 25. The ratio of [C/N] and isochrone masses in comparison panel (a), and as a function of $T_\mathrm{eff}$ and $\log g$ in panels (b) and (c), respectively.

Figure 26. Distribution of the dynamical properties of angular momentum $L_Z$ and radial action $J_R$ of stars in GALAH DR4 (black), with globular cluster members highlighted in colour. Cluster members were selected as those with more than 70 percent membership probability according to Vasiliev & Baumgardt (2021). The Sun is indicated with a red $\odot$ symbol.

Figure 27. Distribution of the dynamical properties of angular momentum $L_Z$ and orbital energy E of stars in GALAH DR4 (black), with globular cluster members highlighted in colour. Cluster members were selected as those with more than 70 percent membership probability according to Vasiliev & Baumgardt (2021). The Sun is indicated with a red $\odot$ symbol.

Figure 28. Mean EW binned in $T_\mathrm{eff}$ and $\log g$. The Li-dip can be seen at $T_\mathrm{eff}$$\approx 6\,500$ K and $\log g$$\approx 4.2$. At $\log g$$\approx 2.5$, red clump stars have a higher mean Li EW whilst horizontal branch stars have a lower mean Li EW compared to surrounding stars. The mean Li EW increases going up the red giant branch.

Figure 30. Histogram of the mean absolute errors for the neural networks. These were used as loss function during the training (blue) and validation (red) on seen and unseen spectra, respectively.

Figure 31. Example spectrum for a double-lined spectroscopic binary star (SB2) that is better fitted with our binary fitting algorithm.

Figure A1. Comparison of final GALAH DR4 stellar parameters (first column) against the initial parameters used in the allstar analysis (second column), estimates from the GALAH DR4 reduction pipeline (third column), Gaia DR3 (fourth column with $v_\mathrm{mic}$ based on the adjusted formula from Dutra-Ferreira et al. 2016), and GALAH DR3 (fifth column).

Figure B2. Covariance matrices for labels for Vesta (panel a) and Arcturus (panel b).

Table C1. Zero point estimates and corrections applied to the allstar measurements. We used Prša et al. (2016) as reference for Solar parameters and Grevesse et al. (2007), consistent with the marcs model atmosphere composition (Gustafsson et al. 2008), as reference for Solar abundances. For reference, we also show the combined rotational and macroturbulence as well as microturbulence velocities from Jofré et al. (2014). Values for Vesta indicate our uncorrected measurements for the Vesta spectrum.

Figure C1. zero-point estimates of elemental abundances for GALAH DR4. Each panel shows the comparison to literature (DR4 – literature) for Vesta (blue), Gaia FKG Benchmark Stars (orange), Stars with $\vert \mathrm{[Fe/H]} \vert \leq 0.1$ closer than $D_\varpi \,{\lt}\, 0.5\,\mathrm{kpc}$ (red), as well as stars that were also observed by APOGEE DR17 (purple).

Figure C3. Comparison of stars with available measurements in GALAH DR3 (left column), GALAH DR4 (middle column) and APOGEE DR17 (right) for O, Na, Al, Si, K, and Ca.

Figure C4. Continuation of Fig. C3 for Ti, V, Cr, Mn, Co, and Ce.

Figure C5. Collage of globular clusters in the $T_\mathrm{eff}$-$\log g$ space, coloured by stellar metallicity [Fe/H]. There are only minor trends between [Fe/H] and $T_\mathrm{eff}$, even for the horizontal branch stars in NGC 288, NGC 6656 (M22), and NGC 6121 (M4). NGC 5139 (${\unicode{x03C9}}$Cen) shows a significant range in [Fe/H]. RMS scatter and median metallicity uncertainties for each cluster are given in the lower right of each panel.

Figure C6. Parameter overview of stars with raised major quality flag flag_sp for allstar. Each panel shows the logarithmic density distribution of stars in the $T_\mathrm{eff}$ and $\log g$ plane with blue colourmaps. A PARSEC isochrone with $\mathrm{[M/H]}=0$ and $\tau = 4.5\,\mathrm{Gyr}$ is overplotted in orange and the same mass binary main-sequence (shifted from the single star one by $\Delta \log g = -0.3\,\mathrm{dex}$) is shown in red. Panel heads denote the bit mask and its description as well as how many times the flag was raised. We neglect distributions with no flag (0), for flags which have not been raised (8,9,11), and for which no results were available (15).