Hostname: page-component-89b8bd64d-ksp62 Total loading time: 0 Render date: 2026-05-07T20:06:03.826Z Has data issue: false hasContentIssue false

Galaxy 3D shape recovery using mixture density network

Published online by Cambridge University Press:  18 April 2024

Suk Yee Yong*
Affiliation:
CSIRO Space and Astronomy, Epping, NSW, Australia ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Canberra, Australia
K.E. Harborne
Affiliation:
ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Canberra, Australia International Centre for Radio Astronomy (ICRAR), M468, The University of Western Australia, Crawley, WA, Australia
Caroline Foster
Affiliation:
ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Canberra, Australia School of Physics, University of New South Wales, Sydney, NSW, Australia
Robert Bassett
Affiliation:
ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Canberra, Australia The Centre for Astrophysics & Supercomputing, Swinburne University of Technology, Hawthorn, VIC, Australia
Gregory B. Poole
Affiliation:
The Centre for Astrophysics & Supercomputing, Swinburne University of Technology, Hawthorn, VIC, Australia Astronomy Data and Computing Services (ADACS), Swinburne University of Technology, Hawthorn, VIC, Australia
Mitchell Cavanagh
Affiliation:
International Centre for Radio Astronomy (ICRAR), M468, The University of Western Australia, Crawley, WA, Australia
*
Corresponding author: Suk Yee Yong; Email: sukyee.yong@csiro.au
Rights & Permissions [Opens in a new window]

Abstract

Since the turn of the century, astronomers have been exploiting the rich information afforded by combining stellar kinematic maps and imaging in an attempt to recover the intrinsic, three-dimensional (3D) shape of a galaxy. A common intrinsic shape recovery method relies on an expected monotonic relationship between the intrinsic misalignment of the kinematic and morphological axes and the triaxiality parameter. Recent studies have, however, cast doubt about underlying assumptions relating shape and intrinsic kinematic misalignment. In this work, we aim to recover the 3D shape of individual galaxies using their projected stellar kinematic and flux distributions using a supervised machine learning approach with mixture density network (MDN). Using a mock dataset of the EAGLE hydrodynamical cosmological simulation, we train the MDN model for a carefully selected set of common kinematic and photometric parameters. Compared to previous methods, we demonstrate potential improvements achieved with the MDN model to retrieve the 3D galaxy shape along with the uncertainties, especially for prolate and triaxial systems. We make specific recommendations for recovering galaxy intrinsic shapes relevant for current and future integral field spectroscopic galaxy surveys.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press on behalf of Astronomical Society of Australia
Figure 0

Figure 1. Flowchart of the intrinsic shape determination pipeline. For each simulated galaxy, mock IFS images are constructed from which the kinematic and photometric features are extracted. Principal component analysis is applied for feature selection to choose a number of important features (those not selected are in grey). These are then fed to the mixture density network with 3 dense hidden layers of 128 nodes each. In the last layer, the MDN outputs a linear combination of Gaussian mixture parameters given by the weights $\alpha_{i}$, standard deviations $\unicode{x03C3}_{i}$, and means $\mu_{i}$ to predict the p and q distributions.

Figure 1

Figure 2. The distribution of galaxy shapes measured from the RefL0100N1504 box of the EAGLE simulation. In dark grey squares we show galaxies that have undergone a major or minor merger within the last 5 Gyr, which we class as ‘disturbed’. Light gray outline triangles show systems with significant bar structures. The histograms show the distribution of these barred and merger systems in p and q in light and dark grey respectively. Coloured circles represent galaxies that we have selected for our investigation. Each colour demonstrates the shape of the system, with spherical objects in yellow, prolate objects in orange, triaxial objects in pink and oblate objects in blue.

Figure 2

Figure 3. Demonstrating the equal sampling of $p-q$ space within the training set. Each point shows an individual EAGLE galaxy within the full sample. Coloured points show galaxies selected for the training and validation sets, while grey points demonstrate galaxies that have been left for the testing phase. The colour of each point denotes the number of times that galaxy is observed in order to keep that $p-q$ region equally sampled. The number in the corner of each $p-q$ region demonstrates the total number of observations within that region.

Figure 3

Figure 4. (Top left) Considering the raw distributions of tuneable observation properties controlled in each mock observation using a corner plot. The relationship between each property is demonstrated in purple. By ‘raw’ we mean the values modified per observation i.e. viewing angle, size of the PSF and projected distance to the object. (Bottom right) Considering the relative distributions of the important tuneable observation properties to ensure mock observations are uniform in the important ratios, as shown in the blue corner plot. These ratios, i.e. the size of the PSF relative to the size of the object, and the number of pixels within the measurement radius, are important for measuring observable kinematics to produce an unbiased training set. The approximately uniform distribution shown in blue demonstrates that our sample selection is not biasing our algorithm results.

Figure 4

Figure 5. Histograms showing how different intrinsic shapes of EAGLE galaxies within our training data populate each observable parameter. In each case, the spherical systems are shown in yellow, prolate systems in orange, triaxial systems in purple and oblate systems in blue. The overall height of the bar shows the distribution of each kinematic parameter within the full training set. The coloured regions then demonstrate the percentage of each bar that is made up of each intrinstic shape. Starred (*) axis labels have been divided into equally-sized log10 bins to more clearly delineate between the groups, though bar labels are shown as the raw values for clarity. This plot demonstrates that, in none of the single measurements can we cleanly distinguish between the intrinsic shapes directly. This justifies the machine learning approach.

Figure 5

Table 1. List of investigated galaxy parameters in column 2 and their notation in column 1. A short description in column 3 and relevant sections where each parameter is described in the text are listed in column 4. Parameters that are selected as input features to the mixture density network (MDN) indicated in column 5.

Figure 6

Figure 6. Feature selection of galaxy kinematic parameters (see Table 1 for notation) using principal component analysis. Absolute eigenvalues of the associated principal component (PC) are labelled and coloured, where a value of 1 indicates the strongest possible contribution with darker gradient.

Figure 7

Figure 7. Predicted against actual p and q for each galaxy shape using mixture density network (MDN) for the test data set. The black crosses represent the average prediction, while circles represent projections of individual galaxies colour coded by the standard deviation from the MDN output. The darker the gradient, the more certain. The prediction error is evaluated by the root mean squared error (RMSE), where lower values represent better agreement. For reference, the identity is shown as a grey dashed line. Although there is a large variation in the standard deviation of individual prediction, in most of the systems, the average of the predictions are close to the actual value.

Figure 8

Figure 8. Distributions of p and q recovered from the mixture density network model for each galaxy shape compared to the actual for the test data set.

Figure 9

Figure 9. The recovered $p-q$ shape probability density function within one standard deviation ($\unicode{x03C3}$) from the mixture density network, showing examples of ‘informative’ (left column) and ‘uninformative’ (right column) predictions for each galaxy shape. ‘Informative’ fits are those with low standard deviation of $\unicode{x03C3} \leq 0.24$, while vice versa for ‘uninformative’ fits. The predicted shape is shown on the top right of each panel. The vertical dashed line shows the actual value.

Figure 10

Figure 10. Predicted p and q for each galaxy shape using mixture density network for all results (above) and for objects with a certainty of $\unicode{x03C3} \leq 0.24$ (below). Points are coloured by their true 3D shape class. The total number of predicted objects is indicated by the number in the round bracket in each shape region, followed by the number of true objects of that shape. The negative predicted values (NPV) and positive predicted values (PPV) are shown as a function of galaxy shape within the legend of each plot. We see the expected distributions broadly approaching the true as we select more certain predictions, though uncertain oblate systems (shown in blue) contaminate every other class in both plots. The reasons for this are discussed in Section 5.

Figure 11

Table 2. The negative predicted values (NPV) and positive predicted values (PPV) for each underlying 3D shape given as percentages, where a higher value is better. In the left column of each metric, we show the values when all returned shapes are considered. In the right column of each metric, we consider only results with uncertainty less than 0.24.

Figure 12

Figure A1. Predicted against actual p and q for each galaxy shape using mixture density network (MDN) without performing feature selection for the test data set. The black crosses represent the average prediction, while circles represent projections of individual galaxies colour coded by the standard deviation from the MDN output. The darker the gradient, the more certain. The prediction error is evaluated by the root mean squared error (RMSE), where lower values represent better agreement. For reference, the identity is shown as a grey dashed line. Compared to Fig. 7, the RMSE is marginally worse for all systems except the p values for triaxial when no feature selection is performed.

Figure 13

Figure B1. Trends in positive predicted values (PPV) and negative predicted values (NPV) with varying standard deviations from MDN output ($\unicode{x03C3}$) for each galaxy shape. In each panel, the colour of the points shows the constant bin in the p or q value not on the axis, i.e. $\unicode{x03C3}(p_{\mathrm{pred}})$ in the plot of $\unicode{x03C3}(q_{\mathrm{pred}})$ and vice versa. The darker the gradient, the narrower the $\unicode{x03C3}$ and more certain. The vertical grey dashed line shows the mean of the standard deviation at $\unicode{x03C3}=0.24$.