1. Introduction
‘The Virgo Cluster survey has uncovered a new class of dwarfs that are of huge size (10 000 pc in diameter in the extreme) and of very low surface brightness of about 25 mag/arcsec
$^2$
(B) at the centre’. This is a quote from Sandage & Binggeli (Reference Sandage and Binggeli1984) in their ground-breaking study of dwarf galaxies in the Virgo cluster. They identified around two dozen of this new class of kpc-sized, low surface brightness (LSB) galaxies. They went on to say, ‘Most are in the central region of the cluster, suggesting that they are indeed members. No galaxies such as these low surface brightness, large dwarfs are known yet in the local neighbourhood’. We now know that many such galaxies are indeed cluster members, and that local analogues also exist.
For almost twenty years, these faint galaxies received little attention, and were generally studied along with other galaxies of similar stellar mass (e.g. Conselice, Gallagher & Wyse Reference Conselice2003). Based on their smooth appearance and large sizes in the Perseus cluster, Penny et al. (Reference Penny, Conselice, De Rijcke and Held2009) argued that such galaxies must be dark matter dominated to prevent them from being disrupted by the cluster potential. In some cases, the large size of these galaxies caused their explicit exclusion from catalogues of low surface brightness cluster members. For example, the study of Adami et al. (Reference Adami2006) searched the Coma cluster for ‘fainter objects… with a radius larger than 0.6’Footnote
a
… and with an R central surface brightness fainter than
$\sim24$
mag arcsec
$^{-2}$
’. Here they applied a cut on the maximum radius of an LSB galaxy in their catalogue being 2.5’, which explicitly excludes any LSB galaxies larger than
$\sim$
$1.2$
kpc in radius, such as those uncovered by Sandage & Binggeli (Reference Sandage and Binggeli1984).
Later on, the interest in these large-sized LSB dwarf galaxies was rejuvenated by the discovery of Forty-seven Milky Way-sized, Extremely Diffuse Galaxies in the Coma Cluster by van Dokkum et al. (Reference van Dokkum, Abraham, Merritt, Zhang, Geha and Conroy2015a) using the new Dragonfly Telephoto Array. It was unexpected to find that galaxies resembling those found by Sandage & Binggeli (Reference Sandage and Binggeli1984) were so widespread in the Universe, with van Dokkum et al. noting only a handful of examples from previous studies (Impey, Bothun, & Malin Reference Impey, Bothun and Malin1988; Bothun, Impey, & Malin Reference Bothun, Impey and Malin1991; Dalcanton et al. Reference Dalcanton, Spergel, Gunn, Schmidt and Schneider1997). As stated in the title of their article, these LSB galaxies have sizes (in terms of their semi-major effective radii,
$R_{e}$
) comparable to the disk of the Milky Way. They also tended to be red, roundish and featureless. A working definition of
$R_{e}$
$\ge$
1.5 kpc and central surface brightness
$\mu_{g, 0}$
$\ge$
24 mag/arcsec
$^2$
in the g-band was established to broadly select the newly discovered galaxies. Both of these criteria are somewhat arbitrary and, as such, these LSB galaxies are not distinct from the general galaxy population in this parameter space (Li et al. Reference Li2023a). These galaxies were dubbed ‘ultra-diffuse galaxies’ (UDGs), and it was speculated that they were dark matter-dominated. Some were proposed to be a ‘failed galaxy’, existing in a region of stellar mass – halo mass space where galaxies were previously not expected to reside. Spectroscopic follow-up soon occurred, confirming their membership of the Coma cluster (van Dokkum et al. Reference van Dokkum2015b) and their dark matter domination. Hubble Space Telescope (HST) imaging then revealed the presence of rich globular cluster (GC) systems in many of them (van Dokkum et al. Reference van Dokkum2017).
It has now been 10 yr since the resurgence in interest towards UDGs and LSB dwarfs. While hundreds of papers have been published on these galaxies in the last decade, there has yet to be a review. We undertake that task. In this review, our primary focus will be on the old and passive UDGs, i.e. galaxies whose stars formed some time ago and are not experiencing current star formation. Unless such galaxies are observed during a quiescent period of cyclic star formation, or accrete gas at later times and recycle (e.g. Ascencio et al. Reference Ascencio, Cooper, Zaritsky, Donnerstein, Khim and Baxter2025), they are expected to have a relatively stable effective radius and surface brightness, and thus remain a ‘settled’ UDG. On the contrary, star-forming UDGs, despite being very interesting in their own right (e.g. Leisman et al. Reference Leisman2017; Mancera Piña et al. Reference Mancera Piña2019; Kong et al. Reference Kong, Kaplinghat, Yu, Fraternali and Piña2022; Karunakaran et al. Reference Karunakaran, Motiwala, Spekkens, Zaritsky, Donnerstein and Dey2024), represent a state that may be transitory given that they often contain HI gas and possible future star formation. Our focus on old, passive UDGs implies an emphasis on UDGs located in high-density environments. As such, they tend to be red and gas-free. Some of them also have the interesting properties of hosting rich GC systems and perhaps occupying massive dark matter halos. Despite a decade of progress on UDGs, much remains to be explained. For brevity, we focus on the current state of the field and omit some of the historical context that has brought us to our current understanding.
In Section 2 we summarise UDGs’ general photometric properties, including how they are defined, where they are located and their general morphological properties. In Section 3 we summarise the proposed formation scenarios of UDGs. We focus on providing short descriptions of each, along with a summary of testable predictions for each scenario. We then review the progress made studying the internal properties of UDG kinematics and dark matter content in Section 4 and their internal stellar populations in Section 5. UDG globular cluster systems are reviewed in Section 6. In Section 7 we provide a brief comparison of UDGs to classical dwarf galaxies. We then summarise some possible future directions of UDG research in Section 8 . Conclusions are presented in Section 9.
2. General photometric properties
2.1. UDGs in a new region of parameter space
The importance of van Dokkum et al. (Reference van Dokkum, Abraham, Merritt, Zhang, Geha and Conroy2015a)’s discovery was not necessarily that large, low surface brightness dwarfs existed, but instead that they existed in such large numbers. To illustrate their discovery, Figure 1 shows galaxies detected in SDSS with redshifts between
$0.01\lt z\lt0.05$
from Simard et al. (Reference Simard, Mendel, Patton, Ellison and McConnachie2011) along with a large sample of pressure-supported systems from Brodie et al. (Reference Brodie, Romanowsky, Strader and Forbes2011) that spans everything from compact globular clusters to giant ellipticals at the centre of galaxy clusters. UDGs from van Dokkum et al. (Reference van Dokkum, Abraham, Merritt, Zhang, Geha and Conroy2015a) are also included along with a shading for the original UDG definition assuming an approximate galaxy colour of
$V=g-0.3$
. While a handful of galaxies had been detected in SDSS in this region of parameter space, the addition of UDGs greatly increases the known population of galaxies with large half-light radii and
$M_V\approx-15$
. In addition to noting their large sizes in comparison to traditional dwarf ellipticals, it is useful for the remainder of this work to note the positioning of GCs and ultra-compact dwarfs (UCDs) in the lower centre of the plot. Many UDGs host significant populations of GCs and, in some cases, even UCDs. Understanding their connection to these compact systems is the subject of significant (ongoing) work and is described here in Section 6.
Half-light radius vs. V-band magnitude for a range of (primarily elliptical) systems. Blue points are from (and updates Brodie et al. Reference Brodie, Romanowsky, Strader and Forbes2011), grey points represent galaxies in SDSS with
$0.01\lt z\lt0.05$
from Simard et al. (Reference Simard, Mendel, Patton, Ellison and McConnachie2011) and red points are the UDGs first detected in the Coma Cluster by van Dokkum et al. (Reference van Dokkum, Abraham, Merritt, Zhang, Geha and Conroy2015a). The regions approximately corresponding to globular clusters (GC), dwarf spheroidals (dSph), ultra-compact dwarfs (UCD), dwarf ellipticals (dE), normal ellipticals (E) and giant ellipticals (gE) are as labelled. The region meeting the original UDG definition is shaded red and labelled appropriately. UDGs reside in a region of parameter space previously sparsely populated by surveys such as SDSS (highlighted in red).

It is also of note that despite the initial UDG definition being coined for galaxies within the Coma cluster, which are strongly biased to being quiescent (see further Section 2.2), galaxies meeting the UDG definition have now been found in all environments. In particular, UDGs have been found in significant quantities in the field (e.g. Leisman et al. Reference Leisman2017) where they may help to explain HI detections previously thought to reside in dark halos (i.e. those that did not form stars). UDGs found in the field are biased towards being bluer and more irregular in morphology. A significant fraction of field UDGs are found to be quiescent (Prole et al. Reference Prole, van der Burg, Hilker and Davies2019b), however the observed quiescent fraction is generally larger than, and in some tension with, simulations of dwarf galaxy formation (Sales, Wetzel, & Fattahi Reference Sales, Wetzel and Fattahi2022). We provide examples of two UDGs in Figure 2 along with a dwarf elliptical to provide scale for these galaxies. In particular, AGC242019 is a good example of an irregular, HI-rich, blue, star-forming field UDG, while NGC5846_UDG1 (a.k.a. MATLAS-2019; N5846-156) is a good example of a quiescent, smooth morphology, GC-rich group UDG. Many of the compact sources visible on NGC5846_UDG1 are indeed spectroscopically confirmed GCs (Müller et al. Reference Müller2020a; Haacke et al. Reference Haacke2025). The three galaxies have been chosen for being located at approximately similar distances, to facilitate a fair visual comparison.
Dark Energy Camera Legacy Survey (DECaLS) images of three galaxies chosen for their similar distance as indicated in the top left of each cutout. The brightness of the images has been increased by a factor of 2 to aid the visibility of the UDGs. Left: A prototypical dwarf elliptical, MATLAS-49. Centre: An HI-bearing field UDG, AGC242019. Right: A quiescent, GC-rich UDG, NGC5846_UDG1 (MATLAS-2019). UDGs are noticeably larger than their dwarf elliptical (and dwarf irregular, although not shown here) counterparts. The UDG definition takes in a range of morphologies from blue, star-forming field UDGs to red, quenched group/cluster UDGs.

2.2. Alternative UDG definitions
It is worth noting that, following the initial definition of what comprises a UDG by van Dokkum et al. (Reference van Dokkum, Abraham, Merritt, Zhang, Geha and Conroy2015a), many subsequent authors have tweaked the definition for their own work. In general, alterations to the definition follow five main camps:
-
1. Switching from a central surface brightness to one measured at, or the average within
$R_{e}$
. Similarly, some simulations use the mean stellar surface density within 1
$R_{e}$
. -
2. Using a brighter surface brightness cut.
-
3. Switching the filter band from the g-band to another e.g.
$V-$
or r-band. -
4. Lowering the size cut from 1.5 kpc to e.g. 1 kpc.
-
5. Adding additional requirements, e.g. Sérsic n criteria or colour information.
In the simulation work of Van Nest et al. (Reference Van Nest, Munshi, Wright, Tremmel, Brooks, Nagai and Quinn2022), the difficulties of comparing UDG samples defined via alternative definitions were explored. In particular, it was shown that more restrictive definitions as to what comprises a UDG lead both to fewer galaxies being selected as a UDG and those UDGs having a more preferred formation pathway compared to dwarf galaxies of a similar mass that do not meet the UDG definition. More restrictive definitions can also lead to galaxy orientation playing a role in selecting UDGs. In particular, as the definition is surface brightness-based, there is an orientation dependence as to whether or not a galaxy is defined as a UDG. That is, many UDGs that are viewed face-on will be brighter in surface brightness if viewed edge-on and would not be classified as a UDG (Rong et al. Reference Rong, Mancera Piña, Tempel, Puzia and De Rijcke2020a). This bias will also present as a bias in the ellipticity (axial ratio) of UDGs to be lower (higher) than those of the general dwarf population at the same stellar mass. There is likely an environmental dependence of this bias, where UDGs are diskier in the field than they are in clusters. For a full exploration of different UDG definitions and their limitations, see Van Nest et al. (Reference Van Nest, Munshi, Wright, Tremmel, Brooks, Nagai and Quinn2022).
Furthermore, it is well known that the UDG definition is generally biased against galaxies with young (bluer) stellar populations (Li et al. Reference Li2023a,b). For many galaxies with young stellar populations, their surface brightness will be transient, and as their stars evolve and die, they will fade into the UDG regime quickly (Trujillo et al. Reference Trujillo, Roman, Filho and Sánchez Almeida2017; Bellazzini et al. Reference Bellazzini, Belokurov, Magrini, Fraternali, Testa, Beccari, Marchetti and Carini2017; Román et al. Reference Román, Jones, Montes, Verdes-Montenegro, Garrido and Sánchez2021. Put another way, these blue galaxies exhibit UDG-like stellar surface densities, despite not meeting the surface brightness definition.
Li et al. (Reference Li2023a,b) also pointed out that, due to the evolution in the size – stellar mass relationship with stellar mass, the initial UDG definition does not necessarily do a good job at selecting the largest galaxies at fixed stellar mass. Indeed, there is a clear change in the relationship of
$R_{e}$
with
$M_{V}$
in Figure 1. While we have shaded the UDG definition down to extremely low
$M_V$
, the size cut on the definition means galaxies of lower stellar mass than those initially discovered will also likely have to have extremely low surface brightnesses to make the size cut and are thus not readily observable beyond the Local Group. They propose it may be useful to study ‘ultra-puffy dwarfs’, i.e. those that are outliers in the size-stellar mass relationship Li et al. (Reference Li2023a,b). Other studies have used this approach (e.g. Lim et al. Reference Lim, Peng, Cŏté, Sales, den Brok, Blakeslee and Guhathakurta2018), and it is a common way to select ‘UDGs’ in simulations with poor reproduction of dwarf galaxy sizes in general.
Finally, it is worth noting that the initial UDG definition left a small gap in parameter space between the galaxies usually studied in SDSS that tend to be brighter than the night sky (
$\mu_{g, 0}\lt23$
mag arcsec
$^{-2}$
) and UDGs themselves (
$\mu_{g,0}\gt24$
mag arcsec
$^{-2}$
). Rather than altering the UDG definition to include these galaxies, Forbes & Gannon (Reference Forbes and Gannon2024) have suggested these galaxies be dubbed NUDGEs (nearly UDGs) as they nudge up against the UDG definition. NUDGEs also include galaxies of slightly smaller size than UDGs but with a similar surface brightness (e.g. Ferré-Mateu et al. Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023). For clarity in this work, we will endeavour to refer to UDGs meeting the original UDG definition as such, and will refer to NUDGEs when discussing galaxies that are close to, but do not strictly meet, the UDG definition.
2.3. UDGs numbers vs. environment
The most basic questions to answer for UDGs are simply: How many of them are there? And where are they all? These questions are important for two main reasons. (i) If UDGs do not represent a significant fraction of the galaxy population, much of their formation could be ascribed to them being many sigma outliers in established and well-explored processes of galaxy formation. While they would still be worthy of study, their existence would perhaps not represent a large challenge for galaxy formation. (ii) Knowing where UDGs are found is key to understanding the role that their environment plays in their formation. If UDGs are found only in massive galaxy clusters, it suggests that environmental processes play a strong role in their formation and evolution. Conversely, if UDGs are only found in the field, they must form via internal processes, and they may be efficiently destroyed once they enter a dense environment.
In Figure 3 we reproduce Figure 7 of Carleton et al. (Reference Carleton2023), which shows the number of UDGs detected as a function of total environmental halo mass (a similar, recent plot can be found in Karunakaran & Zaritsky Reference Karunakaran and Zaritsky2023). As shown in Figure 3, UDGs have been detected in all environments from the field (e.g. Leisman et al. Reference Leisman2017) to the most massive clusters in the nearby Universe (e.g. Janssens et al. Reference Janssens, Abraham, Brodie, Forbes and Romanowsky2019). There still exists some debate around the exact slope of the relationship between the number of UDGs (
$N_\mathrm{UDG}$
) and the halo mass (
$M_{200}$
); however, most studies agree that its slope is approximately 1. Slopes less than unity would imply that dense environments do not promote UDG formation and likely even play a role in destroying UDGs as they fall in from the field. Slopes greater than unity imply the opposite, that the external processes operating in groups/clusters play an important role. As such, current results indicate that the environment may not play a strong role in UDG formation/destruction or, at the very least, any increase in UDG production caused by dense environments is being roughly balanced by the UDG destruction in them.
Number of UDGs vs. total environmental halo mass, reproduced from (Carleton et al. Reference Carleton2023, their Figure 7). See also Karunakaran & Zaritsky (Reference Karunakaran and Zaritsky2023) and Goto et al. (Reference Goto, Zaritsky, Karunakaran, Donnerstein and Sand2023) alternative plots of this relationship published at a similar time, but focusing on lower density environments. Data are included from the studies of Impey et al. (Reference Impey, Bothun and Malin1988), Muñoz et al. (Reference Muñoz2015), van der Burg et al. (Reference van der Burg, Muzzin and Hoekstra2016), van der Burg et al. (Reference van der Burg2017), Yagi et al. (Reference Yagi, Koda, Komiyama and Yamanoi2016), Trujillo et al. (Reference Trujillo, Roman, Filho and Sánchez Almeida2017), Román & Trujillo (Reference Román and Trujillo2017b), Mancera Piña et al. (Reference Mancera Piña, Peletier, Aguerri, Venhola, Trager and Choque Challapa2018), Janssens et al. (Reference Janssens, Abraham, Brodie, Forbes and Romanowsky2019), Bachmann et al. (Reference Bachmann, van der Burg, Fensch, Brammer and Muzzin2021) per the legend. Carleton et al. (Reference Carleton2023)’s work on El Gordo UDGs is indicated as “this work" in the legend. The dashed line represents the relationship derived in van der Burg et al. (Reference van der Burg2017) of
$N_\mathrm{UDG} \propto M_{200}^{0.93\pm0.16}$
. The near linearity of this relationship indicates that the environment may not play a strong role in the net creation/destruction of UDGs.

Perhaps the greatest problem with our current understanding of UDG formation’s relationship with the environment is simply the different UDG definitions being used by the many authors plotted in Figure 3. Having different definitions for what a UDG is when calculating their number will clearly affect the relative positions of different studies to one another and thus the slope of the relationship between
$N_\mathrm{UDG}$
and
$M_{200}$
. A further consideration is the correction to UDG counts out to the virial radius. Imaging usually covers only a fraction of the cluster, creating the need to correct the UDGs detected in the imaged area to estimate the total number in the cluster. Different studies have employed varying approaches to make this correction. Without a systematic study of UDGs in different environments using the same definition and the same corrections, it is not possible to quantify the effects these issues have on our current ability to determine the slope of the
$N_\mathrm{UDG}-M_{200}$
relation in different environments.
The answer to the question regarding what fraction of the general galaxy population are UDGs appears to be approximately 5% of all galaxies with stellar mass
$\gtrapprox 10^7\,\mathrm{M}_\odot$
across all environments (Jones et al. Reference Jones, Papastergis, Pandya, Leisman, Romanowsky, Yung, Somerville and Adams2018). For example, Li et al. (Reference Li2023a) find that
$7\pm2$
% of the satellites around systems that are analogous to the Milky Way are UDGs. A similar expectation from simulations is seen in Newton et al. (Reference Newton2023). Due to the linear slope found in Figure 3, UDGs being approximately 5% of all galaxies remains true in the most massive clusters. As a confirmation, the work of Janssens et al. (Reference Janssens, Abraham, Brodie, Forbes and Romanowsky2019) found 1351
$^{+387}_{-379}$
UDGs to reside in the Hubble Frontier Field galaxy cluster Abell 2744. With the JWST UNCOVER survey recently discovering
$\sim50\,000$
sources of near-infrared light in the vicinity of that cluster (Weaver et al. Reference Weaver2024), these UDGs again represent a significant galaxy population within the cluster (
$\sim$
3%). Thus, any works based on galaxy mass/luminosity functions must take into account UDGs. A consequence of the abundant nature of UDGs is that they outnumber L
$^\star$
galaxies of similar half-light radius (Danieli & van Dokkum Reference Danieli and van Dokkum2019).
2.4. The morphology of UDGs
UDGs exhibit a wide range of morphologies, but the red, quiescent UDGs reviewed herein tend to be largely featureless beyond hosting GCs and nuclear star clusters. There still exists some debate if they are oblate or prolate in nature (see Burkert Reference Burkert2017 and Rong et al. Reference Rong2020b). Their surface brightness profiles are typically exponential disk-like, with mean Sérsic indices of
$n\sim0.8$
(Mowla et al. Reference Mowla, van Dokkum, Merritt, Abraham, Yagi and Koda2017). In some cases, disk-like profiles are used as a selection criterion. Evidence for traditional features such as bars, arms or bulges are rare. Tidal features are seen in some cases, but the majority of UDGs appear to be relatively undisturbed. For example, in the recent visual analysis of 7 000 UDG candidates in the SMUDGes survey, some 93% were classified as ‘undisturbed UDG candidates’ (Zaritsky, Donnerstein & Khim Reference Zaritsky, Donnerstein and Khim2025). Earlier work by Mowla et al. (Reference Mowla, van Dokkum, Merritt, Abraham, Yagi and Koda2017), stacking deep images of 287 UDGs, found no evidence of tidal features out to
$\sim$
4
$R_{e}$
. This suggests that if they did form by merger/interaction, then it was sufficiently long ago so that signatures are no longer seen.
The MATLAS survey found around 1/3 of UDGs to host a nuclear star cluster (Marleau et al. Reference Marleau2021; Poulain et al. Reference Poulain2025a). For the SMUDGES survey, of slightly shallower depth, the number was more like 1/10 (Lambert et al. Reference Lambert, Khim, Zaritsky and Donnerstein2024). We emphasise that these fractions are comparable to those for the classical dwarfs in the same survey, but slightly lower than the fractions observed in high-density environments such as the Coma cluster. For the stellar masses of UDGs, it is expected that nuclear star clusters are the result of GC infall and merging via dynamical friction (Fahrion et al. Reference Fahrion2021). Janssens et al. (Reference Janssens, Abraham, Brodie, Forbes and Romanowsky2019) presented evidence from the Frontier Field clusters that UCDs may be the nuclear remnants of stripped UDGs that contained a nuclear star cluster. This concept was further supported by the diffuse envelopes seen around some Virgo cluster UCDs by Wang et al. (Reference Wang2023).
3. UDG formation scenarios and their simulations
Reproducing the observed population of dwarf galaxies in simulations within the
$\Lambda$
CDM paradigm is an extremely difficult problem due to the necessity of including sufficient physics (e.g. feedback effects/ISM physics), sufficient volume (e.g. for environmental effects) and sufficient resolution (e.g. to resolve the dwarfs) to accurately model their properties (Sales et al. Reference Sales, Wetzel and Fattahi2022). Before the discovery of UDGs there was an established problem with simulations whereby they were producing large numbers of dwarf galaxies that were generally larger than those being observed (see e.g. Sales et al. Reference Sales, Wetzel and Fattahi2022). The discovery of UDGs showed that some of these galaxies may exist. Their discovery, therefore, represented a post-diction (sic.) of simulations. Key to the simulation of UDGs, is the need to simultaneously create normal dwarfs and to demonstrate what sets UDGs apart in their formation pathway(s). Generally, the proposed formation mechanisms in simulations are split into those that occur in isolation within a halo (i.e. internal formation) and those that occur due to the influence of external halos (i.e. external formation). It is also established that galaxies may not need to be transformed into a UDG by external or internal processes, but may instead form intrinsically as a UDG, i.e. born as a UDG.
We next discuss some of the major attempts to elucidate UDG formation pathways, and include some testable predictions expected in each scenario.
3.1. Internal formation
3.1.1. High halo spin
Description: In this formation scenario, the UDG forms as a normal dwarf galaxy, although its dark matter halo has above average spin. This additional angular momentum within the dark matter halo causes it to be more diffuse, which, in turn, results in a more diffuse galaxy (i.e. the UDG). For example, Jiang et al. (Reference Jiang, Dekel, Freundlich, Romanowsky, Dutton, Macciò and Di Cintio2019a) found that a galaxy’s half-light radius is proportional to its halo spin along with its virial radius. It is of note that high halo spin may also be correlated with a bias to later halo assembly. This scenario is primarily simulated/modelled in Amorisco & Loeb (Reference Amorisco and Loeb2016), Rong et al. (Reference Rong, Guo, Gao, Liao, Xie, Puzia, Sun and Pan2017) and Liao et al. (Reference Liao2019).
Testable Predictions: As noted in Amorisco & Loeb (Reference Amorisco and Loeb2016), the halo spin distribution is not considered strongly environmentally dependent, so this formation scenario is expected to function in all environments. For instance, Liao et al. (Reference Liao2019) found that the number of UDGs follows an approximately linear power law with environmental mass, while Amorisco & Loeb (Reference Amorisco and Loeb2016) provided a prediction for the number of UDGs expected as a function of their luminosity. It is unclear if the above-average angular momentum of the dark matter halo also translates into above-average angular momentum within the galaxies. In general, galaxy halo spin has not been found to correlate strongly with galaxy angular momentum within cosmological simulations (Jiang et al. Reference Jiang2019b). However, Rong et al. (Reference Rong, Guo, Gao, Liao, Xie, Puzia, Sun and Pan2017) predicted that in the field, where environmental effects are minimal, UDGs should be on average redder than classical dwarfs. This is due to their more extended nature, causing their gas densities to drop below the threshold for star formation earlier, resulting in on average older stellar populations. Finally, Rong et al. (Reference Rong, Guo, Gao, Liao, Xie, Puzia, Sun and Pan2017) predicted that UDGs forming via this pathway should have, on average, later cluster infall times than their dwarf counterparts (median population infall times of 9 Gyr vs 5 Gyr in time since the Big Bang).
3.1.2. Supernova feedback
Description: In this formation scenario, UDGs form as a classical dwarf galaxy, but intense, repeated, bursty episodes of star formation (and thus star formation feedback) help to redistribute mass away from the centre of the halo. This will both core the dark matter halo and cause an increase in the size of the dwarf galaxy it hosts, lowering its stellar surface density and causing it to become a UDG. It is unclear what causes the stronger stellar feedback needed to puff up a UDG over the regular feedback experienced by a normal classical dwarf of similar stellar mass. This scenario is primarily simulated in Di Cintio et al. (Reference Di Cintio, Brook, Dutton, Macciò, Obreja and Dekel2017) and Chan et al. (Reference Chan, Kereš, Wetzel, Hopkins, Faucher-Giguère, El-Badry, Garrison-Kimmel and Boylan-Kolchin2018), with both simulating UDGs forming in isolation.
Testable Predictions: Due to the need for repeated episodes of star formation, and the fact that ejected gas will have re-accretion times significantly less than a Hubble time, the star formation histories of UDGs forming in this way are expected to be rather bursty. This may result in a (temporarily) quenched UDG surrounded by ejected gas. These UDGs are also expected to have cored dark matter halos. The maximum halo mass of UDGs forming via this mechanism is expected to be roughly
$10^{11}~\mathrm{M}_\odot$
(Di Cintio et al. Reference Di Cintio, Brook, Dutton, Macciò, Obreja and Dekel2017). The higher the half-light radius, the higher the fraction of young stars are expected to be found within isolated UDGs forming via this mechanism (Di Cintio et al. Reference Di Cintio, Brook, Dutton, Macciò, Obreja and Dekel2017). A possible mix of radial age and metallicity gradients from flat to negative is expected for these galaxies (Chan et al. Reference Chan, Kereš, Wetzel, Hopkins, Faucher-Giguère, El-Badry, Garrison-Kimmel and Boylan-Kolchin2018; Cardona-Barrero et al. Reference Cardona-Barrero, Di Cintio, Battaglia, Macciò and Taibi2023).
3.1.3. High gas spin
Description: In this formation scenario, the proto-UDG exhibits higher than average energy within its star-forming gas disk throughout its life. As the galaxy forms, this gas disk is therefore more extended and, due to the extra angular momentum, takes longer timescales to collapse, causing less efficient star formation. The natural effect of this is a larger, more diffuse dwarf galaxy (i.e. a UDG). It is noteworthy that it is unclear if this is true for the entire gas disk or just for the star-forming portion of the gas. This scenario is primarily simulated in Zheng et al. (Reference Zheng, Liao, Gao and Jiang2025).
Testable Predictions: A clear prediction of this scenario is that the gas within these dark matter halos should exhibit both a larger angular momentum and a greater extent than normal dwarf galaxies (Zheng et al. Reference Zheng, Liao, Gao and Jiang2025).
3.2. External formation
3.2.1. Tidal heating/puffing
Description: In this formation scenario, energy is injected into the stellar body of a regular dwarf galaxy through the gravitational potential via (likely repeated) close passages with more massive halos, increasing its size and lowering its surface brightness. It is possible this formation scenario represents an extension of the ‘galaxy harassment’ model previously proposed to turn spiral galaxies into dwarf elliptical/spheroidal population within clusters (Moore et al. Reference Moore, Katz, Lake, Dressler and Oemler1996). It is likely related to the formation scenario described below for UDGs forming via tidal stripping. It is unclear if this formation scenario is more efficient in cored dark matter halos given the differing results from Carleton et al. (Reference Carleton, Errani, Cooper, Kaplinghat, Peñarrubia and Guo2019) and Sales et al. (Reference Sales, Navarro, Peñafiel, Peng, Lim and Hernquist2020). Further, it is clear that this scenario likely applies to many UDGs that have fallen into cluster/group environments from the field (see e.g. Jiang et al. Reference Jiang, Dekel, Freundlich, Romanowsky, Dutton, Macciò and Di Cintio2019a). This scenario is primarily simulated in Carleton et al. (Reference Carleton, Errani, Cooper, Kaplinghat, Peñarrubia and Guo2019), Jiang et al. (Reference Jiang, Dekel, Freundlich, Romanowsky, Dutton, Macciò and Di Cintio2019a), Liao et al. (Reference Liao2019) and Sales et al. (Reference Sales, Navarro, Peñafiel, Peng, Lim and Hernquist2020).
Testable Predictions: This scenario requires a more massive halo in order to work, thus UDGs are unlikely to form via this scenario in the field. Due to the repeated passages required to make the largest UDGs (
$R_{e}\gt3$
kpc), these are expected to host old stellar populations (
$89\% \gt4$
Gyr; Carleton et al. Reference Carleton, Errani, Cooper, Kaplinghat, Peñarrubia and Guo2019). UDGs should show signs of dark matter loss in their outskirts, concentrating the dark matter distribution in the centre. UDG abundances should be correlated with cluster age and drop off with redshift (Carleton et al. Reference Carleton, Errani, Cooper, Kaplinghat, Peñarrubia and Guo2019). UDGs formed via tides in this manner should experience significantly earlier infall times (
$\sim$
9.5 Gyr ago) to the cluster than those that have formed as UDGs prior to cluster infall (
$\sim$
5.5 Gyr ago).
3.2.2. Tidally stripped galaxies
Description: In this formation scenario, the UDG represents the remnant of a more massive galaxy that has experienced significant tidal stripping of stars to reduce its stellar mass into the dwarf regime. More moderate tidal stripping is possible to aid in the formation of a UDG, however, other mechanisms are likely needed. This scenario is primarily simulated in Sales et al. (Reference Sales, Navarro, Peñafiel, Peng, Lim and Hernquist2020) using the disruption models of Errani et al. (Reference Errani, Peñarrubia and Tormen2015). This is also simulated as part of the work of Carleton et al. (Reference Carleton, Errani, Cooper, Kaplinghat, Peñarrubia and Guo2019).
Testable Predictions: They should be metal-rich, reflective of their origin as a more massive and now disrupted galaxy (Sales et al. Reference Sales, Navarro, Peñafiel, Peng, Lim and Hernquist2020; Benavides et al. Reference Benavides, Sales, Abadi, Vogelsberger, Marinacci and Hernquist2024). They should also be relatively dark matter-free, as any tidal interaction able to strip significant stellar mass from a galaxy will also have stripped much of their dark matter halos (Sales et al. Reference Sales, Navarro, Peñafiel, Peng, Lim and Hernquist2020). Finally, tidal features are expected for these galaxies, indicative of their tidal stripping. Carleton et al. (Reference Carleton, Errani, Cooper, Kaplinghat, Peñarrubia and Guo2019) found that S-shaped tidal features are expected to fade within
$\sim$
$0.7$
Gyr of creation and that
$\sim$
13% of their UDGs have experienced a pericentric passage within this time. Thus, approximately 13% of UDGs forming via tidal stripping should display tidal features in clusters (Carleton et al. Reference Carleton, Errani, Cooper, Kaplinghat, Peñarrubia and Guo2019).
3.2.3. Galaxy mergers
Description: In this formation scenario, the UDG forms via a major merger of two smaller dwarf galaxies. In the scenario described by Wright et al. (Reference Wright, Tremmel, Brooks, Munshi, Nagai, Sharma and Quinn2021) the UDG forms via a high redshift merger at
$z\gt1$
. These mergers temporarily redistribute star formation to the outskirts of the galaxy, causing its large size. As the galaxy ages, its surface brightness fades due to passive stellar evolution.
Testable Predictions: As the spin up caused by the merger occurs at high-z, many of the distinguishing features of this scenario are temporary and are not visible at
$z=0$
(Wright et al. Reference Wright, Tremmel, Brooks, Munshi, Nagai, Sharma and Quinn2021). However it is an expectation that
$\sim$
20% of dwarf galaxies in the stellar mass range of
$10^7$
–
$10^9\,\mathrm{M}_\odot$
form via this pathway. They do so in exclusively dwarf like halos of mass
$\lt10^{11} \mathrm{M}_\odot$
(Wright et al. Reference Wright, Tremmel, Brooks, Munshi, Nagai, Sharma and Quinn2021). In scenarios other than Wright et al. (Reference Wright, Tremmel, Brooks, Munshi, Nagai, Sharma and Quinn2021)’s where UDGs form via galaxy mergers, prolate rotation may be expected as is commonly seen in more massive galaxies formed via mergers. Finally, if the merger were wet we would expect a strong burst in their star formation history at the time of the merger.
3.2.4. Ram pressure stripped galaxies
Description: In this formation scenario, a dwarf galaxy falling into a cluster has its gas ram-pressure stripped, which quenches star formation. Passive evolution will then lower the surface brightness into the UDG regime. Further, the removal of gas from the centre of the halo will flatten the gravitational potential and may help the galaxy to expand, becoming a UDG. This scenario is primarily discussed in: Yozin & Bekki (Reference Yozin and Bekki2015), Chilingarian et al. (Reference Chilingarian, Afanasiev, Grishin, Fabricant and Moran2019), Grishin et al. (Reference Grishin, Chilingarian, Afanasiev, Fabricant, Katkov, Moran and Yagi2021) and Junais et al. (2022).
Testable Predictions: Due to the contraction of star-forming region during a starburst, as the gas is stripped, it is expected that an age gradient will form within the proto-UDG as it is stripped of gas (Grishin et al. Reference Grishin, Chilingarian, Afanasiev, Fabricant, Katkov, Moran and Yagi2021). Likewise, a strong truncation of star formation in the star formation history of the galaxy is expected. Finally, it should be noted that it may take up to 10 Gyr for the galaxy to fade into the UDG regime and thus the accretion time has to be relatively ancient for the galaxy to be observed to be a UDG today. It is unclear if there existed environments sufficiently massive 10 Gyr ago to cause similar stripping to what is seen today in clusters. Up to
$\sim$
45% of UDGs in clusters have been claimed to form via this pathway (Grishin et al. Reference Grishin, Chilingarian, Afanasiev, Fabricant, Katkov, Moran and Yagi2021).
3.3. ‘Born’ as a UDG
3.3.1. Tidally stripped stars
Description: In this formation scenario, the UDG forms from stellar material stripped from a more massive galaxy as it undergoes a tidal interaction. In this regard, it is just the large size–low surface brightness end of normal tidal dwarf galaxies (Duc Reference Duc2012). A key difference with the above Tidally Stripped Galaxy scenario is that here the galaxy does not initially form within a dark matter halo; instead, it forms from stellar material stripped out of its host dark matter halo via a tidal interaction.
Testable Predictions: Predictions that are generally applicable to tidally stripped galaxies are likely also applicable to these galaxies. Namely, as they have formed from stars stripped from a more massive galaxy, their stars should have similar ages and be more metal-rich than is generally expected for a dwarf. They should also be dark matter-free, reflective of their formation without a dark matter halo. These UDGs are likely to be found embedded within larger tidal features and may host GCs if these were stripped along with the stars. In general, significant GC populations are not expected. In general, UDGs forming via this pathway are not expected to survive for long periods of time due to their lack of a dark matter halo and formation near a more massive galaxy. According to the simulations of Moreno et al. (Reference Moreno2022), roughly one third of massive galaxies may have a dark matter-deficient satellite (not necessarily a UDG).
3.3.2. Tidally stripped gas
Description: In this formation scenario, the UDG forms in gas stripped from a more massive halo. The gas may be stripped via tidal interaction or ram pressure processes. UDGs forming in the ram pressure stripped gas of ‘jellyfish galaxies’ falling into clusters fit into this scenario. It is slightly different from the previous scenario for ‘Tidally Stripped Stars’, whereby the star formation occurs after the gas is stripped from the galaxy, rather than the stars themselves being stripped from the galaxy to form the UDG. This scenario is primarily described in Poggianti et al. (Reference Poggianti2019) and Ivleva et al. (Reference Ivleva, Remus, Valenzuela and Dolag2024).
Testable Predictions: As these UDGs form from tidally stripped gas, they are not expected to have their own dark matter halo and should have kinematics reflective of this. Likewise, as they form from gas stripped from a more massive galaxy, they should exhibit higher stellar metallicities than other dwarf galaxies of similar stellar mass. In general, it may be difficult to distinguish this scenario from the ‘Tidally Stripped Stars’ one without an indication of both the stellar age of the UDG and the age of the interaction during which its material may have been stripped.
3.3.3. Bullet dwarf
Description: This scenario represents somewhat of a combination of the above two scenarios. Namely, a string of galaxies, many of which may be UDGs, forms due to the high-velocity collision of two dwarf galaxies. It has been dubbed the ‘bullet dwarf’ scenario (Silk Reference Silk2019; van Dokkum et al. Reference van Dokkum2022) due to its resemblance to a small-scale version of the Bullet Cluster. Here, the collision separates the collisional (i.e. the gas) from the non-collisional (i.e. stars, star clusters and dark matter) elements of a dwarf galaxy halo, forming a line of new galaxy formation along the trail as the stripped gas collapses. Some of these galaxies may be UDGs. This scenario is primarily simulated/described in Silk (Reference Silk2019), Shin et al. (2020), Lee et al. (Reference Lee2021), Lee et al. (Reference Lee2024) and van Dokkum et al. (Reference van Dokkum2022).
Testable Predictions: This scenario will present as a linear trail of galaxies on the sky, assuming that they have not had their orbits perturbed since formation. At either end of the trail should be the initial galaxies from the collision, being dark matter-rich and stellar mass-poor (reflecting their loss of gas), while galaxies between them should be dark matter-poor with similar stellar populations (van Dokkum et al. Reference van Dokkum2022). They may also exhibit above-average star cluster formation (both in terms of their absolute number and their individual masses), reflective of the higher gas pressures generated by such a collision (Lee et al. Reference Lee2021). These galaxies may also exhibit stellar bodies stretching along the line of the collision (Tang et al. Reference Tang2025b), along with prolate rotation (Buzzo et al. Reference Buzzo2025b), as a memory of their formation with momentum from the collision.
3.3.4. Failed galaxies
Description: In this formation scenario, the proto-UDG forms quickly at high redshift from a single burst of high-intensity star formation in a massive dark matter halo and is rapidly quenched. This high-intensity star formation drives the efficient production of GCs, which comprise most of the proto-UDG at high redshift. Beyond environmental quenching, there is no clear mechanism to quench, and keep quenched, the proto-UDG although feedback from the formation of the GC system has been proposed (Gannon et al. Reference Gannon, Kimmig, Forbes, Brodie, Valenzuela, Remus, Pfeffer and Dolag2025). The proto-UDG then evolves passively for the remaining age of the Universe, where GCs slowly disrupt/evaporate to form/contribute to the stellar body of the galaxy and where stellar evolution causes the proto-UDG to slowly decrease in surface brightness into the UDG regime. It is dubbed a ‘failed galaxy’ as its host dark matter halo fails to form the stellar mass it is expected to given a standard stellar mass – halo mass relation. This scenario has also been dubbed a ‘pure stellar halo’ as this formation event is the epoch at which most galaxies form their stellar halo, and the UDG experiences no subsequent evolution (Peng & Lim Reference Peng and Lim2016). It is important to note that while this was initially dubbed a ‘failed Milky Way galaxy’ scenario by van Dokkum et al. (Reference van Dokkum, Abraham, Merritt, Zhang, Geha and Conroy2015a), there is now strong evidence that UDGs do not inhabit halos that are that massive (van Dokkum et al. Reference van Dokkum2019b; Gannon et al. Reference Gannon, Forbes, Romanowsky, Ferré-Mateu, Couch and Brodie2020) but are more like M33. Indeed, weak lensing analysis on stacked UDGs indicates that only a small fraction of UDGs could inhabit the relatively massive halo required for this scenario (Sifón et al. Reference Sifón, van der Burg, Hoekstra, Muzzin and Herbonnet2018). This scenario is primarily described in van Dokkum et al. (Reference van Dokkum, Abraham, Merritt, Zhang, Geha and Conroy2015a), Peng & Lim (Reference Peng and Lim2016), Beasley et al. (2016), Danieli et al. (Reference Danieli2022), Forbes & Gannon (Reference Forbes and Gannon2024), Forbes et al. (Reference Forbes, Buzzo, Ferre-Mateu, Romanowsky, Gannon, Brodie and Collins2025) and Gannon et al. (Reference Gannon, Kimmig, Forbes, Brodie, Valenzuela, Remus, Pfeffer and Dolag2025).
Testable Predictions: Due to the extreme nature of this formation scenario, it makes a number of testable predictions. Namely, the UDG should have: (1) an old age, elevated abundance of
$\alpha$
elements, flat age and metallicity gradients, and (early) single burst star formation history; (2) GC-like ages and metallicities reflective of its stellar body being significantly comprised of disrupted GCs; (3) a rich GC system reflective of its early, high-intensity star formation, and (4) a massive dark matter halo as required for it to be ‘failed’. Additional predictions may be possible depending on the mechanism of their quenching. e.g. if the ‘failed galaxies’ were environmentally quenched at high redshift, they are expected to be amongst some of the earliest dwarf galaxies to accrete onto the clusters they now observed to reside in. It is worth noting that JWST has already revealed systems with high fractions of their stars within their GC system at high redshift (Mowla et al. Reference Mowla2022, Reference Mowla2024; Vanzella et al. Reference Vanzella2022a,b, 2023; Adamo et al. Reference Adamo2024
b; Messa et al. Reference Messa, Dessauges-Zavadsky, Adamo, Richard and Claeyssens2024; Bradač et al. Reference Bradač2025; Naidu et al. Reference Naidu2025), proving possible examples of what a proto-failed galaxy-UDG may look like.
3.4. UDG formation summary
It is becoming increasingly obvious that UDGs cannot be fully described by any one of these formation scenarios alone, and that they provide a challenging ‘stress test’ to models of galaxy formation. Several processes may be contributing to the overall UDG population. For example, there are some clear cases of UDGs forming that are associated with Tidally Stripped Stars (e.g. CenA DWIII; Crnojević Reference Crnojević2017) and Tidally Stripped Gas (e.g. Hydra I-UDG32; Hartke et al. Reference Hartke2025). Furthermore, most proposed models cannot easily account for the UDGs with an elevated GC richness, beyond that of a classical dwarf. As described below in Section 5, SED and spectroscopic studies also point to the need for multiple formation pathways to explain the variety in their measured stellar populations and other characteristics (e.g. Kadowaki, Zaritsky, & Donnerstein Reference Kadowaki, Zaritsky and Donnerstein2017; Ruiz-Lara et al. Reference Ruiz-Lara2018; Ferré-Mateu et al. Reference Ferré-Mateu2018; Ferré-Mateu et al. Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023; Buzzo et al. Reference Buzzo2025a). In many cases, combinations of formation scenarios, e.g. their formation in the field and subsequent quenching/tidal heating by accretion onto a dense environment (Román & Trujillo Reference Román and Trujillo2017b; Sales et al. Reference Sales, Navarro, Peñafiel, Peng, Lim and Hernquist2020), also seem likely.
Even beyond the above-discussed scenarios, it is clear that other effects likely influence the formation and evolution of UDGs. For example, the ram pressure stripping of their gas in a single passage through a dense environment may cause quiescent UDGs to appear in the field as ‘backsplash’ galaxies (Benavides et al. Reference Benavides2021). With all these different caveats in mind, we have created a reference table to summarise the basic predictions of each scenario as Figure 4. We stress that it should not be used in isolation from the detailed studies of each formation scenario, as summarised above. It is merely provided to guide future studies of UDGs. In particular, the column of ‘Halo Masses’ can classify scenarios into those thought of as ‘puffy dwarfs’ and ‘failed galaxies’. That is, UDGs forming in a halo of mass normal for a dwarf may simply be the large size end of the normal dwarf size distribution, i.e. a ‘puffy dwarf’. Alternatively, UDGs forming in more massive halos than are typical for a dwarf fit the ‘failed galaxy’ scenario. Finally, there are a number of scenarios of UDGs forming with dark matter deficiency. These tend to form from interactions with other systems.
Summary table of predicted UDG properties for each formation scenario as described in Section 3. We either write ‘-’ if the prediction is totally unclear or add a ‘?’ next to a likely property that is yet not well-established. ‘Int.’ refers to intermediate ages in first column. We provide this table as a basic guide for future UDG studies.

4. UDG internal Properties I: Clues from galaxy kinematics
Observationally, obtaining deep data to differentiate UDG formation pathways has been one of the major challenges of this decade. While deep imaging has been able to provide large samples of UDGs (e.g. the SMUDGES Survey; Zaritsky et al. Reference Zaritsky2019), many of the predictions of the formation scenarios (UDG halo masses/metallicities) require deep spectroscopy to test. From deep spectra, we can measure the galaxy’s stellar kinematics (rotational velocities, velocity dispersions), which are crucial to obtain dynamical masses. The latter, under various assumptions, may provide an estimate of their total halo mass. Good quality spectra can also be used to probe the stellar populations of UDGs. This can provide better constrained ages and metallicities compared to SED fitting or colours, while also giving star formation histories (SFHs), quenching timescales, and other chemical signatures such as elemental abundance patterns. Currently there are
$\lt$
50 UDGs with their internal properties measured with spectroscopy (Gannon et al. Reference Gannon2024a). In this Section (4; Clues from galaxy kinematics) and the following Section (5; Clues from stellar populations), we elucidate what has been learned from these efforts.
4.1. What drives UDG dark matter content – Velocity dispersions?
Given the
$L^\star$
-like, large half-light radii but low, dwarf galaxy-like stellar masses, an initial question of their study was simply which galaxy class,
$L^\star$
or dwarf, their dark matter halo resembled. Before getting too involved in the nuanced answer to this question, it is worth stating that there is very little evidence for any UDG residing in a
$L^\star$
-like dark matter halo. The weak lensing study of Sifón et al. (Reference Sifón, van der Burg, Hoekstra, Muzzin and Herbonnet2018) placed a 95% confidence interval upper limit on the halo masses of UDGs
$\log( M_{200}/\mathrm{M}_\odot)\leq11.8$
, which is less than typical of any
$L^\star$
-like galaxy (i.e.
$\log( M_{200}/\mathrm{M}_\odot)\approx12$
). Furthermore, most UDG dynamical masses are too small to be reasonably fitted by a massive,
$L^\star$
dark matter halo (Gannon et al. Reference Gannon, Forbes, Romanowsky, Ferré-Mateu, Couch and Brodie2020). For this reason, the field has reframed the original question as to whether or not UDGs reside in
$L^\star$
or dwarf-like dark matter halos into whether or not they reside in dwarf-like dark matter halos, or those that are more massive than is typical for a dwarf. This is particularly relevant when discussing the ‘failed galaxy’ scenario described above.
In order to answer this question we need to measure masses for UDGs. A common way to do this is to measure a velocity dispersion from the stars within the galaxy or from the dispersion of GC recessional velocities about the galaxy. When estimating dynamical masses for UDGs, the vast majority of studies have used the Jeans mass estimator of Wolf et al. (Reference Wolf, Martinez, Bullock, Kaplinghat, Geha, Muñoz, Simon and Avedo2010), which is reliant on the square of galaxy’s velocity dispersion along with its half-light radius.
It is thus useful to investigate UDG velocity dispersions to see if they are typical for a galaxy of their stellar mass. In Figure 5 we show UDG velocity dispersions that are measured either internally from their stellar body or from the dispersion of their GC system vs. their stellar masses. When GC velocity dispersions are inferred from
$\lt$
10 GCs, we employ a greater transparency on these data points. Non-UDGs are also shown to establish the expected relationship for galaxies. The vast majority of UDG stellar velocity dispersions lie within the scatter of those for non-UDGs.
Galaxy velocity dispersion vs. galaxy stellar mass. The non-UDGs that establish the normal relation are from the works of Chilingarian et al. (Reference Chilingarian, Cayatte, Revaz, Dodonov, Durand, Durret, Micol and Slezak2009), McConnachie (Reference McConnachie2012), Cappellari et al. (Reference Cappellari2013), Harris et al. (Reference Harris, Harris and Alessi2013), grey points. UDGs are plotted from the catalogue of Gannon et al. (Reference Gannon, Ferré-Mateu, Forbes, Brodie, Buzzo and Romanowsky2024b) with dynamical masses coming from their stellar velocity dispersions (red triangles) and from their GC system velocity dispersions (blue squares). In general, UDGs with secure velocity dispersion measurements do not appear to be dynamically hotter or colder than dwarf galaxies of a similar stellar mass.

There is an interesting population of four UDGs (Hydra-I UDG4, NGC1052-DF2, PUDG_R15 and PUDG_R16) in Figure 5 that have stellar velocity dispersions significantly below what is established for non-UDGs. The integrated velocity dispersion includes any rotation (Courteau et al. Reference Courteau2014). However, this rotational term will not include any inclination correction. As per the discussion in Section 2, the UDG definition is biased to selecting face-on galaxies, so this rotation correction may be significant in some cases. After the inclusion of this correction, these UDGs mentioned above may present more regular (higher) velocity dispersions. It is worth noting that multiple UDGs, along with a number of NUDGes, have recently been reported to exhibit varying degrees of rotation, suggesting this solution may be viable (Chilingarian et al. Reference Chilingarian, Afanasiev, Grishin, Fabricant and Moran2019; Buttitta et al. Reference Buttitta2025; Levitskiy et al. Reference Levitskiy, Forbes, Gannon, Ferré-Mateu, Romanowsky, Brodie, Couch and Haacke2025). Further, it is worth noting that a pair of NUDGes that did not meet the criteria to be a UDG used by Gannon et al. (Reference Gannon, Ferré-Mateu, Forbes, Brodie, Buzzo and Romanowsky2024b) but that have been referred to UDGs by some authors (FCC 224 and NGC1052-DF4; van Dokkum et al. Reference van Dokkum, Danieli, Abraham, Conroy and Romanowsky2019a; Shen, van Dokkum, & Danieli Reference Shen, van Dokkum and Danieli2023; Tang et al. Reference Tang2025c; Buzzo et al. Reference Buzzo2025b) also would reside in a similar region to the four UDGs plotted.
Alternatively, these UDGs may exhibit a paucity of dark matter due to an exotic formation pathway. For example, there are extensive arguments in the literature advocating for and against the need to invoke a special formation pathway (e.g. van Dokkum et al. Reference van Dokkum2018; Trujillo et al. Reference Trujillo2019; Silk Reference Silk2019; Emsellem et al. Reference Emsellem2019; Montes et al. Reference Montes, Infante-Sainz, Madrigal-Aguado, Román, Monelli, Borlaff and Trujillo2020; van Dokkum et al. Reference van Dokkum2022; Keim et al. Reference Keim2022; Keim et al. Reference Keim2025) for the UDG NGC 1052-DF2. A more thorough discussion of this galaxy, and of others that may be similar (e.g. FCC 224; Tang et al. Reference Tang2025c; Buzzo et al. Reference Buzzo2025b), is beyond the scope of this work.
It is useful to note that there are also a population of UDGs above the normal relationship in Figure 5, the vast majority of which have velocity dispersions obtained from their GC system. For non-UDGs, the velocity dispersion of their GC system usually traces the velocity dispersion of their stars. Likewise, for many UDGs, this is the case (Forbes et al. Reference Forbes, Gannon, Romanowsky, Alabi, Brodie, Couch and Ferré-Mateu2021). As such, it is difficult to assess why the distribution of GC velocity dispersions is generally different to that of the stellar velocity dispersions. Those UDGs with elevated GC velocity dispersions are too faint to measure their stellar velocity dispersions. We will therefore consider alternative possibilities. These UDGs are mostly from Toloba et al. (Reference Toloba2023), who studied UDGs in the Virgo Cluster. Being in the cluster environment, it is possible that they are undergoing tidal disruption (as is the case for one UDG being plotted, VLSB-D) and that their velocity dispersion is not indicative of their total mass. Alternatively, being in a cluster environment may make it more likely that intra-cluster GCs are mis-associated with the UDG, artificially driving up their velocity dispersions. The increase of GC-velocity dispersion with local galaxy density shown in Figure 11 of Toloba et al. (Reference Toloba2023) could be an indicator of either explanation.
Finally, it is worth noting that the spread in UDG velocity dispersions may be explainable by different dark matter halo profiles. In particular, the elevated velocity dispersions seen for some UDG GC systems could be explained by highly concentrated dark matter halos (Toloba et al. Reference Toloba2023). Alternatively, UDGs with extremely low velocity dispersions may be explainable by extremely low concentration (and/or cored) dark matter halos (Trujillo-Gomez, Kruijssen, & Reina-Campos Reference Trujillo-Gomez, Kruijssen and Reina-Campos2022). A UDG dark matter halo that presents a rising velocity profile with radius may also help to explain the elevated GC velocity dispersions. As the spectroscopically measured GCs tend to trace out to larger radii (in some cases out to
$7 R_{e}$
), a rising velocity profile with radius would naturally explain GC to stellar velocity dispersions that are above unity. If this were the case, however, we would also expect UDG stellar velocity dispersion to be larger than those of dwarf galaxies of similar stellar mass, due to their being measured in a larger aperture than those of normal dwarfs.
Despite the length of discussion devoted to the outlying UDGs in Figure 5, it is worth reiterating the key conclusion, which is that the majority of measured UDG velocity dispersions (particularly UDG stellar velocity dispersions) are normal for their stellar mass. In general, UDGs do not seem to be dynamically hotter or colder than non-UDGs of a similar stellar mass.
4.2. What drives UDG dark matter content – Luminosity?
In order to best interrogate whether UDGs are dwarf-like or not in their dark matter content, it would be useful to place UDGs on the stellar mass – halo mass relationship. It is worth pointing out that neither stellar mass, nor halo mass, is a directly observable property. Instead, it is common to use a proxy for both: Luminosity as a proxy for stellar mass; Dynamical mass as a proxy for halo mass. A version of Figure 6 has been produced by several authors (e.g. Toloba et al. Reference Toloba2018; van Dokkum et al. Reference van Dokkum2019b; Gannon et al. Reference Gannon2021). Note that luminosity is only a good proxy for stellar mass if there are reasonably similar stellar populations for the galaxies plotted, and dynamical mass is only a good proxy for total halo masses mass if there are reasonably similar dark matter halo profiles for galaxies being compared. In Figure 6 non-UDGs follow a characteristic ‘U-shape’ relationship similar to the stellar to halo mass ratio vs halo mass relationship (see e.g. Figure 2 of Wechsler & Tinker Reference Wechsler and Tinker2018). It is clear, however, that many current UDG dynamical mass measurements do not follow the same U-shape and do not reside on the same locus as non-UDGs of similar luminosity. Many UDGs are much more dark matter dominated within their half-light radius than comparable non-UDGs of similar luminosity (and dwarf-like stellar mass). This presented some of the first evidence that many UDGs may not follow stellar mass – halo mass relationships that were established without considering their existence.
Mass to light ratio vs. dynamical mass within half light radius. The non-UDGs that establish the U-shaped relation are from the works of Zaritsky et al. (Reference Zaritsky, Gonzalez and Zabludoff2006), Wolf et al. (Reference Wolf, Martinez, Bullock, Kaplinghat, Geha, Muñoz, Simon and Avedo2010), Cappellari et al. (Reference Cappellari2013), Toloba et al. (Reference Toloba2014), grey points and Forbes et al. (Reference Forbes, Read, Gieles and Collins2018b). UDGs are plotted from the catalogue of Gannon et al. (Reference Gannon, Ferré-Mateu, Forbes, Brodie, Buzzo and Romanowsky2024b) with dynamical masses coming from their stellar velocity dispersions (red triangles) and their GC system velocity dispersions (blue squares). Where GC velocity dispersions are inferred from
$\lt$
10 GCs, we employ a higher transparency to indicate these dispersions may be less reliable. UDGs exhibit a wide variation in their dark matter properties, with most of them significantly more dark matter dominated than the established U-shaped relation. This is possible evidence of UDGs having halos of total mass more than is expected given the standard stellar mass – halo mass relationship.

UDGs, by definition, have larger half-light radii than the majority of galaxies that are of similar luminosity. Naturally, this will lead to larger dynamical masses (
$M_\mathrm{dyn}\propto R_{e} \sigma^2$
) as their dynamical mass is being measured within a larger aperture, taking into account more of their dark matter halo and thus increasing the relative dark matter content of the mass measurement (van Dokkum et al. Reference van Dokkum2019b). This bias cannot, however, rectify the difference between UDGs and non-UDGs seen in Figure 6. Accounting for aperture effects will make many non-UDGs exhibit dynamical mass-to-light ratios more similar to UDGs; however, the great scatter in UDG mass-to-light ratios cannot be accounted for by a simple aperture correction to non-UDG measurements (see e.g. Figure 16 of van Dokkum et al. Reference van Dokkum2019b and related discussion).
4.3. What drives UDG dark matter content – Globular clusters?
In Figure 6, many UDGs show extreme dark matter dominance and hints of dark matter halos more massive than what are expected given previously established stellar mass – halo mass relationships. A separate early sign that they may inhabit massive dark matter halos was their extreme GC richness (e.g. Beasley et al. 2016; van Dokkum et al. Reference van Dokkum2017). Given GCs are known indicators of halo mass (e.g. Spitler & Forbes Reference Spitler and Forbes2009; Harris et al. Reference Harris, Harris and Alessi2013; Burkert & Forbes Reference Burkert and Forbes2020), a key question of UDG studies has been to establish if they too follow GC number – halo mass relationships and thus if GCs may be used to estimate UDG halo masses. We will revisit this question in full in Section 6.5, however, here we will examine the first-order evidence that UDG dynamics correlate with their GC richness. Put another way, if UDG GC counts indicate their total halo masses, then we expect a correlation between UDG GC richness and dynamical mass.
In Figure 7 we plot UDG GC counts vs. their dynamical masses. Non-UDGs that establish the relationship are from Harris et al. (Reference Harris, Harris and Alessi2013), and a best-fitting line to their relationship is included. In the same way, dynamical mass may be thought of as a proxy for halo mass when creating Figure 6. Here again, we use dynamical mass as a proxy for halo mass. It is worth noting that this assumption strictly assumes a level of self-similarity of UDG dark matter halo profiles whereby any correlation may be diluted, were some UDGs to exhibit strong cores, while others exhibit strong cusps, at the same total halo mass. It is evident from Figure 7 that UDGs, particularly those with dynamical masses measured from their stellar velocity dispersions, follow the relationship that has been established for non-UDGs. We take their following the relationship as first-order evidence that UDGs indeed follow the GC number – halo mass relationship and that many UDGs reside in massive dark matter halos. Further, it should be noted that the relationship may tighten further when considering the stellar mass contained within the GC system. Some UDGs exhibit low numbers of GCs, but a top-heavy mass function (e.g. Janssens et al. Reference Janssens2022), which may skew them off the relationship to the right in Figure 7.
GC system richness (
$N_\mathrm{GC}$
) vs. dynamical mass (
$M_\mathrm{dyn}$
). A second y-axis is included where GC-number is translated to a total halo mass estimate using the Burkert & Forbes (Reference Burkert and Forbes2020) relation. The non-UDGs that establish the relationship are from the catalogue of (Harris et al. Reference Harris, Harris and Alessi2013, grey points). A best-fitting line to these data is included in black. Per previous plots, UDGs are plotted from the catalogue of Gannon et al. (Reference Gannon, Ferré-Mateu, Forbes, Brodie, Buzzo and Romanowsky2024b) with dynamical masses coming from their stellar velocity dispersions (red triangles) and their GC system velocity dispersions (blue squares). Per previous plots UDG GC velocity dispersions coming from
$\lt$
10 tracers are plotted with a higher transparency. UDGs with well-constrained velocity dispersions largely follow the relationship that has been established for non-UDGs. It has been suggested that this is evidence that they also follow the GC number – halo mass relationship.

Our evidence that UDGs may indeed follow the GC number – halo mass relationship has two important corollaries for their formation. Firstly, if UDG total halo masses can be estimated via their GC counts, their dark matter halo profile shape can be inferred from individual dynamical mass measurements. This has been done by various authors (e.g. Beasley et al. Reference Beasley and Trujillo2016; Toloba et al. Reference Toloba2018; Gannon et al. Reference Gannon, Forbes, Romanowsky, Ferré-Mateu, Couch and Brodie2020, Reference Gannon2022; Forbes et al. Reference Forbes, Gannon, Romanowsky, Alabi, Brodie, Couch and Ferré-Mateu2021; Levitskiy et al. Reference Levitskiy, Forbes, Gannon, Ferré-Mateu, Romanowsky, Brodie, Couch and Haacke2025) who generally find cored (or equivalently low concentration) dark matter halos best recover the measured dynamical masses of (usually GC-rich) UDGs. UDGs residing within cored or possibly low-concentration dark matter halos may agree with their formation via strong stellar feedback (Toloba et al. Reference Toloba2018) or may help with their formation via tidal effects (Carleton et al. Reference Carleton, Errani, Cooper, Kaplinghat, Peñarrubia and Guo2019). As a counterpoint to the above evidence, the study of Kravtsov (Reference Kravtsov2024) built a toy model to explain UDG dynamical masses and found the opposite – that many of the most massive UDGs may have higher, not lower, concentration dark matter halos. Secondly, and simply, if UDG GC content indicates that many UDGs reside in massive dark matter halos, the most GC-rich UDGs cannot simply be an extension of the regular dwarf galaxy population, as they reside in dark matter halos that are an order of magnitude more massive.
4.4. Where are UDGs on the fundamental plane of elliptical galaxies?
In Figure 1, we established UDGs as an extension of (primarily spheroidal) stellar systems to a new region of previously unexplored parameter space, and so it is of interest where UDGs reside on the Fundamental plane of elliptical galaxies (Djorgovski & Davis Reference Djorgovski and Davis1987), which tend to be pressure supported. Rather than plotting the plane in its traditional effective radius – velocity dispersion – surface brightness form, we place UDGs in an altered form of the plane comprising dynamical mass – effective radius – luminosity in Figure 8, as it represents a simple extension of Figure 1 in the direction of dynamical mass.
An update on Figure 9 from Gannon et al. (Reference Gannon, Forbes, Brodie, Romanowsky, Couch and Ferré-Mateu2023). Left: Mass–radius–luminosity space: half-light luminosity (
$L_{1/2}$
), 3D half-light radius (
$r_{1/2}$
) and dynamical mass within the half-light radius (
$M_{1/2}$
). We de-project the plane and zoom around the locus of UDGs on the Right. From top to bottom these are the
$L_{1/2}$
–
$r_{1/2}$
,
$L_{1/2}$
–
$M_{1/2}$
and
$r_{1/2}$
–
$M_{1/2}$
projections of the plane. We establish the locus for non-UDGs using the data from Tollerud et al. (Reference Tollerud, Bullock, Graves and Wolf2011), Toloba et al. (Reference Toloba, Boselli, Peletier, Falcón-Barroso, van de Ven and Gorgas2012), McConnachie (Reference McConnachie2012), Kourkchi et al. (Reference Kourkchi2012) and Forbes et al. (Reference Forbes, Read, Gieles and Collins2018b) (grey points). Per previous plots, UDGs are plotted from the catalogue of Gannon et al. (Reference Gannon, Ferré-Mateu, Forbes, Brodie, Buzzo and Romanowsky2024b) with dynamical masses coming from their stellar velocity dispersions (red triangles) and their GC system velocity dispersions (blue squares). Within this parameter space, the primary difference between UDGs and non-UDGs is their larger sizes.

It is clear from Figure 8 that the largest difference between UDGs and other elliptical systems is simply their increased half-light radii. This is particularly apparent in the half-light radius – luminosity and dynamical mass – half-light radius de-projections of the plane. At face value the fact that UDGs are not outliers in the dynamical mass – luminosity projection of the plane may seem at odds with previous conclusions from Figures 6 to 7. Interestingly, Kadowaki et al. (Reference Kadowaki, Zaritsky and Donnerstein2021) show in their Appendix A how the large half-light radius of UDGs, and the current bias towards spectroscopy of the largest UDGs, creates a bias towards UDGs residing in massive dark matter halos.
Furthermore, Zaritsky & Behroozi (Reference Zaritsky and Behroozi2023) and Zaritsky et al. (Reference Zaritsky, Donnerstein, Dey, Karunakaran, Kadowaki, Khim, Spekkens and Zhang2023) show how a fundamental plane type argument can be used to estimate UDG halo masses from their photometric properties. Again, UDGs are found to reside in dark matter halos of greater mass than similar stellar mass dwarfs, with a primary driver of this conclusion being their large half-light radii. A key conclusion of their work is that UDGs may present at least two times and frequently greater than ten times less efficient star formation than dark matter halos of a similar mass that do not host UDGs (Zaritsky et al. Reference Zaritsky, Donnerstein, Dey, Karunakaran, Kadowaki, Khim, Spekkens and Zhang2023). This conclusion would support the scenario where some UDGs are ‘failed galaxies’ that have failed to form the stellar mass expected given their massive dark matter halos.
Indeed, much of the evidence currently gathered from UDG dynamics, their elevated mass to light ratios (Figure 6), their following the GC number – dynamical mass relationship (Figure 7) and their fundamental plane positioning (Figure 8), all support the hypothesis that a significant fraction of the UDGs studied for dynamics may follow a ‘failed galaxy’ formation pathway. We stress the importance of the caveat that the conclusion can only be reached for the (biased) sample of UDGs for which we currently have high-quality spectroscopy that enables dynamical measurements. As this sample has been shown to be generally larger and more GC-rich than the general UDG population (Gannon et al. Reference Gannon, Ferré-Mateu, Forbes, Brodie, Buzzo and Romanowsky2024b), it is likely also biased to the ‘failed galaxy’ formation pathway, in comparison to the general UDG population.
5. UDG internal Properties II: Clues from stellar populations
The study of the stellar populations of galaxies is key to understanding their formation processes. It can provide a multitude of details, e.g. the age of the stars forming the galaxy, how they formed over time (e.g. long and steady, bursty, or very quick and early star formation histories), and what was their chemical enrichment by means of the total metallicity and the relative abundances of certain elements (e.g. [Mg/Fe]). As detailed in Figure 4, these properties can help differentiate between the diverse UDG formation scenarios. Only a couple of local UDGs have been studied via resolving individual stars within the galaxy (e.g. WLM by Albers et al. Reference Albers2019, And XIX by Collins et al. Reference Collins, Williams, Tollerud, Balbinot, Gilbert and Dolphin2022, and F8D1 by Smercina et al. Reference Smercina2025). Thus, the vast majority of the stellar population information to date comes from integrated measurements of the stellar populations of UDGs.
5.1. Global stellar populations – Are UDGs the continuation of classical dwarfs?
Given the possible formation scenarios discussed in Section 3, the obvious question is whether UDGs have stellar population properties that are more similar to classical dwarfs or whether they resemble some other types of galaxies. The first constraints on the stellar populations of Coma UDGs came from broad-band colours (van Dokkum et al. Reference van Dokkum, Abraham, Merritt, Zhang, Geha and Conroy2015a), which suggested either old, metal-poor populations or somewhat younger, more metal-rich ones. Similar results were obtained by van der Burg et al. (Reference van der Burg, Muzzin and Hoekstra2016), Román & Trujillo (Reference Román and Trujillo2017a), and Prole et al. (Reference Prole, Davies, Keenan and Davies2018) for UDGs in other environments, showing some environmental differences: cluster UDGs tended to be red and quenched, while field UDGs were often bluer and still forming stars. This supported the idea that star-forming UDGs may fall into clusters, quench, and evolve into the red systems observed today (e.g. Román & Trujillo Reference Román and Trujillo2017b; Prole et al. Reference Prole, van der Burg, Hilker and Davies2019b; Román et al. Reference Román, Beasley, Ruiz-Lara and Valls-Gabaud2019)
However, given the age – metallicity degeneracy resulting from the colours alone, spectroscopy was needed to obtain more constrained results. Over the last decade a huge effort has been carried out to obtain spectra to study the stellar populations of UDGs via full-spectral-fitting or classical line indices. These studies revealed a large variety in the stellar population properties of UDGs (van Dokkum et al. Reference van Dokkum2015b; Kadowaki et al. Reference Kadowaki, Zaritsky and Donnerstein2017; Gu et al. 2018; Ferré-Mateu et al. Reference Ferré-Mateu2018; Ruiz-Lara et al. Reference Ruiz-Lara2018; Chilingarian et al. Reference Chilingarian, Afanasiev, Grishin, Fabricant and Moran2019; Ferré-Mateu et al. Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023). While many of the UDGs are indeed old (
$\sim$
7–9 Gyr) and metal poor ([Z/H]
$\sim -$
1 dex), dominated by elevated [Mg/Fe] values (
$\ge$
0.4 dex), and showing very early and fast star formation histories (SFHs), others are found to be much younger (3–5 Gyr), presenting extended SFHs, and with metallicities more similar to classical dwarfs ([Z/H]
$\sim -$
0.5 dex). It is worth mentioning that these works were focused on the cluster environment, and only a few isolated or low density environment UDGs have spectroscopic results (Román et al. Reference Román, Beasley, Ruiz-Lara and Valls-Gabaud2019; Mart n-Navarro et al. Reference Martín-Navarro2019; Ferré-Mateu et al. Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023), with results comparable to those in clusters. SED fitting studies have thus taken over to measure not only large, complete UDG samples, but also significantly stepping away from the cluster environment (e.g. the SMUDGes survey Zaritsky et al. Reference Zaritsky2019; Barbosa et al. Reference Barbosa2020, or the works of Buzzo et al. Reference Buzzo2022, Reference Buzzo2024, Reference Buzzo2025a, which combined the results from the MATLAS survey and other environments). These surveys now provide stellar population estimates for hundreds of UDGs, allowing comparisons across environments and with simulations.
Age – metallicity relation containing the spectroscopic samples (open circles) from Ferré-Mateu et al. (Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023) with updates on their figure to include the results of Levitskiy et al. (Reference Levitskiy, Forbes, Gannon, Ferré-Mateu, Romanowsky, Brodie, Couch and Haacke2025), Buzzo et al. (Reference Buzzo2025b) and Doll et al. (Reference Doll2025). Photometric results for the SMUDGEs galaxies (Barbosa et al. Reference Barbosa2020) and those presented in Buzzo et al. (Reference Buzzo2022, Reference Buzzo2024) are also included, shown by open diamonds, creating a sample of 212 UDGs in total. The figure also shows the distributions in age and metallicity of different types of galaxies for comparison, as specified in the legend, from: Janz et al. (Reference Janz2016), Ferré-Mateu et al. (Reference Ferré-Mateu2018), Recio-Blanco (Reference Recio-Blanco2018), Naidu et al. (Reference Naidu2022), and Romero-Gómez et al. (Reference Romero-Gómez2023). Overall, UDGs span the entire age range, from very young to very old ones (1–14 Gyr), but are more limited in total metallicities to [Z/H]
$\gt-1.5$
dex. It can be seen that overall UDGs are compatible with being the envelope of the bulk of the age – metallicity distribution of dEs, but particularly for the older ones the locus is also shared with the GCs and dSphs distributions.

The age – metallicity distribution of the compilation of 212 observed UDGs is shown Figure 9. The figure is updated from Ferré-Mateu et al. (Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023) to incorporate all the UDGs currently with spectroscopic measurements discussed above (filled circles), and also those with SED fitting stellar population results (open diamonds). The density contours and distributions correspond to the stellar populations of comparison galaxies such as dEs (light blue), dSphs (dark blue) and GCs (light brown; see Ferré-Mateu et al. (Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023) for a full description of these comparison samples). Overall, the bulk of red, quenched UDGs span all possible ages, with a median value of
$\langle$
Age
$\rangle=7.1\pm2.7$
Gyr (see also Table 1), but are more limited in total metallicities, with an average of
$\langle$
[Z/H]
$\rangle=-1.08\pm0.34$
dex. UDGs overall seem to occupy the lower envelope of the dE distribution. Except for the younger UDGs, which mostly match dEs, the age – metallicity relation is not enough to decipher the origin of older UDGs, which show metallicities that resemble either dEs, dSphs and GCs. In particular, the oldest UDGs could also track the GCs distribution.
Average stellar population properties of UDGs from spectroscopic and photometric studies († only spectroscopic).

5.1.1. The mass – Metallicity relation of UDGs
One of the key scaling relations in stellar population studies is that of the stellar mass – metallicity relation (MZR, e.g. Gallazzi et al. Reference Gallazzi, Charlot, Brinchmann, White and Tremonti2005; Panter et al. Reference Panter, Jimenez, Heavens and Charlot2008; Kirby et al. Reference Kirby, Cohen, Guhathakurta, Cheng, Bullock and Gallazzi2013). Figure 10 shows the same UDGs and comparison galaxies as in Figure 9, together with the local MZR of both low- and high-mass galaxies (Kirby et al. Reference Kirby, Cohen, Guhathakurta, Cheng, Bullock and Gallazzi2013 and Panter et al. Reference Panter, Jimenez, Heavens and Charlot2008, respectively), and the MZR of
$z\sim$
2 galaxies from Ma et al. (Reference Ma2020). The figure shows that UDGs fill the gap between dEs and dSphs in terms of stellar mass, and overall scatter around the scaling relations at a given stellar mass, consistent with a continuation of the dwarf sequence (e.g. puffed-up dwarfs). Yet, some outliers seem to exist. Some UDGs exhibit higher-than-expected metallicities, which might be interpreted as remnants of tidally stripped more massive galaxies (e.g. Liao et al. Reference Liao2019; Sales et al. Reference Sales, Navarro, Peñafiel, Peng, Lim and Hernquist2020; Tremmel et al. Reference Tremmel, Wright, Brooks, Munshi, Nagai and Quinn2020; Zheng et al. Reference Zheng, Liao, Gao and Jiang2025). Conversely, there is a non-negligible fraction (roughly 10% of the entire sample) of galaxies with lower-than-expected metallicities that instead follow the
$z\sim 2$
MZR. Interestingly, the majority of them are GC-rich (see Buzzo et al. Reference Buzzo2025a) and show the most elevated [Mg/Fe] (see Ferré-Mateu et al. Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023 and next section), thus suggesting a formation as ‘failed galaxies’ that quenched early before significant metal enrichment (Ferré-Mateu et al. Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023, Buzzo et al. Reference Buzzo2025a).
The figure shows an updated version of the stellar mass – metallicity relation (MZR) from Ferré-Mateu et al. (Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023). The MZRs and their intrinsic scatter of local massive and low-mass galaxies (dashed line for Panter et al. Reference Panter, Jimenez, Heavens and Charlot2008 and solid line for Simon Reference Simon2019, respectively) are shown, as well as the theoretical prediction of
$z\sim 2$
galaxies (dotted line, Ma et al. Reference Ma, Hopkins, Faucher-Giguère, Zolman, Muratov, Kereš and Quataert2016). Similar to Figure 9, it shows the spectroscopic and SED fitting sample of UDGs, and the distribution of dEs, dSphs, and GCs in coloured contours. It can be seen that the bulk of the UDGs scatter around the MZRs, with similar metallicity to classical dwarfs at their stellar mass. However, several UDGs inhabit the parameter space around the
$z\sim 2$
relation, possibly indicative of a ‘failed-galaxy’ origin.

An alternative way to elucidate the formation pathways of UDGs is to compute the deviation from the MZR,
$\delta_{\rm{MZR}}$
(e.g. Barbosa et al. Reference Barbosa2020; Buzzo et al. Reference Buzzo2024, Reference Buzzo2025a) and study whether there is any dependence of this deviation on other structural and stellar population parameters. Buzzo et al. (Reference Buzzo2025a) found that UDGs that aligned with the traditional dwarf MZR tend to be younger, exhibit extended SFHs, contain relatively few GCs, and have lower GC-to-stellar mass ratios. In contrast, UDGs falling below the dwarf MZR are generally older, have early and fast SFHs, and are rich in GCs. Notably, UDGs with the highest GC system masses relative to their stellar mass are found below the local MZR, inhabiting a higher redshift MZR locus as may be expected if they are ‘failed galaxies’. We will discuss this further in Section 7.
Finally, we wish to include a note of caution to those assembling their own MZR relations for UDGs. Although the MZR is one of the most powerful relations to understand the nature of galaxies, it also presents some intrinsic caveats. For instance, some works present [Fe/H] as total metallicity, which is not necessarily the case for low-metallicity galaxies, as the low-metallicity stars used to compute the stellar population synthesis models are
$\alpha$
-enhanced. There is the issue of the inhomogeneity and incompleteness of the different samples (see e.g. Figure 6 from Buzzo et al. Reference Buzzo2024). Lastly, different techniques can introduce systematics in the recovery of certain stellar population properties (see e.g. Buzzo et al. Reference Buzzo2022; Webb et al. Reference Webb2022; Ferré-Mateu et al. Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023; Buzzo et al. Reference Buzzo2024). One avenue to explore in the future would be to combine them in spectrophotometric studies, which seem to render the most constrained results (see e.g. Webb et al. Reference Webb2022; Tang et al. Reference Tang2025b,c).
5.1.2. The elevated [Mg/Fe] values of UDGs
Another key property for understanding UDG formation is the relative abundance of
$\alpha$
-elements such as Mg compared to Fe. These elements are produced by supernovae species that operate on different timescales, making them a useful cosmic clock of star formation. High [Mg/Fe] values indicate rapid, early star formation that ended before type Ia supernovae contributed significant Fe, while lower values imply more extended star formation and greater Fe enrichment (e.g. Thomas et al. Reference Thomas, Maraston, Bender and Mendes de Oliveira2005; de La Rosa et al. Reference de La Rosa, La Barbera, Ferreras and de Carvalho2011; McDermid et al. Reference McDermid2015).
Most UDGs studied so far show elevated
$\alpha$
-enhancements (typically [Mg/Fe]
$\sim$
0.4 dex; Ferré-Mateu et al. Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023). In the latter, it was shown that a correlation between the [Mg/Fe] and the formation timescales did exist and that indeed those with the highest elemental abundances where the ones that formed and quenched the earliest. We cation that this could be possibly biased by the fact that the majority of these UDGs are GC-rich and in clusters. We show the [Mg/Fe]–[Z/H] relation in Figure 11, which shows that the majority of UDGs follow the locus of dEs, but also GCs and some dSphs.
However, three objects clearly stand out in the figure, presenting extremely high values of [Mg/Fe]
$\sim$
1–1.5 dex, values not even seen in the most metal-poor stars of the Milky Way halo. These puzzling cases (DGSAT I, Mart n-Navarro et al. Reference Martín-Navarro2019; Y358, Ferré-Mateu et al. Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023; and Hydra-I UDG9, Doll et al. Reference Doll2025) do not fit easily within conventional enrichment scenarios and may reflect unusual processes such as Fe depletion rather than Mg enhancement, or an extremely early, truncated star formation, i.e. ‘failed galaxies’.
Similar to Figures 9 and 10, now representing the [Mg/Fe]–metallicity relation. Only UDGs studied with high S/N spectroscopy are shown, as SED fitting cannot provide this property. While the majority of UDGs show
$\alpha$
-enhancement values compatible with dEs, GCs or dSphs, there are three clear outliers with [Mg/Fe]
$\gt1$
dex that inhabit a region of parameter space that was previously empty of galaxies.

5.2. What is the impact of local (global) environment on the stellar populations of UDGs?
At this point, it is clearly established that UDGs are a mixed-bag of objects, representing everything from being puffed up dwarf galaxies, to failed galaxies, and even possibly tidally stripped remnants of more massive galaxies. The diversity underscores that UDGs are not a homogeneous class, but rather a mixture of systems whose histories are imprinted in their ages, metallicities, and abundance ratios, amongst other key structural properties. The next logical question is to ask whether environment (either global or local) has any say in shaping the nature of UDGs.
UDGs in galaxy clusters have become a central focus in the past decade, particularly regarding their stellar populations. Understanding the role of a galaxy’s location within the cluster is therefore essential. Ferré-Mateu et al. (Reference Ferré-Mateu2018) showed that, although Coma UDGs’ stellar populations do not exhibit a strong cluster-centric dependence overall, those in the core appeared more metal-rich than systems in the outskirts previously studied by Kadowaki et al. (Reference Kadowaki, Zaritsky and Donnerstein2017) and Gu et al. (2018). However, such trends may be affected by projection effects, which has motivated the widespread use of phase-space diagrams to examine correlations between cluster location and UDG properties (e.g. Alabi et al. Reference Alabi2018; Gannon et al. Reference Gannon2022; Ferré-Mateu et al. Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023; Forbes et al. Reference Forbes2023). These diagrams plot galaxy velocity relative to the cluster mean (normalised by the velocity dispersion) against projected cluster-centric distance (normalised by the virial radius). Different regions of this phase-space correspond to different infall epochs (Rhee et al. Reference Rhee, Smith, Choi, Yi, Jaffé and Candlish2017). While informative, this diagnostic is not without limitations: statistical contamination from interlopers may assign galaxies to incorrect infall regions. As such, trends inferred from these diagrams should be interpreted with caution.
If galaxies typically quench
$\sim$
1.5 Gyr after entering the cluster environment (Muzzin et al. Reference Muzzin2014), those quenched earliest should be expected to be concentrated in the cluster centre (i.e. in the ‘very early infall’ region). Observations, however, reveal a more complex picture. Ferré-Mateu et al. (Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023) and Doll et al. (Reference Doll2025) report a broad range of quenching times–defined as the epoch when the galaxy has formed 90% of its stellar mass (
$t_{90}$
). This finding may be complicated by the fact that some of the UDGs could be quenched prior to infall. Overall, no strong correlations have been identified between infall time, position in the phase-space, and structural properties such as size or stellar mass. In terms of stellar populations, [Mg/Fe] shows the clearest trend: the most
$\alpha$
-enhanced UDGs are found near the cluster centre and exhibit evidence for rapid quenching (Ferré-Mateu et al. Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023), consistent with results for non-UDGs of similar mass (e.g. Pasquali et al. Reference Pasquali, Smith, Gallazzi, De Lucia, Zibetti, Hirschmann and Yi2019; Smith et al. Reference Smith, Lucey, Hudson, Allanson, Bridges, Hornschemeier, Marzke and Miller2009; Gallazzi et al. Reference Gallazzi, Pasquali, Zibetti and Barbera2021). By contrast, stellar age and metallicity show little dependence on infall region, in line with trends seen in other dwarf galaxies (e.g. Romero-Gómez et al. Reference Romero-Gómez2023).
All in all, we note that while the phase-space can be a useful tool to understand the formation pathways of individual UDGs, it still needs to be aided by other indicators. Moreover, while the LEWIS sample (see Iodice et al. Reference Iodice2020b, Reference Iodice2023) is the most homogeneous in terms of cluster members, the number of the observed UDGs in that cluster is still limited to fewer than ten and covers only out to 0.4 virial radii, thus one of the next crucial steps will be to obtain larger, more complete samples of UDGs within a given cluster.
Environmental comparisons reveal clear differences in the stellar populations of UDGs. Systems in clusters are generally older, while those in the field tend to be younger (Barbosa et al. Reference Barbosa2020; Buzzo et al. Reference Buzzo2022). Metallicity also varies with environment: field UDGs are typically more metal-poor but follow the MZR more closely (Buzzo et al. Reference Buzzo2024), whereas cluster UDGs are slightly more metal-rich yet display a larger scatter around the relation (Ferré-Mateu et al. Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023; Doll et al. Reference Doll2025). These contrasts align with their star formation histories (SFHs), which are usually more extended in field UDGs but shorter or quenched earlier in cluster systems (see e.g. Ferré-Mateu et al. Reference Ferré-Mateu2018; Ruiz-Lara et al. Reference Ruiz-Lara2018; Martín-Navarro et al. Reference Martín-Navarro2019; Villaume et al. 2022; Ferré-Mateu et al. Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023; Levitskiy et al. Reference Levitskiy, Forbes, Gannon, Ferré-Mateu, Romanowsky, Brodie, Couch and Haacke2025).
The influence of the local environment appears to be especially important. Even within clusters, some UDGs reside in filaments or are on their first infall, conditions that are more akin to groups or the field than high density environments as galaxy clusters. UDGs in high-density regions seem to be both older (8.4 vs. 7.7 Gyr) and more metal-rich (
$-1.01$
vs.
$-1.16$
dex) than their low-density counterparts, as shown in Table 1.
Star formation histories further highlight these distinctions (Ferré-Mateu et al. Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023). Left panel of Figure 12 shows that UDGs in dense environments typically show rapid formation and early quenching, with lookback times of
$t_{50}=9.5^{+1.0}_{-0.6}$
and
$t_{90}=7.4^{+0.5}_{-0.5}$
Gyr (being
$t_{50}$
and
$t_{90}$
the time it took to build 50 and 90% of the galaxy’s stellar mass, respectively). Those in low-density environments, by contrast, often sustain star formation for longer, quenching much later or in some cases, have not yet quenched, with
$t_{50}=6.2^{+1.4}_{-2.6}$
and
$t_{90}=0.7^{+1.3}_{-0.6}$
Gyr. The most elevated [Mg/Fe] values are overall compatible with the early and fast formation of high-density galaxies (see Figure 4 of Ferré-Mateu et al. Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023). We note, moreover, that delayed SFHs as those in low-density environments of Figure 12 are compatible with those of low-mass local dwarfs (e.g. Weisz et al. Reference Weisz2014, Reference Weisz2019), or similar to the recovered SFHs of the Local Volume UDGs from resolved stars (e.g. Albers et al. Reference Albers2019; Collins et al. Reference Collins, Williams, Tollerud, Balbinot, Gilbert and Dolphin2022; Smercina et al. Reference Smercina2025)
Star formation histories adapted from Ferré-Mateu et al. (Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023) (left, observations), Cardona-Barrero et al. (Reference Cardona-Barrero, Di Cintio, Battaglia, Macciò and Taibi2023) (middle, NIHAO, priv. comm), and Benavides et al. (Reference Benavides, Sales, Abadi, Vogelsberger, Marinacci and Hernquist2024) (right, TNG50). Each panel shows the cumulative mass fraction over cosmic time, coloured by the local environment: yellow-ish tones for low-density UDGs and orange for high-density ones (left panel for observations and middle one for NIHAO); or by star formation status: quenched in violet and star forming in purple (right panel, TNG50 UDGs). The dotted lines mark 50 and 90% of mass fraction assembled (used to compute the lookback times
$t_{50}$
and
$t_{90}$
). While quenched UDGs in TNG50, high-density observed UDGs, and NIHAO UDGs all show similar
$t_{50}$
(
$\sim$
$9.5$
Gyr), UDGs in low-density environments show much longer timescales of formation, with
$t_{50}\sim6.0$
Gyr. However, both NIHAO and low-density UDGs show the longest quenching timescales (as seen by the lookback time they formed 90%,
$t_{90}\sim 1-3$
Gyr), while quenched TNG50 and high-density UDGs have similarly faster ones (
$t_{90}\sim 7$
Gyr), quenching in only a couple of Gyr after half of their stellar mass was built.

5.3. Do simulated UDGs have similar stellar populations to observed ones?
As discussed in Section 3, the formation of UDGs has been explored through a variety of simulation frameworks, each highlighting different physical mechanisms, although there is a clear bias towards the ‘puffed up’ origin. Notably, while many simulations successfully produce UDGs, they often fail to generate some aspects of more compact classical dwarf ellipticals within the same stellar mass range–a discrepancy that underscores the ‘dwarf diversity’ challenge in galaxy formation models (Sales et al. Reference Sales, Wetzel and Fattahi2022).
In low-density environments, internal processes such as stellar feedback are expected to play a key role. The NIHAO simulations (Di Cintio et al. Reference Di Cintio, Brook, Dutton, Macciò, Obreja and Dekel2017; Jiang et al. Reference Jiang, Dekel, Freundlich, Romanowsky, Dutton, Macciò and Di Cintio2019a; Cardona-Barrero et al. Reference Cardona-Barrero, Di Cintio, Brook, Ruiz-Lara, Beasley, Falcón-Barroso and Macciò2020) show that repeated energy injection from supernovae induces potential fluctuations that expand the stellar component, producing galaxies with extended, bursty SFHs, low metallicities, and large sizes. The middle panel of Figure 12 shows the averaged SFHs of NIHAO UDGs (Cardona-Barrero, private communication). While these UDGs form relatively quickly, they quench later than most observed systems in low-density environments.
Environmental effects further shape UDG properties, particularly in clusters. The RomulusC (cluster) and Romulus25 (field) simulations (Tremmel et al. Reference Tremmel, Karcher, Governato, Volonteri, Quinn, Pontzen, Anderson and Bellovary2017, Reference Tremmel, Wright, Brooks, Munshi, Nagai and Quinn2020; Wright et al. Reference Wright, Tremmel, Brooks, Munshi, Nagai, Sharma and Quinn2021) suggest that early major mergers can increase angular momentum and expand the stellar distribution. In cluster environments, early infall enhances passive evolution, while in the field, merger-driven puffing-up dominates. These models align well with observational signatures such as low central surface brightness and intermediate stellar ages. Similarly, FIRE simulations (Chan et al. Reference Chan, Kereš, Wetzel, Hopkins, Faucher-Giguère, El-Badry, Garrison-Kimmel and Boylan-Kolchin2018) produce cluster UDGs through internal feedback, though quenching is artificially imposed to match cluster infall times. These models reproduce realistic sizes and morphologies, but the artificial quenching poses a challenge to directly compare to observations.
Illustris-TNG simulations (TNG50, Benavides, Abadi, & Sales Reference Benavides, Abadi and Sales2022; Benavides et al. Reference Benavides, Sales, Abadi, Vogelsberger, Marinacci and Hernquist2024 and TNG100, Sales et al. Reference Sales, Navarro, Peñafiel, Peng, Lim and Hernquist2020) extend this picture by distinguishing between ‘born’ UDGs (formed as diffuse systems before infall) and ‘tidal’ UDGs (initially more compact galaxies that were expanded through cluster tides and interactions). This framework reproduces the diversity of cluster UDGs, with cluster members tending to be smaller, redder, and more metal-poor, while systems in low-density environments retain more extended SFHs, although this model requires empirical rescaling of the metallicities due to known underestimation in the TNG suite (Nelson et al. Reference Nelson2018). Backsplash galaxies may naturally account for the presence of red quenched UDGs beyond the cluster virial radius (Benavides et al. Reference Benavides2021). The right panel of Figure 12 shows the averaged SFHs of both quenched UDGs and star forming ones adapted from Benavides et al. (Reference Benavides, Sales, Abadi, Vogelsberger, Marinacci and Hernquist2024). The authors compare the shape of these SFHs with the high-density and low-density ones from observations (Ferré-Mateu et al. Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023), showing that quenched cluster UDGs in TNG50 have similar quenching timescales to observed high-density systems, whereas field and group UDGs display more prolonged star formation, consistent with observational trends.
Across simulations, the environmental modulation of SFHs is also reflected in the MZR. Figure 13 shows again the already discussed MZR (as in Figure 10), this time including the values from the different suites of simulations just discussed. This figure, adapted from Ferré-Mateu et al. (Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023), splits both observations and simulations by their local environment (high density for cluster, low density for field, groups or infall/filaments). We first note, as previously discussed, that some of the simulations like TNG first apply a shift in their total metallicities to match the observations. After those corrections, high-density simulations broadly follow the observed relation but show less scatter than observed UDGs, even if considering the entire simulated ensemble. Notably, no high-density simulation reproduces the lowest-metallicity systems interpreted as ‘failed galaxies’. In low-density environments, NIHAO simulations can achieve such low metallicities, although these systems cannot represent failed galaxies due to their extended, bursty SFHs.
Figure adapted from Ferré-Mateu et al. (Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023) to show the MZR as in Figure 10, for both observed (coloured symbols) and simulated UDGs (white, grey, and black symbols), now divided by local environment: high-density galaxies (cluster, left panel) and low-density ones (field, group, and infall/filaments, right panel). The local scaling relations at different stellar masses are shown in thick solid and dashed lines (Panter et al. Reference Panter, Jimenez, Heavens and Charlot2008 and Simon Reference Simon2019, respectively), together with their intrinsic scatter (thin solid lines). The theoretical relation for
$z\sim$
2 galaxies (Ma et al. Reference Ma, Hopkins, Faucher-Giguère, Zolman, Muratov, Kereš and Quataert2016) in shown as a dotted line. Per the legends, UDGs considered to be infalling are represented in both panels in a split coloured symbol. FIRE (white crosses; Chan et al. Reference Chan, Kereš, Wetzel, Hopkins, Faucher-Giguère, El-Badry, Garrison-Kimmel and Boylan-Kolchin2018), RomulusC (black triangles; Tremmel et al. Reference Tremmel, Wright, Brooks, Munshi, Nagai and Quinn2020), TNG100 (white diamonds for cluster UDGs, shaded grey for those identified as tidally stripped galaxies; Sales et al. Reference Sales, Navarro, Peñafiel, Peng, Lim and Hernquist2020), and TNG50 (white squares; Benavides et al. Reference Benavides, Sales, Abadi, Vogelsberger, Marinacci and Hernquist2024) are depicted in the high-density panel. Romulus25 (Wright et al. Reference Wright, Tremmel, Brooks, Munshi, Nagai, Sharma and Quinn2021), TNG50, and TNG100 also appear in the low-density category, alongside UDGs with NIHAO (white x-symbols; Di Cintio et al. Reference Di Cintio, Brook, Dutton, Macciò, Obreja and Dekel2017). While simulations produce UDGs with a relatively tight scatter in their MZR, observed UDGs are found to have a much larger scatter. In particular there is a population of low metallicity UDGs that do not appear to be well reproduced by any of the simulations, i.e. those of the ‘failed galaxy’ type.

Overall, the comparison of SFHs and MZR across simulations and observations emphasises the combined influence of internal feedback, mergers, and environment in producing the full diversity of UDGs, while highlighting specific areas, such as extremely metal-poor UDGs, where simulations remain incomplete.
5.4. Do UDGs build their stellar mass in the same way as classical dwarfs?
Spatially resolved stellar populations offer critical insight into UDG formation, complementing integrated measurements. Observations of stellar population gradients, though limited, thus provide a more detailed view of these processes.
DF44 was the first UDG with measured radial gradients (Villaume et al. 2022), benefiting from high signal-to-noise and multiple radial bins. Its age profile is essentially flat, consistent with classical dwarfs, but the metallicity profile is flat-to-rising, contrasting with both classical dwarfs and simulations of UDGs. This is, DF44 seemed to have formed inside-out rather than outside-in like the rest of classical dwarfs. Subsequent spatially resolved studies confirmed that DF44 is not unique. Other cluster UDGs (DFX1, DF07, PUDG-R21, FCC 224) and even NUDGEs (PUDG-R27 and VCC 1448), all exhibit similarly flat age gradients and flat-to-rising metallicity profiles (Ferré-Mateu et al. Reference Ferré-Mateu, Gannon, Forbes, Romanowsky, Buzzo and Brodie2025; Buzzo et al. Reference Buzzo2025b; Levitskiy et al. Reference Levitskiy, Forbes, Gannon, Ferré-Mateu, Romanowsky, Brodie, Couch and Haacke2025), and the group UDG NGC1052-DF2 shows also comparable trends (Fensch et al. Reference Fensch2019).
Figure 14 is updated from Ferré-Mateu et al. (Reference Ferré-Mateu, Gannon, Forbes, Romanowsky, Buzzo and Brodie2025)Footnote
b
to include all UDGs with spatially resolved stellar populations. It also summarises the gradients of simulated UDGs (Cardona-Barrero et al. Reference Cardona-Barrero, Di Cintio, Battaglia, Macciò and Taibi2023; Benavides et al. Reference Benavides, Sales, Abadi, Vogelsberger, Marinacci and Hernquist2024) and classical dwarfs (Graus et al. Reference Graus2019; Mercado et al. Reference Mercado2021), while presenting the observed distribution in classical dwarfs (Koleva et al. Reference Koleva, Prugniel, De Rijcke and Zeilinger2011; Sybilska et al. Reference Sybilska2017; Bidaran et al. Reference Bidaran2022; Lipka et al. Reference Lipka, Thomas, Saglia, Bender, Fabricius and Partmann2024). NIHAO UDGs display mild negative metallicity gradients, (
$\bigtriangledown$
([Z/H])
$=-$
0.125 dex/log(
$R/R_{e}$
), red dashed line), while TNG50 UDGs exhibit steeper gradients, particularly among star-forming systems (see Benavides et al. Reference Benavides, Sales, Abadi, Vogelsberger, Marinacci and Hernquist2024). Notably, the only TNG50 UDGs that show slightly flat or rising metallicity profiles are those that have experienced tidal stripping. Figure 14 illustrates that UDGs populate the tail of classical dwarf gradients, towards the flat-to-rising metallicity regime. Collectively, these patterns support a ‘Everything, everywhere, all at once’, nearly coeval, inside-out formation mode (Ferré-Mateu et al. Reference Ferré-Mateu, Gannon, Forbes, Romanowsky, Buzzo and Brodie2025).
Figure adapted from Ferré-Mateu et al. (Reference Ferré-Mateu, Gannon, Forbes, Romanowsky, Buzzo and Brodie2025) to show the age and metallicity gradients derived for UDGs in that work but also including the recent results of Buzzo et al. (Reference Buzzo2025b) and Levitskiy et al. (Reference Levitskiy, Forbes, Gannon, Ferré-Mateu, Romanowsky, Brodie, Couch and Haacke2025). Individual simulated UDGs from TNG50 (Benavides et al. Reference Benavides, Sales, Abadi, Vogelsberger, Marinacci and Hernquist2024) and the mean gradient for NIHAO ones (Cardona-Barrero et al. Reference Cardona-Barrero, Di Cintio, Battaglia, Macciò and Taibi2023) are shown as red asterisks and a red dashed line, respectively, while FIRE-2 simulated dwarfs (Graus et al. Reference Graus2019; Mercado et al. Reference Mercado2021), are shown in black asterisks. The grey band indicates the regime considered as a ‘flat’ metallicity gradient according to Mercado et al. (Reference Mercado2021). Observed classical dwarfs are shown as black diamonds (data from Koleva et al. Reference Koleva, Prugniel, De Rijcke and Zeilinger2011; Sybilska et al. Reference Sybilska2017; Bidaran et al. Reference Bidaran2022; Lipka et al. Reference Lipka, Thomas, Saglia, Bender, Fabricius and Partmann2024). This figure shows that UDGs are broadly consistent with extending the distribution of both simulated and observed dwarfs, towards flat-to-rising metallicity profiles. This suggests an inside-out formation scenario, unlike most known dwarfs that are thought to form outside-in.

It is noteworthy that most of the UDGs studied for resolved gradients are cluster members and, in particular, GC rich, with the exception of NGC1052-DF2 in a group environment. Extending these studies to field UDGs and/or GC-poor systems will be essential to test whether these trends are universal or not.
Finally, gradients in [Mg/Fe] remain largely unexplored. Currently, only DF44 has a published
$\alpha$
-element profile, showing a mildly declining gradient (Villaume et al. 2022). Incorporating [Mg/Fe] gradients across a broader sample would help clarify how UDGs assemble their
$\alpha$
elements, and further distinguish what sets them apart from the general classical dwarf population.
6. The globular cluster systems of UDGs
One of the most intriguing aspects of UDGs is their globular cluster (GC) content. Indeed, shortly after their initial discovery in the Coma cluster, they were found to host ‘extensive’ (i.e. rich) GC systems (van Dokkum et al. Reference van Dokkum2017). While many host GC systems similar in number to those of classical dwarfs (i.e. up to a dozen GCs), others reveal relatively rich GC systems (i.e. many tens, and perhaps up to a hundred). Thus, along with effective radius, UDGs have some properties that are more similar to giant galaxies, despite their dwarf-like stellar masses. As discussed below, the UDGs with rich GC systems may also have giant galaxy-like GC luminosity functions and dark matter halos.
The largest samples of UDGs with GC counts, and the benefit of being at a similar distance (which reduces systematics), are those in the Coma (distance
$\sim$
100 Mpc), Perseus (75 Mpc), Hydra-I (50 Mpc) and Fornax (20 Mpc) clusters. Good quality GC studies also exist for UDGs in the Virgo cluster (16.5 Mpc), although distance variations of a few Mpc (
$\sim$
20%) can be present. Using studies of UDGs in these clusters and in lower density environments (e.g. the MATLAS survey Marleau et al. Reference Marleau2021), along with limited spectroscopy, we address here various questions pertaining to the GC systems of UDGS.
6.1. Quantifying GC systems
Typically, the first step in measuring a GC system associated with a galaxy is to count its GCs from imaging. For UDGs beyond the Local Volume, ground-based imaging cannot resolve them (unless it is complemented with adaptive optics) and so it struggles to separate GCs from foreground stars and background galaxies even with multiple filters. HST, JWST and Euclid have provided a significant improvement in selecting GCs by partially, or fully, resolving them (GCs typically have half-light radii of 2–3 pc). Euclid has 0.1" pixels in its VIS instrument and can partially resolve GCs at Virgo-like distances (
$\sim$
16.5 Mpc). On HST, the ACS/WFC (0.05" pixels) and WFC3/UVIS (0.04" pixels) instruments can reach to
$\sim$
35 and 40 Mpc respectively. The NIRCam instrument on JWST in its highest resolution mode has a pixel scale of 0.03". Strictly speaking, imaging only gives GC candidates which require a radial velocity for confirmation.
In order to estimate a total number of GCs (N
$_\mathrm{GC}$
), corrections are needed for magnitude and radial incompleteness, and background contamination. The magnitude correction may require an assumption about the GC luminosity function (GCLF), i.e. whether to use that of luminous galaxies or one more appropriate for dwarfs. Different studies have adopted different approaches and hence variations in the number of GCs assigned to a galaxy can vary from one author to the next.
In order to compare different galaxies, it is useful to normalise N
$_\mathrm{GC}$
by galaxy luminosity or stellar mass since bigger galaxies tend to have larger GC systems. Galaxy luminosity has the advantage of being an observable (when the distance is known), but it is difficult to compare young galaxies with older, passive hosts. In principle, using galaxy stellar mass (
$M_{\ast}$
) removes the effects of stellar populations on the luminosity. This is combined with the total mass of the GC system
$M_\mathrm{GC}$
to create the ratio
$M_\mathrm{ GC}$
/
$M_{\ast}$
. This ratio, or a similar variant, is becoming more widely used than the older specific frequency (
$S_N$
) which is essentially the ratio of N
$_\mathrm{GC}$
to galaxy luminosity
$L_V$
, i.e.
$S_N = N_\mathrm{GC}$
$\times$
10
$^{0.4(M_V + 15)}$
.
To calculate
$M_\mathrm{GC}$
it is common to simply multiply
$N_\mathrm{GC}$
by the mean mass of a GC. This has the advantage of being less sensitive to the GCs with the faintest magnitudes. A mean GC mass of 2
$\times$
10
$^5$
M
$_{\odot}$
(Jordán et al. Reference Jordán2007) is often assumed, although second order variations in the ‘universal’ GCLF are known (Villegas et al. Reference Villegas2010). For example, although the GCLF in dwarf galaxies typically appears to have a Gaussian shape (Georgiev et al. Reference Georgiev, Hilker, Puzia, Goudfrooij and Baumgardt2009), some studies suggest that the mean GC mass is closer to 1
$\times$
10
$^5$
M
$_{\odot}$
for dwarf galaxies. Therefore given the low numbers of GCs in the lowest mass dwarf galaxies, it is better to sum individual GC masses rather than assume a mean value (Forbes et al. Reference Forbes, Read, Gieles and Collins2018b).
One of the key differences between UDGs and classical dwarfs is that in some cases, such as in the Coma cluster, UDGs can host a larger number of GCs, on average, for a given host galaxy stellar mass. One of the first works to indicate this was Beasley & Trujillo (Reference Beasley and Trujillo2016), who found a rich GC system in the Coma UDG DF17 from 3-filter HST imaging. Figure 15 shows GC counts for Coma cluster galaxies with data from Forbes et al. (Reference Forbes, Alabi, Romanowsky, Brodie and Arimoto2020). Here all of the GC counts have been estimated using HST imaging in a similar manner and the galaxies are located at the same distance (i.e. 100 Mpc in the Coma cluster), so it is a good dataset for a relative comparison between UDGs and classical dwarfs. Although GC-poor UDGs overlap with classical dwarfs, the GC-rich ones extend to numbers not seen in dwarfs. The addition of further data has indicated that the most GC-rich Coma UDGs are not simply puffed-up classical dwarf galaxies.
Number of GCs around Coma cluster galaxies as a function of galaxy stellar mass. Red squares show UDGs, blue circles classical dwarf galaxies and open squares non-UDG low surface brightness galaxies. There is a clear trend for Coma UDGs to have more GCs, on average, than a Coma classical dwarf galaxy of the same stellar mass. Data from Forbes et al. (Reference Forbes, Alabi, Romanowsky, Brodie and Arimoto2020).

While the Coma cluster reveals the best evidence for GC-rich UDGs, there are strong indications for an environmental dependency, with GC richness increasing with environmental density. In the field and groups, UDGs generally host small numbers of GCs (Somalwar et al. Reference Somalwar, Greene, Greco, Huang, Beaton, Goulding and Lancaster2020; Jones et al. Reference Jones2023). In the MATLAS survey of low density groups, around 2/3 of their 74 UDGs imaged hosted between zero and two GCs, and only four had more than 20 GCs. The Fornax cluster with a halo mass of
$\sim$
7
$\times$
10
$^{13}$
M
$_{\odot}$
is about 1/10 the mass of the Coma cluster (and contains around 1/10 as many UDGs). In a study of UDGs (and LSBs) in the Fornax cluster, Prole et al. (Reference Prole2019a) found all of the dozen UDGs to have fewer than 20 GCs. With a comparable halo mass to the Coma cluster, the Perseus cluster and has been studied using both HST (Janssens et al. Reference Janssens2024) and Euclid (Saifollahi et al. Reference Saifollahi2025a). In both studies several UDGs hosting up to 40–50 GCs were reported. And similar to Coma, the UDGs in Perseus have more GCs per unit stellar mass than classical dwarfs (Saifollahi et al. Reference Saifollahi2025a). More work is needed, along with comparative studies of classical dwarfs, to quantify these environmental trends.
The trend of higher GC richness for UDGs in higher density environments is similar to that found for classical dwarfs, e.g. Peng et al. (Reference Peng2008). For classical dwarfs, this may be explained by different assembly histories for cluster and field galaxies, combined with quenching due to cluster infall (Mistani et al. Reference Mistani2016). The expectations for GC richness in the various UDG formation mechanisms are summarised in Figure 4. For example, the UDG formation processes involving high spin or SN feedback of classical dwarfs would be expected to leave the number of GCs relatively unchanged. Tidal heating (Carleton et al. Reference Carleton, Guo, Munshi, Tremmel and Wright2021) may boost the
$M_\mathrm{GC}/M_{\ast}$
ratio of UDGs by the preferential loss of stars but not the absolute number. On the other hand, while mergers may add more GCs, the ratio
$M_\mathrm{GC}/M_{\ast}$
in the merger remnant will be similar to that of the progenitors since additional stars are added as well. Tidal stripping of a classical dwarf must be severe to remove GCs but certainly will not raise their absolute number. Elevating the GC count compared to classical dwarf galaxies might be possible in both the ‘bullet dwarf’ and ‘failed galaxy’ scenarios. We note that in the UDG literature, both a simple GC number count (e.g.
$N_\mathrm{GC} \gt 20$
) and
$M_\mathrm{GC}/M_{\ast}$
(e.g.
$\gt$
2%) have been used as a measure of GC richness.
6.2. Do UDGs have a ‘universal’ Gaussian GCLF?
The GCLF turnover magnitude occurs at slightly fainter magnitudes (Villegas et al. Reference Villegas2010) in dwarf compared to giant galaxies. This is likely driven by differences in the ages and metallicities of the GCs in dwarf galaxies. The effect is much weaker in blue filters (e.g. the g-band) than red filters (e.g. z-band). For example, from a sample of luminous ellipticals the Vega system turnover magnitude in the I-band is
$M_I\sim -8.5$
(Kundu & Whitmore Reference Kundu and Whitmore2001) while it is
$M_I\sim -8.2$
for a sample of dwarf ellipticals (Miller & Lotz Reference Miller and Lotz2007). A similar GCLF offset in magnitude was noted by Saifollahi et al. (Reference Saifollahi2025b) after stacking massive vs. dwarf galaxies in the Fornax cluster (although they also noted three dwarf galaxies to have brighter than normal GCLF turnovers).
Studies of the GCLF of UDGs are hindered by typically low numbers of GCs for individual galaxies and by distance, which restricts both the magnitude coverage of the GCLF and the ability to resolve individual objects with space-based telescopes. Using deep HST/ACS imaging, Mihos et al. (Reference Mihos, Durrell, Toloba, Peng, Lim and Cŏté2025) studied the rich GC system of the UDG VCC 615 in the Virgo cluster (
$N_\mathrm{GC}$
= 25). This galaxy has the advantage of a Tip of the Red Giant Branch measured distance and resolved GCs with 92% coverage of the GCLF magnitude range. They found a turnover magnitude of
$M_I$
between
$-$
8.45 and
$-$
8.68 for their sample of GCs, thus closer to that of luminous ellipticals than dwarfs.
Perhaps the best defined GCLF of an individual UDG is that of NGC 5846_UDG1. It is located in the NGC 5846 group and its GC system has been probed by Danieli et al. (Reference Danieli2022) using deep HST/WFC3 imaging. They identified GC candidates covering 94% of the GCLF and estimated a total system of 54
$\pm$
9 GCs. From an initial dozen confirmed GCs with radial velocities (Müller et al. Reference Müller2020a), this has risen recently to 20 (Haacke et al. Reference Haacke2025). Furthermore, some 30 candidates having measured FWHM sizes consistent with GCs at 26.5 Mpc (the assumed distance used by Danieli et al. Reference Danieli2022). At this assumed distance the measured turnover corresponds to
$M_V=-7.5$
(see Figure 16), i.e. consistent with that of luminous galaxies (Rejkuba Reference Rejkuba2012). Using a stricter colour selection for GC candidates, Guerra Arencibia et al. (Reference Guerra Arencibia, Montes, Golini and Trujillo2025) argue for a smaller system of 33
$\pm$
3 GCs and a closer distance of 20 Mpc. At this distance, UDG1 would lie outside of the NGC 5846 group, thus making UDG1 even more remarkable – a GC-rich field UDG.
GCLF of NGC 5846_UDG1 reproduced from Figure 4 left of Danieli et al. (Reference Danieli2022). This UDG has a GCLF that is well fit by a GCLF with a turnover magnitude of
$M_V=-7.5$
and a dispersion
$\sigma=1.1$
(i.e. similar to that of giant galaxies). Thirty GC candidates are spatially resolved by HST/WFC3 with sizes consistent with GCs at a distance of 26.5 Mpc.

In a study of 18 UDGs (and other LSB galaxies) in the Perseus cluster, Li et al. (Reference Li2025) found that the majority have a GCLF turnover consistent with that of dwarf galaxies. However, they also found two galaxies to have brighter GCLF turnovers by
$\sim$
1 mag. A similar
$\sim$
1 mag brighter than the typical dwarf galaxy GCLF was reported by Janssens et al. (Reference Janssens2022) for the field UDG DGSAT-I. By stacking the GCLF of six Coma cluster UDGs observed using HST/ACS, Saifollahi et al. (Reference Saifollahi, Zaritsky, Trujillo, Peletier, Knapen, Amorisco, Beasley and Donnerstein2022) measured a turnover that was more consistent giant galaxy GCLFs (once a correction for Vega to AB magnitudes is correctly applied). In the case of the GC-rich UDG (
$N_\mathrm{GC} = 52\pm 12$
studied by Fielder et al. (Reference Fielder, Jones, Sand, Bennet, Crnojević, Karunakaran, Mutlu-Pakdil and Spekkens2023), the GC candidates are more compatible with a dwarf galaxy GCLF.
Thus there appears to be a diversity of GCLFs in UDGs, with some revealing dwarf-like turnover magnitudes and others being more consistent with more massive galaxies. As well as these single Gaussian GCLFs (albeit of varying turnover magnitudes), there are several reported cases of double-peaked GCLFs. In particular, the brighter peak corresponds to GCs with luminosities similar to those of UCDs (i.e. up to 2–3 mag brighter than the brightest GCs). Such objects have been identified around the galaxies DF2 and DF4 in the NGC 1052 group, along with a population of normal luminosity GCs (van Dokkum et al. Reference van Dokkum, Danieli, Abraham, Conroy and Romanowsky2019a). The status of the overly luminous GCs, and their large in physical sizes, depends on the distance to the host galaxy which has been disputed by some (e.g. Trujillo et al. Reference Trujillo2019). However, when considered along with the other fainter GC candidates in these two galaxies their GCLFs are double-peaked, and will remain so regardless of their assumed distance (Shen, van Dokkum, & Danieli Reference Shen, van Dokkum and Danieli2021a).
A galaxy with many similar properties to DF2 and DF4, including the presence of some overly-luminous GCs is FCC224, i.e Tang et al. (Reference Tang2025c). In this case, the galaxy lies in the outer parts of the Fornax cluster within its velocity phase-space envelope and with an surface brightness distance distance, thus it can be confidently assigned to the Fornax cluster. Tang et al. (Reference Tang2025c) showed the GCLF of FCC224 along with that of DF2 and DF4, in comparison to the stacked GCLF of classical dwarf galaxies in the Fornax cluster. The GCs of FCC224 reveal a
$\sim$
1 mag brighter GCLF turnover (with several of the brighter GCs spectroscopically-confirmed by Buzzo et al. Reference Buzzo2025b) and a few fainter GCs, with the GCLF of the classical Fornax dwarfs lying at intermediate magnitudes.
Another key property connecting DF2 and DF4 to FCC224 is the lack of dark matter in the galaxy inner regions. While distance affects the calculation of enclosed dynamical mass via the galaxy effective radius, it does not affect the velocity dispersion which for all galaxies is remarkably low at 5–10 km s
$^{-1}$
. Whether the presence of overly-luminous GCs is associated with a DM-free UDG or not is yet to be fully tested and requires further examples. For the two UDGs with well-defined GCLFs mentioned above, NGC5846_UDG1 (see Figure 16) and VCC615, there is no evidence for a population of overly-luminous GCs.
In conclusion, for some UDGs the GCLF turnover magnitude is more similar to dwarf galaxies (i.e. galaxies that match UDGs in stellar mass). However, for others, the GCLF is in closer agreement to more massive galaxies (which might suggest a match in total halo mass). There appear to be strong examples of both dwarf-like and giant-like GCLFs for UDGs. Thus UDGs exhibit a diversity in their GCLF properties, including the presence of some overly-luminous GCs and double-peaked GCLFs.
6.3. What is the radial extent of GC systems?
Another key property of GC systems is their radial extent (usually quantified as the half-number radius) compared to that of the galaxy stars (effective radius), i.e.
$R_\mathrm{GC}/R_{e}$
. This ratio often plays a crucial role in determining the correction for radial incompleteness and hence the total GC system counts. As the fraction of accreted GCs is reduced in lower mass galaxies, the ratio
$R_\mathrm{GC}/R_{e}$
also gets smaller (Forbes Reference Forbes2017, Lim et al. Reference Lim2024). A key question is whether this ratio is different to those of classical dwarf galaxies. In a recent study of a large sample of classical dwarf galaxies, Carlsten et al. (Reference Carlsten, Greene, Beaton and Greco2022) found
$R_\mathrm{GC}/R_{e}$
to vary from 1.06, on average, for Local Volume dwarfs to 1.25 for Virgo cluster dwarfs.
From deep HST imaging of the individual UDGs, VCC 615 and NGC 5846_UDG1, Mihos et al. (Reference Mihos, Durrell, Toloba, Peng, Lim and Cŏté2025) and Danieli et al. (Reference Danieli2022) reported
$R_\mathrm{GC}/R_{e}=1.47^{+0.58}_{-0.36}$
and
$0.8\pm 0.2$
, respectively. For UDGs in the low density environments of the MATLAS survey, Marleau et al. (Reference Marleau2024) found a median value of
$R_\mathrm{GC}/R_{e}\sim1$
from HST/ACS imaging. For UDGs in the Coma cluster, Lim et al. (Reference Lim, Peng, Cŏté, Sales, den Brok, Blakeslee and Guhathakurta2018) using HST/ACS concluded that
$R_\mathrm{GC}/R_{e}$
was consistent with 1.5. A similar conclusion was reached by van Dokkum et al. (Reference van Dokkum2017) for a small sample Coma UDGs. However, using that same data, Saifollahi et al. (Reference Saifollahi, Zaritsky, Trujillo, Peletier, Knapen, Amorisco, Beasley and Donnerstein2022) derived
$R_\mathrm{GC}/R_{e}=1.09\pm0.14$
. Different approaches conducted by individual researchers have led to different results for UDGs in the Perseus cluster as well. In an HST study of the Perseus cluster, Janssens et al. (Reference Janssens2024) found a range of 1–2.5 in
$R_\mathrm{GC}/R_{e}$
, with an average value of
$\sim$
1.2 for their UDGs. Exploiting Euclid’s wide field-of-view, Saifollahi et al. (Reference Saifollahi2025a) found an average for their four stellar mass bins to be
$R_\mathrm{GC}/R_{e}\sim0.72$
. In a study of Fornax cluster UDGs Prole et al. (Reference Prole2019a) found a median of
$R_\mathrm{ GC}/R_{e}\sim1.7$
.
Saifollahi et al. (Reference Saifollahi2025a) were also able to determine relative differences between samples of UDGs and non-UDG galaxies at a constant stellar mass, with good number statistics. For the stellar mass range
$6.5\lt\log(M_{\ast}/M_{\odot})\lt8$
, for four mass bins they found
$R_\mathrm{GC}/R_{e}$
to vary from 0.52 to 1.13 for UDGs and from 0.73 to 1.20 for non-UDGs. The simple average of the four bins was 0.72 and 1.04 for UDGs and non-UDGs, respectively. Thus, in a relative sense, the GC profiles of UDGs appear to be slightly more compact than those of non-UDGs of the same stellar mass. This argues against UDGs simply being puffed-up dwarfs (Saifollahi et al. Reference Saifollahi, Zaritsky, Trujillo, Peletier, Knapen, Amorisco, Beasley and Donnerstein2022). Compared to classical dwarfs, both the galaxy effective radius (by definition) and the GC system half-number radius of UDGs are relatively large.
In summary, classical dwarf galaxies appear to have GC systems that are slightly more extended, on average, than their galaxy stars (i.e.
$R_\mathrm{GC}/R_{e}\ge 1$
). Whereas the GC systems of UDGs (see Table 2) tend to be more compact, with
$R_\mathrm{GC}/R_{e}\le 1$
. For both UDGs and classical dwarfs, a variety of GC profiles exist.
Summary of UDG globular cluster system radial extent.

6.4. What properties do
$S_N$
and M
$_\mathrm{GC}$
/M
$_{\ast}$
correlate with?
Specific frequency (
$S_N$
) and the ratio of GC system mass to galaxy stellar mass (
$M_\mathrm{GC}/M_{\ast}$
) are proxies for the efficiency of both GC formation and destruction. So it is interesting to understand what host galaxy properties they might correlate with.
Pfeffer et al. (Reference Pfeffer2024) collected GC and host galaxy axial ratios for UDGs in the Virgo, Coma and Perseus clusters along with field/group environments from the MATLAS survey. They showed that high
$S_N$
(GC-rich) UDGs tend to be rounder on average, with almost a complete lack of GC-rich UDGs with
$b/a \le 0.6$
. Given this trend was also seen in the MATLAS sample of field/group UDGs (Marleau et al. Reference Marleau2024) it is unlikely to be purely an environmental process. This trend was interpreted by Pfeffer et al. (Reference Pfeffer2024) as a reflection of relatively inefficient GC formation in rotationally-supported UDGs. More measurements of the rotation in UDGs, over a range of GC richness, should allow this prediction to be tested.
Using HST/ACS, Lim et al. (Reference Lim, Peng, Cŏté, Sales, den Brok, Blakeslee and Guhathakurta2018) examined the dependence of
$S_N$
on various properties for Coma UDGs. They found weak trends for
$S_N$
to be higher in the cluster centre and in smaller sized UDGs. There was no clear trend with galaxy surface brightness in their data. Forbes et al. (Reference Forbes, Alabi, Romanowsky, Brodie and Arimoto2020) combined the data of van Dokkum et al. (Reference van Dokkum2017), Lim et al. (Reference Lim, Peng, Cŏté, Sales, den Brok, Blakeslee and Guhathakurta2018) and Amorisco et al. (Reference Amorisco, Monachesi, Agnello and White2018) to create an enlarged sample of 85 UDGs in Coma with GC counts. They found weak trends for higher
$S_N$
in smaller sized, fainter surface brightness and bluer UDGs. We note that strong trends exist between galaxy luminosity and properties such as effective radius and colour, so trends with
$S_N$
(effectively scaling inversely with luminosity) should be treated with caution. In a recent study of UDGs and classical dwarfs in the Perseus cluster, Tang et al. (Reference Tang2025a) did not find any strong GC trends with position in the cluster for their 40 UDGs. However, they did find that lower surface brightness and larger UDGs tend to be more GC-rich.
In Figure 17 we show
$M_\mathrm{GC}/M_{\ast}$
vs surface brightness and effective radius for UDGs and non-UDGs in the Coma and Perseus clusters from the above studies. While there is a strong trend of higher ratios in lower surface brightness (lower stellar density) UDGs, there is only a very weak trend for the highest values in the smallest sized UDGs. Additionally, Marleau et al. (Reference Marleau2024) studied the
$S_N$
trends for UDGs in the MATLAS survey using HST/ACS. While they found no significant trend with colour, surface brightness or axial ratio, they did find higher
$S_N$
in larger-sized UDGs (i.e. opposite to the trends for Coma cluster UDGs mentioned above).
$M_\mathrm{GC}/M_{\ast}$
vs. average g-band surface brightness within the effective radius (
$\lt\mu_g\gt_e$
) and effective radius
$R_{e}$
. The plot shows UDGs (circles) and non-UDGs (triangles) in both Coma (blue) and Perseus (red) clusters. Vertical lines show the UDG definition. Data are from Forbes et al. (Reference Forbes, Alabi, Romanowsky, Brodie and Arimoto2020) for Coma and Tang et al. (Reference Tang2026) for Perseus. The Coma cluster reveals higher ratios than the Perseus cluster galaxies. The trend with
$R_{e}$
is rather weak but a strong trend for higher ratios in lower surface brightness galaxies is evident.

In terms of stellar populations, higher
$M_\mathrm{GC}/M_{\ast}$
ratios are associated with slightly older and more metal-poor UDGs (Forbes et al. Reference Forbes, Buzzo, Ferre-Mateu, Romanowsky, Gannon, Brodie and Collins2025). This is consistent with them forming efficiently at earlier times. However, the sample sizes are currently restricted to around a dozen UDGs with both spectroscopy and GC counts. More data are required to further test the initial findings that GC-rich UDGs have stellar populations resembling those of metal-poor GCs, as might be expected if the stars of such galaxies are dominated by disrupted GCs. Ideally this data should be spectroscopic but the colours of GCs and their host galaxy can be a useful proxy for their stellar populations.
6.5. Do UDGs follow the GC number – Halo mass relation?
It is well-known that normal galaxies follow the relation between GC number and halo mass, with a scatter of
$\sim$
0.3 dex, over many orders of magnitude, e.g. Spitler & Forbes (Reference Spitler and Forbes2009), Harris et al. (Reference Harris, Harris and Hudson2015), Harris et al. (Reference Harris, Blakeslee and Harris2017), Forbes et al. (Reference Forbes, Read, Gieles and Collins2018b), Le & Cooper (Reference Le and Cooper2025) and Zaritsky (Reference Zaritsky2022). We note that a similar near-linear relation exists between the mass of the nuclear star cluster in UDGs and the halo mass (Khim et al. Reference Khim, Zaritsky, Lambert and Donnerstein2024). Such a relation offers an alternative method for estimating the halo mass of UDGs if they host a nuclear star cluster. It also suggests a close relationship between nuclear star clusters and the GC systems of UDGs.
Although simulations exist that can reproduce the observed GC relation for normal galaxies, e.g. Li & Gnedin (Reference Li and Gnedin2014), Boylan-Kolchin (Reference Boylan-Kolchin2017), Burkert & Forbes (Reference Burkert and Forbes2020) and Valenzuela et al. (Reference Valenzuela, Moster, Remus, O’Leary and Burkert2021), they have yet to fully explore the situation for UDGs. Using Illustris TNG100, Carleton et al. (Reference Carleton, Guo, Munshi, Tremmel and Wright2021) examined the relation between GC system mass and halo mass for the UDGs in that simulation suite. They found that they systematically deviate from the observed relation and with larger scatter. We also note the work of Doppel et al. (Reference Doppel, Sales, Benavides, Toloba, Peng, Nelson and Navarro2024) in which they tagged dark matter particles as GCs within the Illustris TNG50 simulation. However, as all of their galaxies, including UDGs, have fewer than 20 GCs and lie within dwarf-like dark matter halos, the regime of GC-rich UDGs was not explored.
Other than the well-studied UDG DF44 with an independent halo mass measurement (van Dokkum et al. Reference van Dokkum2019b), it is unknown whether other UDGs also follow the GC number – halo mass relation. Recently, Forbes & Gannon (Reference Forbes and Gannon2025) revisited this issue using independent halo masses from the literature for an additional three nearby UDGs and two NUDGes (which have UDG-like sizes but slightly brighter than the UDG definition). They have GC counts ranging from a single GC associated with the WLM galaxy to the 74 of DF44. In this work it was found that UDGs do indeed follow the standard GC number – halo mass relation, as reproduced in Figure 18.
Scaling relation between the number of globular clusters and the halo mass of a galaxy reproduced from (Forbes & Gannon Reference Forbes and Gannon2025, their Figure 1). Black symbols show normal galaxies with a range of morphology and environment taken from Burkert & Forbes (Reference Burkert and Forbes2020). The dashed line shows their linear fit of 5
$\times$
10
$^9$
M
$_{\odot}$
in halo mass per GC. Coloured symbols show three UDGs and two NUDGes with independent halo mass estimates. The Milky Way and Sombrero galaxies are shown for comparison. GC uncertainties are smaller than the symbol size. The UDGs follow the relation of normal galaxies.

An interesting corollary of UDGs following the GC number – halo mass relation is that they reveal a systematic deviation in the stellar mass – halo mass relation with increasing GC count. The GC-poor UDGs are still consistent with the latter scaling relation (as might be expected if they are simply puffed-up dwarf galaxies) however the GC-rich ones are not. As found by Forbes & Gannon (Reference Forbes and Gannon2024), the GC-rich UDGs have more massive halos and/or fewer stars than expected if they were to follow the standard stellar mass – halo mass relation. Such galaxies that have failed to form as many stars as expected for their halo mass have been dubbed failed galaxies (as described Section 3.3.4).
6.6. When did the first GCs form?
Age measurements of GCs in UDGs are few and far between. The one GC associated with the WLM dwarf is very old, with an age comparable to the age of the Universe (Hodge et al. Reference Hodge, Dolphin, Smith and Mateo1999). From a high quality spectrum, Larsen et al. (Reference Larsen, Brodie, Forbes and Strader2014) found it to be consistent with an age of
$\sim$
13 Gyr and very metal-poor ([Fe/H]
$=-1.97$
dex), which matches the age – metallicity relation of the WLM galaxy stars (Leaman et al. Reference Leaman2013). The star formation history of WLM reveals a slow enrichment over cosmic time, resulting in a mean metallicity for the galaxy some 0.7 dex higher than for its GC. In the case of UDG1 in the NGC 5846 group, several GCs have reported ages in Müller et al. (Reference Müller2020b), with a mean GC age of 11.2
$^{+1.8}_{-0.8}$
Gyr (which was similar to that of the galaxy field stars within the estimated uncertainties).
Future work should include efforts to obtain ages and metallicities of GCs hosted by UDGs. The first epoch of GC formation appears to be relatively uniform irrespective of the host galaxy (e.g. see review by Forbes et al. Reference Forbes2018a). However, determining the absolute ages, and hence when the first GCs formed, has been problematic even for Milky Way GCs. Although GCs were known to be very old, it was not entirely clear if they formed during, or after, re-ionisation (
$z\sim6$
). With the advent of HST (e.g. Vanzella et al. Reference Vanzella2017), and more recently JWST observations of lensed high redshift galaxies, a population of bright, compact star clusters was revealed. Their structural properties, ages, and metallicities all suggest that they are the high redshift counterparts of today’s GCs, e.g. Forbes & Romanowsky (Reference Forbes and Romanowsky2023). Furthermore, the stars in some high redshift galaxies have revealed abundance patterns that resemble local GCs (Naidu et al. Reference Naidu2025).
A summary of these high redshift observations is given in Adamo et al. (Reference Adamo2024a) and Pfeffer et al. (Reference Pfeffer2025). Notable is the Sparkler at
$z=1.378$
(Mowla et al. Reference Mowla2022), and two systems at redshifts
$z\gt6$
, i.e. Firefly at
$z=8.304$
(Mowla et al. Reference Mowla2024) and Cosmic Gems at
$z=9.625$
(Messa et al. Reference Messa2025). The mean age derived for the oldest star clusters are
$4.20\pm 0.26$
Gyr for the 5 GCs in the Sparkler,
$102.4\pm 7.9$
Myr for the 10 Firefly clusters, and
$13\pm 2.4$
Myr for the 5 Cosmic Gems clusters (we note that this is a rapidly moving field with new analysis and new data that may revise these values, e.g. Tomasetti et al. Reference Tomasetti, Moresco, Lardo, Courbin, Jimenez, Verde, Millon and Cimatti2025). Using Ned Wright’s Cosmology Calculator (https://www.astro.ucla.edu/wright/CosmoCalc.html) the look-back time corresponding to the redshift of each galaxy for the ‘General’ Universe is an age of 13.721 Gyr, i.e. 9.086 Gyr for the Sparkler, 13.106 Gyr for the Firefly, and 13.217 Gyr for Cosmic Gems. The GC formation epoch is therefore
$13.286\pm 0.260$
,
$13.208\pm 0.008$
and
$13.230\pm 0.002$
Gyr for the Sparkler, Firefly, and Cosmic Gems respectively.
Improvements in fitting and modelling the CMDs of Milky Way GCs have brought uncertainties down from 1–2 Gyr to a fraction of that for sample of GCs. One example is the recent analysis by Valcin et al. (Reference Valcin, Jimenez, Seljak and Verde2025) of 68 Milky Way GCs with HST photometry to derive a mean age for the oldest, metal-poor GCs of 13.39 Gyr. The error budget is dominated by systematics of
$\pm$
0.23 Gyr, while the statistical uncertainty is only
$\pm$
0.10 Gyr. A summary of the ages of the oldest GCs is given in Figure 19. As such it shows that the oldest GCs formed some 400 Myr after the Big Bang but almost a Gyr before the end of re-ionisation at redshift
$z\sim6$
. There is no current evidence to suggest that the oldest GCs associated with UDGs are any different.
The first epoch of globular cluster formation. The red line shows the age of the Universe (13.72 Gyr), and the blue line shows the age corresponding to
$z = 6$
(near the end of re-ionisation). Symbols show the oldest GCs in 3 lensed systems and for a compilation of Milky Way GCs (with their systematic uncertainty). Globular clusters first formed
$\sim$
400 Myr after the Big Bang and before the end of re-ionisation.

The formation epoch of the first GCs has particular relevance for the ‘failed galaxy’ scenario. If the GCs in UDGs form early as described in Figure 19, make a significant contribution to the galaxy’s halo (Peng & Lim Reference Peng and Lim2016) via disruption, and the UDG evolves passively there after, then we would expect the field stars of a failed galaxy UDG to reveal GC-like stellar populations, i.e low metallicity and very old ages.
6.7. How efficiently are GCs formed and destroyed?
As mentioned, the number (or mass) of GCs per galaxy luminosity (or stellar mass) today is a combined measure of GC formation and destruction efficiency. The efficiency of producing gravitationally bound star clusters has been tackled in simulations (Kruijssen Reference Kruijssen2012), including high redshift galaxies (Kruijssen Reference Kruijssen2015). Assuming that star formation is an extension of that observed locally, but to higher gas fractions and pressures, then more efficient GC formation can be expected. This can be quantified as the fraction of the total luminosity of each galaxy contributed by the star clusters. As summarised by Pfeffer et al. (Reference Pfeffer2025), this ratio can reach
$\sim$
75% for some high redshift galaxies as observed by JWST. In other words, up to three quarters of the luminosity produced in these high redshift galaxies comes from their system of star clusters.
This luminosity ratio is related to the initial
$M_\mathrm{GC}/M_{\ast}$
ratio via rather uncertain mass-to-light ratios, but nevertheless one can expect mass ratios to be of a similar order. JWST has thus confirmed that GCs can form very efficiently at early epochs (a few Myr after the Big Bang). Further evidence of star cluster dominated star formation at early times comes from JWST observations of super-solar N/O abundances, which are similar to those measured in Milky Way GCs (Ji et al. Reference Ji, Belokurov, Maiolino, Monty, Isobe, Kravtsov and McClymont2025). Such vigorous star cluster formation would be expected to have an influence on the galaxy’s evolution in the form of feedback of energy, and perhaps expulsion of the gas, which may now quench subsequent star, and cluster, formation. This quenching of star formation by star clusters is one possible mechanism to explain failed galaxy UDGs (Danieli et al. Reference Danieli2022), since the galaxy ‘fails’ to form additional stars.
Over cosmic time a large fraction of the initial GC population is expected to be removed by either internal (relaxation or mass loss) or external (tidal shock) processes. As well as reducing the number of GCs, these destruction processes appear to operate in a similar manner preferentially destroying low mass clusters at early times, resulting in a final GCLF with the near-universal log-normal shape and turnover magnitude (Forbes et al. Reference Forbes2018a). However, detailed modelling of the GCLF over time and the contribution of disrupted GCs to the host galaxy’s stars remains to be investigated. As the GCs are disrupted, their stars contribute to the main body of the galaxy. This disruption of GCs over cosmic time has been modelled in classical dwarf galaxies by Moreno-Hilario et al. (Reference Moreno-Hilario, Martinez-Medina, Li and Souza2024). They found lower (or inefficient) GC disruption rates in lower mass, and lower density, galaxies. Thus, for a given initial GC formation efficiency, we might expect higher
$M_\mathrm{GC}/M_{\ast}$
ratios in lower mass dwarf galaxies and in lower surface brightness galaxies such as UDGs (see also Forbes et al. Reference Forbes, Buzzo, Ferre-Mateu, Romanowsky, Gannon, Brodie and Collins2025).
In summary, the elevated GC counts seen in UDGs may be a combination of efficient formation at early times and relatively inefficient destruction. However, detailed modelling of the GCLF over time and the contribution of disrupted GCs to the host galaxy’s stars is still required.
6.8. Do UDGs have the same stellar populations as their GC systems?
The concept of UDGs being ‘pure stellar halos’ of old metal-poor stars was originally proposed by Peng & Lim (Reference Peng and Lim2016). This may arise since GCs are disrupted over cosmic time, with their stars contributing to the field stars of the host galaxy. If few, if any, in-situ field stars formed, then the stellar mass of a galaxy could be dominated by GC-like stellar populations. Some support for this idea comes from the spectroscopic observations of NGC5846_UDG1 by Müller et al. (Reference Müller2020b). Using MUSE they found that the host galaxy stars and a dozen GCs shared the same mean old age and low metallicity. A similar situation was reported by Buzzo et al. (Reference Buzzo2025b) for the stars and half dozen GCs of FCC224. In the case of NGC1052-DF2 however, Fensch et al. (Reference Fensch2019) found the galaxy stars to have a higher metallicity than the co-added spectra of GCs. Spectroscopy of cluster UDGs found some to have GC-like stellar populations, although the majority tend to be slightly younger and more metal-rich, on average, than typical GCs (Ferré-Mateu et al. Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023).
Deriving the stellar populations of individual GCs is time consuming and therefore samples are extremely limited. An alternative approach is to directly compare the relative colours of UDGs and their GCs. For old ages, colour is largely an indicator of metallicity but its sensitivity depends on the filter combination. In an HST study of six UDGs, Saifollahi et al. (Reference Saifollahi, Zaritsky, Trujillo, Peletier, Knapen, Amorisco, Beasley and Donnerstein2022) found GCs and their host UDG to have the same colour on average (some being bluer, some redder). For a sample of a dozen UDGs in the Perseus cluster with HST imaging, Janssens et al. (Reference Janssens2024) found a mean colour offset of
$0.07\pm 0.08$
, i.e. the host galaxy was redder than the GCs on average but there was no significant difference when the uncertainty was taken into account. A redder galaxy colour would suggest it is more metal-rich (e.g. enriched by additional star formation) than its metal-poor GCs. There was no obvious trend between the colour difference and the total number of GCs in each UDG. van Dokkum et al. (Reference van Dokkum2020) studied the colours of the GCs in both DF2 and DF4 in the NGC 1052 group using HST. They found the luminous GCs to have monochromatic colours indicating very similar ages and metallicities for the GCs in both galaxies. The host galaxies had slightly redder colours consistent with the higher metallicity inferred from spectroscopy (Fensch et al. Reference Fensch2019). For the field UDG DGSAT-I, Janssens et al. (Reference Janssens2022) found identical colours for the GCs and galaxy stars.
In summary, the field stars of some UDGs have the same stellar populations to their GCs (i.e. very old and metal-poor). However, in other cases the galaxy stars have an elevated metallicity (and perhaps slightly younger ages) suggesting some in-situ star formation in addition to disrupted GC stars. Future work should include more studies to measure the relative colour difference of field stars and the GC system for individual UDGs. This difference can then be correlated with UDG properties, e.g. the ratio of mass in GCs today to the stellar mass of the UDG. Failed galaxy UDGs might be expected to reveal GC-like stellar populations, whereas UDGs that are puffed-up dwarf galaxies may reveal more enriched and younger stars.
6.9. Does cluster infall affect GC richness?
Early infall of galaxies into a cluster is expected to quench subsequent star formation. Thus the earlier the infall, the higher is the expected fraction of bound mass in old GCs relative to the galaxy’s field stars (Mistani et al. Reference Mistani2016). Early infall, combined with early and efficient GC formation, in the TNG100 simulation of Carleton et al. (Reference Carleton, Guo, Munshi, Tremmel and Wright2021) are the main reasons that model UDGs have twice as many GCs as non-UDGs. There is indeed some observational evidence for this effect in classical dwarf galaxies (Peng et al. Reference Peng2008), however for UDGs, the evidence is less convincing. Lim et al. (Reference Lim, Peng, Cŏté, Sales, den Brok, Blakeslee and Guhathakurta2018) examined how specific frequency of GCs (
$S_N$
) varied with projected cluster-centric radius for Coma UDGs. For one luminosity range they found a weak trend of higher
$S_N$
at small projected radii but no clear trend for their other two luminosity ranges.
In a heterogeneous study of UDGs in the Hydra I, Perseus, Coma and Virgo clusters, Forbes et al. (Reference Forbes2023) studied their location in the infall diagram (Rhee et al. Reference Rhee, Smith, Choi, Yi, Jaffé and Candlish2017) as a function of GC richness with a division at
$N_\mathrm{GC}\ge20$
. The expected trend was not present. A similar lack of a trend was found by Tang et al. (Reference Tang2026) in their study of 40 UDGs in the Perseus cluster. Using
$M_\mathrm{GC}/M_{\ast}$
as a measure of GC richness, they found a weak tendency for GC-rich UDGs to be found in later infall regions, contrary to the expectation that they occupy early infall regions. Furthermore, the quenching timescale of GC-rich UDGs does not seem well aligned with their infall time (Ferré-Mateu et al. Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023; Doll et al. Reference Doll2025) These results might suggest some other process is responsible for GC richness (e.g. quenching prior to cluster infall).
7. Are UDGs different from classical dwarf galaxies?
At this point, it is reasonable to ask whether UDGs are simply classical dwarf galaxies with lower surface brightness and larger sizes on average, or if they represent something fundamentally different. In many respects, UDGs do resemble classical dwarfs–for instance, their global stellar population properties are comparable (Section 5). Yet, important differences emerge. One striking example is the on-sky distribution of their elongation: while dwarfs show a fairly uniform b/a distribution, UDGs are biased toward larger b/a ratios. If UDGs are inherently disky in morphology, this may be due to a bias towards face-on orientations, a bias that stems from their definition as low-surface-brightness systems (see further Pfeffer et al. Reference Pfeffer2024). Alternatively, UDGs may not be inherently disky in morphology. Moreover, although their average metallicities are similar to those of classical dwarfs, some UDGs deviate significantly from the canonical MZR. Their metallicity gradients also tend to be flat or even rising, in contrast to the falling gradients usually observed in dwarfs. Furthermore, some UDGs host far more GCs–both in absolute terms and relative to stellar mass than typical dwarfs. This overabundance implies overly massive halos, inconsistent with a simple dwarf-galaxy nature. In this regard, at least a fraction of UDGs clearly diverge from the classical dwarf population.
As surveys have expanded and follow-up spectroscopic and SED-fitting studies followed suite, the diversity among UDGs became increasingly evident in their structure, stellar populations, and GC properties. With ongoing and upcoming all-sky surveys such as Euclid, and LSST, the number of detected UDGs is expected to grow into the tens of thousands. Indeed, Euclid has already transformed our view of low-mass galaxies: early results indicate that about 8% of dwarfs are new UDG candidates, corresponding to roughly 200 new systems (see e.g. Marleau et al. Reference Marleau2025a,b; Saifollahi et al. Reference Saifollahi2025a). This rapid increase makes it essential to develop methods for distinguishing between different UDG types.
One promising approach is the application of machine learning techniques, as demonstrated by Buzzo et al. (Reference Buzzo2025a). Using a sample of 88 UDGs in both field and cluster environments, they applied the KMeans clustering algorithm to classify UDGs according to their structural and stellar population properties. Figure 20 reproduces their Figure 5, presenting these properties in a polar plot. This figure reveals two clearly distinct classes of UDGs. Class A UDGs are the least massive, with lower surface brightness, elongated shapes, more compact sizes, and very poor GC systems. They are typically bluer, with younger stellar populations, lower stellar mass-to-light ratios, more extended star formation histories, and metallicities that follow the classical dwarf MZR. These features suggest that Class A corresponds to the ‘puffy dwarf’ formation channel. In contrast, Class B UDGs are more massive and of higher surface brightness, with rounder shapes, larger sizes, and rich GC systems, yielding high GC-to-stellar mass ratios. They are generally redder, with older stellar populations, higher mass-to-light ratios, early-quenched star formation histories, and metallicities that fall below the classical dwarf MZR. Such galaxies have been proposed to represent the ‘failed galaxy’ formation pathway.
Figure reproduced from Figure 5 of Buzzo et al. (Reference Buzzo2025a), which shows the result of performing the clustering algorithm of KMeans using the stellar populations and internal properties of UDGs. The figure shows how two distinctive types of UDGs are found. Given the overall properties of each group, group A seems to be more compatible with the puffed dwarf formation channel, while the class B would be more representative of ‘failed galaxies’ one.

A small caveat to this result is that the MATLAS survey has since revised its surface brightnesses (Marleau et al. Reference Marleau2021; Marleau, et al. Erratum Submitted; Poulain et al. Reference Poulain2025b). Buzzo et al. (Reference Buzzo2025a)’s classification of galaxies followed that of the MATLAS survey as published in Marleau et al. (Reference Marleau2021), and so those galaxies considered to be a ‘UDG’ by MATLAS, and hence by Buzzo et al. (Reference Buzzo2025a), were based on inaccurate surface brightnesses. (It does not affect the KMeans analysis that used the Buzzo et al. (Reference Buzzo2025a) surface brightness measurements, not those from the MATLAS survey; it merely affects the classification of a galaxy as a UDG or not). In this revision, all UDGs previously called so, remain so, however 14 new galaxies now fit the UDG regime.
We note that this distinction holds when including NUDGe objects, although it starts to dilute the trends due to a wider spread in the age and metallicities. Nonetheless, as discussed to this point and shown by Figure 20, the most influential features in separating between the two classes seem to be the deviation from the classical dwarf MZR, the number of GCs, and the galaxy shape (given by the axis ratio b/a). It will be interesting to keep updating this classification as time evolves and more UDGs are observed and followed up, for instance, adding the [Mg/Fe] ratio in the classification, which is currently lacking in many of the studies.
8. Some future directions for UDG research
8.1. UDGs and their relationship to environment
Janssens et al. (Reference Janssens, Abraham, Brodie, Forbes, Romanowsky and van Dokkum2017) discovered that UDGs and UCDs had an inverse spatial relationship in the A2744 cluster, suggesting that the disruption of nucleated UDGs left UCD remnants and contributed field stars to the intra-cluster light (ICL). Support for this process came from Wang et al. (Reference Wang2023) who followed the evolution of nucleated UDGs in the Virgo cluster. Thus evidence is accumulating that disrupted UDGs could be a contributor to the ICL (and the UCD population). From measurements of individual stars in the ICL of the Virgo cluster, Williams et al. (Reference Williams2007) found the ICL to have a range of metallicities but it was dominated by stars of low metallicity ([Z/H]
$\lt-1$
dex) and old age, similar to the stellar populations of cluster UDGs, e.g. Ferré-Mateu et al. Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023). Further study of the relationship between UDGs, UCDs and the ICL is thus warranted to best understand the interplay between these different objects.
Furthermore, currently the studies of UDG number with environmental mass (e.g., Figure 3) combine different studies that use a mix of definitions as to what a UDG is. These studies also make different corrections to account for the incompleteness of imaging to the cluster’s virial radius. Clearly, this is inappropriate for accurately determining the slope of the relationship between UDG number and environmental mass. A consistent approach to studying this relationship will aid efforts to understand the rate at which UDGs are created/destroyed by tidal forces. Studying the evolution of the number of UDGs in clusters with redshift will also help to understand how tidal forces may form/destroy UDGs, as well as their accretion rates into clusters. Current studies of UDGs in clusters beyond
$\sim$
100 Mpc are limited to Janssens et al. (Reference Janssens, Abraham, Brodie, Forbes, Romanowsky and van Dokkum2017), Janssens et al. (Reference Janssens, Abraham, Brodie, Forbes and Romanowsky2019), Carleton et al. (Reference Carleton2023), Ikeda et al. (Reference Ikeda2023). With a plethora of deep HST and JWST exposures of clusters at a wide range of redshifts, along with new images expected from Euclid, it should be possible to expand these efforts.
8.2. UDG’s resolved internal properties
Recently, measurements of stellar population radial gradients in UDGs have become available in the literature (see Section 5). Such gradients tend to be relatively flat in both age and metallicity. This contrasts with the negative metallicity gradients usually seen in classical dwarfs and predicted for UDGs by simulations. A constant old age and low metallicity might however, be expected if disrupted GCs dominate the stellar populations of the host galaxy. More work is needed to expand the samples of GC-poor vs. GC-rich comparisons, and to search for other secondary parameters that might affect stellar population gradients in UDGs, such as rotation, size, and environment. Gradients of the
$\alpha$
elements are largely unconstrained observationally at this time but hold important clues regarding timescales of star formation.
Just as data regarding stellar population gradients in UDGs is sparse, so too are radial kinematic profiles. Until recently, DF44 and NGC 1052-DF2 are the only 2 UDGs with radial velocity dispersion profiles measured (van Dokkum et al. Reference van Dokkum2019b; Emsellem et al. Reference Emsellem2019). Rotation has now been detected in several UDGs (e.g., Chilingarian et al. Reference Chilingarian, Afanasiev, Grishin, Fabricant and Moran2019; Buttitta et al. Reference Buttitta2025; Levitskiy et al. Reference Levitskiy, Forbes, Gannon, Ferré-Mateu, Romanowsky, Brodie, Couch and Haacke2025), which intriguingly is frequently found to be prolate. Samples are currently small, and the radial extent probed is also limited; it is likely that peak rotation has not yet been reached. This will also allow the effects of galaxy inclination to be explored and, hence, determine how many of the UDGs with no detected rotation are truly non-rotating systems.
Furthermore, resolved kinematic profiles can be used to infer resolved mass profiles for UDGs and can be used to fit dark matter halos. In this regard, UDGs may be perfect for constraining alternative theories of dark matter. UDGs are sufficiently extended to measure the dark matter profile out to larger radii than classical dwarfs. They are also frequently found to be extremely dark matter dominated; as such, much of their kinematics is dominated by interactions within their dark, not baryonic, matter. Thus, any interactions such as the soliton expected in fuzzy dark matter (e.g., Wasserman et al. Reference Wasserman2019) or dark matter composed of ultra-light axions (e.g., Montes et al. Reference Montes2024), may be constrained by studying UDGs. Likewise, UDGs may provide constraints on non-cold dark matter, alternative gravity theories (e.g., Müller et al. Reference Müller, Famaey and Zhao2019). More work is needed in this realm.
8.3. Advancing the formation theories of GC-rich UDGs
The GC systems of UDGs have been the subject of much study, yet key questions remain. Understanding why they host up to several times more GCs than their classical dwarf galaxy counterparts may shed crucial insight into how UDGs form and evolve. Most current simulations predict standard dwarf-like GC systems, i.e. they do not predict the GC-rich UDGs nor overly massive halos (a consequence of the GC number–halo mass relation). Thus new simulations are needed that produce overly massive halo (or equivalently stellar-suppressed) UDGs. One possible approach is the ‘genetic’ modification of initial conditions as carried out by Rey et al. (Reference Rey, Pontzen, Agertz, Orkney, Read, Saintonge and Pedersen2019) for very low mass galaxies. Here quenching and late assembly suppresses star formation resulting in overly massive halo galaxies relative to the standard stellar mass–halo mass relation. Could this ‘failed galaxy’ process be responsible for the GC-rich UDGs? And, if so, what other properties are predicted for UDGs formed in this way?
The role of cluster infall on GC systems also needs to be explored further. Although GC system trends within a cluster appear weak, there are hints of cluster-to-cluster variations. Namely, GC-rich UDGs appear to be more frequent in the largest clusters like Coma, than the smallest, such as Fornax. Will these results be verified with systematic wide-field studies of GC systems out to the virial radius in several clusters?
We also see a diversity in GC system properties from their radial extent to their luminosity functions. What drives this diversity? And what is the origin of the overly-luminous GCs seen in some UDGs?
8.4. Going darker: Identifying even more extreme galaxies
A promising new approach to identify even lower surface brightness galaxies is to use their (relatively high surface brightness) globular clusters as ‘lamp posts’. This approach was demonstrated by Li et al. (Reference Li2022) who used a spatial statistical technique to identify an overdensity of GCs with no associated host galaxy light in the HST imaging of the Perseus cluster. Subsequent imaging from Euclid revealed some diffuse emission at the location of the GCs, and hence a very faint host galaxy. They estimated the ratio of the light in the GC system to that of the galaxy to be a remarkable 16–33%. For galaxies located closer than the Perseus cluster, at
$\sim$
75 Mpc, it may be possible to obtain radial velocities for the GCs. This would confirm their physical status and may allow for a system velocity dispersion to be measured (leading to an enclosed dynamical mass).
While several telescopes offer wide areas that facilitate the search for additional UDGs (e.g. VST, Euclid, Rubin), an alternative method is the detailed follow-up from a blind HI survey. Radio telescopes such as Arecibo, Parkes, ASKAP, and recently FAST, have conducted HI surveys which have, initially at least, detected a number of extragalactic HI sources that lack an obvious optical counterpart. Primarily, follow-up deep imaging is likely to find low surface brightness galaxies that are star forming at some low level, thus being different to those studied in this review. However, an interesting, outstanding question for modern galaxy formation is when such gas reservoirs fail to form a galaxy, or form a galaxy that is largely quiescent for the age of the Universe and thus is ‘dark’. Recently, Monaci et al. (submitted) used the public FAST catalogue to generate a list of
$\sim$
60 nearby dark galaxy candidates. Deeper optical imaging may reveal a very low surface brightness UDG or a place tight upper limit on the stellar mass of the galaxy. Dark galaxies with no stars but significant HI gas are expected in large numbers in the
$\Lambda$
CDM framework.
8.5. Other wavelengths and new facilities
The study of UDGs with other techniques and/or wavelengths beyond the optical/IR and radio has been limited to date. Even if stacking of many UDGs may only provide an upper limit, e.g. weak lensing (Sifón et al. Reference Sifón, van der Burg, Hoekstra, Muzzin and Herbonnet2018) or X-rays (Kovács, Bogdán, & Canning 2019), further efforts are encouraged. If UDGs reside in massive dark matter halos, they may be expected to exhibit an excess of ionised gas emitting photos in the X-ray/far UV in comparison to smaller halos. Likewise, if UDGs’ field stars are significantly composed of stars from GCs, there may be an expectation of an excess of stars in X-ray binaries in comparison to normal dwarfs. Both scenarios require deep high energy exposures to test.
It is worth noting that there are a large number of deep-wide surveys that are beginning, or are soon to begin, surveying the sky (e.g., Euclid and LSST). These surveys should present an order of magnitude more UDGs across large swathes of the sky, and build a far more complete picture of the UDG population.
Sadly, however, the future prospects are more challenging for expanding UDG spectroscopic efforts. Thirty-meter class telescopes will be needed to build a well-sampled, statistical sample of UDG spectroscopic properties to truly understand their formation. Until these telescopes are completed, innovative imaging methods on the available deep/wide data (e.g., the SED fitting) and machine learning classification approaches like KMeans are the only way to build and study statistical UDG samples. Spectroscopy will likely need to be limited to detailed studies of the most interesting cases to understand the extreme, rather than the mean, of UDGs.
9. Conclusions
In this Dawes Review, we have summarised the last decade of research into ultra-diffuse galaxies (UDGs). We have placed particular emphasis on those UDGs that are quiescent, as the initial discovery of UDGs in the Coma cluster was focused on UDGs such as these. We have summarised the general properties of UDGs (Section 2), the proposed formation scenarios for UDGs (Section 3), UDG internal kinematic properties (Section 4), UDG internal stellar population properties (Section 5) and the globular cluster (GC) systems of UDGs (Section 6). A brief discussion as to the main differences between UDGs and classical dwarf galaxies as well as the possible future of the field, was included in Sections 7 and 8. Below we list the main items of note for each section of our review:
In terms of the general properties of UDGs we highlight:
-
• The original UDG definition is based on somewhat arbitrary criteria and selects galaxies that are large and faint. Compared to the handful of galaxies found prior to 2015, a surprisingly large number of galaxies have been found to fit these criteria. The selection is biased towards galaxies that are redder and rounder (perhaps viewed more face-on). Less-strict definitions for UDGs that have been used may create a bias to select galaxies forming via more varied formation pathways.
-
• UDGs are found in all environments. Given the relationship between UDG numbers and environmental mass is approximately linear, the environment likely does not play a strong net role in the creation/destruction of UDGs. In particular, UDGs’ frequent lack of tidal features, even once stacked, provides little evidence for recent tidal interactions.
-
• UDGs currently comprise
$\sim$
5–8% of all galaxies. Their inclusion is necessary to accurately reproduce galaxy mass/luminosity functions.
In terms of the formation scenarios for UDGs we highlight:
-
• We have identified at least 11 scenarios that have been proposed to form a UDG and provided a visual summary of each (Figure 4). From the discussion of internal properties in Sections 4, 5 and 6, it is obvious that no single scenario can explain the entire population, with many UDGs clearly forming from distinct scenarios to one another. Further, while many of these scenarios are consistent with UDGs being simply the large size end of the normal dwarf galaxy distribution (i.e. puffy dwarfs) others require different dark matter halo characteristics. These latter galaxies may be thought of as dark matter halos that have failed to form their expected stellar mass (i.e. failed galaxies) and may host large numbers of GCs.
In terms of the internal kinematic properties of UDGs we highlight:
-
• Current UDG dynamical masses provide preliminary evidence that UDGs do not follow stellar mass – halo mass relationships established prior to their discovery. They indicate that UDGs likely follow the GC number – halo mass relationship (see also below and Figure 18). A corollary of this is that many UDGs likely reside in cored dark matter halos.
-
• When placed on the fundamental plane of elliptical galaxies UDGs reside in their own region of parameter space. This is primarily driven by their large half-light radii.
In terms of the internal stellar population properties of UDGs we highlight:
-
• UDGs span a wide mean age range (see Table 1), but with metallicities generally below solar ([Z/H]
$\sim-1.1$
dex). When placed in the canonical mass – metallicity (MZR) relation, the bulk of UDGs scatter around the local dwarf galaxy MZR (‘puffy dwarfs’), while a minority of very metal–poor UDGs follow instead the
$z\sim2$
MZR (‘failed galaxies’). Rare examples with the most elevated metallicities are compatible with being tidally stripped UDGs. -
• Overall, UDGs show above solar
$\alpha$
-enhancements ([Mg/Fe]
$\sim0.4$
dex). However, some extreme outliers show [Mg/Fe]
$\gt1$
dex – unprecedented in any galaxy type, raising questions about enrichment/depletion mechanisms. -
• Local environment seems to play a role on how UDGs build up their stellar mass: UDG in higher density environments are typically older, show short and early quenched SFHs, and have the most elevated [Mg/Fe]. On the contrary, UDGs in lower density environments tend to have extended SFHs, later quenching, lower metallicities and [Mg/Fe].
-
• In the few spatially resolved cases, (GC-rich) UDGs in clusters have relatively flat age profiles, but surprisingly flat-to-rising metallicity gradients. This is unlike classical dwarfs and simulated UDGs. It implies a rather co-eval, inside-out growth and assembly pathways distinct from classical dwarfs.
In terms of the GC systems of UDGs we highlight:
-
• UDGs have been shown in many environments (e.g., the Perseus/Coma clusters) to have, on average, more GCs than dwarf galaxies of a similar stellar mass. Alternatively, this presents as UDGs having a higher
$M_\mathrm{GC}/M_\star$
ratio than other galaxies. -
• UDGs have been shown to challenge the conventional wisdom of a ‘universal’ GCLF. While some UDGs indeed follow the dwarf-like GCLF, other UDGs have been shown to have a more giant-galaxy-like GCLF. Furthermore, some UDGs have been shown to have over-luminous GC systems, and/or have a GCLF that is best fit by a dual-peaked Gaussian.
-
• UDGs appear to have GC systems of slightly smaller relative radial extent (i.e.
$R_\mathrm{GC}/R_{e}$
) than dwarf galaxies of a similar stellar mass. While this trend appears relevant to the UDG population as a whole, individual UDGs may deviate with many shown to have radially extended GC systems. -
• Current evidence for the handful of UDGs with independent halo mass measurements favours their following the GC number – halo mass relationship. A corollary of this finding is that many GC-rich UDGs cannot follow the stellar mass – halo mass relationship due to their following the GC number – halo mass relationship.
We conclude our review by thanking all those researchers whose hard work, dedication and creativity have led to the past decade of progress in understanding ultra-diffuse galaxies. We eagerly anticipate a second decade of the mystery, discovery and camaraderie that is provided by the study of UDGs.
Acknowledgements
We thank the dual anonymous referees for their comments, which helped us to best frame this exciting field of research. We thank the AGATE team members A. Romanowsky, J. Brodie, Y. Tang, A. Levitskiy, M. Monaci, D. Vaz and H. Christie for their support and insight in the creation of this review. We thank Y. Tang for providing his data to include in our Figures. We thank the ARC for financial support via DP220101863 and DP250101673. A.F.M acknowledges support from RYC2021-031099-I and projects PID2021-123313NA-I00 and PID2024-162088NB-I00 of MICIN/AEI/10.13039/501100011033/FEDER,UE, NextGenerationEU/PRT.
Data availability statement
This article is a review of work that is publicly available. In particular, this work makes use of a publicly available catalogue of UDG spectroscopic properties available here. The full references for the catalogue are McConnachie (Reference McConnachie2012), van Dokkum et al. (Reference van Dokkum, Abraham, Merritt, Zhang, Geha and Conroy2015a), Beasley et al. (2016), Martin et al. (Reference Martin2016), Yagi et al. (Reference Yagi, Koda, Komiyama and Yamanoi2016), Mart nez-Delgado et al. (Reference Delgado, Neyer and Deich2016), van Dokkum et al. (Reference van Dokkum2016, Reference van Dokkum2017), Karachentsev et al. (Reference Karachentsev, Makarova, Sharina and Karachentseva2017), van Dokkum et al. (Reference van Dokkum2018), Toloba et al. (Reference Toloba2018), Gu et al. (2018), Lim et al. (Reference Lim, Peng, Cŏté, Sales, den Brok, Blakeslee and Guhathakurta2018), Ruiz-Lara et al. (Reference Ruiz-Lara2018), Alabi et al. (Reference Alabi2018), Ferré-Mateu et al. (Reference Ferré-Mateu2018), Forbes et al. (Reference Forbes, Read, Gieles and Collins2018b), Mart n-Navarro et al. (Reference Martín-Navarro2019), Chilingarian et al. (Reference Chilingarian, Afanasiev, Grishin, Fabricant and Moran2019), Fensch et al. (Reference Fensch2019), Danieli et al. (Reference Danieli, van Dokkum, Conroy, Abraham and Romanowsky2019), van Dokkum et al. (Reference van Dokkum2019b), Torrealba et al. (Reference Torrealba2019), Iodice et al. (Reference Iodice2020a), Collins et al. (Reference Collins, Tollerud, Rich, Ibata, Martin, Chapman, Gilbert and Preston2020), Müller et al. (Reference Müller2020a), Gannon et al. (Reference Gannon, Forbes, Romanowsky, Ferré-Mateu, Couch and Brodie2020), Lim et al. (Reference Lim2020), Müller et al. (Reference Müller2021), Forbes et al. (Reference Forbes, Gannon, Romanowsky, Alabi, Brodie, Couch and Ferré-Mateu2021), Shen et al. (Reference Shen2021b), Ji et al. (Reference Ji2021), Huang & Koposov (Reference Huang and Koposov2021), Gannon et al. (Reference Gannon2021), Gannon et al. (Reference Gannon2022), Mihos et al. (Reference Mihos2022), Danieli et al. (Reference Danieli2022), Villaume et al. (2022), Webb et al. (Reference Webb2022), Saifollahi et al. (Reference Saifollahi, Zaritsky, Trujillo, Peletier, Knapen, Amorisco, Beasley and Donnerstein2022), Janssens et al. (Reference Janssens2022), Gannon et al. (Reference Gannon, Forbes, Brodie, Romanowsky, Couch and Ferré-Mateu2023), Ferré-Mateu et al. (Reference Ferré-Mateu, Gannon, Forbes, Buzzo, Romanowsky and Brodie2023), Toloba et al. (Reference Toloba2023), Iodice et al. (Reference Iodice2023), Shen et al. (Reference Shen, van Dokkum and Danieli2023), Janssens et al. (Reference Janssens2024), Gannon et al. (Reference Gannon2024a), Buttitta et al. (Reference Buttitta2025), Ferré-Mateu et al. (Reference Ferré-Mateu, Gannon, Forbes, Romanowsky, Buzzo and Brodie2025), Levitskiy et al. (Reference Levitskiy, Forbes, Gannon, Ferré-Mateu, Romanowsky, Brodie, Couch and Haacke2025), Buzzo et al. (Reference Buzzo2025b).





























































