Hostname: page-component-77f85d65b8-5ngxj Total loading time: 0 Render date: 2026-03-30T03:45:00.465Z Has data issue: false hasContentIssue false

Semi-discrete unbalanced optimal transport and quantization

Published online by Cambridge University Press:  17 October 2025

David P. Bourne*
Affiliation:
Maxwell Institute for Mathematical Sciences and Department of Mathematics, Heriot-Watt University, Edinburgh, UK
Bernhard Schmitzer
Affiliation:
University of Göttingen, Göttingen, Germany
Benedikt Wirth
Affiliation:
University of Münster, Münster, Germany
*
Corresponding author: David P. Bourne; Email: d.bourne@hw.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

In this paper we study the class of optimal entropy-transport problems introduced by Liero, Mielke and Savaré in Inventiones Mathematicae 211 in 2018. This class of unbalanced transport metrics allows for transport between measures of different total mass, unlike classical optimal transport where both measures must have the same total mass. In particular, we develop the theory for the important subclass of semi-discrete unbalanced transport problems, where one of the measures is diffuse (absolutely continuous with respect to the Lebesgue measure) and the other is discrete (a sum of Dirac masses). We characterize the optimal solutions and show they can be written in terms of generalized Laguerre diagrams. We use this to develop an efficient method for solving the semi-discrete unbalanced transport problem numerically. As an application, we study the unbalanced quantization problem, where one looks for the best approximation of a diffuse measure by a discrete measure with respect to an unbalanced transport metric. We prove a type of crystallization result in two dimensions – optimality of a locally triangular lattice with spatially varying density – and compute the asymptotic quantization error as the number of Dirac masses tends to infinity.

Information

Type
Papers
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Two (approximate) centroidal Voronoi tessellations (CVTs) of 10 points for the uniform density $\mu =1$ on a unit square. The polygons are the centroidal Voronoi cells $V_i$ and the circles are the generators $x_i$. The CVTs were computed using Lloyd’s algorithm. The CVT on the left has a lower energy $J$ than the CVT on the right. The corresponding quantizer $\nu =\sum _{i=1}^{10} m_i \delta _{x_i}$ of $\mu$ is reconstructed from the CVT by taking $m_i$ as the areas of the centroidal Voronoi cells and $x_i$ as their generators.

Figure 1

Figure 2. Semi-discrete transport between the Lebesgue measure on $\Omega =[0,L]^2$, $L=5$ and a discrete measure with $M=4$ Dirac masses of locations $(x_1,x_2,x_3,x_4)= L \cdot ((0.375,0.375),(0.75,0.35),$$(0.65,0.75),(0.25,0.8))$ and weights $(m_1,m_2,m_3,m_4)=|\Omega |\cdot (0.38,0.29,0.19,0.14)$. Top row: optimal cells $\{{C}_i(w)\}_{i=1}^M$; the residual set $R$ is represented by white; location of the discrete points $(x_1,\ldots ,x_M)$ is indicated with red dots. Bottom row: optimal marginal $\rho$ (identical colour scale in all figures; regions with $\frac {{\mathrm {d}} \rho }{{\mathrm {d}} \mu }(x)=0$ are white for emphasis) and boundaries of cells $\{{C}_i(w)\}_{i=1}^M$ (red) are shown for models (a)–(b) from Examples2.9 and 3.14. Figure (e) shows the same model as (d), only with ${c}(x,y)=[d(x,y)/2]^2$ instead of ${c}(x,y)=d(x,y)^2$; on some cells $\operatorname{spt} \rho$ is now strictly bounded away from the boundaries of ${C}_i(w)$.

Figure 2

Figure 3. One-dimensional slices of computational results from Figure 2 along $[0,L]\times \{0.375\,L\}$ with $L=5$. Left: $\phi _w$ for optimal $w \in {\mathbb{R}}^M$. For models (a), (b), and (d), $\phi _{w}$ is piecewise quadratic; for (c) the profile is determined by $c_{\mathrm {HK}}$ and $\phi _w=\infty$ on $R \neq \emptyset$. Right: Optimal density $\frac {{\mathrm {d}} \rho }{{\mathrm {d}} \mu }$, where $\frac {{\mathrm {d}} \rho }{{\mathrm {d}} \mu }=(F^\ast)'(\!-\!\phi _{w})$ on $\Omega \setminus R$ and $0$ elsewhere by (3.4b). For (a) the density is constant, for (b) it is piecewise Gaussian, for (c) it is piecewise given by $\cos (d(y,x_i))^2$ on $\Omega \setminus R$ and $0$ on $R$, and for (d) it is given by truncated paraboloids.

Figure 3

Figure 4. Semi-discrete Hellinger–Kantorovich transport on $\Omega =[0,1]^2$ (using the same values for $x_i/L$ and $m_i/|\Omega |$ as in Figure 2) for different length scales $\varepsilon$. Top row: optimal cells $\{{C}_i(w)\}_{i=1}^M$; the residual set $R$ is represented by white; location of the discrete points $(x_1,\ldots ,x_M)$ is indicated with red dots. Bottom row: optimal marginal $\rho$ (using the same colour scale for all images). For large $\varepsilon$, the behaviour is similar to that of the standard semi-discrete Wasserstein-2 distance. As $\varepsilon$ decreases, the effects of unbalanced transport become increasingly prominent.

Figure 4

Figure 5. Semi-discrete Hellinger–Kantorovich distance on $\Omega =[0,1]^2$ for different length scales $\varepsilon$, as in Figure 4, but for $M=128$. The evolution of one cell ${C}_i(w)$ for fixed $i$ is highlighted in red (top row). For large $\varepsilon$, ${C}_i(w)$ is essentially the standard Wasserstein-2 Laguerre cell, not necessarily containing $x_i$. For small $\varepsilon$, ${C}_i(w)$ becomes (a fraction of) the open ball $B_{\varepsilon \pi /2}(x_i)$.

Figure 5

Figure 6. ${\mathrm {HK}}^\varepsilon (\mu ,\nu)^2$ for different length scales $\varepsilon$ for the setup from Figure 4. Left: as $\varepsilon \searrow 0$, ${\mathrm {HK}}^\varepsilon (\mu ,\nu)^2$ tends to $\mathrm{Hell}(\mu ,\nu)^2=2$. Right: as $\varepsilon \to \infty$, $\varepsilon ^2{\mathrm {HK}}^\varepsilon (\mu ,\nu)^2$ tends to $W_2(\mu ,\nu)^2$.

Figure 6

Figure 7. Quantization energy decrease of Lloyd’s algorithm and a BFGS method versus number of iterations (left) and function evaluations (centre; for the BFGS method function evaluations differ from iterations due to additional evaluations in the stepsize control) for the example shown on the right. Right: Input density $\mu$ and optimal locations $(x_1,\ldots ,x_M)$ for $M=100$, where $\mu$ is population density in Germany 2015 (published by the Federal Statistical Office of Germany in the “Regional Atlas” http://www.destatis.de/regionalatlas). The computations use the Hellinger–Kantorovich model.

Figure 7

Figure 8. Quantization results for $\mu =(1+\exp (\!-\!\frac {|x|^2}{2(4\pi)^2})) \cdot \mathcal{L} {{\LARGE \llcorner }}\Omega$ and $\Omega =[-4\pi ,4\pi ]^2$, $M=16$ on the same models as in Figure 2. Top row: optimal locations $x_1,\ldots ,x_M$ and Voronoi cells $\{V_i(x_1,\ldots ,x_M)\}_{i=1}^M$. Bottom row: optimal marginal $\rho ={\pi _1}_{\#} \gamma$ (identical colour scale in all figures; regions with $\frac {{\mathrm {d}} \rho }{{\mathrm {d}} \mu }(x)=0$ are white for emphasis). For (a) we have $\rho =\mu$.

Figure 8

Figure 9. Quantization results for the Hellinger–Kantorovich model and different length scales, showing the optimal Laguerre cells ${C}_i(0)$ (which coincide with the optimal Voronoi cells up to the set $R$ from (2.9)) and the optimal marginals $\rho ={\pi _1}_{\#} \gamma$ (same domain and $\mu$ as in Figure 8; identical colour scale in all figures).

Figure 9

Figure 10. Quantization results for the Hellinger–Kantorovich model using different length scales and numbers of discrete points, with constant total point density $\varepsilon _M^2 M$. The optimal marginals $\rho ={\pi _1}_{\#} \gamma$ are shown (same domain and $\mu$ as in Figure 8; identical colour scale in all figures).

Figure 10

Figure 11. Top row:$B'$from Lemma4.14for Hellinger–Kantorovich transport. Middle and bottom row: input distribution$\mu$(a Gaussian and same data as in Figure 7) as well as asymptotically optimal point densities$D$for different values of$P=\lim _{M\to \infty }\varepsilon _M^2M$(colour-coding from blue for$0$to red for maximum value).