Hostname: page-component-7479d7b7d-fwgfc Total loading time: 0 Render date: 2024-07-14T15:04:33.199Z Has data issue: false hasContentIssue false

On a class of nonlocal continuity equations on graphs

Published online by Cambridge University Press:  17 May 2023

Antonio Esposito
Mathematical Institute, University of Oxford, Oxford, UK
Francesco Saverio Patacchini
IFP Energies nouvelles, Rueil-Malmaison, France
André Schlichting*
Institute for Analysis and Numerics, University of Münster, Münster, Germany
Corresponding author: André Schlichting; Email:
Rights & Permissions [Opens in a new window]


Motivated by applications in data science, we study partial differential equations on graphs. By a classical fixed-point argument, we show existence and uniqueness of solutions to a class of nonlocal continuity equations on graphs. We consider general interpolation functions, which give rise to a variety of different dynamics, for example, the nonlocal interaction dynamics coming from a solution-dependent velocity field. Our analysis reveals structural differences with the more standard Euclidean space, as some analogous properties rely on the interpolation chosen.

Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (, which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
© The Author(s), 2023. Published by Cambridge University Press


For reference, we list some of the most recurrent notation of the paper.


Let $A$ denote a generic set.

  • $\mathcal{B}(A)$ : Borel subsets of $A$ .

  • ${\mathcal{M}}(A)$ : Radon measures on $A$ .

  • ${\mathcal{M}}^+(A)$ : nonnegative Radon measures on $A$ .

  • Given $\nu \in{\mathcal{M}}({\mathbb{R}}^d)$ and letting $A\in \mathcal{B}({\mathbb{R}}^d)$ , we denote by $\nu ^+(A) \,:\!=\, \sup _{B\in \mathcal{B}(A)} \nu (B)$ and $\nu ^-(A) \,:\!=\, - \inf _{B\in \mathcal{B}(A)} \nu (B)$ the upper and lower variation measures of $\nu$ ; the total variation measure of $\nu$ is $|\nu |(A)\,:\!=\,\nu ^+(A)+\nu ^-(A)$ , and its total variation norm is $\lVert{\nu }\rVert _{{TV}} \,:\!=\, \lvert{\nu }\rvert ({\mathbb{R}}^d)$ .

  • ${\mathcal{M}}_{\textrm{TV}}(A)$ : Radon measures on $A$ with finite total variation.

  • ${\mathcal{M}}_{\textrm{TV}}^+(A)\,:\!=\,{\mathcal{M}}^+(A)\cap{\mathcal{M}}_{\textrm{TV}}(A)$ .

  • $\mathcal{P}(A)$ : Borel probability measures on $A$ .


  • ${{\mathbb{R}}}^{2d}_{\!\scriptscriptstyle \diagup } \,:\!=\, \{{(x,y)\in{\mathbb{R}}^d\times{\mathbb{R}}^d \,:\, x\ne y}\}$ is the off-diagonal of ${\mathbb{R}^{d}}\times{\mathbb{R}^{d}}$ .

  • $\mu$ sets the underlying geometry of the state space; it belongs to ${\mathcal{M}}^+({\mathbb{R}}^d)$ and is sometimes referred to as base measure.

  • $\eta$ is the edge weight function; it maps ${{\mathbb{R}}}^{2d}_{\!\scriptscriptstyle \diagup }$ to $[0,\infty )$ .

  • $G$ is the set of edges; that is, $G= \{ (x,y)\in{{\mathbb{R}}}^{2d}_{\!\scriptscriptstyle \diagup } \,:\, \eta (x,y)\gt 0\}$ .

  • $\mathcal{V}^{\textrm{as}}(G) \,:\!=\, \{v\colon G \to{\mathbb{R}} \,:\, v(x,y) = -v(y,x)\}$ denotes the set of antisymmetric vector fields on $G$ .


  • $T$ is a positive, finite final time.

  • $\mathcal{AC}_T\,:\!=\,AC([0,T];\,{\mathcal{M}}_{\textrm{TV}}({\mathbb{R}}^d))$ is the space of absolutely continuous curves with respect to $\lVert{\cdot }\rVert _{{TV}}$ from $[0,T]$ to ${\mathcal{M}}_{\textrm{TV}}({\mathbb{R}}^d)$ .

  • Given $a\in{\mathbb{R}}$ , $a_+\,:\!=\,\max \{0,a\}$ and $a_-\,:\!=\,({-}a)_+$ are its positive and negative parts, respectively.

1. Introduction

In this manuscript, we resume the analysis of partial differential equations (PDEs) on graphs started in our previous work [Reference Esposito, Patacchini, Schlichting and Slepcev13], focusing on a larger class of nonlocal continuity equations (NCE), including different dynamics. The main motivation for this study comes from data science, as graphs represent a relevant ambient space for data representation and classification [Reference Belkin and Niyogi4, Reference Trillos and Slepčev15, Reference Trillos and Slepčev16, Reference Kannan, Vempala and Vetta22, Reference Ng, Jordan and Weiss23, Reference Roith and Bungert28]. However, most of the results obtained so far in the literature are concerned with static problems rather than time-dependent ones.

In [Reference Esposito, Patacchini, Schlichting and Slepcev13], we studied the dynamics driven by nonlocal interaction energies on graphs, whose vertices are the random sample of a given underlying distribution. We interpreted the corresponding PDEs as gradient flows of the nonlocal interaction energies in the space of probability measures, equipped with a quasi-metric obtained from the dynamical transportation cost, following Benamou–Brenier [Reference Benamou and Brenier5]. In the recent papers [Reference Heinze, Pietschmann and Schmidtchen19, Reference Heinze, Pietschmann and Schmidtchen20], the analysis is extended to nonlocal cross-interaction systems on graphs with a nonlinear mobility, in the context of nonquadratic Finslerian gradient flows. In [Reference Craig, Trillos and Slepčev9], dynamics on graphs are shown to be useful for data clustering; indeed, the authors connect the mean shift algorithm with spectral clustering at discrete and continuum levels via Fokker–Planck equations on data graphs.

The study of equations on graphs represents a natural link with the discretisation of continuous PDEs, gradient flows and optimal transport-related problems. We start mentioning structure preserving numerical schemes for evolution equations of gradient flow form (see e.g., [Reference Bailo, Carrillo and Hu2, Reference Bailo, Carrillo, Murakawa and Schmidtchen3, Reference Cancès, Gallouët and Todeschi7, Reference Carrillo, Chertock and Huang8, Reference Schlichting and Seis29] and references therein); the use of upwind and similar interpolations showed also beneficial in preserving the second law of thermodynamics, that is the entropy decay. Inspired by the theory of numerical schemes for local conservation laws, in [Reference Du, Huang and LeFloch11] a new class of monotonicity-preserving nonlocal nonlinear conservation laws was proposed, in one space dimension. The latter work might be indeed interpreted as an equation on graphs, under some suitable assumptions on the kernel considered. In this regard, it may be interesting to further investigate on the extension of the present manuscript to other nonlocal conservation laws (NCL).

Another related question concerns the convergence of discrete optimal transport distances to its continuous counterpart, cf. [Reference Forkert, Maas and Portinale14, Reference Gladbach, Kopfer, Maas and Portinale17, Reference Gladbach, Kopfer, Maas and Portinale18]. Similarly, the variational convergence of discretisation for evolution problems is investigated in [Reference Hraivoronska and Tse21]. Here the discrete systems obtained can be also seen as special cases of the type of the evolution equations investigated in the current manuscript. On a different note, we mention [Reference Peletier, Rossi, Savaré and Tse26], where a direct gradient flow formulation of jump processes is recently established – the authors consider driving energy functional containing entropies. The kinetic relations used there are symmetric, hence excluding for instance the upwind interpolation, which is our main example. A characterisation of not necessarily symmetric kinetic relations was recently obtained for jump processes on finite state spaces in [Reference Peletier and Schlichting27, Section 6.5]. Furthermore, in [Reference Esposito, Gvalani, Schlichting and Schmidtchen12], the authors recently introduced a local-nonlocal gradient structure for the aggregation equation motivated by the inelastic Boltzmann equation – this might have interesting generalisations to the graph setting.

In this work, we consider continuity equations driven by a wide class of velocity fields, including those depending on the unknown itself, and prove existence and uniqueness of measure-valued, as well as $L^p$ -valued, solutions by means a fixed-point theorem. This is a slightly different concept of solution than that used in [Reference Esposito, Patacchini, Schlichting and Slepcev13], where we established a Finslerian gradient flow framework for interaction energies. As it becomes clear in the following, the geometry of the ambient space influences the analysis and requires novel considerations.

For ease of presentation, we describe the problem first on finite, undirected graphs. Let $X \,:\!=\,\{x_1, \dots, x_n\}\subset{\mathbb{R}}^d$ be the set of vertices and consider the edge weights $w_{x,y} \geq 0$ , satisfying $w_{x,y} = w_{y,x}$ for all $x,y \in X$ . For simplicity, we impose that $w_{x,x}=0$ . We consider a mass distribution $\rho \colon X \to [0, \infty )$ with $\sum _{x \in X} \rho _x =1$ . An example of Ordinary Differential Equations (ODEs) on such a graph preserving the total mass takes the form

(1.1) \begin{equation} \frac{\textrm{d} \rho _x}{\textrm{d} t} = - \frac{1}{2} \sum _{y\in X} \!\left ({j_{x,y} - j_{y,x}}\right ) w_{x,y}, \end{equation}

The time variation of the mass at a vertex $x$ is triggered by the outgoing and ingoing fluxes, described by the function $j$ . We will be interested in the situation where the flux is obtained by a vector field $v\,:\, X\times X \to{\mathbb{R}}$ , along which the mass density $\rho$ is advected. The vector field might itself depend also on the mass density in a local or nonlocal as well as linear or nonlinear way. On graphs, the fluxes and velocities $j,v\,:\,X\times X\to{\mathbb{R}}$ are defined on the edges, whereas the mass on the single vertices. For this reason, the relation between flux and velocity strongly depends on the chosen mass interpolation on vertex pairs. We consider a general interpolation function $\Phi \,:\,{\mathbb{R}}^3\to{\mathbb{R}}$ to better understand its role for the dynamics. Hence, the continuity equation in flux-form (1.1) is complemented by the constitutive equation relating the velocity to the flux

\begin{equation*} j_{x,y} = \Phi \!\left ({\frac {1}{n}\rho _x,\frac {1}{n}\rho _y,v_{x,y}}\right ). \end{equation*}

The weights $1/n$ are relevant in the discrete-to-continuum limit. In [Reference Esposito, Patacchini, Schlichting and Slepcev13], we also considered the case of graphs with infinite vertices, namely the PDEs resulting from letting $n$ to $\infty$ . Thus, we introduced a unified setup entailing both discrete and continuum interpretations.

The vertices are points in ${\mathbb{R}}^d$ , and the edges are determined by a nonnegative symmetric weight function $\eta \colon{{\mathbb{R}}}^{2d}_{\!\scriptscriptstyle \diagup } \to [0, \infty )$ , namely we define the set of edges as $G\,:\!=\,\{ (x,y)\in{{\mathbb{R}}}^{2d}_{\!\scriptscriptstyle \diagup } \,:\, \eta (x,y)\gt 0\}$ , where ${{\mathbb{R}}}^{2d}_{\!\scriptscriptstyle \diagup } = \{(x,y)\in{\mathbb{R}}^d\times{\mathbb{R}}^d \,:\, x\ne y\}$ . From the discrete setting, the set of vertices is replaced by a general measure on ${\mathbb{R}}^d$ , denoted by $\mu$ ; a discrete graph with vertices $X\,:\!=\,\{x_1, \dots, x_n\} \subset{\mathbb{R}}^d$ corresponds then to $\mu$ being the empirical measure of $X$ , that is, $\mu = \frac{1}{n} \sum _{i=1}^n \delta _{x_i}$ . This generalisation is natural in applications to machine learning, since data have the form of a point cloud randomly sampled from some measure in Euclidean space. With this notation, the PDEs we study have the form

(1.2a) \begin{align} \partial _t\rho + \overline \nabla \cdot \boldsymbol{j} = 0, \end{align}
(1.2b) \begin{align} \boldsymbol{j}&=F^\Phi (\mu ;\,\rho,v), \end{align}

where $\overline \nabla$ and $\overline \nabla \cdot$ are the nonlocal gradient and divergence, respectively (cf. Definition 2.1 below), and $F^\Phi$ is an interpolation-dependent flux.

In [Reference Esposito, Patacchini, Schlichting and Slepcev13], we considered the upwind interpolation between vertices, as it is a reasonable choice for both the dynamics and the gradient flow structure. More precisely, we fixed $\Phi (a,b,v)=av_+-bv_-$ and introduced the following NCE:

\begin{equation*} \partial _t\rho _t(x)+\int _{\mathbb {R}^{d}}\!\left (\rho _t(x)v_t(x,y)_+-\rho _t(y)v_t(x,y)_-\right )\eta (x,y)\ \textrm{d} \mu (y) = 0, \end{equation*}

for $\mu \text{-a.e. } x \in{\mathbb{R}^{d}}$ . Note that we let here $\rho \ll \mu$ for ease of presentation, although it is not necessary, and, by abuse of notation, we denote by $\rho$ both measure and density. We focused on the specific case of the nonlocal interaction equation, that is,

(NL2IE) \begin{align} \partial _t\rho _t(x) & = - \int _{\mathbb{R}^{d}} j_t(x,y) \eta (x,y)\ \textrm{d}\mu (y) \,=\!:\, - (\overline \nabla \cdot j_t) (x), \nonumber\\ j_t(x,y) &= \rho _t(x) v_t(x,y)_+ - \rho _t(y) v_t(x,y)_-, \\ v_t(x,y) &= - \left (K*\rho _t(y) - K*\rho _t(x) + P(y) - P(x)\right ). \nonumber \end{align}

The equation above is actually a particular case of a NCL, as the velocity field depends on the configuration itself. The theory of generalised Wasserstein gradient flows was shown to be useful to prove existence of weak solutions to (NL2IE) and to provide information on the underlying geometric structure of the configuration space, which is the set of probability measures with finite second-order moments. The latter, equipped with quasi-metric introduced in [Reference Esposito, Patacchini, Schlichting and Slepcev13], has Finsler structure, rather than Riemannian. Among others, open problems include the contractivity of the quasi-distance (cf. [Reference Ohta and Sturm24, Reference Ohta and Sturm25]), the stability and uniqueness of weak solutions.

Based on the above considerations, in this paper, we obtain existence and uniqueness of measure and $L^p$ solutions for the class of PDEs (1.2a) by means of a classical Banach fixed-point argument. This complements the analysis started in [Reference Esposito, Patacchini, Schlichting and Slepcev13], as it concerns general flux interpolations as well as a larger class of velocity fields. The structure of the graph influences the analysis of the equations in this setting. Indeed, some analogous properties in the Euclidean case are not easily derived, depending on the interpolation chosen. Therefore, as a by-product of our study, we provide properties of the dynamics in relation to the interpolation considered, such as positivity preservation and $L^p$ regularity. To the best of our knowledge, this is the first result in these directions.

The paper is structured as follows. We introduce preliminary notions in Section 2 to explain the setup. Section 3 is devoted to the NCE and emphasises the fundamental role of the flux interpolation. From there, we prove basic properties of the NCE, highlighting analogies with and differences from the more standard Euclidean setting. In Section 4, we prove the main result of the manuscript, namely the existence and uniqueness of measure solutions for the NCE. We include velocity fields depending on the solution itself, in which case we also refer to the NCE as a NCL. Section 5 is focused on $L^p$ solutions and positivity preservation, only proven for the upwind interpolation.

2. Setup

Nonlocal graph structure. Let us fix a measure $\mu \in{\mathcal{M}}^+({\mathbb{R}}^d)$ and a measurable function $\eta \colon{{\mathbb{R}}}^{2d}_{\!\scriptscriptstyle \diagup } \to [0,\infty )$ , and set $G \,:\!=\, \{(x,y)\in{{\mathbb{R}}}^{2d}_{\!\scriptscriptstyle \diagup } \,:\, \eta (x,y)\gt 0\}$ . We always assume the following:

\begin{equation*}\qquad\qquad\qquad\qquad\qquad \eta\;\textit{is continuous, bounded and symmetric on}\ G.\qquad\qquad\qquad\qquad\qquad (\eta)\end{equation*}

We often refer to $\mu$ as the base measure and to $\eta$ as the weight function. In this sense, $(\mu,\eta )$ defines a possibly uncountable, weighted, undirected graph. A finite graph would correspond to the base measure $\mu _n=\frac{1}{n}\sum _i\delta _{x_i}$ for a set of points $\{x_1,x_2,\dots,x_n\}$ .

Total variation distance. For two measures $\rho ^1,\rho ^2\in{\mathcal{M}}_{\textrm{TV}}({\mathbb{R}}^d)$ , we define their total variation distance by

\begin{equation*} \lVert {\rho ^1-\rho ^2}\rVert _{{TV}} = 2\sup _{A\in \mathcal {B}({\mathbb {R}}^d)} \lvert {\rho ^1[A] - \rho ^2[A]}\rvert. \end{equation*}

The factor $2$ is present only for convenience since we restrict to measures with finite and equal total variation, so that $\lVert{\rho ^1- \rho ^2}\rVert _{{TV}}=\lvert{\rho ^1-\rho ^2}\rvert ({\mathbb{R}}^d)$ . We equip the sets ${\mathcal{M}}_{\textrm{TV}}({\mathbb{R}^{d}})$ and $\mathcal{P}({\mathbb{R}^{d}})$ with the total variation distance.

Gradients and divergences. We recall here the notions of nonlocal gradient and divergence on $G$ .

Definition 2.1 (Nonlocal gradient and divergence). For any $\phi \colon{\mathbb{R}^{d}} \to{\mathbb{R}}$ , we define its nonlocal gradient $\overline \nabla \phi \colon G \to{\mathbb{R}}$ by

\begin{equation*} \overline \nabla \phi (x,y)=\phi (y)-\phi (x) \quad \mbox {for all $(x,y)\in G$}. \end{equation*}

For any Radon measure $\boldsymbol{j}\in{\mathcal{M}}(G)$ , its nonlocal divergence $\overline \nabla \cdot \boldsymbol{j} \in{\mathcal{M}}({\mathbb{R}}^d)$ is defined as the adjoint of $\overline \nabla$ with respect to $\eta$ , that is, for any $\phi \colon{\mathbb{R}^{d}} \to{\mathbb{R}}$ continuous and vanishing at infinity, there holds

\begin{equation*} \begin {aligned} \int _{\mathbb {R}^{d}} \phi \textrm {d} \overline \nabla \cdot \boldsymbol{j} &= - \frac 12\iint _G\overline \nabla \phi (x,y) \eta (x,y)\ \textrm {d}\boldsymbol{j}(x,y)\\ &= \frac {1}{2}\int _{{\mathbb {R}^{d}}} \phi (x) \int _{{\mathbb {R}^{d}}\setminus \{x\}} \eta (x,y) \left ({ \textrm {d}\boldsymbol{j}(x,y) - \textrm {d}\boldsymbol{j}(y,x)}\right ). \end {aligned} \end{equation*}

In particular, for $\boldsymbol{j}$ antisymmetric, that is, $\boldsymbol{j}\in{\mathcal{M}}(G)$ and $\textrm{d} \boldsymbol{j}(x,y) = - \textrm{d} \boldsymbol{j}(y,x)$ , denoted by $\boldsymbol{j}\in{\mathcal{M}}^{\textrm{as}}(G)$ , we have

\begin{equation*} \int _{\mathbb {R}^{d}} \phi \textrm {d} \overline \nabla \cdot \boldsymbol{j}= \iint _G \phi (x) \eta (x,y)\ \textrm {d}\boldsymbol{j}(x,y). \end{equation*}

With this notion of divergence, we can consider a NCE (cf. Definition 3.1 below) defined on a suitable subclass of absolutely continuous curves denoted by $\mathcal{AC}_T=AC([0,T];\,{\mathcal{M}}_{\textrm{TV}}({\mathbb{R}}^d))$ . More precisely, $\mathcal{AC}_T$ is the set of curves from $[0,T]$ to ${\mathcal{M}}_{\textrm{TV}}({\mathbb{R}^{d}})$ such that there exists $m\in L^1([0,T])$ with

\begin{equation*} \lVert {\rho _s-\rho _t}\rVert _{{TV}} \leq \int _s^t m(r)\ \textrm {d} {r}, \qquad \text {for all } 0\leq s\lt t\leq T. \end{equation*}

3. Nonlocal Continuity Equation (NCE)

In this section, we study the NCE on the graph defined by $(\mu,\eta )$ . First, we define the concept of measure-valued solution.

Definition 3.1. (Measure-valued solution for the NCE). A measurable pair $(\rho,\boldsymbol{j})\colon [0,T] \to{\mathcal{M}}_{\textrm{TV}}({\mathbb{R}^{d}})\times{\mathcal{M}}(G)$ is a measure-valued (or simply measure) solution to the NCE, denoted as

(NCE) \begin{align} \partial _t \rho + \overline \nabla \cdot \boldsymbol{j} = 0, \end{align}

provided that, for any $A\in \mathcal{B}({\mathbb{R}^{d}})$ , it holds that

  1. (i) $\rho \in \mathcal{AC}_T$ ;

  2. (ii) $(\boldsymbol{j}_t)_{t\in [0,T]}$ is Borel measurable and $\left ({t\mapsto \overline \nabla \cdot \boldsymbol{j}_t[A]}\right ) \in L^1([0,T])$ ;

  3. (iii) $(\rho,\boldsymbol{j})$ satisfies,

    \begin{equation*} \rho _t[A] + \int _{0}^t \overline \nabla \cdot \boldsymbol{j}_s[A] \textrm {d} s = \rho _{0}[A] \qquad \text {for a.e. $t \in [0,T]$}; \end{equation*}

in this case, we write $(\rho,\boldsymbol{j})\in CE([0,T])$ .

In the above definition, the absolute continuity of a measure solution $\rho$ is ensured by the integrability of the flux divergence. Moreover, $\rho$ does not need to be nonnegative, that is, so that $\rho _t\geq 0$ for a.e. $t\in [0,T]$ , for the definition to make sense; in fact, positivity preservation is analysed in Section 5.

3.1. Flux interpolations

We provide a class of flux interpolations generalising our work in [Reference Esposito, Patacchini, Schlichting and Slepcev13], where we only studied the upwind interpolation. We consider a minimal set of assumptions on the interpolation to achieve well-posedness.

Definition 3.2 (Admissible flux interpolation). A measurable function $\Phi \colon{\mathbb{R}}^3 \to{\mathbb{R}}$ is called an admissible flux interpolation provided that the following conditions hold:

  1. (i) $\Phi$ satisfies

    (3.1) \begin{equation} \Phi (0,0;\,v)=\Phi (a,b;\,0)=0,\quad \text{ for all } a,b,v\in{\mathbb{R}}; \end{equation}
  2. (ii) $\Phi$ is argument-wise Lipschitz in the sense that, for some $L_{\Phi }\gt 0$ , any $a,b,c,d,v,w\in{\mathbb{R}}$ , it holds

    (3.2a) \begin{align} \lvert{\Phi (a,b;\,w) - \Phi (a,b;\,v)}\rvert &\leq L_{\Phi } (|a|+|b|) \lvert{w-v}\rvert ; \end{align}
    (3.2b) \begin{align} \lvert{\Phi (a,b;\,v) - \Phi (c,d;\,v)}\rvert &\leq L_{\Phi } (\lvert{a-c}\rvert + \lvert{b-d}\rvert ) \lvert{v}\rvert ; \end{align}
  3. (iii) $\Phi$ is positively one-homogeneous in its first and second argument, that is, for all $\alpha \gt 0$ and $(a,b,w)\in{\mathbb{R}}^3$ , it holds

    \begin{equation*} \Phi (\alpha a,\alpha b;\, w) = \alpha \Phi (a,b;\,w). \end{equation*}

Example 3.3. Here follow examples of admissible flux interpolations $\Phi$ according to Definition 3.2.

  • Upwind interpolation. One important case is given by the upwind interpolation $\Phi _{\textrm{upwind}}$ defined as

    (3.3) \begin{equation} \Phi _{\textrm{upwind}}(a,b;\,w) = a w_+ - b w_- \qquad \text{for } (a,b,w)\in{\mathbb{R}}^3. \end{equation}
  • Mean multipliers. Another case is product interpolation $\Phi _{\textrm{prod}}$ , which is of the form

    \begin{equation*} \Phi _{\textrm {prod}}(a,b;\,w) = \phi (a,b) w \qquad \text {for } (a,b,w)\in {\mathbb {R}}^3, \end{equation*}
    with $\phi \colon{\mathbb{R}}^2\to{\mathbb{R}}$ any measurable function satisfying, for some $L_\Phi \gt 0$ ,
    \begin{align*} &|\phi (a,b)|\leq L_\Phi \max \{|a|,|b|\}, \\ &|\phi (a,b)-\phi (c,d)|\le L_\Phi (|a-c|+|b-d|),\\ &\phi (\alpha a, \alpha b)= \alpha \phi (a,b),\\ &\phi (a,b)=\phi (b,a), \end{align*}
    for all $\alpha \geq 0$ and $a,b,c,d\in{\mathbb{R}}$ . Common choices for $\phi$ are as below:
    • Arithmetic mean. $\phi _{\textrm{AM}}(a,b)\,:\!=\, \frac{a+b}{2}$ ;

    • Minimal mean. $\phi _{\textrm{min}}(a,b)\,:\!=\,\min \{a,b\};$

    • Maximal mean. $\phi _{\textrm{max}}(a,b)\,:\!=\,\max \{a,b\}.$

    We note that some common choices, such as the geometric mean and the logarithmic mean, do not satisfy the Lipschitz condition stated above, which is essential for the fixed-point argument we use later to establish well-posedness. This situation may be remedied by a suitable regularisation of those examples, although we do not explore this possibility in the present paper.

Definition 3.4 (Admissible flux). Let $\Phi$ be an admissible flux interpolation, and let $\rho \in{\mathcal{M}}_{\textrm{TV}}({\mathbb{R}^{d}})$ and $w\in \mathcal{V}^{\textrm{as}}(G) \,:\!=\, \{v\colon G \to{\mathbb{R}} \,:\, v(x,y) = - v(y,x)\}$ . Furthermore, take $\lambda \in{\mathcal{M}}^+({{{\mathbb{R}}^{2d}}})$ such that $\rho \otimes \mu,\mu \otimes \rho \ll \lambda$ (e.g., $\lambda = \lvert{\rho }\rvert \otimes \mu + \mu \otimes \lvert{\rho }\rvert$ ). Then, the admissible flux $F^\Phi [\mu ;\,\rho,w]\in{\mathcal{M}}(G)$ at $(\rho,w)$ is defined by

(3.4) \begin{equation} \textrm{d} F^\Phi [\mu ;\,\rho,w] = \Phi \!\left ({\frac{\textrm{d}(\rho \otimes \mu )}{\textrm{d}\lambda }, \frac{\textrm{d}(\mu \otimes \rho )}{\textrm{d}\lambda };\, w }\right )\ \textrm{d} \lambda. \end{equation}

Note that because of the one-homogeneity of $\Phi$ , the expression in (3.4) is independent of the choice of $\lambda$ . The NCE of Definition 2.1 with the notation of Definition 3.4 reads

(NCE) \begin{equation} \partial _t \rho + \overline \nabla \cdot F^\Phi [\mu ;\,\rho _t,v_t] = 0, \end{equation}

with integral form, for all $A\in \mathcal{B}({\mathbb{R}}^d)$ , given by

(3.5) \begin{equation} \rho _t[A] + \int _{0}^t \overline \nabla \cdot F^\Phi [\mu ;\,\rho _s,v_s][A] \textrm{d} s = \rho _{0}[A], \qquad \text{for a.e. $t \in [0,T]$}, \end{equation}

where $v\,:\,[0,T]\to \mathcal{V}^{\textrm{as}}(G)$ .

3.2. Basic properties

We highlight some properties of (NCE) analogous to those in Euclidean setting, though intrinsically different due to the underlying graph structure. The well-posedness is treated in Section 4, where we consider a more general scenario, in particular including (NCE).

Proposition 3.5 (Integrability, support and mass preservation for the NCE). Let $\rho _0\in{\mathcal{M}}_{\textrm{TV}}({\mathbb{R}}^d)$ and let $v\colon [0,T] \to \mathcal{V}^{\textrm{as}}(G)$ satisfy, for some $C_v\gt 0$ ,

(3.6) \begin{equation} \int _0^T \sup _{x\in{\mathbb{R}^{d}}}\int _{{\mathbb{R}^{d}}\setminus \{x\}} |v_t(x,y)| \eta (x,y)\ \textrm{d} \mu (y) \leq C_v. \end{equation}

Let also $\Phi$ be an admissible flux interpolation and $\rho \,:\, [0,T]\to{\mathcal{M}}_{\textrm{TV}}({\mathbb{R}}^d)$ be ${\mathcal{M}}_{\textrm{TV}}({\mathbb{R}}^d)$ be such that (3.5) is satisfied. Then, the following properties hold:

  • $t \mapsto \overline \nabla \cdot F^\Phi [\rho _t,v_t][A] \in L^1([0,T])$ (flux integrability);

  • $\rho \in L^\infty ([0,T];\,{\mathcal{M}}_{\textrm{TV}}({\mathbb{R}}^d))$ (time boundedness);

  • $\rho _t[{\mathbb{R}}^d]=\rho _0[{\mathbb{R}}^d]$ for all $t\in [0,T]$ (mass preservation);

  • $\rho \in \mathcal{AC}_T$ (absolute continuity);

  • if $supp\ \rho _0\subseteq supp\ \mu$ , then $supp\ \rho _t \subseteq supp\ {\mu }$ for a.e. $t\in [0,T]$ (support inclusion).

Proof. We split the proof according to each item above.

Flux integrability – For all $A\in \mathcal{B}({\mathbb{R}^{d}})$ and $t\in [0,T]$ , we have

\begin{align*} \overline \nabla \cdot F^\Phi [\mu ;\,\rho _t,v_t][A]&=-\frac{1}{2}\iint _G\overline \nabla \chi _A(x,y)\eta (x,y)\ \textrm{d} F^\Phi [\mu ;\,\rho _t,v_t](x,y) \\ &=-\frac{1}{2}\iint _G\overline \nabla \chi _A\Phi \!\left ({\frac{\textrm{d}(\rho _t\otimes \mu )}{\textrm{d}\lambda }, \frac{\textrm{d}(\mu \otimes \rho _t)}{\textrm{d}\lambda };\, v_t }\right )\eta \textrm{d} \lambda. \end{align*}

Next, using (3.1) and (3.2a) with $w=0$ , symmetry of $\eta$ , antisymmetry of $v$ , and (3.6), we estimate, for any $t\in [0,T]$ , that

(3.7) \begin{align} \int _0^t \lvert{\overline \nabla \cdot F^\Phi [\mu ;\,\rho _s,v_s][A]}\rvert\ \textrm{d} s &\le \frac{L_\Phi }{2}\! \int _0^t \iint _G |v_s| \eta \!\left (\frac{\textrm{d}{|\rho _s|\otimes \mu }}{\textrm{d}\lambda }\!+\!\frac{\mu \otimes |\rho _s|}{\textrm{d}\lambda }\right )\ \textrm{d} \lambda \textrm{d} s \nonumber \\ &\le L_\Phi \int _0^t \iint _G|v_s(x,y)|\eta (x,y)\ \textrm{d}\mu (y)\ \textrm{d}|\rho _s|(x)\ \textrm{d}{s} \nonumber \\ &\leq L_{\Phi }\int _0^t \overline{v}_s \, |\rho _s|[{\mathbb{R}^{d}}]\textrm{d} s, \end{align}

where $\overline{v}_s \,:\!=\, \sup _{x\in{\mathbb{R}}^d} \int _{{\mathbb{R}^{d}}\setminus \{x\}} |v_t(x,y)| \eta (x,y)\ \textrm{d} \mu (y)$ .

Time boundedness – For a.e. $t\in [0,T]$ , the integral form (3.5) entails

\begin{equation*} \lvert {\rho _t}\rvert [{\mathbb {R}}^d] \leq \lvert {\rho _0}\rvert [{\mathbb {R}}^d] + L_{\Phi }\int _0^t \overline {v}_s |\rho _s|[{\mathbb {R}^{d}}]\ \textrm {d} s. \end{equation*}

Then, Gronwall’s inequality provides, for a.e. $t\in [0,T]$ , the a priori bound $\lvert{\rho _t}\rvert [{\mathbb{R}}^d]\leq \lvert{\rho _0}\rvert [{\mathbb{R}}^d] e^{L_\Phi C_v} \lt \infty$ . Hence, $\rho \in L^\infty ([0,T];\,{\mathcal{M}}_{\textrm{TV}}({\mathbb{R}}^d))$ .

Mass preservation – This is a simple consequence of $\overline \nabla \chi _{{\mathbb{R}}^d} = 0$ , which yields $\overline \nabla \cdot F^\Phi [\rho _t,v_t][{\mathbb{R}}^d]=0$ for all $t\in [0,T]$ . Hence, (3.5) implies that $\rho$ is mass preserving. We also infer the integrability of the flux from (3.7).

Absolute continuity – For any $A\in \mathcal{B}({\mathbb{R}}^d)$ , we have $t\mapsto |\overline \nabla \cdot F^\Phi [\mu ;\,\rho _t,v_t][A]|$ belongs to $L^1([0,T])$ . Hence, $\rho \in \mathcal{AC}_T$ .

Support inclusion – Note that for $A=\mathbb{R}^d\setminus\textrm{supp}\ \mu$ and a.e. $t\in [0,T]$ , the solution $\rho$ satisfies

\begin{align*}\rho_t\lbrack A\rbrack=\rho_0\lbrack A\rbrack&-\frac12\int_0^t\iint_{G\cap A\times\textrm{supp}\ \mu}\Phi\!\left(\frac{\textrm d(\rho_s\otimes\mu)}{\textrm d\lambda},0;\,v_s\right)\ \textrm d\lambda(x,y)\ \textrm ds\\&+\frac12\int_0^t\iint_{G\cap\textrm{supp}\ \mu\times A}\Phi\! \left(0,\frac{\textrm d{(\mu\otimes\rho_s)}}{\textrm d\lambda};\,v_s\right)\ \textrm d\lambda(x,y)\ \textrm ds;\end{align*}

thus, we get the estimate

\begin{align*} \lvert{\rho _t}\rvert [A] &\leq \lvert{\rho _0}\rvert [A] + L_\Phi \int _0^t \lvert{\rho _s}\rvert [A] \overline{v}_s \textrm{d}{s}, \end{align*}

and, by Gronwall’s inequality, we also get $\lvert{\rho _t}\rvert [A]\leq e^{C_v L_\Phi } \lvert{\rho _0}\rvert [A]$ . We conclude by noting that $\lvert{\rho _0}\rvert [A]=0$ by assumption.

Remark 3.6. Condition (3.6) is the analogue of the weak-compressibility assumption classically used for the continuity equation $\partial _t \rho _t + \nabla \cdot \left ({v_t \rho _t}\right )=0$ , with vector field $v\,:\,[0,T]\times{\mathbb{R}}^d \to{\mathbb{R}}^d$ (see, e.g., [Reference Ambrosio1, Reference DiPerna and Lions10]). More precisely, in the Euclidean setting, the assumption in (3.6) takes the form $\nabla \cdot v \in L^1([0,T];\,L^\infty ({\mathbb{R}}^d))$ and is used to control of $\lVert{\rho }\rVert _{L^\infty ([0,T];\,L^p({\mathbb{R}}^d))}$ , for any $p\in [1,\infty )$ (cf. [Reference DiPerna and Lions10, Prop II.1.]). In our setting, the structural properties of the graph, encoded in $(\mu,\eta )$ and the flux interpolation $\Phi$ , require a refined analysis involving a careful regularisation argument when treating $L^p$ solutions; we refer the reader to Section 5, where those questions are studied for solutions possessing a density.

4. Nonlocal Conservation Law (NCL)

We focus here on the general case where the velocity field depends on the solution itself. More precisely, we provide well-posedness to (NCE) for a vector field of the form

\begin{equation*} v_t(x,y) = V_t[\rho _t](x,y) \qquad \text {for all $t\in [0,T]$}, \end{equation*}

for some $V\colon [0,T] \times{\mathcal{M}}_{\textrm{TV}}({\mathbb{R}^{d}}) \to \mathcal{V}^{\textrm{as}}(G)$ . For the reader’s convenience, we write the following straightforward generalisation of Definition 3.1 to what we refer to as NCL.

Definition 4.1 (Measure-valued solution to the NCL). Given an admissible flux interpolation $\Phi$ and a measurable map $V\colon [0,T] \times{\mathcal{M}}_{\textrm{TV}}({\mathbb{R}^{d}}) \to \mathcal{V}^{\textrm{as}}(G)$ , a curve $\rho \colon [0,T] \to{\mathcal{M}}_{\textrm{TV}}({\mathbb{R}^{d}})$ is said to be a measure-valued (or simply measure) solution to the NCL, denoted as

(NCL) \begin{equation} \partial _t \rho + \overline \nabla \cdot F^\Phi [\mu ;\,\rho,V_t(\rho )] = 0, \end{equation}

provided that, for any $A\in \mathcal{B}({\mathbb{R}^{d}})$ , it holds that

  1. (i) $\rho \in \mathcal{AC}_T$ ;

  2. (ii) $t \mapsto \overline \nabla \cdot F^\Phi [\mu ;\,\rho _t,V_t(\rho _t)][A] \in L^1([0,T])$ ;

  3. (iii) $\rho$ satisfies

    (4.1) \begin{equation} \rho _t[A] + \int _{0}^t \overline \nabla \cdot F^\Phi [\mu ;\,\rho _s,V_s(\rho _s)][A]\ \textrm{d} s = \rho _{0}[A] \qquad \text{for a.e. $t \in [0,T]$}. \end{equation}

Example 4.2. An important example of a map $V$ in Definition 4.1 is that stemming from the convolution with an interaction kernel (or potential) $K\colon{\mathbb{R}^{d}}\times{\mathbb{R}^{d}}\to{\mathbb{R}}$ , which yields the Nonlocal Nonlocal Interaction Equation (NL $^2$ IE), to which we can add an external potential $P\colon{\mathbb{R}^{d}}\to{\mathbb{R}}$ . Namely, in this case, for $\rho \colon [0,T] \to{\mathcal{M}}_{\textrm{TV}}({\mathbb{R}^{d}})$ , $t\in [0,T]$ and $(x,y)\in G$ , the vector field $V$ is given by

\begin{equation*} V_t[\rho _t](x,y) = -\overline \nabla (K*\rho _t)(x,y) - \overline \nabla P(x,y). \end{equation*}

When the interpolation is chosen to be the upwind one (3.3), we get the equation studied in the optimal transport, weak-measure setting of [Reference Esposito, Patacchini, Schlichting and Slepcev13].

Our well-posedness proof of (NCL), and thus (NCE), is based on a fixed-point argument and only applies to measures with fixed total variation, which is consistent with the mass preservation property from Proposition 3.5. For all $M\gt 0$ , we introduce the notation

\begin{align*} \mathcal{AC}_T^M = AC([0,T];\,{\mathcal{M}}_{\textrm{TV}}^M({\mathbb{R}^{d}})), \qquad{\mathcal{M}}_{\textrm{TV}}^M({\mathbb{R}^{d}}) = \{\rho \in{\mathcal{M}}_{\textrm{TV}} \,:\, \lvert{\rho}\rvert [{\mathbb{R}^{d}}] = M\}. \end{align*}

Note that, for any $\rho ^0,\rho ^1\in{\mathcal{M}}_{\textrm{TV}}^M({\mathbb{R}^{d}})$ , we have the identity

\begin{equation*} \lVert {\rho ^1-\rho ^2}\rVert _{{TV}} = 2\sup _{A\in \mathcal {B}({\mathbb {R}}^d)} \lvert {\rho ^1[A] - \rho ^2[A]}\rvert =|\rho ^1-\rho ^2|({\mathbb {R}^{d}}). \end{equation*}

Throughout this section, we fix $M\geq 0$ , $\rho ^0\in{\mathcal{M}}_{\textrm{TV}}^M({\mathbb{R}^{d}})$ and $\Phi$ an admissible flux interpolation (cf. Definition 3.2). With any $V\colon [0,T] \times{\mathcal{M}}_{\textrm{TV}}^M({\mathbb{R}^{d}}) \to \mathcal{V}^{\textrm{as}}(G)$ such that, for some $C_V\gt 0$ ,

(4.2) \begin{equation} \sup _{t\in [0,T]} \sup _{\rho \in{\mathcal{M}}_{\textrm{TV}}^M({\mathbb{R}^{d}})} \sup _{x\in{\mathbb{R}^{d}}} \int _{{\mathbb{R}^{d}}\setminus \{x\}} |V_t[\rho ](x,y)| \eta (x,y)\ \textrm{d} \mu (y)\ \textrm{d} t \leq C_V, \end{equation}

we associate the solution map $S_T^V\colon \mathcal{AC}_T^M \to \mathcal{AC}_T^M$ , defined, for $t \in [0,T]$ and $A\in \mathcal{B}({\mathbb{R}^{d}})$ , by

\begin{equation*} S_T^V(\rho )(t)[A] \,:\!=\, \rho ^0[A] - \int _0^t \overline \nabla \cdot F^\Phi [\mu ;\,\rho _s,V_s(\rho _s)][A]\ \textrm {d} s. \end{equation*}

Note that (4.2) is an $L^\infty (L^\infty )$ -type of bound for the nonlocal divergence; it is thus slightly stronger than the similar (3.6) of $L^1(L^\infty )$ -type under which we have boundedness of solutions in Proposition 3.5.

We establish well-posedness under a Lipschitz assumption on $\rho \mapsto V[\rho ]$ on the space $\mathcal{AC}_T$ , which we endow with the distance $\boldsymbol{d}_{\mathcal{AC}_T}$ defined by

\begin{equation*} \boldsymbol{d}_{\mathcal {AC}_T}(\rho,\sigma ) \!=\! \lVert {\rho -\sigma }\rVert _{L^\infty ([0,T];\,{\mathcal {M}}_{\textrm {TV}}({\mathbb {R}}^d))} \!=\! \sup _{t\in [0,T]} \lVert {\rho _t-\sigma _t}\rVert _{{TV}} \quad \text {for all $\rho,\sigma \in \mathcal {AC}_T$}. \end{equation*}

Lemma 4.3. Let $V\colon [0,T] \times{\mathcal{M}}_{\textrm{TV}}^M({\mathbb{R}^{d}}) \to \mathcal{V}^{\textrm{as}}(G)$ satisfy the uniform-compressibility assumption (4.2) for some $C_V\in (0,\infty )$ and suppose that there exists a constant $L_V\ge 0$ such that, for all $t\in [0,T]$ and all $\rho,\sigma \in{\mathcal{M}}_{\textrm{TV}}^M({\mathbb{R}^{d}})$ ,

(4.3) \begin{equation} \sup _{x\in{\mathbb{R}^{d}}}\int _{{\mathbb{R}^{d}}\setminus \{x\}} |V_t[\rho ](x,y) - V_t[\sigma ](x,y)| \eta (x,y)\ \textrm{d}\mu (y)\ \textrm{d} t \leq L_V \lVert{\rho - \sigma }\rVert _{{TV}}. \end{equation}

Then, for all $\rho,\sigma \in \mathcal{AC}_T^M$ , the contraction estimate

\begin{equation*} \boldsymbol{d}_{\mathcal {AC}_T}(S_T^V(\rho ),S_T^V(\sigma )) \leq \alpha T \boldsymbol{d}_{\mathcal {AC}_T}(\rho,\sigma ), \end{equation*}

holds for $\alpha \,:\!=\, L_{\Phi }\!\left ({M L_V + C_V}\right )$ , where $L_\Phi$ is as in (3.2).

In particular, for $T\gt 0$ such that $T\lt 1/\alpha$ , there exists a unique measure solution $\rho$ to (NCL) on $[0,T]$ such that $\rho _{0} = \rho ^0$ .

Proof. Let $\rho,\sigma \in \mathcal{AC}_T^M$ and let $t \in [0,T]$ . We rewrite, for $s\in [0,T]$ ,

(4.4) \begin{equation} \overline \nabla \cdot F^\Phi [\mu ;\,\rho _s,V_s(\rho _s)][A] - \overline \nabla \cdot F^\Phi [\mu ;\,\sigma _s,V_s(\sigma _s)][A]= I_s + II_s, \end{equation}


\begin{align*} I_s &= \frac{1}{2}\iint _{G}\overline \nabla \chi _A(x,y) \Biggl [ \Phi \!\left ({\frac{\textrm{d}(\sigma _s\otimes \mu )}{\textrm{d}\lambda },\frac{\textrm{d}(\mu \otimes \sigma _s)}{\textrm{d}\lambda }; V_s[\sigma _s]}\right ) \\ &\qquad \qquad \qquad - \Phi \!\left ({\frac{\textrm{d}(\sigma _s\otimes \mu )}{\textrm{d}\lambda },\frac{\textrm{d}(\mu \otimes \sigma _s)}{\textrm{d}\lambda };\, V_s[\rho _s]}\right )\Biggr ] \eta \textrm{d} \lambda, \\ II_s &= \frac{1}{2}\iint _{G}\overline \nabla \chi _A(x,y) \Biggl [ \Phi \!\left ({\frac{\textrm{d}(\sigma _s\otimes \mu )}{\textrm{d}\lambda },\frac{\textrm{d}(\mu \otimes \sigma _s)}{\textrm{d}\lambda };\, V_s[\rho _s]}\right )\\ &\qquad \qquad \qquad -\Phi \!\left ({\frac{\textrm{d}(\rho _s\otimes \mu )}{\textrm{d}\lambda },\frac{\textrm{d}(\mu \otimes \rho _s)}{\textrm{d}\lambda };\, V_s[\rho _s]}\right )\Biggr ] \eta \textrm{d} \lambda. \end{align*}

For the first term, we apply the Lipschitz assumptions (3.2a) on $\Phi$ and (4.3) on $V$ and use the antisymmetry of $V_t(\rho _t)$ and $V_t(\sigma _t)$ and the symmetry of $\eta$ (cf. $\eta$ ) to obtain

\begin{align*} \int _0^t \lvert{I_s}\rvert \textrm{d}{s} &\leq \frac{L_\Phi }{2}\int _0^t\iint _{G} \left |V_s[\sigma _s] - V_t[\rho _s] \right | \eta \!\left ( \textrm{d} (\lvert{\sigma _s}\rvert \otimes \mu ) + \textrm{d} (\mu \otimes \lvert{\sigma _s}\rvert ) \right )\ \textrm{d} s \\[3pt] &\leq L_\Phi \int _0^t \iint _{G} \bigl\lvert V_s[\sigma _s] - V_t[\rho _s]\bigr\rvert(x,y) \eta (x,y)\ \textrm{d} (\lvert{\sigma _s}\rvert \otimes \mu )(x,y)\ \textrm{d} s \\[3pt] &\leq L_\Phi \!\sup _{s\in [0,t]} \lvert{\sigma _s}\rvert [{\mathbb{R}}^d]\! \int _0^t \!\sup _{x\in{\mathbb{R}}^d} \int _{{\mathbb{R}}^d\setminus \{x\}} \bigl\lvert V_s[\sigma _s] \!-\! V_t[\rho _s]\bigr\rvert(x,y) \eta (x,y) \ \textrm{d} \mu (y) \ \textrm{d} s\\[3pt] &\leq L_\Phi L_V M T \, \boldsymbol{d}_{\mathcal{AC}_T}(\rho,\sigma ). \end{align*}

As for $II_s$ , we use the Lipschitz assumption (3.2b) on $\Phi$ , again the antisymmetry of $V_t(\rho _t)$ and the symmetry of $\eta$ (recall $\eta$ ), and apply the compressibility of $V$ given in (4.2) to get

\begin{align*} \int _0^t\lvert{II_s}\rvert \textrm{d} s&\le \frac{L_\Phi }{2}\int _0^t\iint _{G} \lvert{V_s[\rho _s]}\rvert (x,y) \left ( \left |\frac{\textrm{d}{(\sigma _s\otimes \mu )}}{\textrm{d}\lambda }(x,y) - \frac{\textrm{d}(\rho _s\otimes \mu )}{\textrm{d}\lambda }(x,y)\right | \right .\\[3pt]&\qquad \qquad + \left. \left |\frac{\textrm{d}(\mu \otimes \sigma _s)}{\textrm{d}\lambda }(x,y) - \frac{\textrm{d}{(\mu \otimes \rho _s)}}{\textrm{d}\lambda }(x,y)\right | \right ) \eta (x,y)\ \textrm{d} \lambda (x,y)\ \textrm{d} s\\[3pt]&\leq \frac{L_\Phi }{2}\int _0^t\iint _{G}\lvert{V_s[\rho _s]}\rvert \eta \!\left ({\textrm{d}(\lvert{\sigma _s-\rho _s}\rvert \otimes \mu )+\textrm{d}(\mu \otimes \lvert{\sigma _s-\rho _s}\rvert )}\right )\ \textrm{d} s\\[3pt]&\leq L_\Phi C_VT \boldsymbol{d}_{\mathcal{AC}_T}(\rho,\sigma ). \end{align*}

All in all, taking the suprema over Borel sets and over time in (4.4) gives

\begin{align*} \boldsymbol{d}_{\mathcal{AC}_T}\!\left ({S_T^V(\rho ), S_T^V(\sigma )}\right ) \leq L_{\Phi }\!\left ({M L_V + C_V}\right )T \boldsymbol{d}_{\mathcal{AC}_T}(\rho,\sigma )\,=\!:\, \alpha T\boldsymbol{d}_{\mathcal{AC}_T}(\rho,\sigma ). \end{align*}

The existence and uniqueness when $T\lt 1/\alpha$ is a direct consequence of the Banach fixed-point theorem in the metric space $\mathcal{AC}_{T}^M$ applied to $S_{T}^V$ .

Remark 4.4. For (NCE), one has to control only the term $II_s$ , and so the condition in (4.2) is enough to get the contraction estimate and well-posedness.

Theorem 4.5 (Well-posedness for (NCL)). Let $V\colon [0,T] \times{\mathcal{M}}_{\textrm{TV}}^M({\mathbb{R}^{d}}) \to \mathcal{V}^{\textrm{as}}(G)$ and suppose there are constants $C_V,L_V\gt 0$ so that, for all $t\in [0,T]$ and all $\rho,\sigma \in{\mathcal{M}}_{\textrm{TV}}^M({\mathbb{R}^{d}})$ ,

\begin{equation*} \begin {gathered} \sup _{x\in {\mathbb {R}^{d}}} \int _{{\mathbb {R}^{d}}\setminus \{x\}} |V_t[\rho ](x,y)| \eta (x,y) \ \textrm{d} \mu (y) \leq C_V,\\ \sup _{x\in {\mathbb {R}^{d}}}\int _{{\mathbb {R}^{d}}\setminus \{x\}} |V_t[\rho ](x,y) - V_t[\sigma ](x,y)| \eta (x,y)\ \textrm {d}\mu (y) \leq L_V \lVert {\rho -\sigma }\rVert _{TV}. \end {gathered} \end{equation*}

Then, there exists a unique measure solution $\rho$ to (NCL) such that $\rho _0 = \rho ^0$ .

Proof. Let $\alpha$ be as in Lemma 4.3 and let $a = \alpha T$ . If $a\lt 1$ , then the result is direct by applying the well-posedness from Lemma 4.3.

Suppose now $a\geq 1$ , write $k$ the integer part of $a$ and let $\tau = 1/(2\alpha )$ . Then, by Lemma 4.3, we know there exists a unique measure solution to (NCL) on $[0,\tau ]$ ; let us call this solution $\rho ^1$ and observe that $\rho ^1 \in \mathcal{AC}_{0,\tau }$ , where $\mathcal{AC}_{0,\tau }=AC([0,\tau ];\,{\mathcal{M}}_{\textrm{TV}}^M({\mathbb{R}^{d}}))$ . Again, applying Lemma 4.3 yields the existence and uniqueness of $\rho ^2 \in \mathcal{AC}_{\tau,2\tau }$ , the solution to (NCL) on $[\tau,2\tau ]$ . By proceeding iteratively, we construct a sequence of solutions

\begin{equation*} \rho ^i \in \mathcal {AC}_{(i-1)\tau,i\tau } \quad \text {for all $i \in \{1,\dots,k\}$}, \qquad \rho ^{k+1} \in \mathcal {AC}_{k\tau,T}. \end{equation*}

We now define the curve $\rho \in \mathcal{AC}_{0,T} = \mathcal{AC}_T$ by

\begin{equation*} \begin {cases} \rho _t = \rho _t^i & \text {for all $t \in [(i-1)\tau,i\tau )$ and $i\in \{1,\dots,k\}$},\\ \rho _t = \rho _t^{k+1} & \text {for all $t \in [k\tau,T]$}, \end {cases} \end{equation*}

which, by construction, is the unique measure solution to (NCL).

We now apply Theorem 4.5 to the nonlocal interaction equation studied in [Reference Esposito, Patacchini, Schlichting and Slepcev13], that is, to the velocity field $v$ as in Example 4.2, but for a more general admissible flux interpolation $\Phi$ . This provides existence and uniqueness of measure solutions to (NL2IE).

Corollary 4.6 (Well-posedness for (NL2IE)). Assume that $\eta$ satisfies

(4.5) \begin{equation} \sup _{x \in{\mathbb{R}^{d}}} \int _{{\mathbb{R}^{d}}} f(x,y) \eta (x,y)\ \textrm{d}\mu (y) \lt \infty \end{equation}

for some nonnegative measurable function $f \colon{\mathbb{R}^{d}} \times{\mathbb{R}^{d}} \to{\mathbb{R}}$ . Let $K \colon{\mathbb{R}^{d}} \times{\mathbb{R}^{d}} \to{\mathbb{R}}$ and $P \colon{\mathbb{R}^{d}} \to{\mathbb{R}}$ be such that there exist constants $L_K, L_P\gt 0$ for which

(4.6) \begin{equation} |K(y,z) - K(x,z)| \leq L_K f(x,y), \quad |P(y) - P(x)| \leq L_P f(x,y), \end{equation}

for all $x,y,z \in{\mathbb{R}^{d}}$ . Then, (NL2IE), whose velocity $V\colon [0,T]\times{\mathcal{M}}_{\textrm{TV}}^M({\mathbb{R}^{d}}) \to \mathcal{V}^{\textrm{as}}(G)$ we recall is defined for $t\in [0,T]$ and $\sigma \in{\mathcal{M}}_{\textrm{TV}}^M({\mathbb{R}^{d}})$ by

(4.7) \begin{equation} V_t[\sigma ](x,y) = -\overline \nabla K*\sigma (x,y) - \overline \nabla P(x,y) \quad for\ all\ (x,y) \in G, \end{equation}

has a unique measure solution $\rho$ such that $\rho _0 = \rho ^0$ .

Proof. We first check that, indeed, $V$ as given in (4.7) satisfies (4.2).

\begin{align*} |V_t[\rho ](x,y)| &= |\overline \nabla (K*\rho +P)(x,y)| = |K*\rho (y) + P(y) - K*\rho (x) - P(x)|\\[3pt] &\leq \int _{\mathbb{R}^{d}} \left | K(y,z) - K(x,z) \right | \textrm{d}|\rho |(z) + |P(y) - P(x)|\\[3pt] &\leq L_K \int _{\mathbb{R}^{d}} f(x,y) \textrm{d}|\rho |(z) + L_P f(x,y) = (ML_K + L_P) f(x,y); \end{align*}

hence we obtain

\begin{align*} & \sup_{t\in [0,T]}\sup_{\rho\in{\mathcal{M}_{\textrm{TV}}^M({{\mathbb{R}^{d}}})}}\sup_{x\in{{\mathbb{R}^{d}}}} \int_{{{\mathbb{R}^{d}}}\setminus\{x\}} |V_t[\rho](x,y)| \eta(x,y)\ \textrm{d}\mu(y)\\[3pt] & \leq (ML_K + L_P) \sup_{x \in {{\mathbb{R}^{d}}}} \int_{{{\mathbb{R}^{d}}}\setminus\{x\}} f(x,y) \eta(x,y)\ \textrm{d}\mu(y) <\infty, \end{align*}

which is (4.2). Then, we are only left with showing (4.3). For all $\rho,\sigma \in{\mathcal{M}}_{\textrm{TV}}^M({\mathbb{R}^{d}})$ , $t \in [0,T]$ and $(x,y) \in G$ , we have

\begin{align*} |V_t[\rho ](x,y)-V_t[\sigma ](x,y)| &= | \overline \nabla (K*\rho _t - K*\sigma _t)(x,y) |\\[3pt] &\leq \int _{\mathbb{R}^{d}} |K(y,z) - K(x,z)| \textrm{d} |\rho _t(z) - \sigma _t(z)|\\[3pt] &\leq L_K \lVert{\rho _t-\sigma _t}\rVert _{TV} f(x,y), \end{align*}

which yields (4.3) and ends the proof.

Note that choosing the function $f$ in the above corollary to be

\begin{equation*} f(x,y) = |x-y|\vee |x-y|^2 \quad \mbox {for all $x,y \in {\mathbb {R}^{d}}$} \end{equation*}

shows that [Reference Esposito, Patacchini, Schlichting and Slepcev13, Assumption (K3)], needed for the existence result on weak solutions to (NL2IE) in [Reference Esposito, Patacchini, Schlichting and Slepcev13, Theorem 3.15], is stronger than that in (4.6) on $K$ . On the other hand, the condition (4.5), resulting from this choice of $f$ , is a stronger assumption on $\eta$ near the diagonal than [Reference Esposito, Patacchini, Schlichting and Slepcev13, Assumption (A1)], again needed in Theorem [Reference Esposito, Patacchini, Schlichting and Slepcev13, Theorem 3.15]. Our well-posedness result in Corollary 4.6 thus holds for more general interaction potentials but less general weight functions than our weak existence result in Theorem [Reference Esposito, Patacchini, Schlichting and Slepcev13, Theorem 3.15]. Another interesting example of $f$ which can be chosen in Corollary 4.6 is a constant function, which only imposes $K$ to be a bounded function; in this case, the resulting condition (4.5) on $\eta$ is even more restrictive, albeit still reasonable.

Remark 4.7 (The case when $\mu$ is atomic). Let $I \subseteq{\mathbb{N}}$ be not necessarily finite. Consider $\{x_i\}_{i\in I} \subset{\mathbb{R}}^d$ , $\{m_i\}_{i\in I} \subset [0,\infty )$ and $\mu \in{\mathcal{M}}^+({\mathbb{R}^{d}})$ such that

\begin{equation*} \mu = \sum _{i\in I} m_i\delta _{x_i}. \end{equation*}

Let $V\colon [0,T] \times{\mathcal{M}}_{\textrm{TV}}^M({\mathbb{R}^{d}}) \to \mathcal{V}^{\textrm{as}}(G)$ satisfy the hypotheses of Theorem 4.5; that is, there exist there exist $C_V,L_V\gt 0$ such that, for all $t\in [0,T]$ and all $\rho,\sigma \in{\mathcal{M}}_{\textrm{TV}}^+({\mathbb{R}^{d}})$ , we have

\begin{gather*} \sup _{t\in [0,T]}\sup _{\rho \in{\mathcal{M}}_{\textrm{TV}}^M({\mathbb{R}^{d}})}\sup _{x \in{\mathbb{R}^{d}}} \sum _{\substack{j\in I\\ x_j\neq x}}^n m_j |V_t[\rho ](x,x_j)|\eta (x,x_j) \leq C_V,\\ \sup _{x \in{\mathbb{R}^{d}}} \sum _{j\in I:x_k\neq x}^n m_j |V_t[\rho ](x,x_j) - V_t[\sigma ](x,x_j)|\eta (x,x_j) \leq L_V \lVert{\rho -\sigma }\rVert _{TV}. \end{gather*}

In this case, we know from Theorem 4.5 that a unique solution $\rho$ exists on $[0,T]$ such that $\rho _0 = \rho ^0$ . If $\textrm{supp}\ \rho^0\subseteq\textrm{supp}\ \mu$ , then Proposition 3.5 entails that the solution stays supported in $\textrm{supp}\ \mu$ , in particular, $\rho _t\ll \mu$ for a.e. $t\in [0,T]$ . If moreover $\Phi$ is jointly antisymmetric, that is, $\Phi (a,b;\,-v)=-\Phi (b,a;\,v)$ for any $a,b,v\in{\mathbb{R}}$ , then (4.1) rewrites, for any $A\in \mathcal{B}({\mathbb{R}^{d}})$ and a.e. $t\in [0,T]$ , as

\begin{align*} \rho _t[A] = \rho ^0[A] -\sum _{i\neq j}\int _0^t \Phi \!\left ({r_i(t)m_j, m_i r_j(t), V_s[\rho _s](x_i,x_j)}\right )\eta (x_i,x_j)\ \textrm{d} s. \end{align*}

5. $\boldsymbol{L}^{\boldsymbol{p}}$ solutions and positivity preservation

Let $\rho ^0 \in{\mathcal{M}}_{\textrm{TV}}^M({\mathbb{R}}^d)$ be such that $\rho ^0\ll \mu$ . In this section, we consider curves in $AC([0,T];\,L^1_\mu ({\mathbb{R}^{d}}))$ and equip it with the distance

\begin{equation*} \|\rho ^1-\rho ^2\|_{L^\infty ([0,T];\,L^1_\mu ({\mathbb {R}^{d}}))}=\sup _{t\in [0,T]}\int _{\mathbb {R}^{d}}|\rho ^1(x)-\rho ^2(x)|\ \textrm{d} \mu (x) \quad \text {for all $\rho ^1,\rho ^2\in L^1_\mu ({\mathbb {R}^{d}})$}. \end{equation*}

The advantage of the $L^1_\mu$ setting is that we are able to show positivity preservation of solutions when $\Phi =\Phi _{\textrm{Upwind}}$ , as well as $L^p_\mu$ regularity with $p\in (1,\infty )$ .

In this setting, we choose $\lambda =\mu \otimes \mu$ so that the admissible flux from Definition 3.2 is given by

\begin{equation*} \textrm {d} F^\Phi [\mu ;\,\rho,w](x,y) = \Phi \!\left ({\rho (x),\rho (y);\, w(x,y)}\right )\ \textrm {d} (\mu \otimes \mu )(x,y), \end{equation*}

for any $\rho \in L^1_\mu ({\mathbb{R}^{d}})$ , $w\in \mathcal{V}^{\textrm{as}}(G)$ and $(x,y)\in G$ . Assuming that $\Phi$ is jointly antisymmetric, that is, $\Phi (a,b;\,-\!v)=-\Phi (b,a;\,v)$ for any $a,b,v\in{\mathbb{R}}$ , the nonlocal divergence of $F^\Phi [\mu ;\,\rho,v]$ is given by

\begin{equation*} \overline \nabla \cdot F[\mu ;\,\rho,v](x)=\int _{{\mathbb {R}^{d}}\setminus \{x\}}\Phi \!\left ({\rho (x),\rho (y);\, v(x,y)}\right )\eta (x,y)\ \textrm{d} \mu (y) \quad \text {for $\mu $-a.e. $x\in {\mathbb {R}^{d}}$};\, \end{equation*}

properties stated in Proposition 3.5 still hold. As in Section 4, the velocity field may depend on the configuration itself:

\begin{equation*} v_t(x,y)=V_t[\rho _t](x,y) \quad \text {for all $t\in [0,T]$ and $(x,y)\in G$}, \end{equation*}

for some $V\,:\,[0,T]\times L^1_\mu ({\mathbb{R}^{d}})\to \mathcal{V}^{\textrm{as}}(G)$ . The solution map is, for $\mu$ -a.e. $x\in{\mathbb{R}^{d}}$ , given by

(5.1) \begin{equation} \rho _t(x)=\rho ^0(x)-\int _0^t \overline \nabla \cdot F[\mu ;\,\rho _s,V_s[\rho _s]](x)\ \textrm{d} s. \end{equation}

Fix $\rho ^0\in L^1_{\mu,M}({\mathbb{R}^{d}})$ . The procedure followed in Section 4 provides a well-posedness result, where, for $M\gt 0$ fixed, we set $L^1_{\mu,M}({\mathbb{R}^{d}})\,:\!=\,\bigl\{\rho \in L^1_{\mu }({\mathbb{R}^{d}}) \,:\, \int _{\mathbb{R}^{d}}|\rho (x)|\textrm{d} \mu (x)=M\bigr\}$ :

Theorem 5.1 (Well-posedness for (NCL)). Let $V\colon [0,T] \times L^1_{\mu,M}({\mathbb{R}^{d}}) \to \mathcal{V}^{\textrm{as}}(G)$ and suppose there are constants $C_V,L_V\gt 0$ so that, for all $t\in [0,T]$ and all $\rho,\sigma \in L^1_{\mu,M}({\mathbb{R}^{d}})$ ,

\begin{equation*} \begin {gathered} \sup _{x\in {\mathbb {R}^{d}}} \int _{{\mathbb {R}^{d}}\setminus \{x\}} |V_t[\rho ](x,y)| \eta (x,y) \ \textrm{d} \mu (y) \leq C_V,\\[5pt] \sup _{x\in {\mathbb {R}^{d}}}\int _{{\mathbb {R}^{d}}\setminus \{x\}} |V_t[\rho ](x,y) - V_t[\sigma ](x,y)| \eta (x,y)\ \textrm {d}\mu (y) \leq L_V \lVert {\rho -\sigma }\rVert _{L^1_\mu ({\mathbb {R}^{d}})}. \end {gathered} \end{equation*}

Then, there exists a unique measure solution $\rho$ to (NCL) satisfying (5.1) such that $\rho _0 = \rho ^0$ .

As we now work with densities (with respect to $\mu$ ), we are able to prove positivity preservation for (NCE) in the case of the upwind flux interpolation; the proof of the result follows the strategy used in [Reference Boyer6].

Proposition 5.2 (Positivity preservation for (NCE)). Let $\rho ^0$ be nonnegative everywhere and let the assumptions in Theorem 5.1 hold. Furthermore, assume that $\Phi \equiv \Phi _{\textrm{Upwind}}$ . Then, the solution $\rho$ to (NCE) is nonnegative a.e., that is, $\rho _t(x)\ge 0$ for a.e. $t\in [0,T]$ and $\mu$ -a.e. $x\in{\mathbb{R}^{d}}$ .

Proof. As $\rho$ is absolutely continuous in time, for a.e. $t\in [0,T]$ and $\mu$ -a.e. $x\in{\mathbb{R}^{d}}$ , it holds

\begin{align*} \begin{split} \partial _t\rho _t(x)&=-\overline \nabla \cdot F^\Phi [\mu ;\,\rho _t,v_t](x)\\[5pt] &=-\int _{{\mathbb{R}^{d}}\setminus \{x\}}v_t(x,y)_+ \eta (x,y)\rho _t(x)\ \textrm{d} \mu (y)\\[5pt] &\quad +\int _{{\mathbb{R}^{d}}\setminus \{x\}}v_t(x,y)_-\eta (x,y)\rho _t(y)\ \textrm{d} \mu (y). \end{split} \end{align*}

We denote by $a,A \colon [0,T] \to \mathbb{R}$ the maps defined by

\begin{align*} a(t)\,:\!=\,\sup _{x\in{\mathbb{R}^{d}}}\int _{{\mathbb{R}^{d}}\setminus \{x\}}|v_t(x,y)_-|\eta (x,y)\ \textrm{d} \mu (y), \quad A(t)\,:\!=\,\exp \!\left ({-}\int _0^t a(s)\ \textrm{d} s\right ), \end{align*}

and we set $\tilde{\rho }_t(x)=A(t)\rho _t(x)$ for a.e. $t\in [0,T]$ and $\mu$ -a.e. $\in{\mathbb{R}^{d}}$ . In turn, by using $v_+=v+v_-$ , we obtain, for $\mu$ -a.e. $x\in{\mathbb{R}^{d}}$ ,

reordering the terms, we get

(5.2) \begin{equation} \begin{split} \partial _t \tilde \rho _t(x) &+ \int _{{\mathbb{R}^{d}}\setminus \{x\}} v_t(x,y)_- \left ({\tilde \rho _t(x) - \tilde \rho _t(y)}\right ) \eta (x,y)\ \textrm{d} \mu (y) \\ &+\tilde{\rho }_t(x)\left (a(t)+\int _{{\mathbb{R}^{d}}\setminus \{x\}} v_t(x,y)\eta (x,y)\ \textrm{d} \mu (y)\right ) = 0. \end{split} \end{equation}

In addition, note that, by definition of $a$ , we have

\begin{equation*} a(t)+\int _{{\mathbb {R}^{d}}\setminus \{x\}} v(x,y)\eta (x,y)\ \textrm{d} \mu (y)\ge 0. \end{equation*}

Let us prove that any supersolution of (5.2) is a.e. nonnegative. Indeed, if this were true, then we would have that the supersolution $\rho _t^\varepsilon \,:\!=\,\tilde{\rho }_t+\varepsilon t=a(t)\rho _t+\varepsilon t\ge 0$ $\mu$ -a.e., for any $\varepsilon \gt 0$ and a.e. $t\in [0,T]$ , and, letting $\varepsilon \to 0$ , we then would obtain $\rho _t\ge 0$ for a.e. $t\in [0,T]$ . By contradiction, we thus assume that a supersolution to (5.2), still denoted by $\tilde{\rho }$ , is such that there exists $\tau \in (0,T]$ with

(5.3) \begin{equation} \inf _{y\in{\mathbb{R}}^d}\tilde \rho _\tau (y) \lt 0. \end{equation}

Let $(\tau _k)_k\subset (0,T]$ be defined as $\tau _k = \tau + 1/k$ for all $k\gt 0$ large enough. By the time continuity of $\tilde \rho$ from $[0,T]$ to $L_\mu ^1({\mathbb{R}^{d}})$ , we know that, up to a subsequence, $\tilde \rho _{\tau _k} \to \tilde \rho _\tau$ pointwise as $k\to \infty$ . Furthermore, let $(x_n^t)_n$ be a minimising sequence for $\tilde \rho _t$ for all $t\in [0,T]$ . Then,

\begin{equation*} \tilde \rho _{\tau _k}(x_n^\tau ) \xrightarrow [k\to \infty ]{} \tilde \rho _\tau (x_n^\tau ) \xrightarrow [n\to \infty ]{} \inf _{y\in {\mathbb {R}}^d}\tilde \rho _\tau (y), \end{equation*}

and similarly, whenever $\tau \gt 0$ , for the sequence $(\tau^{\prime}_k)_k\subset (0,T]$ defined by $\tau^{\prime}_k = \tau - 1/k$ for all $k\gt 0$ large enough. Hence, the set $\Delta \subset (0,\infty )$ , given by

\begin{equation*}\Delta=\{\delta>0\,:\,\forall\ \ \ \ t\in\lbrack0,T\rbrack\cap(\tau-\delta,\tau+\delta),\ \ \ \ \ \underset{y\in\mathbb{R}^d} {\text{inf}} {\widetilde{\rho}}_t(y) \lt 0\},\end{equation*}

is nonempty, open, and $\delta _*\,:\!=\,\sup \Delta \gt 0$ . Moreover, $\delta _*\le \tau$ since, by assumption, $\tilde \rho _0\geq 0$ . Setting $\tau _* \,:\!=\, \tau - \delta _*\geq 0$ and $\tau ^* \,:\!=\, \min \{T,\tau + \delta _*\}$ , we have

\begin{equation*} \inf _{y\in {\mathbb {R}}^d} \tilde \rho _{\tau _*}(y) \geq 0 \qquad \text {and} \quad \inf _{y\in {\mathbb {R}}^d} \tilde \rho _t(y) \lt 0 \quad \text {for all $t\in (\tau _*,\tau ^*)$}. \end{equation*}

As consequence, for all $h\gt 0$ such that $\tau _*+h \lt \tau ^*$ , we have

\begin{equation*} \lim _{n\to \infty }\tilde \rho _{\tau _* +h}\left(x_n^{\tau _*+h}\right) \lt 0 \leq \lim _{n\to \infty } \tilde \rho _{\tau _*}(x_n^{\tau _*}) \leq \liminf _{n\to \infty } \tilde \rho _{\tau _*}\left(x_n^{\tau _*+h}\right), \end{equation*}

since $x_n^{\tau _*+h}$ is minimising for $\tilde{\rho }_{\tau _*+h}$ but not necessarily for $\tilde{\rho }_{\tau _*}$ , and so

(5.4) \begin{equation} \limsup _{n\to \infty } \left ( \tilde \rho _{\tau _* +h}\left(x_n^{\tau _*+h}\right) - \tilde \rho _{\tau _*}\left(x_n^{\tau _*+h}\right) \right ) \leq 0. \end{equation}

We find that, for $t_*=\tau _*+h$ ,

\begin{gather*} \limsup _{n\to \infty } \int _{\tau _*}^{\tau _*+h}\int _{{\mathbb{R}^{d}}\setminus \{x_n^{t_*}\}} v_{t_*}\!\left(x_n^{t_*},y\right)_- \left ({\tilde \rho _{t_*}\left(x_n^{t_*}\right) - \tilde \rho _{t_*}(y)}\right ) \eta \!\left(x_n^{t_*},y\right)\ \textrm{d} \mu (y)\ \textrm{d} t \leq 0,\\ \limsup _{n\to \infty } \int _{\tau _*}^{\tau _*+h}\!\tilde{\rho }_{t_*}\!\left(x_n^{t_*}\right)\left (a(t_*)+\int v_{t_*}\!\left(x_n^{t_*},y\right)\eta \!\left(x_n^{t_*},y\right)\ \textrm{d} \mu (y)\right )\ \textrm{d} t\leq 0. \end{gather*}

Integrating (5.2) between $(\tau _*,\tau _*+h)$ at $x=x_n^{\tau _*+h}$ and taking the $\liminf$ as $n\to \infty$ , we arrive at

\begin{equation*} \liminf _{n\to \infty }\left (\tilde {\rho }_{\tau _*+h}\left(x_n^{\tau _*+h}\right)- \tilde {\rho }_{\tau _*}\!\left(x_n^{\tau _*+h}\right)\right )\ge 0, \end{equation*}

which contradicts (5.4). Hence, the existence of $\tau$ such that (5.3) holds is false and every supersolution to (5.2) must be a.e. nonnegative, which concludes the proof.

We are also able to prove $L^p$ regularity of solutions for (NCE):

Proposition 5.3 ( $L^p$ regularity for (NCE)). Suppose that $\frac{\textrm{d}{\mu }}{\textrm{d}x}\in L^\infty ({\mathbb{R}}^d)$ and $\rho _0$ is nonnegative everywhere with $\rho _0\in L^p({\mathbb{R}^{d}})$ for some $p\in (1,\infty )$ . Consider any measurable pair $(\rho,v)\,:\,[0,T]\to L^1_{\mu,M}({\mathbb{R}^{d}})\times \mathcal{V}^{\textrm{as}}(G)$ satisfying (5.1), with $\Phi \equiv \Phi _{\textrm{Upwind}}$ . Assume that $\eta$ is translation invariant, that is,

(5.5) \begin{equation} {\eta (x+h,y+w)=\eta (x,y), \ for\ any\ (x,y), (x+h,y+w)\in G.} \end{equation}

Assume there exists a constant $C_v\gt 0$ such that $v\,:\,[0,T]\to \mathcal{V}^{\textrm{as}}(G)$ satisfies the following uniform translational bound:

(5.6) \begin{equation} \limsup _{\varepsilon \to 0} \int _0^T \sup _{y\in{\mathbb{R}^{d}}}\int _{{\mathbb{R}^{d}}} \sup _{h,w\in B_{\varepsilon }(0)} \left ({ v_t(x+h,y+w)_- \; \eta (x,y)}\right )^p \textrm{d} y \leq C_v. \end{equation}

Let $\rho$ be the solution to (NCE). Then, $\rho _t$ is a density with respect to the Lebesgue measure and $\rho _t\in L^1_{\mu,M}({\mathbb{R}^{d}})\cap L^p({\mathbb{R}^{d}})$ for all $t\in [0,T]$ . Furthermore, for all $t\in [0,T]$ , it holds

(5.7) \begin{equation} \sup _{t\in [0,T]} \lVert{\rho _t}\rVert _{L^p({\mathbb{R}^{d}})}^p \leq \left ({ \|\rho _0\|_{L^p({\mathbb{R}^{d}})}^p+\tilde{C}_v T}\right )\exp \!\left ({\frac{T}{q}}\right ), \end{equation}

with $\tilde{C}_v=\frac{C_v}{p}\left ({p M\lVert{\frac{\textrm{d}{\mu }}{\textrm{d}x}}\rVert _{L^\infty }}\right )^p$ .

Proof. Let $\nu$ be a standard mollifier, that is, a nonnegative and even function in $C^\infty _c({\mathbb{R}^{d}})$ (the set of smooth, compactly supported functions defined on $\mathbb{R}^{d}$ ) such that $\int _{\mathbb{R}^{d}}\nu \textrm{d} x=1$ and $supp\nu = B_1(0)\,:\!=\,\{x\in{\mathbb{R}^{d}} \,:\, \lVert{x}\rVert =1\}$ . Fix $\varepsilon \gt 0$ and write $\nu _\varepsilon = \varepsilon ^{-d} \nu ({\cdot}/\varepsilon )$ . Also, for any $z\in{\mathbb{R}}^d$ , define the translation operator $\tau ^z \colon{\mathbb{R}}^d\to{\mathbb{R}}^d$ by $\tau ^{z}(h) \,:\!=\, h - z$ . In particular, set the translated measures $\rho _t^{z}\,:\!=\,\tau ^{z}_\#\rho _t$ and $\mu ^z\,:\!=\,\tau ^z_\#\mu$ , where ${}_\#$ stands for the measure-theoretic pushforward. We use the following interplay between translation and convolution: for any $f\in C_b({\mathbb{R}^{d}})$ (the set of continuous and bounded functions defined on $\mathbb{R}^{d}$ ), we have $f*\nu _\varepsilon \in C_b^\infty ({\mathbb{R}^{d}})$ , that is, $f*\nu _\varepsilon \in C_b({\mathbb{R}^{d}})$ and $f*\nu _\varepsilon$ is smooth, and

\begin{align*} \iint _{{\mathbb{R}^{d}}\times{\mathbb{R}^{d}}} f(h)\nu _\varepsilon (z)\ \textrm{d} \rho _t^z(h)\ \textrm{d} z&=\iint _{{\mathbb{R}^{d}}\times{\mathbb{R}^{d}}} f(h-z)\nu _\varepsilon (z)\ \textrm{d} \rho _t(h)\ \textrm{d} z\\ &=\int _{\mathbb{R}^{d}} (\nu _\varepsilon *f)(h)\ \textrm{d} \rho _t(h)\\ &=\iint _{{\mathbb{R}^{d}}\times{\mathbb{R}^{d}}} \nu _\varepsilon (h-z)f(z)\ \textrm{d} z\ \textrm{d} \rho _t(h)\\ &=\iint _{{\mathbb{R}^{d}}\times{\mathbb{R}^{d}}} f(z)\nu _\varepsilon (z-h)\ \textrm{d} \rho _t(h)\ \textrm{d} z\\ &=\int _{{\mathbb{R}^{d}}} f(z)\rho _t^\varepsilon (z)\ \textrm{d} z. \end{align*}

Let $\rho ^\varepsilon =\rho *\nu _\varepsilon$ be the smoothed solution satisfying

\begin{equation*} \partial _t\rho ^\varepsilon _t+\left ({\overline \nabla \cdot F^\Phi }\right )*\nu _\varepsilon =0, \end{equation*}

where $((\overline \nabla \cdot F^\Phi )*\nu _\varepsilon )(x)=\int _{\mathbb{R}^{d}}\nu _\varepsilon (x-z)\ \textrm{d}\overline \nabla \cdot F^\Phi (z)$ for all $x\in{\mathbb{R}^{d}}$ . We can compute the time derivative of the $L^p$ norm of $\rho ^\varepsilon$ : for a.e. $t\in [0,T]$ , use (5.5) to get

\begin{align*} \frac{\textrm{d}}{\textrm{d}{t}}\int _{\mathbb{R}^{d}}|\rho ^\varepsilon _t|^p\textrm{d} x & = p \int _{\mathbb{R}^{d}}\rho ^\varepsilon _t(x)^{p-1}\partial _t\rho ^\varepsilon _t(x)\ \textrm{d} x\\ &=-p\int _{\mathbb{R}^{d}} \rho ^\varepsilon _t(x)^{p-1}\left ({\nu _\varepsilon *\overline \nabla \cdot F^\Phi }\right )(x)\ \textrm{d} x\\ &=\frac{p}{2}\iint _G\overline \nabla (\rho _t^\varepsilon )^{p-1}*\nu _\varepsilon \Phi \left ({\rho _t(x),\rho _t(y);\, v_t }\right )\eta \textrm{d} \mu (x)\ \textrm{d} \mu (y)\\ &=\frac{p}{2}\int _{\mathbb{R}^{d}} \iint _G\overline \nabla (\rho _t^\varepsilon )^{p-1}(x-z,y-z)\nu _\varepsilon (z) v_t(x,y)_+\eta (x,y)\ \textrm{d} \rho _t(x)\ \textrm{d} \mu (y)\ \textrm{d} z\\ &\quad - \frac{p}{2}\int _{\mathbb{R}^{d}} \iint _G\overline \nabla (\rho _t^\varepsilon )^{p-1}(x-z,y-z)\nu _\varepsilon (z) v_t(x,y)_-\eta (x,y)\ \textrm{d} \mu (x)\ \textrm{d} \rho _t(y)\ \textrm{d} z\\ &= - p\int _{\mathbb{R}^{d}} \iint _G\overline \nabla (\rho _t^\varepsilon )^{p-1}(x-z,y-z)\nu _\varepsilon (z) v_t(x,y)_-\eta (x,y)\ \textrm{d} \mu (x)\ \textrm{d} \rho _t(y)\ \textrm{d} z\\ &= - p\int _{\mathbb{R}^{d}} \iint _G \overline \nabla (\rho _t^\varepsilon )^{p-1}(h,w)\nu _\varepsilon (z) v_t(z + h,z + w)_-\eta (h,w)\ \textrm{d} \mu ^z(h) \ \textrm{d} \rho _t^z(w)\ \textrm{d} z\\ &\le p\int _{\mathbb{R}^{d}} \iint _G (\rho _t^\varepsilon )^{p-1}(h)\nu _\varepsilon (z) v_t(z + h,z + w)_-\eta (h,w)\ \textrm{d} \mu ^z(h) \ \textrm{d} \rho _t^z(w)\ \textrm{d} z\,= :\,I. \end{align*}

To estimate $I$ , we use the following variant of Young’s inequality: for $p\in (1,\infty )$ and $a,b\in (0,\infty )$ , there holds

(5.8) \begin{equation} a^{p-1} b \leq \frac{a^p}{q} +\frac{b^p}{p}, \quad \text{where } q= \frac{p}{p-1}. \end{equation}

Due to (5.6), for some $\varepsilon _0 \gt 0$ sufficiently small, for all $\varepsilon \in (0,\varepsilon _0)$ and a.e. $t\in [0,T]$ , the function $\overline v_t^\varepsilon \colon G \to{\mathbb{R}}$ , defined as

\begin{equation*} \overline v_t^\varepsilon (x,y) \,:\!=\, \sup _{h,w\in B_{\varepsilon }(0)} \left (v_t(x+h,y+w)\right )_-, \end{equation*}

satisfies, for some $C_v^{\varepsilon _0}\gt 0$ , the bound

\begin{equation*} \sup _{\varepsilon \in (0,\varepsilon _0)}\int _0^T \sup _{x\in {\mathbb {R}^{d}}}\int _{{\mathbb {R}^{d}}\setminus \{x\}} \left ({\overline v_t^\varepsilon (x,y) \eta (x,y)}\right )^p \textrm {d} y \leq C_v^{\varepsilon _0}. \end{equation*}

Using the notation above, Hölder’s inequality and (5.8), we get, for a.e. $t\in [0,T]$ ,

\begin{align*} I&\leq p \Big\lVert{\frac{\textrm{d}{\mu }}{\textrm{d}x}}\Big\rVert _{L^\infty } \int _{\mathbb{R}^{d}} (\rho _t^\varepsilon )^{p-1}(h) \int _{{\mathbb{R}^{d}}\setminus \{h\}} \int _{\mathbb{R}^{d}} \nu _\varepsilon (z) \overline v_t^\varepsilon (h,w) \eta (h,w)\ \textrm{d} \rho _t^z(w)\ \textrm{d} z\ \textrm{d} h \\ &\leq p \Big\lVert{\frac{\textrm{d}{\mu }}{\textrm{d}x}}\Big\rVert _{L^\infty } \left [ \left ({\int _{\mathbb{R}^{d}} \lvert{\rho _t^\varepsilon (h)}\rvert ^p \textrm{d}{h}}\right )^{\frac{p-1}{p}}\times \right .\\ & \left .\times \left ({\int _{\mathbb{R}^{d}} \left\lvert{\int _{{\mathbb{R}^{d}}\setminus \{h\}} \int _{\mathbb{R}^{d}} \nu _\varepsilon (z) \overline v_t^\varepsilon (h,w) \eta (h,w)\ \textrm{d} \rho _t^z(w)\ \textrm{d} z}\right\rvert ^p \textrm{d} h }\right )^\frac{1}{p}\right ]\\ &\leq p \Big\lVert{\frac{\textrm{d}{\mu }}{\textrm{d}x}}\Big\rVert _{L^\infty } \lVert{\rho _t^\varepsilon }\rVert _{L^p}^{p-1} \left ({ \int _{\mathbb{R}^{d}} \left\lvert{ \sup _{w\in{\mathbb{R}}^d} \overline v_t^\varepsilon (h,w) \eta (h,w) \int _{{\mathbb{R}^{d}}\setminus \{h\}} \int _{\mathbb{R}^{d}} \nu _\varepsilon (z)\ \textrm{d} \rho _t^z(w)\ \textrm{d} z}\right\rvert ^p\ \textrm{d} h}\right )^{\frac{1}{p}} \\ &\le \frac{1}{q}\Big\lVert{\rho _t^\varepsilon }\Big\rVert _{L^p}^{p}+\frac{1}{p}\left ({ p \Big\lVert{\frac{\textrm{d}{\mu }}{\textrm{d}x}}\Big\rVert _{L^\infty } \rho _0[{\mathbb{R}^{d}}]}\right )^p\sup _{w\in{\mathbb{R}}^d} \int _{\mathbb{R}^{d}} \lvert{\overline v_t^\varepsilon (h,w) \eta (h,w)}\rvert ^p \textrm{d} h. \end{align*}

In turn, we infer

\begin{equation*} \sup _{t\in [0,T]} \lVert {\rho _t^\varepsilon }\rVert _{L^p}^p \leq \left ({ \|\rho _0\|_{L^p({\mathbb {R}^{d}})}^p+\tilde {C}_vT}\right )\exp \!\left ({\frac {T}{q}}\right ), \end{equation*}

where $\tilde{C}_v=\frac{C_v^{\varepsilon _0}}{p}\left ({p \lVert{\frac{\textrm{d}{\mu }}{\textrm{d}x}}\rVert _{L^\infty }\rho _0[{\mathbb{R}^{d}}]}\right )^p$ . The above inequality ends the proof since, up to a subsequence, we deduce $\rho _t^\varepsilon \rightharpoonup \rho _t$ in $L^p({\mathbb{R}^{d}})$ for any $t\in [0,T]$ , and the stability estimate (5.7) follows by sending $\varepsilon$ to $0$ above.

Remark 5.4. It is common in applications to data science to consider $\eta (x,y)=\eta (x-y)$ instead of (5.5), which is a reformulation of translation invariance. For this reason, since the graph is translation invariant, we can think of the assumption in 5.6 as an $L^p$ control for the velocity field over connected vertices, uniformly with respect to translations.


The authors are deeply grateful to Prof. Dejan Slepčev (Carnegie Mellon University) for many enlightening discussions on the contents of the manuscript. Furthermore, they would like to thank the anonymous reviewers for their valuable comments on the article. AE was supported by the Advanced Grant Nonlocal-CPD (Nonlocal PDEs for Complex Particle Dynamics: Phase Transitions, Patterns and Synchronization) of the European Research Council Executive Agency under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 883363). A considerable part of this work was carried out while AE was a postdoc at FAU Erlangen-Nürnberg. AE gratefully acknowledge support by the German Science Foundation (DFG) through CRC TR 154 ‘Mathematical Modelling, Simulation and Optimization Using the Example of Gas Networks’. AS is supported by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy EXC 2044 – 390685587, Mathematics Münster: Dynamics–Geometry–Structure.

Conflict of interests



Ambrosio, L. (2004) Transport equation and Cauchy problem for $BV$ vector fields. Invent. Math. 158 (2), 227260.CrossRefGoogle Scholar
Bailo, R., Carrillo, J. A. & Hu, J. (2020) Fully discrete positivity-preserving and energy-dissipating schemes for aggregation-diffusion equations with a gradient-flow structure. Commun. Math. Sci. 18 (5), 12591303.CrossRefGoogle Scholar
Bailo, R., Carrillo, J. A., Murakawa, H. & Schmidtchen, M. (2020) Convergence of a fully discrete and energy-dissipating finite-volume scheme for aggregation-diffusion equations. Math. Models Methods Appl. Sci. 30 (13), 24872522.CrossRefGoogle Scholar
Belkin, M. & Niyogi, P. (2002) Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput. 15 (6), 13731396.CrossRefGoogle Scholar
Benamou, J.-D. & Brenier, Y. (2000) A computational fluid mechanics solution to the Monge–Kantorovich mass transfer problem. Numer. Math. 84 (3), 375393.CrossRefGoogle Scholar
Boyer, F. (2012) Analysis of the upwind finite volume method for general initial- and boundary-value transport problems. IMA J. Numer. Anal. 32 (4), 14041439.CrossRefGoogle Scholar
Cancès, C., Gallouët, T. O. & Todeschi, G. (2020) A variational finite volume scheme for Wasserstein gradient flows. Numer. Math. 146 (3), 437480.CrossRefGoogle Scholar
Carrillo, J. A., Chertock, A. & Huang, Y. (2015) A finite-volume method for nonlinear nonlocal equations with a gradient flow structure. Commun. Comput. Phys. 17 (1), 233258.CrossRefGoogle Scholar
Craig, K., Trillos, N. Garcia & Slepčev, D. (2022). Clustering Dynamics on Graphs: From Spectral Clustering to Mean Shift Through Fokker–Planck Interpolation. Springer International Publishing, Cham, pp. 105151.Google Scholar
DiPerna, R. J. & Lions, P.-L. (1989) Ordinary differential equations, transport theory and Sobolev spaces. Invent. Math. 98 (3), 511547.CrossRefGoogle Scholar
Du, Q., Huang, Z. & LeFloch, P. G. (2017) Nonlocal conservation laws. A new class of monotonicity-preserving models. SIAM J. Numer. Anal. 55 (5), 24652489.10.1137/16M1105372CrossRefGoogle Scholar
Esposito, A., Gvalani, R. S., Schlichting, A. & Schmidtchen, M. (2021) On a novel gradient flow structure for the aggregation equation. Preprint arXiv:2112.08317 .Google Scholar
Esposito, A., Patacchini, F. S., Schlichting, A. & Slepcev, D. (2021) Nonlocal-interaction equation on graphs: gradient flow structure and continuum limit. Arch. Ration. Mech. Anal. 240 (2), 699760.CrossRefGoogle Scholar
Forkert, D., Maas, J. & Portinale, L. (2022) Evolutionary $\Gamma$ -convergence of entropic gradient flow structures for Fokker–Planck equations in multiple dimensions. SIAM J. Math. Anal. 54 (4), 42974333.10.1137/21M1410968CrossRefGoogle Scholar
Gladbach, P., Kopfer, E., Maas, J. & Portinale, L. (2020) Homogenisation of one-dimensional discrete optimal transport. J. Math. Pures Appl. 139: 204234.CrossRefGoogle Scholar
Gladbach, P., Kopfer, E., Maas, J. & Portinale, L. (2021) Homogenisation of dynamical optimal transport on periodic graphs. Preprint arXiv:2110.15321 .Google Scholar
Heinze, G., Pietschmann, J.-F. & Schmidtchen, M. (2023) Nonlocal cross-interaction systems on graphs: energy landscape and dynamics. Kinet. Relat. Model.CrossRefGoogle Scholar
Heinze, G., Pietschmann, J.-F. & Schmidtchen, M. Nonlocal cross-interaction systems on graphs: nonquadratic finslerian structure and nonlinear mobilities. Preprint arXiv:2107.11289 .Google Scholar
Hraivoronska, A. & Tse, O. (2022) Diffusive limit of random walks on tessellations via generalized gradient flows. Preprint arXiv:2202.06024 .Google Scholar
Kannan, R., Vempala, S. & Vetta, A. (2004) On clusterings: good, bad and spectral. J. ACM 51 (3), 497515.CrossRefGoogle Scholar
Ng, A. Y., Jordan, M. I. & Weiss, Y. (2001). On spectral clustering: analysis and an algorithm. In Adv. Neural Inf. Process. Syst., MIT Press, Cambridge MA, pp. 849856.Google Scholar
Ohta, S.-I. & Sturm, K.-T. (2009) Heat flow on Finsler manifolds. Comm. Pure Appl. Math. 62 (10), 13861433.CrossRefGoogle Scholar
Ohta, S.-I. & Sturm, K.-T. (2012) Non-contraction of heat flow on Minkowski spaces. Arch. Ration. Mech. Anal. 204 (3), 917944.CrossRefGoogle Scholar
Peletier, M. A., Rossi, R., Savaré, G. & Tse, O. (2022) Jump processes as generalized gradient flows. Calc. Var. Partial Differ. Equ. 61 (1), 33.CrossRefGoogle Scholar
Peletier, M. A. & Schlichting, A. (2022) Cosh gradient systems and tilting. Nonlinear Anal., 231, 113094.CrossRefGoogle Scholar
Roith, T. & Bungert, L. (2023) Continuum limit of lipschitz learning on graphs. Found. Comput. Math. 23, 393–431.10.1007/s10208-022-09557-9CrossRefGoogle Scholar
Schlichting, A. & Seis, C. (2022) The Scharfetter-Gummel scheme for aggregation-diffusion equations. IMA J. Numer. Anal. 42(3), 23612402.CrossRefGoogle Scholar
Trillos, N. Garc. & Slepčev, D. (2016) Continuum limit of total variation on point clouds. Arch. Ration. Mech. Anal. 220 (1), 193241.CrossRefGoogle Scholar
Trillos, N. Garc. & Slepčev, D. (2018) A variational approach to the consistency of spectral clustering. Appl. Comput. Harmon. Anal. 45 (2), 239281.CrossRefGoogle Scholar