A phase-space approach to weighted Fourier extension inequalities

Jonathan Bennett; Susana Gutiérrez; Shohei Nakamura; Itamar Oliveira

doi:10.1017/fms.2025.10127

A phase-space approach to weighted Fourier extension inequalities

Part of: Harmonic analysis in several variables Optics, electromagnetic theory - General Integral transforms, operational calculus

Published online by Cambridge University Press: 03 November 2025

Shohei Nakamura and

Jonathan Bennett: Affiliation:
University of Birmingham , United Kingdom; E-mail: j.bennett@bham.ac.uk
Susana Gutiérrez: Affiliation:
University of Birmingham , United Kingdom; E-mail: s.gutierrez@bham.ac.uk
Shohei Nakamura: Affiliation:
University of Birmingham , United Kingdom; E-mail: s.nakamura@bham.ac.uk Department of Mathematics, Graduate School of Science, Osaka University, Toyonaka, Japan
Itamar Oliveira*: Affiliation:
University of Birmingham , United Kingdom
*: E-mail: i.oliveira@bham.ac.uk (Corresponding author)

Article contents

Abstract
Introduction
The paraboloid: a quantum mechanical viewpoint
The sphere: an optical viewpoint
General submanifolds: a geometric viewpoint
Estimating distances: the proof of Proposition
Computing Jacobians: the proof of Proposition
Surface-carried fractional integrals
Surface-carried maximal operators
Tomographic constructions
Applications to a variant of Flandrin’s conjecture
Questions
Competing interests
Funding statement
References

Abstract

The purpose of this paper is to expose and investigate natural phase-space formulations of two longstanding problems in the restriction theory of the Fourier transform. These problems, often referred to as the Stein and Mizohata–Takeuchi conjectures, assert that Fourier extension operators associated with rather general (codimension 1) submanifolds of Euclidean space may be effectively controlled by the classical X-ray transform via weighted $L^2$ inequalities. Our phase-space formulations, which have their origins in recent work of Dendrinos, Mustata and Vitturi expose close connections with a conjecture of Flandrin from time-frequency analysis, and rest on the identification of an explicit ‘geometric’ Wigner transform associated with an arbitrary (smooth strictly convex) submanifold S of $\mathbb {R}^n$. Our main results are certain natural ‘Sobolev variants’ of the Stein and Mizohata–Takeuchi conjectures and involve estimating the Sobolev norms of such Wigner transforms by geometric forms of classical bilinear fractional integrals. Our broad geometric framework allows us to explore the role of the curvature of the submanifold in these problems, and in particular we obtain bounds that are independent of any lower bound on the curvature; a feature that is uncommon in the wider restriction theory of the Fourier transform. Finally, we provide a further illustration of the effectiveness of our analysis by establishing a form of Flandrin’s conjecture in the plane with an $\varepsilon $-loss. While our perspective comes primarily from Euclidean harmonic analysis, the procedure used for constructing phase-space representations of extension operators is well-known in optics.

MSC classification

Primary: 42B10: Fourier and Fourier-Stieltjes transforms and other transforms of Fourier type

Secondary: 44A12: Radon transform 78A05: Geometric optics

Information

Type: Analysis
Information: Forum of Mathematics, Sigma , Volume 13 , 2025 , e181

DOI: https://doi.org/10.1017/fms.2025.10127 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2025. Published by Cambridge University Press

1 Introduction

1.1 Background: the Stein and Mizohata–Takeuchi problems

A central objective of modern harmonic analysis is to reach an effective quantitative understanding of Fourier transforms of measures supported on submanifolds of Euclidean space, such as the sphere or paraboloid. Problems of this type are usually formulated in terms of Fourier extension operators: to a smooth codimension-1 submanifold S of $\mathbb {R}^n$ , equipped with surface measure $\mathrm {d}\sigma $ , we associate the extension operator

(1.1)

$$ \begin{align} \widehat{g\mathrm{d}\sigma}(x):=\int_S g(u)e^{-2\pi ix\cdot u}\mathrm{d}\sigma(u); \end{align} $$

here $g\in L^1(\mathrm {d}\sigma )$ and $x\in \mathbb {R}^n$ . The extension operator (1.1) is often referred to as an adjoint restriction operator, as its adjoint restricts the n-dimensional Fourier transform of a function to the submanifold S. The estimation of extension operators in various settings is known as (Fourier) restriction theory. A key instance of this is the celebrated restriction conjecture, which concerns bounds of the form $\|\widehat {g\mathrm {d}\sigma }\|_q\lesssim \|g\|_p$ . Surprisingly many problems from across mathematics call for such an understanding, from dispersive PDE to analytic number theory; see [Reference Stovall50] for a recent survey. Such connections are often quite intimate, as hopefully this paper serves to illustrate – in this case with regard to optics, or optical field propagation.

In this paper we look to estimate extension operators in the setting of $L^2$ norms with respect to general weight functions w. This setting has been the subject of some attention since the influential work of Stein and others in the 1970s in the closely related context of Bochner–Riesz summability. At its centre is a variant of a question posed by Stein in the 1978 Williamstown conference on harmonic analysis [Reference Stein47] (see [Reference Bennett, Carbery, Soria and Vargas11] for further historical context). In its global form, for a given S, this asks whether there is a constant $C<\infty $ for which

(1.2)

$$ \begin{align} \int_{\mathbb{R}^n}|\widehat{g\mathrm{d}\sigma}(x)|^2w(x)\mathrm{d}x\leq C\int_S|g(u)|^2\sup_{v\in T_uS}Xw(N(u),v)\mathrm{d}\sigma(u) \end{align} $$

for all nonnegative weight functions w. Here $N:S\rightarrow \mathbb {S}^{n-1}$ is the Gauss map, and X denotes the classical X-ray transform

(1.3)

$$ \begin{align} Xw(\omega,v):=\int_{-\infty}^\infty w(v+t\omega)\mathrm{d}t, \end{align} $$

where $\omega \in \mathbb {S}^{n-1}$ and $v\in \langle \omega \rangle ^\perp $ together parametrise the Grassmannian manifold of lines $\ell =\ell (\omega ,v):=\langle \omega \rangle +\{v\}$ in $\mathbb {R}^n$ ; here $T_uS=\langle N(u)\rangle ^\perp $ denotes the tangent space of S at the point $u\in S$ . This is a natural inequality for a number of reasons, and it is instructive to begin by considering the simple case where g is the indicator function of a small cap (the intersection of S with a small ball in $\mathbb {R}^n$ ). The key observation is that $|\widehat {g\mathrm {d}\sigma }|^2$ is then bounded below on a neighbourhood of a line segment with direction normal to S, so that the left-hand side of (1.2) computes a variant of the X-ray transform of the weight w. The inequality (1.2) therefore proposes that $|\widehat {g\mathrm {d}\sigma }|^2$ concentrates on lines, or families of lines, rather more generally. An affirmative answer to this question is easily given in the case that S is contained in a hyperplane – a fact that follows quickly from Plancherel’s theorem. More substantial results in support of (1.2) have been obtained for restricted classes of weights, notably when S is the sphere $\mathbb {S}^{n-1}$ and the weights are radial [Reference Carbery, Romera and Soria22, Reference Barceló, Ruiz and Vega5, Reference Barceló, Bennett and Carbery4]; see also [Reference Bennett, Nakamura and Shiraki15] and the references there. Inequalities of this general type, where an operator is estimated with respect to a general weight function, are often referred to as Fefferman–Stein inequalities – see [Reference Beltran6] for a recent example. We shall also be interested in the simpler Mizohata–Takeuchi inequality

(1.4)

$$ \begin{align} \int_{\mathbb{R}^n}|\widehat{g\mathrm{d}\sigma}(x)|^2w(x)\mathrm{d}x\leq C\|g\|_{L^2(\mathrm{d}\sigma)}^2\sup_{(u,v)\in TS}Xw(N(u),v), \end{align} $$

where the supremum is restricted to $u\in \mathrm {supp\,}(g)$ , as suggested by (1.2), and $TS$ denotes the tangent bundle of S. This emerged independently of (1.2) through work of Mizohata and Takeuchi on the well-posedness of Schrödinger equations in the 1980s. We refer to [Reference Barceló, Ruiz and Vega5] and the references there for further context.

Remark 1.1 (The strength of (1.2)).

The original motive for establishing (1.2), or some appropriate variant of it, is that it would allow the restriction conjecture to follow (and almost immediately) from the Kakeya maximal function conjecture, the Kakeya maximal function being a close relative of

$$ \begin{align*}\sup_{v\in T_uS}Xw(N(u),v), \end{align*} $$

at least when S is suitably curved. We refer to [Reference Bennett and Nakamura14] and the references there for further details and discussion. In the original setting proposed by Stein, this amounts to the implication of the Bochner–Riesz conjecture from the Nikodym (or Kakeya) maximal conjecture. There is a number of precedents for this sort of integro-geometric control of oscillatory integral operators – see, for example, [Reference Bennett, Carbery, Soria and Vargas11, Reference Beltran and Bennett7].

Remark 1.2 (Failure of the global inequalities (1.2) and (1.4)).

Very recently, and since earlier drafts of this paper, Cairo [Reference Cairo18] has succeeded in constructing a counterexample to (1.4) (and thus (1.2)) whenever S is not contained in a hyperplane. However, her subtle example does not exclude the possibility that the local variants

(1.5)

$$ \begin{align} \int_{B(0,R)}|\widehat{g\mathrm{d}\sigma}(x)|^2w(x)\mathrm{d}x\lesssim R^\alpha\int_S|g(u)|^2\sup_{v\in T_uS}Xw(N(u),v)\mathrm{d}\sigma(u) \end{align} $$

and

(1.6)

$$ \begin{align} \int_{B(0,R)}|\widehat{g\mathrm{d}\sigma}(x)|^2w(x)\mathrm{d}x\lesssim R^\alpha\|g\|_{L^2(\mathrm{d}\sigma)}^2\sup_{(u,v)\in TS}Xw(N(u),v) \end{align} $$

of (1.2) and (1.4) (resp.) might hold for exponents $\alpha>0$ ; here R denotes a large parameter. That (1.5) (and thus (1.6)) holds for some $\alpha>0$ is an elementary exercise, and we refer to [Reference Carbery, Iliopoulou and Wang21] for recent local results of this type. In order to be meaningful for general w the inequalities (1.2) and (1.4) may therefore be qualified with the additional assumption that w is supported in a ball of fixed radius $R\gg 1$ , accepting some growth in R in the constant factors. We clarify that such considerations are not relevant to the results presented in this paper.

Remark 1.3 (The role of curvature).

Somewhat unusually in the setting of Fourier extension estimates it appears that the above Stein and Mizohata–Takeuchi-type inequalities should not require that S has nonvanishing curvature; we have already noted that (1.2) is easily verified when S is a hyperplane. Related to this fact is the observation that (1.2) and (1.4) are dilation invariant in the sense that their validity for a given S and a given (dilation-invariant) class of weights, implies their validity for any isotropic dilate $kS$ of S, uniformly in $k>0$ ; this follows by a routine scaling argument. This scale-invariance is important in applications, as may be seen in the established setting of the sphere and radial weights – see [Reference Barceló, Ruiz and Vega5].

1.2 Phase-space formulations

Recently in the setting of quadratic submanifolds, Dendrinos, Mustata and Vitturi [Reference Dendrinos, Mustata and Vitturi24] observed that the Mizohata–Takeuchi inequality (1.4) may be reformulated in terms of the classical Wigner distribution, providing it with a natural phase-space interpretation. The purpose of this paper is to establish and explore such phase-space formulations of the Stein and Mizohata–Takeuchi inequalities for quite general (codimension-1) submanifolds, exposing the role played by the underlying geometry. The starting point is the surprising observation that a rather general Fourier extension operator (in modulus square) has a natural and explicit phase-space representation, namely,

(1.7)

$$ \begin{align} |\widehat{g\mathrm{d}\sigma}|^2=X_S^*W_S(g,g); \end{align} $$

see the forthcoming Proposition 4.8. Here $W_S(g_1,g_2):TS\rightarrow \mathbb {R}$ is a certain geometric (or S-carried) Wigner transform, and $X_S$ is the pullback of the X-ray transform by the Gauss map; concretely,

$$ \begin{align*}X_Sw(u,v):=Xw(N(u),v)\end{align*} $$

for $(u,v)\in TS$ . Such phase-space representations have their origins in quantum mechanics in the case that S is the paraboloid – a perspective that we develop in Section 2. They are also well-known in optics, particularly when S is the paraboloid or the sphere, and we develop this perspective in Section 3. As we shall see in the later sections, identifying a suitable Wigner transform $W_S$ explicitly in terms of the geometry of a general (strictly convex) submanifold S requires some careful geometric analysis. This is one of the main achievements of this paper, and it is hoped that it will also find some interesting applications beyond harmonic analysis. From the point of view of harmonic analysis, our treatment of these surface-carried Wigner transforms naturally involves controlling associated surface-carried singular integral and maximal averaging operators, which we hope will be of some independent interest.

By duality the representation (1.7) immediately gives rise to the phase-space integral formula

(1.8)

$$ \begin{align} \int_{\mathbb{R}^n}|\widehat{g\mathrm{d}\sigma}(x)|^2w(x)\mathrm{d}x=\int_{TS}W_S(g,g)(u,v)X_Sw(u,v)\mathrm{d}v\mathrm{d}\sigma(u), \end{align} $$

leading to phase-space formulations of the Stein and Mizohata–Takeuchi problems. Here the integral on the tangent bundle $TS$ is defined in the usual way, by first integrating with respect to Lebesgue measure on the tangent space $T_uS$ , and then with respect to surface measure $\mathrm {d}\sigma (u)$ on S.

Remark 1.4 (Connections with Flandrin’s conjecture).

The phase-space formulation of the Mizohata–Takeuchi problem has striking similarities with a conjecture of Flandrin [Reference Flandrin27] and its variants [Reference Lerner37] in the setting of the classical Wigner transform W. A recent form of this conjecture states that

(1.9)

$$ \begin{align} \iint_KW(g,g)\lesssim\|g\|_2^2 \end{align} $$

uniformly over all convex subsets K of phase-space; this was originally formulated by Flandrin with constant $1$ , although a counterexample to this stronger statement was constructed recently in [Reference Delourme, Duyckaerts and Lerner25]. The methods of this paper are also effective here, and we illustrate this in Section 10, establishing a form of this conjecture in the plane involving an $\varepsilon $ -loss in the measure of K and by establishing that the Flandrin-type conjecture (1.9) implies the parabolic Mizohata–Takeuchi inequality under a simple convexity assumption on the weight function w.

Remark 1.5 (Connections to maximally modulated singular integrals).

The Flandrin-type conjecture (1.9) in the plane (and thus (1.2) and (1.4)) is also intimately connected to boundedness questions for the maximally modulated bilinear Hilbert transform

$$ \begin{align*}H_*(f_1,f_2)(x):=\sup_{\lambda\in\mathbb{R}}\left|\int_{\mathbb{R}}f_1\left(x+\frac{y}{2}\right)f_2\left(x-\frac{y}{2}\right)e^{i\lambda y}\frac{dy}{y}\right|. \end{align*} $$

We refer to Section 10 for details.

Evidently the phase-space formula (1.8) goes some way to motivate the original inequalities (1.2) and (1.4). The first remark to make is that the most naive use of (1.8) is easily seen to fail for any S through the observation that the $L^1$ estimate

(1.10)

$$ \begin{align}\int_{T_uS}|W_S(g,g)(u,v)|\mathrm{d}v\lesssim |g(u)|^2, \end{align} $$

fails, despite $W_S(g,g)$ satisfying the marginal property

(1.11)

$$ \begin{align}\int_{T_u S}W_S(g,g)(u,v)\mathrm{d}v=|g(u)|^2 \end{align} $$

(possibly under some additional minor regularity assumption on S); see Section 8 for details, along with the sense in which such pointwise identities hold. Of course if $X_Sw(u,v)$ is independent of v, then the failure of (1.10) is of no consequence, and (1.2) follows quickly from an application of Fubini’s theorem and (1.11).

Our explicit phase-space representation (1.7) requires rather little of the submanifold S. The main assumption is that S is smooth and strictly convex in the sense that its shape operator is strictly positive definite at all points. On a technical level we also assume that its set of unit normals $N(S)$ is geodesically convex (i.e., the intersection of $N(S)$ with any great circle is connected), along with a mild additional differentiability hypothesis (see Remark 4.2), which we expect to be automatic from the smoothness of S.

For the purposes of our phase-space approach to the Stein (1.2) and Mizohata–Takeuchi inequalities (1.4), it will be convenient to restrict further to compact graphs. The assumption that S is a graph is a very mild assumption as the Stein and Mizohata–Takeuchi inequalities (and their variants) behave well under partitioning a manifold S into boundedly many pieces. This allows us to extend our results a posteriori to closed manifolds such as the sphere, for example. With this in mind we make the additional (technical) assumption that

(1.12)

$$ \begin{align} N(u)\cdot N(u')\geq \frac{1}{2}\;\;\text{ for all }u,u'\in S, \end{align} $$

meaning that the normals to S lie in a cone of some fixed aperture.

As indicated in Remark 1.3, it is not anticipated that the discussed Stein and Mizohata–Takeuchi inequalities have a quantitative dependence on any lower bound on the curvature of S, and our results in this paper reflect this. Identifying this feature is one of the reasons why we have insisted on making our analysis as geometric (or parametrisation-free) as possible. Curiously, while our bounds do not depend on the curvature of S in absolute terms, as we shall see, certain dilation-invariant curvature functionals naturally emerge. For example, for curves in the plane our Stein-type inequality may be controlled by the quantity

(1.13)

$$ \begin{align} \Lambda(S):=\sup_{u,u'\in S}\left(\frac{|u'-u"|K(u)}{|N(u')\wedge N(u")|}\right)^{1/2}, \end{align} $$

where $K(u)$ denotes the Gaussian curvature of S at the point u, and $u"$ is a certain point on S constructed geometrically from points $u,u'\in S$ (we refer to Section 4 for details). However, in this paper we shall formulate our main results in terms of a relatively simple curvature functional related to the quasi-conformality of the shape operator of S. This has the advantage of being effective in both the Stein and Mizohata–Takeuchi settings, and in all dimensions. To describe this it is helpful to again begin with the case $n=2$ , where we shall say that a strictly convex planar curve S has bounded curvature quotient if there exists a finite constant c such that

(1.14)

$$ \begin{align} K(u)\leq cK(u') \end{align} $$

for all $u, u'\in S$ . Let us denote by $Q(S)$ the least such c. We extend this to higher dimensions by defining $Q(S)$ to be the maximum ratio of the principal curvatures of S, namely the smallest constant c such that

(1.15)

$$ \begin{align} \lambda_j(u)\leq c\lambda_{k}(u') \end{align} $$

for all $u, u'\in S$ and $1\leq j,k\leq n-1$ , where $\lambda _j(u)$ denotes the jth principal curvature of S at the point u. Evidently $Q(kS)=Q(S)$ for all isotropic dilates $kS$ of S – a natural property in this setting as we have indicated in Remark 1.3.

Remark 1.6 (Relation to shape quasi-conformality).

The finiteness of $Q(S)$ may be interpreted as a certain rather strong quasi-conformality condition on the shape operator $\mathrm {d}N$ of S. Indeed it quickly implies that the shape operator is $Q(S)$ -quasi-conformal, that is

$$ \begin{align*}\|\mathrm{d}N_u\|^{n-1}\leq Q(S)^{n-2}K(u)\;\;\text{ for all }u\in S;\end{align*} $$

see, for example, [Reference Ahlfors2] for a treatment of quasi-conformal maps. This simply follows from the fact that the principal curvatures of S are the eigenvalues of the shape operator. Arguing very similarly we see that the finiteness of $Q(S)$ also implies the ‘long-range’ quasi-conformality condition

(1.16)

$$ \begin{align} \|\mathrm{d}N_u\|^{n-1}\leq Q(S)^{n-1}K(u')\;\;\text{ for all }u, u'\in S, \end{align} $$

which has the advantage of having content also when $n=2$ , where it reduces to (1.14). This latter condition is actually equivalent to S having bounded curvature quotient even in higher dimensions, since (1.16) $\implies $ (1.15) with $c=Q(S)^{n-1}$ .

Our main theorems are the following Sobolev variants of the Stein and Mizohata–Takeuchi inequalities (stated somewhat informally for the sake of exposition – see the forthcoming Theorems 4.11 and 4.13 for clarification):

Theorem 1.7 (Sobolev–Stein inequality).

Suppose that S is a smooth strictly convex surface with curvature quotient $Q(S)$ , and $s<\frac {n-1}{2}$ . Then there is a dimensional constant c such that

(1.17)

$$ \begin{align} \int_{\mathbb{R}^n}|\widehat{g\mathrm{d}\sigma}(x)|^2w(x)\mathrm{d}x\leq cQ(S)^{\frac{5n-8}{4}}\int_S I_{S,2s}(|g|^2,|g|^2)(u)^{1/2}\|X_Sw(u,\cdot)\|_{\dot{H}^s(T_u S)}\mathrm{d}\sigma(u), \end{align} $$

where $I_{S,s}$ is a certain bilinear fractional integral on S of order s, and $\dot {H}^s(T_u S)$ denotes the usual homogeneous $L^2$ Sobolev space on the tangent space $T_u S$ .

Theorem 1.8 (Sobolev–Mizohata–Takeuchi inequality).

Suppose that S is a smooth strictly convex surface with curvature quotient $Q(S)$ , and $s<\frac {n-1}{2}$ . Then there is a constant c, depending on at most n, s, and the diameter of S, such that

(1.18)

$$ \begin{align} \int_{\mathbb{R}^n}|\widehat{g\mathrm{d}\sigma}(x)|^2w(x)\mathrm{d}x\leq cQ(S)^{\frac{9n-12}{4}}\|g\|_{L^2(S)}^2\sup_{u\in S}\|X_Sw(u,\cdot)\|_{\dot{H}^s(T_u S)}. \end{align} $$

Remark 1.9. While the constant in (1.17) does not depend on the Sobolev index s, the restriction $s<\frac {n-1}{2}$ is imposed in order to ensure that the kernel of the fractional integral operator $I_{S,2s}$ is a locally integrable function; see the statement of Theorem 4.11 for clarification. We refer to Remark 1.12 for the optimality of the threshold $\frac {n-1}{2}$ .

Remark 1.10 (Improved constants).

It is not expected that the particular powers of $Q(S)$ featuring in Theorems 1.7 and 1.8 are best-possible, at least in dimensions $n>2$ . Moreover, and as we have already indicated, the curvature quotient $Q(S)$ does not capture all of the relevant geometry of the surface S. For example, in the relatively simple two-dimensional setting our arguments reveal that the power of $Q(S)$ in Theorem 1.7 may be replaced by the smaller quantity $\Lambda (S)$ in (1.13). It is straightforward to see that $\Lambda (S)$ may be finite when S has a point of vanishing curvature, such as in the case of the quartic curve $S=\{(t,t^4):|t|\leq 1\}$ . We refer to Section 4.4 for more on this.

Remark 1.11 (Permissibility of signed weights).

Our proofs of Theorems 1.7 and 1.8 reveal that they continue to hold for signed weights w. This marks an essential difference between these theorems and the original Stein and Mizohata–Takeuchi problems.

Remark 1.12 (The strength of Theorem 1.7).

As we clarify in Section 4, Theorems 1.7 and 1.8 (when specialised to non-negative weights w) are easily seen to be formally weaker than the global Stein and Mizohata–Takeuchi inequalities (1.2) and (1.4) respectively (as we have commented in Remark 1.2, the latter were recently shown to fail as stated in [Reference Cairo18]). This follows via a standard Sobolev embedding and, as may be expected, the range $s<\frac {n-1}{2}$ is best-possible in this respect. Despite its weakness relative to the Stein inequality, the Sobolev–Stein inequality (1.17) continues to be effective in transferring estimates for the X-ray transform to Fourier extension estimates, particularly in two dimensions. To see this, let $\theta \in \mathbb {R}$ and write

$$ \begin{align*}\|(-\Delta)^{\frac{\theta}{2}}|Eg|^2\|_2^2=\int_{\mathbb{R}^n}|Eg|^2w, \end{align*} $$

where $w=(-\Delta )^\theta |Eg|^2$ . By Theorem 1.7 (noting Remark 1.11) and the Cauchy–Schwarz inequality,

$$ \begin{align*}\|(-\Delta)^{\frac{\theta}{2}}|Eg|^2\|_2^2\lesssim\|I_{S,2s}(|g|^2,|g|^2)\|_{L^1(S)}^{\frac{1}{2}}\|(-\Delta)^{\frac{s}{2}}X_S((-\Delta)^{\theta}|Eg|^2)\|_{L^2(TS)} \end{align*} $$

whenever $s<\frac {n-1}{2}$ . By our forthcoming bounds on $I_{S,s}$ (see Section 7, and in particular (7.1)),

(1.19)

$$ \begin{align} \|I_{S,2s}(|g|^2,|g|^2)\|_{L^1(S)}^{\frac{1}{2}}\lesssim \|g\|_4^2. \end{align} $$

Next, since S is strictly convex its Gauss map is injective, and hence by a change of variables followed by the isometric property of the X-ray transform,

$$ \begin{align*}\|K(u)^{\frac{1}{2}}(-\Delta_v)^{\frac{1}{4}}X_Sw\|_{L^2(TS)}\leq\|(-\Delta_v)^{\frac{1}{4}}Xw\|_2=c_n\|w\|_{L^2(\mathbb{R}^n)}.\end{align*} $$

Therefore, provided S has everywhere nonvanishing Gaussian curvature it follows that

$$ \begin{align*} \begin{aligned} \|(-\Delta)^{\frac{s}{2}}X_S((-\Delta)^{\theta}|Eg|^2)\|_{L^2(TS)}&=\|(-\Delta)^{\frac{1}{4}}X_S((-\Delta)^{\theta+\frac{s}{2}-\frac{1}{4}}|Eg|^2)\|_{L^2(TS)}\\&\lesssim \|K(u)^{1/2}(-\Delta)^{1/4}X_S((-\Delta)^{\theta+\frac{s}{2}-\frac{1}{4}}|Eg|^2)\|_{L^2(TS)}\\&\lesssim \|(-\Delta)^{\theta+\frac{s}{2}-\frac{1}{4}}|Eg|^2\|_2. \end{aligned} \end{align*} $$

Hence

$$ \begin{align*}\|(-\Delta)^{\frac{\theta}{2}}|Eg|^2\|_2^2\lesssim\|g\|_4^2\|(-\Delta)^{\theta+\frac{s}{2}-\frac{1}{4}}|Eg|^2\|_2 \end{align*} $$

whenever $s<\frac {n-1}{2}$ . Setting $\frac {\theta }{2}=\theta +\frac {s}{2}-\frac {1}{4}$ , or equivalently $\theta =\frac {1}{2}-s$ , it follows that

$$ \begin{align*}\|(-\Delta)^{\frac{\theta}{2}}|Eg|^2\|_2\lesssim \|g\|_4^2 \end{align*} $$

whenever $\theta>1-\frac {n}{2}$ . This Sobolev-extension estimate is reminiscent of the well-known Strichartz inequalities of Ozawa and Tsutsumi [Reference Ozawa and Tsutsumi44]; see [Reference Bennett, Bez, Jeavons and Pattakos10] for some further contextual discussion. In particular, when $n=2$ this implies the classical restriction theorem for smooth compact planar curves of nonvanishing curvature, since the missing case $\theta =0$ is the missing (endpoint) $L^4$ estimate in that setting. We note that curvature only plays a role in the X-ray estimate, which is structurally consistent with Stein’s inequality (1.2). This implication via (1.17) should be compared with the passage from the Kakeya maximal conjecture to the restriction conjecture implied by Stein’s inequality (1.2) outlined in Remark 1.1. Some related arguments in the setting of the paraboloid may be found in [Reference Planchon and Vega46, Reference Vega52, Reference Beltran and Vega8].

Remark 1.13 (The strength of Theorem 1.8).

The proximity of (1.18) to (1.4) varies depending on the nature of the weight w, and evidently this relates to the tightness of the Sobolev embedding referred to in Remark 1.12. For example, in the case of the sphere (or suitable portions of it – see Theorem 3.4) and for weights of the form $w(x)=\varphi (x/R)$ , where $\varphi $ is a smooth bump function and $R\gg 1$ , Theorem 1.8 quickly leads to the inequality

$$ \begin{align*}\frac{1}{R}\int_{B(0,R)}|\widehat{g\mathrm{d}\sigma}(x)|^2\mathrm{d}x\leq C_\varepsilon R^{\varepsilon}\|g\|_2^2\end{align*} $$

for all $\varepsilon>0$ and $R\gg 1$ . Up to the $\varepsilon $ -loss this coincides with (1.4) and is the well-known Agmon–Hörmander inequality [Reference Agmon and Hörmander1]. For weights w that lack regularity one should expect the Sobolev embedding referred to in Remark 1.12 to be weak, and thus (1.18) to be considerably weaker than (1.4). Examples of such weights seem likely to include those that are known to be ‘critical’ for (1.4) in the sense that they have large mass globally, but small mass on any line, such as the weights of Cairo [Reference Cairo18], or the random weights of Carbery [Reference Carbery19] and Mulherkar [Reference Mulherkar40]; see also [Reference Guth32].

Remark 1.14. While the curvature quotient $Q(S)$ is invariant under isotropic dilations of S, our Sobolev–Mizohata–Takeuchi theorem (Theorem 1.8) is not. This stems from the fact that necessarily s is strictly less than $\tfrac {n-1}{2}$ for the implicit constant to be finite and manifests itself in the dependence on the diameter of S in the statement of Theorem 1.8. That said, it does provide a bound that is independent of any lower bound on the curvature of S.

Remark 1.15 (Relation to the wavepacket approach).

The representation (1.7) may be viewed as a certain ‘scale-free’ (and ‘quadratic’) version of the wavepacket decomposition that has proved so effective in Fourier restriction theory. There an extension operator is expressed as a superposition of wavepackets adapted to tubes in $\mathbb {R}^n$ , with the tubes corresponding to a discrete set of points in the tangent bundle of S. The distinction arises from a use of a conventional windowed Fourier transform (a linear operator) in the wavepacket decomposition, rather than a Wigner distribution – the latter being a form of windowed Fourier transform where the window is the input function g itself (a quadratic operator). We refer to [Reference Carbery, Iliopoulou and Wang21] and the references there for progress on the Stein and Mizohata–Takeuchi problems based on wavepacket analysis.

Structure of the paper. In Section 2 we consider the case when S is the paraboloid, motivating our perspective and results in classical quantum mechanical terms that date back to Wigner’s original work. In Section 3 we prove Theorems 1.7 and 1.8 when S is the sphere, interpreting our perspective from the point of view of optical field theory. In Section 4 we turn to the much more involved geometric analysis in the setting of general submanifolds, proving Theorems 1.7 and 1.8, although deferring the necessary analysis of Jacobians, distances and bilinear fractional integrals to Sections 6, 5 and 7 respectively. In Section 8 we establish the characteristic marginal properties of the geometric Wigner transforms via an analysis of the appropriate geometric maximal operators. In Section 9 we observe that the phase-space perspective presented here coincides with a certain tomographic perspective introduced in [Reference Bennett and Nakamura14] when $n=2$ , highlighting a tomographic method for constructing geometric Wigner distributions. In Section 10 we illustrate the effectiveness of our basic methods by establishing a form of Flandrin’s conjecture in the plane with an $\varepsilon $ loss. Finally, in Section 11 we pose some questions.

Notation. Throughout this paper, for nonnegative quantities $A,B$ we write $A\lesssim B$ if there exists a constant c that is independent of S such that $A\leq cB$ . The independence of the implicit constant c of various other parameters will be clear from the context. In particular, such constants will never depend on the input function g, nor the weight function w.

2 The paraboloid: a quantum mechanical viewpoint

In the particular case when S is the paraboloid, the phase-space representation (1.7) has a well-known quantum mechanical derivation going back to the original work of Wigner [Reference Wigner53]. As may be expected, this involves the classical Wigner transform, and as we shall see in this section, leads to some additional insights and simplifications in our arguments. Moreover, parametrised formulations of the Stein and Mizohata–Takeuchi inequalities (1.2) and (1.4) will emerge rather naturally from these classical considerations, permitting them some physical (or probabilistic) interpretations.

The Wigner transform is defined (see, e.g., [Reference Folland28]) for $g_1, g_2\in L^2(\mathbb {R}^d)$ by

(2.1)

$$ \begin{align} W(g_1,g_2)(x,v)=\int_{\mathbb{R}^d}g_1\left(x+\frac{y}{2}\right)\overline{g_2\left(x-\frac{y}{2}\right)}e^{-2\pi iv\cdot y}\mathrm{d}y. \end{align} $$

For a solution $u:\mathbb {R}^d\times \mathbb {R}\rightarrow \mathbb {C}$ of the Schrödinger equation

$$ \begin{align*}2\pi i\frac{\partial u}{\partial t}=\Delta_{x} u\end{align*} $$

with initial data $u_0\in L^2(\mathbb {R}^d)$ , it is a classical observation dating back to Wigner [Reference Wigner53] that

(2.2)

$$ \begin{align} f(x,v,t):=W(u(\cdot, t), u(\cdot, t))(x,v) \end{align} $$

satisfies the kinetic transport equation

$$ \begin{align*}\frac{\partial f}{\partial t}=2v\cdot\nabla_x f \end{align*} $$

from classical mechanics. Consequently

$$ \begin{align*}f(x,v,t)=f_0(x+2tv,v),\end{align*} $$

where $f_0=W(u_0,u_0):\mathbb {R}^d\times \mathbb {R}^d\rightarrow \mathbb {R}$ is the Wigner distribution of the initial data $u_0$ . We note that the function f may be reconciled with the corresponding f in the forthcoming Sections 3 and 4 using the Fourier invariance property (see 1.94 of [Reference Folland28])

(2.3)

$$ \begin{align} W(g_1,g_2)(x,v)=W(\widehat{g}_1,\widehat{g}_2)(-v,x). \end{align} $$

By the classical marginal property

(2.4)

$$ \begin{align} \int_{\mathbb{R}^d}W(g,g)(x,v)\mathrm{d}v=|g(x)|^2 \end{align} $$

of the Wigner distribution we obtain the phase-space representation

(2.5)

$$ \begin{align} |u(x,t)|^2=\int_{\mathbb{R}^d}f(x,v,t)\mathrm{d}v=\int_{\mathbb{R}^d}f_0(x+2tv,v)\mathrm{d}v=:\rho(f_0)(x,t). \end{align} $$

The operator $\rho $ , which is referred to as a velocity averaging operator in kinetic theory, is easily seen to be a certain (parametrised) adjoint space-time X-ray transform, indeed

$$ \begin{align*}\rho^*(g)(x,v)=\int_{\mathbb{R}}g(x-2tv,t)\mathrm{d}t, \end{align*} $$

which is of course an integral of the space-time function g along the line through the point $(x,0)$ with direction $(-2v,1)$ . We caution that the parameter v, being a velocity, describes the direction of this line. This differs from elsewhere in this paper where v is used as a translation (or position) parameter.

As we have indicated in the introduction, the above phase-space representation is particularly natural if one is interested in weighted $L^2$ norms of u, since by duality

(2.6)

$$ \begin{align} \int_{\mathbb{R}^d\times\mathbb{R}}|u(x,t)|^2w(x,t)\mathrm{d}x\mathrm{d}t=\int_{\mathbb{R}^d\times\mathbb{R}^d}W(u_0,u_0)(x,v)\rho^*w(x,v)\mathrm{d}x\mathrm{d}v. \end{align} $$

We refer to [Reference Dendrinos, Mustata and Vitturi24] where this identity was recently derived directly. If the initial data $u_0$ is a Gaussian then $W(u_0,u_0)$ is also a (real) Gaussian, and being nonnegative it follows that

$$ \begin{align*} \begin{aligned} \int_{\mathbb{R}^d\times\mathbb{R}}|u(x,t)|^2w(x,t)\mathrm{d}x\mathrm{d}t&\leq\int_{\mathbb{R}^d}\left(\int_{\mathbb{R}^d}W(u_0,u_0)(x,v)\mathrm{d}x\right)\sup_x\rho^*w(x,v)\mathrm{d}v\\&= \int_{\mathbb{R}^d}|\widehat{u}_0(v)|^2\sup_x\rho^*w(x,v)\mathrm{d}v, \end{aligned} \end{align*} $$

which in turn implies that

$$ \begin{align*}\int_{\mathbb{R}^d\times\mathbb{R}}|u(x,t)|^2w(x,t)\mathrm{d}x\mathrm{d}t\leq\sup_{\substack{x\in\mathbb{R}^d\\ v\in\mathrm{supp\,}(\widehat{u}_0)}}\rho^* w(x,v)\;\|u_0\|_2^2. \end{align*} $$

Here we have used the further marginal property

(2.7)

$$ \begin{align} \int_{\mathbb{R}^d}W(g,g)(x,v)\mathrm{d}x=|\widehat{g}(v)|^2 \end{align} $$

of the Wigner distribution, followed by Plancherel’s theorem. It is therefore reasonably natural to ask whether

(2.8)

$$ \begin{align} \int_{\mathbb{R}^d\times\mathbb{R}}|u(x,t)|^2w(x,t)\mathrm{d}x\mathrm{d}t\lesssim \int_{\mathbb{R}^d}|\widehat{u}_0(v)|^2\sup_x\rho^*w(x,v)\mathrm{d}v, \end{align} $$

and thus

(2.9)

$$ \begin{align} \int_{\mathbb{R}^d\times\mathbb{R}}|u(x,t)|^2w(x,t)\mathrm{d}x\mathrm{d}t\lesssim\sup_{\substack{x\in\mathbb{R}^d\\ v\in\mathrm{ supp\,}(\widehat{u}_0)}}\rho^* w(x,v)\;\|u_0\|_2^2 \end{align} $$

might hold for general $u_0$ . As we clarify shortly in Remark 2.3, the inequalities (2.8) and (2.9) are parabolic forms of the Stein (1.2) and Mizohata–Takeuchi (1.4) inequalities and as such also fail (see Remark 1.2).

Remark 2.1. As indicated in Remark 1.2, the recent counterexamples in [Reference Cairo18] leave open the possibility that for $\widehat {u}_0$ supported in the unit ball (say),

(2.10)

$$ \begin{align} \int_{|(x,t)|\leq R}|u(x,t)|^2w(x,t)\mathrm{d}x\mathrm{d}t\leq C_\varepsilon R^\varepsilon \int_{\mathbb{R}^d}|\widehat{u}_0(v)|^2\sup_x\rho^*w(x,v)\mathrm{d}v \end{align} $$

and

(2.11)

$$ \begin{align} \int_{|(x,t)|\leq R}|u(x,t)|^2w(x,t)\mathrm{d}x\mathrm{d}t\leq C_\varepsilon R^\varepsilon \sup_{\substack{x\in\mathbb{R}^d\\ v\in\mathrm{supp\,}(\widehat{u}_0)}}\rho^* w(x,v)\;\|u_0\|_2^2 \end{align} $$

might hold for each $\varepsilon>0$ and all $R\gg 1$ ; in other words, (2.8) and (2.9) under the assumption that w is supported in the space-time ball $B(0,R)$ , accepting a factor of $R^\varepsilon $ in the implicit constant on the right-hand sides for each $\varepsilon>0$ . The requirement that $\widehat {u}_0$ is supported in some fixed compact set (the unit ball here) prevents scale-invariance considerations reducing (2.10) and (2.11) to (2.8) and (2.9) respectively. We remark that these inequalities are naturally referred to as Strichartz estimates, being bounds on space-time norms.

Remark 2.2 (A quasi-probabilistic interpretation).

In the phase-space formulation of quantum mechanics the Wigner distribution $W(u_0,u_0)$ is interpreted as a (quasi-) probability distribution on position-momentum space for a quantum particle, and so the inequalities (2.8) and (2.9) for any given weight w are the assertions that

(2.12)

$$ \begin{align}\mathbb{E}_{x,v}(\rho^*w)\lesssim\mathbb{E}_{x,v}(\|\rho^*w\|_{L^\infty_x}) \end{align} $$

and

(2.13)

$$ \begin{align}\mathbb{E}_{x,v}(\rho^*w)\lesssim\|\rho^*w\|_\infty \end{align} $$

respectively; we recall that these inequalities are known to fail for general w unless we make some additional localisations (see Remark 2.1). Here the expectation is taken with respect to the quasi-probability density $W(u_0,u_0)$ , where of course $\|u_0\|_2=1$ . Note that $\mathbb {E}_{x,v}(\|\rho ^*w\|_{L^\infty _x})=\mathbb {E}_{v}(\|\rho ^*w\|_{L^\infty _x})$ by the marginal property (2.7), where $\mathbb {E}_{v}$ is taken with respect to the probability density $|\widehat {u}_{0}(v)|^{2}$ . The forthcoming Theorems 2.4–2.8 may be interpreted similarly. Evidently the subtleties in (2.12), (2.13) and all of these inequalities arise from the fact that the Wigner distribution typically takes both positive and negative values.

Remark 2.3. Although (2.8) is false in general (see Remark 2.1), for any given weight w it may be seen as an instance of (1.2) where $d=n-1$ and

(2.14)

$$ \begin{align} S=\mathbb{P}^d:=\{u=(u',u_{d+1})\in\mathbb{R}^d\times\mathbb{R}:u_{d+1}=|u'|^2\} \end{align} $$

is the paraboloid. This is a consequence of a certain change-of-measure invariance property enjoyed by the general inequality (1.2): specifically, if $\mathrm {d}\tilde {\sigma }(u)=a(u)\mathrm {d}\sigma (u)$ for some density a on S, then (1.5) quickly implies that

(2.15)

$$ \begin{align} \int_{\mathbb{R}^n}|\widehat{g\mathrm{d}\tilde{\sigma}}(x)|^2w(x)\mathrm{d}x\leq C\int_S|g(u)|^2\sup_{v\in T_uS}a(u)Xw(N(u),v)\mathrm{d}\tilde{\sigma}(u). \end{align} $$

Next we define the (affine surface) measure $\mathrm {d}\tilde {\sigma }$ on $\mathbb {P}^d$ by

(2.16)

$$ \begin{align} \int_S\Phi \mathrm{d}\tilde{\sigma}=\int_{\mathbb{R}^d}\Phi(u',|u'|^2)\mathrm{d}u', \end{align} $$

so that $a(u)=(1+4|u|^2)^{-1/2}$ . With these choices, a scalar change of variables reveals that

$$ \begin{align*}\sup_{v\in T_u S}a(u)Xw(N(u),v)=\sup_x\rho^*w(x,u'). \end{align*} $$

Finally, defining $g: S\rightarrow \mathbb {C}$ by $g(\cdot ,|\cdot |^2)=\widehat {u}_0$ , we have that $u(x,t)=\widehat {g\mathrm {d}\tilde {\sigma }}(x,t)$ , from which (2.8) follows. The change-of-measure invariance property (2.15) enjoyed by (1.2) is not inherited by the corresponding Mizohata–Takeuchi inequality (1.4), meaning that there is in principle a different Mizohata–Takeuchi inequality for each density a – namely

(2.17)

$$ \begin{align} \int_{\mathbb{R}^n}|\widehat{g\mathrm{d}\tilde{\sigma}}(x)|^2w(x)\mathrm{d}x\leq C\sup_{(u,v)\in TS}a(u)Xw(N(u),v)\|g\|^2_{L^2(\mathrm{d}\tilde{\sigma})}, \end{align} $$

where again, the supremum is restricted to $u\in \mathrm {supp\,}(g)$ . It is straightforward to verify that (2.9) coincides with (2.17) with the above choice of density a on the paraboloid. Similar change-of-measure arguments relate the paraboloid-carried Wigner distribution referred to in (1.7) to the classical Wigner distribution (2.1), reconciling (2.5) with (1.7). We clarify this in Remark 4.7 in Section 4.

Perhaps the most obvious difficulty in going beyond Gaussian initial data is that $W(u_0,u_0)$ is everywhere nonnegative if and only if $u_0$ is a Gaussian (this is known as Hudson’s theorem, see [Reference Folland28] for a treatment of this and other fundamental properties of the Wigner transform), and the inequality $\|W(u_0,u_0)\|_1\lesssim \|u_0\|_2^2$ fails for general $u_0$ (see [Reference Lerner37]). Of course the $L^p$ estimates that do hold for the Wigner distribution (see [Reference Lieb38]) yield variants of (2.9) via Hölder’s inequality, such as

(2.18)

$$ \begin{align} \int_{\mathbb{R}^d\times\mathbb{R}}|u(x,t)|^2w(x,t)\mathrm{d}x\mathrm{d}t\lesssim\|\rho^* w\|_{L^2(\mathbb{R}^d\times [-1,1]^d)}\|u_0\|_2^2, \end{align} $$

as was observed in [Reference Dendrinos, Mustata and Vitturi24] whenever $\widehat {u}_0$ is supported in the cube $[-1,1]^d$ . Here we observe that further variants arise from certain Sobolev estimates on the Wigner transform. For example, we have the following:

Theorem 2.4. For $s>d/2$ ,

(2.19)

$$ \begin{align} \int_{\mathbb{R}^d\times\mathbb{R}}|u(x,t)|^2w(x,t)\mathrm{d}x\mathrm{d}t\leq\int_{\mathbb{R}^d}\widetilde{I}_{2s}(|\widehat{u}_0|^2,|\widehat{u}_0|^2)(v)^{1/2}\|\rho^* w(\cdot,v)\|_{H_x^s}\mathrm{d}v, \end{align} $$

where

$$ \begin{align*}\widetilde{I}_s(g_1,g_2)(v):=\int_{\mathbb{R}^d}\frac{g_1\left(v+\frac{\xi}{2}\right)g_2\left(v-\frac{\xi}{2}\right)}{(1+|\xi|^2)^{s/2}}\mathrm{d}\xi \end{align*} $$

and $H_x^s$ denotes the usual inhomogeneous $L^2$ Sobolev space in the variable x.

Theorem 2.5. For $s>d/2$ ,

(2.20)

$$ \begin{align} \int_{\mathbb{R}^d\times\mathbb{R}}|u(x,t)|^2w(x,t)\mathrm{d}x\mathrm{d}t\lesssim\sup_{v\in\frac{1}{2}(\mathrm{supp\,}(\widehat{u}_0)+\mathrm{ supp\,}(\widehat{u}_0))}\|\rho^* w(\cdot,v)\|_{H_x^s}\|u_0\|_2^2, \end{align} $$

where the implicit constant depends on at most d and s.

Remark 2.6. As our arguments quickly reveal, Theorems 2.4 and 2.5 require no positivity hypothesis on the weight w. This point aside, Theorems 2.4 and 2.5 may be viewed as substitutes for (2.8) and (2.9), which are false for general weights; see Remark 2.1. This is a consequence of the elementary Sobolev embedding $H^s(\mathbb {R}^d)\subset L^\infty (\mathbb {R}^d)$ , which holds whenever $s>d/2$ . It is natural to ask whether the stronger

(2.21)

$$ \begin{align} \int_{\mathbb{R}^d\times\mathbb{R}}|u(x,t)|^2w(x,t)\mathrm{d}x\mathrm{d}t\lesssim\sup_{v\in\mathrm{supp\,}(\widehat{u}_0)}\|\rho^* w(\cdot,v)\|_{H_x^s}\|u_0\|_2^2 \end{align} $$

holds, as suggested by (2.9) for arbitrary positive weights.

Proof of Theorem 2.4.

By (2.6) and an application of the duality of $H^s$ and $H^{-s}$ we have

$$ \begin{align*}\int_{\mathbb{R}^d\times\mathbb{R}}|u(x,t)|^2w(x,t)\mathrm{d}x\mathrm{d}t\leq\int_{\mathbb{R}^d}\|W(u_0,u_0)(\cdot, v)\|_{H_x^{-s}}\|\rho^* w(\cdot,v)\|_{H_x^s}\mathrm{d}v, \end{align*} $$

and so it remains to show that

(2.22)

$$ \begin{align} \|W(u_0,u_0)(\cdot, v)\|_{H_x^{-s}}^2=\widetilde{I}_{2s}(|\widehat{u}_0|^2,|\widehat{u}_0|^2)(v). \end{align} $$

This follows from the classical Fourier invariance property (2.3), which implies

(2.23)

$$ \begin{align} \mathcal{F}_x^{-1}W(g_1,g_2)(\xi, v)=\widehat{g}_1\left(-v+\frac{\xi}{2}\right)\overline{\widehat{g}_2\left(-v-\frac{\xi}{2}\right)}, \end{align} $$

where $\mathcal {F}_x$ denotes the Fourier transform in x. The identity (2.22) now follows by Plancherel’s theorem and the definition of the inhomogeneous Sobolev norm.

Proof of Theorem 2.5.

Observe first that $\widetilde {I}_{2s}(|\widehat {u}_0|^2,|\widehat {u}_0|^2)(v)=0$ whenever

$$ \begin{align*}v\not\in\frac{1}{2}(\mathrm{ supp\,}(\widehat{u}_0)+\mathrm{supp\,}(\widehat{u}_0)).\end{align*} $$

Hence, by Theorem 2.4, it suffices to show that

(2.24)

$$ \begin{align} \|\widetilde{I}_s(g_1,g_2)\|_{L^{1/2}(\mathbb{R}^d)}\lesssim\|g_1\|_1\|g_2\|_1 \end{align} $$

whenever $s>d$ . The operator $\widetilde {I}_s$ is a variant (with singularity only at infinity) of the bilinear fractional integral operator

(2.25)

$$ \begin{align} \displaystyle I_s(g_1,g_2)(v):=\int_{\mathbb{R}^d}\frac{g_1\left(v+\frac{\xi}{2}\right)g_2\left(v-\frac{\xi}{2}\right)}{|\xi|^{s}}\mathrm{d}\xi \end{align} $$

treated by Kenig and Stein in [Reference Kenig and Stein33] and Grafakos and Kalton in [Reference Grafakos and Kalton31] (see also [Reference Grafakos30] for estimates above $L^1$ ), and the bound (2.24) follows a brief inspection of their arguments. For similar arguments, see also Section 7.

Theorems 2.4 and 2.5 cease to be natural if the initial datum $u_0$ has compact Fourier support, as they involve inhomogeneous Sobolev spaces, which respond to high frequencies of $u_0$ only. The appropriate substitutes are the following, which align with our main Theorems 1.7 and 1.8:

Theorem 2.7 (Parabolic Sobolev–Stein).

For $s<d/2$ ,

(2.26)

$$ \begin{align} \int_{\mathbb{R}^d\times\mathbb{R}}|u(x,t)|^2w(x,t)\mathrm{d}x\mathrm{d}t\leq\int_{\mathbb{R}^d}I_{2s}(|\widehat{u}_0|^2,|\widehat{u}_0|^2)(v)^{1/2}\|\rho^* w(\cdot,v)\|_{\dot{H}_x^s}\mathrm{d}v, \end{align} $$

where $I_s(g_1,g_2)$ is given by (2.25) and $\dot {H}_x^s$ denotes the usual homogeneous $L^2$ Sobolev space in the variable x.

Theorem 2.8 (Parabolic Sobolev–Mizohata–Takeuchi).

For $s<d/2$ ,

(2.27)

whenever $\mathrm {supp\,}(\widehat {u}_0)\subseteq B(0;1)$ . The implicit constant depends on at most d and s.

Remark 2.9. Theorems 2.7 and 2.8 also permit signed weights. Restricting to positive weights, Theorems 2.7 and 2.8 are also easily seen to be respectively weaker than (2.8) and (2.9) via a Sobolev embedding. Specifically, by the support hypothesis on $\widehat {u}_0$ we may find a spatial bump function $\Phi $ such that

$$ \begin{align*}\int_{\mathbb{R}^d\times\mathbb{R}}|u(x,t)|^2w(x,t)\mathrm{d}x\mathrm{d}t\leq \int_{\mathbb{R}^d\times\mathbb{R}}|u(x,t)|^2\Phi*w(x,t)\mathrm{d}x\mathrm{d}t, \end{align*} $$

and so it suffices to observe that for any $v\in \mathbb {R}^d$ ,

$$ \begin{align*} \begin{aligned} \|\rho^*(\Phi*w)(\cdot,v)\|_{\infty}\lesssim \|\rho^* w(\cdot,v)\|_{\dot{H}_x^s} \end{aligned} \end{align*} $$

whenever $s<d/2$ . This follows by Plancherel’s identity and the Cauchy–Schwarz inequality.

The proofs of Theorems 2.7 and 2.8 are very similar to those of Theorems 2.4 and 2.5 above, the essential difference being the use of homogeneous rather than inhomogeneous Sobolev norms, and matters are reduced to an $L^1\times L^1\rightarrow L^{1/2}$ bound on the bilinear operator

$$ \begin{align*}T(g_1,g_2)(v):=\int_{B(0;1)}\frac{g_1\left(v+\frac{\xi}{2}\right)g_2\left(v-\frac{\xi}{2}\right)}{|\xi|^{s}}\mathrm{d}\xi. \end{align*} $$

This is a local form of the bilinear fractional integral operator $I_s$ defined in (2.25), and again the required bound follows a brief inspection of the arguments in [Reference Kenig and Stein33].

3 The sphere: an optical viewpoint

The extension operator for the sphere

$$ \begin{align*}\widehat{g\mathrm{d}\sigma}(x):=\int_{\mathbb{S}^{n-1}}e^{-2\pi ix\cdot\omega}g(\omega)\mathrm{d}\sigma(\omega)\end{align*} $$

is of central importance in optics, providing a description of a unit-wavelength (or monochromatic) optical wave field as a superposition of plane waves; note that $\widehat {g\mathrm {d}\sigma }$ solves the Helmholtz equation $ \Delta u +u=0$ on $\mathbb {R}^n$ . Of particular physical significance is $|\widehat {g\mathrm {d}\sigma }|^2$ , sometimes referred to as the local intensity of the field; see, for example, [Reference Alonso3]. The Stein and Mizohata–Takeuchi inequalities (1.2) and (1.4), when specialised to the sphere $S=\mathbb {S}^{n-1}$ , become statements about this intensity, namely

(3.1)

$$ \begin{align} \int_{\mathbb{R}^n}|\widehat{g\mathrm{d}\sigma}(x)|^2w(x)\mathrm{d}x\leq C\int_{\mathbb{S}^{n-1}}|g(\omega)|^2\sup_{v\in\langle\omega\rangle^\perp}Xw(\omega,v)\mathrm{d}\sigma(\omega), \end{align} $$

and

(3.2)

$$ \begin{align} \int_{\mathbb{R}^n}|\widehat{g\mathrm{d}\sigma}(x)|^2w(x)\mathrm{d}x\leq C\sup_{\omega\in\mathrm{ supp\,}(g)}\|Xw(\omega,\cdot)\|_{L^\infty(\langle\omega\rangle^\perp)}\|g\|_{L^2(\mathbb{S}^{n-1})}^2 \end{align} $$

respectively. These conjectural inequalities are well-known for radial weights, as discussed in the introduction, although we recall that for general weights they should carry a further localisation hypothesis following the counterexamples of Cairo [Reference Cairo18]; see Remark 1.2 for clarification. Both (3.1) and (3.2) capture the expectation that the intensity $|\widehat {g\mathrm {d}\sigma }|^2$ concentrates on rays (lines), and as such connect physical optics to geometric optics. A good illustration of this is found in the high-frequency limiting identity

(3.3)

$$ \begin{align} \limsup_{R\rightarrow\infty}R^{n-1}\int_{\mathbb{R}^{n}}|\widehat{g\mathrm{d}\sigma}(Rx)|^2w(x)\mathrm{d}x=\frac{1}{(2\pi)^{n+1}}\int_{\mathbb{S}^{n-1}}|g(\xi)|^2\left(\int_{\mathbb{R}}w(t\xi)\mathrm{d}t\right)\mathrm{d}\sigma(\xi), \end{align} $$

established (for compactly supported w) by Agmon and Hörmander in [Reference Agmon and Hörmander1]; see [Reference Barceló, Ruiz and Vega5]. Accordingly (3.1) and (3.2) call for an optical (or spherical) analogue of the quantum-mechanical (or parabolic) phase-space perspective from Section 2. Fortunately such a perspective is well-known in modern optics (see [Reference Alonso3]) and involves the spherical Wigner transform that we define next. For $g_1,g_2\in L^2(\mathbb {S}^{n-1})$ let

(3.4)

$$ \begin{align} W_{\mathbb{S}^{n-1}}(g_1,g_2)(\omega, v)=\int_{\mathbb{S}^{n-1}}g_1(\omega')\overline{g_2(R_\omega \omega')}e^{-2\pi iv\cdot(\omega'-R_\omega\omega')}J(\omega,\omega')\mathrm{d}\sigma(\omega'). \end{align} $$

Here $\omega \in \mathbb {S}^{n-1}$ , $v\in \langle \omega \rangle ^\perp $ , and for a point $\omega '\in \mathbb {S}^{n-1}$ , the point $R_\omega \omega '$ is defined to be the unique $\omega "\in \mathbb {S}^{n-1}$ for which $\omega $ is the geodesic midpoint of $\omega '$ and $\omega "$ ; that is,

(3.5)

$$ \begin{align} R_\omega \omega'=2(\omega\cdot\omega')\omega-\omega'. \end{align} $$

The function $J(\omega ,\omega '):=2^{n-2}|\omega \cdot \omega '|^{n-2}$ (see the forthcoming Remark 4.7) is chosen so that

$$ \begin{align*}\int_{\mathbb{S}^{n-1}}\Phi(R_\omega\omega')J(\omega,\omega')\mathrm{d}\sigma(\omega)=\int_{\mathbb{S}^{n-1}}\Phi \mathrm{d}\sigma \end{align*} $$

for each $\omega '$ . This expression for J may be obtained by direct computation, noting that the map ${\omega \mapsto \omega ":=R_\omega \omega '}$ is not a bijection; it maps each component of $\mathbb {S}^{n-1}\backslash \langle \omega '\rangle ^\perp $ bijectively to $\mathbb {S}^{n-1}\backslash \{-\omega '\}$ with

(3.6)

$$ \begin{align} \mathrm{d}\sigma(\omega")=2^{n-1}|\omega\cdot\omega'|^{n-2}\mathrm{d}\sigma(\omega). \end{align} $$

The essential features of this construction are those described in [Reference Alonso3]; see also [Reference Kowalski and Ławniczak34].

Motivated by the role of the transport equation in Section 2, for $g\in L^2(\mathbb {S}^{n-1})$ we define the auxiliary function $f:\mathbb {S} ^{n-1}\times \mathbb {R}^n\rightarrow \mathbb {R}$ (not to be confused with (2.2)) by

$$ \begin{align*}f(\omega,x)=\int_{\mathbb{S}^{n-1}}g(\omega')\overline{g(R_\omega \omega')}e^{-2\pi ix\cdot(\omega'-R_\omega\omega')}J(\omega,\omega')\mathrm{d}\sigma(\omega'), \end{align*} $$

so that $W_{\mathbb {S}^{n-1}}(g,g)$ is the restriction of f to the tangent bundle $T\mathbb {S}^{n-1}:=\{(\omega ,v):\omega \in \mathbb {S}^{n-1}, v\in \langle \omega \rangle ^\perp \}$ . That f is real-valued follows from the fact that $R_\omega \circ R_\omega =I$ for each $\omega $ . Evidently f satisfies the transport equation

(3.7)

$$ \begin{align} \omega\cdot\nabla_xf=0, \end{align} $$

meaning that $f(\omega ,x)=f(\omega ,x_{\langle \omega \rangle ^\perp })=W_{\mathbb {S}^{n-1}}(g,g)(\omega ,x_{\langle \omega \rangle ^\perp })$ , where $x_{\langle \omega \rangle ^\perp }$ is the orthogonal projection of x onto $\langle \omega \rangle ^\perp $ . The functions f and $W_{\mathbb {S}^{n-1}}$ have some nice features; for example, we have the marginal identity

(3.8)

$$ \begin{align} \int_{\mathbb{S}^{n-1}}f(\omega,x)\mathrm{d}\sigma(\omega)=|\widehat{g\mathrm{d}\sigma}(x)|^2, \end{align} $$

by Fubini’s theorem and the definition of J. We note in passing that we have the additional marginal property

$$ \begin{align*}\int_{\langle\omega\rangle^\perp}W_{\mathbb{S}^{n-1}}(g,g)(\omega,v)\mathrm{d}v=\frac{1}{2}\left(|g(\omega)|^2+|g(-\omega)|^2\right), \end{align*} $$

very much as in the setting of the classical Wigner distribution. This may be obtained by fixing $\omega $ and considering the contributions to $W_{\mathbb {S}^{n-1}}(g,g)$ coming from the integrals over the hemispheres $\mathbb {S}^{n-1}_{\pm }:=\{\omega ':\pm \omega \cdot \omega '>0\}$ and using the fact that the mapping $\omega '\mapsto \omega '-R_\omega \omega '$ is a bijection from each of $\mathbb {S}^{n-1}_{\pm }$ to the unit ball of $\langle \omega \rangle ^\perp $ ; see the forthcoming proof of Theorem 3.2 for a similar argument.

These observations lead to the desired spherical analogue of (2.5):

Proposition 3.1 (Spherical phase-space representation).

$$ \begin{align*}|\widehat{g\mathrm{d}\sigma}|^2=X^*W_{\mathbb{S}^{n-1}}(g,g).\end{align*} $$

Proof. By (3.8), (3.7) and Fubini’s theorem,

$$ \begin{align*} \begin{aligned} \int_{\mathbb{R}^n}|\widehat{g\mathrm{d}\sigma}(x)|^2w(x)\mathrm{d}x&=\int_{\mathbb{R}^n}\int_{\mathbb{S}^{n-1}}f(\omega,x)\mathrm{d}\sigma(\omega)w(x)\mathrm{d}x\\ &=\int_{\mathbb{S}^{n-1}}\int_{\langle\omega\rangle^\perp}f(\omega,x_{\langle\omega\rangle^\perp})\left(\int_{\langle\omega\rangle}w(x_{\langle\omega\rangle}+x_{\langle\omega\rangle^\perp})\mathrm{d}x_{\langle\omega\rangle}\right)\mathrm{d}x_{\langle\omega\rangle^\perp}\mathrm{d}\sigma(\omega)\\ &=\int_{\mathbb{S}^{n-1}}\int_{\langle\omega\rangle^\perp}W_{\mathbb{S}^{n-1}}(g,g)(\omega,v)Xw(\omega,v)\mathrm{d}v\mathrm{d}\sigma(\omega)\\ &=\int_{\mathbb{R}^n}X^*W_{\mathbb{S}^{n-1}}(g,g)(x)w(x)\mathrm{d}x \end{aligned} \end{align*} $$

for all test functions w.

As we have already indicated, Proposition 3.1 is well-known in some form in optics (at least in low dimensions) where it provides a representation of the local intensity of an optical field as a linear superposition of light rays – a useful and explicit connection between physical and geometric optics; see Alonso [Reference Alonso3]. Proposition 3.1 may be used to prove the following spherical versions of Theorems 1.7 and 1.8:

Theorem 3.2 (Spherical Sobolev–Stein).

For $s<\frac {n-1}{2}$ , there exists a dimensional constant c such that

(3.9)

$$ \begin{align} \int_{\mathbb{R}^n}|\widehat{g\mathrm{d}\sigma}(x)|^2w(x)\mathrm{d}x\leq c\int_{\mathbb{S}^{n-1}}I_{\mathbb{S}^{n-1},2s}(|g|^2,|g|^2)(\omega)^{1/2}\|Xw(\omega,\cdot)\|_{\dot{H}^s(\langle\omega\rangle^\perp)}\mathrm{d}\sigma(\omega), \end{align} $$

where

$$ \begin{align*}I_{\mathbb{S}^{n-1},s}(g_1,g_2)(\omega):=\int_{\mathbb{S}^{n-1}}\frac{g_1(\omega')g_2(R_\omega\omega')}{|\omega'-R_\omega\omega'|^{s}}|\omega\cdot\omega'|^{n-2}\mathrm{d}\sigma(\omega'). \end{align*} $$

Remark 3.3. The hypothesis $s<\frac {n-1}{2}$ in the statement of Theorem 3.2 serves only to ensure that the kernel of the fractional integral operator $I_{\mathbb {S}^{n-1},s}$ is locally integrable, giving meaning to $I_{\mathbb {S}^{n-1},s}$ . The corresponding Sobolev-Mizohata–Takeuchi theorem that follows rests on the availability of suitable bounds on this fractional integral, and so involves a constant that also depends on s.

Theorem 3.4 (Spherical Sobolev–Mizohata–Takeuchi).

For $s<\frac {n-1}{2}$ ,

(3.10)

$$ \begin{align} \int_{\mathbb{R}^n}|\widehat{g\mathrm{d}\sigma}(x)|^2w(x)\mathrm{d}x\lesssim\sup_{\omega\in\mathrm{ supp\,}^*(g)}\|Xw(\omega,\cdot)\|_{\dot{H}^s(\langle\omega\rangle^\perp)}\|g\|_{L^2(\mathbb{S}^{n-1})}^2, \end{align} $$

where $\mathrm {supp\,}^*(g)$ is the set of all geodesic midpoints of pairs of points from $\mathrm {supp\,}(g)$ . The implicit constant depends on at most n and s.

Remark 3.5. Theorems 3.2 and 3.4 may be seen to follow from Theorems 1.7 and 1.8 respectively. This involves partitioning the sphere into suitable geodesically convex patches as alluded to in the introduction, and indeed this is how our proof below begins. This elementary step appears to require the weight w to be non-negative, despite non-negativity not being a requirement of either Theorem 1.7 or 1.8.

Proof of Theorem 3.2.

By partitioning $\mathbb {S}^{n-1}$ into boundedly many (depending only on n) geodesically convex subsets (caps), it suffices to show (3.9) under the assumption that g is supported in a cap S satisfying $\omega \cdot \omega '\geq \tfrac {1}{2}$ for all points $\omega ,\omega '\in S$ (in line with (1.12)). By Proposition 3.1 and the Cauchy–Schwarz inequality it suffices to show that

(3.11)

$$ \begin{align} \|W_{\mathbb{S}^{n-1}}(g,g)(\omega,\cdot)\|_{\dot{H}^{-s}(\langle\omega\rangle^\perp)}^2\lesssim I_{\mathbb{S}^{n-1},2s}(|g|^2,|g|^2)(\omega), \end{align} $$

for some implicit constant depending only on n. Next, for fixed $\omega \in S$ we make the change of variables $\xi =\omega '-R_\omega \omega '$ , which maps S bijectively to a subset U of $\langle \omega \rangle ^\perp $ . Defining $\omega ':U\rightarrow S$ by $\xi =\omega '(\xi )-R_\omega \omega '(\xi )$ we have

$$ \begin{align*} \begin{aligned} W_{\mathbb{S}^{n-1}}(g,g)(\omega,v)&=\int_{S}g(\omega')\overline{g(R_\omega \omega')}e^{iv\cdot(\omega'-R_\omega\omega')}J(\omega,\omega')\mathrm{d}\sigma(\omega')\\ &=\int_{U}g(\omega'(\xi))\overline{g(R_\omega \omega'(\xi))}e^{iv\cdot\xi}\frac{J(\omega,\omega'(\xi))}{\widetilde{J}(\omega,\omega'(\xi))}\mathrm{d}\xi, \end{aligned} \end{align*} $$

where $\widetilde {J}(\omega ,\omega ')=2^{n-1}\omega \cdot \omega '\sim 1$ is the Jacobian of the change of variables. Hence

(3.12)

and so by Plancherel’s theorem on $\langle \omega \rangle ^\perp $ ,

(3.13)

$$ \begin{align} \begin{aligned} \|W_{\mathbb{S}^{n-1}}(g,g)(\omega,\cdot)\|_{\dot{H}^{-s}(\langle\omega\rangle^\perp)}^2&=\int_{U}\left||\xi|^{-s}g(\omega'(\xi))\overline{g(R_\omega \omega'(\xi))}\frac{J(\omega,\omega'(\xi))}{\widetilde{J}(\omega,\omega'(\xi))}\right|^2\mathrm{d}\xi\\ &=\int_{S}|\omega'-R_\omega\omega'|^{-2s}|g(\omega')|^2|g(R_\omega \omega')|^2\frac{J(\omega,\omega')^2}{\widetilde{J}(\omega,\omega')}\mathrm{d}\sigma(\omega')\\ &\lesssim\int_{S}|\omega'-R_\omega\omega'|^{-2s}|g(\omega')|^2|g(R_\omega \omega')|^2|\omega\cdot\omega'|^{n-2}\mathrm{d}\sigma(\omega')\\ &=I_{\mathbb{S}^{n-1},2s}(|g|^2,|g|^2)(\omega), \end{aligned} \end{align} $$

The inequality (3.11) follows.

Remark 3.6. The reader may be puzzled by the retention of the specific factor $|\omega \cdot \omega '|^{n-2}$ in the third line of (3.13), and its inclusion in the definition of $I_{\mathbb {S}^{n-1},s}$ . This is significant as it is (up to a constant factor) the Jacobian $J(\omega ,\omega ')$ , which is natural as it ensures that $I_{\mathbb {S}^{n-1},s}$ is symmetric and enjoys the appropriate Lebesgue space bounds. This feature will become clearer in Section 4 in the context of more general submanifolds S.

Proof of Theorem 3.4.

Arguing as in the proof of Theorem 3.2, it suffices to establish (3.10) for g supported in a single cap S. Since $I_{\mathbb {S}^{n-1},2s}(|g|^2,|g|^2)(\omega )=0$ if $\omega \not \in \mathrm {supp\,}^*(g)$ ,

$$ \begin{align*}\int_{\mathbb{R}^n}|\widehat{g\mathrm{d}\sigma}(x)|^2w(x)\mathrm{d}x\lesssim\sup_{\omega\in\mathrm{supp\,}^*(g)}\|Xw(\omega,\cdot)\|_{\dot{H}^s(\langle\omega\rangle^\perp)}\|I_{\mathbb{S}^{n-1},2s}(|g|^2,|g|^2)\|_{L^{1/2}(S)}^{1/2}, \end{align*} $$

by Theorem 3.2. It therefore suffices to show that

is bounded from $L^1\times L^1$ into $L^{1/2}$ whenever $s<n-1$ . This will be established in Section 7, where more general surface-carried bilinear fractional integral operators are estimated.

4 General submanifolds: a geometric viewpoint

As we shall see, identifying a phase-space representation of $|\widehat {g\mathrm {d}\sigma }|^2$ that is explicit enough to establish Theorems 1.7 and 1.8 requires some careful geometric analysis, beginning with the identification of a suitable generalised Wigner distribution (or transform). We present this for general smooth submanifolds of $\mathbb {R}^n$ that are strictly convex in the sense that their shape operators $\mathrm {d}N_u$ are positive definite at all points $u\in S$ .

4.1 Surface-carried Wigner transforms

The general procedure for constructing a suitable Wigner transform on a submanifold of Euclidean space is again well-known in optics [Reference Alonso3], [Reference Petruccelli and Alonso45]; see, for example, [Reference Gneiting, Fischer and Hornberger29] for related intrinsic constructions in quantum physics. As is pointed out in [Reference Alonso3], for $n\geq 3$ matters are considerably more involved as there is some choice to be exercised.

For compactly supported function $g_1,g_2\in L^2(S)$ let

(4.1)

$$ \begin{align} W_S(g_1,g_2)(u, v)=\int_{S}g_1(u')\overline{g_2(R_u u')}e^{-2\pi iv\cdot(u'-R_uu')}J(u,u')\mathrm{d}\sigma(u'). \end{align} $$

Here $u\in S$ , $v\in T_uS$ , and we define, for $u'\neq u$ , $R_u u'$ to be the unique point $u"\in S$ with $u"\not =u'$ such that

(4.2)

$$ \begin{align} (u'-u")\cdot N(u)=0 \end{align} $$

and

(4.3)

$$ \begin{align} N(u)\wedge N(u')\wedge N(u")=0. \end{align} $$

Define $R_{u}u:=u$ for all $u\in S$ . Condition (4.2) stipulates that $u'-u"\in T_u S$ , which as we shall see, is necessary for the phase-space representation (1.7); see Figure 1. Condition (4.3), which stipulates that $N(u), N(u'), N(u")$ lie on a great circle, is where we have exercised some choice. This appears to be physically significant and is at least implicitly referred to in the optics literature; see, for example, [Reference Alonso3] (p. 346) in the context of the sphere. Moreover, the appropriateness of (4.3) is particularly apparent when S is the paraboloid, as we clarify in the forthcoming Remark 4.7. In (4.1) the function $J(u,u')$ is the reciprocal of the Jacobian of the mapping $u\mapsto R_uu'$ , so that

(4.4)

$$ \begin{align} \int_S\Phi(R_uu')J(u,u')\mathrm{d}\sigma(u)=\int_{S}\Phi \mathrm{d}\sigma \end{align} $$

for each $u'\in S$ . The required bijectivity here follows from the assumed geodesic convexity of $N(S)$ referred to in Section 1. We refer to $W_S(g_1,g_2)$ as the Wigner transform on S, and $W_S(g,g)$ as the Wigner distribution on S. As we shall see shortly, the Jacobian J is a bounded function on compact subsets of $S\times S$ , allowing $W_S(g_1,g_2)$ to be defined as a Lebesgue integral.

Figure 1 A depiction of the choice of $u"$ via the conditions (4.2) and (4.3).

The point $u"$ may seem rather difficult to identify at first sight, although it has a simple alternative description that is constructive. This is shown in Figure 2, and will play an important role in our analysis.

Figure 2 The construction of $u"$ via parallel supporting hyperplanes in $T_uS+\{u'\}$ .

Remark 4.1 (Existence of $u"$ ).

There is a technical point that we have glossed over in the above definition of $W_S$ and Figures 1 and 2. For given $u,u'\in S$ our hypotheses do not guarantee the existence of such a point $u":=R_u u'$ , unless S is closed (the boundary of a convex body in $\mathbb {R}^n$ ). One way to remedy this might be to continue S to a closed submanifold, upon which $R_u u'$ may always be defined, and observe that the resulting function $W_S(g_1,g_2)$ is independent of the choice of extension since $g_2$ is supported on S. In any event, the integral in (4.1) should be interpreted as taken over

$$ \begin{align*}\{u'\in S: (u'-u")\cdot N(u)=0 \text{ and } N(u)\wedge N(u')\wedge N(u")=0\text{ for some }u"\not=u'\}.\end{align*} $$

Naturally such domain restrictions will be apparent in our analysis of the Jacobian J in Section 6.

Remark 4.2 (Differentiability of $u"$ ).

We expect that the maps $u\mapsto R_u u'$ and $u'\mapsto R_u u'$ are differentiable away from $u=u'$ and that this should follow from (4.2) and (4.3) by a suitable application of the implicit function theorem; see Figure 2. This smoothness is of course clear when S is the sphere thanks to the explicit formula (3.5) and is assumed to be true of the submanifolds S considered here.

Remark 4.3 (Rationale for the choice of third point $u"$ ).

As is pointed out in [Reference Alonso3] and [Reference Petruccelli and Alonso45], for $n\geq 3$ there are many possible ways of defining the third point $u"$ in terms of $u'$ and u, although for the purposes of proving Theorems 1.7 and 1.8 there are a number of natural requirements that significantly constrain this choice. First of all, the choice should be ‘nondegenerate’ in the sense that the distances $|u'-u|$ and $|u'-u"|$ should be comparable (suitably uniformly in terms of the geometry of S); it should be symmetric so that the resulting Wigner distribution is real-valued (and the Wigner transform is conjugate symmetric), and it should be geometrically/physically natural, so that the Jacobian J may be expressed in terms of the Gauss map N and its derivative $\mathrm {d}N$ (the shape operator). The forthcoming Propositions 4.4 and 4.5 show that our choice of $u"$ has these features. As we shall see, the coplanarity condition (4.3) is natural as it allows the mapping $u\mapsto R_u u'$ to be transformed to a relatively simple ‘outward vector field’ on the tangent space $T_{u'}S$ . This involves parametrising S using the Gauss map followed by stereographic projection (a composition that may also be found in the theory of minimal surfaces).

It will be important for us to understand how the distances between the three points $u, u', u"$ relate to each other. This is provided by the following proposition, whose proof is deferred to Section 5. In particular it tells us that the function $\rho (u,u'):=|u'-R_uu'|$ on $S\times S$ is a quasi-distance, as we clarify in Section 8.

Proposition 4.4 (Distance estimates).

For all $u,u', u"\in S$ with $u"=R_uu'$ ,

(4.5)

$$ \begin{align} |u'-u"|\lesssim Q(S)^{1/2}|u-u'| \end{align} $$

and

(4.6)

$$ \begin{align} |u'-u"|\gtrsim \frac{1}{Q(S)}|u-u'|. \end{align} $$

We now turn from the metric properties to the measure-theoretic properties of the map $R_u$ , and a host of explicit identities satisfied by the Wigner transform $W_S$ .

To see that $W_S$ is conjugate-symmetric, which in particular implies that the Wigner distribution $W_S(g,g)$ is real-valued, already appears to require some work. For fixed $u\in S$ observe first that if $u"=R_u u'$ then $u'=R_u u"$ , and so by a change of variables,

$$ \begin{align*} \begin{aligned} W_S(g_1,g_2)(u,v)=\int_S g_1(R_u u")\overline{g_2(u")}e^{-2\pi iv\cdot(R_u u"-u")}J(u,R_u u")\Delta(u,u")\mathrm{d}\sigma(u"), \end{aligned} \end{align*} $$

where $\Delta (u,u")$ is the Jacobian of the change of variables $u'=R_u u"$ . It therefore remains to show that

$$ \begin{align*} J(u,u')\Delta(u,u")=J(u,u"), \end{align*} $$

recalling that J was defined in (4.4). Fortunately we have explicit formulae for the Jacobians J and $\Delta $ from which this quickly follows. In the following proposition we denote by $K(u)$ the Gaussian curvature of S at the point u, recalling that $K(u)$ is the determinant of the shape operator $\mathrm {d}N_u$ . Further, we denote by $P_{W}v$ the orthogonal projection of a vector $v\in \mathbb {R}^n$ onto a subspace W of $\mathbb {R}^n$ .

Proposition 4.5 (Jacobian identities).

For all $u,u', u"\in S$ with $u"=R_uu'$ ,

(4.7)

$$ \begin{align} J(u,u')=\left(\frac{|N(u')\wedge N(u")|}{|N(u)\wedge N(u')|}\right)^{n-2}\left|\frac{\langle u"-u',N(u")\rangle}{\langle P_{T_{u"}S}N(u),(\mathrm{d}N_{u"})^{-1}(P_{T_{u"}S}N(u))\rangle}\right|\frac{K(u)}{K(u")}, \end{align} $$

(4.8)

$$ \begin{align} \kern-6pt \Delta(u,u')=\left(\frac{|N(u)\wedge N(u")|}{|N(u)\wedge N(u')|}\right)^{n-1}\frac{|\langle P_{T_{u'}S}N(u),(\mathrm{d}N_{u'})^{-1}(P_{T_{u'}S}N(u))\rangle|}{|\langle P_{T_{u"}S}N(u),(\mathrm{d}N_{u"})^{-1}(P_{T_{u"}S}N(u))\rangle|}\frac{K(u')}{K(u")} \end{align} $$

and

(4.9)

$$ \begin{align} J(u,u')\Delta(u,u")=J(u,u"). \end{align} $$

We defer the proof of Proposition 4.5 to Section 6.

Remark 4.6 (Interpreting J).

The expression for J in Proposition (4.5), while seemingly rather complicated, may be understood in somewhat simple geometric terms. In particular:

(i) Matters are much simpler when $n=2$ , where we may write
$$ \begin{align*} \begin{aligned} \displaystyle J(u,u')&\displaystyle =\left|\frac{\langle u"-u',P_{T_uS}N(u")\rangle}{\langle P_{T_{u"}S}N(u),(\mathrm{d}N_{u"})^{-1}(P_{T_{u"}S}N(u))\rangle}\right|\frac{K(u)}{K(u")} \\ &\displaystyle = \frac{|u"-u'|\cdot |N(u)\wedge N(u")|}{|P_{T_{u"}S}N(u)|^{2}}K(u) \\ &\displaystyle = \frac{|u"-u'|}{|N(u)\wedge N(u")|}K(u). \end{aligned} \end{align*} $$
Here we have used the (two-dimensional) formula
$$ \begin{align*}\langle P_{T_{u"}S}N(u),(\mathrm{d}N_{u"})^{-1}(P_{T_{u"}S}N(u))\rangle=\frac{1}{K(u")}|P_{T_{u"}S}N(u)|^{2},\end{align*} $$
along with the elementary identities $|P_{T_{u"}S}N(u)|=|P_{T_{u}S}N(u")|=|N(u)\wedge N(u")|$ .
(ii) The factor
(4.10) $$ \begin{align}\langle P_{T_{u"}S}N(u),(\mathrm{d}N_{u"})^{-1}(P_{T_{u"}S}N(u))\rangle^{-1} \end{align} $$
is bounded above by $\langle P_{T_{u"}S}N(u),\mathrm {d}N_{u"}(P_{T_{u"}S}N(u))\rangle $ by the harmonic-arithmetic mean inequality. This bound is (up to a suitable normalisation factor) the directional curvature of S at the point $u"$ in the direction $P_{T_{u"}S}N(u)$ . One might therefore interpret the factor (4.10) as a certain ‘harmonic directional curvature’.
(iii) The factor
$$ \begin{align*}\frac{|N(u')\wedge N(u")|}{|N(u)\wedge N(u')|}\end{align*} $$
quantifies (in relative terms) the transversality of the tangent spaces to S at the points $u,u', u"$ , and is therefore also a manifestation of the curvature profile of S; see Figure 1.
(iv) The factor $\langle u"-u',N(u")\rangle $ is different in nature as it explicitly relates to the positions of the points $u', u"$ . It is instructive to use the fact that $u'-u"\in T_uS$ to write this as
$$ \begin{align*}\langle u"-u',P_{T_uS}N(u")\rangle=|N(u)\wedge N(u")||u"-u'|\left\langle \tfrac{\displaystyle u"-u'}{\displaystyle |u"-u'|},\tfrac{\displaystyle P_{T_uS}N(u")}{\displaystyle |P_{T_uS}N(u")|}\right\rangle.\end{align*} $$
We observe that the inner product in the final expression above quantifies the extent to which $u"$ is displaced from the line through $u'$ in the direction $P_{T_uS}N(u")$ ; see Figure 2.
(v) The Jacobian J is scale-invariant in the sense that an isotropic scaling of S leaves J unchanged. This is apparent from the definition of J but is also manifest in the formula (4.7).

Remark 4.7 (Examples).

Proposition 4.5 is easily applied to examples.

(i) If $S=\mathbb {P}^{n-1}$ , the paraboloid (2.14), then a careful calculation using Proposition 4.5 reveals that
$$ \begin{align*}J(u,u')=2^{n-1}\left(\frac{1+4|x"|^2}{1+4|x|^2}\right)^{1/2}, \end{align*} $$
where we are writing $u=(x,|x|^2)$ , $u'=(x',|x'|^2)$ , $u":=R_uu'=(x",|x"|^2)$ . As should be expected from our analysis in Section 2, the parabolic Wigner distribution $W_{\mathbb {P}^{n-1}}$ may be pulled back to the classical Wigner distribution via a suitable map $\Phi :\mathbb {R}^d\times \mathbb {R}^d\rightarrow T\mathbb {P}^d$ ; in this case $\Phi (x,v)=((x,|x|^2),P_{T_{(x,|x|^2)}\mathbb {P}^d}(v,0))$ . This uses the simple geometric fact that the coplanarity condition (4.3) transforms to a colinearity condition in parameter space. More specifically, if for a function $g:\mathbb {R}^d\rightarrow \mathbb {C}$ we let $Lg(x,|x|^2)=(1+4|x|^2)^{-\frac {1}{2}}g(x)$ , and for a function $h:T\mathbb {P}^d\rightarrow \mathbb {C}$ we let $Uh(x,v)=(1+4|x|^2)^{1/2}h(\Phi (x,v))$ , then
$$ \begin{align*}UW_{\mathbb{P}^d}(Lg,Lg)=W(g,g).\end{align*} $$
Moreover, $X_{\mathbb {P}^d}^*h=\rho (Uh)$ , allowing one to deduce the quantum-mechanical phase-space representation (2.5) from the forthcoming Proposition 4.8. We refer to [Reference Alonso3] (p. 353) for a similar remark.
(ii) If $S=\mathbb {S}^{n-1}$ , evidently $K\equiv 1$ and $N(\omega )=\omega $ , and to be consistent with Section 3 we use $\omega $ rather than u to represent a point. We may use the explicit formula (3.5) to write
$$ \begin{align*} \frac{|N(\omega')\wedge N(\omega")|}{|N(\omega)\wedge N(\omega')|}=\frac{(1-(\omega'\cdot\omega")^{2})^{\frac{1}{2}}}{(1-(\omega\cdot\omega')^{2})^{\frac{1}{2}}}=\frac{|P_{\langle\omega'\rangle^{\perp}}\omega"|}{|P_{\langle\omega\rangle^{\perp}}\omega'|}=2|\omega\cdot\omega'|. \end{align*} $$
On the other hand, since $\langle \omega "-\omega ',N(\omega ")\rangle =\langle \omega "-\omega ',P_{\langle \omega \rangle ^{\perp }}\omega "\rangle $ , projecting both sides of (3.5) to $\langle \omega \rangle ^{\perp }$ yields
$$ \begin{align*} \left|\frac{\langle \omega"-\omega',N(\omega")\rangle}{\langle P_{T_{\omega"}S}N(\omega),(\mathrm{d}N_{\omega"})^{-1}(P_{T_{\omega"}S}N(\omega))\rangle}\right|=\frac{|\langle \omega"-\omega',\omega'\rangle|}{|1-(\omega\cdot\omega")^{2}|}=\frac{|1-(\omega'\cdot\omega")|}{|1-(\omega\cdot\omega')^{2}|}=2, \end{align*} $$
since $\omega \cdot \omega " = \omega \cdot \omega '$ and $\omega '\cdot \omega "=2(\omega \cdot \omega ')^{2}-1$ . Altogether we conclude that
$$ \begin{align*}J(\omega,\omega')=2^{n-1}|\omega\cdot\omega'|^{n-2},\end{align*} $$
as appears in (3.6).

We now come to the phase-space representation of $|\widehat {g\mathrm {d}\sigma }|^2$ , and we begin by defining an auxiliary function $f:S\times \mathbb {R}^n\rightarrow \mathbb {R}$ by

$$ \begin{align*}f(u,x)=\int_{S}g(u')\overline{g(R_uu')}e^{-2\pi ix\cdot(u'-R_uu')}J(u,u')\mathrm{d}\sigma(u'), \end{align*} $$

so that $W_S(g,g)$ is the restriction of f to the tangent bundle $TS:=\{(u,v):u\in S, v\in T_uS\}$ . As in the spherical case, we continue to have the marginal identity

(4.11)

$$ \begin{align} \int_{S}f(u,x)\mathrm{d}\sigma(u)=|\widehat{g\mathrm{d}\sigma}(x)|^2 \end{align} $$

by Fubini’s theorem and the definition of J. While we shall not need to use it, it is pertinent to also note the second marginal property

(4.12)

$$ \begin{align} \int_{T_uS}W_S(g,g)(u,v)\mathrm{d}v=|g(u)|^2 \end{align} $$

here (possibly subject to an additional regularity assumption on S) referred to in the introduction; we refer to Section 8 for clarification of this, along with the sense in which it holds as a pointwise identity. Another key property is that f satisfies the transport equation

(4.13)

$$ \begin{align} N(u)\cdot\nabla_xf=0, \end{align} $$

meaning that $f(u,x)=W_S(g,g)(u,P_{T_uS}x)$ , where $P_{T_uS}:\mathbb {R}^n\rightarrow T_uS$ is the orthogonal projection onto $T_uS$ .

Proposition 4.8 (General phase-space representation).

(4.14)

$$ \begin{align} |\widehat{g\mathrm{d}\sigma}|^2=X_S^*W_S(g,g) \end{align} $$

where $X_Sw(u,v):=Xw(N(u),v)$ , the pullback of $Xw$ under the Gauss map

$$ \begin{align*}TS\ni (u,v)\mapsto (N(u),v)\in T\mathbb{S}^{d-1}. \end{align*} $$

We note that for a phase-space function $h:TS\rightarrow \mathbb {C}$ we have the explicit expression

$$ \begin{align*}X_S^*h(x)=\int_{S}h(u,P_{T_uS}x)\mathrm{d}\sigma(u).\end{align*} $$

Proof of Proposition 4.8.

By (4.11), (4.13) and Fubini’s theorem,

$$ \begin{align*} \begin{aligned} \int_{\mathbb{R}^n}|\widehat{g\mathrm{d}\sigma}(x)|^2w(x)\mathrm{d}x&=\int_{\mathbb{R}^n}\int_{S}f(u,x)\mathrm{d}\sigma(u)w(x)\mathrm{d}x\\ &=\int_{S}\int_{T_uS}f(u,v)\left(\int_{(T_uS)^\perp}w(v+z)\mathrm{d}z\right)\mathrm{d}v\mathrm{d}\sigma(u)\\ &=\int_S\int_{T_uS}W_S(g,g)(u,v)Xw(N(u),v)\mathrm{d}v\mathrm{d}\sigma(u)\\ &=\int_{\mathbb{R}^n}X_S^*W(g,g)(x)w(x)\mathrm{d}x \end{aligned} \end{align*} $$

for all test functions w.

Remark 4.9 (A polarised form).

The polarised form

$$ \begin{align*}\widehat{g_1\mathrm{d}\sigma}\:\overline{\widehat{g_2\mathrm{d}\sigma}}=X_S^*W_S(g_1,g_2) \end{align*} $$

of (4.14) may be established similarly, and indeed may be deduced directly from (4.14).

Remark 4.10. There is a point of contact here with [Reference Bennett, Nakamura and Shiraki15], where among other things it is shown that the classical Radon transform fails to distinguish $|\widehat {g\mathrm {d}\sigma }|^2$ from $X_S^*\nu $ for a large class of distributions $\nu $ on $TS$ , provided a suitable transversality condition is satisfied. Perhaps unsurprisingly, $W_S(g,g)$ is easily seen to be an example of such a distribution.

We are now ready to state or main theorems (Theorems 1.7 and 1.8) in full.

Theorem 4.11 (L ² Sobolev–Stein inequality).

Suppose that S is a smooth strictly convex surface with curvature quotient $Q(S)$ , and $s<\frac {n-1}{2}$ . Then there is a dimensional constant c such that

(4.15)

$$ \begin{align} \int_{\mathbb{R}^n}|\widehat{g\mathrm{d}\sigma}(x)|^2w(x)\mathrm{d}x\leq c Q(S)^{\frac{5n-8}{4}}\int_S I_{S,2s}(|g|^2,|g|^2)(u)^{1/2}\|X_Sw(u,\cdot)\|_{\dot{H}^s(T_u S)}\mathrm{d}\sigma(u), \end{align} $$

where

(4.16)

$$ \begin{align} I_{S,s}(g_1,g_2)(u):=\int_S\frac{g_1(u')g_2(R_u u')}{|u'-R_u u'|^s}J(u,u')\mathrm{d}\sigma(u'). \end{align} $$

Remark 4.12. The S-carried fractional integral $I_{S,s}$ is natural for a number of reasons relating to the presence of the Jacobian factor J. In particular, it is symmetric thanks to (4.9) (a property that is analogous to the conjugate symmetry of the Wigner transform $W_S$ ), and as we shall see in Section 7, its Lebesgue space bounds do not depend on any lower bound on the curvature of S. The restriction $s<\frac {n-1}{2}$ ensures that the kernel of $I_{S,s}$ is locally integrable.

Theorem 4.13 (L ² Sobolev–Mizohata–Takeuchi inequality).

Suppose that S is a smooth strictly convex surface with curvature quotient $Q(S)$ , and $s<\frac {n-1}{2}$ . Then there exists a constant c, depending on at most n, s and the diameter of S, such that

(4.17)

$$ \begin{align} \int_{\mathbb{R}^n}|\widehat{g\mathrm{d}\sigma}(x)|^2w(x)\mathrm{d}x\leq cQ(S)^{\frac{9n-12}{4}}\sup_{u\in\mathrm{ supp\,}^*(g)}\|X_Sw(u,\cdot)\|_{\dot{H}^s(T_u S)}\|g\|_{L^2(S)}^2, \end{align} $$

where $ \mathrm {supp\,}^*(g):=\{u\in S: R_u u'\in \mathrm {supp\,}(g)\;\text { for some }\; u'\in \mathrm {supp\,}(g)\}$ .

Remark 4.14. We remark that $\mathrm {supp\,}(g)\subseteq \mathrm {supp\,}^*(g)$ , and often this containment is strict. When S is the sphere, $\mathrm { supp\,}^*(g)$ is the ‘support midpoint set’, consisting of all geodesic midpoints of pairs of points from the support of g. Hence $\mathrm {supp\,}^*(g)\subseteq \mathrm {cvx\,}\mathrm {supp\,}(g)$ in this case, where $\mathrm {cvx\,}$ forms the geodesic convex hull. More generally, $\mathrm { supp\,}^*(g)\subseteq N^{-1}\mathrm {cvx\,}(N(\mathrm {supp\,}(g)))$ , so that

$$ \begin{align*}\int_{\mathbb{R}^n}|\widehat{g\mathrm{d}\sigma}(x)|^2w(x)\mathrm{d}x\leq cQ(S)^{\frac{9n-12}{4}}\sup_{\omega\in\mathrm{cvx\,}(N(\mathrm{supp\,}(g)))}\|Xw(\omega,\cdot)\|_{\dot{H}^s(T_u S)}\|g\|_{L^2(S)}^2. \end{align*} $$

Remark 4.15. While we expect that the power of $Q(S)$ in the statement of Theorem 4.11 is sharp when $n=2$ , it seems unlikely that it is in higher dimensions. The power of $Q(S)$ in the statement of Theorem 4.13 is of course larger still, incurring extra factors from the bounds on the bilinear fractional integrals $I_{S,s}$ in Section 7.

4.2 Proof of the Sobolev–Stein inequality (Theorem 1.7)

In this section we prove Theorem 1.7, or more specifically, Theorem 4.11. We begin with an application of Proposition 4.8 and the Cauchy–Schwarz inequality to write

(4.18)

$$ \begin{align} \int_{\mathbb{R}^n}|\widehat{g\mathrm{d}\sigma}|^2w\leq \int_S \|W_S(g,g)(u,\cdot)\|_{\dot{H}^{-s}(T_u S)}\|X_Sw(u,\cdot)\|_{\dot{H}^s(T_u S)}\mathrm{d}\sigma(u) \end{align} $$

for any $s\in \mathbb {R}$ . In order to estimate the Sobolev norm of the Wigner distribution above we fix $u\in S$ and make the change of variables

(4.19)

$$ \begin{align} \xi=u'-R_u u'. \end{align} $$

Since S is the graph of a strictly convex function, the map $u'\mapsto \xi $ is a bijection from S to a subset U of $T_uS$ . To see this it suffices to establish injectivity, and hence we look to show that $u'-R_uu'\not =\tilde {u}'-R_u\tilde {u}'$ for $u'\neq \tilde {u}'$ . We may assume that $u'-R_uu'$ and $\tilde {u}'-R_u\tilde {u}'$ are parallel, as otherwise the desired conclusion is immediate. Observe that $u'-\tilde {u}'\not \in T_uS$ , as otherwise strict convexity of the level sets of S (sections of S by translates of $T_uS$ ) would force $u'=\tilde {u}'$ or $R_uu'=\tilde {u}'$ ; see Figure 2. Since $u'-\tilde {u}'\not \in T_uS$ and S is the graph of a function, the level sets of S through $u'$ and $\tilde {u}'$ respectively, when projected onto $T_uS$ , are both enclosed by the supporting hyperplanes depicted in Figure 2; this may require interchanging the roles of $u'$ and $\tilde {u}'$ , as we may. Since $u'-R_uu'$ and $\tilde {u}'-R_u\tilde {u}'$ are parallel, it follows that $|\tilde {u}'-R_u\tilde {u}'|<|u'-R_uu'|$ , and thus $u'-R_uu'\not =\tilde {u}'-R_u\tilde {u}'$ . As a result of this bijectivity,

$$ \begin{align*}\|W_{S}(g,g)(u,\cdot)\|_{H^{-s}(T_u S)}^2=\int_{T_u S}\left|\int_U g(u'(\xi))\overline{g(R_u u'(\xi))}|\xi|^{-s}e^{-2\pi iv\cdot\xi}J(u,u'(\xi))\frac{\mathrm{d}\xi}{\widetilde{J}(u,u'(\xi))}\right|^2\mathrm{d}v, \end{align*} $$

where $\widetilde {J}(u,u')$ is the Jacobian of the map $u'\mapsto \xi $ . Hence by Plancherel’s theorem on $T_u S$ ,

$$ \begin{align*} \begin{aligned} \|W_{S}(g,g)(u,\cdot)\|_{H^{-s}(T_u S)}^2&=\int_{U}\left|g(u'(\xi))\overline{g(R_u u'(\xi))}|\xi|^{-s}\frac{J(u,u'(\xi))}{\widetilde{J}(u,u'(\xi))}\right|^2\mathrm{d}\xi\\ &=\int_S\frac{|g(u')|^2|g(R_u u')|^2}{|u'-R_u u'|^{2s}}\frac{J(u,u')^2}{\widetilde{J}(u,u')}\mathrm{d}\sigma(u'). \end{aligned} \end{align*} $$

In order to complete the proof of Theorem 4.11 it therefore suffices to prove that

(4.20)

$$ \begin{align} \frac{J(u,u')}{\widetilde{J}(u,u')}\lesssim Q(S)^{\frac{5n-8}{2}} \end{align} $$

with implicit constant depending only on the dimension. We do this in two steps.

Step 1: Bounding $\widetilde {J}(u,u')$

The goal here is to obtain a suitable lower bound for $\widetilde {J}(u,u')$ .

Proposition 4.16. We have that

(4.21)

$$ \begin{align} \widetilde{J}(u,u')\geq (1+\Delta(u,u')^{2})^{\frac{1}{2}} \end{align} $$

for all $u,u'\in S$ .

Proof. Let $u\in S$ be fixed. The Jacobian $\widetilde {J}$ of the change of variables

$$ \begin{align*} \xi(u')=u'-R_{u}u', \end{align*} $$

may be expressed as

(4.22)

$$ \begin{align} \widetilde{J}(u,u')=\frac{|(\mathrm{d}\xi)_{u'}(v_{1})\wedge\cdots\wedge (\mathrm{d}\xi)_{u'}(v_{n-1})|}{|v_{1}\wedge\cdots\wedge v_{n-1}|}, \end{align} $$

where $v_{1},\ldots ,v_{n-1}$ is a basis for $T_{u'}S$ . We remark that

$$ \begin{align*}(\mathrm{d}\xi)_{u'}(v_{1})\wedge\cdots\wedge (\mathrm{d}\xi)_{u'}(v_{n-1})\in\Lambda^{n-1}(T_{u}S)\quad\mathrm{and}\quad v_{1}\wedge\cdots\wedge v_{n-1}\in\Lambda^{n-1}(T_{u'}S),\end{align*} $$

and we identify the exterior algebras $\Lambda ^{n-1}(T_{u'}S)$ and $\Lambda ^{n-1}(T_{u}S)$ with subspaces of $\Lambda ^{n-1}(\mathbb {R}^{n})$ via the natural embedding induced by the inclusions $T_{u'}S\subset \mathbb {R}^{n}$ and $T_{u}S\subset \mathbb {R}^{n}$ , respectively.

It will be convenient to fix $u'$ and express (4.22) in terms of unit velocities of trajectories along smooth curves in S emanating from $u'$ . In what follows $c:I\rightarrow S$ will denote the arc-length parametrisation of such a curve, where I is an open interval containing $0$ such that $c(0)=u'$ . If $\mathcal {C}$ denotes the set of all such mappings c, then evidently

$$ \begin{align*} T_{u'}S = \langle\left\{\dot{c}(0): c\in\mathcal{C}\right\}\rangle. \end{align*} $$

By the strict convexity of S, the $(n-1)$ -dimensional spaces $T_{u'}S$ and $T_{u}S$ intersect in an $(n-2)$ -dimensional subspace $\mathcal {H}$ . We then pick curves $c_{1},\ldots ,c_{n-2}\in \mathcal {C}$ such that

$$ \begin{align*}\mathcal{H}=\langle \dot{c}_{1}(0),\ldots,\dot{c}_{n-2}(0)\rangle,\end{align*} $$

and the set $\{\dot {c}_{i}(0)\}_{1\leq i\leq n-2}$ is orthonormal. To obtain an orthonormal basis for $T_{u'}S$ , we simply take any other curve $c_{n-1}\in \mathcal {C}$ such that $\dot {c}_{n-1}(0)\in \mathcal {H}^{\perp }\cap T_{u'}S$ . There is one more degree of freedom in choosing $c_{n-1}$ , and we assume without loss of generality that $\dot {c}_{n-1}(0)\cdot N(u)\geq 0$ . This gives

$$ \begin{align*}T_{u'}S=\langle \dot{c}_{1}(0),\ldots,\dot{c}_{n-2}(0),\dot{c}_{n-1}(0)\rangle.\end{align*} $$

Since

$$ \begin{align*} (\mathrm{d}\xi)_{u'}(\dot{c}_{i}(0))= (\xi\circ c_{i})'(0) = \dot{c}_{i}(0)- (\mathrm{d}R_{u})_{u'}(\dot{c}_{i}(0)),\quad 1\leq i\leq n-1, \end{align*} $$

then, since $|\dot {c}_{1}(0)\wedge \cdots \wedge \dot {c}_{n-1}(0)|=1$ by orthonormality of the chosen basis of $T_{u'}S$ ,

(4.23)

$$ \begin{align} \begin{aligned} \displaystyle\widetilde{J}(u,u')&\displaystyle =|(\mathrm{d}\xi)_{u'}(\dot{c}_{1}(0))\wedge\cdots\wedge (\mathrm{d}\xi)_{u'}(\dot{c}_{n-1}(0))| \\ &\displaystyle= |(\dot{c}_{1}(0)- (\mathrm{d}R_{u})_{u'}(\dot{c}_{1}(0)))\wedge\cdots\wedge (\dot{c}_{n-1}(0)- (\mathrm{d}R_{u})_{u'}(\dot{c}_{n-1}(0)))| \\ &\displaystyle= |W_{1}-W_{2}|, \end{aligned} \end{align} $$

where

$$ \begin{align*} \begin{aligned} \displaystyle W_{1}&\displaystyle := (\dot{c}_{1}(0)- (\mathrm{d}R_{u})_{u'}(\dot{c}_{1}(0)))\wedge\cdots\wedge (\dot{c}_{n-2}(0)- (\mathrm{d}R_{u})_{u'}(\dot{c}_{n-2}(0)))\wedge \dot{c}_{n-1}(0), \\ \displaystyle W_{2}&\displaystyle := (\dot{c}_{1}(0)- (\mathrm{d}R_{u})_{u'}(\dot{c}_{1}(0)))\wedge\cdots\wedge (\dot{c}_{n-2}(0)- (\mathrm{d}R_{u})_{u'}(\dot{c}_{n-2}(0)))\wedge (\mathrm{d}R_{u})_{u'}(\dot{c}_{n-1}(0)). \end{aligned} \end{align*} $$

The next claim collects a few useful facts about the action of $(\mathrm {d}R_{u})_{u'}$ on $\mathcal {H}$ .

Claim 4.17. The following hold:

1. The subspace $\mathcal {H}=T_{u'}S\cap T_{u}S$ generated by the set of vectors $\{\dot {c}_{1}(0),\ldots ,\dot {c}_{n-2}(0)\}$ is invariant under the map $(\mathrm {d}R_{u})_{u'}$ . Moreover, $\left .(\mathrm {d}R_{u})_{u'}\right |{}_{\mathcal {H}}:\mathcal {H}\longrightarrow \mathcal {H}$ is an isomorphism. Equivalently,
(4.24) $$ \begin{align} \mathcal{H}=\langle \dot{c}_{1}(0),\ldots,\dot{c}_{n-2}(0)\rangle=\langle (\mathrm{d}R_{u})_{u'}(\dot{c}_{1}(0)),\ldots,(\mathrm{d}R_{u})_{u'}(\dot{c}_{n-2}(0))\rangle. \end{align} $$
2. Let $M_{u,u'}:=\left .(\mathrm {d}R_{u})_{u'}\right |{}_{\mathcal {H}}: \mathcal {H}\longrightarrow \mathcal {H}$ denote the restriction of $(\mathrm {d}R_{u})_{u'}$ to the invariant subspace $\mathcal {H}$ . Then $I-M_{u,u'}:\mathcal {H}\rightarrow \mathcal {H}$ satisfies
(4.25) $$ \begin{align} \det{(I-M_{u,u'})} \geq 1. \end{align} $$

Proof. Let $\omega := N(u)$ . Notice that the coplanarity condition (4.3) implies that

(4.26)

$$ \begin{align} v_{1}:=\frac{P_{\langle\omega\rangle^{\perp}}N(u')}{|P_{\langle\omega\rangle^{\perp}}N(u')|}=-\frac{P_{\langle\omega\rangle^{\perp}}N(u")}{|P_{\langle\omega\rangle^{\perp}}N(u")|}=:-v_{2}. \end{align} $$

On the other hand, $v_{1}$ and $v_{2}$ are the outward normal vectors (in $T_{u}S$ ) of the convex submanifold

(4.27)

$$ \begin{align} \mathcal{S}_{u,u'}:=S\cap (T_{u}S + u') \end{align} $$

at $u'$ and $u"$ respectively, hence

$$ \begin{align*} T_{u'}\mathcal{S}_{u,u'} = T_{u"}\mathcal{S}_{u,u'}, \end{align*} $$

from which (4.24) follows; see Figure 2. Observe also that on $\mathcal {S}_{u,u'}$ we have

(4.28)

$$ \begin{align} R_{u}u' = \widetilde{N}^{-1}(-\widetilde{N}(u')), \end{align} $$

where $\widetilde {N}:\mathcal {S}_{u,u'}\rightarrow \mathbb {S}^{n-2}$ is the Gauss map of $\mathcal {S}_{u,u'}\subset u'+T_{u}S$ . Computing derivatives, $\left .(\mathrm {d}R_{u})_{u'}\right |{}_{\mathcal {H}}: \mathcal {H}\longrightarrow \mathcal {H}$ satisfies

$$ \begin{align*} (\mathrm{d}R_{u})_{u'} = -\mathrm{d}\widetilde{N}^{-1}_{-\widetilde{N}(u')}\circ\mathrm{d}\widetilde{N}_{u'}. \end{align*} $$

Finally, since $\mathrm {d}\widetilde {N}^{-1}_{-\widetilde {N}(u')}$ and $\mathrm {d}\widetilde {N}_{u'}$ are positive definite (recall that our assumptions on S imply positive definiteness of $\mathrm {d}N_{u}$ for all $u\in S$ , hence the same holds for $\mathrm {d}\widetilde {N}_{u}$ ) the product $\mathrm {d}\widetilde {N}^{-1}_{-\widetilde {N}(u')}\circ \mathrm {d}\widetilde {N}_{u'}$ has positive eigenvalues, therefore

$$ \begin{align*} \det{(I-M_{u,u'})}=\det{(I+\mathrm{d}\widetilde{N}^{-1}_{-\widetilde{N}(u')}\circ\mathrm{d}\widetilde{N}_{u'})}\geq 1.\\[-41pt] \end{align*} $$

The next claim contains three key identities involving $W_{1}$ and $W_{2}$ .

Claim 4.18. The following identities hold:

$$ \begin{align*} \begin{aligned} \displaystyle \langle W_{1},W_{1}\rangle &\displaystyle = \det{(I-M_{u,u'})}^{2}, \\ \displaystyle\langle W_{1},W_{2}\rangle &=\displaystyle \det{(I-M_{u,u'})}^{2}\langle (\mathrm{d}R_{u})_{u'}(\dot{c}_{n-1}(0)),\dot{c}_{n-1}(0)\rangle \\ \displaystyle \langle W_{2},W_{2}\rangle&\displaystyle = \Delta(u,u')^{2}\det{(M_{u,u'}^{-1}-I)}^{2}. \end{aligned} \end{align*} $$

Proof. Let $\textbf {0}_{1\times (n-2)}$ be the $1\times (n-2)$ zero row and let $\textbf {X}_{u,u'}$ be the $(n-2)\times (n-2)$ matrix whose $(i,j)$ entry is given by

$$ \begin{align*} (\textbf{X}_{u,u'})_{i,j}:= \langle \dot{c}_{i}(0)- (\mathrm{d}R_{u})_{u'}(\dot{c}_{i}(0)),\dot{c}_{j}(0)- (\mathrm{d}R_{u})_{u'}(\dot{c}_{j}(0))\rangle. \end{align*} $$

Observe that

$$ \begin{align*} \begin{aligned} \displaystyle\langle W_{1},W_{1}\rangle&\displaystyle=\det{\begin{pmatrix} \textbf{X}_{u,u'} & \textbf{0}_{1\times (n-2)}^{\top} \\ \textbf{0}_{1\times (n-2)} & 1 \end{pmatrix}} =\displaystyle\det{(\textbf{X}_{u,u'})}=\displaystyle \det{(I-M_{u,u'})}^{2}, \end{aligned} \end{align*} $$

where we used the facts that $\mathcal {H}$ is invariant under $(\mathrm {d}R_{u})_{u'}$ (as verified in Claim 4.17) and that $\dot {c}_{n-1}(0)$ is orthogonal to $\mathcal {H}$ . Now let $\textbf {Y}_{u,u'}$ be the $(n-2)\times (n-2)$ matrix whose $(i,j)$ entry is given by

$$ \begin{align*} (\textbf{Y}_{u,u'})_{i,j}:= \langle (\mathrm{d}R_{u})_{u'}^{-1}(\dot{c}_{i}(0))-\dot{c}_{i}(0),(\mathrm{d}R_{u})_{u'}^{-1}(\dot{c}_{j}(0))-\dot{c}_{j}(0)\rangle. \end{align*} $$

Analogously,

$$ \begin{align*} \begin{aligned} \displaystyle\langle W_{2},W_{2}\rangle &\displaystyle = |(\dot{c}_{1}(0)- (\mathrm{d}R_{u})_{u'}(\dot{c}_{1}(0)))\wedge\cdots\wedge (\dot{c}_{n-2}(0)- (\mathrm{d}R_{u})_{u'}(\dot{c}_{n-2}(0)))\wedge (\mathrm{d}R_{u})_{u'}(\dot{c}_{n-1}(0))|^{2} \\ &\displaystyle = \Delta(u,u')^{2}\left|\left(\bigwedge_{j=1}^{n-2}(\mathrm{d}R_{u})_{u'}^{-1}[\dot{c}_{j}(0)- (\mathrm{d}R_{u})_{u'}(\dot{c}_{j}(0))]\right)\wedge \dot{c}_{n-1}(0)\right|^{2} \\ &\displaystyle = \Delta(u,u')^{2}\left|\left(\bigwedge_{j=1}^{n-2}[(\mathrm{d}R_{u})_{u'}^{-1}-I](\dot{c}_{j}(0))\right)\wedge \dot{c}_{n-1}(0)\right|^{2} \\ &=\Delta(u,u')^{2}\det(\textbf{Y}_{u,u'}) \\ &= \Delta(u,u')^{2}\det{(M_{u,u'}^{-1}-I)}^{2}. \end{aligned} \end{align*} $$

Finally,

$$ \begin{align*} \begin{aligned} \displaystyle\langle W_{1},W_{2}\rangle &\displaystyle= \det{\begin{pmatrix} \textbf{X}_{u,u'} & A_{(n-2)\times 1} \\ \textbf{0}_{1\times (n-2)} & \langle (\mathrm{d}R_{u})_{u'}(\dot{c}_{n-1}(0)),\dot{c}_{n-1}(0)\rangle \end{pmatrix}} \\ &=\displaystyle\det{(I-M_{u,u'})}^{2}\langle (\mathrm{d}R_{u})_{u'}(\dot{c}_{n-1}(0)),\dot{c}_{n-1}(0)\rangle, \end{aligned} \end{align*} $$

where $A_{(n-2)\times 1}$ is a $(n-2)\times 1$ column that does not feature in the final expression.

Expanding $|W_{1}-W_{2}|^{2}$ using the standard scalar product on the exterior algebra $\Lambda ^{n-1}(\mathbb {R}^{n})$ ,

(4.29)

$$ \begin{align} \begin{aligned} |W_{1}-W_{2}|^{2} &= \langle W_{1}, W_{1}\rangle - 2\langle W_{1}, W_{2}\rangle + \langle W_{2}, W_{2}\rangle \\&= \det{(I-M_{u,u'})}^{2} - 2\det{(I-M_{u,u'})}^{2}\langle (\mathrm{d}R_{u})_{u'}(\dot{c}_{n-1}(0)),\dot{c}_{n-1}(0)\rangle\\& \quad + \Delta(u,u')^{2}\det{(M_{u,u'}^{-1}-I)}^{2}, \end{aligned} \end{align} $$

thanks to Claim 4.18. We continue with the following key observation:

Claim 4.19. Under (1.12), it holds that

$$ \begin{align*} \langle (\mathrm{d}R_{u})_{u'}(\dot{c}_{n-1}(0)),\dot{c}_{n-1}(0)\rangle < 0. \end{align*} $$

Proof. Recall that $\dot {c}_{n-1}(0)\cdot N(u)> 0$ by assumption, therefore differentiating at $t=0$ the identity

$$ \begin{align*}\langle R_{u}(c_{n-1}(t)) - c_{n-1}(t), N(u)\rangle =0\end{align*} $$

gives $\langle (\mathrm {d}R_{u})_{u'}(\dot {c}_{n-1}(0)), N(u)\rangle>0$ . Next, observe that $N(u'), N(u), N(u")$ and $\dot {c}_{n-1}(0)$ are in $\mathcal {H}^{\perp }$ , the (two-dimensional) orthogonal complement of $\mathcal {H}$ in $\mathbb {R}^{n}$ . Since $N(p)\cdot N(q)\geq \frac {1}{2}$ for all $p,q\in S$ by assumption, the angles $\alpha _{1}$ (between $N(u')$ and $N(u)$ ) and $\alpha _{2}$ (between $N(u)$ and $N(u")$ ) are such that $0<\alpha _{1}+\alpha _{2}<\frac {\pi }{2}$ . Since $N(u)\in \mathcal {H}^{\perp }$ , we have by the self-adjointness of the projection operator $P_{\mathcal {H}^{\perp }}$ ,

$$ \begin{align*}0<\langle (\mathrm{d}R_{u})_{u'}(\dot{c}_{n-1}(0)), N(u) \rangle = \langle (\mathrm{d}R_{u})_{u'}(\dot{c}_{n-1}(0)), P_{\mathcal{H}^{\perp}}N(u)\rangle = \langle P_{\mathcal{H}^{\perp}}[(\mathrm{d}R_{u})_{u'}(\dot{c}_{n-1}(0))], N(u)\rangle ,\end{align*} $$

which implies that $P_{\mathcal {H}^{\perp }}[(\mathrm {d}R_{u})_{u'}(\dot {c}_{n-1}(0))]$ is in the upper-half space of $\mathcal {H}^{\perp }$ (here we are assuming without loss of generality that $N(u)=e_{2}$ , the second canonical vector of $\mathcal {H}^{\perp }\cong \mathbb {R}^{2}$ ). On the other hand, $(\mathrm {d}R_{u})_{u'}(\dot {c}_{n-1}(0))\in T_{u"}S$ , hence $\langle P_{\mathcal {H}^{\perp }}[(\mathrm {d}R_{u})_{u'}(\dot {c}_{n-1}(0))], N(u") \rangle =0$ , that is, the angle between $N(u")$ and $P_{\mathcal {H}^{\perp }}[(\mathrm {d}R_{u})_{u'}(\dot {c}_{n-1}(0))]$ is $\frac {\pi }{2}$ . Since $\theta :=\frac {\pi }{2}-(\alpha _{1}+\alpha _{2})$ is strictly positive, the angle $\gamma :=\frac {\pi }{2}+\theta $ between $P_{\mathcal {H}^{\perp }}[(\mathrm {d}R_{u})_{u'}(\dot {c}_{n-1}(0))]$ and $\dot {c}_{n-1}(0)$ is strictly larger than $\frac {\pi }{2}$ (see Figure 3), which implies that

$$ \begin{align*}\langle P_{\mathcal{H}^{\perp}}[(\mathrm{d}R_{u})_{u'}(\dot{c}_{n-1}(0))], \dot{c}_{n-1}(0)\rangle < 0.\end{align*} $$

Figure 3 A graphical representation of the proof of Claim 4.19.

Finally, again by the self-adjointness of $P_{\mathcal {H}^{\perp }}$ ,

$$ \begin{align*} \begin{aligned} \displaystyle \langle (\mathrm{d}R_{u})_{u'}(\dot{c}_{n-1}(0)), \dot{c}_{n-1}(0)\rangle &\displaystyle= \langle (\mathrm{d}R_{u})_{u'}(\dot{c}_{n-1}(0)), P_{\mathcal{H}^{\perp}}[\dot{c}_{n-1}(0)]\rangle \\ &\displaystyle= \langle P_{\mathcal{H}^{\perp}}[(\mathrm{d}R_{u})_{u'}(\dot{c}_{n-1}(0))], \dot{c}_{n-1}(0)\rangle \\ &\displaystyle < 0, \end{aligned} \end{align*} $$

which concludes the proof of the claim.

Returning to (4.29),

(4.30)

$$ \begin{align} \begin{aligned} |W_{1}-W_{2}|^{2} &\displaystyle= \det{(I-M_{u,u'})}^{2} - 2\det{(I-M_{u,u'})}^{2}\langle (\mathrm{d}R_{u})_{u'}(\dot{c}_{n-1}(0)),\dot{c}_{n-1}(0)\rangle\\& \quad + \Delta(u,u')^{2}\det{(M_{u,u'}^{-1}-I)}^{2}\\&\geq 1+\Delta(u,u')^{2}, \end{aligned} \end{align} $$

by (4.25) and Claim 4.19, which concludes the proof of Proposition 4.16.

Step 2: Bounding $J/\widetilde {J}$

Let $\lambda _{1}(p)\leq \lambda _{2}(p)\leq \cdots \leq \lambda _{n-1}(p)$ be the eigenvalues of the shape operator $\mathrm {d}N$ at p. Since $u"-u'\in \langle N(u)\rangle ^{\perp }$ ,

$$ \begin{align*} \begin{aligned} \displaystyle\left|\frac{\langle u"-u',N(u")\rangle}{\langle P_{T_{u"}S}N(u),(\mathrm{d}N_{u"})^{-1}(P_{T_{u"}S}N(u))\rangle}\right|&\displaystyle = \left|\frac{\langle u"-u',N(u")-N(u)\rangle}{\langle P_{T_{u"}S}N(u),(\mathrm{d}N_{u"})^{-1}(P_{T_{u"}S}N(u))\rangle}\right| \\ &\leq\displaystyle \lambda_{n-1}(u") \frac{|\langle u"-u',N(u")-N(u)\rangle|}{|P_{T_{u"}S}N(u)|^{2}}. \end{aligned} \end{align*} $$

Using the fact that $|P_{T_{u"}S}N(u)|=|N(u")\wedge N(u)|\approx |N(u")-N(u)|$ , which follows from (1.12), we have

$$ \begin{align*} \begin{aligned} \displaystyle J(u,u')&\displaystyle =\left(\frac{|N(u')\wedge N(u")|}{|N(u)\wedge N(u')|}\right)^{n-2}\left|\frac{\langle u"-u',N(u")\rangle}{\langle P_{T_{u"}S}N(u),(\mathrm{d}N_{u"})^{-1}(P_{T_{u"}S}N(u))\rangle}\right|\frac{K(u)}{K(u")} \\ &\displaystyle \lesssim\left(\frac{|N(u')-N(u")|}{|N(u)- N(u')|}\right)^{n-2}\frac{|\langle u"-u',N(u")-N(u)\rangle|}{|P_{T_{u"}S}N(u)|^{2}}\frac{\prod_{j=1}^{n-1}\lambda_{j}(u)}{\prod_{j=1}^{n-1}\lambda_{j}(u")} \lambda_{n-1}(u") \\ &\lesssim\displaystyle\left(\frac{|u'-u"|}{|u-u'|}\right)^{n-2}\left(\frac{\sup_{p}\lambda_{n-1}(p)}{\inf_{p}\lambda_{1}(p)}\right)^{n-2} \frac{|u"-u'|\cdot|N(u")-N(u)|}{|N(u")-N(u)|^{2}}\frac{\prod_{j=1}^{n-2}\lambda_{j}(u)}{\prod_{j=1}^{n-2}\lambda_{j}(u")}\lambda_{n-1}(u) \\ &\lesssim\displaystyle\left(\frac{|u'-u"|}{|u-u'|}\right)^{n-2}\frac{|u"-u'|}{|N(u")-N(u)|} Q(S)^{2(n-2)} \sup_{p}\lambda_{n-1}(p). \end{aligned} \end{align*} $$

Hence by (4.5) and the fact that $\widetilde {J}(u,u')\geq 1$ (see Proposition 4.16),

(4.31)

$$ \begin{align} \displaystyle\frac{J(u,u')}{\widetilde{J}(u,u')}\lesssim Q(S)^{\frac{5(n-2)}{2}} \frac{|u"-u'|}{|N(u")-N(u)|} \sup_{p}\lambda_{n-1}(p). \end{align} $$

On the other hand, using the fact that $\widetilde {J}(u,u')\geq \Delta (u,u')$ , which also follows from Proposition 4.16,

$$ \begin{align*} \begin{aligned} \displaystyle \frac{J(u,u')}{\widetilde{J}(u,u')}\leq\displaystyle \frac{J(u,u')}{\Delta(u,u')}=J(u,u")&\lesssim\displaystyle\left(\frac{|u'-u"|}{|u-u"|}\right)^{n-2}\frac{|u"-u'|}{|N(u')-N(u)|} Q(S)^{2(n-2)} \sup_{p}\lambda_{n-1}(p)\\ &\lesssim\displaystyle Q(S)^{\frac{5(n-2)}{2}}\frac{|u"-u'|}{|N(u')-N(u)|} \sup_{p}\lambda_{n-1}(p), \end{aligned} \end{align*} $$

by the distance estimate (4.5) and by (4.9). Consequently,

(4.32)

$$ \begin{align} \begin{aligned} \displaystyle\frac{J(u,u')}{\widetilde{J}(u,u')}&\displaystyle\lesssim |u"-u'| Q(S)^{\frac{5(n-2)}{2}}\frac{1}{\max\{|N(u")-N(u)|,|N(u')-N(u)|\}}\sup_{p}\lambda_{n-1}(p) \\ &\displaystyle\lesssim |u"-u'| Q(S)^{\frac{5(n-2)}{2}}\frac{1}{|N(u")-N(u)|+|N(u')-N(u)|} \sup_{p}\lambda_{n-1}(p) \\ &\displaystyle\lesssim \frac{|u"-u'|}{|N(u")-N(u')|} Q(S)^{\frac{5(n-2)}{2}} \sup_{p}\lambda_{n-1}(p) \\ &\displaystyle\lesssim \frac{1}{\inf_{p}\lambda_{1}(p)} Q(S)^{\frac{5(n-2)}{2}} \sup_{p}\lambda_{n-1}(p) \\ &\displaystyle\lesssim Q(S)^{\frac{5n-8}{2}}, \end{aligned} \end{align} $$

by the mean-value inequality applied to the Gauss map N. This implies (4.20), completing the proof of Theorem 1.7 (Theorem 4.11).

4.3 Proof of the Sobolev–Mizohata–Takeuchi inequality (Theorem 1.8)

In this section we prove Theorem 1.8, or more specifically, Theorem 4.13. We begin by observing that if $u\not \in \mathrm {supp\,}^*(g)$ and $u'\in S$ , then either $u'\not \in \mathrm {supp\,}(g)$ or $R_u u'\not \in \mathrm {supp\,}(g)$ , meaning that ${I_{S,s}(|g|^2, |g|^2)(u)=0}$ . Consequently, by Theorem 4.11,

$$ \begin{align*}\int_{\mathbb{R}^n}|\widehat{g\mathrm{d}\sigma}|^2w\leq c Q(S)^{\frac{5n-8}{4}} \sup_{u\in\mathrm{supp\,}^*(g)}\|X_Sw(u,\cdot)\|_{\dot{H}^s(T_u S)}\int_S I_{S,s}(|g|^2,|g|^2)(u)^{1/2}\mathrm{d}\sigma(u), \end{align*} $$

and so we are reduced to proving a suitable $L^1(S)\times L^1(S)\rightarrow L^{1/2}(S)$ estimate on the bilinear operator

(4.33)

$$ \begin{align} I_{S,s}(g_1,g_2)(u):=\int_{S}\frac{g_1(u')g_2(R_u u')}{|u'-R_u u'|^{s}}J(u,u')\mathrm{d}\sigma(u') \end{align} $$

whenever $s<n-1$ . This follows by a direct application of the forthcoming Theorem 7.2.

4.4 Improved Sobolev–Stein constants in the plane

Our proof of Theorem 1.7 identifies $\|J/\widetilde {J}\|_\infty ^{1/2}$ as the naturally occurring dilation-invariant functional on the surface S, rather than the power of the curvature quotient $Q(S)$ that we use to bound it. In two dimensions our expression for J, being relatively simple, permits the bound $\|J/\widetilde {J}\|_\infty ^{1/2}\lesssim \Lambda (S)$ , where $\Lambda (S)$ is defined in (1.13). To see this we argue as in (4.32), using Propositions 4.5 and 4.16 to write

$$ \begin{align*} \begin{aligned} \frac{J(u,u')}{\widetilde{J}(u,u')}\leq \min\{J(u,u'),J(u,u")\}=|u'-u"|K(u)\min\left\{\frac{1}{|N(u)\wedge N(u")|}, \frac{1}{|N(u)\wedge N(u')|}\right\} \lesssim \Lambda(S). \end{aligned} \end{align*} $$

The two-dimensional case of Theorem 1.7 may then be strengthened to the following:

Theorem 4.20 (Improved Sobolev–Stein in the plane).

Suppose that $s<\frac {1}{2}$ . There is an absolute constant c such that

$$ \begin{align*}\int_{\mathbb{R}^2}|\widehat{g\mathrm{d}\sigma}(x)|^2w(x)\mathrm{d}x\leq c\Lambda(S)\int_S I_{S,2s}(|g|^2,|g|^2)(u)^{1/2}\|X_Sw(u,\cdot)\|_{\dot{H}^s(T_u S)}\mathrm{d}\sigma(u). \end{align*} $$

A similar, although potentially rather more complicated statement is possible in higher dimensions, and is left to the interested reader.

5 Estimating distances: the proof of Proposition 4.4

We begin with (4.5), and the elementary observation that if $\pi $ is 2-plane that is normal to S at a point u, then by (1.12), it must be close to normal at all points of intersection with S. More specifically, for $\widetilde {u}\in S$ we have

$$ \begin{align*}|P_\pi N(\widetilde{u})|\geq|P_{(T_uS)^\perp}N(\widetilde{u})|=N(u)\cdot N(\widetilde{u})\geq 1/2.\end{align*} $$

It follows by Meusnier’s theorem that for such a $\pi $ , the curvature of the curve $S\cap \pi $ at a point is comparable to a normal curvature of S at that same point. This allows us to transfer the curvature quotient of S to such curves, and we shall appeal to this momentarily.

Now let $\pi '$ and $\pi "$ be the normal 2-planes at the point u that pass through the points $u'$ and $u"$ respectively. Let x be the orthogonal projection of u onto the plane $T_uS+\{u'\}$ , and note that $\{u,u',x\}$ and $\{u,u",x\}$ are the vertices of right-angled triangles in the 2-planes $\pi '$ and $\pi "$ respectively. Next observe that by the triangle inequality and Pythagoras’ theorem, it is enough to show that

(5.1)

$$ \begin{align} |x-u"|\lesssim Q(S)^{1/2}|x-u'|. \end{align} $$

To see this we write S as a graph over $T_uS+\{u'\}$ as follows: let $\phi _u:T_uS+\{u'\}\rightarrow \mathbb {R}$ be such that $x'\mapsto x'+\phi _u(x')N(u)$ is a bijective map from a subset $U\subset T_uS$ into S; see Figure 1. That this is possible, and indeed that $\phi _u$ is uniquely defined, follows from (1.12) (a point that is elaborated in [Reference Bennett, Nakamura and Shiraki15]). Notice that

(5.2)

$$ \begin{align} \phi_u(u')=0, \phi_u(x)=|x-u|\;\;\text{ and }\;\;\nabla\phi_u(x)=0, \end{align} $$

by construction. Assuming that $N(u)=e_n$ , as we may, the graph condition (1.12) implies that the normal vector $(\nabla \phi _u, -1)$ lies in some fixed (proper) vertical cone, and so in particular we also have

(5.3)

$$ \begin{align} |\nabla\phi_u|\lesssim 1. \end{align} $$

We now apply Taylor’s theorem on the line segment $[x,u']$ , along with (5.2), to obtain

$$ \begin{align*}|x-u|=\phi_u(x)-\phi_u(u')=\frac{1}{2}k'(u,u')|x-u'|^2, \end{align*} $$

where $k'(u,u')$ is a quantity comparable to some normal curvature of S at some point. Here we have used (5.3) along with our initial observation via Meusnier’s theorem. By symmetry a similar statement may be made with $u"$ in place of $u'$ , from which we deduce that

$$ \begin{align*}k'(u,u')|x-u'|^2=k"(u,u")|x-u"|^2.\end{align*} $$

The inequality (5.1) now follows from the definition of $Q(S)$ and taking square roots.

Turning to (4.6), we fix u and exploit the properties of the map $H:=H_{\omega }= N^{-1}\circ \Phi _{\omega }$ from Section 6. By the mean value theorem and Claim 6.4,

$$ \begin{align*} \begin{aligned} \displaystyle |u-u'| {\kern-1pt}={\kern-1pt} |H(0)-H(x')| &\displaystyle{\kern-1pt}\leq{\kern-1pt} \sup_{\theta}\|\mathrm{d}H_{\theta}\|\cdot |x'| \displaystyle{\kern-1pt}\leq{\kern-1pt} \sup_{\theta}\|\mathrm{d}H_{\theta}\|\cdot \frac{|(1-\widetilde{\eta}(x'))x'|}{|1-\widetilde{\eta}(x')|} \displaystyle=\sup_{\theta}\|\mathrm{d}H_{\theta}\|\cdot \frac{|x'-x"|}{|1-\widetilde{\eta}(x')|}, \end{aligned} \end{align*} $$

where $x"$ is such that $H(x")=u"$ . Consequently,

$$ \begin{align*} \begin{aligned} \displaystyle |u-u'| &\displaystyle\leq\sup_{\theta}\|\mathrm{d}H_{\theta}\|\cdot \frac{|H^{-1}(H(x'))-H^{-1}(H(x"))|}{|1-\widetilde{\eta}(x')|} \\ &\displaystyle\leq\sup_{\theta}\|\mathrm{d}H_{\theta}\|\cdot \sup_{\widetilde{\theta}}\|\mathrm{d}H^{-1}_{\widetilde{\theta}}\|\cdot \frac{|H(x')-H(x")|}{|1-\widetilde{\eta}(x')|} \\ &\displaystyle=\sup_{\theta}\|\mathrm{d}H_{\theta}\|\cdot \sup_{\widetilde{\theta}}\|\mathrm{d}H^{-1}_{\widetilde{\theta}}\|\cdot \frac{|u'-u"|}{|1-\widetilde{\eta}(x')|}, \end{aligned} \end{align*} $$

and therefore

$$ \begin{align*} |u'-u"|\geq \frac{|1-\widetilde{\eta}(x')|}{\sup_{\theta}\|\mathrm{d}H_{\theta}\|\cdot \sup_{\widetilde{\theta}}\|\mathrm{d}H^{-1}_{\widetilde{\theta}}\|}\cdot |u-u'|. \end{align*} $$

We also have, for a fixed $\theta $ ,

$$ \begin{align*} \|\mathrm{d}H_{\theta}\|\leq \|\mathrm{d}N^{-1}_{\Phi(\theta)}\|\cdot \|\mathrm{d}\Phi_{\theta}\|\leq\frac{1}{\inf_{p\in S}\lambda_1(p)}\cdot \|\mathrm{d}\Phi_{\theta}\|_{L^{\infty}_{\theta}}, \end{align*} $$

where $\inf _{p\in S}\lambda _1(p)$ is the infimum over $p\in S$ of the smallest eigenvalue $\lambda _1(p)$ of the shape operator $\mathrm {d}N_{p}$ . Similarly,

$$ \begin{align*} \|\mathrm{d}H^{-1}_{\widetilde{\theta}}\|\leq \|\mathrm{d}\Phi^{-1}_{\widetilde{\theta}}\|\cdot \|\mathrm{d}N_{\Phi(\widetilde{\theta})}\|\leq\|\mathrm{d}\Phi^{-1}_{\widetilde{\theta}}\|_{L^{\infty}_{\widetilde{\theta}}}\cdot\sup_{p}\lambda_{n-1}(p), \end{align*} $$

where $\sup _{p\in S}\lambda _{n-1}(p)$ is the supremum over $p\in S$ of the largest eigenvalue $\lambda _{n-1}(p)$ . Consequently,

(5.4)

$$ \begin{align} |u'-u"|\gtrsim |1-\widetilde{\eta}(x')|\cdot \frac{\inf_{p\in S}\lambda_1(p)}{\sup_{p\in S}\lambda_{n-1}(p)}\cdot |u-u'|\gtrsim \frac{1}{Q(S)}\cdot |u-u'|, \end{align} $$

since $\widetilde {\eta }<0$ by the strict convexity of S.

6 Computing Jacobians: the proof of Proposition 4.5

In this section we provide detailed proofs of (4.7), (4.8) and (4.9). The key idea is that the maps $u\mapsto R_u u'$ and $u'\mapsto R_uu'$ may be transformed into outward vector fields on Euclidean spaces (specifically $T_{u'}S$ and $T_uS$ respectively) by conjugating them with a composition of the Gauss map and a suitable stereographic projection. The derivatives of such vector fields have only two eigenspaces, allowing the computation of their Jacobians to be reduced to the identification of just two eigenvalues, one of which has multiplicity $n-2$ (see the forthcoming Lemma 6.2). This is manifested in the factor raised to the power $n-2$ in the formula (4.7) for J. We begin by recalling and introducing the notation and geometric objects that will feature in our computations of J and $\Delta $ .

• $N:S\rightarrow \mathbb {S}^{n-1}$ is the Gauss map, $\mathrm {d}N_{u}:T_{u}S\rightarrow T_{N(u)}\mathbb {S}^{n-1}$ is the shape operator (recall that $T_{u}S= T_{N(u)}\mathbb {S}^{n-1}$ ), and $K(u)=\det (\mathrm {d}N_{u})$ is the Gaussian curvature at $u\in S$ .
• The formulas of this section will be written in terms of the parameters u, $u'$ and $u"=R_{u}u'$ , which are points on S. We will denote their images via the Gauss map by $\omega $ , $\omega ^{\prime }$ and $\omega ^{\prime \prime }$ , respectively.
• For a fixed $\omega ^{\prime }\in \mathbb {S}^{n-1}$ , $\Phi _{\omega ^{\prime }}:\langle \omega ^{\prime }\rangle ^{\perp }\rightarrow \mathbb {S}^{n-1}$ denotes the inverse of the stereographic projection map with respect to $-\omega ^{\prime }$ . Explicitly
(6.1) $$ \begin{align} \Phi_{\omega^{\prime}}(x)=\left(\frac{2x}{1+|x|^{2}},\frac{1-|x|^{2}}{1+|x|^{2}}\right) \end{align} $$
via the identification $\mathbb {R}^n=\langle \omega '\rangle ^\perp \times \langle \omega '\rangle $ . If $\omega =\Phi _{\omega ^{\prime }}(x)$ , it follows that
(6.2) $$ \begin{align} x=\frac{\omega-\langle\omega,\omega^{\prime}\rangle\omega^{\prime}}{1+\langle\omega,\omega^{\prime}\rangle}. \end{align} $$
The differential $(\mathrm {d}\Phi _{\omega ^{\prime }})_{x}:\langle \omega ^{\prime }\rangle ^{\perp }\rightarrow \langle \omega \rangle ^{\perp }$ satisfies
(6.3) $$ \begin{align} (\mathrm{d}\Phi_{\omega^{\prime}})_{x}(x)=\langle\omega,\omega^{\prime}\rangle\omega-\omega^{\prime}. \end{align} $$
The determinants of $(\mathrm {d}\Phi _{\omega ^{\prime }})_{x}$ and its inverse are, respectively,
(6.4) $$ \begin{align} \det((\mathrm{d}\Phi_{\omega^{\prime}})_{x})=\left(\frac{2}{1+|x|^{2}}\right)^{n-1}=(1+\langle\omega,\omega^{\prime}\rangle)^{n-1} \end{align} $$
and
(6.5) $$ \begin{align} \det((\mathrm{d}\Phi_{\omega^{\prime}}^{-1})_{\omega})=\left(\frac{1}{1+\langle\omega,\omega^{\prime}\rangle}\right)^{n-1}. \end{align} $$
We refer the reader to Chapter 4 of [Reference Lieb and Loss39] for further discussion on the properties of these maps.
• For $\omega $ fixed, set
$$ \begin{align*} H_{\omega}=N^{-1}\circ\Phi_{\omega}. \end{align*} $$
$H_{\omega }$ will play a crucial role in this section. As we shall see, it allows us to reduce the computations of J and $\Delta $ to certain Euclidean analogues with simple spectral structure (outward vector fields, as discussed above and alluded to in Remark 4.3).

We are now ready to prove (4.7), (4.8) and (4.9).

6.1 Computing J

For fixed $\omega '$ we define the map $\Psi _{\omega ^{\prime }}:N(S)\rightarrow \mathbb {S}^{n-1}$ by

(6.6)

$$ \begin{align} \Psi_{\omega^{\prime}}(\omega)=N(R_{N^{-1}(\omega)}N^{-1}(\omega^{\prime})). \end{align} $$

Strictly speaking the domain of $\Psi _{\omega ^{\prime }}$ depends on $\omega '$ , as we allude to in Remark 4.1. The parameter $\omega \in \mathbb {S}^{n-1}$ will be a variable and we will use $x\in \langle \omega ^{\prime }\rangle ^{\perp }$ to represent its preimage by the map $\Phi _{\omega ^{\prime }}$ . Explicitly,

By (6.6) and the definition of $J(u,u')$ , along with the fact that the Gaussian curvature $K(u)$ is the determinant of the shape operator $\mathrm {d}N_u$ , we have

(6.7)

$$ \begin{align} J(u,u')=\left|\det{\left(\mathrm{d}\Psi_{N(u')}(N(u))\right)}\right|\frac{K(u)}{K(u")}. \end{align} $$

The next step is to reduce the computation of the Jacobian determinant $\det {\left (\mathrm {d}\Psi _{N(u')}(N(u))\right )}$ to one of a much simpler outward vector field $\varphi $ on the tangent space at $u'$ (see Lemma 6.2 below). This will be achieved by combining properties of the inverse stereographic projection map $\Phi _{\omega ^{\prime }}$ with the geometric condition (4.3). To this end we define the map $\varphi :\langle \omega ^{\prime }\rangle ^{\perp }\rightarrow \langle \omega ^{\prime }\rangle ^{\perp }$ by

$$ \begin{align*}\varphi(x):=\Phi_{\omega^{\prime}}^{-1}\circ\Psi_{\omega^{\prime}}\circ\Phi_{\omega^{\prime}}(x).\end{align*} $$

Claim 6.1. The vector field $\varphi :\langle \omega '\rangle ^\perp \rightarrow \langle \omega '\rangle ^\perp $ is given by

(6.8)

$$ \begin{align} \varphi(x) = \eta(x)x, \end{align} $$

where

(6.9)

$$ \begin{align} \eta(x) = \frac{\langle x,H_{\omega^{\prime}}^{-1}(R_{H_{\omega^{\prime}}(x)}H_{\omega^{\prime}}(0))\rangle}{|x|^{2}}=\frac{\langle x,\Phi_{\omega^{\prime}}^{-1}(\omega^{\prime\prime})\rangle}{|x|^{2}}. \end{align} $$

Proof of Claim 6.1.

By definition of the map $R_{(\cdot )}u'$ , the normals $\omega $ , $\omega ^{\prime }$ and $\omega ^{\prime \prime }$ are coplanar; therefore, they lie on a great circle. This implies that

$$ \begin{align*}\varphi(x)=\mu(x)x\end{align*} $$

for some $\mu (x)$ , which we conclude to be equal to $\eta (x)$ by taking scalar products with x on both sides of the equation above.

By the chain rule,

$$ \begin{align*}\det(\mathrm{d}\varphi(x))=\det((\mathrm{d}\Phi_{\omega^{\prime}}^{-1})(\omega^{\prime\prime}))\det((\mathrm{d}\Psi_{\omega^{\prime}})(\omega))\det((\mathrm{d}\Phi_{\omega^{\prime}})(x)),\end{align*} $$

hence

$$ \begin{align*} \det((\mathrm{d}\Psi_{\omega^{\prime}})(\omega))=\frac{\det(\mathrm{d}\varphi(x))}{\det((\mathrm{d}\Phi_{\omega^{\prime}}^{-1})(\omega^{\prime\prime}))\det((\mathrm{d}\Phi_{\omega^{\prime}})(x))}. \end{align*} $$

This implies, by (6.7),

(6.10)

$$ \begin{align} J(u,u')=\frac{|\det(\mathrm{d}\varphi(x))|}{|\det((\mathrm{d}\Phi_{\omega^{\prime}}^{-1})(\omega^{\prime\prime}))||\det((\mathrm{d}\Phi_{\omega^{\prime}})(x))|}\frac{K(u)}{K(u")}. \end{align} $$

We are now in a position to invoke the following elementary lemma, whose proof is left to the reader:

Lemma 6.2 (Differential structure of an outward vector field).

Let $\eta :\mathbb {R}^{d}\rightarrow \mathbb {R}$ be a $C^{1}$ function and let $\varphi :\mathbb {R}^{d}\rightarrow \mathbb {R}^{d}$ be given by

(6.11)

$$ \begin{align} \varphi(x)=\eta(x)x. \end{align} $$

The linear map

$$ \begin{align*}\mathrm{d}\varphi(x)=x(\nabla\eta(x))^{\top}+\eta(x) I_{d}\end{align*} $$

has eigenvalues $\lambda _{1}(x)=\eta (x)$ and $\lambda _{2}(x)=\langle \nabla \eta (x),x\rangle + \eta (x)$ of multiplicity $(d-1)$ and $1$ , respectively. The eigenspaces associated to these eigenvalues are

$$ \begin{align*} \begin{aligned} \displaystyle E_{\lambda_{1}(x)}&\displaystyle :=\langle\nabla\eta(x)\rangle^{\perp}, \\ \displaystyle E_{\lambda_{2}(x)}&\displaystyle :=\langle x\rangle. \end{aligned} \end{align*} $$

In particular,

(6.12)

$$ \begin{align} \det(\mathrm{d}\varphi(x))=[\eta(x)]^{d-1}(\langle\nabla\eta(x),x\rangle + \eta(x)). \end{align} $$

The parameter $u'\in S$ is fixed in this subsection; therefore, $\omega ^{\prime }$ will also be fixed, and we write $H_{\omega ^{\prime }}=H$ to simplify notation. Let us use (6.12) to compute $\det (\mathrm {d}\varphi (x))$ . The eigenvalue $\lambda _{1}(x)$ of $\mathrm {d}\varphi (x)$ is

$$ \begin{align*} \lambda_{1}(x)=\eta(x)=\frac{\langle x,H^{-1}(R_{H(x)}H(0))\rangle}{|x|^{2}}, \end{align*} $$

hence, by (6.12), all there is left to do is to compute the eigenvalue $\lambda _{2}(x)$ of $\mathrm {d}\varphi (x)$ . By definition of the map $R_{(\cdot )}u'$ , the vector $R_{u}u'-u'$ is in the tangent space of S at u. In short,

$$ \begin{align*} \langle R_{u}u'-u', N(u)\rangle = 0. \end{align*} $$

Equivalently,

(6.13)

$$ \begin{align} \langle H(\eta(x)x)-H(0), N(H(x))\rangle = 0. \end{align} $$

Differentiating both sides of (6.13) with respect to x,

$$ \begin{align*} \displaystyle 0 = \mathrm{d}(N\circ H)_{x}^{\top}\left(H(\eta(x)x)-H(0)\right) + \left(x\cdot\nabla\eta(x)^{\top}+\eta(x)I_{n-1}\right)^{\top}\mathrm{d}H_{\eta(x) x}^{\top}\left(N\circ H(x)\right). \end{align*} $$

Taking scalar products on both sides with x and using that $N\circ H=\Phi _{\omega '}$ , we have

$$ \begin{align*} \displaystyle 0 = \langle H(\eta(x)x)-H(0),(\mathrm{d}\Phi_{\omega^{\prime}})_{x}(x)\rangle + \langle\mathrm{d}H_{\eta(x) x}^{\top}\left(\Phi_{\omega^{\prime}}(x)\right),\left(x\cdot\nabla\eta(x)^{\top}+\eta(x)I_{n-1}\right)(x)\rangle. \end{align*} $$

By Lemma 6.2,

$$ \begin{align*}\left(x\cdot\nabla\eta(x)^{\top}+\eta(x)I_{n-1}\right)(x)=(\langle\nabla\eta(x),x\rangle+\eta(x)) x,\end{align*} $$

and hence

$$ \begin{align*} \lambda_2(x)=\langle\nabla\eta(x),x\rangle+\eta(x) = -\frac{\langle H(\eta(x)x)-H(0),(\mathrm{d}\Phi_{\omega^{\prime}})_{x}(x)\rangle}{\langle\mathrm{d}H_{\eta(x) x}^{\top}\left(\Phi_{\omega^{\prime}}(x)\right),x\rangle}. \end{align*} $$

By Lemma 6.2 again,

$$ \begin{align*} \det(\mathrm{d}\varphi(x))=-[\eta(x)]^{n-2}\frac{\langle H(\eta(x)x)-H(0),(\mathrm{d}\Phi_{\omega^{\prime}})_{x}(x)\rangle}{\langle\mathrm{d}H_{\eta(x) x}^{\top}\left(\Phi_{\omega^{\prime}}(x)\right),x\rangle}. \end{align*} $$

By (6.10),

(6.14)

$$ \begin{align} J(u,u')=|\eta(x)|^{n-2}\frac{1}{|\langle\mathrm{d}H_{\eta(x) x}^{\top}\left(\Phi_{\omega^{\prime}}(x)\right),x\rangle|}\frac{|\langle H(\eta(x)x)-H(0),(\mathrm{d}\Phi_{\omega^{\prime}})_{x}(x)\rangle|}{|\det((\mathrm{d}\Phi_{\omega^{\prime}}^{-1})(\omega^{\prime\prime}))||\det((\mathrm{d}\Phi_{\omega^{\prime}})(x))|}\frac{K(u)}{K(u")}. \end{align} $$

To proceed, we need to understand each factor in the formula above, which is the content of the next claim.

Claim 6.3. The following identities hold:

(6.15)

$$ \begin{align} |\eta(x)|=\left|\frac{\langle\omega,\omega^{\prime\prime}\rangle-\langle\omega,\omega^{\prime}\rangle\langle\omega^{\prime},\omega^{\prime\prime}\rangle}{(1+\langle\omega^{\prime\prime},\omega^{\prime}\rangle)(1-\langle\omega,\omega^{\prime}\rangle)}\right|; \end{align} $$

(6.16)

$$ \begin{align} \langle H(\eta(x)x)-H(0),(\mathrm{d}\Phi_{\omega^{\prime}})_{x}(x)\rangle = -\langle u"-u',\omega'\rangle; \end{align} $$

(6.17)

$$ \begin{align} \langle\mathrm{d}H_{\eta(x) x}^{\top}\left(\Phi_{\omega^{\prime}}(x)\right),x\rangle = \frac{1}{\eta(x)}\langle\omega,\mathrm{d}N^{-1}_{\omega^{\prime\prime}}(\langle\omega^{\prime\prime},\omega^{\prime}\rangle\omega^{\prime\prime}-\omega^{\prime})\rangle. \end{align} $$

Let us assume Claim 6.3 for the moment and complete the proof of the proposition. By the claim, (6.4) and (6.5),

(6.18)

$$ \begin{align} \displaystyle J(u,u')&=\displaystyle\left|\frac{\langle\omega,\omega^{\prime\prime}\rangle-\langle\omega,\omega^{\prime}\rangle\langle\omega^{\prime},\omega^{\prime\prime}\rangle}{(1+\langle\omega^{\prime\prime},\omega^{\prime}\rangle)(1-\langle\omega,\omega^{\prime}\rangle)}\right|{}^{n-1}\frac{|\langle u"-u',\omega'\rangle|}{|\langle\omega,\mathrm{d}N^{-1}_{\omega^{\prime\prime}}(\langle\omega^{\prime\prime},\omega^{\prime}\rangle\omega^{\prime\prime}-\omega^{\prime})\rangle|}\frac{|1+\langle\omega^{\prime\prime},\omega^{\prime}\rangle|^{n-1}}{|1+\langle\omega,\omega^{\prime}\rangle|^{n-1}}\frac{K(u)}{K(u")} \nonumber\\&=\displaystyle\left|\frac{\langle\omega,\omega^{\prime\prime}\rangle-\langle\omega,\omega^{\prime}\rangle\langle\omega^{\prime},\omega^{\prime\prime}\rangle}{1-\langle\omega,\omega^{\prime}\rangle^{2}}\right|{}^{n-1}\frac{|\langle u"-u',\omega'\rangle|}{|\langle\omega,\mathrm{d}N^{-1}_{\omega^{\prime\prime}}(\langle\omega^{\prime\prime},\omega^{\prime}\rangle\omega^{\prime\prime}-\omega^{\prime})\rangle|}\frac{K(u)}{K(u")}. \nonumber\\&=\displaystyle\left(\frac{(1-\langle\omega',\omega"\rangle^{2})^{\frac{1}{2}}}{(1-\langle\omega,\omega^{\prime}\rangle^{2})^{\frac{1}{2}}}\right)^{n-1}\frac{|\langle u"-u',\omega'\rangle|}{|\langle P_{T_{u"}S}N(u),(\mathrm{d}N_{u"})^{-1}(\langle\omega^{\prime\prime},\omega^{\prime}\rangle\omega^{\prime\prime}-\omega^{\prime})\rangle|}\frac{K(u)}{K(u")}, \end{align} $$

where we used the facts that $\langle \omega ,v\rangle =\langle P_{T_{u"}S}N(u),v\rangle $ for every $v\in T_{u"}S$ , and that three coplanar vectors $\omega ,\omega '$ and $\omega "$ on the sphere satisfy

$$ \begin{align*} \langle\omega,\omega^{\prime\prime}\rangle-\langle\omega,\omega^{\prime}\rangle\langle\omega^{\prime},\omega^{\prime\prime}\rangle = (1-\langle\omega',\omega"\rangle^{2})^{\frac{1}{2}}(1-\langle\omega,\omega'\rangle^{2})^{\frac{1}{2}}. \end{align*} $$

We exploit the coplanarity of $\omega ,\omega '$ and $\omega "$ twice more. First, it implies the existence of $a,b\in \mathbb {R}$ such that

(6.19)

$$ \begin{align} \omega"=a\omega+b\omega'. \end{align} $$

Consequently,

$$ \begin{align*} \frac{|\langle u"-u',\omega^{\prime\prime}\rangle|}{|\langle u"-u',\omega^{\prime}\rangle|}=\frac{|\langle u"-u',a\omega+b\omega'\rangle|}{|\langle u"-u',\omega^{\prime}\rangle|}=|b|, \end{align*} $$

since $u"-u'$ is perpendicular to $N(u)=\omega $ . On the other hand, projecting both sides of (6.19) to $\langle \omega \rangle ^{\perp }$ gives

$$ \begin{align*} P_{\langle\omega\rangle^{\perp}}\omega" = b P_{\langle\omega\rangle^{\perp}}\omega'\Longrightarrow |b|=\frac{|P_{\langle\omega\rangle^{\perp}}\omega"|}{|P_{\langle\omega\rangle^{\perp}}\omega'|}, \end{align*} $$

which in turn implies

(6.20)

$$ \begin{align} \frac{|\langle u"-u',\omega^{\prime\prime}\rangle|}{|\langle u"-u',\omega^{\prime}\rangle|}=\frac{|P_{\langle\omega\rangle^{\perp}}\omega"|}{|P_{\langle\omega\rangle^{\perp}}\omega'|} \Longrightarrow |\langle u"-u',\omega^{\prime}\rangle|=\frac{(1-\langle\omega,\omega^{\prime}\rangle^{2})^{\frac{1}{2}}}{(1-\langle\omega,\omega"\rangle^{2})^{\frac{1}{2}}}|\langle u"-u',N(u")\rangle|. \end{align} $$

Second, the fact that $\omega ,\omega '$ and $\omega "$ are coplanar also gives us that $P_{\langle \omega "\rangle ^{\perp }}\omega '$ and $P_{\langle \omega "\rangle ^{\perp }}\omega $ are parallel, therefore

(6.21)

$$ \begin{align} \langle\omega^{\prime\prime},\omega^{\prime}\rangle\omega^{\prime\prime}-\omega^{\prime}=P_{\langle\omega"\rangle^{\perp}}\omega'=\frac{|P_{\langle\omega"\rangle^{\perp}}\omega'|}{|P_{\langle\omega"\rangle^{\perp}}\omega|}P_{\langle\omega"\rangle^{\perp}}\omega =\frac{(1-\langle\omega',\omega"\rangle^{2})^{\frac{1}{2}}}{(1-\langle\omega,\omega"\rangle^{2})^{\frac{1}{2}}}P_{T_{u"}S}N(u). \end{align} $$

Likewise, or by symmetry,

(6.22)

$$ \begin{align} \langle\omega^{\prime\prime},\omega^{\prime}\rangle\omega^{\prime}-\omega^{\prime\prime}=P_{\langle\omega'\rangle^{\perp}}\omega"=\frac{|P_{\langle\omega'\rangle^{\perp}}\omega"|}{|P_{\langle\omega'\rangle^{\perp}}\omega|}P_{\langle\omega'\rangle^{\perp}}\omega=\frac{(1-\langle\omega',\omega"\rangle^{2})^{\frac{1}{2}}}{(1-\langle\omega,\omega'\rangle^{2})^{\frac{1}{2}}}P_{T_{u'}S}N(u). \end{align} $$

Using (6.20) and (6.21) in (6.18) gives (4.7). We now move to the final part of the argument.

Proof of Claim 6.3.

By (6.9) and (6.2),

$$ \begin{align*} \begin{aligned} \displaystyle|\eta(x)|&\displaystyle=\left|\left\langle \frac{\omega-\langle\omega,\omega^{\prime}\rangle\omega^{\prime}}{1+\langle\omega,\omega^{\prime}\rangle},\frac{\omega^{\prime\prime}-\langle\omega^{\prime\prime},\omega^{\prime}\rangle\omega^{\prime}}{1+\langle\omega^{\prime\prime},\omega^{\prime}\rangle}\right\rangle\right|\frac{|1+\langle\omega,\omega^{\prime}\rangle|^{2}}{|\omega-\langle\omega,\omega^{\prime}\rangle\omega^{\prime}|^{2}} \\ &\displaystyle= \left|\frac{\langle\omega,\omega^{\prime\prime}\rangle-\langle\omega,\omega^{\prime}\rangle\langle\omega^{\prime},\omega^{\prime\prime}\rangle}{(1+\langle\omega^{\prime\prime},\omega^{\prime}\rangle)(1-\langle\omega,\omega^{\prime}\rangle)}\right|, \end{aligned} \end{align*} $$

which verifies (6.15). To establish (6.16), we simply observe that $H(\eta (x)x)-H(0)=u"-u'$ , and this together with (6.3) implies that

$$ \begin{align*} \langle H(\eta(x)x)-H(0),(\mathrm{d}\Phi_{\omega^{\prime}})_{x}(x)\rangle = \langle u"-u',\langle\omega,\omega^{\prime}\rangle\omega-\omega^{\prime}\rangle = -\langle u"-u',\omega'\rangle, \end{align*} $$

since $u"-u'$ is perpendicular to $\omega $ by definition of $u"$ . Finally, notice that $\Phi _{\omega ^{\prime }}(\eta (x)x)=\omega ^{\prime \prime }$ and that a direct computation gives

(6.23)

$$ \begin{align} (\mathrm{d}\Phi_{\omega^{\prime}})_{\eta(x)x}(x)=\frac{1}{\eta(x)}\left(\langle\omega^{\prime\prime},\omega^{\prime}\rangle\omega^{\prime\prime}-\omega^{\prime}\right). \end{align} $$

Therefore by definition of H, the chain rule and (6.23), we have

$$ \begin{align*} \begin{aligned} \displaystyle \langle\mathrm{d}H_{\eta(x) x}^{\top}\left(\Phi_{\omega^{\prime}}(x)\right),x\rangle &\displaystyle = \langle\omega,\mathrm{d}H_{\eta(x) x}(x)\rangle \\ &\displaystyle= \langle\omega,\mathrm{d}N^{-1}_{\Phi_{\omega^{\prime}}(\eta(x)x)}\circ(\mathrm{d}\Phi_{\omega^{\prime}})_{\eta(x)x}(x)\rangle\\ &\displaystyle= \frac{1}{\eta(x)}\langle\omega,\mathrm{d}N^{-1}_{\omega^{\prime\prime}}(\langle\omega^{\prime\prime},\omega^{\prime}\rangle\omega^{\prime\prime}-\omega^{\prime})\rangle, \end{aligned} \end{align*} $$

which concludes the proof of Claim 6.3.

6.2 Computing $\Delta $

Arguing as in Section 6.1, for fixed $\omega $ we define the map $\widetilde {\Psi }_{\omega }:N(S)\rightarrow \mathbb {S}^{n-1}$ by

(6.24)

$$ \begin{align} \widetilde{\Psi}_{\omega}(\omega')=N(R_{N^{-1}(\omega)}N^{-1}(\omega^{\prime})). \end{align} $$

Recalling from Section 4 that $\Delta (u,u")$ is the Jacobian of the change of variables $u'=R_u u"$ , it follows that

(6.25)

$$ \begin{align} \Delta(u,u')=\left|\det{\left(\mathrm{d}\widetilde{\Psi}_{N(u)}(N(u'))\right)}\right|\frac{K(u')}{K(u")}. \end{align} $$

Recall that $\omega ^{\prime }\in \mathbb {S}^{n-1}$ is a variable now. We will use $x'\in \langle \omega \rangle ^{\perp }$ to represent its preimage by the map $\Phi _{\omega }$ :

Once more we reduce the computation of $\det {\left (\mathrm {d}\widetilde {\Psi }_{N(u)}(N(u'))\right )}$ to an application of Lemma 6.2. Define $\widetilde {\varphi }:\langle \omega \rangle ^{\perp }\rightarrow \langle \omega \rangle ^{\perp }$ by

$$ \begin{align*}\widetilde{\varphi}(x'):=\Phi_{\omega}^{-1}\circ\widetilde{\Psi}_{\omega}\circ\Phi_{\omega}(x').\end{align*} $$

Claim 6.4. $\widetilde {\varphi }$ is given by

(6.26)

$$ \begin{align} \widetilde{\varphi}(x') = \widetilde{\eta}(x')x', \end{align} $$

where

(6.27)

$$ \begin{align} \widetilde{\eta}(x') = \frac{\langle x',H_{\omega}^{-1}(R_{H_{\omega}(0)}H_{\omega}(x'))\rangle}{|x'|^{2}}=\frac{\langle x',\Phi_{\omega}^{-1}(\omega^{\prime\prime})\rangle}{|x'|^{2}}. \end{align} $$

The proof of Claim 6.4 is similar to the one of Claim 6.1. By the chain rule,

$$ \begin{align*}\det(\mathrm{d}\widetilde{\varphi}(x'))=\det((\mathrm{d}\Phi_{\omega}^{-1})(\omega^{\prime\prime}))\det((\mathrm{d}\Psi_{\omega})(\omega^{\prime}))\det((\mathrm{d}\Phi_{\omega})(x')),\end{align*} $$

hence

$$ \begin{align*} \det((\mathrm{d}\widetilde{\Psi}_{\omega})(\omega^{\prime}))=\frac{\det(\mathrm{d}\widetilde{\varphi}(x'))}{\det((\mathrm{d}\Phi_{\omega}^{-1})(\omega^{\prime\prime}))\det((\mathrm{d}\Phi_{\omega})(x'))}. \end{align*} $$

This implies, by (6.25), that

(6.28)

$$ \begin{align} \Delta(u,u')=\frac{|\det(\mathrm{d}\widetilde{\varphi}(x'))|}{|\det((\mathrm{d}\Phi_{\omega}^{-1})(\omega^{\prime\prime}))||\det((\mathrm{d}\Phi_{\omega})(x'))|}\frac{K(u')}{K(u")}. \end{align} $$

The parameter $u\in S$ is fixed in this subsection (and therefore so is $\omega \in \mathbb {S}^{n-1}$ ), so we lighten notation by writing $H_{\omega }=H$ . We may again compute $\det (\mathrm {d}\widetilde {\varphi }(x'))$ using Lemma 6.2. The eigenvalue $\widetilde {\lambda }_{1}(x')$ of $\mathrm {d}\widetilde {\varphi }(x')$ is

$$ \begin{align*} \widetilde{\lambda}_{1}(x')=\widetilde{\eta}(x')=\frac{\langle x',H^{-1}(R_{H(0)}H(x'))\rangle}{|x'|^{2}}, \end{align*} $$

hence we just have to compute the eigenvalue $\widetilde {\lambda }_{2}(x')$ of $\mathrm {d}\widetilde {\varphi }(x')$ and use (6.12). Recall that

$$ \begin{align*} \langle R_{u}u'-u', N(u)\rangle = 0. \end{align*} $$

Equivalently,

(6.29)

$$ \begin{align} \langle H(\widetilde{\eta}(x')x')-H(x'), N(H(0))\rangle = 0. \end{align} $$

Differentiating both sides of (6.29) with respect to $x'$ and taking scalar products with $x'$ as well gives

$$ \begin{align*} \langle (\mathrm{d}H_{\widetilde{\varphi}(x')}\circ\mathrm{d}\varphi_{x'})^{\top}(\Phi_{\omega}(0)),x'\rangle = \langle\omega,\mathrm{d}H_{x'}(x')\rangle, \end{align*} $$

which in turn implies that

$$ \begin{align*} \langle (\mathrm{d}H_{\widetilde{\varphi}(x')})^{\top}(\omega),(\mathrm{d}\widetilde{\varphi}_{x'})(x')\rangle =\langle(\mathrm{d}H_{x'})^{\top}\omega,(x')\rangle= \langle\omega,\mathrm{d}H_{x'}(x')\rangle. \end{align*} $$

Using the fact that $x'$ is an eigenvector of $\mathrm {d}\widetilde {\varphi }(x')$ with eigenvalue $\widetilde {\lambda }_{2}(x')$ and that $H=N^{-1}\circ \Phi _{\omega }$ yields

$$ \begin{align*} \begin{aligned} \displaystyle\widetilde{\lambda}_{2}(x')&\displaystyle =\frac{\langle\omega,\mathrm{d}N^{-1}_{\omega'}\circ(\mathrm{d}\Phi_{\omega})_{x'}(x')\rangle}{\langle\omega,\mathrm{d}N^{-1}_{\omega"}\circ(\mathrm{d}\Phi_{\omega})_{\widetilde{\eta}(x')x'}(x')\rangle} =\displaystyle \widetilde{\eta}(x')\frac{\langle\omega,\mathrm{d}N^{-1}_{\omega'}(\langle\omega,\omega'\rangle\omega'-\omega)\rangle}{\langle\omega,\mathrm{d}N^{-1}_{\omega"}(\langle\omega",\omega\rangle\omega"-\omega)\rangle}. \end{aligned} \end{align*} $$

By Lemma 6.2 once more,

$$ \begin{align*} \det(\mathrm{d}\widetilde{\varphi}(x'))=[\widetilde{\eta}(x')]^{n-1}\frac{\langle\omega,\mathrm{d}N^{-1}_{\omega'}(\langle\omega,\omega'\rangle\omega'-\omega)\rangle}{\langle\omega,\mathrm{d}N^{-1}_{\omega"}(\langle\omega",\omega\rangle\omega"-\omega)\rangle}. \end{align*} $$

By (6.28),

$$ \begin{align*} \Delta(u,u')=\frac{1}{|\det((\mathrm{d}\Phi_{\omega}^{-1})(\omega^{\prime\prime}))||\det((\mathrm{d}\Phi_{\omega})(x'))|}|\widetilde{\eta}(x')|^{n-1}\frac{|\langle\omega,\mathrm{d}N^{-1}_{\omega'}(\langle\omega,\omega'\rangle\omega'-\omega)\rangle|}{|\langle\omega,\mathrm{d}N^{-1}_{\omega"}(\langle\omega",\omega\rangle\omega"-\omega)\rangle|}\frac{K(u')}{K(u")}. \end{align*} $$

By (6.27),

(6.30)

$$ \begin{align} \begin{aligned} \displaystyle|\widetilde{\eta}(x')|&\displaystyle=\left|\left\langle \frac{\omega'-\langle\omega,\omega^{\prime}\rangle\omega}{1+\langle\omega,\omega^{\prime}\rangle},\frac{\omega^{\prime\prime}-\langle\omega^{\prime\prime},\omega\rangle\omega}{1+\langle\omega^{\prime\prime},\omega\rangle}\right\rangle\right|\frac{|1+\langle\omega,\omega^{\prime}\rangle|^{2}}{|\omega'-\langle\omega,\omega^{\prime}\rangle\omega|^{2}} \\ &\displaystyle= \left|\frac{\langle\omega',\omega^{\prime\prime}\rangle-\langle\omega,\omega^{\prime}\rangle\langle\omega,\omega^{\prime\prime}\rangle}{(1+\langle\omega^{\prime\prime},\omega\rangle)(1-\langle\omega,\omega^{\prime}\rangle)}\right|. \end{aligned} \end{align} $$

By (6.4), (6.5), and (6.30),

$$ \begin{align*} \begin{aligned} \displaystyle\Delta(u,u')&=\displaystyle \left|\frac{\langle\omega',\omega^{\prime\prime}\rangle-\langle\omega,\omega^{\prime}\rangle\langle\omega,\omega^{\prime\prime}\rangle}{(1+\langle\omega^{\prime\prime},\omega\rangle)\cdot(1-\langle\omega,\omega^{\prime}\rangle)}\right|{}^{n-1}\left|\frac{1+\langle\omega",\omega\rangle}{1+\langle\omega',\omega\rangle}\right|{}^{n-1}\frac{|\langle\omega,\mathrm{d}N^{-1}_{\omega'}(\langle\omega,\omega'\rangle\omega'-\omega)\rangle|}{|\langle\omega,\mathrm{d}N^{-1}_{\omega"}(\langle\omega",\omega\rangle\omega"-\omega)\rangle|}\frac{K(u')}{K(u")} \\ &\displaystyle= \left(\frac{|N(u)\wedge N(u")|}{|N(u)\wedge N(u')|}\right)^{n-1}\frac{|\langle P_{T_{u'}S}N(u),(\mathrm{d}N_{u'})^{-1}(P_{T_{u'}S}N(u))\rangle|}{|\langle P_{T_{u"}S}N(u),(\mathrm{d}N_{u"})^{-1}(P_{T_{u"}S}N(u))\rangle|}\frac{K(u')}{K(u")}, \end{aligned} \end{align*} $$

by (6.21), (6.22) and by similar geometric considerations to those in Section 6.1. This establishes (4.8).

6.3 Relating J and $\Delta $

Here we establish (4.9), the ‘switching property’ of $\Delta $ . By (4.7) we have

$$ \begin{align*} \begin{aligned} \displaystyle\frac{J(u,u")}{J(u,u')} &\displaystyle=\left(\frac{|N(u)\wedge N(u')|}{|N(u)\wedge N(u")|}\right)^{n-2}\left|\frac{\langle P_{T_{u"}S}N(u),(\mathrm{d}N_{u"})^{-1}(P_{T_{u"}S}N(u))\rangle}{\langle P_{T_{u'}S}N(u),(\mathrm{d}N_{u'})^{-1}(P_{T_{u'}S}N(u))\rangle}\right|\frac{|\langle u"-u',N(u')\rangle|}{|\langle u"-u',N(u")\rangle|}\frac{K(u")}{K(u')}. \end{aligned} \end{align*} $$

Using the coplanarity condition (4.3), an elementary argument similar to that leading to (6.20) reveals that

$$ \begin{align*} \frac{|\langle u"-u',N(u')\rangle|}{|\langle u"-u',N(u")\rangle|}=\frac{|P_{\langle\omega\rangle^{\perp}}N(u')|}{|P_{\langle\omega\rangle^{\perp}}N(u")|} =\frac{(1-\langle\omega,\omega^{\prime}\rangle^{2})^{\frac{1}{2}}}{(1-\langle\omega,\omega"\rangle^{2})^{\frac{1}{2}}}=\frac{|N(u)\wedge N(u')|}{|N(u)\wedge N(u")|}, \end{align*} $$

from which (4.9) follows.

7 Surface-carried fractional integrals

In this section we establish Lebesgue space bounds on the bilinear fractional integrals

$$ \begin{align*}I_{S,s} (g_1,g_2)(u) := \int_S \frac{g_1(u') g_2( R_uu' )}{ |u' - R_uu'|^{s} } J(u,u') \mathrm{d}\sigma(u') \end{align*} $$

arising in Section 4.

Remark 7.1 (Relation to classical fractional integral operators).

This is a surface-carried variant of the bilinear fractional integral operator

$$ \begin{align*}I_s(f_1,f_2)(x):=\int_{\mathbb{R}^d}\frac{f_1\left(x+\frac{y}{2}\right)f_2\left(x-\frac{y}{2}\right)}{|y|^s}dy \end{align*} $$

that naturally arises when S is the paraboloid (see Section 2), and has been studied by several authors; we refer to [Reference Grafakos30] and [Reference Kenig and Stein33].

As indicated in Section 4, the presence of the factor J in the kernel implies that this operator is symmetric – that is, $I_{S,s}(g_1,g_2)=I_{S,s}(g_2,g_1)$ . It is also natural for geometric reasons, allowing for bounds that are independent of any lower bounds on the curvature of S. For example, we have

(7.1)

$$ \begin{align} \begin{aligned} \|I_{S,s}(f_1,f_2)\|_1&=\int_S\int_S \frac{f_1(u')f_2(R_u u')}{|u'-R_u u'|^s}J(u,u')\mathrm{d}\sigma(u)\mathrm{d}\sigma(u')\\&=\int_S\int_S\frac{f_1(u')f_2(u")}{|u'-u"|^s}\mathrm{d}\sigma(u')\mathrm{d}\sigma(u")\\ &\leq C_s\|f_1\|_2\|f_2\|_2, \end{aligned} \end{align} $$

where

$$ \begin{align*}C_s:=\sup_{u\in S}\int_S\frac{\mathrm{d}\sigma(u')}{|u-u'|^s}.\end{align*} $$

Evidently $C_s$ does not depend on any lower bound on the curvature of S. More generally we have the following:

Theorem 7.2. Let $0<s<n-1$ , $q\in [\frac 12,1]$ , and S be as above. Then

$$ \begin{align*}\|I_{S,s}(g_1,g_2)\|_{L^q(S)} \lesssim Q(S)^{2(n-1)}\|g_1\|_{L^{2q}(S)} \|g_2\|_{L^{2q}(S)}, \end{align*} $$

where the implicit constant depends on $n,s,q$ and the diameter of S.

In order to prove Theorem 7.2 we adapt the argument of Kenig and Stein [Reference Kenig and Stein33] from the Euclidean setting.

Proof of Theorem 7.2.

For each dyadic scale $\lambda \lesssim \mathrm {diam\,}(S)$ we decompose S into a collection $\Theta _\lambda $ of $\lambda $ -caps $\theta $ , noting that $|\theta | \sim \lambda ^{n-1}$ for such a cap. Performing a dyadic decomposition and using the embedding $\ell ^q \subset \ell ^1$ , for $q\le 1$ , we have that (recall that $u"=R_{u}u'$ )

$$ \begin{align*} \int_S I_{S,s}(g_1,g_2)^q \mathrm{d}\sigma(u) &\lesssim \sum_{0<\lambda \lesssim\mathrm{diam\,}(S)} \lambda^{-qs} \int_S \Bigg( \int_{u'\in S: |u'-u"|\sim \lambda}g_1(u') g_2(u" ) J(u,u') \mathrm{d}\sigma(u') \Bigg)^{q} \mathrm{d}\sigma(u). \end{align*} $$

Next, we fix an arbitrary dyadic scale $\lambda $ and decompose

$$ \begin{align*} \int_S \Bigg( \int_{u'\in S: |u'-u"|\sim \lambda}&g_1(u') g_2( u" ) J(u,u') \mathrm{d}\sigma(u') \Bigg)^q \mathrm{d}\sigma(u)\\ &= \sum_{\theta\in\Theta_\lambda} \int_\theta \Bigg( \int_{u'\in S: |u'-u"|\sim \lambda}g_1(u') g_2( u" ) J(u,u') \mathrm{d}\sigma(u') \Bigg)^q \mathrm{d}\sigma(u)\\ &\lesssim \lambda^{(n-1)(1-q)} \sum_{\theta\in\Theta_\lambda}\Bigg( \int_\theta \int_{u'\in S: |u'-u"|\sim \lambda}g_1(u') g_2( u" ) J(u,u') \mathrm{d}\sigma(u') \mathrm{d}\sigma(u)\Bigg)^q. \end{align*} $$

Here we used that $0<q\le 1$ once more. Recall that $|u-u'|\lesssim Q(S) |u' - u"|$ for all $u,u'\in S$ by Proposition 4.4. Thus if $u \in \theta \in \Theta _\lambda $ and $|u' - u"| \sim \lambda $ , then $|u-u'|\lesssim Q(S)\lambda $ which means that $u' \in \theta ^*$ , where $\theta ^*$ is an $O(Q(S))$ dilate of $\theta $ . Similarly, $u" \in \theta ^*$ . Consequently,

for $p\ge 1$ , where we have used that

$$ \begin{align*}\int_S\int_S f(u)g(u") J(u,u') \mathrm{d}\sigma(u)\mathrm{d}\sigma(u') = \|f\|_{L^1(S)}\|g\|_{L^1(S)}.\end{align*} $$

Since $q\ge \frac 12$ and $p=2q$ , we obtain that

$$ \begin{align*} \begin{aligned} \displaystyle\int_S \Bigg( \int_{u'\in S: |u'-u"|\sim \lambda}&g_1(u') g_2( u" ) J(u,u') \mathrm{d}\sigma(u') \Bigg)^q \mathrm{d}\sigma(u) \\ &\displaystyle\lesssim \lambda^{(n-1)(1-q)} (Q(S)\lambda)^{(n-1)\frac{2q}{p'}} Q(S)^{n-1}\|g_1 \|_{L^p(S)}^{q} \|g_2 \|_{L^p(S)}^{q} \\ &\displaystyle=\lambda^{(n-1)(1-q)+(n-1)\frac{2q}{p'}} Q(S)^{2q(n-1)}\|g_1 \|_{L^p(S)}^{q} \|g_2 \|_{L^p(S)}^{q}, \end{aligned} \end{align*} $$

since the set of dilated caps $\{\theta ^{\ast }:\theta \in \Theta _{\lambda }\}$ covers S with a $Q(S)^{n-1}$ overlap factor. The geometric series converges as long as $-qs + (n-1)( 1- q + \frac {2q}{p'} )>0$ . Since $p=2q$ , this is equivalent to $s<n-1$ .

8 Surface-carried maximal operators

Recall from Section 4 that the geometric Wigner distribution $W_S(g,g)$ possesses the marginal properties (4.11) and (4.12). In the (superficially) more general polarised form these are the identities

(8.1)

$$ \begin{align} \int_{S}W_S(g_1,g_2)(u,P_{T_uS}x)\mathrm{d}\sigma(u)=\widehat{g_1\mathrm{d}\sigma}(x)\overline{\widehat{g_2\mathrm{d}\sigma}(x)} \end{align} $$

and

(8.2)

$$ \begin{align} \int_{T_uS}W_S(g_1,g_2)(u,v)\mathrm{d}v=g_1(u)\overline{g_2(u)} \end{align} $$

respectively. While (8.1) is an elementary consequence of Fubini’s theorem and the definition of the Jacobian J, the property (8.2) appears to be a little more delicate in general. In particular, for $g_1,g_2$ merely in $L^2$ , the integral in identity (8.2) should be interpreted as a suitable pointwise limit – see the forthcoming Proposition 8.4. As may be expected, a maximal analogue of the bilinear fractional integral operator $I_{S,s}$ of Section 7 naturally arises in our analysis. For locally integrable functions $f_1,f_2:S\rightarrow \mathbb {R}_+$ and $0<\delta <1$ we define the ‘averaging’ operator

$$ \begin{align*}A_{S,\delta}(f_1,f_2)(u)=\delta^{-(n-1)}\int_{|u'-R_uu'|<\delta}f_1(u')f_2(R_uu')J(u,u')\mathrm{d}\sigma(u'), \end{align*} $$

and maximal operator

$$ \begin{align*}M_S(f_1,f_2)(u)=\sup_{0<\delta<1}A_{S,\delta}(f_1,f_2)(u). \end{align*} $$

Remark 8.1 (Relation to classical maximal operators).

The operator $M_S$ is a surface-carried variant of the classical bi(sub)-linear Hardy–Littlewood maximal operator

$$ \begin{align*}M(f_1,f_2)(x)=\sup_{\delta>0}\frac{1}{|B(0,\delta)|}\int_{B(0,\delta)}f_1\left(x+\frac{y}{2}\right)f_2\left(x-\frac{y}{2}\right)\mathrm{d}y \end{align*} $$

on a Euclidean space.

We shall need the following estimate:

Theorem 8.2. If S is smooth, strictly convex and has finite curvature quotient $Q(S)$ , then

(8.3)

$$ \begin{align} M_{S}:L^2(S)\times L^2(S)\rightarrow L^{1,\infty}(S). \end{align} $$

Proof. We begin by using the Cauchy–Schwarz inequality to write

$$ \begin{align*} \begin{aligned} A_{S,\delta}(f_1,f_2)(u)&\leq\left(\delta^{-(n-1)}\int_{|u'-R_uu'|<\delta}f_1(u')^2J(u,u')\mathrm{d}\sigma(u')\right)^{1/2}\\&\;\;\;\;\;\;\times\left(\delta^{-(n-1)}\int_{|u'-R_uu'|<\delta}f_2(R_uu')^2J(u,u')\mathrm{d}\sigma(u')\right)^{1/2}. \end{aligned} \end{align*} $$

Making the change of variables $R_uu'=u"$ in the second factor above, using Proposition 4.5, and the fact that $R_uu"=u'$ , we see that

$$ \begin{align*}\begin{aligned} \int_{|u'-R_uu'|<\delta}f_2(R_uu')^2J(u,u')\mathrm{d}\sigma(u')&=\int_{|u"-R_uu"|<\delta}f_2(u")^2J(u,u')\Delta(u,u")\mathrm{d}\sigma(u")\\&=\int_{|u"-R_uu"|<\delta}f_2(u")^2J(u,u")\mathrm{d}\sigma(u")\\&=\int_{|u'-R_uu'|<\delta}f_2(u')^2J(u,u')\mathrm{d}\sigma(u'). \end{aligned} \end{align*} $$

Thus,

$$ \begin{align*}M_S(f_1,f_2)(u)\leq M_S^1(f_1^2)(u)^{1/2}M_S^1(f_2^2)(u)^{1/2},\end{align*} $$

where

$$ \begin{align*}M_S^1(f)(u):=\sup_{0<\delta<1}\delta^{-(n-1)}\int_{|u'-R_uu'|<\delta}f(u')J(u,u')\mathrm{d}\sigma(u').\end{align*} $$

Hence

$$ \begin{align*} \begin{aligned} \lambda\left|\left\{u\in S:M_S(f_1,f_2)(u)>\lambda\right\}\right|&\leq\lambda\left|\left\{u\in S:M_S^1(f_1^2)(u)M_S^1(f_2^2)(u)>\lambda^2\right\}\right|\\&\leq \lambda\left|\left\{u\in S:M_S^1(f_1^2)(u)>\varepsilon\lambda\right\}\right|\\ &\quad +\lambda\left|\left\{u\in S:M_S^1(f_2^2)(u)>\varepsilon^{-1}\lambda\right\}\right| \end{aligned} \end{align*} $$

for all $\varepsilon>0$ . We claim that the sublinear operator $M_S^1$ is of weak-type (1,1), and assuming this momentarily we have

$$ \begin{align*} \begin{aligned} \lambda\left|\left\{u\in S:M_S(f_1,f_2)(u)>\lambda\right\}\right|&\lesssim \varepsilon^{-1}\|f_1\|_2^2+\varepsilon\|f_2\|_2^2 \end{aligned} \end{align*} $$

uniformly in $\varepsilon $ . Optimising in $\varepsilon $ now yields the claimed weak-type bound on the bi-sublinear operator $M_S$ . A similar argument in a Euclidean context may be found in [Reference Grafakos30].

It remains to establish that $M_S^1:L^1(S)\rightarrow L^{1,\infty }$ , and we do this by applying the well-known abstract form of the classical Hardy–Littlewood maximal theorem presented in [Reference Stein48]. To this end we let $B_\delta (u)=\{u'\in S: \rho (u,u')<\delta \}$ , the ball in S centred at u with respect to the function $\rho (u,u'):=|u'-R_uu'|$ . By Proposition 4.4 it follows that $\rho $ is a quasi-distance, as defined in [Reference Stein48] (p. 10). Specifically, we may quickly verify that (i) $\rho (x,y)=0\iff x=y$ , (ii) $\rho (x,y)\leq c\rho (y,x)$ , and (iii) $\rho (x,y)\leq c(\rho (x,z)+\rho (y,z))$ , for some positive constant c depending on $Q(S)$ . By the change of variables (4.19) and an application of Proposition 4.16,

(8.4)

$$ \begin{align} |B_\delta(u)|=\int_{|\xi|\leq \delta}\widetilde{J}(u,u'(\xi))^{-1}\mathrm{d}\xi\leq \delta^{n-1}, \end{align} $$

so that

$$ \begin{align*}M_S^1f(u)\leq\sup_{0<\delta<1}\frac{1}{|B_\delta(u)|}\int_{B_\delta(u)}f(u')J(u,u')\mathrm{d}\sigma(u').\end{align*} $$

Arguing as in the proof of (4.31), we have

$$ \begin{align*} \displaystyle J(u,u')\lesssim Q(S)^{\frac{5(n-2)}{2}} \frac{|u"-u'|}{|N(u")-N(u)|} \sup_{p}\lambda_{n-1}(p), \end{align*} $$

which by a further use of Proposition 4.4 and the mean value theorem applied to the Gauss map shows that $J(u,u')$ is, up to a dimensional constant, bounded from above by a power of $Q(S)$ . Consequently,

$$ \begin{align*}M_S^1f(u)\lesssim \sup_{0<\delta<1}\frac{1}{|B_\delta(u)|}\int_{B_\delta(u)}f(u')\mathrm{d}\sigma(u'),\end{align*} $$

where the implicit constant is permitted to depend on $Q(S)$ . It remains to show that the surface measure on S is doubling with respect to the family of balls $B_\delta (u)$ , as we may then apply the abstract Hardy–Littlewood maximal theorem of [Reference Stein48] (see p. 37). By (8.4) it suffices to show that $|B_\delta (u)|\geq cQ(S)^{n-1}\delta ^{n-1}$ , for some dimensional constant c. However, this follows from Proposition 4.4 since

$$ \begin{align*}B_\delta(u)\supseteq\{u'\in S: |u'-u|\lesssim Q(S)\delta\}.\\[-37pt] \end{align*} $$

Remark 8.3 ( $L^p$ estimates for $M_S$ ).

A minor modification of the arguments in the proof of Theorem 8.2 (a use of Hölder’s inequality in place of the Cauchy–Schwarz inequality) shows that $M_S:L^{p_1}(S)\times L^{p_2}(S)\rightarrow L^q(S)$ whenever $p_1, p_2, q>1$ and $\tfrac {1}{p_1}+\tfrac {1}{p_2}=\tfrac {1}{q}$ . Implicitly, and as in the statement of Theorem 8.2, the bounds here depend on the dimension and $Q(S)$ .

Equipped with the above maximal theorem, we may now clarify the marginal property (8.2). While we expect that (8.2) (suitably interpreted) holds for all of the submanifolds S that we consider in this paper, our approach seems to require the additional assumption that

(8.5)

$$ \begin{align} \lim_{u'\rightarrow u}(\mathrm{d}R_u)_{u'}\;\;\text{ exists.} \end{align} $$

We note that (8.5) requires some interpretation since for each $u'\not =u$ , the map $(\mathrm {d}R_u)_{u'}:T_{u'}S\rightarrow T_{u"}S$ , and the limit should be interpreted as a linear transformation of $T_u S$ . One way to do this is to parametrise S by $T_uS$ , upon which the map $R_u$ may be parametrised by a map $y_u$ on the fixed domain $T_uS$ . We clarify this technical point in the arguments that follow. The local statement (8.5) appears to be an extremely mild assumption. It is straightforward to verify for parabolic S, and since a smooth strictly convex surface is locally parabolic (by Taylor’s theorem), one might reasonably expect it to be verifiable in general.

Proposition 8.4. Let S be smooth and strictly convex. Suppose $\chi $ is a Schwartz function on $T_uS$ with $\chi (0)=1$ , and $\chi _r(v)=\chi (v/r)$ for each $r>0$ . Then for compactly supported $g_1,g_2\in L^2(S)$ ,

$$ \begin{align*}\int_{T_uS}W_S(g_1,g_2)(u,v)\chi_r(v)\mathrm{d}v\rightarrow g_1(u)\overline{g_2(u)} \end{align*} $$

as $r\rightarrow \infty $ for almost every $u\in S$ . Moreover, if $g_1,g_2$ are continuous, then this convergence holds at all points u.

Before we turn to the proof of Proposition 8.4, we state a lemma whose (somewhat technical) proof we leave to the end of the section.

Lemma 8.5. If the limit (8.5) exists then for each $u\in S$ ,

$$ \begin{align*}\lim_{u'\rightarrow u}\widetilde{J}(u,u')=2^{n-1}\end{align*} $$

and

$$ \begin{align*}\lim_{\substack{u'\rightarrow u\\u'-u"\in\langle\omega\rangle}}J(u,u')=2^{n-1}\end{align*} $$

for each $\omega \in T_uS\backslash \{0\}$ .

Proof of Proposition 8.4.

We begin by writing

(8.6)

$$ \begin{align} \int_{T_uS}W_S(g_1,g_2)(u,v)\chi_r(v)\mathrm{d}v&=\int_{T_uS}\int_S g_1(u')\overline{g_2(R_uu')}e^{-2\pi i v\cdot(u'-R_uu')}J(u,u')\mathrm{d}\sigma(u')\chi_r(v)\mathrm{d}v \nonumber\\&=\int_S g_1(u')\overline{g_2(R_uu')}\widehat{\chi}_r(u'-R_uu')J(u,u')\mathrm{d}\sigma(u') \nonumber\\&=:\mathcal{A}_{S,r}(g_1,g_2)(u). \end{align} $$

Since $\widehat {\chi }$ is a bump function, it follows that $\mathcal {M}_S(g_1,g_2)(u)\lesssim M_S(|g_1|,|g_2|)(u)$ where

$$ \begin{align*}\mathcal{M}_S(g_1,g_2)(u):=\sup_{r>1}|\mathcal{A}_{S,r}(g_1,g_2)(u)|. \end{align*} $$

Consequently

(8.7)

$$ \begin{align} \mathcal{M}_S:L^2(S)\times L^2(S)\rightarrow L^{1,\infty}(S), \end{align} $$

by Theorem 8.2. Proposition 8.4 requires us to show that

(8.8)

$$ \begin{align} \mathcal{A}_{S,r}(g_1,g_2)(u)\rightarrow g_1(u)\overline{g_2(u)}\;\text{ for almost every }\;u\in S. \end{align} $$

The first step, which uses a minor variant of a standard argument in the setting of sublinear maximal operators (see, e.g., [Reference Stein and Weiss49]), is to use the maximal estimate (8.7) to reduce to the case of continuous $g_1,g_2$ . We leave this classical exercise to the reader. Suppose now that $g_1,g_2$ are continuous functions. It suffices to show that

(8.9)

$$ \begin{align} \mathcal{A}_{S,r}(1,1)(u):=\int_S \widehat{\chi}_r(u'-R_uu')J(u,u')\mathrm{d}\sigma(u')\rightarrow 1. \end{align} $$

Invoking the change of variables (4.19) and using polar coordinates in $T_uS$ we have

$$ \begin{align*} \begin{aligned} \mathcal{A}_{S,r}(1,1)(u)&=\int_{T_uS}\widehat{\chi}_r(\xi)\frac{J(u,u'(\xi))}{\widetilde{J}(u,u'(\xi))}\mathrm{d}\xi\\ &=\int_0^\infty \int_{\mathbb{S}^{n-2}(T_uS)}r^{n-1}\widehat{\chi}(rt\omega)\frac{J(u,u'(t\omega))}{\widetilde{J}(u,u'(t\omega))}\mathrm{d}\sigma(\omega)t^{n-2}\mathrm{d}t\\ &=\int_0^\infty \int_{\mathbb{S}^{n-2}(T_uS)}\widehat{\chi}(s\omega)\frac{J(u,u'(r^{-1}s\omega))}{\widetilde{J}(u,u'(r^{-1}s\omega))}\mathrm{d}\sigma(\omega)\mathrm{d}s, \end{aligned} \end{align*} $$

where $\mathbb {S}^{n-2}(T_uS)$ denotes the unit sphere in $T_uS$ . The limit (8.9) now follows by Lemma 8.5 since $u'(r^{-1}s\omega )\rightarrow u$ as $r\rightarrow \infty $ , while $u'(r^{-1}s\omega )-R_uu'(r^{-1}s\omega )=r^{-1}s\omega \in \langle \omega \rangle $ .

It remains to prove Lemma 8.5.

Proof of Lemma 8.5.

We begin by clarifying the hypothesis (8.5), and showing that this limit must actually equal $-I$ , where I denotes the identity on $T_uS$ . This reflects a crucial ‘limiting symmetry’ of the configuration of points $u,u',u"$ as $u'\rightarrow u$ . By translation and rotation invariance we may suppose that $u=0$ and $S=\{(x',\phi (x')):x'\in X\}$ , for some smooth real-valued function $\phi $ on a subset X of $T_uS$ satisfying $\nabla \phi (0)=0$ and $\mathrm {Hess\,}(\phi )(x')>_{pd}0$ for all $x'$ . The map $R:=R_u$ then takes the form $R(x',\phi (x'))=(x",\phi (x"))$ , for some unique $x"\in T_uS$ satisfying

(8.10)

$$ \begin{align} \phi(x")=\phi(x') \end{align} $$

and

(8.11)

$$ \begin{align} \frac{\nabla\phi(x")}{|\nabla\phi(x")|}=-\frac{\nabla\phi(x')}{|\nabla\phi(x')|}. \end{align} $$

Observe that (8.10) follows by (4.2) and (8.11) is a consequence of (4.28). Writing $x"=y(x')$ allows us to interpret (8.5) as the existence of the limit $\mathrm {d}y_0:=\lim _{x'\rightarrow 0}\mathrm {d}y_{x'}:T_{u}S\rightarrow T_{u}S$ . In order to show that $\mathrm {d}y_0=-I$ , we fix $v\in T_{u}S$ and let $x_{k}'\rightarrow 0$ be a sequence in $T_{u}S$ satisfying

$$ \begin{align*}\frac{\nabla\phi(y(x_{k}'))}{|\nabla\phi(y(x_{k}'))|}=v\end{align*} $$

for all k. This sequence exists as the Gauss maps $\widetilde {N}$ of the sections $\mathcal {S}_{u,u'}$ (see (4.27)) are bijections. Differentiating (8.10) at the points of this sequence, we have

$$ \begin{align*} \mathrm{d}y(x_{k}')^{\top}(\nabla\phi(y(x_{k}')))=\nabla\phi(x'). \end{align*} $$

Using (8.11),

$$ \begin{align*} \mathrm{d}y(x_{k}')^{\top}\left(\frac{\nabla\phi(x_{k}')}{|\nabla\phi(x_{k}')|}\right)=-\frac{|\nabla\phi(x_{k}')|}{|\nabla\phi(y(x_{k}'))|}\frac{\nabla\phi(x_{k}')}{|\nabla\phi(x_{k}')|}, \end{align*} $$

which implies

(8.12)

$$ \begin{align} \mathrm{d}y(x_{k}')^{\top}(v)=-\frac{|\nabla\phi(x_{k}')|}{|\nabla\phi(y(x_{k}'))|}v \end{align} $$

for all $k\in \mathbb {N}$ . By the mean-value inequality,

$$ \begin{align*} \frac{|\nabla\phi(x_{k}')|}{|\nabla\phi(y(x_{k}'))|}\leq\frac{\sup\|\mathrm{Hess\,}\phi\|_{\infty}}{\inf\|\mathrm{ Hess\,}\phi\|_{\infty}}\frac{|x_{k}'|}{|y(x_{k}')|}\leq \frac{\sup\|\mathrm{Hess\,}\phi\|_{\infty}}{\inf\|\mathrm{ Hess\,}\phi\|_{\infty}}\frac{1}{\|\mathrm{d}y(c_{k})\|_{\infty}} \end{align*} $$

for some $c_{k}$ with $c_{k}\rightarrow 0$ . On the other hand, $y(y(x_{k}'))=x_{k}'$ (recall that $R_{u}(R_{u}u')=u'$ ), hence $\mathrm {d}y(y(x_{k}'))\circ \mathrm {d}y(x_{k}')=I$ , which gives $\mathrm {d}y_0^{2}=I$ , therefore $\|\mathrm {d}y(c_{k})\|_{\infty }$ does not approach $0$ and the sequence

$$ \begin{align*}\frac{|\nabla\phi(x_{k}')|}{|\nabla\phi(y(x_{k}'))|}\end{align*} $$

is bounded. By passing to a subsequence and by taking limits, we conclude from (8.12) that

$$ \begin{align*} \mathrm{d}y_0^{\top}(v)=-Lv \end{align*} $$

for some positive real number L and for all $v\in T_{u}S$ . On the other hand, since $\mathrm {d}y_0^{2}=I$ , the only possible eigenvalues of $\mathrm {d}y_0$ are $\pm 1$ , hence $\mathrm {d}y_0=-I$ . Finally, taking the limit as $u'\rightarrow u$ in the first identity of (4.30) gives

(8.13)

$$ \begin{align} \lim_{u'\rightarrow u}\widetilde{J}(u,u')=2^{n-1}. \end{align} $$

Turning to the limiting identity for J, we first establish some bounds relating to the limiting arrangements of the points $u, u', u"$ and their normals $N(u), N(u'), N(u")$ , beginning with

(8.14)

$$ \begin{align} u'+u"-2u=o(|u-u'|). \end{align} $$

To see this (recalling that we are supposing $u=0$ ) observe that $u'+u"=(x'+y(x'), 2\phi (x'))$ , and since $\phi (x')=O(|x'|^2)$ , it remains to show that $h(x'):=x'+y(x')=o(|x'|)$ . By the mean value theorem, it suffices to observe that $\mathrm {d}h_{x'}=I+\mathrm {d}y_{x'}=o(1)$ as $x'\rightarrow 0$ , since $\mathrm {d}y_{x'}\rightarrow -I$ . A similar, albeit lengthier argument reveals that

(8.15)

$$ \begin{align} N(u')+N(u")-2N(u)=o(|u-u'|). \end{align} $$

Recalling the formula for $J(u,u')$ , we observe first that the factor

$$ \begin{align*}\frac{|N(u')\wedge N(u")|}{|N(u')\wedge N(u)|}=\frac{2|N(u')\wedge N(u)|+o(|u'-u|)}{|N(u')\wedge N(u)|}\rightarrow 2 \end{align*} $$

as $u'\rightarrow u$ . Here we are also using (1.12), which tells us that $|N(u')\wedge N(u)|\sim |u'-u|$ . It remains to show that for each unit vector $\omega \in T_uS$ ,

(8.16)

$$ \begin{align} \left|\frac{\langle u"-u',N(u")\rangle}{\langle P_{T_{u"}S}N(u),(\mathrm{d}N_{u"})^{-1}(P_{T_{u"}S}N(u))\rangle}\right|\rightarrow 2 \end{align} $$

as $u'\rightarrow u$ with $u'-u"\in \langle \omega \rangle $ . Noting that $\langle u"-u', N(u")\rangle =\langle u"-u', P_{T_uS} N(u")\rangle $ , by (8.14) we are reduced to showing that

$$ \begin{align*}\lim_{\substack{u'\to u\\u'-u"\in\langle\omega\rangle}} \frac{ \langle u"-u, P_{T_uS} N(u") \rangle }{ \langle P_{T_{u"}S} N(u), (\mathrm{d}N_{u"})^{-1} P_{T_{u"}S} N(u) \rangle } =1. \end{align*} $$

By symmetry, we may replace $u"$ by $u'$ here, so that the objective is to show that

(8.17)

$$ \begin{align} \lim_{\substack{u'\to u\\u'-u"\in\langle\omega\rangle}} \frac{ \langle u'-u, P_{T_uS} N(u') \rangle }{ \langle P_{T_{u'}S} N(u), (\mathrm{d}N_{u'})^{-1} P_{T_{u'}S} N(u) \rangle } =1. \end{align} $$

To this end we Taylor expand $N(u')$ about $0$ via the parametrisation $u'=(x',\phi (x')) =: \Phi (x')$ to obtain

$$ \begin{align*} N(u') &= N\circ \Phi(x') = N\circ \Phi (0) + \mathrm{d}(N\circ \Phi)_0 x' + O(|x'|^2)\\ &= N(u) + (\mathrm{d}N)_u\circ (\mathrm{d}\Phi)_0 x' + O(|x'|^2) \\ &= N(u) + (\mathrm{d}N)_u x' + O(|x'|^2), \end{align*} $$

where we have used that $(\mathrm {d}\Phi )_{x'} = \begin {pmatrix} \mathrm {id}_{\mathbb {R}^{n-1}} & \mathbf {0} \\ \nabla _{n-1} \phi (x') & 0\end {pmatrix}$ and $\nabla \phi (0)=0$ . Thus, in view of the fact that $|x'|= O(|u'-u|)$ we have

$$ \begin{align*}x' = (\mathrm{d}N)_u^{-1} \big( N(u') - N(u) + O(|u'-u|^2) \big). \end{align*} $$

The numerator of (8.17) now becomes

$$ \begin{align*} \langle u'-u, P_{T_uS} N(u') \rangle &= \langle P_{T_uS}(u'-u), P_{T_uS} N(u') \rangle \\ &= \langle x', P_{T_uS} N(u') \rangle \\ &= \big\langle (\mathrm{d}N)_u^{-1} \big( N(u') - N(u) + O(|u'-u|^2) \big), P_{T_uS} N(u') \big\rangle. \end{align*} $$

Note that

$$ \begin{align*}P_{T_uS} N(u') = N(u')-N(u) + O(|u'-u|^2), \end{align*} $$

and so

$$ \begin{align*}\langle u'-u, P_{T_uS} N(u') \rangle = \big\langle (\mathrm{d}N)_u^{-1} \big( N(u') - N(u) + O(|u'-u|^2) \big), \big( N(u') - N(u) + O(|u'-u|^2) \big) \big\rangle. \end{align*} $$

This is now similar to the denominator of (8.17). In fact,

$$ \begin{align*}P_{T_{u'}S} N(u) = - (N(u')-N(u)) + O(|u'-u|^2), \end{align*} $$

and so

$$ \begin{align*} &\frac{ \langle u'-u, P_{T_uS} N(u') \rangle }{ \langle P_{T_{u'}S} N(u), (\mathrm{d}N_{u'})^{-1} P_{T_{u'}S} N(u) \rangle }\\ &\;\;\;\;\;\;\;\;= \frac{ \big\langle (\mathrm{d}N)_u^{-1} \big( N(u') - N(u) + O(|u'-u|^2) \big), \big( N(u') - N(u) + O(|u'-u|^2) \big) \big\rangle }{ \big\langle (\mathrm{d}N)_{u'}^{-1} \big( N(u') - N(u) + O(|u'-u|^2) \big), \big( N(u') - N(u) + O(|u'-u|^2) \big) \big\rangle }. \end{align*} $$

Further, from (8.15) we have

$$ \begin{align*}N(u') - N(u) = \frac12( N(u') - N(u") ) + o(|u-u'|), \end{align*} $$

and hence,

$$ \begin{align*} &\frac{ \langle u'-u, P_{T_uS} N(u') \rangle }{ \langle P_{T_{u'}S} N(u), (\mathrm{d}N_{u'})^{-1} P_{T_{u'}S} N(u) \rangle }\\ &\;\;\;\;\;\;\;\;= \frac{ \big\langle (\mathrm{d}N)_u^{-1} \big( N(u') - N(u") + o(|u'-u|) \big), \big( N(u') - N(u") + o(|u'-u|) \big) \big\rangle }{ \big\langle (\mathrm{d}N)_{u'}^{-1} \big( N(u') - N(u") + o(|u'-u|) \big), \big( N(u') - N(u") + o(|u'-u|) \big) \big\rangle }. \end{align*} $$

Consequently, if

$$ \begin{align*}\lim_{\substack{u'\to u\\ u'-u"\in\langle\omega\rangle}} \frac{N(u') - N(u")}{|N(u') - N(u")|}\end{align*} $$

exists, then (8.17) follows. Here we have also appealed to the fact that

$$ \begin{align*}|N(u')-N(u")| = |(\mathrm{d}N)_u(u'-u") + O(|u-u'|^2)| \gtrsim |u-u'|. \end{align*} $$

Arguing similarly using Taylor’s theorem, we also have

$$ \begin{align*}N(u")-N(u) = (\mathrm{d}N)_u x" + O(|x"|^2), \end{align*} $$

from which it follows that

$$ \begin{align*}N(u') - N(u") = (\mathrm{d}N)_u x' -(\mathrm{d}N)_u x" + O(|x'|^2) + O(|x"|^2) = (\mathrm{d}N)_u ( u'-u" ) + O(|u-u'|^2), \end{align*} $$

and so

$$ \begin{align*}\frac{N(u') - N(u")}{|N(u') - N(u")|} = \frac{(\mathrm{d}N)_u ( u'-u" ) + O(|u-u'|^2)}{|(\mathrm{d}N)_u ( u'-u" )| + O(|u-u'|^2)} =\frac{(\mathrm{d}N)_u ( \omega ) + O(|u-u'|)}{|(\mathrm{d}N)_u ( \omega )| + O(|u-u'|)}, \end{align*} $$

which converges (to $(\mathrm {d}N)_u\omega /|(\mathrm {d}N)_u\omega |$ ) as $u'\rightarrow u$ with $u'-u"\in \langle \omega \rangle $ , as required.

9 Tomographic constructions

In this section we show that the explicit geometric Wigner distributions from Section 4 may be constructed tomographically from the corresponding extension operators, at least when $n=2$ . This is motivated by the tomographic approach to weighted extension inequalities developed in [Reference Bennett and Nakamura14, Reference Bennett, Nakamura and Shiraki15]. For the submanifolds S considered in Section 4, we saw that the natural tomographic transform is the S-parametrised X-ray transform $X_Sw(u,v):=Xw(N(u),v)$ . Here X denotes the standard X-ray transform and N the Gauss map of S. We remark that if the Gauss map is bijective, such as when S is strictly convex and closed, the operator $X_S$ is easily seen to inherit the inversion formula

$$ \begin{align*}c_nX_S^*K(-\Delta_v)^{1/2}X_S\psi=\psi \end{align*} $$

from the classical inversion formula for X; here $\psi $ is a suitably well-behaved function and $K(u)$ is the Gaussian curvature of S at a point u (acting multiplicatively). This suggests the following:

Proposition 9.1. Let $\Phi $ be a smooth bump function on $\mathbb {R}^2$ such that $\Phi (0)=1$ , and let $\Phi _\lambda (x)=\Phi (x/\lambda )$ for each $\lambda>0$ . If S is a strictly convex smooth curve in the plane then

$$ \begin{align*}\lim_{\lambda\rightarrow\infty}K(u)(-\Delta_v)^{1/2}X_S(\Phi_\lambda|\widehat{g\mathrm{d}\sigma}|^2)(u,v)=W_S(g,g)(u,v) \end{align*} $$

for all compactly supported smooth functions g on S.

Remark 9.2 (Phase-space tomographic methods in optics).

This spatial tomographic construction, which in the particular case of the circle is somewhat implicit in [Reference Bennett, Nakamura and Shiraki15], appears to be quite different from the phase-space tomographic constructions of Wigner distributions that have proved effective in optics. There it is observed that the phase-space X-ray transform applied to the Wigner distribution (referred to as the Radon–Wigner transform) identifies its marginal distributions in all directions, and that these marginals are natural hybrids of the coordinate marginals, involving the fractional Fourier transform. The Wigner distribution is then (re)constructed by an application of the classical (left) inverse X-ray transform; see, for example, [Reference Bertrand and Bertrand17, Reference Alonso3].

Remark 9.3. The cut-off $\Phi _\lambda $ is included in the statement of Proposition 9.1 as $X_S(|\widehat {g\mathrm {d}\sigma }|^2)$ is not in general defined for $g\in L^2(S)$ (unless there is a suitable transversality property satisfied – see [Reference Bennett, Nakamura and Shiraki15]). This may already be seen when $S=\mathbb {S}^1$ and $g\equiv 1$ , as then $|\widehat {g\mathrm {d}\sigma }(x)|^2$ is comparable to $(1+|x|)^{-1}$ on sufficiently large portions of $\mathbb {R}^2$ .

Proof of Proposition 9.1.

A routine (distributional) argument, using the well-known fact that

$$ \begin{align*}\mathcal{F}_v(Xf)(\omega,\xi)=\widehat{f}(\xi),\;\;\xi\in\langle\omega\rangle^\perp,\end{align*} $$

reveals that

(9.1)

$$ \begin{align} \begin{aligned} (-\Delta_v)^{1/2}X_S( \Phi_\lambda|\widehat{g\mathrm{d}\sigma}|^2)(u,v)& =\int_{T_u S}e^{2\pi i\xi\cdot v}|\xi|\widehat{\Phi}_\lambda\ast (g\mathrm{d}\sigma)*(\widetilde{g\mathrm{d}\sigma})(\xi)\mathrm{d}\xi. \end{aligned} \end{align} $$

In order to take the limit as $\lambda \to \infty $ it suffices, by the dominated convergence theorem, to show that

(9.2)

$$ \begin{align} \sup_{\lambda\geq 1}|\xi| |\widehat{\Phi}_\lambda|\ast (g\mathrm{d}\sigma)*(\widetilde{g\mathrm{d}\sigma})(\xi)\lesssim(1+|\xi|)^{-N} \end{align} $$

for some sufficiently large $N\in \mathbb {N}$ . This may be seen by first appealing to the strict convexity of S, along with the assumed properties of g, to show that for some ball $B\subset \mathbb {R}^2$ ; see [Reference Silva43, Section 2] for the appropriate detailed computations. The estimate (9.2) then follows using the rapid decay of $\widehat {\Phi }$ . Taking this limit, it follows that

$$ \begin{align*} \lim_{\lambda\to\infty}(-\Delta_v)^{1/2}&X_S( \Phi_\lambda|\widehat{g\mathrm{d}\sigma}|^2)(u,v) \\&=\int_{T_u S}e^{2\pi i\xi\cdot v}|\xi| (g\mathrm{d}\sigma)*(\widetilde{g\mathrm{d}\sigma})(\xi)\mathrm{d}\xi \\&=\int_S\int_Sg(u')\overline{g(u")}e^{2\pi i(u'-u")\cdot v}|u'-u"|\delta((u'-u")\cdot N(u))\mathrm{d}\sigma(u")\mathrm{d}\sigma(u'). \end{align*} $$

Now, for fixed $u,u'$ the function $u"\mapsto (u'-u")\cdot N(u)$ vanishes if and only if either $u"=u'$ or $u"=R_u u'$ , as defined in Section 4, and so it remains to establish the formula

(9.3)

$$ \begin{align} \int_S|u'-u"|\delta((u'-u")\cdot N(u))\mathrm{d}\sigma(u")=\frac{|u'-R_u u'|}{|N(u)\wedge N(R_u u')|} \end{align} $$

whenever $u'\not = u$ ; see Remark 4.6. Making the change of variables $u"'=u"-R_u u'$ (we stress that $u"$ is the variable of integration in (9.3) rather than a simplified notation for $R_{u}u'$ ), and using $\mathcal {H}^1$ to denote 1-dimensional Hausdorff measure in the plane, we have that

$$ \begin{align*}\begin{aligned} \int_S|u'-u"|\delta((u'-u")&\cdot N(u))\mathrm{d}\sigma(u")\\&=\int_{S-\{R_u u'\}}|u'-R_u u'-u"'|\delta(u"'\cdot N(u))\mathrm{d}\mathcal{H}^1(u"')\\ &=|u'-R_u u'|\lim_{\varepsilon\rightarrow 0}\frac{1}{2\varepsilon}\mathcal{H}^1\left(\{u"'\in (S-\{R_u u'\}):|u"'\cdot N(u)|<\varepsilon\}\right), \end{aligned} \end{align*} $$

from which (9.3) follows from the smoothness of S by elementary geometric considerations.

Remark 9.4 (Stein’s inequality as a lower bound on the X-ray transform).

Stein’s inequality (1.2) may of course be interpreted as a certain lower bound on the X-ray transform $X_S$ . Here we make some contextual remarks relating to this in the setting of the paraboloid, where the corresponding inequality (2.8) takes the form

(9.4)

$$ \begin{align} \begin{aligned} \int_{\mathbb{R}^d\times\mathbb{R}}|u(x,t)|^2w(x,t)\mathrm{d}x\mathrm{d}t\lesssim \int_{\mathbb{R}^d}\|\rho^*w(\cdot,v)\|_{L^\infty(\mathbb{R}^d)}|\widehat{u}_0(v)|^2\mathrm{d}v, \end{aligned} \end{align} $$

recalling the caveat in Remark 2.1. Somewhat similar-looking lower bounds may be obtained from the adjoint Loomis–Whitney inequality introduced in [Reference Bennett and Tao16]. Arguing as in [Reference Bennett and Tao16, Section 8] it follows that

(9.5)

$$ \begin{align} C(|\widehat{u}_0|^2)\|w\|_{L^p_{x,t}}\leq\left(\int_{\mathbb{R}^d}\|\rho^*w(\cdot,v)\|_{L^q(\mathbb{R}^d)}^r|\widehat{u}_0(v)|^2\mathrm{d}v\right)^{1/r} \end{align} $$

whenever $w\geq 0$ , $0<p,q\leq 1$ , $r>0$ and $\tfrac {1}{d+1}\left (\tfrac {1}{q}-1\right )=\tfrac {1}{d}\left (\tfrac {1}{p}-1\right )$ . Here

$$ \begin{align*}C(|\widehat{u}_0|^2):=\left(\int_{(\mathbb{R}^d)^{d+1}}\left|\det\left(\begin{array}{ccc} 1& \cdots & 1\\ 2v_1 & \cdots & 2v_{d+1}\\ \end{array}\right)\right|{}^{\frac{(d+1)r}{d}\left(\frac{1}{p}-1\right)}|\widehat{u}_0(v_1)|^2\cdots|\widehat{u}_0(v_{d+1})|^2\mathrm{d}v\right)^{\frac{1}{(d+1)r}}. \end{align*} $$

Of course (9.5), while superficially similar, is numerologically very different from (9.4), and also phenomenologically: $L^p$ norms below $L^1$ reflect spread rather than concentration. In particular, raising (9.5) to the rth power, setting $r=q$ and taking a limit as $p\rightarrow 0$ one obtains

(9.6)

$$ \begin{align} \begin{aligned} \Bigg(\int_{(\mathbb{R}^d)^{d+1}}\left|\det\left(\begin{array}{ccc} 1& \cdots & 1\\ 2v_1 & \cdots & 2v_{d+1}\\ \end{array}\right)\right|&|\widehat{u}_0(v_1)|^2\cdots|\widehat{u}_0(v_{d+1})|^2\mathrm{d}v\Bigg)^{\frac{1}{d+1}}|\mathrm{supp\,} w|^{\frac{d}{d+1}}\\&\leq \int_{\mathbb{R}^d}|\mathrm{supp\,} \rho^*w(\cdot,v)||\widehat{u}_0(v)|^2\mathrm{d}v. \end{aligned} \end{align} $$

It was observed in [Reference Bennett and Iliopoulou13] (see also [Reference Bennett, Nakamura and Shiraki15]) that the left-hand side of (9.6) (and the expression $C(|\widehat {u}_0|^2)$ in general) has a space-time formulation in terms of u, emphasising further the parallels with (9.4). The factor $|\mathrm {supp\,}\rho ^*w(\cdot ,v)|$ is a measure of the ‘visibility’ of w in the space-time direction $(-2v,1)$ , making (9.6) a certain visibility version of (9.4). Similar remarks may be made for more general surfaces S and are left to the interested reader.

10 Applications to a variant of Flandrin’s conjecture

The phase-space integral formula (2.6) exposes a formal similarity between the parabolic Mizohata–Takeuchi inequality (2.9) (or its local substitute (2.11)) and a variant of a conjecture of Flandrin [Reference Flandrin27] from time-frequency analysis. This conjecture, which was formulated in [Reference Delourme, Duyckaerts and Lerner25], states that

(10.1)

$$ \begin{align} \iint_{K}W(u_0,u_0)(x,v)\mathrm{d}x\mathrm{d}v\lesssim\|u_0\|_2^2 \end{align} $$

uniformly over all convex subsets K of $\mathbb {R}^d\times \mathbb {R}^d$ . This is a weakened form of the original conjecture that was made with constant $1$ , following a recent counterexample in [Reference Delourme, Duyckaerts and Lerner25]; we refer to [Reference Lerner37] for further discussion, along with a number of supporting results.

In this section we show that the basic methods of this paper are effective towards (10.1) by establishing a version of it in the plane involving an arbitrarily small loss in terms of the Lebesgue measure of K. We then show how (10.1) implies the parabolic Mizohata–Takeuchi inequality (2.9) for a special class of weights.

Theorem 10.1. For each $\varepsilon>0$ there exists a constant $C_\varepsilon <\infty $ such that

(10.2)

$$ \begin{align} \iint_K W(u_0,u_0)(x,v)\mathrm{d}x\mathrm{d}v\leq C_\varepsilon |K|^{\varepsilon}\|u_0\|_2^2 \end{align} $$

for all convex subsets K of $\mathbb {R}^2$ .

Proof. Arguing as in Section 2, and indeed Sections 3 and 4, by the Cauchy–Schwarz inequality and the duality of the homogeneous Sobolev spaces $\dot {H}^s$ and $\dot {H}^{-s}$ , we have

(10.3)

for each $s<\frac {1}{2}$ , where $\pi _2(K)\subseteq \mathbb {R}$ is the projection of K onto the v-axis. We now compute both of these Sobolev norms explicitly.

To compute the $\dot {H}^s_x$ norm, we fix v and observe that by the convexity of K,

almost everywhere for some real numbers $a, b$ . Since

with finite constant $c_s$ since $s<\frac {1}{2}$ . Here $\mathrm {diam\,}_1(K)$ is the diameter of K in the first coordinate direction.

To compute the $\dot {H}^{-s}_x$ norm we argue as in Section 2, and indeed Sections 3 and 4, to write

$$ \begin{align*}\|W(u_0,u_0)(\cdot, v)\|_{\dot{H}^{-s}_x}=I_{2s}(|\widehat{u}_0|^2,|\widehat{u}_0|^2)(v)^{1/2}, \end{align*} $$

where $ I_s $ is given by (2.25). We estimate this term further by applying the weak-type estimate

(10.4)

$$ \begin{align} \| I_{s}(g,g)\|_{L^{q,\infty}(\mathbb{R})} \lesssim \| g\|_{L^1(\mathbb{R})}^2, \end{align} $$

from [Reference Kenig and Stein33] (see also [Reference Grafakos and Kalton31]), which holds whenever $s\in (0,1)$ and $\frac 1q = 1+s$ . In particular, given $\varepsilon>0$ and writing $s_\varepsilon = \frac {1}2-\varepsilon $ , we have

$$ \begin{align*}\| I_{2s_\varepsilon} (g,g)^{1/2} \|_{L^{q_\varepsilon,\infty}(\mathbb{R})} = \| I_{2s_\varepsilon} (g,g) \|_{ L^{q_\varepsilon /2 , \infty} (\mathbb{R}) }^{1/2} \le C_\varepsilon \|g\|_1,\quad {q_\varepsilon}:= \frac{1}{1 - \varepsilon}. \end{align*} $$

With this in mind, we apply the Lorentz–Hölder inequality in (10.3) to write

where $\pi _2(K)$ is the projection of K onto the v-axis. Consequently,

It remains to observe that

$$ \begin{align*}\frac{1-2s_\varepsilon}{2} = \frac1{q_\varepsilon'} = \varepsilon, \end{align*} $$

and appeal to the fact that $\mathrm {diam\,}_1(K)$ is comparable to the average diameter $|K|/|\pi _2(K)|$ uniformly over all convex bodies K by an application of Brunn’s theorem.

Remark 10.2 (Higher dimensions).

Our proof of Theorem 10.1 does not extend to higher dimensions, at least readily. This may already be seen if K is the Euclidean unit ball in $\mathbb {R}^{2d}$ , since its d-dimensional sections, also being Euclidean balls, fail to belong to $\dot {H}^s$ whenever $s\geq 1/2$ ; see [Reference Stein48]. Evidently, a routine extension of our argument would require such control for all $s<d/2$ . For further discussion of Sobolev norms of indicator functions we refer to [Reference Faraco and Rogers26].

Remark 10.3 (Inequalities of Flandrin type for surface-carried Wigner distributions).

Our proof of Theorem 10.1 reveals that the convexity hypothesis on K may be weakened to the requirement that the sections $\{x\in \mathbb {R}:(x,v)\in K\}$ are intervals for each $v\in \mathbb {R}$ , provided we replace the measure of K with the diameter of K in (10.2). As such our argument should extend to Flandrin-type inequalities of the form

$$ \begin{align*}\iint_K W_S(g,g)\lesssim \|g\|_{L^2(S)}^2 \end{align*} $$

for the surface-carried Wigner distributions $W_S$ of Section 4, on the assumption that $K\subseteq TS$ is such that $\{v\in T_uS:(u,v)\in K\}$ is an interval for each $u\in S$ . This would require a weak-type addition to Theorem 7.2, analogous to Theorem 1(b) in [Reference Kenig and Stein33], and would introduce some dependence on the curvature quotient $Q(S)$ .

We conclude this section by establishing a simple direct connection between the parabolic Mizohata–Takeuchi inequality (2.9) and the Flandrin-type inequality (10.1), although with one caveat: that the support condition on the right-hand side of the parabolic Mizohata–Takeuchi inequality is dropped.

Proposition 10.4. If the Flandrin-type conjecture (10.1) is true, then the undirected Mizohata–Takeuchi inequality

(10.5)

$$ \begin{align} \int_{\mathbb{R}^d\times\mathbb{R}}|u(x,t)|^2w(x,t)\mathrm{d}x\mathrm{d}t\lesssim \|\rho^*w\|_\infty\|u_0\|_2^2 \end{align} $$

holds for space-time weight functions w that are concave in the spatial variable.

Proof. We begin by observing that if w is a concave function in the spatial variable, then $\rho ^*w$ is a concave function. This is immediate since whenever $(x_\lambda ,v_\lambda )=\lambda (x_1,v_1)+(1-\lambda )(x_2,v_2)\in \mathbb {R}^d\times \mathbb {R}^d$ ,

$$ \begin{align*} \begin{aligned} \rho^*w(x_\lambda,v_\lambda)&=\int_{\mathbb{R}}w(\lambda(x_1-2tv_1)+(1-\lambda)(x_2-2tv_2),t)\mathrm{d}t\\&\geq \int_{\mathbb{R}}\left(\lambda w(x_1-2tv_1,t)+(1-\lambda)w(x_2-2tv_2,t)\right)\mathrm{d}t\\ &= \lambda\rho^*w(x_1,v_1)+(1-\lambda)\rho^*w(x_2,v_2) \end{aligned} \end{align*} $$

for all $0<\lambda <1$ . Applying the layer-cake representation,

(10.6)

where $K(s)=\{(x,v)\in \mathbb {R}^d\times \mathbb {R}^d: \rho ^*w(x,v)\geq s\}$ , it follows from the convexity of $K(s)$ for each s, Fubini’s theorem and the conjectural inequality (10.1) that

$$ \begin{align*} \begin{aligned} \int_{\mathbb{R}^d\times\mathbb{R}}|u(x,t)|^2w(x,t)\mathrm{d}x\mathrm{d}t&= \int_{\mathbb{R}^d\times\mathbb{R}^d}W(u_0,u_0)(x,v)\rho^*w(x,v)\mathrm{d}x\mathrm{d}v\\&=\int_0^{\|\rho^*w\|_\infty}\left(\iint_{K(s)}W(u_0,u_0)(x,v)\mathrm{d}x\mathrm{d}v\right)\mathrm{d}s\\ &\lesssim\|\rho^* w\|_\infty\|u_0\|_2^2. \end{aligned}\\[-37pt] \end{align*} $$

Remark 10.5. If instead of applying the conjectural (10.1) one applies the established (10.2) in the proof of Proposition 10.4, an application of Chebyshev’s inequality reveals that

$$ \begin{align*} \begin{aligned} \int_{\mathbb{R}^d\times\mathbb{R}}|u(x,t)|^2w(x,t)\mathrm{d}x\mathrm{d}t&\leq C_\varepsilon\|u_0\|_2^2\int_0^{\|\rho^*w\|_\infty}|K(s)|^\varepsilon\mathrm{d}s \leq \frac{C_\varepsilon}{1-p\varepsilon} \|\rho^*w\|_p^{p\varepsilon}\|\rho^*w\|_\infty^{1-p\varepsilon}\|u_0\|_2^2 \end{aligned} \end{align*} $$

for $0<p<1/\varepsilon $ . This might be interpreted as a certain $\varepsilon $ -loss form of (10.5). We thank one of the reviewers for suggesting such an observation.

Remark 10.6 (Connections with maximally modulated singular integrals).

Our proof of Theorem 10.1 hints at a connection between the Flandrin-type conjecture (10.1) and another natural question in modern harmonic analysis. Specifically, for subsets K of $\mathbb {R}\times \mathbb {R}$ whose vertical sections are intervals (and hence for convex K), a routine calculation reveals that

(10.7)

$$ \begin{align} \iint_K W(u_0,u_0)\lesssim\|H_*(u_0,u_0)\|_{L^1(\mathbb{R})}, \end{align} $$

where

$$ \begin{align*}H_*(f_1,f_2)(x):=\sup_{\lambda\in\mathbb{R}}\left|\int_{\mathbb{R}}f_1\left(x+\frac{y}{2}\right)f_2\left(x-\frac{y}{2}\right)e^{i\lambda y}\frac{\mathrm{d}y}{y}\right| \end{align*} $$

is the maximally modulated bilinear Hilbert transform. The Flandrin-type conjecture (10.1) would therefore follow from the bound

(10.8)

$$ \begin{align} \|H_*(f_1,f_2)\|_{L^1(\mathbb{R})}\lesssim\|f_1\|_{L^2(\mathbb{R})}\|f_2\|_{L^2(\mathbb{R})}. \end{align} $$

The operator $H_*$ is a natural (bi-sublinear) analogue of the classical Carleson maximal operator. Tools from time-frequency analysis have proved very effective in the study of various related maximally modulated singular integral operators (such as in [Reference Muscalu, Tao and Thiele42] and [Reference Li and Muscalu41]) following the celebrated work of Lacey and Thiele [Reference Lacey and Thiele35, Reference Lacey and Thiele36] on the boundedness properties of the bilinear Hilbert transform. However, as far as we are aware, no nontrivial bounds for the operator $H_{*}$ are known. We note that the bound (10.8) was established for certain ‘non-resonant perturbations’ of $H_*$ in [Reference Benea, Bernicot, Lie and Vitturi9].

11 Questions

Here we collect a number of questions, some concrete and some more speculative.

Question 11.1 (Strengthening the parabolic Sobolev–Mizohata–Takeuchi inequality).

For nonnegative weights w, can one strengthen (2.27) to

$$ \begin{align*}\int_{\mathbb{R}^d\times\mathbb{R}}|u(x,t)|^2w(x,t)\mathrm{d}x\mathrm{d}t\lesssim\sup_{v\in\mathrm{ supp\,}(\widehat{u}_0)}\|\rho^* w(\cdot,v)\|_{\dot{H}_x^s}\|u_0\|_2^2,\end{align*} $$

as suggested by (2.9)?

Question 11.2 (Tomographic constructions of Wigner distributions in higher dimensions).

In Section 9 we saw that geometric Wigner distributions may be constructed tomographically from $|\widehat {g\mathrm {d}\sigma }|^2$ when $n=2$ using the X-ray transform. Might there be a similar tomographic construction of a Wigner distribution that functions in all dimensions, perhaps involving the Radon transform?

Question 11.3 (Fractional Stein and Mizohata–Takeuchi inequalities).

Are there interesting fractional forms of (2.8) or (2.9) suggested by considering an oblique phase-space marginal of the Wigner distribution in place of (2.4)? See Remark 9.2 on phase-space tomography.

Question 11.4 (A Flandrin-type inequality with an $\varepsilon $ -loss in higher dimensions).

May the statement of Theorem 10.1 be extended to dimensions $d>1$ ?

Acknowledgements

The fourth author thanks the American Institute of Mathematics for supporting the AIM Fourier restriction community, in which he was introduced to the Mizohata-Takeuchi conjecture. He also thanks the Basque Center for Applied Mathematics (BCAM) for the invitation to deliver a series of lectures on the subject of this paper and for their kind hospitality. We thank Cristina Benea, José Ca $\tilde {\text {n}}$ izo, Tony Carbery, Mark Dennis, Michele Ferrante, Veronique Fischer, Kerr Maxwell, Søren Mikkelsen, Detlef Müller, Camil Muscalu, Mateus Sousa, Amy Tierney and Gennady Uraltsev for a number of helpful discussions. In particular, we thank Marco Vitturi for drawing our attention to the role of the classical Wigner transform in [Reference Dendrinos, Mustata and Vitturi24], which served as an important source of inspiration for this work. Finally, we thank the reviewers for their many helpful comments and suggestions.

Competing interests

The author has no competing interests to declare.

Funding statement

The first and fourth authors are supported by EPSRC Grant EP/W032880/1. The third author is supported by JSPS Overseas Research Fellowship and JSPS Kakenhi grant numbers 19K03546, 19H01796 and 21K13806.

References

Agmon, S. and Hörmander, L., ‘Asymptotic properties of solutions in differential equations with simple characteristics’, J. Anal. Math. 30 (1976), 1–38.10.1007/BF02786703CrossRef Google Scholar

Ahlfors, L. V., Lectures on Quasiconformal Mappings, second edn, University Lecture Series, vol. 38 (American Mathematical Society, Providence, RI, 2006).Google Scholar

Alonso, M. A., ‘Wigner functions in optics: describing beams as ray bundles and pulses as particle ensembles’, Adv. Opt. Photon. 3(4) (2011), 272–365.10.1364/AOP.3.000272CrossRef Google Scholar

Barceló, J. A., Bennett, J. and Carbery, A., ‘A note on localised weighted estimates for the extension operator’, J. Aust. Math. Soc. 84 (2008), 289–299.10.1017/S1446788708000694CrossRef Google Scholar

Barceló, J. A., Ruiz, A. and Vega, L., ‘Weighted estimates for the Helmholtz equation and consequences’, J. Funct. Anal. 150 (1997), 356–382.10.1006/jfan.1997.3131CrossRef Google Scholar

Beltran, D., ‘A Fefferman–Stein inequality for the Carleson operator’, Rev. Mat. Iberoam. 34 (2018), 221–244.10.4171/rmi/984CrossRef Google Scholar

Beltran, D. and Bennett, J., ‘Subdyadic square functions and applications to weighted harmonic analysis’, Adv. Math. 307 (2017), 72–99.10.1016/j.aim.2016.11.018CrossRef Google Scholar

Beltran, D. and Vega, L., ‘Bilinear identities involving the k-plane transform and Fourier extension operators’, Proc. Roy. Soc. Edinburgh Sect. A 150 (2020), 3349–3377.10.1017/prm.2019.74CrossRef Google Scholar

Benea, C., Bernicot, F., Lie, V. and Vitturi, M., ‘The non-resonant bilinear Hilbert–Carleson operator’, Adv. Math. 458 (2024), 109939 (136 pp.).10.1016/j.aim.2024.109939CrossRef Google Scholar

Bennett, J., Bez, N., Jeavons, C. and Pattakos, N., ‘On sharp bilinear Strichartz estimates of Ozawa–Tsutsumi type’, J. Math. Soc. Japan 69(2) (2017), 1–18.10.2969/jmsj/06920459CrossRef Google Scholar

Bennett, J., Carbery, A., Soria, F. and Vargas, A., ‘A Stein conjecture for the circle’, Math. Ann. 336 (2006), 671–695.10.1007/s00208-006-0019-5CrossRef Google Scholar

Bennett, J., Carbery, A. and Tao, T., ‘On the multilinear restriction and Kakeya conjectures’, Acta Math. 196 (2006), 261–302.10.1007/s11511-006-0006-4CrossRef Google Scholar

Bennett, J. and Iliopoulou, M., ‘A multilinear extension identity on

${\mathbb{R}}^n$ ’, Math. Res. Lett. 25 (2018), 1089–1108.10.4310/MRL.2018.v25.n4.a2CrossRef Google Scholar

Bennett, J. and Nakamura, S., ‘Tomography bounds for the Fourier extension operator and applications’, Math. Ann. 380 (2021), 119–159.10.1007/s00208-020-02131-0CrossRef Google Scholar

Bennett, J., Nakamura, S. and Shiraki, S., ‘Tomographic Fourier extension identities for submanifolds of

${\mathbb{R}}^n$ ’, Sel. Math. New Ser. 30 (2024), 80.10.1007/s00029-024-00970-2CrossRef Google Scholar

Bennett, J. and Tao, T., ‘Adjoint Brascamp–Lieb inequalities’, Proc. Lond. Math. Soc. 129 (2024), e12633.10.1112/plms.12633CrossRef Google Scholar

Bertrand, J. and Bertrand, P., ‘A tomographic approach to Wigner’s function’, Found. Phys. 17 (1987), 397–405.10.1007/BF00733376CrossRef Google Scholar

Cairo, H. M., ‘A counterexample to the Mizohata–Takeuchi conjecture’, arXiv:2502.06137.Google Scholar

Carbery, A., ‘Large sets with limited tube occupancy’, J. Lond. Math. Soc. 79 (2009), 529–543.10.1112/jlms/jdn086CrossRef Google Scholar

Carbery, A., Hanninen, T. and Valdimarsson, S., ‘Disentanglement, multilinear duality and factorisation for non-positive operators’, Anal. PDE 16(2) (2023), 511–543.10.2140/apde.2023.16.511CrossRef Google Scholar

Carbery, A., Iliopoulou, M. and Wang, H., ‘Some sharp inequalities of Mizohata–Takeuchi-type’, to appear in Rev. Mat. Iberoam.Google Scholar

Carbery, A., Romera, E. and Soria, F., ‘Radial weights and mixed norm estimates for the disc multiplier’, J. Funct. Anal. 109 (1992), 52–75.10.1016/0022-1236(92)90011-7CrossRef Google Scholar

Carbery, A. and Soria, F., ‘Pointwise Fourier inversion and localisation in

${\mathbb{R}}^n$ ’, J. Fourier Anal. Appl. 3 (1997), 847–858.10.1007/BF02656490CrossRef Google Scholar

Dendrinos, S., Mustata, A. and Vitturi, M., ‘A restricted 2-plane transform related to Fourier restriction for surfaces of codimension 2’, Anal. PDE 18(2) (2025), 475–526.10.2140/apde.2025.18.475CrossRef Google Scholar

Delourme, B., Duyckaerts, T. and Lerner, N., ‘On integrals over a convex set of the Wigner distribution’, J. Fourier Anal. Appl. 26 (2020).10.1007/s00041-019-09722-9CrossRef Google Scholar

Faraco, D. and Rogers, K., ‘The Sobolev norm of characteristic functions with applications to the Calderón inverse problem’, Q. J. Math. 64 (2013), 133–147.10.1093/qmath/har039CrossRef Google Scholar

Flandrin, P., ‘Maximum signal energy concentration in a time–frequency domain’, Proc. IEEE Int. Conf. Acoust. 4 (1988), 2176–2179.Google Scholar

Folland, G. B., Harmonic Analysis in Phase Space, Ann. Math. Stud., vol. 122 (Princeton University Press, Princeton, NJ, 1989).10.1515/9781400882427CrossRef Google Scholar

Gneiting, C., Fischer, T. and Hornberger, K., ‘Quantum phase-space representation for curved configuration spaces’, Phys. Rev. A 88 (2013), 062117; Erratum Phys. Rev. A 106 (2022), 069904.10.1103/PhysRevA.88.062117CrossRef Google Scholar

Grafakos, L., ‘On multilinear fractional integrals’, Stud. Math. 102 (1992), 49–56.10.4064/sm-102-1-49-56CrossRef Google Scholar

Grafakos, L. and Kalton, N., ‘Some remarks on multilinear maps and interpolation’, Math. Ann. 319 (2001), 151–180.10.1007/PL00004426CrossRef Google Scholar

Guth, L., ‘An enemy scenario in restriction theory’, Joint talk for AIM Research Community ‘Fourier restriction conjecture and related problems’ and HAPPY network (2022). Available at https://www.youtube.com/watch?v=x-DET83UjFg.Google Scholar

Kenig, C. and Stein, E. M., ‘Multilinear estimates and fractional integration’, Math. Res. Lett. 6 (1999), 1–15.10.4310/MRL.1999.v6.n1.a1CrossRef Google Scholar

Kowalski, K. and Ławniczak, K., ‘Wigner function for quantum mechanics on a sphere’, Ann. Phys. 457 (2023).10.1016/j.aop.2023.169428CrossRef Google Scholar

Lacey, M. and Thiele, C., ‘

${L}^p$ estimates on the bilinear Hilbert transform for

$2<p<\infty$ ’, Ann. Math. 146 (1997), 693–724.10.2307/2952458CrossRef Google Scholar

Lacey, M. and Thiele, C., ‘On Calderón’s conjecture’, Ann. Math. 149 (1999), 475–496.10.2307/120971CrossRef Google Scholar

Lerner, N., ‘Integrating the Wigner distribution on subsets of the phase space’, Mem. Eur. Math. Soc. 12 (2024).10.4171/mems/12CrossRef Google Scholar

Lieb, E., ‘Integral bounds for radar ambiguity functions and Wigner distributions’, J. Math. Phys. 31 (1990), 594–599.10.1063/1.528894CrossRef Google Scholar

Lieb, E. H. and Loss, M., Analysis, Graduate Studies in Mathematics 14 (American Mathematical Society, Providence, RI, 2001).Google Scholar

Mulherkar, S., ‘Random constructions for sharp estimates of Mizohata–Takeuchi type’, arXiv:2506.05624.Google Scholar

Li, X. and Muscalu, C., ‘Generalizations of the Carleson–Hunt theorem I. The classical singularity case’, Am. J. Math. 129 (2007), 983–1018.10.1353/ajm.2007.0026CrossRef Google Scholar

Muscalu, C., Tao, T. and Thiele, C., ‘The Bi-Carleson operator’, Geom. Funct. Anal. 16(1) (2006), 230–277.10.1007/s00039-006-0553-zCrossRef Google Scholar

Silva, D. Oliveira e, ‘Extremizers for Fourier restriction inequalities: convex arcs’, J. Anal. Math. 124 (2014), 337–385.10.1007/s11854-014-0035-4CrossRef Google Scholar

Ozawa, T. and Tsutsumi, Y., ‘Space–time estimates for null gauge forms and nonlinear Schrödinger equations’, Differ. Integral Equ. 11 (1998), 201–222.Google Scholar

Petruccelli, J.C. and Alonso, M.A., ‘The Wigner function in optics’, in The Optics Encyclopedia (Wiley-VCH, Weinheim, 2015).Google Scholar

Planchon, F. and Vega, L., ‘Bilinear virial identities and applications’, Ann. Sci. Éc. Norm. Supér. 42 (2009), 263–292.Google Scholar

Stein, E. M., ‘Some problems in harmonic analysis’, Proc. Sympos. Pure Math. (American Mathematical Society, Providence, RI, 1978), 3–20.Google Scholar

Stein, E. M., Harmonic Analysis: Real-Variable Methods, Orthogonality, and Oscillatory Integrals (Princeton University Press, Princeton, NJ, 1993).Google Scholar

Stein, E. M. and Weiss, G., An Introduction to Fourier Analysis on Euclidean Spaces (Princeton University Press, Princeton, NJ, 1971).Google Scholar

Stovall, B., ‘Waves, spheres, and tubes. A selection of Fourier restriction problems, methods, and applications’, Not. Am. Math. Soc. 66 (2019), 1013–1022.Google Scholar

Tao, T., ‘A sharp bilinear restriction estimate for paraboloids’, Geom. Funct. Anal. 13 (2003), 1359–1384.10.1007/s00039-003-0449-0CrossRef Google Scholar

Vega, L., ‘Bilinear virial identities and oscillatory integrals’, in Harmonic Analysis and Partial Differential Equations, Contemp. Math. 505 (American Mathematical Society, Providence, RI, 2010), 219–232.10.1090/conm/505/09925CrossRef Google Scholar

Wigner, E., ‘On the quantum correction for thermodynamic equilibrium’, Phys. Rev. 40 (1932), 749–759.10.1103/PhysRev.40.749CrossRef Google Scholar

Figure 1 A depiction of the choice of $u"$ via the conditions (4.2) and (4.3).

Figure 2 The construction of $u"$ via parallel supporting hyperplanes in $T_uS+\{u'\}$.

Figure 3 A graphical representation of the proof of Claim 4.19.

Article contents

A phase-space approach to weighted Fourier extension inequalities

Abstract

MSC classification

Information

1 Introduction

1.1 Background: the Stein and Mizohata–Takeuchi problems

Remark 1.1 (The strength of (1.2)).

Remark 1.2 (Failure of the global inequalities (1.2) and (1.4)).

Remark 1.3 (The role of curvature).

1.2 Phase-space formulations

Remark 1.4 (Connections with Flandrin’s conjecture).

Remark 1.5 (Connections to maximally modulated singular integrals).

Remark 1.6 (Relation to shape quasi-conformality).

Theorem 1.7 (Sobolev–Stein inequality).

Theorem 1.8 (Sobolev–Mizohata–Takeuchi inequality).

Remark 1.10 (Improved constants).

Remark 1.11 (Permissibility of signed weights).

Remark 1.12 (The strength of Theorem 1.7).

Remark 1.13 (The strength of Theorem 1.8).

Remark 1.15 (Relation to the wavepacket approach).

2 The paraboloid: a quantum mechanical viewpoint

Remark 2.2 (A quasi-probabilistic interpretation).

Proof of Theorem 2.4.

Proof of Theorem 2.5.

Theorem 2.7 (Parabolic Sobolev–Stein).

Theorem 2.8 (Parabolic Sobolev–Mizohata–Takeuchi).

3 The sphere: an optical viewpoint

Proposition 3.1 (Spherical phase-space representation).

Theorem 3.2 (Spherical Sobolev–Stein).

Theorem 3.4 (Spherical Sobolev–Mizohata–Takeuchi).

Proof of Theorem 3.2.

Proof of Theorem 3.4.

4 General submanifolds: a geometric viewpoint

4.1 Surface-carried Wigner transforms

Remark 4.1 (Existence of $u"$ ).

Remark 4.2 (Differentiability of $u"$ ).

Remark 4.3 (Rationale for the choice of third point $u"$ ).

Proposition 4.4 (Distance estimates).

Proposition 4.5 (Jacobian identities).

Remark 4.6 (Interpreting J).

Remark 4.7 (Examples).

Proposition 4.8 (General phase-space representation).

Proof of Proposition 4.8.

Remark 4.9 (A polarised form).

Theorem 4.11 (L 2 Sobolev–Stein inequality).

Theorem 4.13 (L 2 Sobolev–Mizohata–Takeuchi inequality).

4.2 Proof of the Sobolev–Stein inequality (Theorem 1.7)

Step 1: Bounding $\widetilde {J}(u,u')$

Step 2: Bounding $J/\widetilde {J}$

4.3 Proof of the Sobolev–Mizohata–Takeuchi inequality (Theorem 1.8)

4.4 Improved Sobolev–Stein constants in the plane

Theorem 4.20 (Improved Sobolev–Stein in the plane).

5 Estimating distances: the proof of Proposition 4.4

6 Computing Jacobians: the proof of Proposition 4.5

6.1 Computing J

Proof of Claim 6.1.

Lemma 6.2 (Differential structure of an outward vector field).

Proof of Claim 6.3.

6.2 Computing $\Delta $

6.3 Relating J and $\Delta $

7 Surface-carried fractional integrals

Remark 7.1 (Relation to classical fractional integral operators).

Proof of Theorem 7.2.

8 Surface-carried maximal operators

Remark 8.1 (Relation to classical maximal operators).

Remark 8.3 ( $L^p$ estimates for $M_S$ ).

Proof of Proposition 8.4.

Proof of Lemma 8.5.

9 Tomographic constructions

Remark 9.2 (Phase-space tomographic methods in optics).

Proof of Proposition 9.1.

Remark 9.4 (Stein’s inequality as a lower bound on the X-ray transform).

10 Applications to a variant of Flandrin’s conjecture

Remark 10.2 (Higher dimensions).

Remark 10.3 (Inequalities of Flandrin type for surface-carried Wigner distributions).

Remark 10.6 (Connections with maximally modulated singular integrals).

11 Questions

Question 11.1 (Strengthening the parabolic Sobolev–Mizohata–Takeuchi inequality).

Question 11.2 (Tomographic constructions of Wigner distributions in higher dimensions).

Theorem 4.11 (L ² Sobolev–Stein inequality).

Theorem 4.13 (L ² Sobolev–Mizohata–Takeuchi inequality).