1. Introduction
The sum–product phenomenon was introduced by Erdős and Szemerédi in [Reference Erdős and SzemerédiES83]:
Let
$1 \lt a_1 \lt \cdots \lt a_n$
be a sequence of integers. Consider the integers of the form

It is tempting to conjecture that for every
$\epsilon \gt 0$
there is an
$n_0$
, so that for every
$n\geq n_0$
, there are more than
$n^{2-\varepsilon }$
distinct integers of the form (1).
In contemporary notation, we are interested in the sizes of the set of sums and products, defined for a subset
$A$
of integers, or more generally a commutative ring, as

As a general heuristic, the conjecture suggests that either
$A+A$
or
$A\cdot A$
is significantly larger then the original set, unless
$A$
is close to a subring. In the case of the integers, the latter cannot occur as there are no non-trivial finite subrings. The interested reader may consult [Reference Tao and VuTV06] for a rather thorough treatment of sumsets and related questions, including some prior work on the sum–product problem.
Erdős and Szemerédi continue with the following statement: `Perhaps our conjectures remain true if the
${a}$
’s are real or complex numbers.’
Erdős and Szemerédi ultimately prove

where the exponent
$c$
can be seen to be equal to
$\frac {1}{31}$
, see [Reference NathansonNat97], and conjecturally, any
$c\lt 1$
is admissible, at the cost of the implicit constant.
The sum–product phenomenon has been extensively studied in the last few decades, the current records as of writing being [Reference Rudnev and StevensRS22] for real numbers, and [Reference Mohammadi and StevensMS23] (also, see [Reference Rudnev and ShkredovRS22]) for sufficiently small sets in finite fields.
While the sum–product problem was originally posed for finite sets of integers, a number of techniques involving combinatorial and convex geometry have been the predominant tools in the area for a number of years, and these techniques work just as well for finite sets of reals. Indeed, see [Reference ElekesEle97], [Reference SolymosiSol09] and [Reference Hanson, Roche-Newton and RudnevHRR22] for techniques that helped establish our current understanding of the problem over the reals. However, one aspect of the problem that is understood in only the arithmetic setting (over
$\mathbb {Z}$
or perhaps
$\mathbb {Q}$
) is the nature of sets with few distinct products. Indeed, in this setting, unique factorization and
$p$
-adic analysis have allowed for progress which has not been matched by real-variable methods. Results leveraging the techniques we have in mind begin with [Reference ChangCha03] and the subsequent [Reference Bourgain and ChangBC04], and have been elaborated upon in [Reference Hanson, Roche-Newton and ZhelezovHRZ19], [Reference Hanson, Roche-Newton and ZhelezovHRZ20] and [Reference Pálvőlgyi and ZhelezovPZ21]. In particular, one has a much better understanding of sets
$A$
for which
$A\cdot A$
is very small when
$A$
consists of integers. Perhaps motivated by these sorts of results, and the fact that
$\{1,\ldots ,N\}$
is a near extremal example for the sum–product problem, E. Szemerédi asked the fourth-listed author whether sum–product estimates for
$A$
are improved when
$A$
consists of integers satisfyingFootnote
1
$\omega (a)\leq k$
, and even with the very limiting constraint, say,
$k=10$
. This is a natural question to ask, as if something like the initial segment
$A=\{1,\ldots ,N\}$
were in fact the worst case, then one would have
$\omega (a)\leq \log \log |A|(1+o(1))$
, on average, see, for instance, [Reference Montgomery and VaughanMV07].
Even when
$k=1$
, this problem already hints at some subtle behaviour. Indeed when
$A$
consists of the primes up to
$N$
, of which there are approximately
$N/\log N$
by the Prime Number Theorem. One of course has
$|A\cdot A|\gg |A|^2$
, but
$|A+A|\ll |A|\log |A|$
; more dramatic still is when
$A$
is an arithmetic progression composed of primes, whose existence is the content of the Green–Tao Theorem proved in [Reference Green and TaoGT08], where one has
$|A+A|= 2|A|-1$
. So the constraint
$k=1$
still allows for
$A$
to be as structured as is possible so far as addition is concerned. On the other hand, one could also take
$A=\{p,p^2,\ldots ,p^N\}$
for any prime
$p$
, recall that
$\omega$
ignores multiplicity, and so
$|A+A|\gg |A|^2$
but
$|A\cdot A|= 2|A|-1$
, and now the multiplicative structure of
$A$
is maximal.
It is really in the case
$k=2$
, however, that the problem ‘shows its teeth’. The following example, which we know from [Reference Balog and WooleyBW17] is what we shall refer to as a Balog–Wooley set. It merely consists of the product of a geometric progression and an arithmetic progression:

Balog and Wooley chose
$B=\{1,\ldots ,2n^2\}$
and
$\Gamma =\{1,2,4,\ldots ,2^{n-1}\}$
. (Note that to avoid collisions one can replace the prime
$2$
generating
$\Gamma$
by some
$p\gt 2n^2$
.) More generally, one can consider approximate Balog–Wooley sets where
$\Gamma$
and
$B$
are replaced by approximations to geometric and arithmetic progressions, and one can certainly choose
$r$
, as well as all members of
$B$
to be primes, as described in the
$k=1$
case. These examples gain their structure from imposing the constraints

with
$K_\Gamma$
and
$K_B$
basically as small as desired. This will force the product set to be small, since

At the same time, these sets also satisfy

where
$E_+(X,Y)$
denotes the additive energy

owing merely to the diagonal solutions

In particular,
$A$
can satisfy
$|A\cdot A|\ll |A|^{\frac {5}{3}-o(1)}$
and
$E_+(A,A)\gg |A|^{\frac {7}{3}}$
by taking
$|\Gamma |^2=|B|$
, as Balog and Wooley did in their work. They further conjectured that the exponent
$\frac {7}{3}-o(1)$
was the best possible should one wish to decompose
$A$
into pieces, one with small additive energy and the other with small multiplicative energy. If one used only additive energy to predict
$|A+A|$
, say by the standard Cauchy–Schwarz estimate,

then one could only deduce
$|A+A|\gg |A|^{\frac {5}{3}}$
. From this example, we observe that even if each member of
$A$
has but two prime factors, one can do no better than the exponent
$\frac {5}{3}$
in the statement

losing another
$o(1)$
to the exponent
$\frac {5}{3}$
if the number of prime factors increases to roughly
$\log \log |A|$
.
In fact, it is this statement that we prove, up to terms growing slower than any power of
$|A|$
and under the few prime factors constraint. Of course, the additive energy of
$A$
may not be an accurate predictor for
$|A+A|$
, and indeed, Balog–Wooley sets do not violate the Erdős–Szemerédi conjecture, which is still a conjecture, after all.
Our main theorem is the following.
Theorem 1.1.
Let
$\varepsilon$
be a real number with
$0\lt \varepsilon \lt 1/6$
and let
$k\in {\mathbb {N}}$
be positive integer. Suppose
$A$
is a sufficiently large finite set of integers satisfying
$\omega (a)\leq k$
for each
$a\in A$
and such that
$(\log |A|)^{1-6\varepsilon }\geq k$
. Then, there is a subset
$\tilde A\subseteq A$
with

and

We remark that the proof of Theorem 1.1 also admits the statement that for each positive
$\epsilon \lt 1/6$
and sufficiently large
$A$
, either
$|A\cdot A|\geq |A|^{5/3}$
or
$E(\tilde A)\leq |\tilde A|^{7/3}\exp (\log |\tilde A|)^{1-\epsilon }).$
In particular, we get the following sum–product estimate.
Corollary 1.2.
Let
$A$
be a finite set of integers and let
$\varepsilon$
and
$k$
be parameters such that the conditions of Theorem
1.1
are met. Then,

As remarked above, the exponent
$\frac {5}{3}$
is no coincidence, as our approach begins by showing that Balog–Wooley sets are essentially the worst possible scenario. It is because of this approach, namely an attack on the additive energy of
$A$
, that we fail to prove full quadratic growth.
We also note that most of the sum–product estimates that we are aware of juxtapose energy with cardinality of ‘the opposite set’. This is explicit in the title of the paper by Solymosi [Reference SolymosiSol09], which proves the inequality

with the multiplicative energy
$E_\times (A,A)$
being defined analogously to additive energy, with multiplication in place of addition. This, in particular, implies that if
$|A+A|\leq K_+ |A|$
, with a sufficiently small additive doubling constant
$K_+$
, then
$E^\times (A,A)$
barely exceeds its trivial lower bound
$|A|^2$
and hence
$|A\cdot A|$
is almost
$|A|^2$
, proving a case of the Erdős–Szemerédi conjecture for reals (and even complex) numbers at the endpoint
$K_+\approx 1$
, see also [Reference Konyagin and RudnevKR13].
On the other end, with
$K_\times =|A\cdot A|/|A|$
being the multiplicative doubling constant, Pálvőlgyi and the fourth author [Reference Pálvőlgyi and ZhelezovPZ21] prove that for finite sets
$A\subset {\mathbb {Z}}$
and
$\varepsilon \in (0,1/2)$
, there is a subset
$\tilde A\subseteq A$
such that
$|\tilde A|\geq |A|^{1-\varepsilon }$
and

This theorem sheds new light on, and improved the results of, Bourgain and Chang [Reference Bourgain and ChangBC04]. It also settles the Erdős–Szemerédi conjecture in the endpoint case
$K_\times \approx 1$
.Footnote
2
Here one gets an additive energy estimate via the product set, which works as a good predictor for
$|A+A|$
. However, away from the endpoints, and in view of the Balog–Wooley example, an immediate application of the Cauchy–Schwarz inequality to pass from an upper bound on additive energy to a lower bound for sumset is too costly.
As far as the results in this article are concerned, their fountainhead is the following elementary lemma, which illustrates the strength of the few-prime-factors hypothesis.
Lemma 1.3.
Let
$A$
and
$B$
be finite sets of integers such that
$\omega (a)\leq k$
for
$a\in A$
and
$\omega (b)\leq l$
for
$b\in B$
. Then any element
$q\in A\cdot B$
admits at most
$2^{k+l}$
solutions to
$q=ab$
with
$a\in A$
,
$b\in B$
and
$\mathrm {g.c.d.}(a,b)=1$
.
Proof.
Indeed,
$\omega (q)\leq k+l$
and factorization
$q=ab$
with
$\mathrm {g.c.d.}(a,b)=1$
amounts to choosing some subset of the primes dividing
$q$
to go into, say
$a$
. There are at most
$2^{\omega (q)}$
such subsets.
1.1 Structure of the proof
The proof of Theorem 1.1 involves three major steps. First, we establish an approximate structure theorem, Theorem 4.2 of § 2, for sets of integers with few prime factors. It is this theorem that tells us that (generalized) Balog–Wooley sets are essentially the worst-possible. Lemma 1.3 will combine with the structure theorem to provide uniform control on the fibres of the resulting decomposition. In its essence, the structure theorem is basic combinatorics, relying on Lemma 1.3.
The second major ingredient is a Littlewood–Paley-type theorem which allows us to estimate the additive energy of Balog–Wooley-type sets quite efficiently. Our Theorem 5.1 strengthens considerably the elementary but important Lemma 6 (see also Lemma 3.3 below) from Chang [Reference ChangCha03]. Chang’s lemma was also the basis for developments in the papers [Reference Bourgain and ChangBC04, Reference Hanson, Roche-Newton and ZhelezovHRZ19, Reference Hanson, Roche-Newton and ZhelezovHRZ20, Reference Pálvőlgyi and ZhelezovPZ21] following [Reference ChangCha03]. The statement in question was strong enough to meet the objectives of the above-mentioned papers, which dealt with the endpoint case of the small product set. A stronger and by far less trivial result is needed for us to move away from the endpoint.
Theorem 5.1 falls into the realm of Burkholder’s Littlewood–Paley theory for martingales [Reference BurkholderBur66]. The approach is a natural one, as randomization (i.e. martingale transforms) provides a path to iterated Littlewood–Paley decomposition, and Chang’s Lemma is a martingale-difference method if one chooses to view it as such. During the course of this project, we discovered, by way of a wonderful book [Reference PisierPis16], in which Gundy and Varopoulos had previously observed the sort of result needed [Reference Gundy and VaropoulosGV76]. Hence, instead of presenting the whole proof we confine ourselves with a sketch, fetching some facts from the literature as black boxes. We suspect our application of the martingale techniques, which appears as Corollary 5.2 in § 3, could have further applications to the area.
The final component of the proof is a bound for additive energy averaged over dilates from a low-rank group. It is easy to construct an elementary proof of the desired estimate when the rank is 1, see Theorem 6.1 of § 4. But we need to handle a higher rank, and this forces us to appeal to much heavier machinery. This is done in Lemma 2.1 of [Reference Roche-Newton and ZhelezovRZ15], which relies on a rather strong version of the Subspace Theorem from transcendence theory. Some combinatorial applications of the Subspace Theorem were discussed in [Reference Schwartz and SolymosiSS14]. Interestingly, here it fits almost perfectly, in particular its quantitative bound, which for some desired applications may be too ample, is almost in line with the applicability limits of the of the first two steps of our proof, only worsening by
$\epsilon$
the power of
$\log |A|$
, our bound for the rank, alias the number of prime factors that we can handle.
Hence, our approach combines basic combinatorics and elementary number theory with modern tools from harmonic analysis and transcendence theory, which happen to perfectly fit into our considerations. One naturally wonders as to what extent this may be a coincidence.
One immediate question is how to patch the gap between the best possible Theorem 1.1, which bounds additive energy via the product set and the holy grail sum–product inequality, to replace the exponent
$\frac {5}{3}$
in Corollary 1.2 with
$2$
. Here, contrary to the above-mentioned few products, many sums endpoint case, the Cauchy–Schwarz inequality shows no mercy. We would carefully hope for an opportunity to develop an asymmetric version of our argument, but it is by far not immediate and seems to be much less agreeable to the application of the martingale methods, to say the least.
One may question whether the few prime factor case we consider here is not too restrictive. (We emphasize that, of course, this does not mean that all elements of
$A$
are powers of two fixed primes only.) However, already in the two prime factor case the sum–product problem is rather involved, in particular calling for the martingale machinery, and the Balog–Wooley example can be given within the two prime factor model. We expect that more than a superficial look at the arguments in this paper will convince the reader that the few prime factor case captures a major aspect of the essence of the integer sum–product phenomenon.
Before proceeding with the proof proper, we will give a simpler argument in the § 3 which is still good enough to achieve the exponent
$\frac {3}{2}$
.
2. Notation
2.1 Asymptotic notation
For non-negative quantities
$X$
and
$Y$
,
$X\ll Y$
or
$X=O(Y)$
both mean that for some absolute constant
$C\gt 0$
, we have
$X\leq CY$
, while
$X=o(Y)$
means that
$X/Y\to 0$
. Dependence of the implicit constant
$C$
on parameters are indicated by subscripts, so for instance
$X\ll _{\alpha ,\beta } Y$
or
$X=O_{\alpha ,\beta }(Y)$
means that
$X\leq CY$
for some
$C(\alpha ,\beta )\gt 0$
. The symbol
$X\approx Y$
is meant to reflect that
$X\ll Y\ll X$
.
2.2 Number theoretic notation
We reserve the letter
$p$
for primes, and the fundamental theorem of arithmetic is then that, for integral
$n$
,

the product being over distinct primes, where
$v_p(n)$
denoted the
$p$
-adic valuation of
$n$
(i.e. the exponent of
$p$
in the unique factorization of
$n$
). This product is implicitly finite as
$v_p(n)=0$
for all but finitely many values of
$p$
. For those
$p$
where
$v_p(n)\gt 0$
, we write
$p|n$
and we let
$\omega (n)$
denote the number of such
$p$
. We write
$\mathrm {g.c.d.}(m,n)=\prod _{p}p^{\min (v_p(m),v_p(n))}$
for the greatest common divisor of
$m$
and
$n$
. If
$P=\{p_1,\ldots ,p_r\}$
is a set of primes, we can fix the ordering as an
$r$
-tuple
${{\boldsymbol {p}}}=(p_1,\ldots ,p_r)$
and then for an
$r$
-tuple
${{\boldsymbol {v}}}=(v_1,\ldots ,v_r)$
of integers, we write
${{\boldsymbol {p}}}^{{\boldsymbol {v}}}=p_1^{v_1}\cdots p_r^{v_r}$
.
We shall write
$\left \langle P\right \rangle$
, as well as
$\langle p_1,\ldots ,p_r\rangle$
for the multiplicative subgroup of
${\mathbb {Q}}^\times$
generated by the primes in
$P$
, and
$\left \langle P\right \rangle _+$
or
$\langle p_1,\ldots ,p_r\rangle _+$
for the set of integers divisible by every
$p\in P$
, namely

We will also use, for any non-zero integer
$n$
, the notation
$\boldsymbol v_{\boldsymbol p}(n)= p_1^{v_{p_1}(n)}\cdot \ldots \cdot p_r^{v_{p_r}(n)}$
for the product of the maximum powers of
$p_1,\ldots ,p_r$
dividing
$n$
.
2.3 Additive combinatorial notation
For subsets
$A$
and
$B$
of complex numbers, we write

for the sumset,

for the product set, and

for the translate of
$A$
by
$t\in {\mathbb {C}}$
and dilate of
$A$
by
$d\in {\mathbb {C}}$
, respectively. The quantities

and

denote, respectively, the additive and multiplicative energies of
$A$
and
$B$
.
2.4 Graph theoretic notation
If
$X$
and
$Y$
are sets, we will refer to
$G\subseteq X\times Y$
as a bipartite graph. It is of course a directed graph, but this point will not be emphasized. We further define, for subsets
$X^{\prime}\subseteq X,\;Y^{\prime}\subseteq Y$
,

called the neighbours of
$x$
and
$y$
in
$Y^{\prime}$
and
$X^{\prime}$
, respectively. The cardinalities
$|N_Y(x)|$
and
$|N_X(y)|$
are the degrees of
$x$
and
$y$
.
2.5 Analytic notation
We identify
$\mathbb {T}={\mathbb {R}}/{\mathbb {Z}}=[0,1)$
as the torus, and endow the sufficiently nice functions on
$\mathbb {T}$
with the
$L^q$
-norm

for
$q\geq 1$
and the inner product

A function
$f:\mathbb {T}\to {\mathbb {C}}$
of the form

is called a Fourier series. Here, the functions
$e(nt)=e^{2\pi int}$
are the standard characters on
$\mathbb {Z}$
and the coefficient
$\widehat {f}(n)$
is the
$n$
th Fourier coefficient of
$f$
. If
$\widehat {f}(n)$
is non-zero for finitely many
$n$
,
$f$
is called a trigonometric polynomial.
3. A simple argument yielding exponent
$\frac{3}{2}$
We begin this section with a rather simple structure theorem that will give a feel for the general problem. It will not be this version of the structure theorem that is ultimately used, but it is pretty simple to prove and illustrates the general strategy.
Lemma 3.1.
Suppose
$G\subseteq X\times Y$
. If
$|N_Y(x)|\leq k$
for each
$x\in X$
, then there is a subset
$Y^{\prime}\subseteq Y$
of size at most
$2k^2$
and such that for at least
$|X|^2/2$
pairs
$(x,x^{\prime})\in X$
, we have
$N_Y(x)\cap N_Y(x^{\prime})\subseteq Y^{\prime}$
.
Proof.
Indeed, let
$Y^{\prime}=\{y\in Y:|N_X(y)|\geq |X|/2k\}$
. Then

Let
$Y^{\prime\prime}=Y\setminus Y^{\prime}$
and observe that

and so at most half of the pairs
$(x,x^{\prime})\in X\times X$
have a common neighbour outside of
$Y^{\prime}$
.
Corollary 3.2.
Let
$A$
be a finite set of integers such that
$\omega (a)\leq k$
for
$a\in A$
. Then there is a set
$P=\{p_1,\ldots ,p_r\}$
of at most
$r\leq 2k^2$
primes such that

where
$B_{{\boldsymbol {v}}}$
is a set of integers prime to
$p_1p_2\cdots p_r$
and such that

Proof.
Let
$X=A$
and
$Y=\{p:p|a\text { for some }a\in A\}$
, and let
$G=\{(a,p)\in X\times Y:p|a\}$
. Applying Lemma 3.1, we find
$P$
with
$|P|=r\leq 2k^2$
such that for at least half of all pairs
$(a,a^{\prime})\in A\times A$
, all primes dividing both
$a$
and
$a^{\prime}$
belong to
$P$
. Each
$a\in A$
factors as

with
$b_a$
coprime to each
$p\in P$
, and we thus partition
$A$
according to the valuations
$v_{p}(a)$
with
$p\in P$
and we get

for some finite sets of integers
$B_{{\boldsymbol {v}}}$
which are coprime to each
$p\in P$
. Furthermore, if
$(a,a^{\prime})$
is a pair for which all common prime factors do belong to
$P$
, then
$b_a$
and
$b_{a^{\prime}}$
are coprime.
In this way, we have taken an arbitrary set
$A$
of integers and extracted from it a pseudo-product structure; one factor of which is from a low-rank multiplicative group, the other of which is multiplicatively independent in the sense that the fibres are relatively prime. Let
${{\boldsymbol {v}}}_1$
be fixed, and observe that the sets

are disjoint as
${{\boldsymbol {v}}}_2$
varies, since they are graded by the exponents
${{\boldsymbol {v}}}_1+{{\boldsymbol {v}}}_2$
. Now suppose that there are
$M({{\boldsymbol {v}}}_1,{{\boldsymbol {v}}}_2)$
pairs
$(b_1,b_2)\in B_{{{\boldsymbol {v}}}_1}\times B_{{{\boldsymbol {v}}}_2}$
which are relatively prime. Then,

from the conclusion of Corollary 3.2. But

so that for some choice of
${{\boldsymbol {v}}}_1$
, we have

Let
$V^{\prime}=\{{{\boldsymbol {v}}}_2:|B_{{{\boldsymbol {v}}}_2}|\leq |B_{{{\boldsymbol {v}}}_1}|\}$
and

so that

Further observe that on coprime pairs, the map
$(b_1,b_2)\mapsto b_1b_2$
is at most
$4^{k}$
-to-one by Lemma 1.3, so that

This is a good estimate if
$B_{{{\boldsymbol {v}}}_1}$
is sufficiently large. If not, we must resort to growth from addition. We begin with the aforementioned lemma of Chang, which in the special case we need, requires no Fourier analysis.
Lemma 3.3 (Chang). Let
$A$
be a finite set of integers admitting a decomposition of the form

where each
$B_{{\boldsymbol {v}}}$
is a finite set of integers coprime with
$p_1,\ldots ,p_r$
. Then

In particular,

Proof.
For convenience, replace
$A$
with
$A\cup -A$
so as to assume
$A=-A$
. Let
$p=p_1$
. Consider the equation

with variables in
$A$
. Suppose,
$a_1$
has the minimum
$p$
-adic valuation
$v_p(a_1)$
among all
$v_p(a_i)$
. Then, by reducing
$a_1=a_2+a_3-a_4$
modulo
$p^{v_p(a_1)+1}$
, we see a second term in the equation must have the same
$p$
-adic valuation, and at the cost of a constant factor (from rearrangement), this term is
$a_2$
.
It follows that

by Cauchy–Schwarz (applied to the additive energy) and cancelling
$p^v$
. Rearranging, and applying the resulting estimate for each prime from
$\{p_1,\ldots ,p_r\}$
, we find

The final claim comes from applying the trivial estimate

to each summand, whence

From Chang’s Lemma applied to the set
$A^{\prime}$
from above, we find that

Through this combined with (2) we obtain the sum–product estimate with exponent
$\frac{3}{2}$
. Moreover, by tracking the dependence in the proof of Chang’s Lemma, the implicit constant is singly exponential in
$r$
, which is at most
$2k^2$
.
4. A refined structure theorem
Our refined structure theorem will be similar in spirit to Corollary 3.2 but we shall seek, at the price of passing to a subset
$\tilde A$
of
$A$
, at most
$k$
common primes
$p_1,\ldots ,p_r$
so that at least half of the pairs
$(a,a^{\prime})\in \tilde A\times \tilde A$
have a g.c.d. in
$\langle p_1,\ldots ,p_r\rangle _+$
. As concerns the multiplicative structure, we will be interested in a dual decomposition of our set as is being made explicit in the following two statements.
Lemma 4.1 (Iteration Lemma). Suppose
$A$
is a finite set of integers with the property
$\omega (a)\leq k$
for all
$a\in A$
. Let
$p_1,\ldots ,p_j$
be distinct primes, and suppose
$A$
decomposes as

for some set
$B$
of integers coprime to
$p_1\cdots p_j$
, and sets
$\Gamma _b\subseteq \left \langle p_1,\ldots , p_j\right \rangle _+$
. Then one of the following holds:
-
(1) for at least half of the pairs
$(a,a^{\prime})\in A\times A$ ,
$\mathrm {g.c.d.}(a,a^{\prime})\in \left \langle p_1,\ldots ,p_j\right \rangle _+$ ; or
-
(2) there is a prime
$p_{j+1}$ , distinct from
$p_1,\ldots ,p_j$ , and a subset
$ A^{\prime}\subseteq A$ of size
\begin{equation*}| A^{\prime}|\geq \tfrac {|A|}{2(k-j)};\end{equation*}
\begin{equation*}A^{\prime}=\bigcup _{b\in B^{\prime}}b \cdot \Gamma ^{\prime}_b\end{equation*}
$B^{\prime}$ of integers coprime to
$p_1\cdots p_{j+1}$ , and sets
$ \Gamma ^{\prime}_b\subseteq \left \langle p_1,\ldots ,p_{j+1}\right \rangle _+$ .
Proof.
Let
$P_A$
denote the set of primes which divide some element of
$A$
, and let

Consider the bipartite graph

and observe that
$\mathrm {g.c.d.}(a,a^{\prime})\not \in \left \langle p_1,\ldots ,p_j\right \rangle _+$
if and only if
$a$
and
$a^{\prime}$
have a common neighbour in
$P$
. Now, denoting
$N_P(a)$
the neighbours of
$a$
in
$P$
, we have
$|N_P(a)|\leq k-j$
, since each
$a$
has at most
$k$
prime factors, and
$j$
of those are
$p_1,\ldots ,p_j$
. Thus, writing
$N_A(p)$
for the neighbours of
$p$
in
$A$
, we find by double counting that

If (1) fails, then the rightmost sum above is at least
$|A|^2/2$
and so there is some
$p_{j+1}\in P$
with
$|N_A(p_{j+1})|\geq |A|/2(k-j)$
. Let
$A^{\prime}=N_A(p_{j+1})$
so that
$A^{\prime}$
further decomposes (by factoring out the appropriate powers of
$p_{j+1}$
) as

where, denoting
$\boldsymbol p = (p_1,\ldots ,p_{j+1})$
, we have

Theorem 4.2.
Suppose
$A$
is a finite set of integers such that
$\omega (a)\leq k$
for all
$a\in A$
and
$|A\cdot A|\leq K|A|$
, with
$K\lt 2^{-2k-1}|A|$
. Then there is a set
$P=\{p_1,\ldots ,p_r\}$
of
$r\leq k$
primes, and a set
$\tilde A\subseteq A$
of size at least
$\frac {|A|}{ 2^{k+3} k!}$
and with the structural decomposition

where each set
$\Gamma _b$
is a subset of
$\left \langle p_1,\dots ,p_r\right \rangle _+$
. Each set
$B_{{\boldsymbol {v}}}$
is a finite set of integers prime to
$p_1\cdots p_r$
and
$B=\bigcup _{{{\boldsymbol {v}}}\in V}B_{{\boldsymbol {v}}}$
satisfies the bound

Proof.
Set
$A_0=B_0=A$
. By the upper bound on
$K$
and Lemma 1.3, at least
$|A_0|^2/2$
of pairs in
$A_0\times A_0$
are not coprime. Then there is some
$a\in A_0$
, not coprime with at least
$|A_0|/2$
elements of
$A_0$
, hence a prime
$p_1$
that divides at least
$\frac {|A_0|}{2k}$
elements of
$A_0$
; the subset of these elements of
$A_0$
is denoted as
$A_1$
. The corresponding ‘base set’
$B_1 = \{ {a}/{p_1^{v_{p_1}(a)}}:\;a\in A_1\}$
that is
$B_1$
arises by dividing each
$a\in A_1$
by the maximum power of
$p_1$
that divides
$a$
.
Now, beginning with
$j=1$
and each
$\Gamma _b\subset \langle p_1\rangle _+$
, repeatedly apply the Iteration Lemma to
$A_j$
until its outcome becomes (1); otherwise denote the resulting from outcome (2) set
$ A^{\prime}$
as
$A_{j+1}$
. Since each
$a\in A$
has at most
$k$
prime factors, the iteration will terminate. We obtain a sequence of distinct primes
$p_1,\ldots ,p_r$
, a sequence of sets
$A_1\supseteq \cdots \supseteq A_r$
such that
$p_1\cdots p_j$
divides each element of
$A_{j}$
, and sets
$B_1,\ldots ,B_r$
of integers such that
$B_j$
is coprime to
$p_1\cdots p_j$
. Namely
$B_j = \{{a}/{ \boldsymbol p^{\boldsymbol v_{\boldsymbol p}(a)}}:\,a\in A_j\}$
where
$\boldsymbol p=(p_1,\ldots ,p_j)$
.
Since
$p_1\cdots p_j$
divides every element of
$A_j$
, and
$B_j$
is coprime to
$p_1\cdots p_j$
, we have
$\omega (b)\leq k-j$
for
$b\in B_j$
, and
$r\leq k$
.
We set
$B^{\prime}=B_r$
and
$A^{\prime}=A_r$
, so that we have

for some sets
$\Gamma _b\subseteq \left \langle p_1,\ldots , p_r\right \rangle _+$
.
By the Iteration Lemma,

We now run a popularity argument, thinning the set
$A^{\prime}$
by a factor of at most
$8$
to get the lower bound one the corresponding base set
$|B|$
. Set

and consider the subset
$A^{\prime\prime}$
of
$A^{\prime}$
, which is the union of ‘poor’ fibres, namely

Let
$B^{\prime\prime} = \{b\in B^{\prime}:\, |\Gamma _b|\lt L\}$
be the corresponding subset of
$B^{\prime}$
. Suppose, for contradiction, that
$|A^{\prime\prime}|\geq \frac {7}{8}|A^{\prime}|.$
Then, since at least
$|A^{\prime}|^2/2$
pairs
$(a,a^{\prime}) \in A^{\prime}\times A^{\prime}$
have g.c.d. in
$\langle p_1,\ldots ,p_r\rangle _+$
, by the assumption on the cardinality of
$A^{\prime\prime}$
, at least
$|A^{\prime}|^2/4$
pairs
$(a,a^{\prime}) \in A^{\prime\prime}\times A^{\prime\prime}$
have g.c.d. in
$\langle p_1,\ldots ,p_r\rangle _+$
. Let
$G\subseteq A^{\prime\prime}\times A^{\prime\prime}$
be the set of these pairs.
By Lemma 1.3 and the definition of
$B^{\prime\prime}$
, a product
$q=aa^{\prime}$
must have fewer than
$4^k L$
representations with
$(a,a^{\prime})\in G$
. It follows that

which contradicts the definition of
$L$
.
Hence, there are at least
$|A^{\prime}|/8$
elements in the set
$\tilde A:= A^{\prime}\setminus A^{\prime\prime}$
, which is the union of the fibres
$\Gamma _b$
, with
$b\in B^{\prime}\setminus B^{\prime\prime}$
that have size at least
$L$
. Note that
$B := B^{\prime}\setminus B^{\prime\prime} = \{{a}/{ \boldsymbol p^{\boldsymbol v_{\boldsymbol p}(a)}}:\,a\in \tilde A\}$
where
$\boldsymbol p=(p_1,\ldots ,p_r)$
.
Observing that
$|B|\leq |A^{\prime}|/L$
, together with the lower bound for
$|A^{\prime}|$
and the definition of
$L$
completes the proof.
5. Fourier analysis
Chang’s estimate, Lemma 3.3, could be described as an estimate for the
$\Lambda (q)$
constant for subsets of a multiplicative group generated by a finite set of primes (often called
$S$
-units). However, the estimate is a bit crude in some cases, the result of an application of Hölder’s inequality which is at times inefficient. The next ingredient in our proof is a square-function estimate for a Littlewood–Paley decomposition along a sequence of multiples. The ultimate goal of this section is to prove the following theorem.
Theorem 5.1.
Let
${{\boldsymbol {p}}}=(p_1,\ldots ,p_r)$
be an
$r$
-tuple of distinct primes and let
$f(t)=\sum _{a\in A}\widehat {f}(a)e(at)$
be a trigonometric polynomial whose Fourier coefficients are supported in a set
$A$
of the form

where each set
$B_{{\boldsymbol {v}}}$
is a set of integers coprime to
$p_1\cdots p_r$
. Define

Then for any
$q$
with
$1\lt q\lt \infty$
, there is a constant
$C_q\gt 0$
such that

At least when
$r=1$
this result is a fairly straightforward consequence of Burkholder’s martingale Littlewood–Paley theorem.
Indeed, suppose
$p$
is a prime, change the above notation from vector
${\boldsymbol {v}}$
to scalar
$v$
. Namely, for some finite set of non-negative integers
$V=\{v_1,v_2,\ldots ,v_n\}$
, written in the increasing order, and non-empty sets
$B_v$
of coprime with
$p$
integers, one has

Denote

note that
$f_{\,\ge\, v}(t)$
is
$p^{-v}$
-periodic. Then the sequence
$f_{\,\ge\, v_n},\ldots ,f_{\,\geq\, v_1}=f$
is a martingale, owing to the identity

In this context, Theorem 5.1 was explicitly observed, in the case
$r=1$
, as a consequence of Burkholder’s theorem by Gundy and Varapoulos in [Reference Gundy and VaropoulosGV76].
The generalization to higher rank
$r$
is routine, and below we present the salient points of the proof, in order to have the necessary facts combined in a single source (they can be also located in various parts of [Reference Edwards and GaudryEG77]). Since each of [Reference Edwards and GaudryEG77], [Reference PisierPis16], and [Ste09] have readable expositions of Burkholder’s theorem, we stop short of giving its complete proof. Beyond specializing these facts to the application at hand, no originality is claimed.
Applying Theorem 5.1 with
$q=4$
and
$\widehat {f}={\mathbf {1}}_A$
, we have, we have the following refinement to Chang’s energy estimate.
Corollary 5.2.
Let
$A\subseteq {\mathbb {Z}}$
be a finite set of the form

Then there is a constant
$C\gt 0$
,

Corollary 5.2, when coupled with the Cauchy–Schwarz inequality, recovers Chang’s original estimate. However, we will see in the next section, that for many pairs
$({{\boldsymbol {v}}}_1,{{\boldsymbol {v}}}_2)$
, we have a substantial improvement on the trivial energy estimate.
As mentioned above, in order to prove Theorem 5.1, we will make use of Burkholder’s inequalities for martingale transforms. Specifically, we will use the following result which bounds the norm of multipliers which are constant on
$p$
-adic scales. In what follows, we write

Theorem 5.3.
Let
$p$
be a prime and suppose
$q$
is such that
$1\lt q\lt \infty$
. Let
$\varepsilon :{\mathbb {Z}}\to \{-1,1\}$
be a function such that
$\varepsilon (n)$
depends only on
$v_p(n)$
. Then there is an absolute constant
$C_q$
, depending only on
$q$
such that we have

One should think of choosing
$\varepsilon$
to be random, subject to the constraint that it be constant on
$p$
-adic scales. Then, the multiplier theorem above is seen to be equivalent to the square-function estimate quoted in Theorem 5.1 (in the case
$r=1$
) by way of Khintchine’s inequality. First, some notation: for a partition
$\mathcal {P}$
of
$\mathbb {Z}$
, write

Lemma 5.4.
Let
$\mathcal {P}$
be a partition of
$\mathbb {Z}$
. Then the following are equivalent:
-
(1) for any
$q\gt 1$ there are constants
$c_q$ and
$C_q$ such that for any trigonometric polynomial
$f$ ,
\begin{equation*}c_q\|f\|_{L^{q}}\leq \left \|S_{\mathcal {P}}f\right \|_{L^{q}}\leq C_q\|f\|_{L^{q}},\end{equation*}
-
(2) for any
$q\gt 1$ there is a constant
$C_q$ such that for any trigonometric polynomial
$f$ and any function
$\varepsilon :{\mathbb {Z}}\to \{-1,1\}$ which is constant on the parts of
$\mathcal {P}$ , we have
\begin{equation*}\left \|f\right \|_{L^{q}}\leq C_q\left \|f_\varepsilon \right \|_{L^{q}}.\end{equation*}
Proof. Let

Then assuming clause (1) of the lemma,

Conversely, if we assume clause (2) of the lemma, then we can write
$f=(f_{\varepsilon })_\varepsilon$
so the reverse inequality

holds, and taking expectation over all choices of
$\varepsilon$
,

We get from Khintchine’s inequality (see, for instance, Lemma 5.5 of [Reference Muscalu and SchlagMS13]) that

which proves the second inequality from clause (1) upon integration over
$t\in [0,1]$
.
To prove the first inequality in clause (1), we appeal to duality. Let
$q^{\prime}$
be the dual exponent to
$q$
and take a trigonometric polynomial
$g$
such that
$\|g\|_{L^{q^{\prime}}}=1$
. By orthogonality and the triangle inequality,

By the Cauchy–Schwarz inequality, the right-hand side is at most

where in the last estimate we have applied Hölder’s inequality and the second inequality from clause (1) to
$g$
.
Here we remark that, using the last part of the above proof, it will generally suffice to prove the second inequality from clause (1), whence the first can be derived from duality.
The reason for introducing the multiplier formulation is that it is well-suited to iteration, allowing us to prove the following.
Lemma 5.5.
Let
$q\geq 1$
and suppose
$\mathcal {P}_1$
and
$\mathcal {P}_2$
are two partitions of
$\mathbb {Z}$
such that for any trigonometric polynomial
$f$
,

Then if
$\mathcal {P}=\{P_1\cap P_2:P_1\in \mathcal {P}_1,P_2\in \mathcal {P}_2\}$
, there is a constant
$C_q(\mathcal {P})$
such that

Proof.
First, assume
$q\geq 2$
. Let
$\varepsilon :{\mathbb {Z}}\to \{-1,1\}$
be a function which is constant on the parts of
$\mathcal {P}_2$
, and suppose
$f$
is a trigonometric polynomial. By hypothesis and Lemma 5.4, there is a positive constant
$C$
such that

whence

Now

and taking expectation over
$\varepsilon$
yields

The expectation vanishes unless
$P_2=P_2^{\prime}$
, in which case it is 1, and hence

Raising to the power
$q/2$
, we find

by Jensen’s inequality, and integrating over
$t$
shows

as required.
To get the claim for
$1\leq q\lt 2$
, we use duality and the random multiplier formulation. Indeed, let
$q^{\prime}\geq 2$
be the exponent conjugate to
$q$
, let
$\varepsilon :{\mathbb {Z}}\to \{-1,1\}$
be a function which is constant on the parts of
$\mathcal {P}$
, and let
$g$
be a trigonometric polynomial with
$\|g\|_{L^{q^{\prime}}}=1$
. Then by Parseval and Hölder’s inequality,

which shows
$\|f_\varepsilon \|_{L^q}\leq C_{q}\|f\|_{L^q}$
for any
$\varepsilon :{\mathbb {Z}}\to \{-1,1\}$
which is constant on the parts of
$\mathcal {P}$
, and hence the boundedness of
$S_{\mathcal {P}}$
follows from Lemma 5.4.
Proof of Theorem
5.1. To each prime
$p_i$
with
$1\leq i\leq r$
we associate the partition
$\mathcal {P}_i$
of
$\mathbb {Z}$
into
$p_i$
-adic scales. From Theorem 5.3 and Lemma 5.4, we see that for each prime
$p_i$
with
$1\leq i\leq r$
, we find a constant
$C_q$
such that
$\|S_{\mathcal {P}_i}f\|_{L^q}\leq C_q\|f\|_{L^q}$
. From duality, it suffices to show that the common refinement of the partitions
$\mathcal {P}_i$
yields a bounded square function. This in turn follows from Lemma 5.5 applied
$r-1$
times after we note, from the proof of the lemma, that its constants
$C_q(\mathcal {P}_j)$
can be taken to ensure that in the notation of the lemma
$C_q(\mathcal {P}) = C_q(\mathcal {P}_1)\cdot C_q(\mathcal {P}_2).$
6. Energy estimates with dilates
In this section we prove estimates for the number of differences which lie in a fixed coset of a multiplicative group of bounded rank. When the rank is one, this can be achieved in an elementary fashion as described by the following theorem. This theorem will not be needed, unless
$k=2$
(although this special case, as has been discussed at the outset, is already quite non-trivial) but we include it as it may be of independent interest to prove our results without an appeal to much more sophisticated results.
Theorem 6.1.
Let
$B$
be a finite set of positive integers with
$|B|\geq 2$
, let
$p$
be a prime, and let
$n$
be a non-zero integer. Define

Then
$|X_p(B,n)|\leq |B|+4|B|\log _2|B|$
.
Proof.
Let
$p$
be a fixed prime. It will be convenient to normalize
$B$
as follows. First, if
$p$
divides
$n$
then we write
$n=p^{v}n^{\prime}$
with
$\mathrm {g.c.d.}(n^{\prime},p)=1$
. Then
$X_p(B,n)\subseteq X_p(B,n^{\prime})$
and so there is no loss of generality in assuming
$\mathrm {g.c.d.}(n,p)=1$
. Next, if
$B$
lies in a single congruence class modulo
$p^{r_0}$
for some
$r_0\gt 0$
, then we may replace
$B$
with
$B-\min B$
(or any other element of
$B$
), without affecting
$|X_p(B)|$
. The result would be that
$B-\min B$
consists of multiples of
$p^{r_0}$
, and since
$p$
does not divide
$n$
, we can then bound
$|X_p(B,n)|$
by
$|X_p(p^{-r_0}B,n)|$
. So we may further assume the elements of
$B$
are not all congruent modulo
$p$
.
We proceed by induction on
$|B|$
. When
$|B|=2$
, suppose
$B=\{b,b^{\prime}\}$
is a set and
$p$
is a prime. Then
$b-b^{\prime}=p^rn$
for at most one value of
$r$
, so
$|X_p(B)|\leq 1$
and this establishes the base case. For larger
$B$
, we condition on the value of
$b\ (\text {mod }p)$
. To do that, we write

and let
$\mu (u)=\frac {|B_u|}{|B|}$
be the accompanying probability measure. Given
$b\equiv u\, (\text {mod }p)$
, we either have
$b-b^{\prime}=n$
in which case
$b^{\prime}\equiv u-n\ (\text {mod }p)$
, and such solutions contribute at most

solutions, where
$u_{\max }$
is the residue class for which
$\mu (u)$
is largest. Otherwise
$b-b^{\prime}=p^rn$
for some
$r\gt 0$
in which case
$b$
and
$b^{\prime}$
agree modulo
$p$
. Thus,

Since
$B_u\neq B$
by our normalization, we apply induction and find

Putting this all together,

Here,

is the entropy of the measure
$\mu$
. We claim

If the minimum is achieved on the second term, the inequality is true since

Otherwise
$\mu (u_{\max })\leq 1/2$
, in which case the minimum value of entropy is
$1$
by its convexity properties (corresponding to the least uncertain case when there are only two values of
$u$
with equal probabilities
$1/2$
).
It may be that the above argument extends to the case of higher rank
$r\gt 1$
. However, we can just overwhelm the problem with some heavy machinery from the theory of
$S$
-unit equations. The following argument uses a quantitative estimate concerning linear equations in a multiplicative group, Theorem 6.2 from [Reference Amoroso and ViadaAV09], improving the work of [Reference Evertse, Schlickewei and SchmidtESS02].
Theorem 6.2 (
$S$
-unit bound). Let
$S=\{p_1,\ldots ,p_r\}$
be a set of rational primes and let
$\Gamma =\left \langle S\right \rangle$
be the multiplicative group they generate. For fixed
$a_1,\ldots ,a_l\in \mathbb C^\times$
,

where the notation
$\sum ^*$
indicates non-degeneracy in the sense that
$\sum _{i\in I}a_i\gamma _i\neq 0$
for non-empty proper subsets
$I\subset \{1,\ldots ,l\}$
.
We need this estimate for the following application taken from [Reference Roche-Newton and ZhelezovRZ15], see Lemma 2.1 therein. We include the proof so as to be quantitatively explicit. We quote the result for rational numbers, although it applies much more broadly.
Lemma 6.3 (Lemma 2.1 of [Reference Roche-Newton and ZhelezovRZ15]). Suppose
$0\lt \varepsilon \lt \frac {1}{6}$
. For any sufficiently large set
$B$
of rational numbers and a multiplicative group
$\Gamma \subseteq {\mathbb {C}}^\times$
generated by
$r$
rational primes such that
$r\leq (\log |B|)^{1-6\varepsilon }$
, one has the estimate

Proof.
For ease of notation, let
$|B|=n$
, and if necessary, augment
$\Gamma$
by adjoining
$-1$
to it. Consider the undirected graph
$G$
on the vertex set
$B$
, whose edges are those
$\{b_1,b_2\}$
satisfying
$b_1-b_2\in \Gamma$
. Let the number of edges be denoted by
$nf(n)$
, for some function
$f(n)$
. One can assume that
$f(n)$
is increasing and larger than
$(\log n)^r$
, or else there is nothing to prove.
Let
$d=f(n)/2$
. We first prune
$G$
by iteratively removing vertices with degree less than
$d$
, updating the degrees of the vertices (but not the threshold
$d$
) after each stage to reflect any removal. This process must terminate after at most
$n$
steps as there are at most
$n$
vertices that can be removed, and when it does terminate, we can have removed no more than
$n\cdot f(n)/2$
edges. If necessary, we redefine
$G$
to be the pruned graph, in which each vertex has degree at least
$f(n)/2$
.
Fix a vertex
$b_0$
, and consider a non-degenerate path of length
$l$
in
$G$
, starting from
$b_0$
. The path
$b_0,b_1,\ldots ,b_l$
corresponds to the telescopic sum

and we call the path non-degenerate if no subsum of the right-hand side vanishes. Given a non-degenerate path of length
$l$
, one can append to it at least
$f(n)/2 - (2^l-1)$
edges and get a non-degenerate path of length
$l+1$
. Indeed, there are only
$2^l-1$
edges that could lead to a degeneracy. Hence, we find by way of induction that the number of non-degenerate paths of length
$l$
is at least
$f(n)^l/4^l,$
provided that
$f(n)\geq 2^{l+2}$
.
So, there are at least
$f(n)^{l-1}/(4^{l-1} n)$
non-degenerate paths between
$b_0$
and some other element
$b\in B$
, and so
$f(n)^{l}/(4^{l} n)$
paths from
$b_0$
to some
$b_1\in B$
, by appending an edge
$b-b_1$
to the said path. On the other hand, Theorem 6.2 provides the upper bound for this number of paths, once one chooses
$a_i = 1/(b_1-b_0)$
for
$1\leq i\leq l$
. Taking logarithms and assuming that
$r\geq 2$
and, say
$l\geq 100$
, it simplifies the upper bound to

Upon choosing
$l \approx \left (\frac {\log n}{r}\right )^{\frac {2}{11}}$
to balance the terms of the right-hand side, we conclude that

which completes the proof in view of the bound on
$r$
assumed in the statement of the lemma.
Observe that in Lemma 6.3, the condition
$b_1-b_2\in \Gamma$
can be replaced by a coset membership
$b_1-b_2\in u\cdot \Gamma$
by dilating
$B$
.
7. The proof of Theorem 1.1
Let
$A$
be a finite set of integers such that
$\omega (a)\leq k$
for each
$a\in A$
. By passing to a subset and dilating by
$-1$
if necessary, we may assume that
$A\subset {\mathbb {N}}$
, at the cost of a constant factor. Set
$K=|A\cdot A|/|A|$
and apply Theorem 4.2, assuming that its necessary condition on
$K$
is satisfied or we end up with a much stronger claim than Theorem 1.1, to find a set of primes
$\{p_1\cdots p_r\}$
, generating the group
$\Gamma$
, and a structured set
$\tilde A\subseteq A$
of size

Namely, with the notation
$\boldsymbol p=(p_1,\ldots ,p_r)$
,

where each
$B_{{\boldsymbol {v}}}$
is a finite set of integers prime to the set
$p_1\cdots p_r$
, and the union
$B=\bigcup _{{{\boldsymbol {v}}}\in V}B_{{\boldsymbol {v}}}$
satisfies

We now estimate the additive energy of
$\tilde A$
, and in doing so, we may assume that
$\tilde A=-\tilde A$
, augmenting it if necessary. By Corollary 5.2, we have that

The sum above counts solutions in
$\tilde A$
to the equation

where
$a_1=b_1{{\boldsymbol {p}}}^{{{\boldsymbol {v}}}_1}$
,
$a_2=b_2{{\boldsymbol {p}}}^{{{\boldsymbol {v}}}_1}$
for some
$b_1,b_2\in B_{{{\boldsymbol {v}}}_1}$
and
$a_3={{\boldsymbol {p}}}^{{{\boldsymbol {v}}}_2}b_3$
,
$a_4={{\boldsymbol {p}}}^{{{\boldsymbol {v}}}_2}b_4$
for some
$b_3, b_4\in B_{{{\boldsymbol {v}}}_2}$
.
In other words, we have reduced to the case where the exponents appearing on the left- and right-hand sides of the energy equation are both repeated. Let us now write
$\gamma ={{\boldsymbol {p}}}^{{{\boldsymbol {v}}}_1}$
and
$\gamma ^{\prime}={{\boldsymbol {p}}}^{{{\boldsymbol {v}}}_2}$
so we are left counting solutions to

where now
$b_1,\ldots ,b_4\in B$
,
$\gamma \in \Gamma _{b_1}\cap \Gamma _{b_2}$
and
$\gamma ^{\prime}\in \Gamma _{b_3}\cap \Gamma _{b_4}$
. The only solutions where
$b_3=b_4$
correspond to trivial solutions to the energy equation in
$\tilde A$
, of which there are at most
$|\tilde A|^2$
. For the remaining solutions, observe that knowing
$a_4\in \tilde A$
gives us maximum
$|B|$
choices for
$a_3$
. Hence, the the total number of solutions of (5) is bounded, after multiplying
$B$
by the constant arising from the choice of
$(a_4,b_3)$
, by

We can assume that, say
$|A|^{1-\epsilon }\gt |B|\gt 2^k$
, and for otherwise we have proved more than enough. Then applying Lemma 6.3 yields

upon inserting the bounds for
$|B|$
and
$|\tilde A|$
from (4) and (3).
If the quantity
$C^k|\tilde A|^2$
dominates then we have proved more than enough. If not, then from the bound (3), it would suffice to prove

which is now obvious since the left-hand side is minimized when
$K\approx |A|^{2/3}$
.
Acknowledgements
The authors thank the Heilbronn Institute for Mathematical Research (HIMR) for funding a Focused Research Group Testing Additive Structure in May–June 2022, where this project had its inception, and the Johann Radon Institut (RICAM) Linz for being the venue provided. We personally thank Oleksiy Klurman for co-organising the FRG and Oliver Roche-Newton for hosting it. Special thanks to an anonymous referee for a slightly stronger and clearer version of Theorem 4.2.
Conflicts of interest
None.
Financial support
BH is supported by NSF Award 2135200. DZ was supported by the Austrian Science Fund FWF Project P 34180. Thanks to the Johann Radon Institut (RICAM) Linz for providing additional funding.
Journal information
Compositio Mathematica is owned by the Foundation Compositio Mathematica and published by the London Mathematical Society in partnership with Cambridge University Press. All surplus income from the publication of Compositio Mathematica is returned to mathematics and higher education through the charitable activities of the Foundation, the London Mathematical Society and Cambridge University Press.