1 Introduction
1.1 Context
In graph theory, Sidorenko’s conjecture [Reference SidorenkoSid93] predicts that if H is a bipartite graph, then among all graphs G with n vertices and fixed average degree, the number of copies of H in G is asymptotically (as
$|V(G)|\to \infty $
) minimised when G is random. We will call graphs H which satisfy this property Sidorenko. Sidorenko’s conjecture has been resolved for certain classes of bipartite graph (e.g., trees, cycles, complete bipartite graphs [Reference SidorenkoSid91], see also [Reference Conlon, Fox and SudakovCFS10, Reference HatamiHat10, Reference Li and SzegedyLS11, Reference SzegedySze14, Reference Kim, Lee and LeeKLL16, Reference Conlon and LeeCL17, Reference Conlon, Kim, Lee and LeeCKLL18, Reference Conlon and LeeCL21]) though the conjecture in general remains open, and is indeed of great interest. There is a related colouring property which is also of interest: for which graphs H is it true that, among all two-colourings of the edges of
$K_n$
, the number of monochromatic copies of H is minimised by a random two-colouring (asymptotically as
$n\to \infty $
)? Such graphs H are called common. It is not difficult to see that if a graph satisfies the Sidorenko property, then it is common. Goodman showed that
$K_3$
is common [Reference GoodmanGoo59]. Erdős [Reference ErdősErd62] subsequently conjectured that
$K_4$
is common and Burr–Rosta [Reference Burr and RostaBR80] that every graph is common. Sidorenko [Reference SidorenkoSid89] showed that a triangle with a pendant edge is not common and Thomason [Reference ThomasonTho89] showed that in fact
$K_4$
is not common, together disproving both conjectures. In fact, any graph which contains
$K_4$
is uncommon [Reference Jagger, Šťovíček and ThomasonJvT96]. It is not difficult to see that if H is not bipartite then it is not Sidorenko, and so we have a conjectural classification of all graphs. Even a reasonable conjectured classification of common graphs seems out of reach at the moment.
There has been recent interest in obtaining analogous conjectures and results in arithmetic contexts, initiated by Saad and Wolf [Reference Saad and WolfSW17]. Which systems of linear forms
$\Psi $
have the property that, among all subsets S of
$\mathbb {F}_p^n$
of fixed size, the number of solutions to
$\Psi $
in S is minimised when S is chosen randomly? Such systems are called Sidorenko. Those systems with the property that random two-colourings minimise the number of monochromatic solutions among all two-colourings are called common. Again, Sidorenko implies common. In the arithmetic setting, neither property has a conjectured classification.
One situation in which a classification is known is when the system of linear forms has codimension one (i.e., its image is described by a single equation). Here, an equation is Sidorenko if [Reference Saad and WolfSW17] and only if [Reference Fox, Pham and ZhaoFPZ21] its coefficients may be partitioned into pairs which sum to zero, and is common and not Sidorenko if and only if it has an odd number of variables. For systems of higher codimension, one makes the trivial observation that the union of two Sidorenko systems on disjoint variable sets is Sidorenko. Kamčev, Liebenau and Morrison give a nontrivial method by which higher codimension Sidorenko systems can be built from Sidorenko equations; see [Reference Kamčev, Liebenau and MorrisonKLM23, Theorem 1.3]. There are also a number of results in the negative direction [Reference Kamčev, Liebenau and MorrisonKLM24, Reference Kamčev, Liebenau and MorrisonKLM23, Reference Saad and WolfSW17, Reference VersteegenVer21]. Finally, we note that Král’–Lamaison–Pach have reported on a partial classification of common systems comprising two equations in
$\mathbb {F}_2^n$
[Reference Král’, Lamaison and PachKLP22] and have since obtained a near-complete classification of these systems, which is due to appear in forthcoming work (personal communications).
Lovász [Reference LovászLov11] showed that the graph-theoretic Sidorenko conjecture is locally true: for all bipartite graphs H, there is no suitably small perturbation of the random graph G which increases the number of copies of H in G. He subsequently classified locally Sidorenko graphs: a graph H is locally Sidorenko if and only if it is a forest or has even girth [Reference LovászLov12, Theorem 16.26] (see also [Reference Fox and WeiFW17] to this end). The study of local notions of commonness has also attracted recent interest: [Reference Cs’oka, Hubai and Lov’aszCHL22, Reference Hancock, Král’, Krnc and VolecHKKV22, Reference LovászLov12]. One of the purposes of this article is to introduce these local notions to the arithmetic setting. Finally, to further motivate the study of these local properties, we note that [Reference AltmanAlt23] uses a local Sidorenko property of a particular linear system as a key ingredient in the main result there.
1.2 Results
After Fox–Pham–Zhao [Reference Fox, Pham and ZhaoFPZ21] completed the classification of codimension one systems, the next natural goal ought perhaps to be a classification of codimension two systems.
Codimension one systems are Sidorenko/common if and only if they are locally Sidorenko/common. Therefore, the classification of these properties for codimension one systems is also complete, and local notions are only interesting when the codimension is at least two.
We study two different notions of locality. Loosely speaking, the weaker notion (‘weakly locally Sidorenko/common’) allows the radius of perturbation to depend on the function by which we perturb, and the stronger notion (‘locally Sidorenko/common’) asks that the radius is uniform in the choice of function.
One of our main theorems shows that a certain generic family of systems of codimension 2 is not Sidorenko, and (nearly) classifies this family by whether they are weakly locally Sidorenko or locally Sidorenko. We direct the reader to Subsection 2.1 and in particular Definition 2.2 and Definition 2.3 for formal definitions of the weak local Sidorenko property and the local Sidorenko property. We note that Sidorenko implies locally Sidorenko, which in turn implies weakly locally Sidorenko. We note also that in fact all proofs in the literature listed above which show that a particular system is not Sidorenko in fact show the stronger property that it is not locally Sidorenko.
Let p be an odd prime and let
$\Psi =(\psi _1,\ldots ,\psi _t)$
be a system of linear forms, each mapping
$\mathbb {F}_p^D$
to
$\mathbb {F}_p$
, whose image is determined by a
$2 \times t$
system of linear equations with coefficients
$$\begin{align*}M:=\begin{pmatrix} b_{11} & b_{12} & \cdots & b_{1t} \\ b_{21} & b_{22} & \cdots & b_{2t} \end{pmatrix}.\end{align*}$$
We will call
$\Psi $
linearly generic if every
$2 \times 2$
minor of M has nonzero determinant.Footnote
1
An additive k-tuple is a tuple
$(x_1, x_2, \ldots , x_k)$
satisfying the equation
$x_1-x_2+x_3-x_4 + \cdots + (-1)^{k+1}x_k=0$
. We say that
$\Psi $
contains an additive k-tuple if a vector of the form
$(0,\ldots , 0,1,-1,1,\ldots , (-1)^{k+1})$
lies in the row span of M, up to reordering of columns.
Theorem 1.1. Let
$\Psi $
be a codimension two system of linear equations in t variables which is linearly generic. Then
$\Psi $
is not locally Sidorenko (and so not Sidorenko). If t is even then
$\Psi $
is not weakly locally Sidorenko. Furthermore, for p sufficiently large in terms of t, if t is odd and
$\Psi $
is weakly locally Sidorenko, then
$\Psi $
contains an additive
$(t-1)$
-tuple.
Thus, in the situation that p is large in terms of t, we have classified all linearly generic systems by whether they are Sidorenko, locally Sidorenko, or weakly locally Sidorenko, except for those systems with t odd and which contain an additive
$(t-1)$
-tuple. We note that the condition of containing a
$(t-1)$
-tuple is more than a reflection of the limitation of our methods: there does exist a linearly generic system with t odd, which contains an additive
$(t-1)$
-tuple, and which is weakly locally Sidorenko (see Example 4.2).
As a consequence of Theorem 1.1, we answer the following conjecture and question of Kamčev–Liebenau–Morrison.
Conjecture 1.2 [Reference Kamčev, Liebenau and MorrisonKLM23, Conjecture 5.1].
Let
$\Psi $
be a system of
$5$
linear forms whose image has codimension 2. If
$\Psi $
is linearly generic then
$\Psi $
is not Sidorenko.
Question 1.3 [Reference Kamčev, Liebenau and MorrisonKLM23, Question 5.3].
Let
$t\geq 7$
be odd. Does there exist a system of t linear forms
$\Psi $
whose image has codimension 2, which is linearly generic and which is Sidorenko?
We note that the above conjecture and question as they appear in [Reference Kamčev, Liebenau and MorrisonKLM23] do not refer directly to the linearly generic condition, but the formulation there is equivalent. We have the following corollary of Theorem 1.1.
Define the complexity of
$\Psi =(\psi _1,\ldots ,\psi _t)$
, where each
$\psi _i$
is a linear map from
$\mathbb {F}_p^D$
to
$\mathbb {F}_p$
, to be the smallest positive integer s such that the functions
$(\psi _i^s)_{i=1}^t$
are linearly independent over
$\mathbb {F}_p$
. Loosely speaking, if the complexity of a system is 1, then its analysis should be controlled by Fourier analysis, and one ought not require the higher-order theory. We direct the reader to [Reference Gowers and WolfGW10, Reference Gowers and WolfGW11a, Reference Gowers and WolfGW11b], for relevant definitions, discussions and results to this end. One may easily compute that for
$t\geq 5$
and p sufficiently large, systems whose images have codimension two generically have complexity one. Where it is not difficult, using higher-order constructions, to contrive systems of complexity at least
$2$
which are weakly locally Sidorenko but not Sidorenko, the situation is less clear for complexity one systems (we will justify this claim in Subsection 4.2). We believe therefore that the following corollary of Theorem 1.1 is also worth noting.
Corollary 1.5. Among complexity one systems, the weak local Sidorenko property is not equivalent to the local Sidorenko property.
The explicit example demonstrating Corollary 1.5 is Example 4.2 in Section 4.
Of independent interest is the observation and method underlying Theorem 1.1 and its corollaries. We observe that although these systems have complexity one, their ‘dual’ system (i.e., the linear system obtained in frequency space after Fourier inversion) need not have complexity one. The most involved part of Theorem 1.1 utilises higher-order constructions in the frequency space (Theorem 3.5). More broadly, the proof of Theorem 1.1 also utilises other combinatorics on the dual pattern of a linear system (Theorem 3.1). Since the uploading of this document to the arXiv, the use of combinatorics on the dual pattern of a linear system has been called a Fourier template by [Reference Dong, Li and ZhaoDLZ24], and is used in both [Reference Altman and LiebenauAL25] and [Reference Dong, Li and ZhaoDLZ24].
We also provide the first example of a system which is locally Sidorenko but not Sidorenko; see Example 4.1.
Proposition 1.6. In the arithmetic setting, the Sidorenko property is not equivalent to the local Sidorenko property.
This result is perhaps slightly less trivial than it sounds; for example, in the graph-theoretic setting, it remains an open question whether local commonness (appropriately defined, see [Reference Cs’oka, Hubai and Lov’aszCHL22]) is equivalent to commonness. One of the reasons we have chosen to study these local properties is that all existing proofs in the literature that systems are not Sidorenko [Reference Fox, Pham and ZhaoFPZ21, Reference Kamčev, Liebenau and MorrisonKLM23, Reference Saad and WolfSW17, Reference VersteegenVer21] prove the stronger statement that in fact these systems are not locally Sidorenko.
Finally, we note that the linear genericity condition in Theorem 1.1 is necessary. For example, it has been observed by Kamčev–Liebenau–Morrison [Reference Kamčev, Liebenau and MorrisonKLM23] that the system
is Sidorenko, and it clearly does not contain an additive tuple.
In another direction, it has been known for some time [Reference Jagger, Šťovíček and ThomasonJvT96, Theorem 12] that any graph containing
$K_4$
is uncommon. Saad and Wolf [Reference Saad and WolfSW17] asked whether the same is true for four-term arithmetic progressions in the arithmetic context. This question was answered in the affirmative recently by Kamčev–Liebenau–Morrison [Reference Kamčev, Liebenau and MorrisonKLM24] and independently by Versteegen [Reference VersteegenVer21]. Both proofs build on a ‘quadratic’ construction of Gowers [Reference GowersGow20]. In Section 5, we give (together with a lemma from [Reference Kamčev, Liebenau and MorrisonKLM24]) a short proof of Saad and Wolf’s conjecture which does not rely on higher-order constructions. Our proof is also more amenable to generalisation to rank two systems in
$2k$
variables, which would be a significant step in the classification of weakly locally common systemsFootnote
2
. In fact, since this document has been on the arXiv, both the author and Liebenau [Reference Altman and LiebenauAL25] and Dong, Li and Zhao [Reference Dong, Li and ZhaoDLZ24] have used the method introduced in Section 5 below to achieve this generalisation.
We set up some notational and technical preliminaries in Section 2. We prove Theorem 1.1 in Section 3. In Section 4 we give examples to prove Corollary 1.5 and Proposition 1.6. Finally in Section 5 we give a new proof of the answer to Saad and Wolf’s question [Reference Saad and WolfSW17, Question 3.6].
2 Preliminaries
2.1 Definitions, basic properties
Throughout we let
$\Psi = (\psi _1,\ldots , \psi _t)$
be a system of
$\mathbb {F}_p$
-linear forms, each mapping
$\mathbb {F}_p^D$
to
$\mathbb {F}_p$
; we often abuse notation and let them induce linear maps from
$(\mathbb {F}_p^n)^D$
to
$\mathbb {F}_p^n$
in the natural way. Define the operator
$T_\Psi $
acting on functions
$f: \mathbb {F}_p^n \to \mathbb {C}$
by the formula
where throughout
$\mathbb {E}$
is a normalised count, so here it is a shorthand for
$\frac {1}{p^{nD}}\sum $
. We will always assume that
$D \leq t$
and that
$\Psi ^{-1}(0)=0$
(it is clear that one may reduce to this case). At times it will be more convenience to describe a system of linear forms by the equations which define its image. Suppose that
$M_\Psi $
is an
$\mathbb {F}_p$
-valued matrix such that
$\operatorname {\mathrm {Im}} \Psi = \ker M_\Psi $
. Then we have
where
$\boldsymbol {x} = (x_1,\ldots ,x_t)\in (\mathbb {F}_p^n)^t$
.
Throughout this document we will deal with functional versions of the Sidorenko and common properties in which we ask the following of all functions
$f:\mathbb {F}_p^n \to [0,1]$
, rather than just restricting to characteristic functions of sets.
Definition 2.1. A system of t linear forms
$\Psi $
is Sidorenko if, for all
$n\geq 1$
and all
$f:\mathbb {F}_p^n \to [0,1]$
, we have
Note that any function can be written as the sum of a constant function and a function with zero average, whence the inequality in Definition 2.1 can be written as
$T_\Psi (\alpha + f) \geq \alpha ^t$
, where the condition must hold for all
$\alpha \in [0,1]$
and all
$f: \mathbb {F}_p^n \to [-\alpha , 1-\alpha ]$
with
$\mathbb {E}_x f(x) = 0$
. A system is locally Sidorenko if the above inequality is satisfied for all sufficiently small perturbations of the constant function.
Definition 2.2. A system of t linear forms
$\Psi $
is locally Sidorenko if for all
$\alpha \in [0,1]$
there exists
$\varepsilon>0$
such that for all
$n \geq 1$
and all
$f:\mathbb {F}_p^n\to [-\alpha , 1-\alpha ]$
with
$\mathbb {E}_xf(x) = 0$
, we have
We remark that that this notion of locality considers small perturbations of the constant function with respect to the
$\ell ^\infty $
norm on the space of functions. One may also consider analogous notions of locality for different norms. We do not do so in this document, but note that, for example, considering the local Sidorenko property with respect to the
$\ell ^\infty $
norm on the Fourier side was an important part of the main result of [Reference AltmanAlt23].
Above, the size of the neighbourhood of the constant function is fixed independently of
$n,f$
. If the size of the neighbourhood is allowed to vary with these parameters, then we will say that the linear system is weakly locally Sidorenko.
Definition 2.3. A system of t linear forms
$\Psi $
is weakly locally Sidorenko if for all
$n\geq 1$
, for all
$\alpha \in [0,1]$
and all
$f:\mathbb {F}_p^n\to [-\alpha , 1-\alpha ]$
with
$\mathbb {E}_xf(x) = 0$
, there exists
$\varepsilon _f>0$
such that for all
$\varepsilon < \varepsilon _f$
we have
Definition 2.4. A system of t linear forms
$\Psi $
is common if, for all
$n\geq 1$
and all
$f:\mathbb {F}_p^n \to [0,1]$
, we have
Local commonness and weak local commonness may be defined analogously.
Let
$\Psi =(\psi _1,\ldots , \psi _t)$
be a system of t linear forms. Let
$\alpha $
be a constant and f a function on
$\mathbb {F}_p^n$
with
$\mathbb {E}_x f(x) = 0$
. For a subset
$S\subset [t]$
, we denote the corresponding subsystem of
$\Psi $
by
$\Psi (S)$
(i.e., the restriction of
$\Psi $
to the coordinates in S). Then, by the multilinearity of the operator T we have
Note that if f has
$\mathbb {E}_xf(x)=0$
and if the image of
$\Psi (S)$
contains a coordinate hyperplane then
$T_{\Psi (S)}(f) =0$
. It follows from the linear genericity condition (recall: every
$2\times 2$
minor of the matrix defining its equations has nonzero determinant), then for all nonempty
$S \subset [t]$
with
$|S| \leq t-2$
, the subsystem
$\Psi (S)$
contains a coordinate hyperplane and so yields
$T_{\Psi (S)}(f)=0$
. Thus for linearly generic systems we have
Furthermore, for such systems, if
$|S|=t-1$
then the image of
$\Psi (S)$
has codimension one (i.e., its image is described by a single equation).
2.2 Fourier inversion
The reader is directed to [Reference ZhaoZha23, Chapter 6.1] for further background on the basics of Fourier analysis in
$\mathbb {F}_p^n$
.
Recall the Fourier transform defined by
where
$e_p(\cdot ) := e^{\frac {2\pi i \cdot }{p}}$
. Recall also the Fourier inversion formula:
$$\begin{align*}f(x) = \sum_{h\in \mathbb{F}_p^n}\hat f(h)e_p(h\cdot x),\end{align*}$$
and Parseval’s identity
For a linear system
$\Psi $
, let
$M=M_\Psi $
have
$\operatorname {\mathrm {Im}} \Psi = \ker M$
. We will make use of the following consequence of the Fourier inversion formula:
where
$\boldsymbol {h} = (h_1,\ldots , h_t)\in (\mathbb {F}_p^n)^t$
.
2.3 Tensoring
Definition 2.5. Let
$f_1:\mathbb {F}_p^{n_1} \to \mathbb {C}$
and
$f_2: \mathbb {F}_p^{n_2} \to \mathbb {C}$
. Then the tensor product
$f:= f_1 \otimes f_2 : \mathbb {F}_p^{n_1+n_2} \to \mathbb {C}$
is the function defined by
$f(x) = f_1(\pi _1(x))f_2(\pi _2(x))$
, where
$\pi _1$
is the projection onto the first
$n_1$
coordinates of
$\mathbb {F}_p^{n_1+n_2}$
and
$\pi _2$
is the projection onto the last
$n_2$
coordinates.
It is clear that
$\otimes $
is an associative operation and so one may naturally define the tensor product of a finite collection of functions.
Observation 2.6. For a system of linear forms
$\Psi $
, we have
Note that there is a slight abuse of notation here: we interpret
$T_\Psi $
as an operator on
$\mathbb {F}_p^{n_1+n_2}$
,
$\mathbb {F}_p^{n_1}$
and
$\mathbb {F}_p^{n_2}$
as it appears respectively. The equation above is easily verified by direct computation.
3 Proof of Theorem 1.1
Recall that we call a system of two equations linearly generic if the
$2 \times t$
matrix
$$\begin{align*}M:=\begin{pmatrix} b_{11} & b_{12} & \cdots & b_{1t} \\ b_{21} & b_{22} & \cdots & b_{2t} \end{pmatrix}\end{align*}$$
which defines the equations has the property that all of its
$2 \times 2$
minors have nonzero determinant.
3.1 Systems without additive tuples are not weakly locally Sidorenko
We first work towards proving the following theorem, which is one of the statements in Theorem 1.1.
Theorem 3.1. Let
$\Psi $
be a system of two linear equations in t variables which is linearly generic. If t is even or if
$\Psi $
does not contain an additive tuple and p is sufficiently large in terms of t, then
$\Psi $
is not weakly locally Sidorenko.
We now prove the theorem, modulo the proof of two propositions which will appear subsequently and occupy the remainder of the subsection. In both propositions, we prove something more general than what is needed for the application in this document. We do so to illustrate the scope of the constructions and because it is clear that these more general statements can be used to obtain more general results. See also Remark 3.4.
Proof of Theorem 3.1.
Recall from Equation (2) that we have
It suffices to show that there exists f with
$\mathbb {E} f = 0$
such that
$\sum _{|S|=t-1} T_{\Psi (S)}(f)<0$
; then one makes
$\varepsilon $
appropriately small to conclude. If
$t-1$
is odd, then this is trivial: take f for which
$\sum _{|S|=t-1} T_{\Psi (S)}(f) \ne 0$
(it is not difficult to see that such an f must exist), and if
$\sum _{|S|=t-1} T_{\Psi (S)}(f)> 0$
then replace f with
$-f$
. Henceforth we assume that t is odd.
Our strategy will be to find, for each
$S \subset [t]$
with
$|S|=t-1$
, a function
$f_S$
with
$\mathbb {E} f_S=0$
such that
$T_{\Psi (S)}(f_S)=0$
and
$T_{\Psi }(f_S) \ne 0$
. Then we let
$f = \bigotimes _{|S|=t-1} f_S$
and by (4) we see that
where
$T_\Psi (f) = \prod _{|S|=t-1} T_{\Psi }(f_S)\ne 0$
. Recall that t is odd so potentially replacing f with
$-f$
depending on the sign of
$T_\Psi (f)$
, we may conclude that for all
$\varepsilon $
sufficiently small (indeed, for all
$\varepsilon $
), we have
$T_\Psi (\alpha + \varepsilon f) < \alpha ^t$
, so
$\Psi $
is not weakly locally Sidorenko.
It remains to prove the existence of the functions
$f_S$
. Recall that each of the systems
$\Psi (S)$
has codimension one, so is described by a single equation. This equation cannot be of the form
$\sum _{i} (-1)^i x_i =0$
by our assumption that
$\Psi $
does not contain an additive tuple. We prove the existence of such an
$f_S$
in Proposition 3.2 and Proposition 3.3. The first deals with the easier case in which there is a pair of coefficients whose ratio is not equal to
$\pm 1$
, and the second deals with the remaining cases which are not additive tuples.
In the following and throughout, we use Vinogradov notation
$\ll $
and
$\gg $
to conceal a multiplicative constant. The appearance of parameters in the subscript denotes that this constant may depend on these parameters.
Proposition 3.2. Let
$\Psi $
be a codimension
$2$
linear system in t variables which is linearly generic. Let
$\Phi $
be determined by a single equation
$\sum _{i=1}^{l}a_ix_i=0$
, where each
$a_i$
is nonzero, such that there are
$i,j$
satisfying
$a_i \ne \pm a_j$
. Then for all
$n\geq 2$
, there exists
$f: \mathbb {F}_p^n \to [-1,1]$
with
$\mathbb {E}_x f (x)=0$
,
$T_{\Phi }(f) = 0$
and
$|T_\Psi (f)| \gg _t 1$
.
Proof. Without loss of generality
$a_1,a_2$
are such that
$a_1\ne \pm a_2$
. We construct f from its Fourier coefficients. Let
$M=M_\Psi =(b_{ij})$
be the
$2 \times t$
matrix whose coefficients correspond to the two linear equations of
$\Psi $
. Using (3) we have that
$$ \begin{align} T_{\Phi} (f) = \sum_{h \in \mathbb{F}_p^n} \prod_{i=1}^{l} \hat f(a_ih), \end{align} $$
and
$$ \begin{align} T_\Psi(f) = \sum_{h_1,h_2\in \mathbb{F}_p^n} \prod_{i=1}^{t}\hat f (b_{1i}h_1 + b_{2i}h_2). \end{align} $$
We claim that if
$h_1,h_2$
are linearly independent, then
$(u_1,\cdots , u_t) := M^T\binom {h_1}{h_2}$
satisfies
$u_i \ne 0$
for all i and
$a_1u_i \ne \pm a_2u_j$
for all
$i,j$
. Indeed, that
$u_i \ne 0$
follows from linear independence and the fact that M cannot have a zero column, and if
$a_1u_i = \pm a_2u_j$
, then by the linear independence of
$h_1,h_2$
we have
$a_1\binom {b_{1i}}{b_{2i}} =\pm a_2\binom {b_{1j}}{b_{2j}}$
which contradicts the fact that all
$2\times 2$
minors of M are full rank.
Choose any two linearly independent
$h_1,h_2$
and define
$u_1,\ldots ,u_t$
as above. Now define
$\hat f(\pm u_i)=1/(2t)$
for all i and
$\hat f(h)=0$
otherwise so that f is clearly real-valued and takes values in
$[-1,1]$
by Fourier inversion and the triangle inequality. Since
$u_i\ne 0$
for all i we have
$\hat f (0) = \mathbb {E}_xf(x) = 0$
. Next, since
$a_1 \ne \pm a_2$
and since
$a_1u_i \ne \pm a_2u_j$
for all
$i,j$
, we have that for all h at least one of
$\hat f (a_1h), \hat f (a_2h)$
is zero and so
$T_{\Phi }(f)=0$
from (5). Finally,
$T_\Psi (f) \geq 2/(2t)^t$
from (6).
In the following proposition we prove a more general statement than what is needed for the proof of Theorem 3.1. Below we consider equations of arbitrary length with
$\pm 1$
coefficients. For the theorem above, we just need to consider equations
$\sum _{i=1}^{t-1}a_ix_i=0$
of length
$t-1$
(which we recall may be assumed to be even), where
$a_i \in \pm 1$
. In this case, since
$t-1$
is even, we have for parity reasons that
$|\#\{a_i=1\} - \#\{a_i=-1\}|$
must be even, and so the condition that
$t \ne (2m+1)|\#\{a_i=1\} - \#\{a_i=-1\}|$
for all
$m\in \mathbb {Z}$
from the statement of the proposition below is automatically satisfied.
Proposition 3.3. Let
$\Psi $
be a codimension
$2$
linear system in t variables which is linearly generic. Let
$\Phi $
be determined by the equation
$\sum _{i=1}^{t'} a_ix_i=0$
, where each
$a_i\in \{-1,1\}$
and
$\#\{a_i=1\} \ne \#\{a_i=-1\}$
. If p is sufficiently large in terms of t and
$t\ne (2m+1)|\#\{a_i=1\} - \#\{a_i=-1\}|$
for any integer m, then there exists
$f: \mathbb {F}_p \to [-1,1]$
with
$\mathbb {E}_x f (x)=0$
, with
$T_{\Phi }(f) = 0$
and with
$|T_\Psi (f)| \gg _{p,t} 1$
.
Proof. Let
$V = \operatorname {\mathrm {Im}} M_\Psi ^T \leq \mathbb {F}_p^t$
. For
$i=1,\ldots , t$
, let
$P_i$
be the ith coordinate hyperplane (i.e.,
$\{x \in \mathbb {F}_p^t : x_i = 0\}$
). Let the symmetric group on t elements
$\operatorname {\mathrm {Sym}}(t)$
act on
$\mathbb {F}_p^t$
by coordinate permutation, and let
$(\mathbb {Z}/2\mathbb {Z})^t$
act on
$\mathbb {F}_p^t$
by reflection in each coordinate hyperplane (so, e.g.,
$(1,0,1,0,\ldots )\in (\mathbb {Z}/2\mathbb {Z})^t$
maps
$(u_1,u_2,u_3,u_4,\ldots )$
to
$(-u_1,u_2,-u_3,u_4,\ldots )$
). Let G be the group of permutations of
$\mathbb {F}_p^t$
generated by
$\operatorname {\mathrm {Sym}}(t)$
and
$(\mathbb {Z}/2\mathbb {Z})^t$
and let
$S = G - \{\operatorname {\mathrm {id}} , (1,1,\ldots ,1)\}$
.
We claim that
$V - \left ( \bigcup _{i=1}^t P_i \cup \bigcup _{\sigma \in S} \sigma V\right )$
is nonempty. Indeed,
$$\begin{align*}\left| V - \left( \bigcup_{i=1}^t P_i \cup \bigcup_{\sigma \in S} \sigma V\right)\right| \geq |V| - \sum_{i=1}^t |V\cap P_i| - \sum_{\sigma\in S}|V\cap \sigma V|,\end{align*}$$
where each
$V\cap P_i$
and
$V\cap \sigma V$
is a subspace of V, so has size
$1,p$
or
$p^2$
. By taking p sufficiently large in terms of t, it suffices to show that
$V \ne V\cap P_i$
and
$V \ne \sigma V$
for each
$i\in [t]$
and
$\sigma \in S$
. That
$V \ne V \cap P_i$
follows from the fact that M cannot have a zero column, and that
$V\ne \sigma V$
for every
$\sigma \in S$
is an easy exercise in linear algebra, using the fact that each
$2\times 2$
minor of M has full rank and that every element of G may be written in the form
$\sigma \tau $
for some
$\sigma \in \operatorname {\mathrm {Sym}}(t)$
and
$\tau \in (\mathbb {Z}/2\mathbb {Z})^t$
.
Let
$u :=(u_1,\ldots ,u_t) \in V - \left ( \bigcup _{i=1}^t P_i \cup \bigcup _{\sigma \in S} \sigma V\right )$
. We may assume that
$\#\{a_i = 1\}> \#\{a_i=-1\}$
so that by Fourier inverting
$$ \begin{align} T_{\Phi} (f) = \sum_{h \in \mathbb{F}_p^n} |\hat f(h)|^{2k}\hat f(h)^l, \end{align} $$
where
$t'=2k+l$
,
$k \geq 0$
,
$l= \#\{a_i = 1\} - \#\{a_i=-1\}>0$
. We have by construction that each
$\pm u_i$
is distinct and nonzero. Let
$\frac {1}{2} \leq c_1,\ldots , c_t \leq 1$
be real numbers to be chosen later and define
$\hat f(u_i):= c_i e^{\frac {\pi i}{2l}}/(2t)$
and
$\hat f(-u_i) = \overline {\hat f(u_i)}$
for all i. Set
$\hat f(h) = 0$
for all other h. Then
$\mathbb {E}_xf(x) = \hat f(0) = 0$
, f takes values in
$[-1,1]$
and
$\Re (\hat f(\pm u_i)^l) = 0$
for all i. Thus we have
$$\begin{align*}T_{\Phi} (f) = \Re\left( T_{\Phi} (f)\right) = \sum_{h \in \mathbb{F}_p^n} |\hat f(h)|^{2k}\Re(\hat f(h)^l)=0.\end{align*}$$
It remains to argue that
$|T_\Psi (f)| \gg _{p,t} 1$
. Fourier inverting we have
$$\begin{align*}T_\Psi(f) = \sum_{y_1,y_2\in \mathbb{F}_p}\prod_{i=1}^t \hat f(b_{1i}y_1 + b_{2i}y_2).\end{align*}$$
Let
$U=\{\pm u_1,\ldots , \pm u_t\}$
and let
$\mathcal {I}$
be the collection of multisubsets of
$[t]$
which have t elements. For a multisubset W of U with t elements, let
$i(W)\in \mathcal {I}$
be the multisubset of indices of W (e.g., if
$t=4$
we may have
$i([u_1,u_1,u_3,-u_1]) = [1,1,3,1]$
), and let
$\Sigma (W)$
be the sum of the signs in W (so
$\Sigma ([u_1,u_1,u_3,-u_1]) = 1+1+1-1=2$
). Then
$$ \begin{align} T_\Psi(f) = \sum_{I\in \mathcal{I}} \left( \sum_{W:i(W)=I} \#\left\lbrace y_1,y_2:M^T\binom{y_1}{y_2}=W\right \rbrace \cdot \frac{1}{(2t)^t}e^{\frac{\Sigma(W)\pi i}{2l}} \right) \prod_{i\in I}c_i, \end{align} $$
where we abuse notation by implicitly ‘unordering’ the vector
$M^T\binom {y_1}{y_2}$
and identifying it with the corresponding multiset. Viewing (8) as a polynomial in
$c_1,\ldots , c_t$
, we may choose
$c_1,\ldots , c_t$
in such a way as to force
$|T_\Psi (f)|\gg _{p,t} 1$
unless (8) is the zero polynomial. Note that when
$I=[t]$
, we have by our construction of
$u=(u_1,\ldots , u_t)$
that the only multisubsets W for which
$\#\{y_1,y_2:M^T\binom {y_1}{y_2}=W\}$
is nonzero are
$W=[u_1,u_2,\ldots , u_t]$
and
$W=[-u_1,-u_2,\ldots , -u_t]$
, and in these cases
$\#\{y_1,y_2:M^T\binom {y_1}{y_2}=W\}=1$
. Furthermore, for these W we have that
$\Sigma (W) = t$
and
$-t$
respectively. Therefore, the coefficient of the term
$\prod _{i=1}^t c_i$
in (8) is
$\frac {2}{(2t)^t}\Re \left (e^{\frac {t\pi i}{2l}}\right )$
, which is nonzero since t is not of the form
$(2m+1)l$
for any integer m. Thus we may choose
$c_1,\ldots , c_t$
so that
$|T_\Psi (f)|\gg _{p,t} 1$
, completing the proof.
Remark 3.4. It is not difficult to adapt the construction from the previous proposition to relax the assumption that
$t \ne (2m+1)l$
to the assumption that
$l\ne 1$
or t is even; one proceeds by defining
$\hat f(u_i):= c_i e^{\frac {(2n_i+1)\pi i}{2l}}$
for an appropriate choice of integers
$\{n_i\}_{i=1}^t$
. In fact, it seems likely that a modest perturbation of the above argument may allow us to remove any assumption of this kind completely. We do not pursue the matter in this document as we have no immediate need for the more general statement. We hope that the above example is illustrative.
3.2 Systems with additive tuples are not locally Sidorenko
Together with Theorem 3.1, the following completes the proof of Theorem 1.1.
Theorem 3.5. Let
$\Psi $
be be a system of two linear equations which is linearly generic. Then
$\Psi $
is not locally Sidorenko.
The main observation underpinning the upcoming construction is that the complexity of a system of linear forms is not maintained under Fourier inversion. We have not investigated the extent to which higher-order Fourier analysis may be applied in frequency space, but the upcoming construction (and natural generalisations thereof) at least demonstrates one application of the aforementioned observation.
In light of Theorem 3.1 we may assume that t is odd. We note that if
$\Psi (S)$
corresponds to an additive
$(t-1)$
-tuple, then by Fourier inversion,
$T_{\Psi (S)}$
is the (
$(t-1)$
th power of the)
$\ell ^{t-1}$
norm in frequency space. In particular, if
$T_{\Psi (S)}(f)=0$
then
$f=0$
, so the proof strategy from the previous subsection fails. We will have to settle for a function f for which the ratio
$T_{\Psi (S)}(f)/T_\Psi (f)$
is arbitrarily large in absolute value. The existence of such a function is proven in Proposition 3.7. First, we need a lemma.
Lemma 3.6. The following facts hold:
-
1. Define the quadratic
$q: \mathbb {F}_p^n \to \mathbb {F}_p$
by
$q(x) = x^TMx + h^T x$
, where M is a matrix of rank r and
$h \in \mathbb {F}_p^n$
. Then
$$\begin{align*}|\mathbb{E}_{x\in \mathbb{F}_p^n} e_p(q(x))|\leq p^{-r/2}.\end{align*}$$
-
2. Let
$\Psi $
be a system of linear forms, each mapping
$(\mathbb {F}_p^n)^D$
to
$\mathbb {F}_p^n$
. Let
$A \subset \mathbb {F}_p^n$
be the zero set of the quadratic defined by
$x^Tx$
. Then where k is the dimension of the
$$\begin{align*}|T_\Psi(1_A) - p^{-k}| \leq p^{-n/2},\end{align*}$$
$\mathbb {F}_p$
-span of the functions
$\{\psi _i(x)^T\psi _i(x)\}_{i=1}^t$
.
Sketch proof.
Part 1 is a very standard Gauss sum estimate; a proof may be found in [Reference GreenGre07], for example. In [Reference Gowers and WolfGW10, Proof of Theorem 3.1] it is shown (using Part 1 as the main ingredient) that if a system of linear forms
$\tilde \Psi := (\tilde \psi _i)_{i=1}^l$
has
$\{\tilde \psi _i(x)^T\tilde \psi _i(x)\}_{i\in [l]}$
linearly independent, then
$|T_{\tilde \Psi }(1_A) - p^{-l}| \leq p^{-n/2}$
. Let
$S\subset [t]$
be a set such that
$\{\psi _i(x)^T\psi _i(x)\}_{i\in S}$
is a basis for
$\{\psi _i(x)^T\psi _i(x)\}_{i=1}^t$
. Then we observe the equality
$\prod _{i\in S}1_A(\psi _i(x)) = \prod _{i\in [t]}1_A(\psi _i(x))$
for all x, so
$T_\Psi (1_A) = T_{\Psi (S)}(1_A)$
and our result then follows from the second sentence of this proof.
For a
$2 \times k$
matrix M with values in
$\mathbb {F}_p$
, let
$M^{(2)}$
be the
$3 \times k$
matrix defined by
$$\begin{align*}M^{(2)} := \begin{pmatrix} \boldsymbol{r_1}^2 \\ \boldsymbol{r_2}^2 \\ \boldsymbol{r_1}\boldsymbol{r_2} \end{pmatrix},\end{align*}$$
where
$\boldsymbol {r}_1$
and
$\boldsymbol {r}_2$
are the first and second rows of M respectively, and where multiplication of vectors is conducted coordinatewise. We claim that if M is linearly generic and
$k\geq 3$
then
$M^{(2)}$
has rank
$3$
. One may, for example, observe that if
$3$
values of
$\boldsymbol {r_1}$
are nonzero, then
$M^{(2)}$
contains a
$3\times 3$
Vandermonde determinant (up to rescaling columns). After similarly dealing with the case that
$3$
values of
$\boldsymbol {r_2}$
are nonzero, and noting that each of
$\boldsymbol {r_1}$
and
$\boldsymbol {r_2}$
cannot contain more than one zero entry, we may conclude immediately in the remaining case by a cofactor expansion along the row
$\boldsymbol {r_1r_2}$
(two of whose entries are zero).
Proposition 3.7. Let
$t \geq 5$
be odd. Let
$\Psi $
be a system of t linear forms whose image has codimension
$2$
and which is linearly generic. Let
$\Phi $
be the system defined by an additive
$(t-1)$
-tuple. Then, for all n sufficiently large in terms of p and t, there is a function
$f:\mathbb {F}_p^n \to [-1,1]$
such that
$\mathbb {E}_x f(x) = 0$
,
$|T_\Psi (f)| \gg _{p,t} p^{-n(t-4)/2}$
and
$T_{\Phi }(f) < p^{-n(t-3)/2}$
.
Proof. We construct f by specifying its Fourier transform
$\hat f: \mathbb {F}_p^n \to \mathbb {C}$
. Let A be the zero set of the quadratic form given by
$x^Tx$
. Define
$\hat f(0) =0$
and
$\hat f(h) = \frac {1}{p^{n/2}+1}(1_A(h)-p^{-1})$
otherwise. Firstly,
$T_{\Phi }(f) = \sum _h |\hat f(h)|^{t-1} < p^n \cdot \left (p^{-n/2}\right )^{t-1} = p^{-n(t-3)/2}$
.
It is clear that for all
$h \in \mathbb {F}_p^n$
we have
$\hat f(h) = \overline {\hat f(-h)}$
, and so f takes values in
$\mathbb {R}$
. Furthermore, for all
$x\ne 0$
:
$$ \begin{align*} |f(x)| &= \left|\sum_h \hat f(h)e_p(x \cdot h)\right|\\ &= \frac{1}{p^{n/2}+1}\left|\sum_h (1_A(h) - p^{-1})e_p(x \cdot h) - (1-p^{-1})\right| \\ &\leq \frac{1}{p^{n/2}+1}\left(\left|p^n \mathbb{E}_h \left(\mathbb{E}_{a\in \mathbb{F}_p} e_p(ah^Th)\right) e_p(x^T h)\right| + (1-p^{-1})\right)\\&\leq \frac{1}{p^{n/2}+1}\left(p^n\mathbb{E}_{a \in \mathbb{F}_p}\left| \mathbb{E}_h e_p(ah^Th+x^T h)\right| + (1-p^{-1})\right)\\ &\leq \frac{1}{p^{n/2}+1}\left( p^{n/2} + 1-p^{-1}\right) \\ &< 1, \end{align*} $$
by Fourier inverting in the first line, an expansion of
$1_A$
in terms of quadratic phases in the third line (this follows from the orthogonality of characters), and Lemma 3.6 Part 1 in the fourth. For
$x=0$
we have
$$\begin{align*}|f(0)| = \frac{1}{p^{n/2}+1}\left| \sum_h (1_A(h) - p^{-1}) - 1+p^{-1}\right| < 1,\end{align*}$$
by Lemma 3.6 Part 2 with the trivial set of linear forms
$x \mapsto (x)$
. Thus f takes values in
$[-1,1]$
.
By (3) and the condition that the image of
$\Psi $
has codimension 2 we have
where
$\Psi ^\perp $
denotes the dual set of linear forms which may be determined as in (3). It remains to show that
$|T_{\Psi ^\perp }(1_A-p^{-1})| \gg _{p,t} 1$
. We have
$$ \begin{align*} T_{\Psi^\perp}(1_A-p^{-1}) &= \sum_{S \subset [t]}(-p)^{-t+|S|}T_{\Psi^\perp(S)}(1_A)\\ &=\sum_{S\subset [t]}(-p)^{-t+|S|}\mathbb{E}_{x_1,x_2} \prod_{i\in S}1_0\left(\psi_i^\perp(x_1,x_2)^T\psi_i^\perp(x_1,x_2)\right). \end{align*} $$
Since the image of
$\Psi $
has codimension 2, we have that each
$\psi _i^\perp \in \Psi ^\perp $
maps
$(\mathbb {F}_p^n)^2 \to \mathbb {F}_p^n$
. Thus each
$\psi _i^\perp (x)^T\psi _i^\perp (x)$
maps
$x=(x_1,x_2) \in (\mathbb {F}_p^n)^2$
to a linear combination of
$\{x_1^Tx_1, x_1^Tx_2, x_2^Tx_2\}$
, so the dimension of the span of the functions
$\{\psi _i^\perp (x)^T\psi _i^\perp (x)\}_{i=1}^t$
is at most
$3$
. By Lemma 3.6,
One observes that
$\dim \operatorname {\mathrm {span}} \{\psi _i^\perp (x)^T\psi _i^\perp (x)\}_{i\in S} = \operatorname {\mathrm {rank}} M^{(2)}_S$
where
$M_S$
is the matrix obtained by restricting the columns of
$M_\Psi $
to the coordinates in S. We show that
$|T_\Psi (1_A-p^{-1})| \gg _{p,t} 1$
by showing that
$\sum _{S \subset [t]}(-p)^{-t+|S|}p^{-\operatorname {\mathrm {rank}} M^{(2)}_S}$
is nonzero. As discussed above, the fact that every
$2\times 2$
minor of
$M_{\Psi }$
has nonzero determinant implies that for each
$S \subset [t]$
with
$|S| \geq 3$
,
$\operatorname {\mathrm {rank}} M_S^{(2)} = 3$
. Furthermore it is clear that if
$|S|=1$
,
$2$
then
$\operatorname {\mathrm {rank}} M_S^{(2)} = 1$
,
$2$
respectively. Thus we have
$$ \begin{align*} \sum_{S \subset [t]}(-p)^{-t+|S|}p^{-\operatorname{\mathrm{rank}} M^{(2)}_S} &= p^{-3}\sum_{|S| \geq 3}(-p)^{-t+|S|} + (\binom{t}{2} - t + 1)(-p)^{-t} \\ &= \frac{(1-p)^{t} + p^3(\binom{t}{2} - t + 1) - (p^2\binom{t}{2} - pt + 1)}{(-p)^{t}p^{3}}. \end{align*} $$
We claim that the numerator is nonzero for pairs
$(p,t)$
with p prime and
$t\geq 5$
. Indeed, for
$p=2$
and t odd we obtain a quadratic whose roots satisfy
$t<5$
. The same is true of
$p=2$
and t even. Otherwise, for large values of
$(p,t)$
it is clear that the term
$(1-p)^t$
dominates, and the claim may be checked by direct computation for small values of
$(p,t)$
(the author obtained a bona fide proof when
$t \geq 11$
, or
$t \in \{5,7,9\}$
and
$p\geq 11$
, and checked the cases
$(p,t)\in \{3,5,7\}\times \{5,7,9\}$
by hand; for the sake of brevity we do not include the details).
Remark 3.8. In the above proposition, the assumption that
$\Psi $
is linearly generic may be removed quite easily, at least if one allows p to be sufficiently large in terms of t. Indeed one writes
$\sum _{S \subset [t]}(-p)^{-t+|S|}p^{-\operatorname {\mathrm {rank}} M^{(2)}_S}$
as a p-adic series
$\sum _{i=-t}^{-1} a_ip^i$
. Each
$|a_i|$
may be bounded in terms of t, so if p is large enough in terms of t then this series representation is unique and so the sum is zero if and only if each
$a_i$
is zero. One may use some linear algebra to show that at least one of the
$a_i$
must be nonzero. It is certainly plausible that the more general result may be used in the further classification of Sidorenko systems in a directly analogous way to how Proposition 3.7 is used here, but we have no need for such a statement in this document and so we do not write out the details.
Proof of Theorem 3.5.
Recall that we may assume that t is odd. As in the proof of Theorem 3.1, using (2), we have
As we have seen already, each
$\Psi (S)$
has codimension one and so
$T_{\Psi (S)}(f)$
is of the form
$\sum _h \prod _{i=1}^{t-1} \hat f(a_i h)$
. By Hölder’s inequality then
where we let
$\Phi $
denote an additive
$(t-1)$
-tuple. Thus
Let f be the function from Proposition 3.7, and if
$T_\Psi (f)> 0$
then replace f with
$-f$
, so
For
$\varepsilon , \alpha $
fixed, we may choose n large enough so that
$T_\Psi (\alpha + \varepsilon f) < \alpha ^t$
, completing the proof.
This also completes the proof of Theorem 1.1, which is a union of Theorem 3.1 and Theorem 3.5.
4 Examples
4.1 Locally Sidorenko but not Sidorenko
Recall that a system
$\Psi $
comprising t linear forms is locally Sidorenko if for all
$\alpha \in [0,1]$
there exists
$\varepsilon>0$
such that for all
$n\geq 1$
and all
$f : \mathbb {F}_p^n \to [-\alpha , 1-\alpha ]$
with
$\mathbb {E} f = 0$
we have
$T_\Psi (\alpha + \varepsilon f) \geq \alpha ^t$
. We begin by providing the following example of a system which is locally Sidorenko but not Sidorenko. This proves Proposition 1.6.
Example 4.1. Let
$\Phi $
denote the following system of two linear equations in nine variables:
We firstly observe that
$\Phi $
is not Sidorenko. Indeed, let
$A = \{ 1\}\subset \mathbb {F}_p$
. Then
$\mathbb {E}_{x \in \mathbb {F}_p}1_A(x) = 1/p$
, but A contains no solutions to
$\Phi $
, so
$T_\Phi (1_A)=0$
.
We claim that
$\Phi $
is locally Sidorenko. Let
$\operatorname {\mathrm {A4}}$
be shorthand for the first equation in
$\Phi $
and
$\operatorname {\mathrm {A5}}$
for the second. We observe (see Observation 2.6) that for a function f on
$\mathbb {F}_p^n$
we have
$T_\Phi (f) = T_{\operatorname {\mathrm {A4}}}(f)T_{\operatorname {\mathrm {A5}}}(f)$
. Also, since
$\operatorname {\mathrm {A4}}$
has codimension one, we get
$T_{\operatorname {\mathrm {A4}}}(\alpha + f)=\alpha ^4 + T_{\operatorname {\mathrm {A4}}}(f)$
for functions f with
$\mathbb {E} f=0$
, and similarly for
$\operatorname {\mathrm {A5}}$
. Thus, for
$f:\mathbb {F}_p^n \to [-\alpha , 1-\alpha ]$
with
$\mathbb {E} f=0$
, we see using (1) that
$$ \begin{align*} T_\Phi(\alpha + \varepsilon f) &= (\alpha^4+T_{\operatorname{\mathrm{A4}}}(\varepsilon f))(\alpha^5+T_{\operatorname{\mathrm{A5}}}(\varepsilon f)) \\ &= \alpha^9 + \varepsilon^4\left(\alpha^5 T_{\operatorname{\mathrm{A4}}}(f) + \alpha^4\varepsilon T_{\operatorname{\mathrm{A5}}}(f) + \varepsilon^5T_{\operatorname{\mathrm{A4}}}(f)T_{\operatorname{\mathrm{A5}}}(f)\right). \end{align*} $$
Fourier-inverting, we have that
$T_{\operatorname {\mathrm {A4}}}(f) = \sum _h |\hat f(h)|^4$
and
$T_{\operatorname {\mathrm {A5}}}(f)=\sum _{h}|\hat f(h)|^4 \hat f(h)$
. In particular we have that
$|T_{\operatorname {\mathrm {A5}}}(f)| \leq T_{\operatorname {\mathrm {A4}}}(f) \leq 1$
. Thus,
and so setting
$\varepsilon \leq \alpha /2$
we have that
$T_\Phi (\alpha + \varepsilon f)\geq \alpha ^9$
, so
$\Phi $
is indeed locally Sidorenko.
4.2 Weakly locally Sidorenko but not locally Sidorenko
Recall that a system
$\Psi $
comprising t linear forms is weakly locally Sidorenko if for all
$n\geq 1$
, for all
$\alpha \in [0,1]$
and all
$f:\mathbb {F}_p^n \to [-\alpha ,1-\alpha ]$
with
$\mathbb {E} f =0$
, there exists
$\varepsilon _f>0$
such that for all
$\varepsilon < \varepsilon _f$
we have
$T_\Psi (\alpha + \varepsilon f) \geq \alpha ^t$
.
We claimed earlier that it is easy to contrive systems of complexity at least two which are weakly locally Sidorenko, but not Sidorenko. One may, for example, begin with a system of complexity at least two whose shortest subsystems have length at least
$5$
, and take the union of this system with an additive
$4$
-tuple on a separate variable set. The weak local Sidorenko property follows from the fact that its smallest subsystem is an additive
$4$
-tuple, and the (non) Sidorenko property of the entire systems essentially follows from that of the complexity two part by tensoring with an appropriate quadratic phase function. We omit the details.
In this subsection we exhibit a complexity one, linearly generic system which is weakly locally Sidorenko, but not locally Sidorenko, proving Corollary 1.5. The upcoming example is taken from [Reference Kamčev, Liebenau and MorrisonKLM23, Example 4.6], as is most of the analysis towards showing that it is weakly locally Sidorenko. That it is not locally Sidorenko follows from Theorem 1.1.
Example 4.2. Let
$p \not \in \{2,3\}$
and let
$\Phi $
be the following system of two equations in five variables:
One sees that
$\Phi $
is linearly generic. By Theorem 1.1,
$\Phi $
is not locally Sidorenko. We claim that
$\Phi $
is weakly locally Sidorenko. For f with
$\mathbb {E} f=0$
we have from (2) that
so it suffices to show that
$\sum _{|S|=4}T_{\Phi (S)}(f)>0$
whenever
$f \ne 0$
. One sees that equations for the five subsystems S of size
$4$
may be obtained by performing row operations to eliminate the variable
$x_i$
,
$i=1,\ldots , 5$
. Therefore, applying (3) to each of these subsystems, and recalling that
$T_{\Phi (S)}(f)$
is real-valued, we may obtain that
$$ \begin{align*} \sum_{|S|=4}T_{\Phi(S)}(f) &= \sum_{h} |\hat f(h)|^4 + 2|\hat f(h)|^2|\hat f(2h)|^2 + 2\hat f(-h)\hat f(2h)^2 \hat f(-3h) \\ &= \sum_{h} \frac{1}{2}|\hat f(2h)|^4+ \frac{1}{2}|\hat f(3h)|^4+ 2|\hat f(h)|^2|\hat f(2h)|^2 \\ & \qquad \qquad \qquad \qquad \qquad \qquad + 2\Re \left(\hat f(-h)\hat f(2h)^2 \hat f(-3h)\right)\\&\geq \sum_h |\hat f(-2h)\hat f(3 h)|^2 + 2|\hat f(h)|^2|\hat f(2h)|^2 \\[4pt] & \qquad \qquad \qquad \qquad \qquad \qquad + 2\Re \left(\hat f(-h)\hat f(2h)^2 \hat f(-3h)\right)\\[4pt] &= \sum_h \left|\hat f(h)\hat f(-2h) + \overline{\hat f(-2h)\hat f(3h)}\right|{}^2 + |\hat f(h)|^2|\hat f(2h)|^2\\[4pt] &\geq \sum_h |\hat f(h)|^2|\hat f(2h)|^2. \end{align*} $$
This is positive unless
$|\hat f(h)|^2|\hat f(2h)|^2=0$
for all h. But in this case
$\sum _{|S|=4}T_{\Phi (S)}(f) = \sum _h |\hat f(h)|^4$
, which is positive unless f is zero. This completes the proof of the claim that f is weakly locally Sidorenko.
4.3 Locally Sidorenko, not Sidorenko, and no additive tuple
Recall that if
$\Phi $
comprises only an additive
$(2k)$
-tuple, the corresponding functional
$T_{\Phi }$
is a norm on the space of functions and in particular is positive definite:
$T_{\Phi }(f) = 0 \implies f =0 \implies T_{\Psi }(f)=0$
, for any
$\Psi $
. This is the main obstruction to the proof of Theorem 3.1 going through for systems with an additive
$(2k)$
-tuple. It transpires that there exists a nonlinearly generic
$\Psi $
with a subsystem
$\Phi $
whose corresponding functional is not positive definite but with the property that
$T_{\Phi }(f) = 0 \implies T_{\Psi (S)}(f)=0$
for all subsystems
$\Psi (S) \subseteq \Psi $
. This motivates the upcoming example.
Example 4.3. Let
$\Phi $
be the following system of two equations in eight variables:
We claim that
$\Phi $
is locally Sidorenko (and it clearly does not contain any additive tuples). Fix
$\alpha \in (0,1)$
and let
$f: \mathbb {F}_p^n \to [-\alpha , 1-\alpha ]$
. Firstly invoke (1) to write
$T_\Phi (\alpha + \varepsilon f)$
in terms of the subsystems of
$\Phi $
. These subsystems may be computed by parameterising the solution space to
$\Phi $
. Alternatively, one sees that they may be read off the defining equations for
$\Phi $
after performing row operations (recall that to identify subsystems which contribute nontrivially, we simply need to find those coordinate restrictions whose image do not contain a coordinate hyperplane; see remarks after (1)). Doing so, and invoking (3) to pass to frequency space, one finds that
$$ \begin{align*} T_\Phi(\alpha + \varepsilon f) = \alpha^8 &+ \alpha^4\varepsilon^4 \sum_h |\hat f(h)|^2|\hat f(2h)|^2 \\[4pt] &+ \alpha\varepsilon^7 \sum_h |\hat f(h)|^2|\hat f(2h)|^2 \hat f(-2h)\hat f(h)\hat f(-3h)\\[4pt] &+ \alpha \varepsilon^7 \sum_h |\hat f(h)|^2|\hat f(2h)|^2 \hat f(2h)\hat f(3h)\hat f(-h)\\[4pt] &+ \alpha \varepsilon^7 \sum_h |\hat f(h)|^2|\hat f(2h)|^2 \hat f(-h)\hat f(-3h)\hat f(-4h)\\[4pt] &+ \alpha \varepsilon^7 \sum_h |\hat f(h)|^2|\hat f(2h)|^2 \hat f(3h)\hat f(h)\hat f(4h)\\[4pt] &+ \varepsilon^8 \sum_{h_1,h_2} |\hat f(h_1)|^2|\hat f(2h_1)|^2 \\[4pt] & \qquad \qquad \cdot \hat f(h_1+h_2)\hat f(h_1-h_2)\hat f(h_1+2h_2)\hat f(h_1-2h_2). \end{align*} $$
The absolute value of the final sum is at most
by the bound
$||\hat f||_\infty \leq 1$
. Applying Cauchy–Schwarz to the sum over
$h_2$
and subsequently Parseval’s identity yields an upper bound of
$\sum _{h}|f(h)|^2|f(2h)|^2$
. Thus
$$\begin{align*}T_\Phi(\alpha + \varepsilon f) \geq \alpha^8 + \varepsilon^4 \left(\sum_h |\hat f(h)|^2 |\hat f(2h)|^2\right)\left(\alpha^4 - 4\alpha \varepsilon^3 - \varepsilon^4\right).\end{align*}$$
Thus setting
$\varepsilon \leq \alpha /2$
, we have that
$T_\Phi (\alpha + \varepsilon f)\geq \alpha ^8$
, and so
$\Phi $
is locally Sidorenko.
We note that the previous example is not Sidorenko, so it gives another, albeit less simple, example verifying Proposition 1.6.
5 Localising to subsystems with fewest variables
In this section we observe how to ‘localise’ to subsystems with the fewest variables. As an application we outline a relatively short proof of the recent result of Kamčev–Liebenau–Morrison [Reference Kamčev, Liebenau and MorrisonKLM24] and independently Versteegen [Reference VersteegenVer21] that any system containing a four-term arithmetic progression (or indeed any system of two independent equations in four variables) is uncommon. Our proof does not use any quadratic methods, which is also new here.
Previously we have localised by considering small perturbations of a constant function
$\alpha + \varepsilon f$
, whereupon, defining s to be the size of smallest subsystem of
$\Psi $
for which some
$T_{\Psi (S)}(f)$
is nonzero, we have
For fixed positive integers n, k and
$f:\mathbb {F}_p^n \to [-1,1]$
, define the function
$f^{(k)}:\mathbb {F}_p^{n+k}\to [-1,1]$
by
$f^{(k)} := f\otimes 1_0$
where
$1_0$
is the characteristic function for
$0\in \mathbb {F}_p^k$
(i.e.,
$f^{(k)}=(f\circ \pi _1)(1_0\circ \pi _2)$
, where
$\pi _1$
projects onto the first n coordinates,
$\pi _2$
projects onto the final k coordinates). Then for an injective system
$\Psi =(\psi _1,\ldots ,\psi _t)$
in D variables, we have recalling (4) that
Thus, expanding
$T_\Psi (\alpha + f\otimes 1_0)$
multilinearly and letting k be large we have that
where
$\dim \Psi (S)$
denotes the dimension of the image of
$\Psi (S)$
and d is the minimal such quantity for a subsystem for which
$T_{\Psi (S)}(f)$
is nonzero.
Now we outline an alternative proof of the fact that any system containing a
$4$
AP is uncommon. In fact we will prove the stronger result (as [Reference Kamčev, Liebenau and MorrisonKLM24] did) that any system containing a rank two system with four linear forms is uncommon. We note that some parts of the argument remain the same. In particular, we need the result that rank 2 systems in 4 variables can have
$T_\Psi (f)< 0$
. Rather than reproving it we just quote the following weak version of [Reference Kamčev, Liebenau and MorrisonKLM24, Lemma 4.1] whose proof we note does not use higher-order methods.
Lemma 5.1 [Reference Kamčev, Liebenau and MorrisonKLM24, Lemma 4.1].
Let p be a prime. There exists an integer
$n\geq 1$
such that the following is true. Let
$(\Psi _i)_{i=1}^k$
be a collection of linear systems each in four variables with codimension two. Suppose that
$\Psi _i$
is linearly generic for all i. Then there exists
$f: \mathbb {F}_p^n \to [-\frac {1}{2}, \frac {1}{2}]$
with
$\mathbb {E} f =0$
such that
$T_{\Psi _i}(f) < 0$
for
$i=1,\ldots , k$
.
Theorem 5.2 [Reference Kamčev, Liebenau and MorrisonKLM24, Theorem 1.1], generalisation of [Reference VersteegenVer21, Theorem 1.4].
Any linear system of distinct, nonzero forms which contains a dimension 4 subsystem of rank two is uncommon.
Proof. Let t be the number linear forms in
$\Psi $
and let
$f:\mathbb {F}_p^n \to [-\frac {1}{2},\frac {1}{2}]$
have
$\mathbb {E} f=0$
and be otherwise unspecified for the meantime. Let
$k\geq 0$
to be determined later,
$\varepsilon = p^{-k}$
and
$1_0:\mathbb {F}_p^k \to \{0,1\}$
be the characteristic function of zero. Then
$$ \begin{align*} & T_\Psi\left(\frac{1}{2} + (\varepsilon f)\otimes 1_0\right) + T_\Psi\left(\frac{1}{2}-(\varepsilon f)\otimes 1_0\right) \\ &\qquad \qquad \qquad \qquad \qquad= \frac{1}{2^{t-1}} + \sum_{\substack{\emptyset \ne S \subset [t] \\ |S| \text{ even}}}\frac{1}{2^{t-|S|-1}}p^{-k(|S|+\dim \Psi(S))}T_{\Psi(S)}(f). \end{align*} $$
Letting k be arbitrarily large, we see it suffices to show that there is a choice of f such that
$\sum _{S:\dim \Psi (S)+|S| \text { minimal}} T_{\Psi (S)}(f) < 0$
, where the minimum is taken over subsystems whose image does not contain a coordinate hyperplane (if the image contains a coordinate hyperplane then
$T_{\Psi (S)}(f) = 0$
since
$\mathbb {E} f = 0$
). Note that this minimum is at most
$6$
since
$\Psi $
contains a system of four nonzero linear forms with codimension two, and for the minimum we must have that
$\dim \Psi (S)<|S|$
.
If the minimum is equal to
$6$
, then the minimal systems
$\Psi (S)$
must have
$|S|=4, \dim \Psi (S))=2$
. If this is the case then we must have that
$\Psi (S)$
is linearly generic for all of these minimal systems, or else by an easy linear algebraic check they would give rise to a further subsystem
$S'$
with
$|S'| + \dim \Psi (S')$
smaller. Thus we may conclude by invoking Lemma 5.1 (with an appropriate choice of n) in this case.
Otherwise any
$\Psi (S)$
contributing to the minimum must have
$\dim \Psi (S)=1$
, and indeed cannot have
$|S|=4$
since any system
$\Psi (S)$
with
$|S|=4, D=1$
gives rise to
$\binom {4}{2}$
subsystems
$\Psi (S_i)$
with
$\dim \Psi (S_i)=1, |S_i|=2$
. In the remaining case that the minimal systems have
$\dim \Psi (S)=1, |S|=2$
we note that these systems are described by a single equation which cannot have its two coefficients summing to zero since this would mean that
$\Psi $
has a pair of repeated forms. Thus we may conclude by the Fox–Pham–Zhao random Fourier sampling argument to find f such that
$\sum _{S:\dim \Psi (S)+|S| \text { minimal}} T_{\Psi (S)}(f) < 0$
.Footnote
3
This completes the proof.
Remark 5.3. Kamčev–Liebenau–Morrison conjecture [Reference Kamčev, Liebenau and MorrisonKLM24, Conjecture 6.1] that an analogue of Lemma 5.1 holds for systems comprising an even number of linear forms whose image has codimension two. They note that their proof of [Reference Kamčev, Liebenau and MorrisonKLM24, Theorem 1.1] however does not generalise to the case of higher number of variables, even if [Reference Kamčev, Liebenau and MorrisonKLM24, Conjecture 6.1] is confirmed. We note that our proof above would generalise should [Reference Kamčev, Liebenau and MorrisonKLM24, Conjecture 6.1] be confirmed. This would go some way towards the classification of weakly locally common systems.
Remark 5.4. Since this document was released on the arXiv, the generalisation suggested in Remark 5.3 has been carried out using the methods of this chapter in both [Reference Altman and LiebenauAL25] and [Reference Dong, Li and ZhaoDLZ24].
Acknowledgements
The author thanks Ben Green for drawing his attention to the study of the local Sidorenko property in the graph-theoretic setting (in particular [Reference LovászLov11]) and for valuable feedback on an earlier version of this document. We also thank Julia Wolf for valuable feedback on an earlier version of this document and an anonymous referee for a careful reading and feedback.
Competing interests
The author has no competing interests to declare.




