1. Introduction
The moduli space
$\mathcal{M}_{0,n}$
of
$n$
labeled points on the projective line
${\mathbb{P}}^1$
plays a prominent role in algebraic geometry and its interactions with combinatorics. It is equally important in physics, where it is used, for example, in the CHY model [Reference Cachazo, He and YuanCHY14] to compute scattering amplitudes. The space
$\mathcal{M}_{0,n}$
is a very affine variety of dimension
$n-3$
, with coordinates given by the matrix

More abstractly,
$\mathcal{M}_{0,n}$
is the quotient of the open Grassmannian
$\textrm {Gr}(2,n)^o$
by the action of the torus
$({\mathbb{C}}^*)^n$
. We write
$p_{ij}$
for the Plücker coordinates on
$\textrm {Gr}(2,n)$
. These are the
$2 \times 2$
subdeterminants of
$X$
, and being in
$\textrm {Gr}(2,n)^o$
means that
$p_{ij} \not=0$
for all
$1 \leq i \lt j \leq n$
.
A basic ingredient in the CHY model is the following scattering potential on
$\mathcal{M}_{0,n}$
:

The coefficients
$s_{ij}$
are known as Mandelstam invariants. Using the conventions that
$\,s_{ii} = 0$
and
$s_{ji} = s_{ij}$
, the Mandelstam invariants must satisfy the momentum conservation relations

These relations ensure that
$L$
is well-defined on
$\mathcal{M}_{0,n} = \textrm {Gr}(2,n)^o/({\mathbb{C}}^*)^n$
, for any branch of the logarithm function. The following result on the critical points of
$L$
is well-known.
Proposition 1.1.
For a general choice of
$s_{ij}$
, the scattering potential
$L$
has
$(n-3)!$
complex critical points on
$\mathcal{M}_{0,n}$
. If the
$s_{ij}$
are real numbers, then all
$(n-3)!$
critical points are real.
We refer to [Reference Sturmfels and TelenST21, § 2] for a proof, computational aspects, and a statistics perspective.
Example 1.2 (
$n=4$
). Up to an additive constant
$s_{23} \cdot \textrm {log}(-1)$
, we have

This is the log-likelihood function for coin flips with bias
$x_1$
, when heads, respectively tails, were observed
$s_{13}$
, respectively
$s_{23}$
, times. The unique critical point of
$L$
is the maximum likelihood (ML) estimate:

For
$n \geq 5$
there is no simple formula because the ML degree is
$(n-3)!$
. Minimal kinematics [Reference Cachazo and EarlyCE21, Reference EarlyEarly] provides an attractive alternative. The idea is to set a few
$s_{ij}$
to zero until we reach a model of ML degree
$1$
[Reference Agostini, Brysiewicz, Fevola, Kühne, Sturmfels and TelenABFKST23, Reference Duarte and MariglianoDMS21, Reference HuhHuh14]. This yields nice rational functions, such as (5), for all
$n$
.
Example 1.3 (
$n=6$
). One choice of minimal kinematics is the codimension
$3$
subspace defined by
$s_{24} = s_{25} = s_{35} = 0$
. After this substitution, the scattering potential (2) becomes

Our task is to solve the scattering equations
$\nabla L_T=0$
. Explicitly, these equations are

This system is easily solved, starting from the end. Back-substituting gives the critical point:

The key observation is that the numerators and denominators are products of linear forms with positive coefficients. This is characteristic of all models of ML degree
$1$
, thanks to a theorem of Huh [Reference HuhHuh14]. This goes back to Kapranov [Reference KapranovKap91], who coined the term Horn uniformization. We seek to find all Horn uniformizations of the moduli space
$\mathcal{M}_{0,n}$
. Assuming the representation in (1), the solution is given by graphs called
$2$
-trees. Their vertices are indexed by
$[n-1] = \{1,2,\ldots ,n-1\}$
. The following is our first main result in this article.
Theorem 1.4.
Choices of minimal kinematics on
$\mathcal{M}_{0,n}$
are in bijection with 2-trees on
$[n{-}1]$
.
Here, we use the following formal definition of minimal kinematics. Fix the set of index pairs
$(i,j)$
in (2) whose corresponding
$ 2 \times 2$
minor in the matrix
$X$
is non-constant. This is

For any subset
$T$
of
$S$
, we restrict to the kinematic subspace where
$s_{ij}=0$
for all
$ij \in S\setminus T$
:

We say that
$T$
exhibits minimal kinematics for
$\mathcal{M}_{0,n}$
if the function (7) has exactly one critical point, which is hence rational in the
$s_{ij}$
, and
$T$
is inclusion-maximal with this property. With this definition, the minimal kinematics for
$\mathcal{M}_{0,6}$
in Example 1.3 is exhibited by the subset

The proof of Theorem1.4 is given in § 2. Section 3 features the Horn uniformization. Formulas for the unique critical point of
$L_T$
are given in Theorem3.1 and Corollary 3.3.
In § 4 we introduce an amplitude
$m_T$
for any 2-tree
$T$
, and we compute
$m_T$
in Theorem4.1. The following example explains why we use the term amplitude. It is aimed at readers from physics who are familiar with the biadjoint scalar amplitude
$m_n$
; see [Reference Cachazo, He and YuanCHYB14, § 3]. We recall that
$m_n$
is the integral of the Parke–Taylor factor
$1/(p_{12} p_{23} \cdots p_{n-1,n} p_{n,1})$
over the moduli space
$\mathcal{M}_{0,n}$
, localized to the solutions to the scattering equations
$\nabla L=0$
.
Example 1.5 (
$n=6$
). The biadjoint scalar amplitude for
$\mathcal{M}_{0,6}$
is the rational function

Here
$s_{ijk} = s_{ij} + s_{ik} + s_{jk}$
. In the physics literature this amplitude is usually denoted
$ m(\mathbb{I}_6,\mathbb{I}_6)$
, where
$\mathbb{I}_6 = (123456)$
is the standard cyclic order. See also [Reference de la CruzCruz23, (2.7)] or [Reference Sturmfels and TelenST21, (23)].
The
$14$
summands in (9) correspond to the vertices of the
$3$
-dimensional associahedron. The formula is unique since the nine planar kinematic invariants which appear form a basis of the dual kinematic space. With an eye towards more general situations, to achieve a unique formula for
$m_n$
modulo the relations (3), we may also use the basis
$S$
from (6). Thus, we set

We now restrict to the minimal kinematics in Example 1.3. For
$s_{24} = s_{25} = s_{35} = 0$
, we find

Up to relabeling, this is the amplitude
$m_{T_1}$
we associate with the 2-tree
$T_1$
in (10) below. This illustrates Theorem 4.1. The factors are explained by the Horn matrix
$H_{T_1}$
in Example3.2. We note that, modulo momentum conservation (3), our amplitude simplifies to

In § 5 we venture into territory that is of great significance for both algebraic geometry and particle physics. Structures we attach to 2-trees generalize naturally to the hypertrees of Castravet and Tevelev [Reference Castravet and TevelevCT13]. By [Reference TevelevTev, Lemma 9.5.(3)], hypertrees are equivalent to the on-shell diagrams of Arkani-Hamed, Bourjaily, Cachazo, Postnikov and Trnka [Reference Arkani-Hamed, Bourjaily, Cachazo, Postnikov and TrnkaABCPT15]. Every hypertree
$T$
has an associated amplitude
$m_T$
. This rational function does not admit a Horn formula like that in (13) for 2-trees
$T$
. Indeed, now the ML degree for
$T$
is larger than
$1$
. Here, we aim to reach minimal kinematics by restricting to subspaces of the kinematic space. This is seen in Example 5.8. The paper concludes with questions for future research.
2. 2-trees
We begin by defining the class of graphs referred to in Theorem1.4. A 2-tree is a graph
$T$
with
$2n-5$
edges on the vertex set
$[n-1] = \{1,2,\ldots ,n-1\}$
, which can be constructed inductively as follows. We start with the single edge graph
$\{12\}$
, which is the unique 2-tree for
$n-1=2$
. For
$k=3,4,5,\ldots ,n-1$
, we proceed inductively as follows: we select an edge
$ij$
whose vertices
$i$
and
$j$
are in
$\{1,\ldots ,k-1\}$
, and we introduce the two new edges
$ik$
and
$jk$
. There are
$2k-5$
choices at this stage, so our process leads to
$\,(2n-7)!! = 1 \cdot 3 \cdot 5 \cdot 7 \cdots (2n-7)\,$
distinct 2-trees. Of course, many pairs of these 2-trees will be isomorphic as graphs.
The number of 2-trees up to isomorphism appears as the entry A054581 in the Online Encyclopedia of Integer Sequences (OEIS). That sequence begins with the counts

For instance, there are
$\textbf { 2}$
distinct unlabeled 2-trees for
$n=\textbf { 6}$
. Representatives are given by

We identify each 2-tree
$T$
with a subset of the set
$S$
in (6), by removing the initial edge
$12$
. This thus specifies a function (7). For example, the 2-tree
$T_1$
in (10) is identified with
$T$
in (8).
Theorem1.4 has two directions. First, (7) has only one critical point when
$T$
is a 2-tree. Second, every maximal graph with this property is a 2-tree. We begin with the first direction.
Lemma 2.1. Every 2-tree exhibits minimal kinematics.
Proof.
Let
$m=2n-6$
, and identify the coordinates on
$({\mathbb{C}}^*)^m$
with the pairs in
$T$
. We write
$X_T$
for the
$(n-3)$
-dimensional subvariety of
$({\mathbb{C}}^*)^m$
that is parametrized by the
$2 \times 2$
-minors
$p_{ij}$
with
$(i,j) \in T$
. Thus,
$X_T$
is a very affine variety, defined as the complement of an arrangement of
$m$
hyperplanes in
${\mathbb{C}}^{n-3}$
. The scattering potential (7) is the log-likelihood function for
$X_T$
. The number of critical points, also known as the ML degree, equals the signed Euler characteristic of
$X_T$
; see [Reference Agostini, Brysiewicz, Fevola, Kühne, Sturmfels and TelenABFKST23, Reference Huh and SturmfelsHS14, Reference Sturmfels and TelenST21]. Therefore, our claim says that
$|\chi (X_T)| = 1$
.
To prove this, we use the multiplicativity of the Euler characteristic. This states that, for any fibration
$f: E \to B$
, with fiber
$F$
, the following relation holds:
$\chi (E)=\chi (F) \cdot \chi (B)$
.
We proceed by induction on
$n$
. The base case is
$n=4$
, where
$T = \{(1,3),(2,3)\}$
. This set represents the unique
$2$
-tree on
$[3]$
, which is the triangle graph with vertices
$1,2,3$
. Here, the very affine variety is the affine line
${\mathbb{C}}^1$
with two points removed. In symbols, we have

This punctured curve satisfies
$\chi (X_T)=-1$
, so the base case of our induction is verified.
We now fix
$n \geq 4$
, and we assume that
$X_T \subset ({\mathbb{C}}^*)^{2n-6}$
has Euler characteristic
$\pm 1$
for all 2-trees
$T$
with
$n-1$
vertices. Note that
$\textrm {dim}(X_T) = n-3$
. Let
$T^{\prime}$
be any 2-tree with
$n$
vertices. The vertex
$n$
is connected to exactly two vertices
$i$
and
$j$
. Suppose
$i \lt j$
. The associated very affine variety
$X_{T^{\prime}}$
lives in
$({\mathbb{C}}^*)^{2n-4}$
. Note that
$\textrm {dim}(X_{T^{\prime}}) = n-2$
.
We write the coordinates on
$({\mathbb{C}}^*)^{2n-4}$
as
$(\,p,\,p_{in}, p_{jn})$
, where
$p \in ({\mathbb{C}}^*)^{2n-6}$
. Consider the the map
$\,\pi \,:\,({\mathbb{C}}^*)^{2n-4} \rightarrow ({\mathbb{C}}^*)^{2n-6}, \,(\,p,\,p_{in}, p_{jn}) \mapsto p\,$
which deletes the last two coordinates.
Let
$T$
be the 2-tree without the vertex
$n$
and the two edges
$in$
and
$jn$
. Then
$ij$
is an edge of
$T$
, so
$p_{ij}$
is among the coordinates of
$p$
. The restriction of
$\pi$
to
$X_{T^{\prime}}$
defines a fibration

Indeed, the fiber over
$p \in X_T$
equals

This is the punctured line above, i.e.
$F \simeq \mathcal{M}_{0,4}$
. We know that
$\chi (F)=-1$
. Using the induction hypothesis, and the multiplicativity of the Euler characteristic, we conclude that

This means that the 2-tree
$T^{\prime}$
exhibits minimal kinematics, and the lemma is proved.
From the induction step in the proof above, we also see how to write down the three-term linear equations that define
$X_T$
. We display these linear equations for the 2-trees with
$n=6$
.
Example 2.2 (
$n=6$
). Consider the two 2-trees in (10). Each of them defines a very affine threefold of Euler characteristic
$-1$
. Explicitly, these two threefolds are given as follows:

We now come to the converse direction, which asserts that 2-trees are the only maximal graphs
$T$
satisfying
$\chi (X_T) = \pm 1$
. This will follow from known results on graphs and matroids.
Proof of Theorem 1.4
. Consider any arrangement of
$m+1$
hyperplanes in the real projective space
${\mathbb{P}}^d$
such that the intersection of all hyperplanes is empty. This data defines a matroid
$M$
of rank
$d+1$
on
$m+1$
elements. The complement of the hyperplanes is a very affine variety
$X$
of dimension
$d$
. Let
${\mathbb{R}}^d$
be the affine space obtained from
${\mathbb{P}}^d$
by removing any one of the hyperplanes. We are left with an arrangement of
$m$
hyperplanes in
${\mathbb{R}}^d$
. By [Reference Huh and SturmfelsHS14, Theorem 1.20], the number of bounded regions in that affine arrangement is equal to
$|\chi (X)|$
.
The number of bounded regions described above depends only on the matroid
$M$
, and it is known as the beta invariant. This is a result due to Zaslavsky [Reference ZaslavskyZas75]. The beta invariant can be computed by substituting
$1$
into the reduced characteristic polynomial of
$M$
; see [Reference OxleyOxl82].
In our situation, we are considering arrangements of hyperplanes in
${\mathbb{R}}^d$
of the special types
$\{x_i=0\}$
,
$\{x_j=1\}$
, or
$\{x_k=x_l\}$
. These correspond to graphic matroids, and here the characteristic polynomial is essentially the chromatic polynomial of the underlying graph. Our problem is this: for which graphs does this affine hyperplane arrangement have precisely one bounded region? Or, more generally, which matroids have a beta invariant equal to one?
The answer to this question was given by Brylawski [Reference BrylawskiBry71, Theorem 7.6]: the beta invariant of a matroid
$M$
equals one if and only if
$M$
is series-parallel. This means that
$M$
is a graphic matroid, where the graph is series-parallel. We finally cite the following from Bodirsky et al. [Reference Bodirsky, Giménez, Kang and NoyBGKN07, p. 2092]: A series-parallel graph on
$N$
vertices has at most
$2N - 3$
edges. Those having this number of edges are precisely the 2-trees. Setting
$N=n-1$
, we can now conclude that the 2-trees
$T$
are the only subsets of
$S$
that exhibit minimal kinematics for
$\mathcal{M}_{0,n}$
.
3. Horn matrices
A remarkable theorem due to June Huh [Reference HuhHuh14] characterizes very affine varieties
$X \subset ({\mathbb{C}}^*)^m$
that have ML degree
$1$
. The unique critical point
$\hat {p}$
of the log-likelihood function on
$X$
is given by the Horn uniformization, due to Kapranov [Reference KapranovKap91]. Huh’s result was adapted to the setting of algebraic statistics by Duarte et al. in [Reference Duarte and MariglianoDMS21]. For an exposition see also [Reference Huh and SturmfelsHS14, § 3]. The Horn uniformization can be written in concise notation as follows:

Here,
$H$
is an integer matrix with
$m$
columns and
$\lambda$
is a vector in
${\mathbb{Z}}^m$
. The pair
$(H,\lambda )$
is an invariant of the variety
$X$
, referred to as the Horn pair in [Reference Duarte and MariglianoDMS21]. The coefficients
$s_{ij}$
in the log-likelihood function (aka Mandelstam invariants) form the column vector
$s$
of length
$m$
, so
$Hs$
is a vector of linear forms in the coordinates of
$s$
. The notation
$(Hs)^H$
means that we regard each column of
$H$
as an exponent vector, and we form
$m$
Laurent monomials in the linear forms
$H s$
. Finally,
$\star$
denotes the Hadamard product of two vectors of length
$m$
.
The formula for
$\hat {p}$
given in (11) is elegant, but one needs to get used to it. We encourage our readers to work through Example 3.2, where
$(Hs)^H$
is shown for two Horn matrices
$H$
.
We now present the main result of this section, namely the construction of the Horn matrix
$H = H_T$
for the very affine variety
$X_T$
associated with any 2-tree
$T$
on
$[n-1]$
. The matrix
$H_T$
has
$3n-9$
rows and
$m=2n-6$
columns, one for each edge of
$T$
. It is constructed inductively as follows. If
$n=4$
with
$T = \{13,23\}$
, which represents the triangle graph, then

Now, for
$k \geq 5$
, let
$T^{\prime}$
be any 2-tree with
$k$
vertices, where vertex
$k$
is connected to
$i$
and
$j$
, and
$T = T^{\prime} \,\backslash \{ik,jk\}$
as in the proof of Lemma 2.1. Then the Horn matrix for
$T^{\prime}$
equals

where
$\textbf { 0}$
is the zero row vector and
$\textbf { h}_{ij}$
is the column of
$H_T$
that is indexed by the edge
$ij$
.
Theorem 3.1.
Given any 2-tree
$T$
on
$[n-1]$
, the
$(3n-9) \times (2n-6)$
matrix
$H_T$
constructed above equals the Horn matrix
$H$
for the very affine variety
$X_T$
. There exists a sign vector
$\lambda \in \{-1,+1\}^{2n-6}$
such that (
11
) is the unique critical point
$\hat {p}$
of the scattering potential (
7
).
Before proving this theorem, we illustrate the construction of
$H_T$
and the statement.
Example 3.2 (
$n=6$
). We consider the two 2-trees that are shown in (10). In each case, the Horn matrix has nine rows and six columns. We find that the two Horn matrices are

For
$H = H_{T_i}$
, the column vector
$Hs$
has nine entries, each a linear form in six
$s$
-variables. Each column of
$H$
specifies an alternating product of these linear forms, and these are the entries of
$(Hs)^H$
. By adjusting signs when needed, we obtain the six coordinates of
$\hat {p}$
.
For the second 2-tree
$T_2$
, the six coordinates of the critical point
$\hat {p}$
are

For the first 2-tree
$T_1$
, the six coordinates of the critical point
$\hat {p}$
are

We note that these
$\hat {p}_{ij}$
satisfy the trinomial equations given for
$X_{T_1}$
, respectively
$X_{T_2}$
, in Example 2.2.
Proof of Theorem 3.1
. We start by reviewing the Horn uniformization (11) in the version proved by Huh [Reference HuhHuh14]. Let
$X \subset ({\mathbb{C}}^*)^m$
be any very affine variety. The following are equivalent:
-
(i) the variety
$X$ has ML degree
$1$ ;
-
(ii) there exists
$\lambda =(\lambda _1, \ldots ,\lambda _m) \in ({\mathbb{C}}^*)^m$ and a matrix
$H = (h_{ij})$ in
$ {\mathbb{Z}}^{\ell \times m}$ with zero column sums and left kernel
$A$ , such that the monomial map
\begin{equation*} ({\mathbb{C}}^*)^\ell \,\to \, ({\mathbb{C}}^*)^m, \,\, \,{\bf q}=(q_1,\ldots ,q_\ell ) \,\mapsto \,\biggl (\lambda _1 \underset {i=1}{\overset {\ell }{\prod }}q_i^{h_{i1}},\, \ldots ,\,\lambda _m \underset {i=1}{\overset {\ell }{\prod }}q_i^{h_{im}} \biggr ),\end{equation*}
$A$ -discriminantal variety
$\Delta _{A}$ dominantly onto
$X$ .
Constructing a Horn uniformization proves that the variety
$X$
has ML degree
$1$
. The corresponding Horn map (11) is precisely the unique critical point of the log-likelihood function. The following argument thus also serves as an alternative proof of Theorem1.4.
Fix a 2-tree
$T$
on
$[n-1]$
, with associated variety
$X= X_T$
in
$ ({\mathbb{C}}^*)^m$
, where
$m=2n-6$
. Let
$H = H_T$
be the
$(3n-9) \times (2n-6)$
matrix constructed previously. Its left kernel is given by

This matrix has
$n-3$
rows. Its toric variety is the
$(n-4)$
-dimensional linear space

The
$A$
-discriminantal variety is the variety projectively dual to
$X_A$
. Here, it is the linear space

Disregarding
$\lambda$
for now, the monomial map in item (ii) above takes
$q \in \Delta _A$
to the vector

Using the trinomial equations that define
$\Delta _A$
, we can write this as follows

The vector
$p$
satisfies the equation
$\,p_{13} + p_{23} + 1=0\,$
and it satisfies all subsequent equations
$p_{ik} + p_{jk} + p_{ij} = 0$
that arise from the construction of the 2-tree
$T$
.
In a final step, we need to adjust the signs of the coordinates in order for
$p$
to satisfy the equations
$p_{ik} - p_{jk} - p_{ij} = 0$
that cut out
$X_T$
; see e.g. Example 2.2. This is done by replacing
$\hat {p}_{lm} = \lambda _{lm} p_{lm}$
for appropriate signs
$\lambda _{lm} \in \{-1,+1\}$
, or, equivalently, by replacing
$p$
with the Hadamard product
$\hat {p} = \lambda \star p$
for an appropriate sign vector
$\lambda \in \{-1,+1\}^{2n-6}$
.
The signs can be constructed recursively as follows. We set
$\lambda _{13}=-1, \lambda _{23}=1$
and, with the same notation as earlier, we define
$\lambda _{ik}=-\lambda _{ij}$
and
$\lambda _{jk}=\lambda _{ij}.$
Indeed, if
$(\hat {p}_{ij},\hat {p}_{ik},\hat {p}_{jk})$
satisfy
$p_{ik} + p_{jk} + p_{ij} = 0$
, then
$(\lambda _{ij}\hat {p}_{ij},\lambda _{ik}\hat {p}_{ik},\lambda _{jk} \hat {p}_{jk})$
satisfy
$p_{ik} - p_{jk} - p_{ij} = 0$
. This now gives the desired birational map from
$\Delta _A$
onto
$X_T$
. That map furnishes the rational formula for the unique critical point
$\hat {p}$
. To obtain the version
$\,\hat {p} = \lambda \star (Hs)^H$
seen in (11), we note that the map
$s \mapsto Hs$
parametrizes the linear space
$\Delta _A$
. This completes the proof.
We conclude this section by making the coordinates of
$\hat {p}= \lambda \star (Hs)^H$
more explicit. Given any 2-tree
$T$
and any edge
$ij$
of
$T$
, we write
$[s_{ij}]$
for the sum of all Mandelstam invariants
$s_{lm}$
where
$lm$
is any descendant of the edge
$ij$
in
$T$
. Here descendant refers to the transitive closure of the parent-child relation in the iterative construction of
$T$
: the new edges
$ik$
and
$jk$
are children of the old edge
$ij$
. In this case we call
$\{i,j,k\}$
a triangle of the 2-tree
$T$
. This triangle is an ancestral triangle of an edge
$lm$
of
$T$
if
$lm$
is a descendant of the edge
$ik$
. The entries of the vector
$H s$
are the linear forms
$[s_{ij}]$
and their negated sums
$-[s_{ik}] - [s_{jk}]$
.
Corollary 3.3.
The evaluation of the Plücker coordinate
$p_{lm}$
at the critical point of
$L_T$
equals

where the product runs over all ancestral triangles
$\{i,j,k\}$
of the edge
$lm$
, and
$\lambda _{ik} \in \{-1,+1\}$
is the sign chosen in the proof of Theorem 3.1
.
This is a corollary of Theorem3.1. The proof is given by inspecting the Horn matrix
$H_T$
. It is instructive to rewrite the rational functions
$\hat {p}_{ij}$
in Example 3.2 using the notation in (12).
4. Amplitudes
In this section we take a step towards particle physics. We define an amplitude
$m_T$
for any 2-tree
$T$
. This is a rational function in the Mandelstam invariants
$s_{ij}$
. When
$T$
is planar,
$m_T$
is a degeneration of the biadjoint scalar amplitude
$m_n$
. We saw this in Example 1.5. Cachazo and Early in [Reference Cachazo and EarlyCE21] introduced minimal kinematics as a means to study such degenerations.
We shall express the amplitude
$m_T$
in terms of the Horn matrix
$H=H_T$
. Given any 2-tree
$T$
, the entries of the column vector
$s$
are the
$2n-6$
Mandelstam invariants
$s_{ij}$
. The entries of the column vector
$H s$
are
$3n-9$
linear forms in
$s$
. Our main result is the following theorem.
Theorem 4.1.
Fix a 2-tree
$T$
on
$[n-1]$
. The amplitude
$m_T$
associated with
$T$
equals

The product is over all triangles in
$T$
. This rational function of degree
$3-n$
is the product of
$n-3$
of the linear forms in
$Hs$
divided by the product of the other
$2n-6$
linear forms in
$Hs$
.
In order for this theorem to make sense, we first need a definition of the amplitude
$m_T$
. We shall work in the framework of beyond-planar MHV amplitudes developed by Arkani-Hamed et al. in [Reference Arkani-Hamed, Bourjaily, Cachazo, Postnikov and TrnkaABCPT15]. Our point of departure is the observation that every 2-tree on
$[n-1]$
defines an on-shell diagram. Here we view
$T$
as a list of
$n-2$
triples
$ijk$
, starting with
$123$
and ending with
$12n$
. The last triple
$12n$
is special because it uses the vertex
$n$
. Moreover, it does not appear in the formula (13). For a concrete example, we identify the 2-trees
$T_1$
and
$T_2$
in (10) with the following two on-shell diagrams, one planar and one non-planar:

With each triple
$ijk$
in
$T$
we associate the row vector
$p_{jk} e_i - p_{ik} e_j + p_{ij} e_k$
, and we define
$M_T$
to be the
$(n-2) \times n$
matrix whose rows are these vectors for all triples in
$T$
. For instance,

Note that the kernel of
$M_T$
coincides with the row span of the matrix
$X$
in (1). This implies that there exists a polynomial
$\Delta (M_T)$
of degree
$n-3$
in the Plücker coordinates such that the maximal minor of
$M_T$
obtained by deleting columns
$i$
and
$j$
is equal to
$\,\pm \,p_{ij} \cdot \Delta (M_T)$
.
Lemma 4.2.
For any 2-tree
$T$
, the gcd of the maximal minors of the matrix
$M_T$
equals

where the product is over all edges of
$T$
, and
$v_T(ij)$
is the number of triangles containing
$ij$
.
Proof.
The rightmost maximal square submatrix of
$M_T$
is lower triangular. Its determinant equals the product of the
$p_{ij}$
where
$\{i,j,k\}$
runs over all triangles in the 2-tree. By construction of
$T$
, the number of occurrences of
$p_{ij}$
is
$v_T(ij)-1$
. We divide this product by
$p_{12}$
to get
$\Delta (M_T)$
, since the triangle
$\{1,2,n\}$
has to be disregarded for a 2-tree on
$[n-1]$
.
Following [Reference Arkani-Hamed, Bourjaily, Cachazo, Postnikov and TrnkaABCPT15, (2.15)], we define the integrand associated with the 2-tree
$T$
to be

This is a rational function of degree
$-n$
in the Plücker coordinates. Lemma 4.2 now implies the following corollary.
Corollary 4.3.
Given any 2-tree
$T$
, the associated integrand equals

The integrands for the 2-trees
$T_1$
and
$T_2$
from our running example in (10) and (14) are

Note that
$\mathcal{I}_{T_1}$
is a Parke–Taylor factor for
$n=6$
. We are now using the unconventional labeling
$1,n,2,3,4,\ldots ,n-1$
for the vertices of the
$n$
-gon. Any triangulation of this
$n$
-gon is a 2-tree
$T$
. Such 2-trees are called planar. To be precise, the triangulation consists of the three edges of the triangle
$\{1,n,2\}$
together with the
$2n-6$
edges given by the 2-tree
$T$
.
Corollary 4.4.
For any planar 2-tree
$T$
, our integrand equals the Parke–Taylor factor.

Proof. This was observed in [Reference Arkani-Hamed, Bourjaily, Cachazo, Postnikov and TrnkaABCPT15, § 3]. It follows directly from Corollary 4.3.
We now define the amplitude associated with a 2-tree
$T$
to be the following expression:

Here
$\textrm {Hess}(L_T)$
is the determinant of the Hessian of the log-likelihood function in (7). The numerator and the denominator are evaluated at the critical point
$\hat {p}$
which is given by Theorem3.1.
Example 4.5 (
$n=6$
). Let
$T = T_2$
be the non-planar 2-tree in our running example. Then

The numerator
$\mathcal{(I}_T)^2(\hat {p})$
is the same expression but with each exponent increased by one. Therefore
$m_T$
is equal to the ratio given by the Horn matrix
$H_T$
, as promised in Theorem 4.1.
Proof of Theorem 4.1
. Fix a 2-tree
$T$
on
$[n-1]$
. For any edge
$ij$
of
$T$
, we write
$a_{ij}$
for the number of descendants of that edge. For any triangle
$\{i , j \lt k\}$
of
$T$
, we set
$b_k = a_{ik} + a_{jk}+1$
.
A key combinatorial lemma about 2-trees is that
$a_{ij}+1$
equals the sum of the integers
$2 \bigl (2-v_T(lm) \bigr )$
where
$lm$
runs over the set
$\textrm {dec}_T(ij)$
of descendants of the edge
$ij$
. To be precise,

The initial edge
$12$
is excluded. Recall that
$v_T(lm)$
is the number the triangles containing the edge
$lm$
. We prove (20) by induction on the construction of
$T$
, after checking it for
$n \leq 5$
. Indeed, suppose a new vertex
$n$
enters the 2-tree, with edges
$rn$
and
$sn$
. Then
$v_T(rs)$
increases by
$1$
and
$v_T(rn) = v_T(sn)=1$
. Otherwise
$v_T$
is unchanged. For all ancestors
$ij$
of
$rs$
, the right hand side and the left hand side of (20) increase by
$2$
. For
$ij \in \{rn,sn\}$
, both sides are
$2$
. For all other edges
$ij$
, the two sides remain unchanged. Hence (20) is proved.
We now evaluate
$(\mathcal{I}_T)^2$
at
$\hat {p}$
by plugging (12) into the square of (17). This gives

where the inner product is over ancestral triangles
$ijk$
of
$lm$
. Switching the products yields

where the product is over all triangles
$ijk$
of
$T$
. In conclusion, we have derived the formula

In order to prove Theorem4.1, we must show that the Hessian at the critical point equals

The proof is organized by an induction on
$k$
, where the Mandelstam invariants
$s_{ij}$
are transformed as we deduce the desired formula for
$k$
from the corresponding formula for
$k-1$
.
As a warm-up, it is instructive to examine the case
$k=4$
, where the Hessian is a
$1 \times 1$
matrix. The entry of that matrix is the second derivative of (4) evaluated at (5). We find

This equation matches (22), and it serves as the blueprint for the identity in (24) below.
Our first step towards (22) is to get rid of the minus sign. To this end, we write
$\mathcal{H}$
for the Hessian matrix of the negated scattering potential
$L_T$
, evaluated at
$\hat {p}$
. Its entries are

We shall prove that
$\textrm {det}(\mathcal{H})$
equals the product on the right-hand side of (22). This will be done by downward induction. Let
$k$
be the last vertex, connected to earlier vertices
$i,j$
. We display the rows and columns of the Hessian that are indexed by the triangle
$\{i,j,k\}$
:

Since
$k$
is a terminal node in the 2-tree
$T$
, its two edges satisfy a variant of (5), namely

Using these identities, the lower right entry of the matrix
$\mathcal{H}$
can be written as follows:

By factoring out
$\mathcal{H}_{kk}$
from the last row, the lower right entry becomes
$1$
. We add multiples of the last row to rows
$i$
and
$j$
, so as to cancel their last entries. The resulting upper left block with one fewer row and one fewer column is the Hessian matrix of the scattering potential for the 2-tree which is obtained from
$T$
by removing vertex
$k$
and its two incident edges
$ik$
and
$jk$
. However, in the new matrix
$s_{ij}$
is now replaced by
$s_{ij} + s_{ik} + s_{jk}$
. Note that this sum equals
$[s_{ij}]$
if
$v_T(ij)=2$
. Proceeding inductively, more and more terms get added, and eventually each Mandelstam invariant
$s_{lm}$
is replaced by the corresponding sum
$[s_{lm}]$
.
In the end, we find that the determinant of
$\mathcal{H}$
is equal to the product of the quantities

where
$ijk$
ranges over all triangles of the 2-tree
$T$
. A combinatorial argument like that presented above shows that this product is equal to (22). This completes the proof.
Remark 4.6. Our proof rests on the fact that the Hessian is the product of the expressions (25). Another way to get this is to directly triangularize the scattering equations
$\nabla L_T=0$
. Using Corollary 3.3 and the trinomials defining
$X_T$
, we see that
$\nabla L_T=0$
is equivalent to

This is a triangular system of
$n-3$
equations in the unknowns
$x_1,x_2,\ldots ,x_{n-3}$
. The Jacobian of this system is upper triangular, and its determinant is the product of the expressions (25).
5. On-shell diagrams and hypertrees
On-shell diagrams were developed by Arkani-Hamed, Bourjaily, Cachazo, Postnikov and Trnka [Reference Arkani-Hamed, Bourjaily, Cachazo, Postnikov and TrnkaABCPT16]. They encode rational parts, or leading singularities [Reference CachazoCacha08], occurring in scattering amplitudes for
$\mathcal{N}=4$
Super Yang–Mills theory [Reference Arkani-Hamed, Bourjaily, Cachazo, Postnikov and TrnkaABCPT16, § 4.7]. These are defined using the spinor-helicity formalism on a product of Grassmannians,
$\textrm {Gr}(2,n) \times \textrm {Gr}(2,n)$
. We focus on the special case of MHV on-shell diagrams, where leading singularities are rational functions in the coordinates
$p_{ij}$
(
$=\langle ij\rangle$
, in physics notation [Reference Arkani-Hamed, Bourjaily, Cachazo, Postnikov and TrnkaABCPT15, § 2.3]) of a single Grassmannian
$\textrm {Gr}(2,n)$
. The resulting discontinuities of MHV amplitudes are, for us, CHY integrands.
In [Reference Arkani-Hamed, Bourjaily, Cachazo, Postnikov and TrnkaABCPT15, § 2.2], an identification was proposed between on-shell diagrams and certain collections of
$n-2$
triples in
$[n]=\{1,\ldots , n\}$
. Among these on-shell diagrams are the hypertrees of Castravet–Tevelev [Reference Castravet and TevelevCT13]. This identification is again noted in [Reference TevelevTev, Lemma 9.5].
In algebraic geometry, hypertrees represent effective divisors on the moduli space
$\mathcal{M}_{0,n}$
. In physics, one can also consider on-shell diagrams for higher Grassmannians
$\textrm {Gr}(k,n)$
; cf. [Reference Franco, Galloni, Penante and WenFGPW15]. The analogs to hypertrees are now
$(n-k)$
-element collections of
$(k+1)$
-sets in
$[n]$
. These define effective divisors on the configuration spaces
$X(k,n) = \textrm {Gr}(k,n)^o/({\mathbb{C}}^*)^n$
. It would be interesting to examine these through the lens of [Reference Castravet and TevelevCT13, Reference TevelevTev]. In this paper, we stay with
$k=2$
.
We define a hypertree to be a collection
$T$
of
$n-2$
triples
$\Gamma _1,\ldots , \Gamma _{n-2}$
in
$[n]$
such that:
-
(a) each
$i\in [n]$ appears in at least two triples; and
-
(b)
$\big \vert \bigcup _{i\in S}\Gamma _i\big \vert \,\ge \, \vert S\vert+2\,$ for all non-empty subsets
$S \subseteq [n-2]$ .
Hypertrees have the same number of triples as 2-trees, but 2-trees are not hypertrees because some
$i$
appears in only one triple. However, if axiom (a) for hypertrees is dropped but (b) is kept, one obtains all nonzero leading singularities of MHV amplitudes. In particular, 2-trees satisfy (b), and one might view them as hypertrees in a weak sense. A hypertree is called irreducible if the inequality in (b) is strict for
$2 \leq |S| \leq n-3$
; see [Reference Castravet and TevelevCT13, Definition 1.2].
Example 5.1 (
$n=6$
). The 2-trees in (14) are not hypertrees. The following is a hypertree:

but it is not a 2-tree. The hypertree
$T$
corresponds to the octahedral on-shell diagram in [Reference Cachazo, Early, Guevara and MizeraCEGM19, Figure 1.1]. See also [Reference Castravet and TevelevCT13, Figure 2] and [Reference He, Yan, Zhang and ZhangHYZZ18, § 5.2]. This hypertree is irreducible, in the sense defined above, and it will serve as our running example throughout this section.
Many of the concepts for 2-trees from the previous sections make sense for hypertrees
$T$
. We define the
$(n-2) \times n$
matrix
$M_T$
as in § 4. The rows of
$M_T$
are the vectors
$p_{jk} e_i - p_{ik} e_j + p_{ij} e_k$
for
$\{i,j,k\} \in T$
. These span the kernel of the
$2 \times n$
matrix
$X$
in (1). The gcd of the maximal minors of
$M_T$
is a polynomial
$\Delta (M_T)$
of degree
$n-3$
in the Plücker coordinates
$p_{ij}$
. The equation
$\Delta (M_T)=0$
defines a divisor in
$\mathcal{M}_{0,n}$
, namely the hypertree divisor. If
$T$
is a 2-tree then the hypertree divisor is a union of Schubert divisors
$\{p_{ij} = 0\}$
, as seen in Lemma 4.2. For the geometric application in [Reference Castravet and TevelevCT13], this case is uninteresting. Instead, Castravet and Tevelev focus on hypertree divisors that are irreducible; see [Reference Castravet and TevelevCT13, Theorem 1.5].
Example 5.2 (Irreducible hypertree). The hypertree
$T$
in (26) is irreducible. Its matrix is

The hypertree divisor for
$T$
is an irreducible surface in the threefold
$\mathcal{M}_{0,6}$
. It is defined by

This polynomial is irreducible in the coordinate ring of
$\textrm {Gr}(2,6)$
. See also [Reference Castravet and TevelevCT13, Figure 2]. The corresponding on-shell diagram appears in [Reference Arkani-Hamed, Bourjaily, Cachazo, Postnikov and TrnkaABCPT15, (2.18)] where different labelings are used.
For every hypertree
$T$
, we define the CHY integrand
$\,\mathcal{I}_T\,$
by the formula in (16). This is a rational function of degree
$-n$
in the Plücker coordinates
$p_{ij}$
. The scattering potential for
$T$
is the log-likelihood function
$L_T$
in (7) where the sum is over all pairs
$(i,j)$
that are contained in some triple of
$T$
. Thus we set
$s_{ij} = 0$
in (2) for all non-edges
$ij$
of
$T$
. The hypertree
$T$
in (26) has three non-edges, namely
$14$
,
$25$
and
$36$
. Furthermore, we use the same formulas as in (19) to define the hypertree amplitude for
$T$
. To be precise, we set

where the sum ranges over all critical points
$\hat {p}$
of the scattering potential
$L_T$
. This is a rational function of degree
$3-n$
in the Mandelstam invariants
$s_{ij}$
where
$(i,j)$
appears in
$T$
.
Remark 5.3. Our definition of
$L_T$
for hypertrees
$T$
differs from that for 2-trees in (7). The difference arises from the restriction to the basis
$S$
in (6) which reflects the gauge fixing in (1). In the new definition,
$L_T$
has no critical point when
$T$
is a 2-tree. For instance, let
$n=4$
and
$T = \{123, 124\}$
. This
$T$
has one non-edge, namely
$34$
, which is not in
$S = \{13,23\}$
. The new definition requires us to set
$s_{34} = -s_{13} - s_{23}$
to zero, so that
$s_{23} = - s_{13}$
. In this case, the function in (4) becomes
$L_T = s_{13}\cdot \textrm {log}(x_1/(1-x_1))$
, which has no critical point. Minimal kinematics only arises when
$s_{34}$
remains an unknown, resulting in the critical point (5).
Example 5.4 (Hypertree amplitude). We compute the amplitude
$m_T$
for the hypertree
$T$
in (26). The generic
$n=6$
scattering potential
$L$
has six critical points
$\hat {p}$
, but the restricted scattering potential
$L_T$
has only two. The expression (27) also makes sense for
$L$
. We have

This evaluates to a rational function in the Mandelstam invariants. For generic
$s_{ij}$
, we find

We next impose the constraints
$s_{14} = s_{25} = s_{36}=0$
coming from the hypertree
$T$
. Some poles now become spurious. By collecting distinct maximal nonzero residues and then canceling spurious poles, we obtain the Feynman diagram expansion for the hypertree amplitude:

This is the rational function (27), where we sum over the two critical points of
$L_T$
. This amplitude has
$12$
poles,
$24$
compatible pairs, and
$14$
Feynman diagrams, in bijection with the faces of a rhombic dodecahedron shown in Figure 1. By comparison, the amplitude in (9) has
$ 9$
poles and is a sum over
$14$
Feynman diagrams, given combinatorially by the associahedron.

Figure 1. Combinatorics of the octahedral hypertree amplitude
$m_T$
.
It was shown in [Reference Arkani-Hamed, Bourjaily, Cachazo, Postnikov and TrnkaABCPT15, § 3.2] that, for any on-shell diagram
$T$
, the CHY integrand
$\mathcal{I}_T$
decomposes as a sum of Parke–Taylor factors
$PT(\alpha _1,\ldots , \alpha _n)$
. Therefore,
$m_T$
is a linear combination of biadjoint amplitudes
$m(\alpha ,\beta )$
as
$\alpha ,\beta$
range over pairs of cyclic orders on
$\lbrack n\rbrack$
that are both cyclic shuffles of the triples in
$T$
. See [Reference Cachazo, He and YuanCHYB14] for details of this construction. Such a decomposition with seven terms is shown in [Reference Arkani-Hamed, Bourjaily, Cachazo, Postnikov and TrnkaABCPT15, (3.12)] for the integrand
$\mathcal{I}_T$
in (28).
This example raises several questions for future research. The first concerns the ML degree of an arbitrary hypertree
$T$
. By this we mean the number of complex critical points of the function
$L_T$
. The hypertree
$T$
in (26) has
$\textrm {MLdegree}(T)=2$
.
Question 5.5. Can we find a formula for the ML degree of an arbitrary on-shell diagram, and in particular for an arbitrary hypertree
$T$
? How is that ML degree related to the geometry of the hypertree divisor
$\{\Delta (M_T)=0\}$
on the moduli space
$\mathcal{M}_{0,n}$
?
Remark 5.6. We computed the ML degrees for the hypertrees in the Opie–Tevelev database https://people.math.umass.edu/~tevelev/HT_database/database.html. For instance, for
$n=9$
, the ML degrees range from
$8$
to
$16$
. For a concrete example consider the hypertree
$T = \{123,129,456,789,147,258,367\}$
. Here the ML degree equals
$10$
, i.e. the function
$L_T$
has
$10$
critical points. By contrast, the general scattering potential
$L$
in (2) has
$720$
critical points, and its
$m(\alpha ,\beta )$
expansion involves
$3185$
Feynman diagrams. After imposing
$s_{ij}=0$
for every non-edge
$ij$
, we find that
$297$
distinct maximal residues remain in the amplitude
$m_T$
. This is still a considerable amount of structure to be found from only
$10$
critical points.
Returning to the title of this paper, we ought to be looking for minimal kinematics.
Question 5.7. For any hypertree
$T$
, how can we best reach ML degree
$1$
by restricting the scattering potential
$L_T$
to a subspace of kinematic space? In particular, can we always reach a Horn uniformization formula (11) for the critical points
$\hat {p}$
by setting some multiple-particle poles
$s_{ij \cdots k}$
to zero? This would lead to a formula like (13) for the specialized amplitude
$m_T$
.
The following computation shows that we can indeed do this for our running example.
Example 5.8.
In Example 5.4 we set the three-particle pole
$s_{234}= s_{23} + s_{34} + s_{24}$
to zero, in addition to
$s_{14} = s_{25} = s_{36}=0$
. This results in a dramatic simplification of the amplitude:

The ML degree is now
$1$
. By Huh’s theorem [Reference HuhHuh14], the critical point is given by a Horn pair
$(H,\lambda )$
. In short, the subspace
$\{s_{14} = s_{25} = s_{36} = s_{234} = 0\}$
exhibits minimal kinematics.
Finally, all of our questions extend naturally from
$\textrm {Gr}(2,n)$
to
$\textrm {Gr}(k,n)$
. Using physics acronyms, we seek to extend our amplitudes
$m_T$
from CHY theory [Reference Cachazo, He and YuanCHY14] to CEGM theory [Reference Cachazo, Early, Guevara and MizeraCEGM19]. For example, the CEGM potential on the
$4$
-dimensional space
$X(3,6) = \textrm {Gr}(3,6)^o/({\mathbb{C}}^*)^6$
is

This log-likelihood function is known to have
$26$
critical points; see e.g. [Reference Sturmfels and TelenST21, Proposition 5]. We now restrict to the kinematic subspace
$\{\mathfrak{s}_{135}=\mathfrak{s}_{235}=\mathfrak{s}_{246}=\mathfrak{s}_{256}=\mathfrak{s}_{356}=\mathfrak{s}_{245}=0\}$
. Then the ML degree drops from
$26$
to
$1$
. In short, this subspace exhibits minimal kinematics.
Question 5.9. Can we characterize minimal kinematics for the configuration space
$X(k,n) = \textrm {Gr}(k,n)^o/({\mathbb{C}}^*)^n$
? What plays the role that 2-trees play in Theorem 1.4? Can we determine the ML degree of on-shell diagrams for
$k \geq 3$
? Ambitiously, we seek an all
$k$
and
$n$
peek [Reference Cachazo and EarlyCE21]. Furthermore, we ask the same question for the realization space of any rank
$k$
matroid on
$[n]$
.
Revisiting the case
$k=2$
once more, here is a complexity question about graphs for
$\mathcal{M}_{0,n}$
.
Question 5.10. The
$2$
-trees are exactly the maximal graphs of treewidth
$2$
. Can we compute the number of critical points of the scattering potential function for a graph of treewidth
$t$
?
Conflicts of Interest
None.
Journal Information
Moduli is published as a joint venture of the Foundation Compositio Mathematica and the London Mathematical Society. As not-for-profit organisations, the Foundation and Society reinvest
$100\%$
of any surplus generated from their publications back into mathematics through their charitable activities.