To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
We consider the community detection problem in sparse random hypergraphs under the non-uniform hypergraph stochastic block model (HSBM), a general model of random networks with community structure and higher-order interactions. When the random hypergraph has bounded expected degrees, we provide a spectral algorithm that outputs a partition with at least a $\gamma$ fraction of the vertices classified correctly, where $\gamma \in (0.5,1)$ depends on the signal-to-noise ratio (SNR) of the model. When the SNR grows slowly as the number of vertices goes to infinity, our algorithm achieves weak consistency, which improves the previous results in Ghoshdastidar and Dukkipati ((2017) Ann. Stat.45(1) 289–315.) for non-uniform HSBMs.
Our spectral algorithm consists of three major steps: (1) Hyperedge selection: select hyperedges of certain sizes to provide the maximal signal-to-noise ratio for the induced sub-hypergraph; (2) Spectral partition: construct a regularised adjacency matrix and obtain an approximate partition based on singular vectors; (3) Correction and merging: incorporate the hyperedge information from adjacency tensors to upgrade the error rate guarantee. The theoretical analysis of our algorithm relies on the concentration and regularisation of the adjacency matrix for sparse non-uniform random hypergraphs, which can be of independent interest.
We derive a sufficient condition for a sparse random matrix with given numbers of non-zero entries in the rows and columns having full row rank. The result covers both matrices over finite fields with independent non-zero entries and $\{0,1\}$-matrices over the rationals. The sufficient condition is generally necessary as well.
We explore the limiting spectral distribution of large-dimensional random permutation matrices, assuming the underlying population distribution possesses a general dependence structure. Let $\textbf X = (\textbf x_1,\ldots,\textbf x_n)$$\in \mathbb{C} ^{m \times n}$ be an $m \times n$ data matrix after self-normalization (n samples and m features), where $\textbf x_j = (x_{1j}^{*},\ldots, x_{mj}^{*} )^{*}$. Specifically, we generate a permutation matrix $\textbf X_\pi$ by permuting the entries of $\textbf x_j$$(j=1,\ldots,n)$ and demonstrate that the empirical spectral distribution of $\textbf {B}_n = ({m}/{n})\textbf{U} _{n} \textbf{X} _\pi \textbf{X} _\pi^{*} \textbf{U} _{n}^{*}$ weakly converges to the generalized Marčenko–Pastur distribution with probability 1, where $\textbf{U} _n$ is a sequence of $p \times m$ non-random complex matrices. The conditions we require are $p/n \to c >0$ and $m/n \to \gamma > 0$.
We study the distribution of the length of longest increasing subsequences in random permutations of n integers as n grows large and establish an asymptotic expansion in powers of $n^{-1/3}$. Whilst the limit law was already shown by Baik, Deift and Johansson to be the GUE Tracy–Widom distribution F, we find explicit analytic expressions of the first few finite-size correction terms as linear combinations of higher order derivatives of F with rational polynomial coefficients. Our proof replaces Johansson’s de-Poissonization, which is based on monotonicity as a Tauberian condition, by analytic de-Poissonization of Jacquet and Szpankowski, which is based on growth conditions in the complex plane; it is subject to a tameness hypothesis concerning complex zeros of the analytically continued Poissonized length distribution. In a preparatory step an expansion of the hard-to-soft edge transition law of LUE is studied, which is lifted to an expansion of the Poissonized length distribution for large intensities. Finally, expansions of Stirling-type approximations and of the expected value and variance of the length distribution are given.
Given the full shift over a countable state space on a countable amenable group, we develop its thermodynamic formalism. First, we introduce the concept of pressure and, using tiling techniques, prove its existence and further properties, such as an infimum rule. Next, we extend the definitions of different notions of Gibbs measures and prove their existence and equivalence, given some regularity and normalization criteria on the potential. Finally, we provide a family of potentials that nontrivially satisfy the conditions for having this equivalence and a nonempty range of inverse temperatures where uniqueness holds.
We establish the asymptotic expansion in $\beta $ matrix models with a confining, off-critical potential in the regime where the support of the equilibrium measure is a finite union of segments. We first address the case where the filling fractions of these segments are fixed and show the existence of a $\frac {1}{N}$ expansion. We then study the asymptotics of the sum over the filling fractions to obtain the full asymptotic expansion for the initial problem in the multi-cut regime. In particular, we identify the fluctuations of the linear statistics and show that they are approximated in law by the sum of a Gaussian random variable and an independent Gaussian discrete random variable with oscillating center. Fluctuations of filling fractions are also described by an oscillating discrete Gaussian random variable. We apply our results to study the all-order small dispersion asymptotics of solutions of the Toda chain associated with the one Hermitian matrix model ($\beta = 2$) as well as orthogonal ($\beta = 1$) and skew-orthogonal ($\beta = 4$) polynomials outside the bulk.
Let A be an $n \times n$ symmetric matrix with $(A_{i,j})_{i\leqslant j}$ independent and identically distributed according to a subgaussian distribution. We show that
where $\sigma _{\min }(A)$ denotes the least singular value of A and the constants $C,c>0 $ depend only on the distribution of the entries of A. This result confirms the folklore conjecture on the lower tail of the least singular value of such matrices and is best possible up to the dependence of the constants on the distribution of $A_{i,j}$. Along the way, we prove that the probability that A has a repeated eigenvalue is $e^{-\Omega (n)}$, thus confirming a conjecture of Nguyen, Tao and Vu [Probab. Theory Relat. Fields 167 (2017), 777–816].
Graphical models with heavy-tailed factors can be used to model extremal dependence or causality between extreme events. In a Bayesian network, variables are recursively defined in terms of their parents according to a directed acyclic graph (DAG). We focus on max-linear graphical models with respect to a special type of graph, which we call a tree of transitive tournaments. The latter is a block graph combining in a tree-like structure a finite number of transitive tournaments, each of which is a DAG in which every two nodes are connected. We study the limit of the joint tails of the max-linear model conditionally on the event that a given variable exceeds a high threshold. Under a suitable condition, the limiting distribution involves the factorization into independent increments along the shortest trail between two variables, thereby imitating the behaviour of a Markov random field.
We are also interested in the identifiability of the model parameters in the case when some variables are latent and only a subvector is observed. It turns out that the parameters are identifiable under a criterion on the nodes carrying the latent variables which is easy and quick to check.
We study the local convergence of critical Galton–Watson trees under various conditionings. We give a sufficient condition, which serves to cover all previous known results, for the convergence in distribution of a conditioned Galton–Watson tree to Kesten’s tree. We also propose a new proof to give the limit in distribution of a critical Galton–Watson tree, with finite support, conditioned on having a large width.
Starting from loop equations, we prove that the wave functions constructed from topological recursion on families of degree $2$ spectral curves with a global involution satisfy a system of partial differential equations, whose equations can be seen as quantizations of the original spectral curves. The families of spectral curves can be parametrized with the so-called times, defined as periods on second type cycles, and with the poles. These equations can be used to prove that the WKB solution of many isomonodromic systems coincides with the topological recursion wave function, which proves that the topological recursion wave function is annihilated by a quantum curve. This recovers many known quantum curves for genus zero spectral curves and generalizes this construction to hyperelliptic curves.
We find closed formulas for arbitrarily high mixed moments of characteristic polynomials of the Alternative Circular Unitary Ensemble, as well as closed formulas for the averages of ratios of characteristic polynomials in this ensemble. A comparison is made to analogous results for the Circular Unitary Ensemble. Both moments and ratios are studied via symmetric function theory and a general formula of Borodin-Olshanski-Strahov.
The total energy of an eigenstate in a composite quantum system tends to be distributed equally among its constituents. We identify the quantum fluctuation around this equipartition principle in the simplest disordered quantum system consisting of linear combinations of Wigner matrices. As our main ingredient, we prove the Eigenstate Thermalisation Hypothesis and Gaussian fluctuation for general quadratic forms of the bulk eigenvectors of Wigner matrices with an arbitrary deformation.
Extreme value theory plays an important role in providing approximation results for the extremes of a sequence of independent random variables when their distribution is unknown. An important one is given by the generalised Pareto distribution $H_\gamma(x)$ as an approximation of the distribution $F_t(s(t)x)$ of the excesses over a threshold t, where s(t) is a suitable norming function. We study the rate of convergence of $F_t(s(t)\cdot)$ to $H_\gamma$ in variational and Hellinger distances and translate it into that regarding the Kullback–Leibler divergence between the respective densities.
In this note, we give a precise description of the limiting empirical spectral distribution for the non-backtracking matrices for an Erdős-Rényi graph $G(n,p)$ assuming $np/\log n$ tends to infinity. We show that derandomizing part of the non-backtracking random matrix simplifies the spectrum considerably, and then, we use Tao and Vu’s replacement principle and the Bauer-Fike theorem to show that the partly derandomized spectrum is, in fact, very close to the original spectrum.
We study the problem of detecting the community structure from the generalized stochastic block model with two communities (G2-SBM). Based on analysis of the Stieljtes transform of the empirical spectral distribution, we prove a Baik–Ben Arous–Péché (BBP)-type transition for the largest eigenvalue of the G2-SBM. For specific models, such as a hidden community model and an unbalanced stochastic block model, we provide precise formulas for the two largest eigenvalues, establishing the gap in the BBP-type transition.
Arithmetic quasidensities are a large family of real-valued set functions partially defined on the power set of $\mathbb {N}$, including the asymptotic density, the Banach density and the analytic density. Let $B \subseteq \mathbb {N}$ be a nonempty set covering $o(n!)$ residue classes modulo $n!$ as $n\to \infty $ (for example, the primes or the perfect powers). We show that, for each $\alpha \in [0,1]$, there is a set $A\subseteq \mathbb {N}$ such that, for every arithmetic quasidensity $\mu $, both A and the sumset $A+B$ are in the domain of $\mu $ and, in addition, $\mu (A + B) = \alpha $. The proof relies on the properties of a little known density first considered by Buck [‘The measure theoretic approach to density’, Amer. J. Math.68 (1946), 560–580].
Let G be a real Lie group, $\Lambda <G$ a lattice and $H\leqslant G$ a connected semisimple subgroup without compact factors and with finite center. We define the notion of H-expanding measures $\mu $ on H and, applying recent work of Eskin–Lindenstrauss, prove that $\mu $-stationary probability measures on $G/\Lambda $ are homogeneous. Transferring a construction by Benoist–Quint and drawing on ideas of Eskin–Mirzakhani–Mohammadi, we construct Lyapunov/Margulis functions to show that H-expanding random walks on $G/\Lambda $ satisfy a recurrence condition and that homogeneous subspaces are repelling. Combined with a countability result, this allows us to prove equidistribution of trajectories in $G/\Lambda $ for H-expanding random walks and to obtain orbit closure descriptions. Finally, elaborating on an idea of Simmons–Weiss, we deduce Birkhoff genericity of a class of measures with respect to some diagonal flows and extend their applications to Diophantine approximation on similarity fractals to a nonconformal and weighted setting.
In this paper, we consider the convergence rate with respect to Wasserstein distance in the invariance principle for deterministic non-uniformly hyperbolic systems. Our results apply to uniformly hyperbolic systems and large classes of non-uniformly hyperbolic systems including intermittent maps, Viana maps, unimodal maps and others. Furthermore, as a non-trivial application to the homogenization problem, we investigate the Wasserstein convergence rate of a fast–slow discrete deterministic system to a stochastic differential equation.
Let $S=\{p_1, \ldots , p_r,\infty \}$ for prime integers $p_1, \ldots , p_r.$ Let X be an S-adic compact nilmanifold, equipped with the unique translation-invariant probability measure $\mu .$ We characterize the countable groups $\Gamma $ of automorphisms of X for which the Koopman representation $\kappa $ on $L^2(X,\mu )$ has a spectral gap. More specifically, let Y be the maximal quotient solenoid of X (thus, Y is a finite-dimensional, connected, compact abelian group). We show that $\kappa $ does not have a spectral gap if and only if there exists a $\Gamma $-invariant proper subsolenoid of Y on which $\Gamma $ acts as a virtually abelian group,
Let $\Gamma _{g}$ be the fundamental group of a closed connected orientable surface of genus $g\geq 2$. We develop a new method for integrating over the representation space $\mathbb {X}_{g,n}=\mathrm {Hom}(\Gamma _{g},S_{n})$, where $S_{n}$ is the symmetric group of permutations of $\{1,\ldots ,n\}$. Equivalently, this is the space of all vertex-labeled, n-sheeted covering spaces of the closed surface of genus g.
Given $\phi \in \mathbb {X}_{g,n}$ and $\gamma \in \Gamma _{g}$, we let $\mathsf {fix}_{\gamma }(\phi )$ be the number of fixed points of the permutation $\phi (\gamma )$. The function $\mathsf {fix}_{\gamma }$ is a special case of a natural family of functions on $\mathbb {X}_{g,n}$ called Wilson loops. Our new methodology leads to an asymptotic formula, as $n\to \infty $, for the expectation of $\mathsf {fix}_{\gamma }$ with respect to the uniform probability measure on $\mathbb {X}_{g,n}$, which is denoted by $\mathbb {E}_{g,n}[\mathsf {fix}_{\gamma }]$. We prove that if $\gamma \in \Gamma _{g}$ is not the identity and q is maximal such that $\gamma $ is a qth power in $\Gamma _{g}$, then
as $n\to \infty $, where $d\left (q\right )$ is the number of divisors of q. Even the weaker corollary that $\mathbb {E}_{g,n}[\mathsf {fix}_{\gamma }]=o(n)$ as $n\to \infty $ is a new result of this paper. We also prove that $\mathbb {E}_{g,n}[\mathsf {fix}_{\gamma }]$ can be approximated to any order $O(n^{-M})$ by a polynomial in $n^{-1}$.