Limiting empirical spectral distribution for the non-backtracking matrix of an Erdős-Rényi random graph

Ke Wang; Philip Matchett Wood

doi:10.1017/S096354832300024X

Limiting empirical spectral distribution for the non-backtracking matrix of an Erdős-Rényi random graph

Part of: Graph theory Probability theory on algebraic and topological structures

Published online by Cambridge University Press: 31 July 2023

Ke Wang

and

Philip Matchett Wood

Show author details

Ke Wang*: Affiliation:
Department of Mathematics, Hong Kong University of Science and Technology, Kowloon, Hong Kong
Philip Matchett Wood: Affiliation:
Department of Mathematics, Harvard University, Cambridge, MA, 02138, USA
*: Corresponding author: Ke Wang; Email: kewang@ust.hk

Article contents

Abstract
Introduction
The spectrum of $H_0$
The bulk distribution: proving Theorem
Perturbation theory: proving Theorem
Footnotes
References

Rights & Permissions

Abstract

In this note, we give a precise description of the limiting empirical spectral distribution for the non-backtracking matrices for an Erdős-Rényi graph $G(n,p)$ assuming $np/\log n$ tends to infinity. We show that derandomizing part of the non-backtracking random matrix simplifies the spectrum considerably, and then, we use Tao and Vu’s replacement principle and the Bauer-Fike theorem to show that the partly derandomized spectrum is, in fact, very close to the original spectrum.

Keywords

Random graph non-backtracking matrix eigenvalues

MSC classification

Primary: 05C80: Random graphs

Secondary: 60B20: Random matrices (probabilistic aspects; for algebraic aspects see )

Information

Type: Paper
Information: Combinatorics, Probability and Computing , Volume 32 , Issue 6 , November 2023 , pp. 956 - 973

DOI: https://doi.org/10.1017/S096354832300024X [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright: © The Author(s), 2023. Published by Cambridge University Press

1. Introduction

For a simple undirected graph $G=(V,E)$ , the non-backtracking matrix is defined as follows. For each $(i,j) \in E$ , form two directed edges $i \to j$ and $j \to i$ . The non-backtracking matrix $B$ is a $2|E| \times 2|E|$ matrix such that

\begin{equation*}B_{i\to j, k\to l}= \begin {cases} 1 & \text {if}\ j=k \, \text {and}\ i\neq l \\[5pt] 0 & \text {otherwise}. \end {cases} \end{equation*}

The central question of the current paper is the following:

Question 1.1. What can be said about the eigenvalues of the non-backtracking matrix $B$ of random graphs as $|V|\to \infty$ ?

The non-backtracking matrix was proposed by Hashimoto [Reference Hashimoto17]. The spectrum of the non-backtracking matrix for random graphs was studied by Angel, Friedman, and Hoory [Reference Angel, Friedman and Hoory2] in the case where the underlying graph is the tree covering of a finite graph. Motivated by the question of community detection (see [Reference Krzakala, Moore and Mossel23, Reference Massoulié29–Reference Mossel, Neeman and Sly31]), Bordenave, Lelarge, and Massoulié [Reference Bordenave, Lelarge and Massoulié8] determined the size of the largest eigenvalue and gave bounds for the sizes of all other eigenvalues for non-backtracking matrices when the underlying graph is drawn from a generalization of Erdős-Rényi random graphs called the Stochastic Block Model (see [Reference Holland, Laskey and Leinhardt19]), and this work was further extended to the Degree-Corrected Stochastic Block Model (see [Reference Karrer and Newman24]) by Gulikers, Lelarge, and Massoulié [Reference Gulikers, Lelarge and Massoulié15]. In recent work, Benaych-Georges, Bordenave, and Knowles [Reference Benaych-Georges, Bordenave and Knowles5] studied the spectral radii of the sparse inhomogeneous Erdős-Rényi graph through a novel application of non-backtracking matrices. Stephan and Massoulié [Reference Stephan and Massoulié32] also conducted a study on the non-backtracking spectra of weighted inhomogeneous random graphs.

In the current paper, we give a precise characterization of the limiting distribution of the eigenvalues for the non-backtracking matrix when the underlying graph is the Erdős-Rényi random graph $G(n,p)$ , where each edge $ij$ is present independently with probability $p$ , and where we exclude loops (edges of the form $ii$ ). We will allow $p$ to be constant or decreasing sublinearly with $n$ , which contrasts to the bounds proved in [Reference Bordenave, Lelarge and Massoulié8] and [Reference Gulikers, Lelarge and Massoulié15] corresponding to the case $p=c/n$ with $c$ a constant. Let $A$ be the adjacency matrix of $G(n,p)$ , so $A_{ij}=1$ exactly when edge $ij$ is part of the graph $G$ and $A_{ij}=0$ otherwise, and let $D$ the diagonal matrix with $D_{ii}=\sum _{j=1}^n A_{ij}$ . Much is known about the eigenvalues of $A$ , going back to works of Wigner in the 1950s [Reference Wigner38, Reference Wigner39] (see also [Reference Grenander16] and [Reference Arnold3]), who proved that the distribution of eigenvalues follows the semicircular law for any constant $p\in (0,1)$ . More recent results have considered the case where $p$ tends to zero, making the random graph sparse. It is known that assuming $np\to \infty$ , the empirical spectral distribution (ESD) of the adjacency matrix $A$ converges to the semicircle distribution (see for example [Reference Khorunzhy and Pastur25] or [Reference Tran, Van and Wang35]). Actually, much stronger results have been proved about the eigenvalues of $A$ (see the surveys [Reference Vu37] and [Reference Benaych-Georges and Knowles6]). For example, Erdős, Knowles, Yau, and Yin [Reference Erdős, Knowles, Yau and Yin12] proved that as long as there is a constant $C$ so that $np > (\log n)^{C \log \log n}$ (and thus $np\to \infty$ faster than logarithmic speed), the eigenvalues of the adjacency matrix $A$ satisfy a result called the local semicircle law. This law characterizes the distribution of the eigenvalues in small intervals that shrink as the size of the matrix $n$ increases. The most recent development regarding the local semicircle law can be found in [Reference He, Knowles and Marcozzi14] and [Reference Alt, Ducatez and Knowles1].

It has been shown in [Reference Angel, Friedman and Hoory2, Reference Bass4, Reference Hashimoto17] (for example, Theorem 1.5 from [Reference Angel, Friedman and Hoory2]) that the spectrum of $B$ is the set $\{ \pm 1\} \cup \{\mu : \det (\mu ^2 I - \mu A + D-I) =0\}$ , or equivalently, the set $\{ \pm 1\} \cup \{\text{eigenvalues of } H \},$ where

(1.1)

\begin{equation} H = \begin{pmatrix} A & I-D\\[5pt] I & 0 \end{pmatrix}. \end{equation}

We will call this $2n \times 2n$ matrix $H$ the non-backtracking spectrum operator for $A$ , and we will show that the spectrum of $H$ may be precisely described, thus giving a precise description of the eigenvalues of the non-backtracking matrix $B$ . We will study the eigenvalues of $H$ in two regions: the dense region, where $p \in (0,1)$ and $p$ is fixed constant; and the sparse region, where $p=o(1)$ and $np \to \infty$ . The diluted region, where $p=c/n$ for some constant $c>1$ , is the region for which the bounds in [Reference Bordenave, Lelarge and Massoulié8] and [Reference Gulikers, Lelarge and Massoulié15] apply, and, as pointed out by [Reference Bordenave, Lelarge and Massoulié8], it would be interesting to determine the limiting eigenvalue distribution of $H$ in this region.

Note that ${\mathbb E }(D) = (n-1)p I$ , and so we will let

\begin{equation*}\alpha = (n-1)p-1\end{equation*}

and consider the partly averaged matrix

(1.2)

\begin{equation} H_0= \begin{pmatrix} A & I-{\mathbb E }(D)\\[5pt] I & 0 \end{pmatrix} = \begin{pmatrix} A & -\alpha I\\[5pt] I & 0 \end{pmatrix}. \end{equation}

The partly averaged matrix $H_0$ will be an essential tool in quantifying the eigenvalues of the non-backtracking spectrum operator $H$ . Three main ideas are at the core of this paper: first, that partial derandomization can greatly simplify the spectrum; second, that Tao and Vu’s replacement principle [Reference Tao and Van34, Theorem 2.1] can be usefully applied to two sequences of random matrices that are highly dependent on each other; and third, that in this case, the partly derandomized matrix may be viewed as a small perturbation of the original matrix, allowing one to apply results from perturbation theory like the Bauer-Fike Theorem. The use of Tao and Vu’s replacement principle here is novel, as it is used to compare the spectra of a sequence of matrices with some dependencies among the entries to a sequence of matrices where all random entries are independent; typically, the Tao-Vu replacement principle has been applied in cases where the two sequences of random matrices both have independent entries, see for example [Reference Tao and Van34, Reference Wood40, Reference Wood41].

1.1 Results

Throughout the rest of this paper, we mainly focus on sparse random graphs where $p\in (0,1)$ may tend to zero as $n\to \infty$ . Our first result shows that the spectrum of $H_0$ can be determined very precisely in terms of the spectrum of the random Hermitian matrix $A$ , which is well-understood.

Proposition 1.2 (Spectrum of the partly averaged matrix). Let $H_0$ be defined as in (1.2), and let $0< p\le p_0<1$ for a constant $p_0$ . If $p\ge C/\sqrt{n}$ for some large constant $C>0$ , then, with probability $1-o(1)$ , $\frac{1}{\sqrt{\alpha }} H_0$ has two real eigenvalues $\mu _1$ and $\mu _2$ satisfying $\mu _1=\sqrt{\alpha } (1+o(1))$ and $\mu _2 = 1/\sqrt{np} (1+o(1))$ ; all other eigenvalues for $\frac{1}{\sqrt{\alpha }} H_0$ are complex with magnitude $1$ and occur in complex conjugate pairs. If $np \to \infty$ with $n$ , then the real parts of the eigenvalues in the circular arcs are distributed according to the semicircle law.

Remark 1.3. When $n^{-1+\epsilon }\le p\le n^{-1/2}$ , more real eigenvalues of $H_0$ will emerge. We provide a short discussion on the real eigenvalues of $H_0$ in Section 2.2. Note that as long as the number of real eigenvalues is bounded by a fixed constant, for example when $p \ge C/\sqrt n$ , the bulk distribution of $H_0$ is two arcs on the unit circle, with density so that real parts of the eigenvalues follow the semicircular law.

The spectrum of the non-backtracking matrix for a degree regular graph was studied in [Reference Bordenave7], including proving some precise eigenvalue estimates. One can view Proposition 1.2 as extending this general approach by using averaged degree counts, but allowing the graph to no longer be degree regular. Thus, Proposition 1.2 shows that partly averaging $H$ to get $H_0$ is enough to allow the spectrum to be computed very precisely. Our main results are the theorems below, which show that the empirical spectral measures $\mu _H$ for $H$ and $\mu _{H_0}$ for $H_0$ are very close to each other, even for $p$ a decreasing function of $n$ . (The definitions of the measure $\mu _M$ for a matrix $M$ and the definition of almost sure convergence of measures are given in Section 1.3).

Theorem 1.4. Let $A$ be the adjacency matrix for an Erdős-Rényi random graph $G(n,p)$ . Assume $0< p\le p_0<1$ for a constant $p_0$ and $np/\log n \to \infty$ with $n$ . Let $\frac{1}{\sqrt{\alpha }} H$ be a rescaling of the non-backtracking spectrum operator for $A$ defined in ( 1.1 ) with $\alpha = (n-1)p-1$ , and let $\frac{1}{\sqrt{\alpha }} H_0$ be its partial derandomization, defined in ( 1.2 ). Then, $\mu _{\frac{1}{\sqrt{\alpha }}H} - \mu _{\frac{1}{\sqrt{\alpha }} H_0}$ converges almost surely (thus, also in probability) to zero as $n$ goes to infinity.

Remark 1.5. When $p\gg \log n/n$ , the graph $G(n,p)$ is almost a random regular graph and thus $H_0$ appears to be a good approximation of $H$ . When $p$ becomes smaller, such an approximation is no longer accurate. In this sense, Theorem 1.4 is optimal.

In Figure 1, we plot the eigenvalues of $\frac{1}{\sqrt{\alpha }}H$ and $\frac{1}{\sqrt{\alpha }}H_0$ for an Erdős-Rényi random graph $G(n,p)$ , where $n=500$ . The blue circles mark the eigenvalues for $H/\sqrt{\alpha }$ and the red x’s mark the eigenvalues for $H_0/\sqrt{\alpha }$ . We can see that the empirical spectral measures of $H/\sqrt{\alpha }$ and $H_0/\sqrt{\alpha }$ are very close for $p$ not too small. As $p$ becomes smaller (note that here $\log n/n\approx 0.0054$ ), the eigenvalues of $H_0/\sqrt{\alpha }$ still lie on the arcs of the unit circle whereas the eigenvalues of $H/\sqrt{\alpha }$ start to escape and be attracted to the inside of the circle.

Figure 1. The eigenvalues of $H/\sqrt{\alpha }$ defined in (1.1) and $H_0/\sqrt{\alpha }$ defined in (1.2) for a sample of $G(n,p)$ with $n=500$ and different values of $p$ . The blue circles are the eigenvalues of $H/\sqrt{\alpha }$ , and the red x’s are for $H_0/\sqrt{\alpha }$ . For comparison, the black dashed line is the unit circle. For the figures from top to bottom and from left to right, the values of $p$ are taken to be $p=0.5, p=0.1, p=0.08$ and $p=0.05$ , respectively.

To prove that the bulk eigenvalue distributions converge in Theorem 1.4, we will use Tao and Vu’s replacement principle [Reference Tao and Van34, Theorem 2.1] (see also Theorem 3.2), which was a key step in proving the circular law. The replacement principle lets one compare eigenvalue distributions of two sequences of random matrices, and it has often been used in cases where one type of random input – for example, standard Gaussian normal entries – is replaced by a different type of random input – for example, arbitrary mean 0, variance 1 entries. This is how the replacement principle was used to prove the circular law in [Reference Tao and Van34], and it was used similarly in, for example, [Reference Wood40, Reference Wood41]. The application of the replacement principle in the current paper is distinct in that the entries for one ensemble of random matrices, namely $H$ , has some dependencies between the entries, whereas the random entries of $H_0$ are all independent.

Our third result (Theorem 1.6 below) proves that all eigenvalues of $H$ are close to those of $H_0$ with high probability when $p\gg \frac{\log ^{2/3} n}{n^{1/6}}$ , which implies that there are no outlier eigenvalues of $H$ , that is, no eigenvalues of $H$ that are far outside the support of the spectrum of $H_0$ (described in Theorem 1.2).

Theorem 1.6. Assume $0<p\le p_0<1$ for a constant $p_0$ and $p\ge \frac{\log ^{2/3+\varepsilon } n}{n^{1/6}}$ for $\varepsilon >0$ . Let $A$ be the adjacency matrix for an Erdős-Rényi random graph $G(n,p)$ . Let $\frac{1}{\sqrt{\alpha }} H$ be a rescaling of the non-backtracking spectrum operator for $A$ defined in ( 1.1 ). Then, with probability $1-o(1)$ , each eigenvalue of $\frac{1}{\sqrt{\alpha }} H$ is within distance $R= 40\sqrt{\frac{\log n}{np^2}}$ of an eigenvalue of $\frac{1}{\sqrt{\alpha }} H_0$ , defined in ( 1.2 ).

In the upcoming Section 2, it will be demonstrated that eigenvalues in the bulk of the distributions for $\frac{1}{\sqrt{\alpha }} H$ and for $\frac{1}{\sqrt{\alpha }} H_0$ have absolute value 1. Since $p\gg \frac{\log ^{2/3} n}{n^{1/6}}$ , we have $R=40\sqrt{\frac{\log n}{np^2}}=o(1)$ , and consequently, Theorem 1.6 provides informative results. We would like to mention that the above result has been improved to hold for $p \gg \log n/n$ and that each eigenvalue of $\frac{1}{\sqrt{\alpha }} H$ is within distance $ O((\frac{\log n}{np})^{1/4})$ of an eigenvalue of $\frac{1}{\sqrt{\alpha }} H_0$ , using a variant of Bauer-Fike perturbation theorem that appeared later in [Reference Coste and Zhu11, Corollary 2.4], as opposed to invoking the classical Bauer-Fike theorem in this paper (see Theorem 4.1).

1.2 Outline

We will describe the ESD of the partly averaged matrix $H_0$ to prove Proposition 1.2 in Section 2. In Section 3, we will show that the ESDs of $H$ and $H_0$ approach each other as $n$ goes to infinity by using the replacement principle [Reference Tao and Van34, Theorem 2.1] and in Section 4 we will use the Bauer-Fike theorem to prove Theorem 1.6, showing that the partly averaged matrix $H_0$ has eigenvalues close to those of $H$ in the limit as $n\to \infty$ .

1.3 Background definitions

We give a few definitions to make clear the convergence described in Theorem 1.4 between empirical spectral distribution measures of $H$ and $H_0$ . For an $n\times n$ matrix $M_n$ with eigenvalues $\lambda _1, \ldots, \lambda _n$ , the empirical spectral measure $\mu _{M_n}$ of $M_n$ is defined to be

\begin{equation*}\mu _{M_n} =\frac {1}{n} \sum _{i=1}^n \delta _{\lambda _i},\end{equation*}

where $\delta _x$ is the Dirac delta function with mass 1 at $x$ . Note that $\mu _{M_n}$ is a probability measure on the complex numbers $\mathbb C$ . The empirical spectral distribution (ESD) for $M_n$ is defined to be

\begin{equation*}F^{M_n}(x,y)=\frac {1}{n} \#\left \{\lambda _i \;:\; {\mathsf {Re}}(\lambda _i) \le x \mbox { and } {\mathsf {Im}}(\lambda _i) \le y \right \}.\end{equation*}

For $T$ a topological space (for example $\mathbb R$ or $\mathbb C$ ) and $\mathcal B$ its Borel $\sigma$ -field, we can define convergence of a sequence $(\mu _n)_{n \ge 1}$ of random probability measures on $(T,\mathcal B)$ to a nonrandom probability measure $\mu$ also on $(T,\mathcal B)$ as follows. We say that $\mu _n$ converges weakly to $\mu$ in probability as $n \to \infty$ (written $\mu _n \to \mu$ in probability) if for all bounded continuous functions $f\;:\; T \to \mathbb R$ and all $\epsilon >0$ we have

\begin{equation*}{\mathbb {P}}\left (\left \lvert {\int _T f \, d\mu _n - \int _T f\, d\mu }\right \rvert > \epsilon \right ) \to 0 \mbox { as } n \to \infty .\end{equation*}

Also, we say that $\mu _n$ converges weakly to $\mu$ almost surely as $n \to \infty$ (written $\mu _n \to \mu$ a.s.) if for all bounded continuous functions $f\;:\;T \to \mathbb R$ , we have that $\left \lvert{\int _T f \, d\mu _n - \int _T f\, d\mu }\right \rvert \to 0$ almost surely as $n \to \infty$ .

We will use $\|A\|_F \;:\!=\; \textrm{tr} \; ( AA^*)^{1/2}$ to denote the Frobenius norm or Hilbert-Schmidt norm, and $\left \|A\right \|$ to denote the operator norm. We denote $\|A\|_{\max }=\max _{ij}|a_{ij}|$ . We use the notation $o(1)$ to denote a small quantity that tends to zero as $n$ goes to infinity. We use the asymptotic notation $f(n)\ll g(n)$ if $f(n)/g(n)=o(1)$ and we write $f(n)= o(g(n))$ ; furthermore, we write $f(n)=O(g(n))$ if $f(n)\le C g(n)$ for a constant $C>0$ when $n$ is sufficiently large. Finally, we will use $I$ or $I_n$ to denote the identity matrix, where the subscript $n$ will be omitted when the dimension can be inferred by context.

2. The spectrum of $H_0$

We are interested in the limiting ESD of $H$ when $H$ is scaled to have bounded support (except for one outlier eigenvalue), and so we will work with the following rescaled conjugation of $H$ , which has the same eigenvalues as $H/\sqrt{\alpha }$ .

\begin{equation*} \widetilde {H}\;:\!=\;\frac {1}{\sqrt {\alpha }}\left ( \begin {array}{c@{\quad}c} \frac {1}{\sqrt {\alpha }}I & 0\\[5pt] 0&I \end {array} \right ) \left ( \begin {array}{c@{\quad}c} A &I-D \\[5pt] I &0 \end {array} \right ) \left ( \begin {array}{c@{\quad}c} \sqrt {\alpha } I & 0\\[5pt] 0 &I \end {array} \right )= \left ( \begin {array}{c@{\quad}c} \frac {1}{\sqrt {\alpha }}A & \frac {1}{\alpha }(I-D) \\[5pt] I &0 \end {array} \right ). \end{equation*}

Note that the diagonal matrix $\frac{1}{\alpha }(I-D)$ is equal to $-I$ in expectation, and so we will compare the eigenvalues of $\widetilde H$ to those of the partly averaged matrix $\widetilde H_0$ , noting that $\widetilde H = \widetilde H_0 + E$ , where

(2.1)

\begin{equation} \widetilde H_0 \;:\!=\; \left ( \begin{array}{c@{\quad}c} \frac{1}{\sqrt{\alpha }} A & -I \\[5pt] I & 0 \end{array} \right )\qquad \mbox{ and } \qquad E\;:\!=\; \left ( \begin{array}{c@{\quad}c} 0 & I+\frac{1}{\alpha }(I-D) \\[5pt] 0 & 0 \end{array} \right ). \end{equation}

Note that $H_0/\sqrt{\alpha }$ and $\widetilde H_0$ also have identical eigenvalues.

We will show that $\widetilde H_0$ is explicitly diagonalizable in terms of the eigenvectors and eigenvalues of $\frac{1}{\sqrt{\alpha }} A$ , and then use this information to find an explicit form for the characteristic polynomial for $\widetilde H_0$ .

2.1 Spectrum of $\widetilde H_0$ : proof of Proposition 1.2

Since $\frac{1}{\sqrt{\alpha }} A$ is a real symmetric matrix, it has a set $v_1,\ldots, v_n$ of orthonormal eigenvectors with corresponding real eigenvalues $\lambda _1 \ge \lambda _2 \ge \ldots \ge \lambda _n$ . Thus we may write $\frac{A}{\sqrt \alpha } = U^{T}{\textrm{diag}}(\lambda _1,\ldots,\lambda _n) U$ where $U$ is an orthogonal matrix. Consider the matrix $xI - \widetilde H_0$ , and note that

\begin{equation*} \begin {pmatrix} I & 0\\[5pt] -xI & I \end {pmatrix} (xI - \widetilde H_0) = \begin {pmatrix} xI-\frac {1}{\sqrt \alpha } A & I\\[5pt] -x\big (xI-\frac {1}{\sqrt \alpha } A\big ) -I & 0 \end {pmatrix}, \end{equation*}

we see that $\det (xI-\widetilde H_0) = \det (I + x(xI - \frac{1}{\sqrt{\alpha }} A) ) = \det (x^2 I - \frac{x}{\sqrt{\alpha }} A + I)$ . Conjugating to diagonalize $A$ , we see that

(2.2)

\begin{equation} \det (xI-\widetilde H_0) = \det (x^2 I - x{\textrm{diag}}(\lambda _1,\ldots,\lambda _n) + I) = \prod _{i=1}^n (x^2 - \lambda _i x + 1). \end{equation}

With the characteristic polynomial for $\widetilde H_0$ factored into quadratics as in (2.2), we see that for each $\lambda _i$ of $\frac{1}{\sqrt{\alpha }} A$ , there are two eigenvalues $\mu _{2i-1}$ and $\mu _{2i}$ for $\widetilde H_0$ which are the two solutions to $x^2-\lambda _i x +1=0$ ; thus,

(2.3)

\begin{equation} \mu _{2i-1} = \frac{\lambda _i + \sqrt{ \lambda _i^2 - 4}}{2} \qquad \mbox{ and } \qquad \mu _{2i} = \frac{\lambda _i - \sqrt{ \lambda _i^2 - 4}}{2}. \end{equation}

The eigenvalues of $A$ are well-understood. We use the following results that exist in literature.

Theorem 2.1 ([Reference Krivelevich and Sudakov26, Reference Lee and Schnelli27]). Let $A$ be the adjacency matrix for an Erdős-Rényi random graph $G(n,p)$ . Assume $0< p\le p_0<1$ for a constant $p_0$ and $p\ge n^{-1+\phi }$ for a small constant $\phi >0$ . Then for any $\epsilon >0$ , the following holds with probability $1-o(1)$ :

\begin{align*} \lambda _1(A) = np(1+o(1)); \end{align*}

\begin{align*} \max _{2\le i \le n}|\lambda _i(A)+p| \le L\sqrt{np(1-p)}+n^{\epsilon } \sqrt{np}\left (\frac{1}{(np)^2} + \frac{1}{n^{2/3}} \right )\!, \end{align*}

where $L=2+ \frac{s^{(4)}}{np} + O(\frac{1}{(np)^2})$ and $s^{(4)} =n^2p \left [ \frac{p^3 + (1-p)^3}{n^2 p(1-p)}-\frac{3}{n^2}\right ].$

Proof. We collect relevant results regarding the eigenvalues of $A$ from different works in the literature. In [Reference Krivelevich and Sudakov26], it is shown that with probability $1-o(1)$ , $\lambda _1(A) = (1+o(1))\max \{np, \sqrt{\Delta } \}$ where $\Delta$ is the maximum degree. As long as $np/\log n \to \infty$ , $\max \{np, \sqrt{\Delta } \} = np$ (for the bounds on $\Delta$ see, for instance, the proof of Lemma 3.5 below).

The operator norm of $A-{\mathbb E } A$ and the extreme eigenvalues of $A$ have been studied in various works (see [Reference Benaych-Georges, Bordenave and Knowles5, Reference Erdős, Knowles, Yau and Yin12, Reference Füredi and Komlós13, Reference He and Knowles18, Reference Huang, Landon and Yau21, Reference Lee and Schnelli27, Reference Vu36]). In particular, in [Reference Lee and Schnelli27, Theorem 2.9], assuming $p\ge n^{-1 + \phi }$ , the authors proved that for any $\epsilon >0$ and $C>0$ , the following estimate holds with probability at least $1-n^{-C}$ :

\begin{equation*}\left | \frac {1}{\sqrt {np(1-p)}} \| A - {\mathbb E } A \| -L \right | \le n^{\epsilon } \left (\frac {1}{(np)^2} + \frac {1}{n^{2/3}} \right )\end{equation*}

with $L=2+ \frac{s^{(4)}}{np} + O(\frac{1}{(np)^2})$ and $s^{(4)} =n^2p \left [ \frac{p^3 + (1-p)^3}{n^2 p(1-p)}-\frac{3}{n^2}\right ] = 1+ O(p).$ The conclusion of the theorem follows immediately from the classical Weyl’s inequality that $\max _{2\le i \le n}|\lambda _i(A)+p|=\max _{2\le i \le n}|\lambda _i(A)-\lambda _i({\mathbb E } A)|\le \|A-{\mathbb E } A\|$ .

Now we are ready to derive Proposition 1.2.

Proof of Proposition 1.2. Note that $\lambda _i=\lambda _i(A)/\sqrt{\alpha }$ and $\alpha =(n-1)p-1$ . We have that

(2.4)

\begin{align} \lambda _1=\sqrt{np}(1+o(1)) \quad \text{and}\quad \max _{2\le i \le n}|\lambda _i| \le 2\sqrt{1-p}(1+o(1)) \end{align}

with probability $1-o(1)$ by Theorem 2.1. Therefore, for $\lambda _1$ , we see from (2.3) that $\mu _1, \mu _2$ are real eigenvalues and

\begin{align*} \mu _1=\sqrt{np}(1+o(1)) \quad \text{and} \quad \mu _2=\frac{1}{\sqrt{np}}(1+o(1)) \end{align*}

with probability $1-o(1)$ . Next, by Theorem 2.1, it holds with probability $1-o(1)$ for any $2\le i \le n$ that

\begin{align*} \lambda _i^2=\frac{\lambda _i^2(A)}{\alpha } \le \frac{1}{\alpha }\left [ L\sqrt{np(1-p)}+p+n^{\epsilon } \sqrt{np}\left (\frac{1}{(np)^2} + \frac{1}{n^{2/3}} \right )\right ]^2. \end{align*}

Since $p \ge C/\sqrt{n}$ for a sufficiently large constant $C$ , we have

\begin{align*} \lambda _i^2 &\le \frac{1}{\alpha } \left [ 2\sqrt{np(1-p)}+ O(\max \{p,(np)^{-1/2}, p^{1/2} n^{-1/6+\epsilon }\}) \right ]^2\\[5pt] &=\frac{4 np (1-p) + O(\max \{p \sqrt{np},1, pn^{1/3 + \epsilon }\})}{np-(p+1)}\\[5pt] &=\frac{4 (1-p) + O(\max \{\sqrt{p/n},(np)^{-1}, n^{-2/3 + \epsilon }\})}{1-\frac{p+1}{np}} \\[5pt] &= 4(1-p)+O\left (\max \{\sqrt{p/n},(np)^{-1}, n^{-2/3 + \epsilon }\}\right ) + O\left (\frac{1}{np}\right )\le 4-3p, \end{align*}

for all sufficiently large $n$ . Hence, for all $i\ge 2$ , we have $\lambda _i^2 <4$ and thus $\mu _{2i-1}, \mu _{2i}$ are complex eigenvalues with magnitude $1$ (since $|\mu _{2i-1}|=|\mu _{2i}|=1$ ). One should also note that $\mu _{2i-1}\mu _{2i} = 1$ for every $i$ , and that whenever $\mu _{2i-1}$ is complex (i.e., $i\ge 2$ ), its complex conjugate is $\overline \mu _{2i-1}=\mu _{2i}$ .

Furthermore, note that ${\mathsf{Re}}\mu _{2i-1}={\mathsf{Re}} \mu _{2i} =\lambda _i/2 = \lambda _i(A)/2\sqrt{\alpha }$ . It is known that the empirical spectral measure of $A/\sqrt{np(1-p)}$ converges to the semicircular law supported on $[-2,2]$ assuming $np\to \infty$ (see for instance [Reference Khorunzhy and Pastur25] or [Reference Tran, Van and Wang35]). We have the ESD of the scaled real parts of $\mu _j$

\begin{equation*}\frac {1}{2n}\sum _{j=1}^{2n} \delta _{\frac {2{\mathsf {Re}}\mu _{j}}{\sqrt {1-p}}} \to \mu _{\text{sc}}\end{equation*}

weakly almost surely where $\mu _{\text{sc}}$ is the semicircular law supported on $[-2,2]$ . The proof of Proposition 1.2 is now complete.

2.2 Real eigenvalues of $\widetilde H_0$ when $p\le n^{-1/2}$

As mentioned in Remark 1.3, when $p$ becomes smaller than $n^{-1/2}$ , more real eigenvalues of $\widetilde H_0$ will emerge. We can identify some of these eigenvalues, using recent results of [Reference Erdős, Knowles, Yau and Yin12, Reference He and Knowles18, Reference Huang, Landon and Yau21, Reference Lee and Schnelli27] in the study of the extreme eigenvalues of $A$ . For instance, in [Reference Lee and Schnelli27, Corollary 2.13], assume $n^{2\phi -1} \le p \le n^{-2\phi '}$ for $\phi >1/6$ and $\phi '>0$ . Then

(2.5)

\begin{align} \lim _{n\to \infty }{\mathbb{P}}\left ( n^{2/3} \Big (\frac{1}{\sqrt{np(1-p)}}\lambda _2(A) - \mathcal L -a \Big )\le s\right ) = F_{1}^{\text{TW}}(s), \end{align}

where $\mathcal L= 2+\frac{1}{np} + O(\frac{1}{n^2p^2})$ , $a=\sqrt{\frac{p}{n(1-p)}}$ and $F_1^{\text{TW}}(s)$ is the Tracy-Widom distribution function. Therefore, when $p\ge n^{-2/3+\epsilon }$ , by noting that $F_1^{\text{TW}}(s) \to 1$ as $s\to \infty$ and selecting $s$ to be a large constant in (2.5), we see that

\begin{equation*}\lambda _2(A) = 2\sqrt {np(1-p)} + p + \sqrt {\frac {1-p}{np}} + O\left (\frac {\sqrt {np}}{n^{2/3}}\right ).\end{equation*}

Note that if $p<\frac{1-p}{n^{1/3}}$ , then $p<\sqrt{\frac{1-p}{np}}$ and thus for $ n^{-2/3+\epsilon }\le p \le n^{-1/2}\ll n^{-1/3}$ ,

\begin{align*} \lambda _2^2 -4 &= \left (\frac{\lambda _2(A)}{\sqrt \alpha } \right )^2-4 = \frac{\left (2\sqrt{np(1-p)} + \sqrt{\frac{1-p}{np}} +p+ O\big (\frac{\sqrt{np}}{n^{2/3}} \big ) \right )^2}{np-(p+1)}-4\\[5pt] &= \frac{\left (2\sqrt{np(1-p)} + \sqrt{\frac{1-p}{np}} +p \right )^2+O\left (\frac{np}{n^{2/3}} \right )}{np-(p+1)}-4\\[5pt] &= \frac{4np(1-p) + 4(1-p) + 4p\sqrt{np(1-p)}+O(\frac{np}{n^{2/3}})}{np-(p+1)}-4\\[5pt] &=\frac{4(1-p) +\frac{4(1-p)}{np} + 4p \sqrt{\frac{1-p}{np}}+O\left (\frac{1}{n^{2/3}} \right )}{1-\frac{p+1}{np}}-4\\[5pt] &=-4p + 4p \sqrt{\frac{1-p}{np}}+ \frac{4(1-p)(2+p)}{np} +O(n^{-2/3}) > 0. \end{align*}

Hence, from (2.3), both $\mu _3$ and $\mu _4$ are real. The convergence result (2.5) holds for finitely many extreme eigenvalues of $A$ and thus they also generate real eigenvalues for $\widetilde H_0$ .

The fluctuation of the extreme eigenvalues of $A$ has been obtained in [Reference Huang, Landon and Yau21, Corollary 1.5] for $n^{-7/9} \ll p \ll n^{-2/3}$ and in [Reference He and Knowles18] for the remaining range of $p$ up to $p\ge n^{-1+\epsilon }$ . One could use similar discussion as above to extract information about the real eigenvalues of $\widetilde H_0$ . The details are omitted.

2.3 $\widetilde H_0$ is diagonalizable

We can now demonstrate an explicit diagonalization for $\widetilde H_0$ . Since $\mu _{2i-1}$ and $\mu _{2i}$ are solutions to $\mu ^2-\mu \lambda _i +1=0$ , one can check that the following vectors

(2.6)

\begin{align} y_{2i-1}^*=\frac{1}{\sqrt{1+|\mu _{2i-1}|^2}}\left (-\mu _{2i-1} v_i^T \quad v_i^T\right ) \quad \mbox{ and }\quad y_{2i}^*=\frac{1}{\sqrt{1+|\mu _{2i}|^2}}\left (-\mu _{2i}v_i^T \quad v_i^T\right ) \end{align}

satisfy $y_{2i-1}^* \widetilde H_0 = \mu _{2i-1} y_{2i-1}^*$ and $y_{2i}^* \widetilde H_0 = \mu _{2i} y_{2i}^*$ for all $i$ . Furthermore, $y_{2i-1}$ and $y_{2i}$ are unit vectors. For $1\le i\le n$ , define the vectors

(2.7)

\begin{equation} x_{2i-1}=\frac{\sqrt{1+|\mu _{2i-1}|^2}}{\mu _{2i}-\mu _{2i-1}}\begin{pmatrix} v_i \\[5pt] \mu _{2i} v_i \end{pmatrix} \quad \text{ and }\quad x_{2i}=\frac{\sqrt{1+|\mu _{2i}|^2}}{\mu _{2i-1}-\mu _{2i}}\begin{pmatrix} v_i \\[5pt] \mu _{2i-1}v_i \end{pmatrix}\!. \end{equation}

Defining

\begin{equation*} Y=\begin{pmatrix} y_1^*\\[5pt] y_2^*\\[5pt] \vdots \\[5pt] y_{2n}^* \end{pmatrix} \quad \mbox{ and }\quad X=\begin{pmatrix} x_1, x_2, \ldots, x_{2n} \end{pmatrix} \end{equation*}

we see that $X=Y^{-1}$ since $v_1,\ldots,v_n$ are orthonormal. Also it is easy to check that $Y\widetilde H_0 X = \text{diag}(\mu _1,\ldots,\mu _{2n})$ .

3. The bulk distribution: proving Theorem 1.4

We begin by re-stating Theorem 1.4 using the conjugated matrices defined in (2.1).

Theorem 3.1. Let $A$ be the adjacency matrix for an Erdős-Rényi random graph $G(n,p)$ . Assume $0< p\le p_0<1$ for a constant $p_0$ and $np/\log n \to \infty$ with $n$ . Let $\widetilde H$ be the rescaled conjugation of the non-backtracking spectrum operator for $A$ defined in ( 2.1 ), and let $\widetilde H_0$ be its partial derandomization, also defined in ( 2.1 ). Then, $\mu _{\widetilde H} - \mu _{\widetilde H_0}$ converges almost surely (thus, also in probability) to zero as $n$ goes to infinity.

To prove Theorem 3.1, we will show that the bulk distribution of $\widetilde H$ matches that of $\widetilde H_0$ using the replacement principle [Reference Tao and Van34, Theorem 2.1], which we rephrase slightly as a perturbation result below (see Theorem 3.2). First, we give a few definitions that we will use throughout this section. We say that a random variable $X_n \in \mathbb C$ is bounded in probability if

\begin{equation*}\lim _{C\to \infty } \liminf _{n \to \infty } {\mathbb {P}}(\left \lvert {X_n}\right \rvert \le C) = 1\end{equation*}

and we say that $X_n$ is almost surely bounded if

\begin{equation*} {\mathbb {P}}\left ( \limsup _{n\to \infty } \left \lvert { X_n}\right \rvert < \infty \right ) = 1.\end{equation*}

Theorem 3.2 (Replacement principle [Reference Tao and Van34]). Suppose for each $m$ that $M_m$ and $M_m+P_m$ are random $m \times m$ matrices with entries in the complex numbers. Assume that

(3.1)

\begin{equation} \frac{1}{m} \left \|{ M_m }^2\right \|_F + \frac{1}{m} \left \|{M_m+P_m}^2\right \|_F \mbox{ is bounded in probability (resp., almost surely)} \end{equation}

and that, for almost all complex numbers $z \in \mathbb C$ ,

(3.2)

\begin{equation} \frac{1}{m} \log \left \lvert{\rule{0pt}{10pt}\det \left ( M_m+P_m - zI\right ) }\right \rvert - \frac{1}{m} \log \left \lvert{\rule{0pt}{10pt} \det \left (M_m -zI\right )}\right \rvert \end{equation}

converges in probability (resp., almost surely) to zero; in particular, this second condition requires that for almost all $z \in \mathbb C$ , the matrices $M_m+P_m - zI$ and $M_m - zI$ have non-zero determinant with probability $1-o(1)$ (resp., almost surely non-zero for all but finitely many $m$ ).

Then $\mu _{M_m} - \mu _{M_m+P_m}$ converges in probability (resp., almost surely) to zero.

Note that there is no independence assumption anywhere in Theorem 3.2; thus, entries in $P_m$ may depend on entries in $M_m$ and vice versa.

We will use the following corollary of Theorem 3.2, which essentially says that if the perturbation $P_m$ has largest singular value of order less than the smallest singular value for $M_m-zI$ for almost every $z\in \mathbb C$ , then adding the perturbation $P_m$ does not appreciably change the bulk distribution of $M_m$ .

Corollary 3.3. For each $m$ , let $M_m$ and $P_m$ be random $m\times m$ matrices with entries in the complex numbers, and let $f(z,m)\ge 1$ be a real function depending on $z$ and $m$ . Assume that

(3.3)

\begin{equation} \frac{1}{m} \|M_m\|_F^2 + \frac{1}{m}\|M_m+P_m\|_F^2 \mbox{ is bounded in probability (resp., almost surely)}, \end{equation}

and

(3.4)

\begin{equation} f(z,m) \|P_m\| \mbox{ converges in probability (resp., almost surely) to zero}, \end{equation}

and, for almost every complex number $z \in \mathbb C$ ,

(3.5)

\begin{equation} \left \|{\left (M_m-zI\right )^{-1}}\right \| \le f(z,m), \end{equation}

with probability tending to 1 (resp., almost surely for all but finitely many $m$ ).

Then $\mu _{M_m} - \mu _{M_m+P_m}$ converges in probability (resp., almost surely) to zero.

Proof. We will show that the three conditions (3.3), (3.4), and (3.5) of Corollary 3.3 together imply the two conditions needed to apply Theorem 3.2.

First note that (3.3) is identical to the first condition (3.1) of Theorem 3.2. Next, we will show in the remainder of the proof that condition (3.2) of Theorem 3.2 holds by noting that sufficiently small perturbations have a small effect on the singular values, and also the absolute value of the determinant is equal to the product of the singular values.

Let $z$ be a complex number for which (3.5) holds, let $M_m-zI$ have singular values $\sigma _1 \ge \ldots \ge \sigma _m$ , and let $M_m+P_m-zI$ have singular values $\sigma _1+s_1 \ge \sigma _2+s_2 \ge \ldots \ge \sigma _m+s_m$ . We will use the following result, which is sometimes called Weyl’s perturbation theorem for singular values, to show that the $s_i$ are small.

Lemma 3.4 ([Reference Chafaï9, Theorem 1.3]). Let $A$ and $B$ be $m\times n$ real or complex matrices with singular values $\sigma _1(A)\ge \ldots \ge \sigma _{\min \{m, n\}}(A) \ge 0$ and $\sigma _1(B)\ge \ldots \ge \sigma _{\min \{m, n\}}(B) \ge 0$ , respectively. Then

\begin{equation*} \max _{1\le j \le \min \{m, n\}} \left \lvert { \sigma _j(A) -\sigma _j(B) }\right \rvert \le \left \|{A-B}\right \|.\end{equation*}

We then have that

\begin{align*} \max _{1\le i \le m}\left \lvert{s_i}\right \rvert &\le \|P_m\|, \end{align*}

and by (3.5),

\begin{align*} \max _{1\le i \le m}\frac{\left \lvert{s_i}\right \rvert }{\sigma _i} \le f(z,m)\|P_m\| \end{align*}

which converges to zero in probability (resp., almost surely) by (3.4). Thus we know that

\begin{equation*} \left | \log (1+s_i/\sigma _i )\right | \le 2\left \lvert { s_i/\sigma _i}\right \rvert \le 2f(z,m)\|P_m\|, \end{equation*}

where the inequalities hold with probability tending to 1 (resp., almost surely for all sufficiently large $m$ ). Using the fact that the absolute value of the determinant is the product of the singular values, we may write (3.2) as

\begin{align*} \left |\frac{1}{m} \left (\log \prod _{i=1}^m (\sigma _i+s_i)- \log \prod _{i=1}^m\sigma _i\right ) \right | &= \frac{1}{m} \left | \sum _{i=1}^m \log \left ( 1+\frac{s_i}{\sigma _i}\right )\right | \le 2f(z,m)\|P_m\|, \end{align*}

which converges to zero in probability (resp., almost surely) by (3.4). Thus, we have shown that (3.1) and (3.2) hold, which completes the proof.

3.1 Proof of Theorem 3.1

The proof of Theorem 3.1 will follow from Corollary 3.3 combined with lemmas showing that the conditions (3.3), (3.4), and (3.5) of Corollary 3.3 are satisfied. Indeed, Lemma 3.7 verifies (3.3), Lemma 3.9 verifies (3.5) and (3.4) follows by combining Lemma 3.5 and Lemma 3.9. Note that the assumption $np/\log n \to \infty$ in Theorem 3.1 is only needed to prove conditions (3.3) and (3.4). Condition (3.5) in fact follows for any $p$ and for more general matrices – see the proof of Lemma 3.9.

In Corollary 3.3, we will take $M_m$ to be the partly derandomized matrix $\widetilde H_0$ and $P_m$ to be the matrix $E$ [see (2.1)], where we suppress the dependence of $\widetilde H_0$ and $E$ on $n=m/2$ to simplify the notation. There are two interesting features: first, the singular values of $\widetilde H_0$ may be written out explicitly in terms of the eigenvalues of the Hermitian matrix $A$ (which are well-understood; see Lemma 3.9); and second, the matrix $E$ is completely determined by the matrix $\widetilde H_0$ , making this a novel application of the replacement principle (Theorem 3.2 and Corollary 3.3) where the sequence of matrices $\widetilde H_0 + E= \widetilde H$ has some dependencies among the entries.

Lemma 3.5. Assume $0< p \le p_0<1$ for a constant $p_0$ . Further assume $np/\log n\to \infty$ . For $E$ as defined in ( 2.1 ), we have that $\|E\|\le 20 \sqrt{\frac{{\log n}}{np}}$ almost surely for all but finitely many $n$ . In particular, $\|E\|$ converges to zero almost surely for all but finitely many $n$ .

Proof. First, note that ${\mathbb E } D= (n-1)p I = (\alpha +1)I$ and thus

\begin{equation*}E\;:\!=\; \left ( \begin{array}{c@{\quad}c} 0 & I+\frac {1}{\alpha }(I-D) \\[5pt] 0 & 0 \end {array} \right )= \left ( \begin {array}{c@{\quad}c} 0 & \frac {1}{\alpha }({\mathbb E } D-D) \\[5pt] 0 & 0 \end {array} \right ).\end{equation*}

Since ${\mathbb E } D-D$ is a diagonal matrix, it is easy to check that

\begin{equation*}\|E\|=\|E\|_{\max } = \frac {1}{\alpha } \|D-{\mathbb E } D\|_{\max }=\frac {1}{\alpha }\max _{1\le i \le n} |D_{ii}- {\mathbb E } D_{ii}|=\frac {1}{\alpha }\max _{1\le i \le n} |D_{ii}- (n-1)p|.\end{equation*}

Note that $D_{ii}$ ’s have the same distribution. By the union bound, it follows that for any $s>0$ ,

\begin{equation*}{\mathbb {P}}(\|E\| \ge s) \le n {\mathbb {P}}\left (\frac {1}{\alpha } |D_{11}- (n-1)p| \ge s \right )=n {\mathbb {P}}\left (\frac {1}{\alpha } \Big |\sum _{j=2}^n a_{1j}- (n-1)p \Big | \ge s \right ).\end{equation*}

Next we will apply the following general form of Chernoff bound.

Theorem 3.6 (Chernoff bound [Reference Chernoff10]). Assume $\xi _1,\ldots,\xi _n$ are iid random variables and $\xi _i\in [0,1]$ for all $i$ . Let $p={\mathbb E } \xi _i$ and $S_n = \sum _{i=1}^n \xi _i$ , then for any $\varepsilon >0$ ,

\begin{align*} &{\mathbb{P}}(S_n - np \ge n\varepsilon ) \le \exp \left (-\mbox{RE}(p+\varepsilon || p) n \right );\\[5pt] &{\mathbb{P}}(S_n - np \le -n\varepsilon ) \le \exp \left (-\mbox{RE}(p-\varepsilon || p) n \right ) \end{align*}

where $\mbox{RE}(p||q)= p\log (\frac{p}{q}) + (1-p) \log (\frac{1-p}{1-q})$ is the relative entropy or Kullback-Leibler divergence.

By our assumption, $np=\omega (n)\log n$ where $\omega (n)$ is a positive function that tends to infinity with $n$ . Now take $K=(n-1)p+npt$ where $t=t(n)=10 \sqrt{\frac{\log n}{np}}$ (say). Our assumption $np/\log n\to \infty$ implies $t\to 0$ with $n$ . Thus

\begin{align*}{\mathbb{P}} \left (\sum _{j=2}^n a_{1j} \ge K \right ) ={\mathbb{P}} \left (\sum _{j=2}^n a_{1j} -(n-1)p \ge npt \right ) \le \exp (-\mbox{RE}(p+pt|| p) n) \end{align*}

where

\begin{align*} \mbox{RE}\left (p+pt ||p \right )&= p(1+t)\log (1+t) + (1-p-pt)\log \left (\frac{1-p-pt}{1-p} \right )\\[5pt] &=p(1+t)\log (1+t) - (1-p-pt )\log \left (1+\frac{pt}{1-p-pt} \right )\\[5pt] & > p(1+t)(t-t^2/2) - pt =pt^2(1-t)/2 \end{align*}

by the elementary inequalities $x-x^2/2<\log (1+x)< x$ for $x>0$ .

Therefore, for $n$ sufficiently large, taking $t=5 \sqrt{\frac{\log n}{np}}$ , we get

\begin{equation*}{\mathbb {P}} \left (\sum _{j=2}^n a_{1j} \ge (n-1)p+npt \right ) \le \exp \left (-\frac {np t^2(1-t)}{2} \right )\le \exp (-10\log n)= n^{-10}.\end{equation*}

Similarly, take $L=(n-1)p-npt$ where $t=5 \sqrt{\frac{\log n}{np}}$ . Applying the Chernoff bound yields that

\begin{align*}{\mathbb{P}} \left (\sum _{j=2}^n a_{1j} \le L \right ) ={\mathbb{P}} \left (\sum _{j=1}^n a_{1j} -(n-1)p \le -npt \right ) \le \exp \left (-\mbox{RE}(p-pt|| p) n \right ). \end{align*}

We take $n$ sufficiently large such that $t=t(n)<0.01$ (say). Then

\begin{align*} \mbox{RE}(p-pt ||p)&= p(1-t)\log (1-t) + (1-p+pt)\log \left (\frac{1-p+pt}{1-p} \right )\\[5pt] &=p(1-t)\log (1-t) - (1-p+pt )\log \left (1-\frac{pt}{1-p+pt} \right )\\[5pt] & > p(1-t)(-t-\frac{3}{5}t^2) + pt = \frac{1}{5}pt^2(2+3t) \ge \frac{2}{5}pt^2 \end{align*}

where we use the fact that $\log (1-x)<-x$ for $x\in (0,1)$ and $\log (1-x)>-x-\frac{3}{5}x^2$ for $x\in (0,0.01)$ . Hence, we get

(3.6)

\begin{align} {\mathbb{P}}\left (\sum _{j=2}^n a_{1j} \le (n-1)p-npt \right ) \le \exp \left ( - \frac{2}{5}pt^2\right )=\exp (-10\log n) = n^{-10}. \end{align}

Since $2\alpha t = 2((n-1)p-1) t \ge npt$ for $n$ sufficiently large, it follows that

\begin{align*} &{\mathbb{P}}\left (\|E\| \ge 10 \sqrt{\frac{\log n}{np}} \right ) \le n{\mathbb{P}} \left (\left |\sum _{j=2}^n a_{1j}-(n-1)p \right | \ge 2\alpha t \right ) \\[5pt] &\le n{\mathbb{P}}\left (\sum _{j=2}^n a_{1j}\ge (n-1)p + npt \right ) +n{\mathbb{P}}\left (\sum _{j=2}^n a_{1j}\le (n-1)p - npt \right ) \le 2n^{-10}. \end{align*}

By the Borel-Cantelli lemma, we have that $\|E\| \le 10 \sqrt{\frac{\log n}{np}}$ almost surely for all but finitely many $n$ .

To show (3.3), we combine Hoeffding’s inequality and Lemma 3.5 to prove the following lemma.

Lemma 3.7. Assume $0< p \le p_0<1$ for a constant $p_0$ . Further assume $np/\log n\to \infty$ . For $\widetilde H_0$ and $E$ as defined in ( 2.1 ), we have that both $\frac{1}{2n} \|\widetilde H_0\|_F^2$ and $\frac{1}{2n} \|\widetilde H_0+E\|_F^2$ are almost surely bounded.

Proof. We begin by stating Hoeffding’s inequality [Reference Hoeffding20].

Theorem 3.8 (Hoeffding’s inequality [Reference Hoeffding20]). Let $\beta _1,\ldots,\beta _k$ be independent random variables such that for $1\le i \le k$ we have ${\mathbb{P}}(\beta _i \in [a_i,b_i]) = 1.$ Let $S\;:\!=\; \sum _{i=1}^k \beta _i$ . Then for any real $t$ ,

\begin{equation*}{\mathbb {P}}(\left \lvert {S- {\mathbb E }(S) }\right \rvert \ge kt ) \le 2\exp \left ( - \frac {2 k^2 t^2}{\sum _{i=1}^k (b_i -a_i)^2}\right ).\end{equation*}

Recall that $\alpha =(n-1)p-1$ and $\widetilde H_0= \left ( \begin{matrix} \frac{1}{\sqrt{\alpha }} A & -I \\[5pt] I & 0 \end{matrix} \right )$ , where $A=(a_{ij})_{1\le i, j \le n}$ is the adjacency matrix of an Erdős-Rényi random graph $G(n,p)$ . Thus

\begin{align*} \|\widetilde H_0\|_F^2 &= \frac{1}{\alpha }\|A\|_F^2 + 2\|I\|_F^2 = \frac{1}{\alpha }\sum _{i,j} a_{ij}^2 + 2n =\frac{2}{\alpha }\sum _{i < j} a_{ij} + 2n. \end{align*}

To apply Hoeffding’s inequality, note that $a_{ij}$ $(i<j)$ are iid random variables each taking the value 1 with probability $p$ and 0 otherwise. Let $b_i=1$ and $a_i=0$ for all $i$ , and let $k=\left(\substack{n\\[2pt] 2}\right)$ , which is the number of random entries in $A$ (recall that the diagonal of $A$ is all zeros by assumption). Letting $S=\sum _{i<j} a_{ij}$ , we see that $\mathbb E S = kp$ and so

\begin{align*}{\mathbb{P}}\left (\rule{0pt}{12pt}\left \lvert{ S - kp }\right \rvert \ge kt\right ) \le 2 \exp (-2kt^2). \end{align*}

Since $\|\widetilde H_0\|_F^2 = \frac{2}{\alpha } S + 2n$ , we obtain that

\begin{align*}{\mathbb{P}}\left (\rule{0pt}{12pt}\left \lvert{ \frac{1}{2n}\|\tilde{H}_0\|_F^2 - 1 -\frac{kp}{n\alpha }}\right \rvert \ge \frac{kt}{n\alpha }\right ) \le 2 \exp (-2kt^2). \end{align*}

Take $t=p$ . For $n$ sufficiently large,

\begin{equation*}\frac {kt}{n\alpha } =\frac {n(n-1)p/2}{n[(n-1)p-1]}\le \frac {n(n-1)p/2}{n(n-1)p/2}=1\end{equation*}

and since $p\ge \omega (n) \log n/n$ for $\omega (n)>0$ and $\omega (n)\to \infty$ with $n$ , we get

\begin{align*}{\mathbb{P}}\left (\rule{0pt}{12pt} \frac{1}{2n}\|\tilde{H}_0\|_F^2 \ge 3\right ) \le 2 \exp (-2kt^2)\le 2\exp (-\omega (n)^2 \log ^2 n/2). \end{align*}

By the Borel-Cantelli lemma, we conclude that $\frac{1}{2n}\|\tilde{H}_0\|_F^2$ is bounded almost surely. Since $\|E\|_{\max }=\|E\|$ , by triangle inequality, we see

\begin{align*} \frac{1}{2n} \|\widetilde H_0+E\|_F^2 &\le \frac{1}{2n} (\|\tilde{H}_0\|_F + \|E\|_F)^2\le \frac{1}{n} \|\tilde{H}_0\|_F^2 + \frac{1}{n} \|E\|_F^2 \\[5pt] &\le \frac{1}{n} \|\tilde{H}_0\|_F^2 + \|E\|. \end{align*}

By Lemma 3.5, we get $\frac{1}{2n} \|\widetilde H_0+E\|_F^2$ is bounded almost surely. This completes the proof.

The last part of proving Theorem 3.1 by way of Corollary 3.3 is proving that (3.5) holds with $M_m=\widetilde H_0$ and $f(z,m)=C_z$ , a constant depending only on $z$ , as given below in Lemma 3.9. Note that (3.4) follows by combining Lemma 3.5 and Lemma 3.9. The following lemma will be proved by writing a formula for the singular values of $\widetilde H_0$ in terms of the eigenvalues of the adjacency matrix $A$ , which are well-understood. A number of elementary technical details will be needed to prove that the smallest singular value is bounded away from zero, and these appear in Lemma 3.10.

Lemma 3.9. Let $\widetilde H_0$ be as defined in ( 2.1 ) and let $z$ be a complex number such that ${\mathsf{Im}}(z) \ne 0$ and $\left \lvert{z}\right \rvert \ne 1$ (note that these conditions exclude a set of complex numbers of Lebesgue measure zero). Then there exists a constant $C_z$ depending only on $z$ such that $\left \|{ (\widetilde H_0 -zI)^{-1}}\right \| \le C_z$ with probability 1 for all but finitely many $n$ .

Proof. We will compute all the singular values of $\widetilde H_0 -zI$ , showing that they are bounded away from zero by a constant depending on $z$ . The proof does not use randomness and depends only on facts about the determinant and singular values and on the structure of $\widetilde H_0$ ; in fact, the proof is the same if $\widetilde H_0$ is replaced with any matrix $\begin{pmatrix} M& -I \\[5pt] I & 0 \end{pmatrix}$ with $M$ Hermitian.

To find the singular values of $\widetilde H_0$ we will compute the characteristic polynomial $\chi (\tilde w)$ for $(\widetilde H_0 -zI)(\widetilde H_0 -zI)^*$ , using the definition of $\widetilde H_0$ in (2.1), and assuming that $\tilde w= w + 1+{\left \lvert{z}\right \rvert }^2$ ; thus,

\begin{eqnarray*} \chi (\tilde w)&\;:\!=\;\det \left ((\widetilde H_0 -zI)(\widetilde H_0 -zI)^* - (w+1+\left \lvert{z}\right \rvert ^2)I\right ) \\[5pt] &=\det \begin{pmatrix} \frac{A^2}{\alpha } - (z+\bar z) \frac{A}{\sqrt{\alpha }} - wI\;\;\;\;\;\; & \frac{A}{\sqrt{\alpha }} +(\bar z -z) I\\[5pt] \frac{A}{\sqrt{\alpha }} + (z - \bar z)I & -wI \end{pmatrix}. \end{eqnarray*}

We can use the fact that if $\begin{pmatrix}X & Y \\[5pt] Z & W\end{pmatrix}$ is a matrix composed of four $n\times n$ square blocks where $W$ and $Z$ commute, then $\det \begin{pmatrix}X & Y \\[5pt] Z & W\end{pmatrix} = \det (XW-YZ)$ (see [Reference Silvester33, Theorem 3]). Thus, it is equivalent to consider

\begin{align*} \det \left ( w\left (\frac{A^2}{\alpha } - (z+\bar z) \frac{A}{\sqrt{\alpha }} -w I\right ) + \left (\frac{A}{\sqrt{\alpha }} +(\bar z -z) I\right )\left (\frac{A}{\sqrt{\alpha }} + (z - \bar z)I\right ) \right ). \end{align*}

Because $\frac{A}{\sqrt{\alpha }}$ is Hermitian, it can be diagonalized to $L={\textrm{diag}}(\lambda _1,\ldots,\lambda _n)$ , and thus the above determinant becomes:

\begin{align*} &\det \left ( w\left (\frac{A^2}{\alpha } - (z+\bar z) \frac{A}{\sqrt{\alpha }} -w I\right ) + \left (\frac{A}{\sqrt{\alpha }} +(\bar z -z) I\right )\left (\frac{A}{\sqrt{\alpha }} + (z - \bar z)I\right ) \right )\\[5pt] &\quad = \det \left ( w\left (L^2 - (z+\bar z) L -w I\right ) + \left (L +(\bar z -z) I\right )\left ( L + (z - \bar z)I\right ) \right )\\[5pt] &\quad = \prod _{i=1}^n \left (w \left (\lambda _i^2 - (z+\bar z) \lambda _i -w \right )+ \left ( \lambda _i + (z - \bar z)\right )\left (\lambda _i +(\bar z -z) \right ) \right )\\[5pt] &\quad = \prod _{i=1}^n \left ( -w^2 + w\left ( \lambda _i^2 - (z+\bar z) \lambda _i \right ) + \lambda _i^2-(z-\bar z)^2 \right ). \end{align*}

The quadratic factors can then be explicitly factored, showing that each $\lambda _i$ generates two singular values for $\widetilde H_0 -zI$ , each being the positive square root of

\begin{equation*} 1+\left \lvert {z}\right \rvert ^2+\frac 12\left ( \lambda _i^2 - (z+\bar z)\lambda _i\right ) \pm \frac 12 \sqrt { \left ( \lambda _i^2 - (z+\bar z) \lambda _i \right )^2+4(\lambda _i^2-(z-\bar z)^2) }. \end{equation*}

The proof of Lemma 3.9 is thus completed by Lemma 3.10 (stated and proved below), which shows that the quantity above is bounded from below by a positive constant depending only on $z$ .

Lemma 3.10. Let $z$ be a complex number satisfying ${\mathsf{Im}}(z) \ne 0$ and $\left \lvert{z}\right \rvert \ne 1$ . Then for any real number $\lambda$ , we have that

(3.7)

\begin{equation} 1+\left \lvert{z}\right \rvert ^2+\frac 12\left ( \lambda ^2 - (z+\bar z)\lambda \right ) \pm \frac 12 \sqrt{ \left ( \lambda ^2 - (z+\bar z) \lambda \right )^2+4(\lambda ^2-(z-\bar z)^2) } \ge C_z, \end{equation}

where $C_z$ is a positive real constant depending only on $z$ .

The proof of Lemma 3.10 is given in the appendixFootnote ¹ using elementary calculus, facts about matrices, and case analysis. Lemma 3.10 completes the proof of Lemma 3.5 and thus of Theorem 3.1.

4. Perturbation theory: proving Theorem 1.6

In this section, we study the eigenvalues of $H$ via perturbation theory. Recall from that the discussion in the beginning of Section 2 that $\widetilde H$ in (2.1) has the same eigenvalues as $H/\sqrt{\alpha }$ . We consider $\widetilde H = \widetilde H_0 + E$ where $E= \left ( \begin{array}{c@{\quad}c} 0 & I+\frac{1}{\alpha }(I-D) \\[5pt] 0 & 0 \end{array} \right )\!.$ Note that $H_0/\sqrt{\alpha }$ and $\widetilde H_0$ also have identical eigenvalues.

Let us begin by defining the spectral separation of matrices. Denote the eigenvalues of a matrix $M$ by $\eta _i(M)$ ’s. The spectral variation of $M+E$ with respect to $M$ is defined by

\begin{equation*}S_{M}(M + E) = \max _{j} \min _i |\eta _j(M+E) - \eta _i (M)|.\end{equation*}

Theorem 4.1 (Bauer-Fike theorem; see Theorem 6 from [Reference Bordenave, Lelarge and Massoulié8]). If $H_0$ is diagonalizable by the matrix $Y$ , then

\begin{equation*}S_{H_0}(H_0 + E) \le \left \|{E}\right \| \cdot \left \|{Y}\right \| \cdot \|Y^{-1}\|.\end{equation*}

Denote by $\mathcal{C}_i\;:\!=\;\mathcal{B}(\mu _i(H_0), R)$ the ball in $\mathbb{C}$ centred at $\mu _i(H_0)$ with radius $R=\left \|{E}\right \| \cdot \left \|{Y}\right \| \cdot \|Y^{-1}\|$ . Let $\mathcal{I}$ be a set of indices such that

\begin{equation*}(\cup _{i\in \mathcal {I}} \mathcal {C}_i) \cap (\cup _{i\notin \mathcal {I}} \mathcal {C}_i) = \emptyset .\end{equation*}

Then the number of eigenvalues of $H_0 + E$ in $\cup _{i\in \mathcal{I}} \mathcal{C}_i$ is exactly $|\mathcal{I}|$ .

We will bound the operator norm of $E$ and the condition number $\left \|{Y}\right \| \left \|{Y^{-1}}\right \|$ of $Y$ to prove Theorem 1.6.

By Lemma 3.5, we know that $\left \|{E}\right \| \le 20\sqrt{\frac{\log n}{np}}$ with probability 1 for all but finitely many $n$ .

To bound the condition number of $Y$ , we note that the square of the condition number of $Y$ is equal to the largest eigenvalue of $YY^*$ divided by the smallest eigenvalue of $YY^*$ . Using the explicit definition of $Y$ from (2.6), we see from the fact that the $v_i$ are orthonormal that

\begin{align*} YY^*={\textrm{diag}}(Y_1,\ldots,Y_n) \end{align*}

where $Y_i$ ’s are $2\times 2$ block matrices of the following form

\begin{equation*} Y_i=\begin {pmatrix} y_{2i-1}^* y_{2i-1} & y_{2i-1}^* y_{2i}\\[5pt] y_{2i}^* y_{2i-1} & y_{2i}^* y_{2i} \end {pmatrix}. \end{equation*}

Recall that

\begin{equation*} y_{2i-1}^*=\frac {1}{\sqrt {1+|\mu _{2i-1}|^2}}\begin {pmatrix} -\mu _{2i-1} v_i^T & v_i^T\\[5pt] \end {pmatrix} \quad \mbox { and }\quad y_{2i}^*=\frac {1}{\sqrt {1+|\mu _{2i}|^2}}\begin {pmatrix} -\mu _{2i}v_i^T & v_i^T \end {pmatrix} \end{equation*}

We then have $ Y_i=\begin{pmatrix} 1 & \gamma _i\\[5pt] \overline{\gamma }_i & 1 \end{pmatrix}$ where

\begin{equation*}\gamma _i\;:\!=\; \frac {\mu _{2i-1}\overline {\mu }_{2i}+1}{\sqrt {(1+|\mu _{2i-1}|^2)(1+|\mu _{2i}|^2)}}.\end{equation*}

It is easy to check that the eigenvalues of $Y_i$ are $1\pm |\gamma _i|$ . The eigenvalues of $YY^*$ are the union of all the eigenvalues of the blocks, and so we will compute the eigenvalues $1\pm |\gamma _i|$ based on whether $\lambda _i$ produced real or complex eigenvalues for $\widetilde H_0$ .

For $i=1$ , the eigenvalue $\lambda _1$ produces two real eigenvalues for $\widetilde H_0$ . Using the facts that $\mu _{1}\mu _{2}=1$ and $\mu _{1}+ \mu _{2}=\lambda _1$ , which together imply that $\mu _{1}^2+\mu _{2}^2 = \lambda _1^2-2$ , we see that in this case $\gamma _1^2=\frac{4}{\lambda _1^2}$ , and so the two eigenvalues corresponding to this block are $1\pm |\gamma _1|=1 \pm 2/\left \lvert{\lambda _1}\right \rvert$ . By (2.4), we see that $1\pm |\gamma _i| = 1\pm \frac{2}{\sqrt{np}}(1+o(1))$ with probability $1-o(1)$ .

For $i\ge 2$ , the eigenvalue $\lambda _i$ produces two complex eigenvalues for $\widetilde H_0$ , both with absolute value 1 (see Section 2). In this case, $\gamma _i = \frac{1+\mu _{2i-1}^2}{2}$ . Again using the facts that $\mu _{2i-1}\mu _{2i}=1$ and $\mu _{2i-1}^2+\mu _{2i}^2 = \lambda _i^2-2$ , we see that $\overline \gamma _i \gamma _i = \lambda _i^2/4$ , which shows that the two eigenvalues corresponding to this block are $1 \pm \left \lvert{\lambda _i}\right \rvert/{2}$ .

By [Reference Vu36] (see Theorem 2.1 in Section 2) we know that when $p\ge \frac{\log ^{2/3+\varepsilon } n}{n^{1/6}}$ , $\max _{2\le i\le n} \left \lvert{\lambda _i}\right \rvert \le 2 \sqrt{1-p} + O(n^{1/4}\log n/\sqrt{np})$ with probability tending to 1, and thus the largest and smallest eigenvalues coming from any of the blocks corresponding to $i\ge 2$ are $1 + \sqrt{1-p} +O(n^{1/4}\log n/\sqrt{np})$ and $1-\sqrt{1-p}+ O(n^{1/4}\log n/\sqrt{np})$ with probability tending to 1. Combining this information with the previous paragraph, we see that the condition number for $Y$ is

\begin{align*} &\sqrt{\frac{1+\sqrt{1-p}+O(n^{1/4}\log n/\sqrt{np})}{1-\sqrt{1-p}+O(n^{1/4}\log n/\sqrt{np})}}=\sqrt{\frac{(1+\sqrt{1-p})^2 + O(n^{-1/4}p^{-1/2}\log n)}{p+ O(n^{-1/4}p^{-1/2}\log n)}}\\[5pt] &=\sqrt{\frac{2}{p} \frac{1+\sqrt{1-p} -p/2 +O(n^{-1/4}p^{-1/2}\log n)}{1+O(n^{-1/4}p^{-3/2}\log n)}} \\[5pt] & = \sqrt{\frac{2}{p} \left ( (1+\sqrt{1-p}) + O(n^{-1/4}p^{-3/2}\log n) \right )} \le \frac{2}{\sqrt{p}} \end{align*}

for $n$ sufficiently large. In the third equation above, we use the Taylor expansion. In the last inequality, we use that $n^{-1/4}p^{-3/2}\log n \le \log ^{-3\varepsilon/2} n=o(1)$ since $p\ge \frac{\log ^{2/3+\varepsilon } n}{n^{1/6}}$ .

Finally, we apply Lemma 3.5 and Bauer-Fike Theorem (Theorem 4.1) with $R=\frac{40}{\sqrt{p}}\sqrt{\frac{\log n}{np}}=40\sqrt{\frac{\log n}{np^2}}$ to complete the proof.

Acknowledgements

The first author would like to thank Zhigang Bao for useful discussions. The authors would like to thank the anonymous referees for their valuable suggestions that have enhanced the readability of this paper.

Footnotes

Ke Wang was supported in part by Hong Kong RGC grants GRF 16308219, GRF 16304222, and ECS 26304920. Philip Matchett Wood was supported in part by NSA grant H98230-16-1-0301.

¹ The appendix is available on the arXiv version.

References

Alt, J., Ducatez, R. and Knowles, A. (2022) The completely delocalized region of the Erdős-Rényi graph. Electron. Commun. Probab. 27 1–9.CrossRef Google Scholar

Angel, O., Friedman, J. and Hoory, S. (2015) The non-backtracking spectrum of the universal cover of a graph. Trans. Am. Math. Soc. 367(6) 4287–4318.CrossRef Google Scholar

Arnold, L. (1967) On the asymptotic distribution of the eigenvalues of random matrices. J. Math. Anal. Appl. 20(2) 262–268.CrossRef Google Scholar

Bass, H. (1992) The ihara-selberg zeta function of a tree lattice. Int. J. Math. 3(06) 717–797.CrossRef Google Scholar

Benaych-Georges, F., Bordenave, C. and Knowles, A. (2020) Spectral radii of sparse random matrices. Ann. Inst. Henri Poincaré Probab. Stat. 56(3) 2141–2161.CrossRef Google Scholar

Benaych-Georges, F. and Knowles, A. (2016) Lectures on the local semicircle law for Wigner matrices. arXiv preprint arXiv:1601.04055.Google Scholar

Bordenave, C. (2015) A new proof of Friedman’s second eigenvalue Theorem and its extension to random lifts. Ann. Sci. Éc. Norm. Supér. 53(6) 1393–1439.CrossRef Google Scholar

Bordenave, C., Lelarge, M. and Massoulié, L. (2018) Non-backtracking spectrum of random graphs: community detection and non-regular ramanujan graphs. Ann. Probab. 46(1) 1–71.CrossRef Google Scholar

Chafaï, D. (2009) Singular values of random matrices. In: Lecture Notes. https://djalil.chafai.net/docs/sing.pdf Google Scholar

Chernoff, H. (1952) A measure of asymptotic efficiency for tests of a hypothesis based on the sum of observations. Ann. Math. Stat. 23(4) 493–507.CrossRef Google Scholar

Coste, S. and Zhu, Y. (2021) Eigenvalues of the non-backtracking operator detached from the bulk. Random Matrices Theory Appl. 10(03) 2150028.CrossRef Google Scholar

Erdős, L., Knowles, A., Yau, H.-T. and Yin, J. (2013) Spectral statistics of Erdős-Rényi graphs I: local semicircle law. Ann. Probab. 41(3B) 2279–2375.CrossRef Google Scholar

Füredi, Z. and Komlós, J. (1981) The eigenvalues of random symmetric matrices. Combinatorica 1(3) 233–241.CrossRef Google Scholar

He, Y., Knowles, A. and Marcozzi, M. (2019) Local law and complete eigenvector delocalization for supercritical Erdős–Rényi graphs. Ann. Probab. 47(5) 3278–3302.CrossRef Google Scholar

Gulikers, L., Lelarge, M. and Massoulié, L. (2017) Non-backtracking spectrum of degree-corrected stochastic block models. In ITCS 2017-8th Innovations in Theoretical Computer Science, pp. 1–52.Google Scholar

Grenander, U. (1963) Probabilities on Algebraic Structures. Wiley.Google Scholar

Hashimoto, K. (1989) Zeta functions of finite graphs and representations of p-adic groups. In Automorphic Forms and Geometry of Arithmetic Varieties, Vol. 15 of Advanced Studies in Pure Mathematics, Academic Press, pp. 211–280.CrossRef Google Scholar

He, Y. and Knowles, A. (2021) Fluctuations of extreme eigenvalues of sparse Erdős–Rényi graphs. Probab. Theory Relat. Fields 180 985–1056.CrossRef Google Scholar PubMed

Holland, P. W., Laskey, K. B. and Leinhardt, S. (1983) Stochastic blockmodels: first steps. Soc. Netw. 5(2) 109–137.CrossRef Google Scholar

Hoeffding, W. (1963) Probability inequalities for sums of bounded random variables. J. Am. Stat. Assoc. 58 13–30.CrossRef Google Scholar

Huang, J., Landon, B. and Yau, H.-T. (2020) Transition from Tracy–Widom to Gaussian fluctuations of extremal eigenvalues of sparse Erdős–Rényi graphs. Ann. Probab. 48(2) 916–962.CrossRef Google Scholar

Ivchenko, G. I. (1973) On the asymptotic behavior of degrees of vertices in a random graph. Theory Probab. Appl. 18(1) 188–195.CrossRef Google Scholar

Krzakala, F., Moore, C., Mossel, E., et al. (2013) Spectral redemption in clustering sparse networks. Proc. Natl. Acad. Sci. USA 110(52) 20935–20940.CrossRef Google Scholar PubMed

Karrer, B. and Newman, M. E. J. (2011) Stochastic blockmodels and community structure in networks. Phys. Rev. E 83(1) 016107,–10.CrossRef Google Scholar PubMed

Khorunzhy, A. M. and Pastur, L. A. (1993) Limits of infinite interaction radius, dimensionality and the number of components for random operators with off-diagonal randomness. Commun. Math. Phys. 153(3) 605–646.CrossRef Google Scholar

Krivelevich, M. and Sudakov, B. (2003) The largest eigenvalue of sparse random graphs. Comb. Probab. Comput. 12(1) 61–72.10.1017/S0963548302005424CrossRef Google Scholar

Lee, J. O. and Schnelli, K. (2018) Local law and Tracy–Widom limit for sparse random matrices. Probab. Theory Relat. Fields 171(1) 543–616.CrossRef Google Scholar

Latała, R., van Handel, R. and Youssef, P. (2018) The dimension-free structure of nonhomogeneous random matrices. Invent. Math. 214(3) 1031–1080.CrossRef Google Scholar

Massoulié, L. (2014) Community detection thresholds and the weak Ramanujan property. In STOC’14—Proceedings of the 2014 ACM Symposium on Theory of Computing. ACM, pp. 694–703.Google Scholar

Mossel, E., Neeman, J. and Sly, A. (2018) A proof of the block model threshold conjecture. Combinatorica 38(3) 665–708.CrossRef Google Scholar

Mossel, E., Neeman, J. and Sly, A. (2015) Reconstruction and estimation in the planted partition model. Probab. Theory Relat. Fields 162(3-4) 431–461.CrossRef Google Scholar

Stephan, L. and Massoulié, L. (2022) Non-backtracking spectra of weighted inhomogeneous random graphs. Math. Stat. Learn. 5(3) 201–271.CrossRef Google Scholar

Silvester, J. R. (2000) Determinants of block matrices. Math. Gaz. 84(501) 460–467.CrossRef Google Scholar

Tao, T. and Van, V. (2010) Random matrices: universality of ESDs and the circular law. Ann. Probab. 38(5) 2023–2065. With an appendix by Manjunath Krishnapur.CrossRef Google Scholar

Tran, L. V., Van, H. V. and Wang, K. (2013) Sparse random graphs: eigenvalues and eigenvectors. Random Struct. Algorithms 42(1) 110–134.CrossRef Google Scholar

Vu, V. (2007) Spectral norm of random matrices. Combinatorica 27(6) 721–736.CrossRef Google Scholar

Vu, V. (2008) Random Discrete Matrices. Springer, pp. 257–280.Google Scholar

Wigner, E. P. (1955) Characteristic vectors of bordered matrices with infinite dimensions. Ann. Math. 62(2) 548–564.CrossRef Google Scholar

Wigner, E. P. (1958) On the distribution of the roots of certain symmetric matrices. Ann. Math. 67(2) 325–327.CrossRef Google Scholar

Wood, P. M. (2012) Universality and the circular law for sparse random matrices. Ann. Appl. Probab. 22(3) 1266–1300.CrossRef Google Scholar

Wood, P. M. (2016) Universality of the ESD for a fixed matrix plus small random noise: a stability approach. Ann. Inst. Henri Poincaré Probab. Stat. 52(4) 1877–1896.CrossRef Google Scholar

Figure 1. The eigenvalues of $H/\sqrt{\alpha }$ defined in (1.1) and $H_0/\sqrt{\alpha }$ defined in (1.2) for a sample of $G(n,p)$ with $n=500$ and different values of $p$. The blue circles are the eigenvalues of $H/\sqrt{\alpha }$, and the red x’s are for $H_0/\sqrt{\alpha }$. For comparison, the black dashed line is the unit circle. For the figures from top to bottom and from left to right, the values of $p$ are taken to be $p=0.5, p=0.1, p=0.08$ and $p=0.05$, respectively.

Article contents

Limiting empirical spectral distribution for the non-backtracking matrix of an Erdős-Rényi random graph

Abstract

Keywords

MSC classification

Information

1. Introduction

1.1 Results

1.2 Outline

1.3 Background definitions

2. The spectrum of $H_0$

2.1 Spectrum of $\widetilde H_0$ : proof of Proposition 1.2

2.2 Real eigenvalues of $\widetilde H_0$ when $p\le n^{-1/2}$

2.3 $\widetilde H_0$ is diagonalizable

3. The bulk distribution: proving Theorem 1.4

3.1 Proof of Theorem 3.1

4. Perturbation theory: proving Theorem 1.6

Acknowledgements

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests