Counting independent sets in structured graphs

Matija Bucić; Maria Chudnovsky; Julien Codsi

doi:10.1017/S0963548325000124

Counting independent sets in structured graphs

Part of: Graph theory

Published online by Cambridge University Press: 07 July 2025

Matija Bucić

Maria Chudnovsky and

Julien Codsi

Show author details

Matija Bucić*: Affiliation:
School of Mathematics, Institute for Advanced Study and Department of Mathematics, Princeton University, Princeton, USA
Maria Chudnovsky: Affiliation:
School of Mathematics, Institute for Advanced Study and Department of Mathematics, Princeton University, Princeton, USA
Julien Codsi: Affiliation:
School of Mathematics, Institute for Advanced Study and Department of Mathematics, Princeton University, Princeton, USA
*: Corresponding author: Matija Bucić; Email: mb5225@princeton.edu

Article contents

Abstract
Introduction
Counting independent sets locally
Translating counts from local to global
Counting independent sets in $H$-free graphs
Concluding remarks and open problems
Footnotes
References

Rights & Permissions

Abstract

Counting independent sets in graphs and hypergraphs under a variety of restrictions is a classical question with a long history. It is the subject of the celebrated container method which found numerous spectacular applications over the years. We consider the question of how many independent sets we can have in a graph under structural restrictions. We show that any $n$-vertex graph with independence number $\alpha$ without $bK_a$ as an induced subgraph has at most $n^{O(1)} \cdot \alpha ^{O(\alpha )}$ independent sets. This substantially improves the trivial upper bound of $n^{\alpha },$ whenever $\alpha \le n^{o(1)}$ and gives a characterisation of graphs forbidding which allows for such an improvement. It is also in general tight up to a constant in the exponent since there exist triangle-free graphs with $\alpha ^{\Omega (\alpha )}$ independent sets. We also prove that if one in addition assumes the ground graph is chi-bounded one can improve the bound to $n^{O(1)} \cdot 2^{O(\alpha )}$ which is tight up to a constant factor in the exponent.

Keywords

Counting independent sets container method dependent random choice

MSC classification

Primary: 05C75: Structural characterization of families of graphs

Secondary: 05C85: Graph algorithms

Information

Type: Paper
Information: Combinatorics, Probability and Computing , Volume 34 , Issue 5 , September 2025 , pp. 625 - 634

DOI: https://doi.org/10.1017/S0963548325000124 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (https://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is unaltered and is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use or in order to create a derivative work.
Copyright: © The Author(s), 2025. Published by Cambridge University Press

1. Introduction

Problems involving counting independent sets in graphs and hypergraphs have a long history. They have been studied both due to their intrinsic interest and since one can encapsulate many natural questions in terms of counting independent sets in an appropriate (hyper)graph, see e.g. a recent survey [Reference Samotij27] for many examples and a detailed history of such questions. Let us denote by $\alpha (G)$ the maximum size of an independent set and by $i(G)$ the number of independent sets in a graph $G$ . There are two trivial bounds which often serve as a baseline for more involved arguments.

(1)

\begin{equation} 2^{\alpha (G)} \le i(G) \le \sum _{j=0}^{\alpha (G)} \binom {n}{j}. \end{equation}

The lower bound follows since all subsets of the maximum size independent set are themselves independent and the upper bound simply account for all subsets of size up to $\alpha (G)$ . Both of these bounds can be tight, for example, if $G$ is an empty or complete (hyper)graph, respectively. Note also that if $\alpha (G)=\Omega (n)$ the upper bound is also exponential in $\alpha (G)$ so the two bounds match up a constant in the exponent. We will be consequently mostly interested in the regime when $\alpha$ is somewhat small compared to $n$ when one can approximate the upper bound on the right by $\left (\frac {n}{\alpha (G)}\right )^{\alpha (G)}$ or even more loosely by just $n^{\alpha (G)}$ .

Trying to improve the upper bound in (1) has garnered a lot of interest over the years and is the subject of the celebrated container method. In the case of graphs it was introduced in the 1980’s by Kleitman and Winston [Reference Kleitman and Winston21, Reference Kleitman and Winston22] who used it to count lattices and graphs without four-cycles. Variations of the method have been used over the years to attack a variety of problems for example Alon [Reference Alon1] and Sapozhenko [Reference Sapozhenko28] used it to count the number of independent sets in regular graphs which in turn has applications to counting sum-free sets in Abelian groups (see e.g. [Reference Alon1, Reference Lev, Łuczak and Schoen24, Reference Sapozhenko29]). Another remarkable example is the recent breakthrough lower bound on the off-diagonal Ramsey numbers [Reference Mattheus and Verstraete26]. The method has been extended to the hypergraph case independently by Balogh, Morris, and Samotij [Reference Balogh, Morris and Samotij3] and Saxton and Thomason [Reference Saxton and Thomason30] and has found an even more impressive array of applications, see e.g. the survey [Reference Balogh, Morris and Samotij4] produced to accompany an ICM 2018 talk on the subject of the container method for more examples and information. At a high level, the container method allows one to translate knowledge about a variety of statistics in a (hyper)graph (for example having a control of codegrees) to improvements on the upper bound in (1).

In this paper, we study whether having structural information about a graph leads to an improvement in the upper bound (1). Perhaps one of the most studied structural properties of graphs is being $H$ -free for a small fixed graph $H$ . Here and throughout the paper a graph $G$ being $H$ -free stands for not having $H$ as an induced subgraph. This leads to a very natural question, does $G$ being $H$ -free imply a substantial improvement in the upper bound (1)? On the positive side, Farber in [Reference Farber14] showed in 1987 that forbidding a $2K_2$ does indeed lead to such an improved bound. Unfortunately, the answer is in general no. If one takes a graph consisting of $\alpha -1$ vertex disjoint complete graphs of order as equal as possible (namely the complement of the $(\alpha -1)$ -partite Turán graph on $n$ vertices) one obtains a graph with roughly $(n/\alpha )^{\alpha -1}$ independent sets, which is quite close to the upper bound in (1). On the other hand, this graph avoids most graphs as induced subgraphs, in fact, all of its induced subgraphs are themselves vertex disjoint unions of complete graphs. Our first result shows that one can improve the upper bound substantially if we forbid any such graph as an induced subgraph.

Theorem 1. Any $bK_a$ -free $n$ -vertex graph $G$ with $\alpha =\alpha (G)$ has $i(G) \le n^{O(1)}\cdot \alpha ^{O(\alpha )}.$

For a version of this theorem with precise dependencies see Theorem 7.

Besides characterising which forbidden graphs lead to an improvement this result is also tight up to a constant factor in the exponent so long as $a \ge 3$ and $b \ge 2$ . Indeed, taking $b-1$ disjoint complete graphs of order about $n/b$ each gives an $n$ -vertex $bK_a$ -free graph with roughly $(n/b)^{b-1}$ independent sets and independence number $b-1$ . This shows the polynomial factor in $n$ needs to grow with $b$ . One can show the same is true for $a$ by taking a graph consisting of $a$ disjoint complete graphs of order about $n/a$ each and then placing additional edges between parts independently with suitable probability to ensure there will be few copies of (not necessarily induced) $K_{a,a}$ ’s in the complement but with still many independent sets. One can then add a few edges to destroy these copies without destroying too many independent sets. More interestingly, one can not, in general, improve the second term either (beyond a constant factor in the exponent). This follows since there exist triangle-free graphs (so $bK_a$ -free for any $a \ge 3$ ) with at least $\alpha (G)^{\Omega (\alpha (G))}$ independent sets. This in turn follows by combining two results. The first one is due to Davies, Jenssen, Perkins and Roberts [Reference Davies, Jenssen, Perkins and Roberts12] (see also [Reference Cooper and Mubayi9] for a slightly weaker but qualitatively similar result) showing that any $n$ -vertex triangle-free graph with maximum degree $d$ has $i(G) \ge \exp \left (\left (\frac 12+o_d(1)\right )\cdot \frac {\log ^2 d}{d}\cdot n\right )$ . The second is that there exist triangle-free graphs $G$ with all degrees being $(1+o(1))\sqrt {\frac 12 n \log n}$ and $\alpha (G)\le (1+o(1))\sqrt {2n \log n}$ . One obtains such a graph with high probability from the famous triangle-free process, see e.g. [Reference Bohman and Keevash5, Reference Pontiveros, Griffiths and Morris15] for more details on this topic. This produces a triangle-free graph with at least $\alpha ^{(\sqrt {2}/4+o(1))\alpha }$ independent sets and $\alpha =\Theta (\sqrt {n \log n})$ .

Our second main result shows that we can in fact go further and even match the trivial lower bound from (1) up to a constant factor in the exponent if we in addition assume our ground graph is chi-bounded. Here, a hereditary class of graphs $\mathcal{G}$ is said to be chi-bounded if there exists some function $g:\mathbb{N} \to \mathbb{R}$ such that $\chi (G) \le g(\omega (G))$ for every $G \in \mathcal{G}$ . It is a well-studied notion with many interesting applications and connections, see e.g. a recent survey of Scott and Seymour [Reference Scott and Seymour31].

Theorem 2. Let $\mathcal{G}$ be a chi-bounded hereditary class of graphs. For any $bK_a$ -free $n$ -vertex graph $G \in \mathcal{G}$ with $\alpha =\alpha (G)$ we have $i(G) \le n^{O(1)} \cdot 2^{O(\alpha )}.$

For a version of this theorem with precise dependencies see Theorem 8.

Since forbidding $bK_2$ as an induced subgraph implies chi-boundedness, we conclude that the stronger bound of Theorem 2 also holds in Theorem 1 when $a=2$ , completing the picture as we have shown such an improvement is impossible when $a\ge 3$ .

There are two main tools behind our arguments. The first one is a certain hypergraph analogue of an induced Kövári-Sós-Turán Theorem introduced in the graph case by Loh, Tait, Timmons and Zhou [Reference Loh, Tait, Timmons and Zhou25] and extended and used to settle a variety of problems recently in [Reference Axenovich and Zimmermann2, Reference Bourneuf, Bucić, Cook and Davies6, Reference Girão and Hunter17–Reference Illingworth19]. The proof of this lemma is based on the dependent random choice technique (see e.g. the survey [Reference Fox and Sudakov16] for more information). The second ingredient is a certain local-to-global transference lemma for independent set counts, the proof of which is based on the ideas behind the container method.

1.1. Notation

Given a graph $G$ we denote by $\alpha (G),\omega (G),\chi (G)$ and $i(G)$ the independence number, clique number, chromatic number, and number of independent sets in $G$ respectively. We denote by $I_t(G)$ the family of all independent sets of size $t$ in $G$ and write $i_t(G)=|I_t(G)|$ . Given a set of vertices $X$ in a graph $G$ we write $d_G(X)$ for the number of common neighbours of all vertices in $X$ . For counting simplification purposes we consider an empty set of vertices as independent. For the remainder of this paper, all logs are in base 2.

We note that for the purposes of intuition that we often refer to counts as local or global. In general, the former refers to counting objects within (all) subgraphs of our original graph of certain size whereas the latter refers to counting objects in the whole graph.

2. Counting independent sets locally

In this section, we will prove our key technical tool, namely a hypergraph variant of the induced version of the Kövári-Sós-Turán Theorem [Reference Kövari, Sós and Turán23]. The classical theorem of Kövári, Sós, and Turán dating back to 1954 states that if an $n$ vertex graph does not contain a $K_{s,s}$ as a not necessarily induced subgraph, then it has at most $O(n^{2-1/s})$ edges. It proved itself as an incredibly useful tool over the years as in many problems one can easily verify there are no $K_{s,s}$ -subgraphs and as a result conclude that the graph in question is “locally sparse”. Erdős extended this result to hypergraphs in 1964 [Reference Erdős13] showing that forbidding the $r$ -partite $r$ -uniform complete hypergraph as a not necessarily induced subgraph of an $r$ -uniform $n$ -vertex hypergraph implies the number of edges is at most $O(n^{r-\varepsilon })$ for some $\varepsilon \gt 0$ depending on the forbidden hypergraph. Another natural extension of the classical theorem in which one only forbids $K_{s,s}$ as an induced subgraph was only considered about 10 years ago in the graph case by Loh, Tait, Timmons, and Zhou [Reference Loh, Tait, Timmons and Zhou25] and has found some very interesting further extensions and applications in the last few years. Unfortunately, the straightforward generalisation can not hold, as even a complete graph is $K_{s,s}$ -free so long as $s \ge 2$ . However, if one, in addition, forbids a clique (even of a size polynomial in $n$ ) one suddenly recovers the classical bound.

Our technical lemma gives in a sense a common extension of both of these results. Roughly speaking our result states that if one forbids a complete $r$ -partite graph as an induced subgraph and assumes there are no large cliques in our graph then there are at most $O(n^{r-\varepsilon })$ cliques of size $r$ for some $\varepsilon \gt 0$ depending on the forbidden graph. Our result is actually slightly stronger, we assume a more flexible condition that every $m$ vertices contain an independent set of size $a$ (not having a large clique implies such a condition holds via Ramsey’s Theorem). We note that in [Reference Loh, Tait, Timmons and Zhou25] a similar result focusing on the number of larger cliques has been proved but only under a much stronger assumption that a complete bipartite graph is forbidden. We also note that our result is not a full-fledged extension of Erdős’ result as it only applies to clique complexes, namely hypergraphs whose edges are cliques of a graph. One can actually prove such a stronger variant using a similar approach although since we do not need it here we choose not to do so. Part of the reason for this is that for our chi-bounded result, we need a very precise bound here, namely one that gives us a (slight) improvement even under a much weaker local assumption. We note also that the result is stated in the complement compared to the above discussion since this is how we will use it.

We start with a precise definition of the local condition we will use.

Definition 3. A graph $G$ is $(m,a)$ -cliquey if all $m$ -vertex induced subgraphs of $G$ contain a $K_a$ .

So in particular, by Ramsey’s theorem, we know that any graph $G$ is $(\alpha (G)^a,a)$ -cliquey for any $a$ .

We advise the reader that the following lemma and its proof might be initially easier to read under the assumption that $m$ is polynomially smaller than $n$ which ensures $\varepsilon \gt 0$ is an absolute constant depending only on $a$ and $b$ . Indeed, this is sufficient for our proof of Theorem 1 and we suspect for most future applications as well. However, as mentioned above we need the more precise version, which allows for smaller, subpolynomial gains under weaker assumptions to prove Theorem 2.

Lemma 4. Let $a,b \ge 1$ and $n \ge m$ be integers. Let $G$ be a $bK_a$ -free $n$ -vertex graph which is $(m,a)$ -cliquey. Then there are at most ${n^{b-\varepsilon }}/{b!}$ independent sets of size $b$ in $G$ , where

\begin{equation*}\varepsilon :=\left (\frac {\log \frac nm}{8ab \log n}\right )^{b}.\end{equation*}

Proof. We write $\varepsilon _b:=\left (\frac {\log \frac nm}{8ab \log n}\right )^{b}$ as the values of $m,a$ and $n$ for which we will use this expression always remain the same. We prove the lemma by induction on $b$ . If $b=1$ , the lemma is vacuous since $G$ being $(m,a)$ -cliquey and $K_a$ -free imply we must have $n \lt m$ . Let us now assume $b \ge 2$ and that the lemma holds for any $(b-1)K_a$ -free graph. Let $G$ be a $bK_a$ -free $(m,a)$ -cliquey graph with $n$ vertices. If $a=1$ or $n\lt b$ there are no independent sets of size $b$ in $G$ and the lemma holds, so we may in addition assume $a \ge 2$ and $n \ge b$ .

We may assume $n \gt m$ as otherwise, $\varepsilon _b=0$ and the desired bound is larger than $\binom {n}{b}$ so is trivially true. Suppose towards a contradiction that $G$ has more than $\frac {n^{b-\varepsilon _b}}{b!}$ independent sets of size $b$ . Our general strategy will be to find a set of vertices containing many independent sets of size $b-1$ for which all $a(b-1)$ subsets have more than $m$ non-neighbours. By induction, this set will contain $(b-1)K_a$ which will be used to create an induced $bK_a$ .

Let $T$ be a random subset of vertices of $G$ obtained by sampling $t= \left \lfloor \frac {4ab \log n}{\log \frac nm} \right \rfloor \gt 2ab\cdot \frac {\log n}{\log \frac nm}$ times uniformly at random, with repetitions, a vertex of $G$ . Let $U$ be the set of vertices not adjacent to any vertex of $T$ . Let $X$ count the number of independent sets of size $b-1$ contained in $U$ . An independent set $I$ of size $b-1$ is contained within $U$ only if all $t$ vertices we sampled belong to its common non-neighbourhood, which happens with probability $(d_{\bar {G}}(I)/n)^{t}$ so by using Jensen’s inequality (and convexity of $f(x)=x^t$ for $x$ positive) we get

\begin{align*} {\mathbb{E}} X = \sum _{I \in I_{b-1}(G)} \left (\frac {d_{\bar {G}}(I)}{n}\right )^t \ge |I_{b-1}(G)| \left (\frac {\sum _{I \in I_{b-1}(G)} d_{\bar {G}}(I)}{n|I_{b-1}(G)|}\right )^t &\ge \frac {n^{b-1}}{(b-1)!} \left (\frac {b|I_b(G)|}{n\cdot \frac {n^{b-1}}{(b-1)!}}\right )^t\\ &\gt \frac {n^{b-1}}{(b-1)!} \cdot n^{-t\varepsilon _b}\\ &\gt \frac {n^{b-1}}{(b-1)!}\cdot n^{-\varepsilon _{b-1}/2}, \end{align*}

where in the second inequality we used $|I_{b-1}(G)|\le \binom {n}{b-1}\le \frac {n^{b-1}}{(b-1)!}$ (and the fact this term appears with a power $1-t \le 0$ ) and the hypergraph handshake lemma. In the third inequality, we used the assumed lower bound on $|I_b(G)|$ . In the final inequality we used $t\varepsilon _b= \left \lfloor \frac {4ab \log n}{\log \frac nm} \right \rfloor \cdot \left (\frac {\log \frac nm}{8ab \log n}\right )^{b}\lt \frac 12 \left (\frac {\log \frac nm}{8a(b-1) \log n}\right )^{b-1}=\frac {\varepsilon _{b-1}}2.$ On the other hand, given a set of $a(b-1)$ vertices with less than $m$ common non-neighbours the probability that this set is a subset of $U$ is at most $\left (\frac {m}{n}\right )^t$ . So if we let $Y$ be the random variable counting the number of $a(b-1)$ -sized subsets of $U$ with less than $m$ common non-neighbours we have

\begin{equation*}{\mathbb{E}} Y \le \binom {n}{a(b-1)}\left (\frac {m}{n}\right )^t \le {n^{a(b-1)}}\cdot 2^{-t \log \frac nm}\lt {n^{a(b-1)}}\cdot n^{-2ab} ={n^{-a(b+1)}}. \end{equation*}

This shows that there is an outcome for which

\begin{equation*}X-\binom {n}{b-1}\cdot Y \gt \frac {n^{b-1-\varepsilon _{b-1}/2}-n^{-2-(a-1)(b+1)}}{(b-1)!}\ge \frac {n^{b-1-\varepsilon _{b-1}/2}-n^{-2}}{(b-1)!}.\end{equation*}

Let us consider such an outcome $U$ . Note that since $X \le \binom {n}{b-1}$ and $n^{b-1-\varepsilon _{b-1}/2}-n^{-2}\gt 0$ we must have $Y=0$ . Furthermore, since $b \ge 2$ we have $\varepsilon _{b-1} \le 2(b-1)\frac {\log \frac {n}{m}}{\log n},$ which is equivalent to $n^{b-1-\varepsilon _{b-1}/2}\ge m^{b-1}.$ This implies that $X\gt \frac {m^{b-1}-n^{-2}}{(b-1)!}\gt \binom {m-1}{b-1},$ so $|U| \ge m.$ This, together with $a \ge 2$ implies there must be an edge inside $G[U],$ so $X \le \binom {n}{b-1}-2\le \frac {n^{b-1}-2}{(b-1)!},$ where we used $n \ge b \ge 2$ . Combined with the above lower bound on $X$ we get $n^{b-1}-2 \gt n^{b-1-\varepsilon _{b-1}/2}-n^{-2}$ which implies $n^{\varepsilon _{b-1}/2}\gt 1+\frac {1}{n^{b-1}}$ which in turn gives $X\gt \frac {n^{b-1-\varepsilon _{b-1}/2}-n^{-2}}{(b-1)!}\ge \frac {n^{b-1-\varepsilon _{b-1}}}{(b-1)!}.$

If $b=2$ this implies $|U|=X\gt n^{1-\varepsilon _1}\gt m$ so we can find a $K_a$ inside of $G[U]$ . For $b \ge 3$ consider an auxiliary graph $G'$ on the same vertex set for which $G'[U]=G[U]$ but every vertex in $V(G) \setminus U$ is adjacent to every other vertex of $G'$ . $G'$ is an $n$ vertex graph, is clearly $(m,a)$ -cliquey and has more than $\frac {n^{b-1-\varepsilon _{b-1}}}{(b-1)!}$ independent sets of size $b-1$ . So by the inductive assumption, it must contain a $(b-1)K_a$ as an induced subgraph. Since $b-1\ge 2$ the vertices of this $(b-1)K_a$ are not adjacent to all other vertices and hence must belong to $U$ . Hence, in either case, we find an induced copy of $(b-1)K_a$ inside $G[U]$ . Finally, since $Y=0$ the $a(b-1)$ vertices comprising this $(b-1)K_a$ have at least $m$ common non-neighbours. Among them, we can find a $K_a$ giving us an induced $bK_a$ in $G$ and the desired contradiction.

3. Translating counts from local to global

In this section, we prove our second technical result which will allow us to propagate a tiny gain in the number of independent sets of some small size $b$ to a more substantial one globally. The basic idea behind the proof is reminiscent of the proofs of the container theorems (see e.g. [Reference Balogh, Morris and Samotij3, Reference Saxton and Thomason30]) although for the specific regime we work with we have a very simple proof (motivated in part by the ideas in [Reference Bucić, Fox and Pham7]).

The following definition will come in useful in tracking independent set counts on subgraphs.

Definition 5. Given a graph $G$ and a real $m \ge 0$ let

\begin{equation*}i(G,m):=\max _{\substack {G' \subseteq G, |G'| \le m}} i(G').\end{equation*}

In other words, $i(G,m)$ denotes the maximum number of independent sets contained in an induced subgraph of $G$ with up to $m$ vertices.

Lemma 6. Let $b \ge 1,$ and suppose $G$ is an $n$ -vertex graph with at most $n^{b-\varepsilon }/b!$ independent sets of size $b$ . Then, $i(G) \le n^{b-1}\cdot i(G, n^{1-\varepsilon /2^{b-1}})$ .

Proof. We call any independent set of size $b$ atypical. Now for any $1 \le i \le b-1$ we call an independent set of size $b-i$ typical if it belongs to fewer than $n^{1-\varepsilon /2^i}$ atypical independent sets of size $b-i+1$ and we say it is atypical otherwise. Let ${\mathcal{I}}_t$ denote the collection of atypical independent sets of size $t$ . So ${\mathcal{I}}_b$ consists of all independent sets of size $b$ in $G$ and has size $|{\mathcal{I}}_b| \le n^{b-\varepsilon }/b!$ . Since each atypical independent set of size $b-i$ belongs to at least $n^{1-\varepsilon /2^i}$ atypical independent sets of size $b-i+1$ and each of these sets can contain at most $b-i+1$ of them we conclude $|{\mathcal{I}}_{b-i}|\cdot n^{1-\varepsilon /2^i} \le |{\mathcal{I}}_{b-i+1}|\cdot (b-i+1)$ . We conclude that

(2)

\begin{equation} |{\mathcal{I}}_{b-i}| \le \frac {|{\mathcal{I}}_{b}|\cdot b\cdot (b-1) \cdots (b-i+1)}{n^{1-\varepsilon /2}\cdot n^{1-\varepsilon /4}\cdots n^{1-\varepsilon /2^i}}\le \frac {n^{b-i-\varepsilon /2^i}}{(b-i)!}. \end{equation}

We now count the independent sets of $G$ based on the size of the largest typical independent set they contain. Note that there are at most $\binom {n}{b-i}i(G,n^{1-\varepsilon /2^{i}})$ independent sets in $G$ with the largest typical independent set they contain having size $b-i$ . Indeed, there are $\binom {n}{b-i}$ choices for the typical independent set, and once this is fixed the rest of the independent set is restricted to vertices which extend it into any atypical independent set of size $b-i+1$ (by maximality) and by definition of typicality there are at most $n^{1-\varepsilon /2^i}$ such vertices. This only leaves the independent sets not containing any typical independent sets at all. Note that any such set is restricted to use only the vertices which are atypical independent sets of size one, of which there are by (2) at most $n^{1-\varepsilon /2^{b-1}}.$ Putting all of this together we conclude that the number of independent sets in $G$ is at most

\begin{equation*} i(G,n^{1-\varepsilon /2^{b-1}})+\sum _{i=1}^{b-1} \binom {n}{b-i}i(G,n^{1-\varepsilon /2^{i}}) \le {n}^{b-1}\cdot i(G,n^{1-\varepsilon /2^{b-1}}), \end{equation*}

as desired.

4. Counting independent sets in $H$ -free graphs

In this section, we prove our main results. We begin with Theorem 1 which we state here in a slightly more precise form.

Theorem 7. Let $a,b \ge 1$ be integers. There exists $C=C(a,b)\ge 0$ such that any $bK_a$ -free $n$ -vertex graph $G$ with $\alpha =\alpha (G)$ has $i(G) \le n^C\cdot \alpha ^{2a\alpha }.$

Proof. We will prove the theorem with $\varepsilon =\varepsilon (a,b):=(16ab)^{-b}$ , $C=C(a,b)=(b-1)2^{b-1}/\varepsilon$ . We proceed by induction on $n$ . Observe first that if $n \le \alpha ^{2a},$ then the desired inequality holds since $i(G)\le n^\alpha \le \alpha ^{2a\alpha }$ . Let us now assume that $n \gt \alpha ^{2a}$ and that any induced subgraph $H$ of $G$ on $m\lt n$ vertices satisfies the desired inequality. In particular, this implies $i(G,m)\le m^C\cdot \alpha ^{2a\alpha }$ for any $m \lt n$ .

Ramsey’s Theorem implies that $G$ (and in fact any graph) is $(\alpha ^a,a)$ -cliquey. Lemma 4 implies there are at most $n^{b-\varepsilon }/b!$ independent sets of size $b$ in $G$ since

\begin{equation*}\left (\frac {\log \frac n{\alpha ^a}}{8ab \log n}\right )^{b} \ge (16ab)^{-b}.\end{equation*}

By Lemma 6 this implies

\begin{equation*} i(G) \le n^{b-1}\cdot i(G, n^{1-\varepsilon /2^{b-1}}) \le {n}^{b-1}\cdot n^{C-C\varepsilon /2^{b-1}}\cdot \alpha ^{2a\alpha }= n^C\alpha ^{2a\alpha }. \end{equation*}

This completes the induction and the proof.

We note that by being more careful with the numbers in the above argument one can improve the bound to $n^{O(1)}\cdot \alpha ^{(1+o(1))a\alpha }.$

We proceed with our result in the chi-bounded case, namely Theorem 2. The proof is similar to the above, the main distinction being that we can lower the base case to $n$ being linear in $\alpha$ rather than polynomial. This however comes with additional issues concerning the fact that Lemma 4 stops giving us a polynomial gain in counts of small independent sets. The gain is still sufficient for our purposes though.

Theorem 8. Let $a,b \ge 1$ be integers and let $\mathcal{G}$ be a chi-bounded, $bK_a$ -free hereditary class of graphs. There exists $C=C(\mathcal{G})\gt 0$ such that for every $G \in \mathcal{G}$ we have $i(G) \le |G|^{C} \cdot 2^{C\alpha (G)}.$

Proof. Let $g$ be a non-decreasing integral chi-bounding function, so that $\frac {|G'|}{\alpha (G')}\le \chi (G') \lt g(\omega (G'))$ for any $G' \in \mathcal{G}$ . Let $C=g(a)\cdot (32ab)^{2b}.$

Let $G \in \mathcal{G}$ be an $N$ -vertex $bK_a$ -free graph and let $\alpha =\alpha (G)$ .

Let $G'$ be an induced subgraph of $G$ with $\alpha g(a)$ vertices. By our chi-boundedness assumption we have $\alpha (G)g(a) =|G'| \lt \alpha (G')g(\omega (G'))\le \alpha (G)g(\omega (G'))$ so $\omega (G')\gt a$ . In particular, this implies that $G$ , as well as any of its induced subgraphs, are $(\alpha g(a),a)$ -cliquey.

Let $m:=\alpha g(a)$ and $\varepsilon _n:=\left (\frac {\log \frac nm}{8ab \log n}\right )^{b}$ . We note that while $\varepsilon _n$ is a function of $a,b,m$ as well as $n$ the values of $a,b,m$ will remain fixed throughout the argument. Now for any $n \ge m$ Lemma 4 implies that any $n$ -vertex subgraph of $G$ contains at most $n^{b-\varepsilon }/b!$ independent sets of size $b$ . This in turn via Lemma 6 implies it has at most $n^{b-1} \cdot i(G,n^{1-\varepsilon _n/2^{b-1}})$ independent sets in total. Since this subgraph was arbitrary we have that for any real $n \ge m$ we have

(3)

\begin{equation} i(G,n) \le n^{b-1} \cdot i\left (G,n^{1-\varepsilon _n/2^{b-1}}\right ). \end{equation}

Suppose first that $2m\le n \le m^2$ . Then, combined with (3) we get

\begin{equation*} \varepsilon _n/2^{b-1}= \left (\frac {\log \frac nm}{8ab \log n}\right )^{b}/2^{b-1} \ge \frac {1}{(32ab)^b\log ^b m}=:c \implies i(G,n) \le n^{b-1} \cdot i(G,n^{1-c}).\end{equation*}

The main benefit compared to (3) is that the exponent $c$ does not depend on $n$ (which is also why we require an assumption on the rough size of $n$ ). This makes it easier to iterate the bound to get for any integer $j \ge 1$ that:

\begin{equation*} i(G,n) \le n^{b-1} \cdot i\left (G,n^{1-c}\right ) \le n^{(b-1)j} \cdot i\left (G,\max \{n^{(1-c)^j},2m\}\right ).\end{equation*}

By choosing $j= \left \lfloor 1/c \right \rfloor$ we get

\begin{align*} i(G,n) &\le n^{(b-1)/c} \cdot i(G,2m)\\ &\le (m^2)^{(b-1)(32ab)^b\log ^{b} m} \cdot 2^{2m} \le 2^{2(b-1)(32ab)^b\log ^{b+1} m +2m}\le 2^{2b(32ab)^b(b+1)^{b+1} m}\le 2^{C\alpha }. \end{align*}

Suppose now $n \ge m^2$ . This combined with (3) gives

\begin{equation*} \varepsilon _n/2^{b-1}= \left (\frac {\log \frac nm}{8ab \log n}\right )^{b}/2^{b-1} \ge \frac {1}{(16ab)^b}=:c \implies i(G,n) \le n^{b-1} \cdot i(G,n^{1-c}).\end{equation*}

Similarly to the above, we get that in this range for any integer $j\ge 1$

\begin{equation*} i(G,n) \le n^{(b-1)(1+(1-c)+(1-c)^{2}+\ldots +(1-c)^{j})} \cdot i\left (G, \max \{n^{(1-c)^{j+1}},m^2\}\right ).\end{equation*}

By choosing $j$ large enough we get

\begin{equation*}i(G,n)\le n^{(b-1)/c}\cdot i\left (G,m^2\right ) \le n^{(b-1)(16ab)^b} \cdot 2^{C\alpha }.\end{equation*}

Putting all of this together, since $i(G)=i(G,N),$ this completes the proof if $N \ge 2m$ . In the remaining case the trivial bound of $2^{N}\le 2^{2g(a)\alpha }$ suffices.

We note that by being more careful with the numbers in the above argument one can improve the bound to $n^{O(1)} \cdot 2^{(1+o(1))g(a)\alpha }$ .

5. Concluding remarks and open problems

In this paper, we improve the trivial upper bound on the number of independent sets in $bK_a$ -free graphs. One of the main points of interest in improving upper bounds on the number of independent sets in a variety of graphs is that it allows for reducing the number of events we need to run various union bound arguments. It would be very interesting to find such applications of our results.

Given a graph $G$ with weights on its vertices, the Maximum Weight Independent Set (MWIS) problem asks to find the independent set in $G$ of maximum weight. It is well-known that MWIS is (very) computationally difficult in general, and is in particular (strongly) NP-hard [Reference Karp20]. This motivated a considerable amount of work on getting efficient algorithms for MWIS for graphs under various restrictions, see [Reference Dallard, Milanič and Štorgel11] for a more detailed treatment of the history and many examples. For example, it is known that a polynomial time algorithm exists for graphs with bounded treewidth. In fact, combining the results of [Reference Dallard, Fomin, Golovach, Korhonen and Milanič10, Reference Dallard, Milanič and Štorgel11] gives that it is enough to have a bounded independence number in every bag of some tree decomposition.Footnote ¹ A natural next step is to explore whether this can be further relaxed to allow the independence number bound to grow with $n$ , in particular, given several recent results (see [Reference Chudnovsky, Hajebi, Lokshtanov and Spirkl8, Reference Dallard, Fomin, Golovach, Korhonen and Milanič10, Reference Dallard, Milanič and Štorgel11] and references therein) showing various structural restrictions imply the existence of a tree decomposition with each bag having independence number at most polynomial in $\log n$ . In addition, there is a polynomial time algorithm for MWIS if we are given a tree decomposition in which every bag has only polynomially many independent sets. All of this motivates the question of exploring which structural restrictions guarantee that there are only polynomially many independent sets in a graph with a small independence number. Theorem 1 tells us that an $n$ -vertex, $bK_a$ -free graph $G$ with $\alpha (G)\le O(\log n / \log \log n)$ has $n^{O(1)}$ independent sets. Theorem 2 tells us that under the additional assumption that $G$ belongs to a chi-bounded class of graphs the same holds already when $\alpha (G) \le O(\log n)$ .

Another interesting future direction might be to try to obtain improvements similar to ours under other structural restrictions.

Our bounds are tight up to a constant in the exponent in general. It would be interesting to obtain optimal exponents, at least asymptotically. This certainly seems to require additional ideas. With this in mind, we sketch here an alternative argument, inspired by the one of Farber [Reference Farber14] that he used to settle the $2K_2$ -free case, which can be used to prove both of our main results in the case one forbids a disjoint union of $(b-1)K_2$ and $K_a$ . Instead of counting all independent sets, we will only count maximal ones. Since each maximal independent set contains at most $2^{\alpha (G)}$ independent sets of $G$ and every independent set is contained in at least one maximal one this shows the difference between two counts is at most $2^{\alpha (G)}$ . The argument now proceeds by induction on $b$ . The case $b=1$ follows immediately from Ramsey’s Theorem. For larger $b$ , let us fix an arbitrary vertex $v$ . There are three types of maximal independent sets in $G$

1. Ones that do not contain $v$ , and are hence also maximal in $G \setminus \{v\}$ .
2. Ones that do contain $v$ , and are, after removal of $v$ , maximal in $G \setminus \{v\}.$
3. Ones that do contain $v$ , but are, after removal of $v$ , not maximal in $G \setminus \{v\}.$

It is easy to see that the number of maximal independent sets of type 1. plus the number of maximal independent sets of type 2. equals the number of maximal independent sets of $G \setminus \{v\}$ . On the other hand, for any maximal independent set of type 3. there must exist a vertex $u$ adjacent to $v$ but otherwise having no neighbours in the maximal independent set. This means that we can upper bound the number of such maximal independent sets by going through all neighbours $u$ of $v$ (at most $n$ choices) and counting maximal independent sets in the set of common non-neighbours of $v$ and $u$ . The crucial observation is that this set must be $(b-2)K_2+K_a$ -free as we could extend any such induced subgraph by $vu$ to a $(b-1)K_2+K_a$ . So we can use induction to get a strong upper bound on the number of independent sets of type 3, which suffices to prove the desired bounds.

As we already mentioned the number of independent sets and the number of maximal independent sets are at most a factor of $2^{\alpha (G)}$ apart, so they behave similarly. On the other hand, no such relation seems to hold with the count of maximum independent sets. This leads to the natural question of whether one can improve our results if instead of counting independent sets we count only maximum ones. For example, even the following initial question remains open.

Question 9. Does every $n$ -vertex triangle-free graph with independence number $\alpha$ contain at most $2^{O(\alpha )}$ maximum independent sets?

If true this would be tight (up to a constant factor in the exponent) by considering $nK_2$ . Or, more generally, by taking a disjoint union of $mK_2$ and a triangle-free graph on $n-2m$ vertices with as small independence number as possible so as to show the above result would be tight even for essentially any choice of $\alpha$ .

Acknowledgements

We would like to thank Nicolas Trotignon for useful discussions in the early stages of this project. We would also like to thank Wojciech Samotij for multiple useful comments and for suggesting one of the ideas behind the proof of Lemma 6. We also thank Zach Hunter for pointing out several typos in the first version of this paper. We are very grateful to the anonymous referee for a very careful reading of the paper and numerous useful comments and suggestions. The first author would like to gratefully acknowledge the support of the Oswald Veblen Fund.

Supported by NSF-EPSRC Grant DMS-2120644 and by AFOSR grant FA9550-22-1-0083

Footnotes

1 See [Reference Chudnovsky, Hajebi, Lokshtanov and Spirkl8] for definitions of a variety of concepts discussed in this paragraph.

References

Alon, N. (1991) Independent sets in regular graphs and sum-free subsets of finite groups. Israel J. Math. 73(2) 247–256.10.1007/BF02772952CrossRef Google Scholar

Axenovich, M. and Zimmermann, J. (2025) Induced Turán problem in bipartite graphs. Discret. Appl. Math. 360 497–505.10.1016/j.dam.2024.10.007CrossRef Google Scholar

Balogh, J., Morris, R. and Samotij, W. (2015) Independent sets in hypergraphs. J. Amer. Math. Soc. 28(3) 669–709.CrossRef Google Scholar

Balogh, J., Morris, R. and Samotij, W. (2018) The method of hypergraph containers. In Proceedings of the International Congress of Mathematicians—Rio de Janeiro 2018, Vol. IV. Invited lectures, World Sci. Publ., Hackensack, NJ, pp. 3059–3092,Google Scholar

Bohman, T. and Keevash, P. (2021) Dynamic concentration of the triangle‐free process. Random Struct. Algorithms 58(2) 221–293.10.1002/rsa.20973CrossRef Google Scholar

Bourneuf, R., Bucić, M., Cook, L. and Davies, J. (2024) On polynomial degree-boundedness. Adv. Comb. 5.Google Scholar

Bucić, M., Fox, J. and Pham, H. T. (2024) Equivalence between Erdős-Hajnal and polynomial Rödl and Nikiforov conjectures, arXiv preprint arXiv: 2403.08303.Google Scholar

Chudnovsky, M., Hajebi, S., Lokshtanov, D. and Spirkl, S. (2024) Tree independence number II. Three-path-configurations. Three-path-configurations, arXiv preprint arXiv: 2405.00265.Google Scholar

Cooper, J. and Mubayi, D. (2014) Counting independent sets in triangle-free graphs. Proc. Amer. Math. Soc. 142(10) 3325–3334.10.1090/S0002-9939-2014-12068-5CrossRef Google Scholar

Dallard, C., Fomin, F.V., Golovach, P.A., Korhonen, T. and Milanič, M. (2024) Computing tree decompositions with small independence number. In 51st International Colloquium on Automata, Languages, and Programming (ICALP 2024). Leibniz International Proceedings in Informatics (LIPIcs), Vol. 297, Schloss Dagstuhl – Leibniz-Zentrum für Informatik, pp. 51:1–51:18.Google Scholar

Dallard, C., Milanič, M. and Štorgel, K. (2024) Treewidth versus clique number. II. Tree-independence number. J. Combin. Theory Ser. B 164 404–442.10.1016/j.jctb.2023.10.006CrossRef Google Scholar

Davies, E., Jenssen, M., Perkins, W. and Roberts, B. (2018) On the average size of independent sets in triangle-free graphs. Proc. Amer. Math. Soc. 146(1) 111–124.10.1090/proc/13728CrossRef Google Scholar

Erdős, P. (1964) On extremal problems of graphs and generalized graphs. Israel J. Math. 2(3) 183–190.10.1007/BF02759942CrossRef Google Scholar

Farber, M. (1989) On diameters and radii of bridged graphs. Discrete. Math. 73(3) 249–260.10.1016/0012-365X(89)90268-9CrossRef Google Scholar

Pontiveros, G. F., Griffiths, S. and Morris, R. (2020) The triangle-free process and the Ramsey number

$R(3,k)$ . Mem. Amer. Math. Soc. 263(1274) v+125.Google Scholar

Fox, J. and Sudakov, B. (2011) Dependent random choice. Random Struct. Algorithms 38(1–2) 68–99.10.1002/rsa.20344CrossRef Google Scholar

Girão, A. and Hunter, Z. (2025) Induced subdivisions in

$K_{s, s}$ -free graphs with polynomial average degree. Int. Math. Res. Not. (IMRN) 2025(4) rnaf025.10.1093/imrn/rnaf025CrossRef Google Scholar

Hunter, Z., Milojević, A., Sudakov, B. and Tomon, I. (2025) Kővári-Sós-Turán theorem for hereditary families. J. Comb. Theory Ser. B 172 168–197.10.1016/j.jctb.2024.12.009CrossRef Google Scholar

Illingworth, F. (2021) Graphs with no induced

$K_{2,t}$ . Electron. J. Combin. 28(1) 1.19.10.37236/9223CrossRef Google Scholar

Kleitman, D. J. and Winston, K. J. (1980) The asymptotic number of lattices. Ann. Discrete Math. 6 243–249.10.1016/S0167-5060(08)70708-8CrossRef Google Scholar

Kleitman, D. J. and Winston, K. J. (1982) On the number of graphs without

$4$ -cycles. Discrete Math. 41(2) 167–172.10.1016/0012-365X(82)90204-7CrossRef Google Scholar

Kövari, T., Sós, V. T. and Turán, P. (1954) On a problem of K. Zarankiewicz. Colloq. Math. 3(1) 50–57.10.4064/cm-3-1-50-57CrossRef Google Scholar

Lev, V. F., Łuczak, T. and Schoen, T. (2001) Sum-free sets in abelian groups. Israel J. Math. 125(1) 347–367.CrossRef Google Scholar

Loh, P.-S., Tait, M., Timmons, C. and Zhou, R. M. (2018) Induced Turán numbers. Combin. Probab. Comput. 27(2) 274–288.10.1017/S0963548317000542CrossRef Google Scholar

Mattheus, S. and Verstraete, J. (2024) The asymptotics of

$ r (4, t)$ . Ann. Math. 199(2) 919–941.10.4007/annals.2024.199.2.8CrossRef Google Scholar

Samotij, W. (2015) Counting independent sets in graphs. European J. Combin. 48 5–18.10.1016/j.ejc.2015.02.005CrossRef Google Scholar

Sapozhenko, A. A. (2001) On the number of independent sets in extenders. Diskret. Mat. 13(1) 56–62.Google Scholar

Sapozhenko, A. A. (2002) Asymptotics of the number of sum-free sets in abelian groups of even order. Dokl. Akad. Nauk 383(4) 454–457.Google Scholar

Saxton, D. and Thomason, A. (2015) Hypergraph containers. Invent. Math. 201(3) 925–992.10.1007/s00222-014-0562-8CrossRef Google Scholar

Scott, A. and Seymour, P. (2020) A survey of

$\chi$ -boundedness. J. Graph Theory 95(3) 473–504.10.1002/jgt.22601CrossRef Google Scholar

Article contents

Counting independent sets in structured graphs

Abstract

Keywords

MSC classification

Information

1. Introduction

1.1. Notation

2. Counting independent sets locally

3. Translating counts from local to global

4. Counting independent sets in $H$ -free graphs

5. Concluding remarks and open problems

Acknowledgements

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests