Hostname: page-component-77f85d65b8-9nbrm Total loading time: 0 Render date: 2026-04-22T05:49:24.453Z Has data issue: false hasContentIssue false

Noisy group testing via spatial coupling

Published online by Cambridge University Press:  19 November 2024

Amin Coja-Oghlan
Affiliation:
Faculty of Computer Science and Faculty of Mathematics, TU Dortmund, Dortmund, Germany
Max Hahn-Klimroth
Affiliation:
Faculty of Computer Science, TU Dortmund, Dortmund, Germany
Lukas Hintze
Affiliation:
Fakultät für Mathematik, Informatik und Naturwissenschaften, Universität Hamburg, Hamburg, Germany
Dominik Kaaser
Affiliation:
Institute for Data Engineering, TU Hamburg, Hamburg, Germany
Lena Krieg*
Affiliation:
Faculty of Computer Science, TU Dortmund, Dortmund, Germany
Maurice Rolvien
Affiliation:
Faculty of Computer Science, TU Dortmund, Dortmund, Germany
Olga Scheftelowitsch
Affiliation:
Faculty of Computer Science, TU Dortmund, Dortmund, Germany
*
Corresponding author: L. Krieg; Email: lena.krieg@tu-dortmund.de
Rights & Permissions [Opens in a new window]

Abstract

We study the problem of identifying a small number $k\sim n^\theta$, $0\lt \theta \lt 1$, of infected individuals within a large population of size $n$ by testing groups of individuals simultaneously. All tests are conducted concurrently. The goal is to minimise the total number of tests required. In this paper, we make the (realistic) assumption that tests are noisy, that is, that a group that contains an infected individual may return a negative test result or one that does not contain an infected individual may return a positive test result with a certain probability. The noise need not be symmetric. We develop an algorithm called SPARC that correctly identifies the set of infected individuals up to $o(k)$ errors with high probability with the asymptotically minimum number of tests. Additionally, we develop an algorithm called SPEX that exactly identifies the set of infected individuals w.h.p. with a number of tests that match the information-theoretic lower bound for the constant column design, a powerful and well-studied test design.

Information

Type
Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. Information rates on different channels in nats. The horizontal axis displays the infection density parameter $0\lt \theta \lt 1$. The colour indicates the optimal value of $d$ for a given $\theta$.

Figure 1

Algorithm 1 SPARC, steps 1–2

Figure 2

Algorithm 2 SPARC, steps 3–9.

Figure 3

Figure 2. The threshold function $\mathfrak z(\cdot)$ (red) on the interval $\mathscr{Y}(c_{\mathrm{ex},1}(d,\theta ),d,\theta )$ and the resulting large deviations rate $c_{\mathrm{ex},1}(d,\theta )d(1-\theta )({D_{\mathrm{KL}}({{{\alpha }\|{\exp ({-}d)}}})+\alpha D_{\mathrm{KL}}({{{\mathfrak z(\alpha )}\|{p_{01}}}})})$ (black) with $\theta =1/2$, $p_{00}=0.972$, $p_{11}=0.9$ at the optimal choice of $d$.

Figure 4

Algorithm 3 The SPEX algorithm.

Figure 5

Table 1. Overview of notation