Group Testing with General Correlation Using Hypergraphs
Hesam Nikpey, Saswati Sarkar, Shirin Saeedi Bidokhti
TL;DR
This work develops a unified probabilistic framework for group testing under general node-state correlations by modeling infection patterns as edges in a hypergraph with a probability mass function over edges. It introduces an adaptive greedy algorithm that updates posterior edge probabilities after each test, achieving an upper bound on the expected number of tests that scales with $H(X)$, the edge-entropy, plus the mean number of infections $\mu$; the method is shown to be near-optimal in several random-hypergraph regimes. The authors extend the framework to semi-non-adaptive and noisy settings, providing theoretical bounds and arguing that entropy is not always a tight lower bound, while $\mu$ may be a lower bound in certain regimes. They also show how the model subsumes prior independent and correlated models, recover/improve prior results, and discuss the potential for non-adaptive designs and broader correlation structures. The results highlight the fundamental role of correlation structure in reducing testing resources and offer practical guidance for adaptive, semi-adaptive, and noisy group testing in networks with complex dependencies.
Abstract
Group testing, a problem with diverse applications across multiple disciplines, traditionally assumes independence across nodes' states. Recent research, however, focuses on real-world scenarios that often involve correlations among nodes, challenging the simplifying assumptions made in existing models. In this work, we consider a comprehensive model for arbitrary statistical correlation among nodes' states. To capture and leverage these correlations effectively, we model the problem by hypergraphs, inspired by [GLS22], augmented by a probability mass function on the hyper-edges. Using this model, we first design a novel greedy adaptive algorithm capable of conducting informative tests and dynamically updating the distribution. Performance analysis provides upper bounds on the number of tests required, which depend solely on the entropy of the underlying probability distribution and the average number of infections. We demonstrate that the algorithm recovers or improves upon all previously known results for group testing settings with correlation. Additionally, we provide families of graphs where the algorithm is order-wise optimal and give examples where the algorithm or its analysis is not tight. We then generalize the proposed framework of group testing with general correlation in two directions, namely noisy group testing and semi-non-adaptive group testing. In both settings, we provide novel theoretical bounds on the number of tests required.
