1. Introduction
The group testing approach is proposed by Dorfman in [
1]. Its main objective is to identify the infection status of a group of individuals by performing fewer tests than the number of individuals. The main idea behind group testing is to pool the obtained test samples and to test these pooled samples, instead of testing each sample individually. By this way, when a mixed sample results as negative, it implies that every test sample included in that mixed sample is negative. On the other hand, if a mixed sample results as positive, it implies that there is at least one positive test sample among the test samples that are mixed in that mixed sample. The main objective of the group testing problem is to design the testing scheme by specifying the test pools such that the infected set is identified with a minimum number of tests.
Since the group testing scheme that was proposed by Dorfman in [
1], various group testing algorithms have been studied. Mainly, two standard infection models have been considered in the majority of works: a combinatorial model where the number of infections is fixed and the infected set is uniformly randomly selected from all fixed sized subsets of all individuals, and a probabilistic model where each individual is infected with a fixed probability, independently. The group testing algorithms can be divided into several categories: adaptive group testing algorithms where the tests are performed in multiple steps and test results from previous steps can be used while designing the future test pools, and non-adaptive group testing algorithms where all of the tests are designed to be performed in a single step. The group testing algorithms can be categorized depending on their decoding error performance where zero-error group testing algorithms identify the infected set perfectly and small (vanishing) error algorithms identify the infected set with a small error that vanishes with a large population. As a variation of the test result model, noisy test results setting has been proposed and studied. For these standard settings, various algorithms have been proposed, analyzed and the capacities have been characterized for different parameter regimes [
2,
3,
4,
5,
6,
7,
8,
9,
10,
11,
12,
13,
14,
15,
16,
17,
18,
19,
20,
21].
Group testing works that are based on the standard settings have a limitation in terms of the overall performance: the advantage of group testing diminishes as the prevalence rate of the infection increases, due to the nature of the problem [
5,
15]. In more recent works, the standard settings have been challenged: non-identical infection probabilities, correlations between infection status of the individuals and community structure-based available side information scenarios have been proposed and studied [
22,
23,
24,
25,
26,
27,
28,
29,
30,
31,
32,
33]. These practical settings prove that by challenging the standard settings, further improvements can be obtained: the group testing approach can be used efficiently even when the prevalence rate is high among the population if further available side information is used in test designs. However, the overall static nature of the group testing problem is not challenged in these works. For a given group of individuals, the aim is to identify the infected set, as in the standard settings. In a recent line of works, the static nature of the majority of group testing works has been challenged, and dynamic models are proposed [
34,
35,
36,
37,
38]. In these works, SIR [
39] based dynamic infection spread models are studied. Testing and identification are done over time while the infection status of the individuals also evolve. In this paper, we aim to challenge the standard, static definition of the group testing problem and approach it over a time dimension over a similar setting with our prior work [
38]. By this way, we consider a setting where a group testing approach can be applied in systems that evolve over time, such as contagious diseases that spread within a population.
We consider the discrete time, SIR-based dynamic infection spread model that we introduced in [
38]. In our model, at each time instance, the testing capacity is limited and fixed, and full identification of the infections in the system is not possible. Each discrete time instance consists of two phases: infection spread phase and testing phase. Identified infections are isolated and further infection spread by them is prevented. Since the testing capacity is limited at each time instance, not all of the infections are detected and infected individuals that are not detected and isolated keep spreading the infection. The dynamic infection spread model is based on the SIR model, where the population is divided into three disjoint groups: susceptible individuals (S), non-isolated infections (I), and isolated infections (R). At the initial time instance, we assume i.i.d. initial infections in the system and at each time instance that follows, non-isolated infections spread the disease to the susceptible individuals in the system. Consecutively, during the testing phase, limited number of tests are performed and detected infections are isolated. We assume that, for the rest of the testing process, isolated infections remain in the same state. This is a reasonable assumption since they eventually recover, and even when their isolation ends, they can be immune to the infection for a disease-specific time period. We assume that the testing process finalizes prior to the end of that time period. In our model, in contrast to the prior works, the objective is not to identify all of the infections while minimizing the number of tests, but to identify as many infections as possible at each time instance by using limited number of
T tests and to eventually control the spread of the disease.
Novelty
The system model that we consider in this paper is based on a dynamic, time-evolving disease spread model. Utilizing group testing algorithms for dynamic systems is a very recent line of group testing literature that has been studied in [
35,
36,
37,
38]. These works are based on variations of the well-known SIR model, but our prior work [
38] and this paper are distinct from other dynamic group testing works as we approach the overall objective of the system differently: we consider a limited testing capacity at each time instance and instead of minimizing the number of tests required to identify every infection in the system at each time instance, we aim to utilize the given test capacity in terms of novel performance metrics that we present. In this paper, we further expand our prior work [
38] by proposing novel performance metrics: disease control time,
and
-disease control time
. We characterize the performance of dynamic individual testing algorithms in terms of these novel performance metrics. Moreover, we introduce a novel dynamic algorithm: dynamic SAFFRON-based group testing algorithm which is inspired by the static SAFFRON scheme that is studied in [
40] and introduced in [
41]. We present the mean-sense theoretical analysis of both a dynamic individual testing algorithm and dynamic SAFFRON-based group testing algorithm, in (
6) and (
12), respectively. Finally, we implement the discrete time SIR-based dynamic infection spread model and the dynamic algorithms to simulate them and compare the numerical results with the theoretical results that we obtain.
2. System Model
There are n individuals in the system and their infection status, which we denote by for individual i at time t, changes over discrete time instances. At any given time, there are three disjoint groups in the population: susceptible individuals who can become infected (S), non-isolated infections who have not been detected (I) and isolated infections who have been detected by performed tests and isolated indefinitely (R). can take values 0 (susceptible), 1 (non-isolated infection) and 2 (isolated infection), representing the infection state of the individual i at time t. At , we introduce the initial i.i.d. infections to the system: each individual is infected with probability p, independently. We introduce three random processes: denotes the number of susceptible individuals, denotes the number of non-isolated infections and denotes the isolated infections, at time t.
Each discrete time instance
, consists of two consecutive phases: infection spread phase and testing phase. During the infection spread phase, each non-isolated and infected individual can infect each susceptible individual with probability
q, independently. Here, since each susceptible individual can be infected by each non-isolated and infected individual, this can be modeled as the realization of
independent Bernoulli random variables with parameter
q, for each time
. Following the infection spread phase, the testing phase starts in which the testing capacity is limited to a given, fixed number,
T. Thus, the aim of designing the group testing algorithm is to construct
binary test matrices that specify the testing pools. Let
denote the binary test matrix for the tests at time
t, and
denote the
ith row and
jth column of
, where
is 1 if the test sample of
jth individual is mixed in the
ith mixed test sample, and 0 otherwise. Let
denote the test result vector of the tests performed at time
t and
denote the
ith test result at time
t. Then, we have
We assume that results of the tests that are performed at time t are available before the infection spread phase at time , i.e., vector is known while designing the binary test matrix . Moreover, when infected individuals are detected during the testing phase at time t, they are immediately isolated and thus, prevented from spreading the infection for the rest of the testing process, i.e., for all times , if the individual i is found to be infected by the tests performed at time . Note that the only possible infection state changes in our system are either from susceptible individuals to non-isolated infections, which happen via infection spread from non-isolated and infected individuals to susceptible individuals during the infection spread phase at each time, or from non-isolated infections to isolated infections, which happen via infection detection during the testing phase at each time instance. A group testing policy is a scheme that specifies the algorithm to construct binary test matrices for each time instance , until the infection is under control.
Since the source of the infection state evolution in our dynamic model is the non-isolated infections, we define the disease control time to be the first time when there are no non-isolated infections remaining in the population, i.e., . After time , every individual in the population is either susceptible or isolated, i.e., or for and for all . Furthermore, we introduce -disease control time for probabilistic analysis, denoted by where holds. While characterizing and analyzing the performance of a group testing algorithm , there are two performance metrics associated with it: the disease control time (or ) and the number of individuals that have never gotten infected throughout the process, i.e., . Lower is favored to control the disease faster, and higher is favored to control the disease with a lower number of total infections.
3. Proposed Algorithms and Analysis
In this section, we introduce a novel dynamic group testing algorithm: the dynamic SAFFRON-based group testing algorithm. This algorithm is inspired by the static group testing algorithm introduced in [
41], coined as SAFFRON scheme, which is further studied in [
40] for partial detection problem. We analyze dynamic SAFFRON-based group testing algorithm in terms of two performance metrics that we consider: disease control time and number of susceptible individuals when the disease is brought under control. Furthermore, we analyze the dynamic individual testing algorithm that we introduced in [
38], in terms of its
-disease control time performance.
3.1. Related Prior Results
For the sake of completeness, here we present a revised version of the results from [
38], that we use for the analysis in this paper. To keep it compact, we only include the main results that we use in the analysis in this paper.
We consider symmetric and converging dynamic testing algorithms, which must satisfy the symmetry criterion:
for each time instance
and for all
pair. Moreover, they must satisfy the convergence criterion:
Let denote the probability of an individual not being identified during the testing phase at time t. Note that, since we consider symmetric and converging dynamic testing algorithms, this probability is the same for all individuals. We consider dynamic testing algorithms where only depends on the testing capacity T, , and . We assume the infection spread probability q scales as . Since the infection spread is realized for every susceptible and non-isolated infected individual pair and there can be such pairs at each time instance, assuming q to be is a mild assumption.
In the remainder of this subsection, we present our prior result from [
38] that we use in our analysis in this paper, regarding symmetric and converging dynamic testing algorithms.
We start with the following lemma, where we prove that symmetric and converging dynamic testing algorithms guarantee the disease to be controlled eventually.
Lemma 1. When a symmetric and converging dynamic testing algorithm is implemented, and thus, the system approaches the steady state, in the mean sense.
The statements of Lemma 2 and Theorem 1 give improved characterizations of the approximation terms compared to the original statements in [
38]. The approximations are characterized by
terms in this paper, which follow from the linear approximations in Taylor series expansions for exponential terms and hold since
. Moreover, arbitrarily small variance conditions in [
38] are also characterized to scale as
here as well. In the following lemma, we characterize an approximation of the probability of an individual being infected and non-isolated at time
t, i.e., probability of the event of
.
Lemma 2. When a symmetric and converging dynamic testing algorithm is implemented, we havewhere the conditional probability of an individual not being identified in the tests at time t given is denoted by and when and both scale as with respect to n for all . In the following theorem, we present a general result that characterizes the approximation of the expectation of . This result holds for all symmetric and converging dynamic testing algorithms that satisfy the small covariance conditions that we state in Lemma 2.
Theorem 1. When a symmetric and converging dynamic testing algorithm is implemented and vanishing covariance constraints in Lemma 2 are satisfied for all , we have We use Theorem 1 to characterize the expected number of susceptible individuals when the disease is brought under control, which is one of the two performance metrics that we consider. On the other hand, for the characterization of the disease control time , our analysis is based on Lemma 2.
3.2. Dynamic Individual Testing Algorithm
In the dynamic individual testing algorithm, we consider randomized, individual testing of
T individuals at each time instance
, where
T individuals to be tested are uniformly randomly selected from the whole population, independent across both individuals and time. In [
38], we proved that the dynamic individual testing algorithm (weak dynamic individual testing algorithm in that paper) is a symmetric and converging dynamic testing algorithm. Furthermore, we derived
results for the disease control time. Here, we derive
-disease control time performance results of the dynamic individual testing algorithm.
In the following theorem, we present our result for the -disease control time performance metric when the dynamic individual testing algorithm is used in our proposed dynamic system.
Theorem 2. When the dynamic individual testing algorithm is used, we have Proof. We start with the mean of non-isolated infections
where (
7) follows from the definition of
as the total number of non-isolated infected individuals and (
8) follows from Lemma 2. Then, we have
where (
9) follows from the definition of
-disease control time and (
10) follows from the fact that
for all
when the dynamic individual testing algorithm is used. Finally, we obtain
by arranging the terms. □
In Theorem 2, we characterize the -disease control time metric for dynamic individual testing algorithm. Since it is defined as the time instance when there are non-isolated infected individuals in the mean sense, -disease control time metric presents a characterization of how fast a given algorithm can control the disease spread.
3.3. Dynamic SAFFRON Based Group Testing Algorithm
We propose and analyze a novel algorithm, which we coin as dynamic SAFFRON based group testing algorithm. It is inspired by the static group testing algorithm, SAFFRON, introduced in [
41] and studied in [
40] for partial detection.
The static SAFFRON scheme-inspired algorithm presented in [
40] is based on the idea of constructing binary test matrices for the groups of individuals that contain approximately one infected individual. Constructed SAFFRON scheme test matrices can be used to recover exactly one infected individual within the tested group, while also indicating whether there is zero or more than one infected individuals within the tested group if there is not exactly one infected individual.
Here, we propose the novel dynamic SAFFRON-based group testing algorithm, where at each time instance , groups of size are selected uniformly and randomly from the set of non-isolated individuals. For the values that are close to their mean (Our numerical results suggest a concentration around the mean for . An in-depth concentration analysis can be the subject of future works.), each of these groups has approximately one infected and non-isolated individual.
By designing SAFFRON scheme-inspired binary test sub-matrices that are guaranteed to identify one infection from each group that has exactly one infection, we detect and isolate infections over time.
For a selected group of individuals, the binary test sub-matrix is constructed as follows:
Let denote the size of the selected individuals, i.e., .
First, rows of the test sub-matrix are constructed where column vector that corresponds to column i is set to the binary representation of the number .
Then, remaining rows of the test sub-matrix are constructed by concatenating the created sub-matrix in the previous step with the binary matrix where the value of each element is flipped, i.e., XORed with 1.
By using this construction, it is guaranteed that, if there is exactly one infection within the tested individuals, there will be exactly positive tests and the positive test indices will be the binary representation of where the infected individual index is i. If there is no infection, then all tests will be negative. In the case of more than one infection, strictly more than tests will be positive. Thus, this construction guarantees the detection of a single infection within the selected group.
In the following lemma, we characterize the expected number of detected infections at each time instance.
Lemma 3. When dynamic SAFFRON-based group testing algorithm is used, at each time instance , in averageinfections are detected and isolated. Proof. We choose the group size as
since there will be approximately one infected individual within each group when the groups are chosen uniformly randomly from the set of all non-isolated individuals. Since the testing capacity is
T at each time instance, there will be
such groups. Recall that the probability that a non-isolated individual is infected is
due to the fact that dynamic SAFFRON-based group testing algorithm uniformly randomly selects the groups at each time instance and since the infection statistics are symmetric over individuals. Furthermore, the group size is inverse of this term. Therefore, each group contains exactly one infected individual with probability
where lemma statement follows. □
Similar to our analysis for dynamic individual testing algorithm, we can use the results of Lemma 2 and Theorem 1 for dynamic SAFFRON-based group testing algorithm where we have
where
. To justify using Lemma 2 and Theorem 1 results, we need to prove that the dynamic SAFFRON-based group testing algorithm satisfies the symmetry and convergence criteria in (
2) and (
3).
Since the selection of tested groups is independent across individuals for each time instance, the dynamic SAFFRON-based group testing algorithm is symmetric. To guarantee convergence, we consider using the dynamic individual testing algorithm whenever , i.e., the regime where the expected number of non-isolated infections is small with respect to the total number of non-isolated individuals. In that regime, SAFFRON-based construction cannot be used efficiently to detect infections. In the regime where the expected number of non-isolated infections is high, the SAFFRON-based scheme can consistently detect infections as characterized in Lemma 3.
4. Numerical Results
In our simulations, we implement the dynamic discrete SIR-based infection spread system. We initialize our system with random independent initial infections with probability p and in following time instances, we simulate the random disease spread from non-isolated infections to susceptible individuals, independently with probability q. For both dynamic algorithms, we run our system for 500 time instances and iterate realizing this dynamic system for 1000 times. We save the sample path trajectories of the random processes , and , take their averages and plot.
To control disease spread, in our first simulation, we implement the dynamic individual testing algorithm, where randomly selected
T individuals are individually tested at each time instance. Here, by using our theoretical result in Theorem 2, we have
when
. This is consistent with the numerical results plotted in
Figure 1. In our second simulation, we implement a hybrid version of the dynamic SAFFRON-based group testing algorithm. At each time instance, we randomly select groups of people where each group has one infection in average, where we use the expected
that we calculate by using Lemma 3, to obtain the group size. Then, we construct SAFFRON scheme-based test sub-matrices for each of these groups and perform tests, with maximum
T tests in total. In practice, SAFFRON scheme-based construction can have a non-utilized testing capacity at each discrete time instance due to the fact that the testing capacity
T may not be divisible by the calculated group size. We further utilize these available testing capacities and perform random individual testing when non-utilized tests remain. Moreover, later in the testing process, since the expected
becomes smaller, the calculated group size grows larger and the required number of tests for SAFFRON-based construction exceeds the testing capacity
T. For these later times in the testing process, we switch to the dynamic individual testing algorithm to detect and isolate the remaining infections. We also plot the theoretical calculation of expected
in
Figure 2, where we use Lemma 3 for calculation, which align with the numerical results that we obtain, as we plot in
Figure 2. We observe that the steady state mean number of susceptible individuals is slightly higher in the dynamic individual testing algorithm while the convergence time is slightly lower in the dynamic SAFFRON-based group testing algorithm. This observation for this set of parameters suggests that both algorithms can be used to optimize different performance metrics, depending on the system requirements and parameters, as well as hybrid usage of the dynamic algorithms is also possible as in our hybrid dynamic SAFFRON-based group testing algorithm implementation.