Brought to you by:
Paper The following article is Open access

A minimal model for household effects in epidemics

, , , , , and

Published 15 October 2020 © 2020 The Author(s). Published by IOP Publishing Ltd
, , Special Issue on the Biological Physics of the Coronavirus Disease 2019 Pandemic Citation Greg Huber et al 2020 Phys. Biol. 17 065010 DOI 10.1088/1478-3975/abb209

1478-3975/17/6/065010

Abstract

Shelter-in-place and other confinement strategies implemented in the current COVID-19 pandemic have created stratified patterns of contacts between people: close contacts within households and more distant contacts between the households. The epidemic transmission dynamics is significantly modified as a consequence. We introduce a minimal model that incorporates these household effects in the framework of mean-field theory and numerical simulations. We show that the reproduction number R0 depends on the household size in a surprising way: linearly for relatively small households, and as a square root of size for larger households. We discuss the implications of the findings for the lockdown, test, tracing, and isolation policies.

Export citation and abstract BibTeX RIS

Content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

The rapid spread of the SARS-CoV-2 virus and the COVID-19 pandemic is a threat to global public health with very few precedents. The policy response thus far has often been informed by epidemiological modeling, such as [1]. This paper offers some suggestions for improving such models by accounting for intra-household effects. The role of inside-household transmission for COVID epidemiology has been recognized early [24]. In this work we investigate the combination of relatively fast transmission inside the households and relatively slower transmission between them.

An important feature in epidemiological models is the way the individuals come in contact with each other within the model framework. The simplest assumption is the panmictic one: everybody comes in contact with everybody else with equal probability, and thus has a chance to infect everybody else [57]. On the other end of the spectrum are detailed individual-based models where we know exactly who contacts whom at the individual level [810].

The widespread COVID-19 pandemic and resultant partial quarantine measures make the panmictic assumption unrealistic: if confinement works, one hopes to lower the number of contacts outside the immediate surroundings of every individual. On the other hand, detailed models are quite effective at explaining what has happened in the past, but predictions about the future and the results of lockdown policies in the context of these models require detailed assumptions on the mode of future contacts between individuals. These details are knowable in hindsight, but are rather difficult to predict.

These considerations suggest a third approach: idealized models that are more realistic than the classical panmictic ones, but less detailed than those based on individual data. In this class of models, we describe the way the population interacts through a limited number of well-defined parameters. These parameters are influenced by policy decisions, so models of this kind can be used to predict the result of specific policies on the spread of contagion. An example of such models is Stroud's model of influenza spreading in a mixed population [11], where the inhomogeneity was represented by a semi-empirical power law, partially justified by simulations, for the probability of a new infection.

In this paper, we describe the effect of household size on sheltering in place, with application to the current COVID-19 epidemic. We assume that, due to shelter-in-place orders, the contacts between people in different households are uniformly reduced. However, no shelter-in-place policies restrict the interactions between the members of the same household. In fact, these interactions may become much closer and more frequent than before the epidemic. The epidemiological data on COVID-19 in close quarters such as cruise ships [12] suggest very high contagion rates in such situations. Therefore, it makes sense to discuss the spread of infection in a model where people are separated into close-knit groups ('households') with a high rate of contagion within groups relative to the rate of contagion between groups.

The idea of modeling epidemics in a stratified population is, of course, not new. Approaches similar to ours were employed in [13] to simulate the 1918 flu pandemic and in [14, 15] to study the effect of isolation and vaccination. In [16, 17] the dependence of the spread of infection was obtained in a household model, but with different assumptions from ours, which lead to different predictions.

Usually these approaches rely on many parameters to describe the dynamics of transmission. Here, we instead employ a minimal model that simply introduces within-household transmission in a standard SIR (susceptible–infected–recovered) formalism. This approach allows us to isolate the effect of household sizes from other factors that would affect more detailed descriptions. It is also analytically tractable and presents a phase space that depends on only a few parameters, so it can be thoroughly explored numerically. We are thus able to obtain a simple mathematical law that describes the growth of the epidemic's rate of spread as a function of the household size. While more finely grained models could offer a more detailed description, we expect this law to be universal as long as our main premises—high transmission rate for close contacts and high rate of asymptomatic transmission—are satisfied.

Figure 1.

Figure 1. Household model. Black symbols correspond to infected individuals.

Standard image High-resolution image

Below, in section 2, we describe the model in detail and calculate the fast-growing modes. This analysis provides insight into the initial dynamics of epidemics and the influence of various parameters on the rate of infection. In section 3, we report on the results of simulations and numerical solution of the mean-field equations. In section 4, we discuss the results and their meaning for lockdown strategies. The conclusions are stated in section 5.

2. Mean-field models

Let us discuss a population separated into households with high contact rates within a household and low contact rates between different households (figure 1).

We will use the SIR model with three states for each individual: susceptible, infected, and recovered, as shown on figure 2. The state of a household is described by the number of individuals in each state (s, i, r). If H is the household size, then s + i + r = H. In this section, we use a simple model where the first infection quickly leads to the whole household being infected, so the simplified state diagram becomes the one shown in figure 3. Let S be the number of households with no one infected, F be the number of households with one person infected, and G be the number of households with everyone infected. The total number of infected persons is F + HG. Let us neglect secondary outside transmission, i.e., the situation when a member of an already infected household gets infected through an outside contact. Then the probability that an arbitrary contact of an infected person is susceptible is SH/N, where N is the total number of persons. If β is the transmission rate, then the number of households is governed by the equations

Equation (1)

Equation (2)

where α is the rate of having the whole household infected, and γ is the recovery rate. We assume the recovery rate for all household members is the same and neglect the difference in the time of infections of different household members.

Figure 2.

Figure 2. State diagram for an individual in the simplest model.

Standard image High-resolution image
Figure 3.

Figure 3. Simplified state diagram for a household. The household state is (s, i, r): susceptible, infected and recovered.

Standard image High-resolution image

If all members of the household are quickly infected by the index patient, the rate constant α does not depend on the household size H. If contagion spreads through aerosols or shared surfaces [18], the rate of spread does not depend on H. This suggests that the dependence α(H) is weak or non-existent. Therefore, in the subsequent models we neglect this dependence, and discuss its possible effects in the appendix A.

In the early stages of the epidemic, far from saturation, SHN, and equations (1) and (2) are linear. The positive eigenvalue of the system corresponds to the growing mode and is equal to

Equation (3)

A fast in-household transmission corresponds to αβ. Equation (3) shows that there are two regimes for the transmission. For small household sizes, when 2(2H − 1)βα, the exponent for the growth of infections depends on H linearly, as λγ. However, for larger households, when 2(2H − 1)βα, the effective growth rate is proportional to the square root of H, $\lambda \approx \sqrt{\alpha \beta H}-\gamma $. This finding is in contrast with the one in [16], where a linear dependence on H was found under different assumptions.

It is customary to quantify epidemic-growth potential using the basic reproduction number R0, which is equal to the increase in the number of infections during the course of disease for an individual patient. For this model

Equation (4)

We will assume γ ≈ 0.125 days−1 and the rate of spread in the absence of lockdown β ≈ 0.3 [19]. Setting different values for β under lockdown, we plot the values of R0 on figure 4.

Figure 4.

Figure 4.  R0 for the model with γ = 0.125 days−1. The horizontal line corresponds to R0 = 1. Each subfigure corresponds to a fixed β with a curve for each value of α.

Standard image High-resolution image

3. Numerical modeling

Eigenvalue analysis of equations (1) and (2) is applicable in the linear regime only. For the situation where the depletion of the susceptible population is non-negligible, we need either to solve these equations numerically or perform simulations. Below we report the results of both approaches.

We performed numerical simulations to validate the analytical models of household infections described in the above section. In each scenario that we studied, we took a population of N = 200 000 individuals, placed each individual into a house for a range of different household sizes, and subsequently initialized a small random subset of 20 individuals in the population with an infection. For each step, we divided the infection into two phases:

  • (a)  
    Panmictic phase: each infected individual can infect any uninfected individual in the simulation, with a probability ${p}_{0}={\Delta}t\frac{\beta }{N}$, where β is the daily rate of infection in the panmictic phase, N is the population size and Δt is the time step of the simulation.
  • (b)  
    Household phase: each infected individual can infect any uninfected individual in their house, with a probability Δ, where α is the daily rate of infection inside the household.

We studied these regimes using Δt = 0.1 and γ = 0.125. We ran a different simulation for various combinations of α, β and H. For each combination of values we ran the simulations n = 10 times. For the same parameters we numerically solved equations (1) and (2). The results are shown in figure 5.

Figure 5.

Figure 5. Simulations and mean-field predictions. The lines correspond to numerical solution of equations (1) and (2). The error bars correspond to the confidence intervals for repeated simulations computed as $1.58\text{IQR}/\sqrt{n}$ [20]. Each subfigure corresponds to a combination of α and β as shown on the panels with a curve for each value of H.

Standard image High-resolution image

The model shows a very good agreement between the predictions of the mean-field theory and numerical experiments, especially for large in-household transmission α. It should be noted that the meaning of α is subtly different for simulations and mean-field theory. For simulations, α is the rate for an individual to become infected inside a household; while for mean-field theory, α is the rate for a household as a whole to become fully infected. These two definitions are expected to be convergent in the limit of fast inside infections, as seems to be the case from figure 5.

It is interesting to look at the total number of cases during the epidemics. The results of our simulations for N = 200 000 are shown on figure 6.

Figure 6.

Figure 6. The total number of cases for simulations with N = 200 000. Each subfigure corresponds to a value of H.

Standard image High-resolution image

4. Discussion

Our approach is based on the existence of two time scales: fast transmission inside households and slow transmission between the households. Following the scaling approach (see, for example, [21]), we can imagine the households being 'super-individuals' with renormalized interaction parameters. Since, in this model, H people have H times as many contacts as one person, one can naïvely expect that the spread of contagion scales as the household size H. However, as confirmed by mean-field calculations and numerical simulations, the scaling law is less steep: the spread grows only as $\sqrt{H}$. This fact may have profound policy implications, suggesting the possible establishment of larger 'households' such as social circles) rather than a uniform reintegration of all contacts during reopening. However, further modeling and comparison with empirical data would be necessary before implementing the results of this paper as policy. A rather strong dependence of the total number of cases over the time span of epidemic on the household size (figure 6) stresses the necessity of caution at reopening.

One of the interesting questions for epidemics control is the relative importance of measures to suppress the transmission between households and the measures to suppress the transmission inside households (for example, by wearing masks at home). Figure 5 suggests that the former measures are more efficient than the latter, at least within the assumptions of the model: the variation in the inter-household rate β has more influence on the outcome than the variation in the intra-household rate α.

The assumptions underlying the model reveal its natural limitations. For mean-field calculations (but not numerical simulations), we assumed that inside-household dynamics are much faster than the transmission between households. This allowed us to neglect the details of transmission inside households and use an idealized model depending just on the mean time between the first infection and the completely infected household. This also allowed us to neglect secondary between-household infections (i.e., household members of an infected individual getting infected by their outside contacts rather than by the inside ones). Our numerical simulations did not employ this assumption explicitly. However, by selecting a very simple inside-household infection model (pair interactions between household members) we made the assumption implicitly: the details of the inside transmission should not matter if the transmission is fast. Another assumption for the mean-field model is the neglect of random fluctuations: we used mean values to construct the equations. We also neglected the dependence of the time for a household to become infected on the household size. This dependence should be significant for very large 'households' (see appendix A).

Another assumption in our calculations is the homogeneity inside households: all members have the same contact rate. In reality some members might have more contacts with the outside world than others. It is easy to see, however, that one can account for this inhomogeneity by rescaling the rate β of outside contacts.

While we use the word 'households' throughout this paper, our model may describe situations beyond the simple picture of families waiting out the epidemics together. A dormitory, cruise ship, or even apartment building with a common ventilation system may represent examples of a large 'household' (see [22] on the role of common ventilation systems). While in the calculations here we assume 'households' of equal size, the model could be generalized for a mixture of different groups with a high level of transmission inside groups and a low level of transmission between the groups. In particular, it can be shown (see appendix B) that the main result of our model is valid in the inhomogeneous case. If we increase all house sizes by the same factor, the infection rate increases by the square root of the factor.

Another possible application of our model is related to the observation that one of the ways to slowly reopen may involve the relaxing of social distancing within certain groups while maintaining it between the groups. In this case, these 'relaxed' groups can be treated as 'super-households'. An important application of the super-household concept could be a rudimentary testing-and-tracing policy, in which we test all members of the super-household of any asymptomatic infected individual, instead of a more expensive attempt to trace and test all their contacts. This application is important because asymptomatic transmission seems to be one of the salient and dangerous features of the current COVID-19 epidemic [23].

In subsequent papers, we will analyze testing-and-tracing strategies for the household-based shelter-in-place policy.

5. Conclusions

We studied epidemic spread in a stratified population, such as under shelter-in-place orders, using mean-field theories and numerical simulations. We found that:

  • (a)  
    Under shelter-in-place conditions the spread of the epidemics depends on the size of the 'household' (i.e., the group of people sheltered together).
  • (b)  
    The dependence of the rate of spread on the household size is linear for relatively small households and a square-root law for larger households.

We used idealized models to study the general features of the epidemics in the shelter-in-place conditions. More realistic models may add to these findings, but we expect the main results above to hold.

Acknowledgments

GH, LML, and DY are supported by the Chan Zuckerberg Biohub. The authors are grateful to Daniel S Fisher, who suggested an elegant formalism for the analytic model, and to Rob Phillips for his inspiration and for including us in his stimulating COVID discussion group.

Appendix A.: Dependence of α on H for large households

In models with small households, α has very little dependence on H since most household members will be infected in short order by the household's first infected individual. In this case, we ignore secondary infections. However, in large households, it may be required to take secondary infections within a household into account.

Here, we look at the dependence of α on H in a regime where secondary interactions are relevant. In particular, this model uses the same method of in-house infection as the numerical simulations.

Assume that the infection rate within a household is proportional to the number of 'connections' between infecteds and susceptibles. In a household of size H > 1 with n infecteds, we may draw H(H − 1)/2 connections between household members; n(Hn) of those connections will be between infecteds and susceptibles. That is to say, the infection rate is αn = α0 n(Hn) for some constant α0 which is independent of H and n.

The total time to infect the entire household starting with one infected is therefore

Equation (A.1)

We now reduce this expression for T.

Equation (A.2)

Equation (A.3)

Equation (A.4)

Equation (A.5)

where γ ≈ 0.577 is the Euler–Mascheroni constant and Bm is the mth Bernoulli number. Although it is not obvious that the terms after γ become vanishingly small for large H (since the Bernoulli numbers grow quickly), it is well known that these terms are only a small correction in this expression for the harmonic series. Therefore, defining α = 1/T, we may write the approximate expression

Equation (A.6)

for H > 1.

Appendix B.: Different household sizes

In the model that we study in the main body of this paper, we restrict all households to have the same size H. In this appendix, we relax this assumption and construct a model which supports a distribution of household sizes. Let FH (t) be the number of households of size H at a time t that have exactly one person infected. We also let GH (t) be the analogous quantity for fully infected households. For simplicity, we will always assume that H ⩾ 2. For these quantities, we assume the dynamics

Equation (B.1)

where γ and β have exactly the same interpretation as in the main text, αH is the rate of having the whole household of size H infected, and SH is the number of susceptible households of size H. For early times, $\sum _{J=2}^{\infty }{S}_{H}H=N$. For ease of notation, we define ${\beta }_{H}=\beta \frac{{S}_{H}H}{N}$.

We now set out to solve for the early time dynamics of this model by computing its largest eigenvalue. As a simplifying assumption, we assume that the distribution of SH has a small enough width that we may assume that αH = α is roughly constant. As discussed in appendix A, in general αH should depend on H for large H.

Using the definitions

Equation (B.2)

we write down equations of motion

Equation (B.3)

The matrix governing the dynamics of this new system is

Equation (B.4)

where we have done the following redefinition: let σ(k) be the kth smallest value of H for which βH ≠ 0. We assume that there are a finite number M of such values. Then we write α'k = ασ(k) β'k = βσ(k), Hk = σ(k) − 1.

From this, we may write down a characteristic equation as

Equation (B.5)

where

Equation (B.6)

We now use our assumption that the distribution of SH is not very wide. That is, αH α. In that case, we obtain the characteristic equation

Equation (B.7)

Therefore, the largest eigenvalue of the original system (after subtracting off the γ) is

Equation (B.8)

which reduces to our original expression. If we again invoke the assumption that the distribution of SH is not very wide and we take small enough α, then we recover the square root law.

Please wait… references are loading.
10.1088/1478-3975/abb209