Introduction

The outbreak of COVID-19 has caused major public health concerns around the world. Since the first cases in late 2019 in China, COVID-19 has spread exponentially around the world, indicating an endemic person-to-person transmission.

In an attempt to flatten the transmission of the virus, governments in many countries implemented a range of non-pharmaceutical interventions (NPIs), including closures of schools, workplaces, public transport, cancellations of public events, restrictions of internal movement, tracing infected persons contacts, enhanced testing, and more. Timelines of these policies vary across countries, but by mid-April 2020, due to the virus crisis around 70% of countries have enacted one or more of these measures.Footnote 1 The effectiveness of each NPI may vary with a range of other actions taken by the government and communities at the time of the crisis. To assess these differences, we examine how the effectiveness of NPIs varies with country characteristics.

Using a dynamic version of the Susceptible-Infected-Recovered (SIR) model, we examine how the impact of NPIs on transmission rate of COVID-19 varies with a range of country characteristics along economic, public health and environment dimensions. These characteristics are associated with differences in behavioral response and differences in resources available to governments that might be required to enforce these policies. Our findings suggests that these factors play important roles in slowing down the spread of the virus during NPIs; the results also suggest that the economic and social systems as well as incentives and attitudes may lead to different outcomes of NPIs.

At the environmental dimension, we account for population density, pollution, and surface area factors. Higher population density may increase the frequency of human interactions and affect the compliance with the social distancing rules; therefore, the spread of the virus might be higher in such communities and lockdown policies less effective. It has also been previously established that an epidemic of a respiratory disease, such as influenza, initially affects the more densely populated urban areas (see for example, Zachreson et al. (2018), using Australian data). Higher concentrations of air pollutants are associated with higher likelihood of chronic respiratory conditions and therefore present an additional risk factor. For example, Conticini et al. (2020), using Italian data, conclude that pollution is an additional factor of increased lethality from COVID-19; Wu and McGoogan (2020b) reach similar conclusions using US data. Additionally, Setti et al. (2020) show that the airborne transmission of COVID-19 might be more prevalent in areas with higher air pollution.

At the socio-economic dimension, we account for GDP per capita and employment rate. Higher employment rates may increase the risk of exposure to viruses and therefore increase the spread. For example, Markowitz et al. (2019), using the US data, show that increases in employment are associated with increased incidence of influenza; Adda (2016) uses French data and shows that viruses spread faster during economic booms. Access to leave and sick benefits may increase the incentives to work for those who display symptoms of sickness and increase the spread of the virus, see for example, Barmby and Larguem (2009) (using UK data) and Pichler and Ziebarth (2017) (using US and German data). Better economic conditions at the onset of the pandemic may also provide better access to funds for governments, providing resources for better benefits and enforcement practices.

At the health system dimension, we account for health expenditure as % GDP, proportion of physicians in population, and proportion of elderly population. Research shows that elderly face higher COVID-19 associated risks.Footnote 2 A response to the higher risk might be in form of higher prevalence in voluntary social distancing. On the other hand, for the general population, taking into account the proportion of elderly, access to a more advanced health system might be correlated with a lower perceived risk of becoming severely ill; therefore, lower incentives to exercise voluntary social distancing before NPIs are enacted.

The dataset is constructed using four data sources. The Oxford COVID-19 Government Response Tracker provides measures of government responses to COVID-19 spread. For more details about the dataset, see Hale et al. (2020). We obtain data on the number of recorded COVID-19 cases from the Center for Systems Science and Engineering at Johns Hopkins University. Google Mobility Reports provide daily country level information on human mobility across a range of location categories. Country-level characteristics are from the World Bank data.

We show that NPIs which impose strict social distancing rules, such as school and workplace closures, are more effective in countries with lower population density, lower surface area, lower air pollution, higher GDP per capita, lower employment rate, higher health expenditure, and lower proportion of elderly population. We attribute the differences in effectiveness of NPIs to incentives driven behaviors and public resource constraints. Compliance with closures and lockdowns may demand more resources in places with higher population density, larger geographical area, and higher employment rate, impairing the effectiveness of these policies. Communities with access to a better health care, measured by health expenditure as % of GDP, may have less incentives to voluntary reduce social interactions; therefore, social distancing measures in such communities could be more effective. A cross-country analysis of Google mobility data reinforces these results and interpretations. Individuals in countries with higher proportion of physicians per 1000 persons and higher health expenditure as % of GDP are less likely to increase their time spent at home prior to schools and workplace closures.Footnote 3 On the other hand, after the implementation of workplaces and/or school closures, the proportional increase in time spent at home is higher in countries with better access to health system and lower in countries with higher proportion of elderly.

Extensive testing is more effective in countries with higher GDP per capita, larger surface area, higher air pollution, lower health expenditure, and higher proportion of elderly in population. Variation in testing intensity and its potential correlation with the GDP per capita could explain some of these results. Recent studies point out the existence of complementarities between social distancing behaviors and testing: enhanced testing policy can lead to more social activity and less social distancing, (see for example, Acemoglu et al. 2020). Such complementarities appear to be more likely in countries with better health systems, proxied by health expenditure as a proportion of GDP, and lower proportions of elderly population.

There are now a number of studies evaluating the effectiveness of NPIs. Askitas et al. (2020) show that some NPIs are more effective than others; Chen and Qiu (2020) detail these differences in terms of costs and benefits. Jinjarak et al. (2020) document that more stringent policies are associated with lower mortality growth rates and report cross-country differences. Kucharski et al. (2020) estimate that in China, the basic reproduction rate declines from 2.35 one week before travel restrictions to 1.05 one week after travel restrictions. Friedson et al. (2020), using daily COVID-19 data in California, find that the lockdown reduced the number of cases by 125.5 to 219.7 (per 100,000 population) and led to 1,661 fewer deaths during the first month following its implementation. Fang et al. (2020) study the impact of lockdown in Wuhan, China enacted on January 23, 2020; they show that the COVID-19 cases would be 64.8% higher in the 347 Chinese cities outside Hubei province, and 52.64% higher in the 16 non-Wuhan cities inside Hubei, in the counterfactual world in which the city of Wuhan were not locked down. Cho (2020) shows that lockdown policies are effective in lowering excess mortality. Ullah and Ajala (2020) find that lockdown significantly affects the number of confirmed cases after 7 days of its implementation; Hartl et al. (2020) show a trend break in the transmission rate on the 9th day in Germany.

A number of studies examine the socio-economic inequalities associated with the COVID crisis along several dimensions. Ahmed et al. (2020) show that addressing inequality could be important in mitigating the spread of the virus. Borjas (2020) shows that more disadvantaged populations are less likely to be tested but more likely to be infected conditional on testing. Brzezinski et al. (2020) show that the more disadvantaged communities tend to have lower uptake of voluntary physical distancing in response to the outbreak of the crisis. Earlier epidemiology literature shows that there is considerable variation in individual infectiousness, model predictions that control for individual variation differ sharply from average-based approaches, with disease extinction more likely and outbreaks more rare but more explosive; suggesting that targeted control policies are more effective than general policies (see for example, Lloyd-Smith et al. 2005). Our results complement this literature showing that there is a substantial heterogeneity in the effectiveness of government response policies. This heterogeneity can be attributed to endogenous social distancing behavior; and differences in resources devoted to the policy.

Data

The dataset combines daily government response data, daily reported COVID-19 cases data, and country-level aggregate characteristics.

Cross-country government response policies data are from the Oxford COVID-19 Government Response Tracker (OxCGRT).Footnote 4Hale et al. (2020) provide an extensive description of the data and its collection. There are 201 countries and territories, 178 of which have reported at least one case of COVID-19 by April 15 2020, 139 have implemented at least one NPI. Appendix Table 6 summarizes the policies and average dates.

The OxCGRT also reports the COVID-19 Government Response Stringency Index, calculated using selected NPIs excluding contact tracing and extended testing). The value of the index on any given day is the average of the indicators for each policy. Each NPI receives a score between 0 and 3 (or 0 and 2), where zero implies no policy or missing information, 1 implies targeted policy and 2 or 3 implies more general policy. The index is rescaled to create a score between 0 and 100.

The number of COVID-19 cases for 22 January - 11 May 2020 is provided by Center for Systems Science and Engineering (CSSE) at Johns Hopkins University.Footnote 5 Since the accounting of positive cases may vary substantially across countries due to different testing and reporting practices, most estimations control for country fixed effects.

A significant amount of effort has been put into creating the OxCGRT and CSSE datasets. However, there are differences in how COVID-19 cases are tested and reported across countries. For a number of countries response policies such as schools and workplace closures are reported as initiated on the same day as the first reported case. We assume that these reportings are due to measurement errors and include only those countries which had at least 20 reported cases on the day of the school closures policy implementation. This restriction leaves us with 55 countries.

The third data source are daily Google Mobility Reports released by Google in May 2020.Footnote 6 These aggregated data are from mobile device location history reported for six location categories: retail and recreation, grocery and pharmacy, parks, public transit, workplaces and residential areas. The reports show how visits to these locations and length of stay in a given location change over time. A baseline day represents a normal value for that day of the week. The baseline day is the median value for the 5 week period between Jan 3 to Feb 6, 2020.

The fourth data source is the World Bank Databank, which provides a range of aggregate measures on health, geography and the state of economy.Footnote 7 We use the most recent available measures. The measures include health expenditure %GDP (2017), number of physicians per 1000 population (2008-2017), proportion of population above 65 years old (2017), GDP per capita (2017, in constant $US), employment rate (2017), surface area in km2 (2013-2017), population density (2017), and air pollution (PM2.5) (2017). Appendix Table 7 provides summary statistics for the selected 55 countries. All estimations use normalized standardized measures of country specific characteristics.

Estimation

To assess the effectiveness of NPIs we use the Susceptible-Infected-Recovered (SIR) model with time-varying parameters. We utilize the dynamic version of the SIR model, as in Chen and Qiu (2020), who augment this model to test for the effectiveness of various NPIs across nine countries. We apply their methodology in our analysis.

The time-varying SIR model is described by a system of differential equations,

$$ \begin{array}{@{}rcl@{}} \frac{\partial S_{j}}{\partial t} &=& -\frac{\beta_{j}(t)S_{j}(t)}{N_{j}}I_{j}(t) \\ \frac{\partial I_{j}}{\partial t} &= &\frac{\beta_{j}(t)S_{j}(t)}{N_{j}}I(t)-\gamma_{j}(t)I_{j}(t)\\ \frac{\partial R_{j}}{\partial t} &= &\gamma_{j}(t)I_{j}(t) \end{array} $$
(1)

The model assumes that the population, Nj, in country j is divided into three groups: susceptible, Sj(t); infected, Ij(t), and recovered, Rj(t). Population size is assumed to be constant since we focus on a relatively short time period. The country-specific time varying parameter βj(t) represents the transmission rate of the virus in country j, the value β reflects the rate at which contacts occur in the population and the probability of transmission occurring when a susceptible and infected meet. Each of the Ij(t) infected individuals can transmit the disease with probability \(\beta _{j}(t) \frac {S_{j}(t)}{N_{j}}\). Infected individuals recover at rate γj(t). Without behavioral changes or interventions βj(t) does not vary over time; however, βj(t) may change with NPIs or due to changing behavior.

In the classical SIR model, assuming that for a large N early in the outbreak, S(t)/N ≈ 1, the reproduction rate, R0j = βj(t)/γj(t), which determines the number of additional infections by an infected person before he/she recovers. If R0j > 1, the disease will spread exponentially and will infect a large fraction of the total population. In the dynamic SIR model, R0j can change with βj(t). Assuming constant γ, we estimate the change in βj(t) as a function of country-specific characteristics to examine the effectiveness of NPIs.

The change in the number of infected in country j is given as:

$$ I_{j}(t+1)-I_{j}(t) = \frac{\beta_{j}(t) S_{j}(t)}{N_{j}}I(t) - \gamma_{j}(t)I_{j}(t) $$
(2)

Solving for βj(t), leads to the following equation,

$$ \beta_{j}(t) = \frac{I_{j}(t+1)-I_{j}(t)}{I_{j}(t)}\frac{N_{j}}{S_{j}(t)}+\gamma_{j}(t) \frac{N_{j}}{S_{j}(t)} $$
(3)

We assume that the recovery rate, γj(t), is constant across time and countries, we set γ = 1/18 (following findings in Atkeson (2020) and consistent with the fraction of infected that recovered or died according to the WHO as compiled in JHU CCSE).

Following Chen and Qiu (2020), the impacts of k NPIs in country j are measured as follows,

$$ \beta_{j} (t)=\exp \left( \alpha_{j} + \sum\limits_{k=1}^{K} \delta_{jk} NPI_{jtk}\right). $$
(4)

where αj is a country fixed effect and

$$ NPI_{j,t,k}= \begin{cases} 1 & \text{if}\ t<t^{*}_{k}\\ \exp(-\frac{(t-t^{*}_{k})}{\tau} )& \text{if}| t\geq t^{*}_{k} \end{cases} $$
(5)

where \(t^{*}_{k}\) represents the time when the NPIk was enacted. The parameter τ controls for the time-lag effect of interventions. We assume τ = 8, which reflects findings on COVID-19 incubation period.Footnote 8

The differences in the impacts of NPIs across countries, summarized by the δjk, can reflect a variety of factors. First, there can be differences in compliance rates due to country-specific characteristics or due to differences in enforcement. For example, higher population density may interfere with the effectiveness of NPIs that impose social distancing and making them less effective. Second, some populations may practice voluntary social distancing more extensively than others prior to the interventions (for example, due to a higher proportion of high risk population). This mechanism implies that in such communities the effect of NPIs that impose social distancing on transmission rate should be less pronounced.

Allowing the impacts of NPIs to vary with country-specific characteristics, the empirical specification of equation (4) takes the following form,

$$ \log\beta_{jt}=\alpha_{0j} + \sum\limits_{k=1}^{K}\alpha_{1k}NPI_{jtk} + \sum\limits_{k=1}^{K}\alpha_{2k}\mathbb{Z}_{j} NPI_{jtk} + \epsilon_{jt}, $$
(6)

where vector \(\mathbb {Z}_{j}\) includes a range of country specific characteristics, including aggregate health variables: health spending as a % of GDP, number of physicians per 1000 people, and the proportion of population above 65; economic indicators: log GDP per capita and ratio of employed in population; environmental indicators: log of surface area, population density (people per km2), and air pollution measured (concentration of PM2.5). Measurement error in the data is denoted by 𝜖jt.

To estimate (6), we focus on a selected set of NPIs. Including all NPIs presents an identification challenge since there are high correlations between implementations and timings of different policies. Appendix Table 6 shows that policies follow very tightly; for example, the policies of workplace closures, “stay-at-home” orders, restrictions of public gatherings and public transport closures are all implemented within five days, on average. Therefore, we focus on major policies that enforce social distancing, schools and workplaces closures while controlling for the “stay-at-home” policy, and extensive testing policy. The period for the estimation is between 7 days before the first “major” NPI until country j begins to relax its policies, where “major” NPIs include schools or/end workplace closures.

Table 1 reports fixed effects estimation results of Eq. 6. Extensive testing (ET) is more effective in countries with higher GDP per capita, larger surface area, higher air pollution, lower health expenditure (as % of GDP), and higher proportion of older population (65 plus). On the other hand, the schools and workplace closures policies are more effective in countries with higher GDP per capita, lower population density, smaller surface area, lower pollution, lower employment rate, higher health expenditure and lower proportion of older population.

Table 1 Selected NPIs and COVID-19 transmission rate, with country characteristics interactions, FE, selected countries, N = 3815

Appendix Table 8 reports estimation results of Eq. 6 using observations for all available countries and shows similar results. In Appendix Table 9 we report fixed effects estimation results of Eq. 6 using the alternative measure of COVID-19 cases, which we construct using the death toll.Footnote 9 The estimations show similar relationships between the impacts of schools and workplace closures policies and country-level characteristics on transmission rates.

We test for robustness of the findings using an alternative measure of government response, the The OxCGRT Stringency Index (SI). We estimate how the relationship between SI and transmission rate varies with country-specific characteristics using the following specification,

$$ \log\beta_{jt}=\gamma_{0}+\gamma_{1} SI_{jt} + \mathbb{Z}_{j}\gamma_{2} SI_{jt}+\varepsilon_{jt}, $$
(7)

the measure of SIjt accounts for the lag between the policy and its effect on the transmission rate by using the average SI of prior 12 to 5 days. The estimation is limited to the periods where the SI is increasing.Footnote 10

Table 2 reports the results. Negative coefficients imply higher impact of the policies. Column (1) shows that there is a negative correlation between SI and transmission rate. Columns (2)-(6) gradually introduce aggregate country-specific controls. The patterns are similar to those reported in Table 1. The effectiveness of policies is declining with population density and surface area; and increasing with health expenditure and proportion of physicians in population.

Table 2 Government response stringency index (SI) and COVID-19 transmission rate, with country characteristics interactions, FE, selected countries, N = 4179

Internal mobility

To further investigate the cross-country differences in transmission rates, we analyze the variation in internal mobility (from Google Mobility Reports data). First, using the sample of all countries, we estimate how internal mobility varies before NPIs are enacted, focusing on five location categories: residential areas, retail and restaurants, public transit stations, parks and beaches, and workplaces. We use the following specification,

$$ MI_{ijt}=\lambda_{i0} + \lambda_{i1}(t-t^{*}) + \mathbb{Z}_{ij} \lambda_{i2}\ + u_{ijt}, $$
(8)

where MIijt is a mobility index for category i.

Table 3 reports the results of Eq. 8, where the “before” period is defined as 10 to 6 days before school/workplaces closures.Footnote 11 Mobility changes vary across countries; in the period immediately before the implementation of major social distancing NPIs, people spend relatively less time in residential areas (i.e., at home) in countries with higher proportion of physicians, higher health spendings, higher employment and lower GDP per capita.

Table 3 Change in daily Google mobility indexes before selected NPIs and aggregate measures, OLS, N = 235

The empirical model of response of internal mobility to the major NPIs is specified as follows,

$$ MI_{ijt}=\mu_{i0} + \sum\limits_{k=1}^{K}\mu_{i1k}\mathbb{I}_{jtk} + \sum\limits_{k=1}^{K}\mu_{i2k}\mathbb{Z}_{j} \mathbb{I}_{jtk} + \epsilon_{ijt}, $$
(9)

where \(\mathbb {I}_{jtk}\) is a binary indicator of policy k.Footnote 12 We estimate (9) using information between 7 days before and 7 days after the school or workplace closures. We focus on this short time period because of the relationship between mobility and transmission rates. Individuals make mobility decisions based on the threat of infection, we assume that this threat is relatively stable in the selected short period because of the delayed effect of social distancing NPIs on transmission rates (due to incubation and detection periods).Footnote 13

First, we estimate (9) excluding country-specific controls. Results in Table 4 show a strong relationship between the NPIs and internal mobility. In the period immediately after the implementation of major social distancing NPIs, time spent in residential areas is increasing when social distancing policies are enacted; whereas time spent on retail shopping, recreation, travel and work is declining. Table 5 reports estimation results of Eq. 9. The decline in time spent on activities in non-residential areas reflects the effectiveness of NPIs; it is higher in countries with lower proportion of older population, higher proportion of physicians, lower expenditure on health, lower employment rate, and smaller surface area. These findings are consistent with results in Table 1, where we estimate the relationship between social distancing NPIs and transmission rate.Footnote 14

Table 4 Post-NPIs changes in Google mobility indexes, selected countries, FE
Table 5 Post-NPIs changes in daily Google mobility indexes, with country characteristics interactions, FE, all countries, N = 1582

Discussion and Conclusion

Governments around the world have responded to the COVID-19 crisis with a range of NPIs aiming to flatten the spread of the virus. Recent studies show that such policies are effective in reducing the spread of the virus. However, it is not determined whether the same combination of NPIs is optimal for each country, or whether country specific characteristics should dictate which policies should be implemented or how they should be implemented. This paper aims to fill this gap by studying the interactions between the effectiveness of NPIs at a country-level and country-specific characteristics.

We embed NPIs in a dynamic epidemiological SIR model and show empirically how the effectiveness of NPIs varies with country-specific characteristics. The effectiveness of social distancing policies on transmission rates is declining with population density, surface area, pollution, employment rate and proportion of population over 65 years old; and increasing with GDP per capita and health expenditure. The effectiveness of the enhanced testing policy is increasing with GDP per capita, surface area, air pollution, proportion of older population; and declining with health expenditure.

Estimations that use the Google mobility data suggest that the behavioral response might be important in explaining the results, but it cannot explain all the differences in the effectiveness of NPIs across countries. Access to a better health system and higher employment rate are negatively correlated with staying at home before the social distancing policies are enacted. After the policies are enacted, we observe larger increases in staying at home in countries with lower proportion of elderly population and higher proportion of physicians. This result is consistent with the cross-country differences in transmission rates. This heterogeneity in the effectiveness of NPIs can be explained by differences in voluntary social distancing which varies with country-specific characteristics. Access to a better public health system may reduce voluntary social distancing due to decreased risks and NPIs designed to enforce social distancing are more effective in such communities. On the other hand, in communities with higher proportions of elderly, who are in a higher risk group to become severely ill, voluntary social distancing might be more prevalent before intervention policies are enacted.

Our behavioral interpretation of the variation in mobility and transmission rates across age and health system related dimensions is consistent with findings in other recent studies. For example, Andersen et al. (2020) compare consumption response to shutdowns using data from Sweden and Denmark. They show that there is a considerable heterogeneity across groups with different levels of infection risk. The effect of shutdowns on spending exhibits a significant age gradient: the effect is negative for young adults (18-29 years) and positive for the oldest group (70+ years). Andersen et al. (2020) conclude that shutdowns decrease the total spending of individuals with low health risk. Moreover, they show that the effects are particularly pronounced for categories involving high social proximity such as personal care and social spending. On the other hand, shutdowns increase the total spending of individuals with high health risk. Our findings suggest similar behavioral patterns in a cross-country comparison.

Differences in internal mobility and suggested differences in infection risk perceptions cannot explain the variation in transmission rates with respect to GDP per capita, size of geographical area, and population density. Higher population density, larger geographical area, higher employment rate, and lower GDP per capita are associated with less effective social distancing polices. Previous studies show that the spread of viral diseases is more pronounced in more densely populated areas (see for example, Zachreson et al. 2018) and when economic activity is higher (see for example, Markowitz et al. 2019 and Adda 2016), combining this existing evidence with our new findings suggests that the lower effectiveness might be related to resource requirements and the ability to enforce social distancing policies.Footnote 15

Recent studies propose models of complementarities between social distancing and testing policies. For example, Acemoglu et al. (2020) develop a theoretical model showing that enhanced testing policy can lead to more social activity and less social distancing (since people feel more confident), which may lead to an increase in infections. In Table 1 we show that testing is less effective in countries with higher health expenditure and more effective in countries with higher proportion of elderly population. Within the Acemoglu et al. (2020) model this result suggests that the increase in social activity due to the enhanced testing NPI is more likely to occur in places with more advanced health systems and in places with lower proportions of elderly.Footnote 16 It should also be considered that testing intensity varies across countries. Our results show that testing NPI is more effective in countries with higher GDP per capita, which is in line with the testing intensity assumption. We also find that the effectiveness of testing is increasing with surface area, which suggests that this policy is relatively more efficient in such geographic conditions.

Our results complement earlier epidemiological research and the expanding literature on interactions between behavioral response, economic incentives and epidemiological dynamics during the COVID-19 pandemic. Our analysis also complements recent literature on socio-economic inequalities associated with the COVID-19 crisis. The findings emphasize that the effectiveness of a given NPI policy varies with the socio-economic and geographic characteristics of a given community and highlight that targeted policies may improve the outcomes of government response policies.