Counterfactual time series analysis of short-term change in air pollution following the COVID-19 state of emergency in the United States

Dey, Tanujit; Tyagi, Pooja; Sabath, M. Benjamin; Kamareddine, Leila; Henneman, Lucas; Braun, Danielle; Dominici, Francesca

doi:10.1038/s41598-021-02776-0

Download PDF

Article
Open access
Published: 07 December 2021

Counterfactual time series analysis of short-term change in air pollution following the COVID-19 state of emergency in the United States

Tanujit Dey¹^na1,
Pooja Tyagi²^na1,
M. Benjamin Sabath^2,3,
Leila Kamareddine²,
Lucas Henneman⁴,
Danielle Braun^2,5 &
…
Francesca Dominici²

Scientific Reports volume 11, Article number: 23517 (2021) Cite this article

4008 Accesses
12 Citations
6 Altmetric
Metrics details

Subjects

Abstract

Lockdown measures implemented in response to the COVID-19 pandemic produced sudden behavioral changes. We implement counterfactual time series analysis based on seasonal autoregressive integrated moving average models (SARIMA), to examine the extent of air pollution reduction attained following state-level emergency declarations. We also investigate whether these reductions occurred everywhere in the US, and the local factors (geography, population density, and sources of emission) that drove them. Following state-level emergency declarations, we found evidence of a statistically significant decrease in nitrogen dioxide (NO₂) levels in 34 of the 36 states and in fine particulate matter (PM_2.5) levels in 16 of the 48 states that were investigated. The lockdown produced a decrease of up to 3.4 µg/m³ in PM_2.5 (observed in California) with range (− 2.3, 3.4) and up to 11.6 ppb in NO₂ (observed in Nevada) with range (− 0.6, 11.6). The state of emergency was declared at different dates for different states, therefore the period "before" the state of emergency in our analysis ranged from 8 to 10 weeks and the corresponding "after" period ranged from 8 to 6 weeks. These changes in PM_2.5 and NO₂ represent a substantial fraction of the annual mean National Ambient Air Quality Standards (NAAQS) of 12 µg/m³ and 53 ppb, respectively. As expected, we also found evidence that states with a higher percentage of mobile source emissions (obtained from 2014) experienced a greater decline in NO₂ levels after the lockdown. Although the socioeconomic restrictions are not sustainable, our results provide a benchmark to estimate the extent of achievable air pollution reductions. Identification of factors contributing to pollutant reduction can help guide state-level policies to sustainably reduce air pollution.

The economic commitment of climate change

Article Open access 17 April 2024

The carbon dioxide removal gap

Article 03 May 2024

Frequent disturbances enhanced the resilience of past human populations

Article Open access 01 May 2024

Introduction

There is consistent evidence that short- and long-term exposure to fine particulate matter (PM_2.5) and nitrogen dioxide (NO₂) increases the risk of mortality, hospitalization, and other adverse health outcomes^{1,2,3,4,5,6,11,12}. Furthermore, several studies have provided preliminary evidence that short and long-term air pollution exposure increases the risk of hospitalization and death among individuals with COVID-19^{4,5,6,7,8,9,10}.

The United States mitigates air pollution through a combination of federal, state, and local air pollution regulations¹³. For example, the federal government sets emissions standards and the NAAQS. They also require states to prepare State Implementation Plans (SIPs) that detail emissions reductions strategies for areas that are not in compliance with the NAAQS (non-attainment areas). SIPs use air quality models to demonstrate how regulating local emissions sources helps a non-attainment area meet the NAAQS. Geographically heterogeneous regulations, emission sources, and meteorology, results in varying air pollution concentrations by geographic location^13,14.

Several studies have examined the impact of a sudden intervention on changes in air pollution (see¹⁵ for a review). For example, researchers used interrupted time-series designs to quantify the impact of the 1990 Dublin coal ban¹⁶ and regression discontinuity to identify the arbitrary spatial impact of the China Huai River Policy¹⁷. An important feature of these studies is that they investigated abrupt and localized changes across a relatively short time span (Dublin coal ban) and spatial scale (Huai River policy)¹⁸. Because of the abrupt nature of these interventions, defining a hypothetical experiment in these studies was straightforward.

Similarly, we examined the effect of the abrupt lockdown measures implemented in response to the COVID-19 pandemic, which produced sudden and significant changes in how society functions, with decreases in road traffic, air traffic, and economic activity¹⁹. This provided us with an unprecedented opportunity to implement a quasi-experimental design with a well-defined control condition (no pandemic) to estimate the changes in air pollution because of the implementation of these extreme measures. In a quasi-experimental design, the researcher compares outcomes between a treatment group and a control group, just as in a classical experiment; but treatment status (in our context the COVID-19 related intervention) is determined by politics, an accident, a regulatory action, or some other action beyond the researcher’s control (in our context the start of the pandemic). See²⁰ for a discussion of strengths and limitations of a quasi-experimental design. Furthermore, the spatial heterogeneity in the extent to which air pollution levels changed because of the lockdown measures allowed us to identify factors contributing to these changes.

A number of recent studies have investigated the effect of the COVID-19 pandemic on the levels of different air pollutants in the US^{21,22,23,24,25,26,27,28,29,30,31,32,33,34}, globally^{35,36,37,38,39} and for several cities around the world^{40,41,42,43,44,45,46,47,48,49,50,51}. Table 1 summarizes studies that have estimated changes in air pollution levels by comparing air pollution levels during the COVID-19 pandemic period to historical data both in the US and globally.

Table 1 Summary of published studies examining changes in air pollution attributable to COVID-19 related interventions in the US and globally.

Full size table

Regardless of the emerging literature on this topic, these studies for the most part do not simultaneously account for autocorrelation, time trends and seasonality, and meteorological factors. To our knowledge, none of these studies attempt to identify state-level factors contributing to heterogeneity in the air pollution declines across states for both PM_2.5 and NO₂.

In this study, we had several scientific objectives that distinguish this paper from existing contributions in the literature. More specifically, we 1) develop and implement state-of-the-art time series approaches for counterfactual forecasting to predict weekly state-levels of PM_2.5 and NO₂ from January 1, 2020, to April 23, 2020, under the hypothetical scenario that the pandemic did not occur. These models account for measured confounding (e.g. meteorological factors), unmeasured confounding (e.g. seasonal variation and time trends) and residual autocorrelation; 2) properly validate the accuracy of the model fitting and account for the uncertainty in the counterfactual forecasts via bootstrap; 3) estimate the weekly state-level deviations and 95% CI between counterfactual (e.g., absent the pandemic) and observed levels of PM_2.5 and NO₂ from January 1, 2020 to April 23, 2020; 4) assess whether the deviations between the counterfactual values and the observed values start to deviate in correspondence to key interventions implemented as a result of the pandemic; 5) assess within each state, changes in both PM_2.5 and NO₂; and finally 6) investigate which state-level characteristics, including emissions sources, contributed the most to these changes, while adjusting for geography and population density.

Materials and methods

Data acquisition

We gathered and harmonized data from several databases (Table S1). We obtained historical daily monitor data of PM_2.5 and NO₂ concentrations for January 1, 2015 to August 31, 2019 from the US EPA Air Quality System⁵². We obtained current levels of these air pollutants for August 31, 2019 to April 23, 2020 from the EPA AirNow application programming interface⁵³. We linked historical and current monitor data within each state. These data were available for 48 states for PM_2.5 and 36 states for NO₂. We obtained daily temperature, humidity, and precipitation data from the University of Idaho’s GRIDMET project, which were then aggregated to the state level using Google Earth Engine⁵⁴.

We obtained state-level source emissions totals from the National Emissions Inventory for 2014⁵⁵, and gathered information on population density and geographic region classification of the states from the United States Census Bureau^56,57. Finally, we accessed the COVID-19 US State Policy Database⁵⁸ to extract information regarding the dates of COVID-19 related state interventions, including state-level declaration of emergency, shelter-in-place orders, and non-essential business closures for each state. All the data sources are publicly available, they are summarized in Table S1, and also available on GitHub along with all code necessary to conduct the analysis; https://github.com/NSAPH/USA-COVID-state-level-air-pollution-SARIMA-analysis.

Statistical methods

Counterfactual forecasting of air pollution levels starting January 1, 2020

SARIMA models are autoregressive models often used to forecast time series where future observations are correlated with past observations^59,60. They have the advantage of accounting for the time trend, seasonality, confounders (e.g., meteorological variables), and residual autocorrelation. We fitted SARIMA models to historical data using weekly state-level air pollution levels (from January 1, 2015, to December 31, 2019) accounting for time trend, seasonality, autocorrelation and also accounting for the effect of weather by including temperature, precipitation, and humidity as covariates in the model.

The basis of the SARIMA model is a linear regression of a response variable Y_t at time t against the past values (Y_t-1, Y_t-2, ….) of Y and the past forecast errors (ɛ_t-1, ɛ_t-2, …). A detailed example of this analysis for NO₂ in California is provided in the supplementary materials, including model validation measures (Figures S1-S5).

We conducted the following analyses separately for PM_2.5 and NO₂ and for each state. The algorithm of the model construction and prediction is presented below.

1.
We created 1,000 time series bootstraps using Box-Cox and Loess-based decomposition⁶¹ to separate the time series into the trend, seasonal, and remainder part. The remainder is then bootstrapped. We used historical data from January 1, 2015, to December 31, 2019 (see Figure S2 for an example of NO₂ in California).
2.
For each bootstrapped time series, we:
- Fit SARIMA models^59,60,61 adjusting for meteorological factors, namely temperature, precipitation, and humidity (see Figure S3 for an example of NO₂ in California).
- From the fitted SARIMA models, we predict air pollution counterfactual levels (absent the pandemic) during a 16-week period from January 01, 2020, to April 23, 2020 (see Figure S4 for an example of NO₂ in California).
3.
For each state and for each week, we average the predicted air pollution counterfactual levels across all bootstrap replicates. We denote these averages by $C_{i,j}^{pred} ,\;where\; i = 1,2, \ldots ,16$, and j indicates the state (see Figure S4 for an example of NO₂ in California).
4.
For each state j and for each week i, we estimate the weekly differences$\delta_{i, j} = C_{i,j}^{obs} - C_{i,j}^{pred} , \;i = 1,2, \ldots ,16$, between the observed values (under pandemic conditions) and the predicted (assuming that the pandemic did not occur) (see Figure S5 for an example of NO₂ in California). The quantification of the statistical uncertainty of these weekly differences using the bootstrap replicates is called “bagged SARIMA” (see Figures S4 and S5 for an example of NO₂ in California).

The data and code for the analysis is available at https://github.com/NSAPH/USA-COVID-state-level-air-pollution-SARIMA-analysis.

Model assessment

To assess the overall predictive performance of the SARIMA model, we repeated the same procedure of model building and prediction as described in the algorithm above, this time training the model based on the data from January 1, 2015 to December 31, 2018, and predicting for a 16-week period from January 01, 2019 to April 23, 2019. This allows us to assess model fit and evaluate our modeling approach absent the pandemic. The main goal of implementing this assessment is to find out the model’s performance in prediction absent the pandemic and compare its predictive performance using the average prediction error as defined below during the pandemic.

Average prediction error (APE) for state j:

$${\text{APE}}j = 1/16\sum\limits_{(i = 1)}^{16} {\delta_{(i,j)} } ,\;where\;\delta_{(i,j)} = C_{(i,j)}^{obs} - C_{(i,j)}^{pred}$$

as defined in Step 4 of algorithm above.

We used the R package auto.arima to select model coefficients with the best predictive capability based on bias-corrected Akaike Information Criterion (AIC)^62,63 and then used the mean absolute scaled error (MASE) to evaluate the fit of the model⁶⁴.

Estimating air pollution changes attributable to state-level emergency declarations

In step 2 described above, we start the counterfactual forecasting for the period January 01, 2020, to April 23, 2020 without any consideration regarding the date of the intervention (such as the declaration of the state emergency). After the forecasting was complete, we then chose the declaration of the state of emergency as the intervention because it most closely visually aligned with the onset of deviations from the forecasted pollutant concentrations. Other interventions, including the timing of non-essential business closures and shelter-in-place orders, were considered visually (see Figures S7 and S8 in the supplementary material, the differences between these interventions are less than two weeks).

We use T_{int, j} to denote the date of the state intervention (declaration of the state of emergency) for each state j. For each state and for each of the two pollutants (PM_2.5 and NO₂), we estimated the parameter $\Delta_{j}$ denoting the change in pollutant concentrations following the state intervention compared to before by calculating:

$$\Delta_{j} = \Delta_{before,j} - \Delta_{after,j}$$

(1)

where $\Delta_{before,j}$ is the median of the weekly deviations, ${\delta }_{i,j}$, (as defined in step 4 above) for the weeks before the date of the declaration of the state emergency (T_int,j) and $\Delta_{after,j}$ is the median of these weekly deviations, ${\delta }_{i,j}$ , for the weeks after T_int,j. Because of the good fit of the SARIMA model to the historical data (Figure S6), and because the counterfactual forecasting is agnostic to the date of the state level emergency (see Figures S4, S5 for an example of NO₂ forecasting in California), we argue that negative estimated values of ${\Delta }_{j}$ indicate that air pollution levels declined because of the state-level emergency. We note that since the state of emergency was declared at different dates for different states, and the total length of the prediction period was 16 weeks in 2020, therefore the period "before" the state of emergency in our analysis ranged from 8 to 10 weeks and the corresponding "after" period ranged from 8 to 6 weeks.

To identify the states with the most pronounced discrepancy between the pattern of change in PM_2.5 and NO₂, we calculated the ratio (ρ_j) for each state j, defined as:

$$\rho_{j} = \Delta_{{NO_{2} ,j}} /\Delta_{{PM_{2.5} ,j}}$$

(2)

I f ${\rho }_{j} <0$ the two pollutants changed in opposite directions (i.e., one increased while the other decreased), and the larger the magnitude of ${\rho }_{j}$ , the larger the discrepancy between the pollutants’ patterns of change.

Regression modeling to identify state-level factors contributing to heterogeneity in the air pollution across states

In this part of the analysis, our goal is to quantify the associations between the change in pollutant concentrations during the forecasting period January 01, 2020 to April 23, 2020 and several sources of pollutants along with a few geographical variables. The estimated ${\Delta }_{j}$ (as defined in Eq. 2) is the outcome for each state for each of the two pollutants, separately. We have used the following independent variables: the proportion of emissions from fire sources, stationary sources, and mobile sources (obtained from 2014); population density; and region of the state.Note that the NEI reports four sources of emission: fire, mobile, stationary and biogenic; we used only three of these (fire, stationary, and mobile sources) as predictors in the regression model and therefore, their proportions do not sum to 1. Instead of using a regular multivariable linear regression model, we chose to use a weighted multivariable linear regression (WMLR) model. The reasons behind using this model are: (1) this model can incorporate the covariance matrix of errors which is quite beneficial for the heteroscedastic data, which is a feature of these data sets (2) because of the variability in pollutants concentrations across states, the WMLR are more robust to the outliers than regular regression models. Lastly, based on the pairwise correlation assessments, besides the main effect of the predictors in the models, we also included all two-factor interaction terms of the predictors. This model not only quantifies better associations between the outcome and the covariates; the goodness of fit performances, using the regular R² and adjusted R², are better for these models with the interaction terms compared to models without the interaction terms.

Results

Short-term change in air pollutants following the COVID-19 state of emergency

For most states, the differences between the SARIMA counterfactual predictions (i.e., assuming the pandemic did not occur) and the observed pollutant values were close to zero during the period before the state-level emergency declaration (Figs. 1, 2), but there were significant deviations following the intervention lockdown measures.

We found evidence of a statistically significant decrease in NO₂ concentrations following the declaration of a state of emergency in 34 of the 36 states that were investigated (Fig. 1 and Tables S2-a, S3). The change in NO₂ following the declaration of a state of emergency ${\Delta }_{j}$ calculated using Eq. 2, ranged from -0.6 ppb to 11.6 ppb across the states, with an average change of 3.1 ppb and standard deviation 2.4 ppb.

We also found evidence of a statistically significant decline in PM_2.5 concentrations in 16 of the 48 states that were studied, including New York and other states in the Northeast and West Coast (Figs. 2, 3, Tables S2-b, S4, and Figure S9). The change in PM_2.5 following the declaration of a state of emergency ranged from -2.3 µg/m³ to 3.4 µg/m³ across the states, with an average change of 0.3 µg/m³ and standard deviation 1.3 µg/m³.

In Figures S10 and S11 we show the difference between actual and predicted levels of NO₂ and PM_2.5, respectively for all states. Even though the date of the lockdown was not incorporated into the SARIMA model for counterfactual forecasting, the observed values were closer to the predicted values of NO₂ before the state of emergency declarations compared to after the state of emergency declarations. For PM_2.5 the differences before and after the state of emergency declarations are not as large.

To quantify how well the SARIMA models can predict for a given period, as described in the Methods section we compared the predictive performances of the models during the same period for 2019 (no pandemic) compared to the main analysis for 2020 (pandemic). We assessed how the APE behaves for both 2019 and 2020 for each state. From Figs. 4 and 5, except for a few states, the APEs are higher for 2020 compared to 2019 for each pollutant model. In addition to using the APE, we also summarize information regarding the model evaluation in the supplementary materials. Figure S3 shows that, for example, the SARIMA model has excellent goodness of fit for the historical data for NO₂ in California.

We also found that MASE for the fitted models was less than 1 unit for each pollutant in each state (Figure S6), indicating that the SARIMA model outperformed one-step naïve forecasts, which use the value at time ‘t’ to predict the outcome at t + 1.

Figure 6 shows the estimated ρ_j defined as the ratio of the estimated ${\Delta }_{j}$ for NO₂ divided by the estimated ${\Delta }_{j}$ for PM_2.5. More than one-third of the states (13 states) had $\rho <0$ , i.e., these states experienced a decrease in NO₂ and a simultaneous increase in PM_2.5. The contrast between the pattern of change of NO₂ and PM_2.5 following state-level emergency declarations suggests that dominating sources of these two pollutants are different in those states. It is also noticeable from Fig. 6 that, for these 13 states the changes in PM_2.5 are not statistically significant (Table S2-b) and in 3 states (CO, GA, WA), the NO₂ changes are not significant either (Table S2-a). We see more states with statistically significant changes in NO₂, than PM_2.5, following state-level emergency declarations.

State-level factors may explain the heterogeneity in air pollution declines across states

To ascertain which state-level factors might explain the heterogeneity in the extent to which the air pollution declined across states, we fit a weighted multivariable linear regression model with the estimated ${\Delta }_{j}$ (for each pollutant separately) for each state as the dependent variable, and geography, population density and sources of emission as predictors accounting for the main effect and their corresponding two-factor interactions. For the PM_2.5 pollutant model, all the proportions of annual emissions from a state’s stationary (e.g., industrial processes), mobile (e.g., road and air traffic), and fire sources (e.g., agricultural field burning) (Table S5, from 2014) are not statistically significant and have negative associations with the change in PM_2.5 concentration (Table S6). In contrast, the proportion of annual emissions from mobile sources and stationary sources were statistically significantly associated with the change in NO₂ concentration (Table S6).

Discussion

Following the declaration of a state of emergency, we found that NO₂ concentrations showed a statistically significant decline in 34 of the 36 states included in this analysis. In contrast, PM_2.5 concentrations declined in only 16 of 48 states included in this analysis. These 16 states are in the Northeast and on the West Coast. Furthermore, as expected, we found that the proportion of a state’s annual emissions from mobile sources and stationary sources are statistically significant factors in NO₂ changes in response to the state emergency declaration. For PM_2.5 reductions, all three sources—mobile, stationary, and fire—were not statistically significant predictors and have negative associations with the changes in PM_2.5 concentrations. We concluded that state of emergency declarations implemented in response to the COVID-19 pandemic predominantly affected mobile sources (e.g., cancelled flights and reduced traffic)⁶⁵ and stationary sources and led to a decline in NO₂. However, because the major sources of PM_2.5 are stationary (e.g., industrial fuel combustion), these were less affected by state-level emergency declarations (Table S5).

SARIMA models have some advantageous features compared to other statistical approaches. Recent studies^22,23,29,36 have used t-tests²², a robust difference approach²³, linear regression³⁶, and synthetic control methods⁶⁶ to study the changes in US air pollution attributable to the COVID-19 shutdown. Bekbulat et al. used temporal correction in their robust differences approach²³ (not peer-reviewed on August 03, 2020) and Venter et al. included meteorological factors in their regression³⁶. The latter was a global study of air pollution changes during the pandemic, which found that a decline in NO₂ in the United States occurred on a national level. However, these methods do not directly incorporate the correlations between observed pollutant concentrations, and trends and seasonality in the data. We accounted for both factors using SARIMA models. Any contribution to the data from generally decreasing air pollution trends and weather seasonality must be removed to best estimate the effect of pandemic-related extreme measures on air pollution. By further combining SARIMA with bootstrapping, we were able to quantify the uncertainty in the estimated mean predictions.

We note that our counterfactual predictions of pollutant concentrations assume that the trend and seasonality during the last five years (i.e., the training period for the model) persisted during the prediction period (January 1, 2020, to April 23, 2020). Another assumption was that the relationship between meteorological variables used in the SARIMA model (temperature, humidity, and precipitation) and the pollutant concentrations were the same in both the training and prediction periods^67,68. While in California for NO₂ (see Figures S1 to S5) we see smaller differences between predicted and observed concentrations before the state of emergency, in some states this was not the case, and we see differences between the predicted and observed for the entire January 2020—April 2020 period. We would not expect the model’s predictive capability to affect the estimation of the pollutant concentrations before and after the state intervention differently. Therefore, where we observed significant deviations from the predicted concentrations following the state intervention, we can be confident that it is due to the intervention and not due to the model’s predictive capability. Additionally, we fit an additional SARIMA model to predict the same prediction period for the previous year of 2019 (January 1, 2019, to April 23, 2019). In comparing the behaviour of the APEs, we find that APEs are higher for 2020 compared to 2019, except for a few states, for each of the pollutant models.

In contrast to other studies (see for example²²), we did not a priori divide our data into pre- and post‒COVID-19 periods. We used January 1, 2015, to December 31, 2019 as historical data and then used the SARIMA model to predict the counterfactual pollutant levels during the 16-week period from January 1, 2020 to April 23, 2020, under the hypothesis that neither the pandemic nor the state emergency declaration occurred. In other words, first we predict air pollution levels for the whole study period of 16 weeks. We then looked a posteriori to determine if the NO₂ or PM_2.5 declines coincided with state-level emergency declarations (see Figures S1-S5 for example of NO₂ in California).

By identifying the maximum decline in the median pollutant concentrations following state-level emergency declaration, we found that the extreme measures taken during the pandemic led to a change of PM_2.5 of up to 3.4 µg/m³ (in California) and a change of NO₂ of up to 11.6 ppb (in Nevada). These weekly-averaged values represent a substantial fraction of the annual mean NAAQS values of 12 µg/m³ and 53 ppb, respectively. Based on the national regression model, there is significant potential to reduce NO₂ concentrations by reducing mobile and stationary sources of NO₂ emissions, provided the same level of change can be sustained throughout the annual cycle. But these associations were not seen in the PM_2.5 regression model. In Table 1, we summarized the published evidence from similar studies in the US. For example, Berman el al 2020, examines all the counties in the US for both PM_2.5 and NO₂. They found a 25.5% reduction (4.8 ppb) in NO₂ during the COVID-19 period and a 11.3% statistically significant reduction (0.7 μg/m³) of PM_2.5 in counties from states that instituted early non-essential business closures²². However, the statistical analysis of this study relies on t-tests and does not account for confounding or residual autocorrelation. Overall, among studies summarized in Table 1, there is consistent evidence of a decline of NO₂ for most of the locations^22,24,25,26, whereas the evidence of declines in PM_2.5 is weaker (see for example^26,28). In addition, one relevant pre-print study found that PM_2.5 concentrations during lockdown are 10% (0.54 μg/m³) higher than expected post-covid, but 11% (0.73 μg/m³) lower than pre-covid, with 31% decrease in NO₂ levels in 3 major cities²³. Another relevant pre-print study found a nationwide average increase of 1.36 μg/m³ in PM_2.5 following official lockdown orders⁶⁶.

With respect to studies outside the US, a recent study investigated the effect of lockdown in urban China, using difference-in-difference approach; they found a decline of 14 µg/m³ in locked-down cities compared to cities that did not implement a lockdown⁶⁹. The cities in that study had baseline PM_2.5 concentrations four times higher than the safe limits set by the World Health Organization, which may have been partly responsible for a larger decline after lockdown compared to what we observed in the United States. Another study used baseline regression to estimate the impact of lockdown on 44 cities in Northern China and found a 5.93% decrease in PM_2.5 and a 24.67% decrease in NO₂ concentrations during lockdown⁴⁰. Others have used paired t-tests and the autoregressive moving average (ARMA) model to quantify the impact of the COVID-19 lockdown in 41 cities in India on pollution levels, and found a 19% decrease in NO₂ compared to the same period in 2019⁴⁵.

Our study results support the effectiveness of state-level actions to reduce ambient levels of PM_2.5 and NO₂, and specifically, that restrictions on stationary and mobile sources of air pollution could decrease NO₂ emissions even further in states where mobile sources constitute a larger proportion of annual emissions. In contrast, PM_2.5 concentration reduction may not be as easily achieved through these sources alone. In states where changes in PM_2.5 and NO₂ exhibited opposite trends (one increased while the other decreased), lowering the emission of NO₂, by decreasing mobile source emissions for example, may not necessarily decrease PM_2.5 concentrations.

Study limitations

The models were fit separately for PM_2.5 and NO₂ and we did not account for correlation between the two pollutants. We relied on state-level concentration averages and the 2014 emissions inventory. While our study would benefit greatly from a more recent emissions inventory (or spatial emissions estimates during the interventions), to our knowledge, such data is not currently available publicly. With respect to fire emissions, we note that although we only have data from 2014, regions with higher areas burned in 2014 have larger propensity to have higher areas burned in 2020⁷⁰. Trading finer spatial resolution in the monitoring data—not averaging to the state level—may reveal important sub-state variability in lockdown impacts. Monitor data was obtained from EPA AirNow and has not undergone quality control by the EPA. We didn’t remove outlier observations, however we averaged hourly measurements by day and by state, which would have minimized the impact of outliers. Our approach also does not consider the spatial correlations between pollutant concentrations, which may help explain concentration changes in non-local pollutants such as PM_2.5. Wind speed was not included in the SARIMA model, adjusting for wind speed could have improved the predictions even more. Finally, data were available for 36 states for NO₂ and 48 states for PM_2.5, which limited the number of observations in the weighted regression model. Finally, even though we have accounted for the interactions between the predictors in the weighted least squares regression models, we need to consider adjusting for other potential predictors which could improve the prediction.

References

Crouse, D. L. et al. Ambient PM_2.5, O3, and NO 2 Exposures and Associations with Mortality over 16 Years of Follow-Up in the Canadian Census Health and Environment Cohort (CanCHEC). Environ. Health Perspect. 123, 1180–1186 (2015).
CAS PubMed PubMed Central Google Scholar
Pope, C. A., Coleman, N., Pond, Z. A. & Burnett, R. T. Fine particulate air pollution and human mortality: 25+ years of cohort studies. Environ. Res. 183, 108924 (2020).
CAS PubMed Google Scholar
Brook, R. D. et al. Particulate matter air pollution and cardiovascular disease: an update to the scientific statement from the american heart association. Circulation 121, 2331–2378 (2010).
CAS PubMed Google Scholar
Wu, X., Nethery, R. C., Sabath, B. M., Braun, D. & Dominici, F. Exposure to air pollution and COVID-19 mortality in the United States: a nationwide cross-sectional study. MedRxiv https://doi.org/10.1101/2020.04.05.20054502v2 (2020).
Article PubMed PubMed Central Google Scholar
Cohen, A. J. et al. Estimates and 25-year trends of the global burden of disease attributable to ambient air pollution: an analysis of data from the Global Burden of Diseases Study 2015. The Lancet 389, 1907–1918 (2017).
Google Scholar
Horne, B. D. et al. Short-term elevation of fine particulate matter air pollution and acute lower respiratory infection. Am. J. Respir. Crit. Care Med. 198, 759–766 (2018).
PubMed Google Scholar
Rhee, J. et al. Impact of long-term exposures to ambient PM2.5 and ozone on ARDS risk for older adults in the United States. Chest 156, 71–79 (2019).
PubMed PubMed Central Google Scholar
Bhaskar, A., Chandra, J., Braun, D., Cellini, J., Dominici, F. Air pollution, SARS-CoV-2 transmission, and COVID-19 outcomes: A state-of-the-science review of a rapidly evolving research area. medRxiv (2020) https://doi.org/10.1101/2020.08.16.20175901
Pozzer, A. et al. Regional and global contributions of air pollution to risk of death from COVID-19. Cardiovasc. Res. 116, 2247–2253 (2020).
CAS PubMed PubMed Central Google Scholar
Benmarhnia, T. Linkages between air pollution and the health burden from COVID-19: methodological challenges and opportunities. Am. J. Epidemiol. 189, 1238–1243 (2020).
PubMed Google Scholar
Di, Q. et al. Air pollution and mortality in the medicare population. N. Engl. J. Med. 376, 2513–2522 (2017).
CAS PubMed PubMed Central Google Scholar
Shi, L. et al. Low-concentration PM 2.5 and mortality: estimating acute and chronic effects in a population-based study. Environ. Health Perspect. 124, 46–52 (2016).
PubMed Google Scholar
Air quality management in the United States. (National Academies Press, 2004).
Jiang, Z. et al. Unexpected slowdown of US pollutant emission reduction in the past decade. Proc. Natl. Acad. Sci. 115, 5099–5104 (2018).
CAS PubMed PubMed Central ADS Google Scholar
Zigler, C. M. & Dominici, F. Point: clarifying policy evidence with potential-outcomes thinking-beyond exposure-response estimation in air pollution epidemiology. Am. J. Epidemiol. 180, 1133–1140 (2014).
PubMed PubMed Central Google Scholar
Dockery, D. W. et al. Effect of air pollution control on mortality and hospital admissions in Ireland. Res. Rep. Health Eff. Inst. 3–109 (2013).
Chen, Y., Ebenstein, A., Greenstone, M. & Li, H. Evidence on the impact of sustained exposure to air pollution on life expectancy from China’s Huai River policy. Proc. Natl. Acad. Sci. 110, 12936–12941 (2013).
CAS PubMed PubMed Central ADS Google Scholar
Henneman, L. R. F. et al. Air quality accountability: developing long-term daily time series of pollutant changes and uncertainties in Atlanta, Georgia resulting from the 1990 Clean Air Act Amendments. Environ. Int. 123, 522–534 (2019).
CAS PubMed Google Scholar
Badger, E. & Parlapiano, A. Government Orders Alone Didn’t Close the Economy. They Probably Can’t Reopen It. https://www.nytimes.com/2020/05/07/upshot/pandemic-economy-government-orders.html. (2020).
Dominici, F., Greenstone, M. & Sunstein, C. R. Particulate matter matters. Science 344, 257–259 (2014).
PubMed PubMed Central ADS Google Scholar
Xiang, J. et al. Impacts of the COVID-19 responses on traffic-related air pollution in a Northwestern US city. Sci. Total Environ. 747, 141325 (2020).
CAS PubMed PubMed Central ADS Google Scholar
Berman, J. D. & Ebisu, K. Changes in U.S. air pollution during the COVID-19 pandemic. Sci. Total Environ. 739, 139864 (2020).
CAS PubMed PubMed Central ADS Google Scholar
Bekbulat, B. et al. PM25 and ozone air pollution levels have not dropped consistently across the US following societal covid response. Sci. Total Environ. https://doi.org/10.26434/chemrxiv.12275603.v6 (2020).
Article Google Scholar
Goldberg, D. L. et al. Disentangling the impact of the COVID-19 lockdowns on urban NO2 from natural variability. Geophys. Res. Lett. https://doi.org/10.1029/2020GL089269 (2020).
Article PubMed PubMed Central Google Scholar
Karaer, A., Balafkan, N., Gazzea, M., Arghandeh, R. & Ozguven, E. E. Analyzing COVID-19 impacts on vehicle travels and daily nitrogen dioxide (NO2) levels among Florida counties. Energies 13, 6044 (2020).
CAS Google Scholar
Parker, H. A., Hasheminassab, S., Crounse, J. D., Roehl, C. M. & Wennberg, P. O. Impacts of traffic reductions associated with COVID-19 on Southern California air quality. Geophys. Res. Lett. 47, e2020GL090164 (2020).
CAS PubMed PubMed Central ADS Google Scholar
Miech, J. A., Herckes, P. & Fraser, M. P. Effect of COVID-19 travel restrictions on phoenix air quality after accounting for boundary layer variations. ScienceDirect. (2021).
Gillingham, K. T., Knittel, C. R., Li, J., Ovaere, M. & Reguant, M. The short-run and long-run effects of covid-19 on energy and the environment. Joule 4, 1337–1341 (2020).
CAS PubMed PubMed Central Google Scholar
Chen, L.-W.A., Chien, L.-C., Li, Y. & Lin, G. Nonuniform impacts of COVID-19 lockdown on air quality over the United States. Sci. Total Environ. 745, 141105 (2020).
CAS PubMed PubMed Central ADS Google Scholar
Sarfraz, M., Shehzad, K. & Farid, A. Gauging the air quality of New York: a non-linear Nexus between COVID-19 and nitrogen dioxide emission. Air Qual. Atmos. Health 13, 1135–1145 (2020).
CAS Google Scholar
Hudda, N., Simon, M. C., Patton, A. P. & Durant, J. L. Reductions in traffic-related black carbon and ultrafine particle number concentrations in an urban neighborhood during the COVID-19 pandemic. Sci. Total Environ. 742, 140931 (2020).
CAS PubMed PubMed Central ADS Google Scholar
Zangari, S., Hill, D. T., Charette, A. T. & Mirowsky, J. E. Air quality changes in New York City during the COVID-19 pandemic. Sci. Total Environ. 742, 140496 (2020).
CAS PubMed PubMed Central ADS Google Scholar
Liu, Q. et al. Spatiotemporal impacts of COVID-19 on air pollution in California, USA. Sci. Total Environ. 750, 141592 (2021).
CAS PubMed ADS Google Scholar
Naeger, A. R. & Murphy, K. Impact of COVID-19 containment measures on air pollution in California. Aerosol Air Qual. Res. 20, 2025–2034 (2020).
CAS Google Scholar
Fu, F., Purvis-Roberts, K. L. & Williams, B. Impact of the COVID-19 pandemic lockdown on air pollution in 20 major cities around the world. Atmosphere 11, 1189 (2020).
CAS ADS Google Scholar
Venter, Z. S., Aunan, K., Chowdhury, S. & Lelieveld, J. COVID-19 lockdowns cause global air pollution declines. Proc. Natl. Acad. Sci. 117, 18984–18990 (2020).
CAS PubMed PubMed Central ADS Google Scholar
Ching, J. & Kajino, M. Rethinking air quality and climate change after COVID-19. Int. J. Environ. Res. Public. Health 17, 5167 (2020).
CAS PubMed Central Google Scholar
Forster, P. M. et al. Current and future global climate impacts resulting from COVID-19. Nat. Clim. Change 10, 913–919 (2020).
CAS ADS Google Scholar
Covid-19 Changes Climate Patterns. Public Health Post https://www.publichealthpost.org/research/covid-19-changes-the-climate-patterns/.
Bao, R. & Zhang, A. Does lockdown reduce air pollution? Evidence from 44 cities in northern China. Sci. Total Environ. 731, 139052 (2020).
CAS PubMed PubMed Central ADS Google Scholar
Aloi, A. et al. Effects of the COVID-19 lockdown on urban mobility: empirical evidence from the City of Santander (Spain). Sustainability 12, 3870 (2020).
CAS Google Scholar
Tobías, A. et al. Changes in air quality during the lockdown in Barcelona (Spain) one month into the SARS-CoV-2 epidemic. Sci. Total Environ. 726, 138540 (2020).
PubMed PubMed Central ADS Google Scholar
Miyazaki, K. et al. Air Quality Response in China Linked to the 2019 Novel Coronavirus (COVID-19) Lockdown. Geophys. Res. Lett. 47, e2020GL089252 (2020).
CAS PubMed PubMed Central ADS Google Scholar
Toro, A. R. et al. Air pollution and COVID-19 lockdown in a large South American city: Santiago Metropolitan Area. Chile. Urban Clim. 36, 100803 (2021).
Google Scholar
Vadrevu, K. P. et al. Spatial and temporal variations of air pollution over 41 cities of India during the COVID-19 lockdown period. Sci. Rep. 10, 16574 (2020).
CAS PubMed PubMed Central ADS Google Scholar
Viatte, C. et al. Ammonia and PM2.5 air pollution in Paris during the 2020 COVID lockdown. Atmosphere 12, 160 (2021).
CAS ADS Google Scholar
Wang, P., Chen, K., Zhu, S., Wang, P. & Zhang, H. Severe air pollution events not avoided by reduced anthropogenic activities during COVID-19 outbreak. Resour. Conserv. Recycl. 158, 104814 (2020).
PubMed PubMed Central Google Scholar
Wu, C.-L. et al. Impact of the COVID-19 lockdown on roadside traffic-related air pollution in Shanghai. China. Build. Environ. https://doi.org/10.1016/j.buildenv.2021.107718 (2021).
Article PubMed Google Scholar
Malpede, M. & Percoco, M. Lockdown measures and air quality: evidence from Italian provinces. Lett. Spat. Resour. Sci. https://doi.org/10.1007/s12076-021-00267-4 (2021).
Article PubMed PubMed Central Google Scholar
Benchrif, A., Wheida, A., Tahri, M., Shubbar, R. M. & Biswas, B. Air quality during three covid-19 lockdown phases: AQI, PM25 and NO2 assessment in cities with more than 1 million inhabitants. Sustain. Cities Soc. 74, 103170 (2021).
PubMed PubMed Central Google Scholar
Hammer, M. S. et al. Effects of COVID-19 lockdowns on fine particulate matter concentrations. Sci. Adv. 7, eabg7670 (2021).
Outdoor Air Quality Data, https://www.epa.gov/outdoor-air-quality-data/download-daily-data.
AirNow, https://www.airnow.gov/.
Abatzoglou, J. T. Development of gridded surface meteorological data for ecological applications and modelling. Int. J. Climatol. 33, 121–131 (2013).
Google Scholar
2014 National Emissions Inventory Report. https://gispub.epa.gov/neireport/2014/.
Census Regions and Divisions of the United States. https://www2.census.gov/geo/pdfs/maps-data/maps/reference/us_regdiv.pdf.
U.S. Census Bureau (2010). Population Density Data. Retrieved from: https://www.census.gov/data/tables.html.
Raifman, J. et al. COVID-19 US State Policy Database. (2020) doi:10.3886/E119446V1.
Hyndman, R. J. & Athanasopoulos, G. Forecasting: principles and practice. (OTexts, 2014).
Box, G. E. P., Jenkins, G. M., Reinsel, G. C. & Ljung, G. M. Time series analysis: forecasting and control. (John Wiley & Sons, Inc, 2016).
Bergmeir, C., Hyndman, R. J. & Benítez, J. M. Bagging exponential smoothing methods using STL decomposition and Box-Cox transformation. Int. J. Forecast. 32, 303–312 (2016).
Google Scholar
Hyndman, R. J. & Khandakar, Y. Automatic Time Series Forecasting: The forecast Package for R. J. Stat. Softw. 27, (2008).
Konishi, S. & Kitagawa, G. Information Criteria and Statistical Modeling. in 245–247.
Hyndman, R. J. & Koehler, A. B. Another look at measures of forecast accuracy. Int. J. Forecast. 22, 679–688 (2006).
Google Scholar
Shilling, F. & Waetjen, D. Special Report(Update): Impact of COVID19 Mitigation on Numbers and Costs of California Traffic Crashes. 11 https://roadecology.ucdavis.edu/files/content/projects/COVID_CHIPs_Impacts.pdf.
Chen, K. L., Henneman, L. R. F. & Nethery, R. C. Differential impacts of COVID-19 lockdowns on PM2.5 across the United States. medRxiv https://doi.org/10.1101/2021.03.10.21253284 (2021).
Article PubMed PubMed Central Google Scholar
Brodersen, K. H., Gallusser, F., Koehler, J., Remy, N. & Scott, S. L. Inferring causal impact using Bayesian structural time-series models. Ann. Appl. Stat. 9, 247–274 (2015).
MathSciNet MATH Google Scholar
Abadie, A., Diamond, A. & Hainmueller, J. Synthetic control methods for comparative case studies: estimating the effect of California’s tobacco control program. J. Am. Stat. Assoc. 105, 493–505 (2010).
MathSciNet CAS Google Scholar
He, G., Pan, Y. & Tanaka, T. The short-term impacts of COVID-19 lockdown on urban air pollution in China. Nat. Sustain. https://doi.org/10.1038/s41893-020-0581-y (2020).
Article Google Scholar
Wildfires and Acres | National Interagency Fire Center. https://www.nifc.gov/fire-information/statistics/wildfires.

Download references

Funding

U.S. Environmental Protection Agency (83587201–0), National Institutes of Health (R01ES026217) 2020 Starr Friedman Award, Harvard University, Climate Change Solutions Fund.

Author information

These authors contributed equally: Tanujit Dey and Pooja Tyagi.

Authors and Affiliations

Center for Surgery and Public Health, Department of Surgery, Brigham and Women’s Hospital, Harvard Medical School, Boston, USA
Tanujit Dey
Department of Biostatistics, Harvard T.H. Chan School of Public Health, 677 Huntington Ave, Boston, MA, 02115, USA
Pooja Tyagi, M. Benjamin Sabath, Leila Kamareddine, Danielle Braun & Francesca Dominici
Faculty of Arts and Sciences, Research Computing, Harvard University, 38 Oxford Street, Cambridge, MA, 02138, USA
M. Benjamin Sabath
Department of Civil, Environmental, and Infrastructure Engineering, George Mason University, 4400 University Drive, Fairfax, VA, 22030, USA
Lucas Henneman
Department of Data Science, Dana-Farber Cancer Institute, 450 Brookline Ave, Boston, MA, 02215, USA
Danielle Braun

Authors

Tanujit Dey
View author publications
You can also search for this author in PubMed Google Scholar
Pooja Tyagi
View author publications
You can also search for this author in PubMed Google Scholar
M. Benjamin Sabath
View author publications
You can also search for this author in PubMed Google Scholar
Leila Kamareddine
View author publications
You can also search for this author in PubMed Google Scholar
Lucas Henneman
View author publications
You can also search for this author in PubMed Google Scholar
Danielle Braun
View author publications
You can also search for this author in PubMed Google Scholar
Francesca Dominici
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.D and P.T. led the statistical analyses and the drafting of the paper, including code and results for the supplemental material. D.B. directed its implementation, including quality assurance and control and helped with the writing of the manuscript. L.K reviewed the literature and edited the manuscript. L.H. conducted literature review and helped with the discussion. B.S. performed data acquisition from the Environmental Protection Agency and AirNow websites. F.D. directed the study implementation, the study’s analytic strategy, edited the manuscript and prepared the discussion sections.

Corresponding author

Correspondence to Francesca Dominici.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dey, T., Tyagi, P., Sabath, M.B. et al. Counterfactual time series analysis of short-term change in air pollution following the COVID-19 state of emergency in the United States. Sci Rep 11, 23517 (2021). https://doi.org/10.1038/s41598-021-02776-0

Download citation

Received: 13 November 2020
Accepted: 19 November 2021
Published: 07 December 2021
DOI: https://doi.org/10.1038/s41598-021-02776-0

This article is cited by

Artificial intelligence for improving Nitrogen Dioxide forecasting of Abu Dhabi environment agency ground-based stations
- Aamna AlShehhi
- Roy Welsch
Journal of Big Data (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.