Next Article in Journal
The Moderating Role of Emotion Regulation in the Recall of Negative Autobiographical Memories
Previous Article in Journal
Health by Words—A Content Analysis of Political Manifestos in the Portuguese Elections of 1975
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Spatio-Temporal Analysis of Influenza-Like Illness and Prediction of Incidence in High-Risk Regions in the United States from 2011 to 2020

1
Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou 450001, China
2
Zhengzhou University Library, Zhengzhou University, Zhengzhou 450001, China
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2021, 18(13), 7120; https://doi.org/10.3390/ijerph18137120
Submission received: 8 May 2021 / Revised: 27 June 2021 / Accepted: 29 June 2021 / Published: 2 July 2021

Abstract

:
About 8% of the Americans contract influenza during an average season according to the Centers for Disease Control and Prevention in the United States. It is necessary to strengthen the early warning for influenza and the prediction of public health. In this study, Spatial autocorrelation analysis and spatial scanning analysis were used to identify the spatiotemporal patterns of influenza-like illness (ILI) prevalence in the United States, during the 2011–2020 transmission seasons. A seasonal autoregressive integrated moving average (SARIMA) model was constructed to predict the influenza incidence of high-risk states. We found the highest incidence of ILI was mainly concentrated in the states of Louisiana, District of Columbia and Virginia. Mississippi was a high-risk state with a higher influenza incidence, and exhibited a high-high cluster with neighboring states. A SARIMA (1, 0, 0) (1, 1, 0)52 model was suitable for forecasting the ILI incidence of Mississippi. The relative errors between actual values and predicted values indicated that the predicted values matched the actual values well. Influenza is still an important health problem in the United States. The spread of ILI varies by season and geographical region. The peak season of influenza was the winter and spring, and the states with higher influenza rates are concentrated in the southeast. Increased surveillance in high-risk states could help control the spread of the influenza.

1. Introduction

Influenza is caused by the influenza virus which mainly spreads through airborne droplets and direct contact. It has the characteristics of strong infectivity, rapid transmission and antigen variation. The activity of seasonal influenza begins to increase in October, most often peaks between December and February and can remain elevated until May. Influenza virus infections are very common and their incidence can only be estimated [1]. Previous estimates attributed to the World Health Organization indicated that 250,000–500,000 influenza-associated deaths occur annually, corresponding to estimates of 3.8–7.7 deaths per 100,000 individuals calculated using 2005 United Nations Department of Economic and Social Affairs World Population Prospects [2]. In particular, the 2017–2018 influenza season in the United States was notable for its high severity, with about 45 million illnesses and 810,000 influenza-associated hospitalizations throughout the United States [2].
As influenza may be characterized by fever, cough, sore throat, runny or stuffy nose, body aches, headache, chills or fatigue and so on, it is hard to diagnose as influenza, based on symptoms alone. A number of influenza tests are available to detect influenza viruses in respiratory specimens. The most common are called “rapid influenza diagnostic tests (RIDTs)” [3]. However, not all the people were tested for influenza, the number of reported cases of influenza may significantly underestimate the actual prevalence of influenza. A prospective study in two metropolitan areas of Taiwan showed that influenza-like symptoms were significant predictors for influenza infection. The combination of fever plus cough had the best sensitivity (86%), but the combination of fever plus cough and sneeze had the best specificity (77%) [4]. Therefore, influenza-like illness (ILI) can be used instead of influenza to estimate the trend of influenza [5].
Influenza activity varied widely across the country, making the use of national burden estimates difficult for state or county public health messaging, planning, and responses [2]. A study of the temporal and spatial distribution of weekly influenza in Pennsylvania suggested that there was spatial heterogeneity and spatiotemporal aggregation of influenza distribution [6]. Martin [7] used spatiotemporal statistical prediction models to estimate daily ILI-related emergency department visits. The Geographic Information System (GIS) is a useful epidemiological method for identifying high-risk areas in many infectious diseases [8]. Global and local Moran’s I spatial autocorrelation analyses are common methods to detect whether there is spatial autocorrelation and where the specific areas are located, separately. Spatiotemporal scanning statistics is a more comprehensive method that can analyze space and time distribution and set parameters more flexibly [9]. Identifying high-risk and endemic areas and the spatial−temporal distribution of infectious diseases is important for the development of prevention plans and health policies [10].
The prevalence of influenza displays a seasonal pattern [11]. For diseases that show recurrent seasonal patterns or occur in cyclic patterns, time-series models are the most widely used statistical models for forecasting [12]. The autoregressive integrated moving average (ARIMA) is a widely used predictive analysis model for nonstationary time-series [13]. Compared with ARIMA model, the seasonal autoregressive integrated moving average (SARIMA) model considers seasonal characteristics and can more accurately identify the fluctuation of the diseases over time. The SARIMA model has been used to predict influenza in many studies. Yuzhou [14] used the SARIMA model to control the effects of seasonality in the forecast of influenza epidemics, and the results showed that the SARIMA model had better predictive performance. Xin [15] constructed several SARIMA models to predict the incidence of influenza in different provinces in China, and concluded that time-series analysis was good tool for the prediction of disease incidence.
This study used spatial autocorrelation analysis and spatiotemporal scanning analysis to identify the spatiotemporal patterns of ILI epidemics in the United States, during the 2011–2020 transmission seasons. A time-series model was then constructed, based on the influenza data of high-risk states, to predict the future incidence of influenza, so as to provide theoretical guidance and scientific basis for the prevention and treatment of influenza [16].

2. Materials and Methods

2.1. Data Resources

Information on outpatient visits to health care providers for ILI is collected weekly through the United States Outpatient Influenza-like Illness Surveillance Network. For this system, the confirmed influenza case was “A patient who tests positive for influenza virus infection by an approved laboratory test”, and ILI is defined as “fever (temperature of 100 °F (37.8 °C) or greater) and a cough and/or a sore throat without a known cause other than influenza” (https://www.cdc.gov/flu/weekly/overview.htm, accessed on 24 November 2020).
We collected the ILI data from 1st week 2011 to 29th week 2020 from the Centers for Disease Control and Prevention (CDC). The data included the number of ILI cases in 49 states for different age groups. For hotspot states, time-series models were constructed by collecting data from 1st week 2011 to 52nd week 2018, and data from 1st week 2019 to 29th week 2020 were taken as test data to assess forecast performance [17].

2.2. Spatiotemporal Cluster Analysis

Moran’s I is an important index for analyzing the spatial correlation of diseases [18]. Moran’s I ranges from −1 to 1, where 0 indicates a random distribution of influenza. A value close to 1 indicates that the unit cluster has a similar value. A value close to −1 indicates that the unit with high values and low values are adjacent in space [19]. Based on its value and significance, Moran’s I can detect four types of cluster, including the high-high (HH), high-low (HL), low-low (LL) and low-high (LH) clustering patterns, respectively. The number of permutations was 999, and the significance level was 0.05 [20].
Spatiotemporal cluster analysis is a measurement of temporal and spatial correlation on the foundation of spatial autocorrelation with the further consideration of the time factor [21]. It can relate the spatial characteristics to the temporal characteristics of influenza [16]. During the study period, the cluster was detected by retrospective spatiotemporal permutation scanning statistics [22]. A retrospective study is an analysis of a fixed geographic area and research period. The satellite scanning software scans multiple start and end dates, and evaluates real-time clusters (continuing to the study period and date) and historical clusters (which did not exist before the end date of the study period). Spatiotemporal scan statistics are defined by a specific window with a circular geographic base and height corresponding to time. The window size was constantly adjusted to detect possible spatiotemporal clusters [22]. In order to scan for small to large clusters, the largest radius was set to 50% of the total population at risk, the largest height was set to 50% of the total study period [23]. The logarithmic likelihood ratio (LLR) was used to compare observed and expected numbers to identify specific clusters. After detecting the most likely spatiotemporal clusters, these clusters were tested by the Monte Carlo method [24].
Monte Carlo simulation generates random copies of the data set under appropriate null hypotheses to determine the statistical significance of these results. The p values for these tests are calculated by comparing the maximum likelihood levels from the real data set with the maximum likelihood levels from the random data set, where p = rank/(1 + number of simulations) [11]. The number of copies should be at least 999 times to ensure sufficient accuracy. Therefore, we use 999 Monte Carlo replications to estimate the importance level of these clusters. If the points conforming to the evaluated cluster maintained their aggregated pattern when compared with 999 randomized simulations of the entire dataset, then it was considered important [25].

2.3. Time-Series Analysis

Time-series analysis has the advantage of predicting incidence. It is characterized by the number of patients in the past and responds by predicting the number of patients in the future. The SARIMA model is based on the sequential lag relationship existing in time-series data and more suitable for forecasting when the data has obvious seasonal characteristics [13]. The SARIMA model can be expressed as: SARIMA (p, d, q) (P, D, Q)s. Letters p, d, q are the order of autoregression, the order of difference and the order of moving average; Letters P, D, Q are the order of seasonal autoregression, the order of difference and the order of moving average, and s is the specific value of cycle, the cycle of American influenza is 52 weeks (s = 52) [13].
The process of establishing the SARIMA model was divided into three steps: First, a weekly time-series plot of incidence (per 100,000 population) was drawn to check for stationarity and seasonality. The model was constructed according to the autocorrelation function (ACF) and partial autocorrelation function (PACF) of the model residuals. Secondly, ACF and PACF for estimating residuals were tested by Ljung-Box Q test, and the minimum of the Bayesian information criterion (BIC) was taken as the optimal SARIMA model. Finally, the model was applied to forecast the weekly ILI incidence for 30th week 2020 to 52nd week 2021.

2.4. Statistical Analysis

The data was organized by Microsoft Excel 2013. The SARIMA model was constructed by R 3.6.0 and SPSS 27.0. The value of Moran’s I and local indicators of spatial association were calculated by GeoDa 1.14.0. The time scan statistic was measured with SaTScanTM9.5. All the maps were drawn by ArcGIS 10.0.

3. Results

3.1. Epidemiological Analysis

Included in our study were a total of 9,065,910 ILI cases from 1st week 2011 to 29th week 2020 in the United States. The ILI annual infection rate fluctuated from 5.92 to 15.84 per 100,000 population. ILI occurred throughout the year, most often peaked between December and February and lasted until May.
In terms of age, the number of ILI cases in the age group 5–24 years old was the most, and these groups accounted for about 35 percent, while the number of patients in the age group over 65 years old was the least, accounting for about 7 percent (Figure 1). The difference between different age groups had a statistical significance (p < 0.001).
This study collected the population of 49 states and visualized them on the map (Figure 2), and found no obvious association between population density and influenza incidence.

3.2. Spatiotemporal Analysis

Overall, the highest cumulative incidence of ILI (per 100,000 population) during the study period was seen in the states of Louisiana, District of Columbia and Virginia, which reported 12,200, 9563 and 9554 cases, respectively. The lowest cumulative incidence of ILI incidence was reported from the states of Ohio, Washington and Iowa (Figure 3).

3.2.1. Global Spatial Autocorrelation

The global spatial autocorrelation analysis for ILI suggested a clustering distribution at the state level in the years of 2012 to 2017, the global Moran’s I reached the significance level of 0.05. In contrast, the global Moran’s I for 2011, 2018 and 2019 display no significant spatial autocorrelation, though Moran’s I greater than 0 (Table 1).

3.2.2. Local Spatial Autocorrelation

Local spatial autocorrelation analysis reveals only the relative states, rather than absolute correlations. Only those states whose local Moran’s I have reached the significance level of 0.05 will be present on the map. From 2011 to 2019, the local spatial autocorrelation showed three HH clusters in total with two HL clusters, four LH and three LL clusters. HH clusters were observed in the states of Louisiana (5 years), Mississippi (4 years), and District of Columbia (1 year). Louisiana and Mississippi had HH clusters for long periods. HL clusters were observed in the states of Illinois (4 years), and Oregon (2 years). LH clusters were observed in the states of Tennessee (4 years), Maryland (6 years), Arkansas (5 years), and Texas (3 years). LL clusters appeared in the northeastern part of the United States only in 2018 and 2019 (Figure 4).

3.2.3. Spatiotemporal Cluster Analysis

The spatiotemporal cluster analysis detected 23 clusters of ILI in the study period. The clusters were particularly obvious in spring and winter. For example, the risk ratio (RR) was highest in 2015, with three levels of clustering. Level 1, with Louisiana at the center of high incidence area and two surrounding states, the risk of ILI in this area was 11.66 times more likely to develop the disease than other areas (LLR = 69,009, p < 0.001). Level 2, with Virginia at the center of a high incidence area and three surrounding states, the risk of ILI in this area was 9.79 times more likely to develop the disease than other areas (LLR = 73,277, p < 0.001). Level 3, with New Mexico at the center of high incidence area and three surrounding states, the risk of ILI in this area was 3.38 times more likely to develop the disease than other areas (LLR = 26,518, p < 0.001). At the same time, the states with a high cluster in the local spatial autocorrelation analysis were all located in the high cluster area, the results were consistent. From the cluster time, the high incidence time mainly occurs between January and March (Table 2).

3.3. Time-Series Analysis

Based on the result of spatiotemporal analysis, the HH cluster was mainly in Mississippi and Louisiana. In particular, Mississippi has been exhibiting an HH cluster in recent years. So we predict the incidence of ILI in Mississippi by time-series analysis.
Using raw training data from 1st week 2011 to 52nd week 2018, the trend difference (d = 0) and seasonal difference (D = 1) were calculated (Figure 5). The augmented Dickey−Fuller Test indicated the sequence was stationary (t = −3.98, p = 0.01). The ACF and PACF plots were used to estimate the parameter ranges of p, P and q, Q [26]. After checking ACF and PACF plots, SARIMA (1, 0, 0) (1, 1, 0)52 was the best fitted model with lowest AIC and BIC values, and the Ljung−Box Q Test of this model is valid ( χ 2 = 21.822, p = 0.149), indicating it was a white noise sequence. All the parameter estimates were significant (Table 3).
The model SARIMA (1, 0, 0) (1, 1, 0)52 forecasting effect was tested by comparing the predicted values with the observed values from 1st week 2019 to 29th week 2020. As Figure 6 shows, the black and blue lines represent the observed values and predicted values, respectively, and the dark gray and light gray represent 80% and 95% confidence intervals, respectively. The predicted trend of ILI incidence was basically consistent with the actual trend, and both the root mean squared error (RMSE) and mean absolute percent error (MAPE) were small, indicating that the model prediction results were reliable. Then, forecasting the ILI incidence from 30th week 2020 to 52nd week 2021 by SARIMA. The forecast results showed that there was a high ILI incidence in winter and spring, and low ILI incidence in summer and autumn. The incidence of ILI will reach its peak in the 6th week 2021 (Table 4).

4. Discussion

Influenza is a contagious respiratory illness caused by influenza viruses. It can cause mild to severe illness. Serious outcomes of influenza infection can result in hospitalization or death. Due to the low detection rates, it is easy to underestimate the severity of influenza. The weekly ILI tests conducted by the CDC can effectively remind us of influenza trends. The main purpose of this study was to explore the epidemiological characteristics of ILI incidence, identify the states and possible clusters with high ILI incidence in the United States through spatiotemporal analysis, and then construct a SARIMA model to realize the short-term prediction of ILI incidence.
In the descriptive analysis, we recorded age characteristics, seasonal peaks and regional differences. Those individuals most at risk for severe symptoms and complications from this virus are the very young, vulnerable older adults, pregnant women, immunocompromised individuals of all ages, and those with chronic comorbid conditions [27]. Many studies indicated that influenza viruses caused severe morbidity and mortality in the elderly [28]. However, in this study, people aged 65 and over accounted for the lowest proportion of ILI cases. One of the reasons is that the cardinal number of population of 65 years and over is small. During the study period, a gradual increase in ILI incidence was observed, particularly during the 2018 and 2019 influenza seasons, with a sharp increase in the incidence of ILI. There are many reasons to explain the influenza outbreaks. Both extreme weather and insufficient vaccines are important reasons that affect the incidence of influenza [29].
Spatiotemporal analysis was used to identify high-risk areas for multiple diseases [20]. Yue [30] and Freitas [31] used spatiotemporal analysis to identify the spatial clustering characteristics of dengue fever cases. Liu [23] used spatiotemporal scanning analysis to explore the high-risk areas of hand, foot and mouth disease. In this study, the high incidence of influenza was mainly concentrated in the states of Louisiana, Virginia and Mississippi. Spatiotemporal analysis revealed the HH clusters and high-risk states were mainly located in Mississippi, and the time clusters were mainly concentrated in January to March. This finding was confirmed by other studies [32,33]. Time-series analysis has the advantage of predicting the incidence. It is characterized by the number of patients in the past and responds by predicting the number of patients in the future [13]. The prediction showed that the 95% confidence interval of the predicted ILI incidence almost contained the observed value. The RMSE and MAPE were small, which supported that the SARIMA model was effective in the prediction of ILI. Then, we used this model to forecast the ILI incidence from 30th week 2020 to 52nd week 2021. The results demonstrated that ILI incidence will increase in 45th week 2020 and peak in 6th week 2021, and the distribution is similar to the previous years.
Influenza viruses spread through human contact. Therefore, geography and population density are potential factors of influenza transmission [34]. According to Garrett’s research, the high population density will accelerate the spread of influenza [35]. In this study, the Northeast and Southwest were the most densely populated areas with the lower ILI incidence in the United States. Mississippi is a mostly rural state with a low population density and the highest incidence, different from the results of Garrett’s research [35]. The reason for the results of this study might be the lower economic level of Mississippi [35,36]. During epidemics, the poorest part of the population usually suffers the most. In addition, Mississippi is also the state with the highest proportion of black Americans. Many studies have shown that black individuals have a higher proportion of influenza cases [37,38,39].
Transmission of influenza varies across seasons and geographical areas in the United States. The obvious temporal clusters during the winter and spring, which was in accordance with the seasonality of the respiratory disease [40]. Most parts of the United States are temperate or subtropical climate. The continental climate zone in the central plain is characterized by the cold winters. Winters in temperate regions are characterized by an average temperature between 0 °C and 20 °C, with the minimum temperature dropping to as low as −40 °C in some regions. Generally speaking, influenza peaks in temperate regions in winter. However, the seasonal pattern in subtropical regions seem to be more complicated [29]. Influenza transmission is influenced by variations meteorological variables such as temperature, absolute humidity and precipitation. In the annual influenza epidemics of the United States, the transmission of influenza increases during periods with low precipitation and absolute humidity [41]. Absolute humidity is the total water content in the air. The survival rate of the influenza virus increases at lower absolute humidity levels. Relative humidity increases with the high precipitation. High relative humidity will accelerate the accumulation of respiratory droplets, which reduces the spread of influenza virus. In contrast, low relative humidity is favorable to the spread of influenza virus. The suitable temperature range activating influenza viral transmission could partially explain the common winter epidemics in the central regions [42].

5. Limitations

There is need to highlight some limitations that may be associated with our study outcomes. First, since all influenza activity reporting by public health partners and health-care providers was voluntary, it was difficult to maintain the quality and consistency of the source of data. Second, the influenza incidence of 2020 may be underestimated due to the masks. Therefore, the 2020 data was used for model testing instead of model building. Third, the influence factors of influenza activity were not studied in-depth in this study. In the next step, the risk factors need being explored.

6. Conclusions

In this study, we found the high-risk clusters were concentrated in the southeast, and the incidence of influenza may reach its peak in the 6th week 2021. In order to limit the spread of the outbreaks, surveillance activities and health education should be selectively carried out in higher incidence areas in the epidemic season.

Author Contributions

Conceptualization, X.S.; methodology, J.B.; writing—original draft preparation, Z.S.; writing—review and editing, X.J., Y.Y. and H.Z.; All authors have read and agreed to the published version of the manuscript.

Funding

This research did not receive any funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are openly available in the Centers for Disease Control and Prevention (CDC; https://www.cdc.gov/flu/weekly/, accessed on 24 November 2020).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Tokars, J.I.; Olsen, S.J.; Reed, C. Seasonal Incidence of Symptomatic Influenza in the United States. Clin. Infect. Dis. 2018, 66, 1511–1518. [Google Scholar] [CrossRef]
  2. Iuliano, A.D.; Roguski, K.M.; Chang, H.H.; Muscatello, D.J.; Palekar, R.; Tempia, S.; Cohen, C.; Gran, J.M.; Schanzer, D.; Cowling, B.J.; et al. Estimates of global seasonal influenza-associated respiratory mortality: A modelling study. Lancet 2018, 391, 1285–1300. [Google Scholar] [CrossRef]
  3. CDC. Diagnosing Flu. Available online: https://www.cdc.gov/flu/symptoms/testing.htm (accessed on 11 November 2020).
  4. Humiston, S.G.; Pham, T.N. Influenza-Like Illness Diagnosis and Management in the Acute Care Setting. Pediatric Emerg. Care 2016, 32, 875–882. [Google Scholar] [CrossRef]
  5. Pei, S.; Kandula, S.; Yang, W.; Shaman, J. Forecasting the spatial transmission of influenza in the United States. Proc. Natl. Acad. Sci. USA 2018, 115, 2752–2757. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. Stark, J.H.; Sharma, R.; Ostroff, S.; Cummings, D.A.; Ermentrout, B.; Stebbins, S.; Burke, D.S.; Wisniewski, S.R. Local spatial and temporal processes of influenza in Pennsylvania, USA: 2003–2009. PLoS ONE 2012, 7, e34245. [Google Scholar] [CrossRef] [Green Version]
  7. Martin, L.J.; Dong, H.; Liu, Q.; Talbot, J.; Qiu, W.; Yasui, Y. Predicting influenza-like illness-related emergency department visits by modelling spatio-temporal syndromic surveillance data. Epidemiol. Infect. 2019, 147, e312. [Google Scholar] [CrossRef] [Green Version]
  8. Salimi, M.; Jesri, N.; Javanbakht, M.; Farahani, L.Z.; Shirzadi, M.R.; Saghafipour, A. Spatio-temporal distribution analysis of zoonotic cutaneous leishmaniasis in Qom Province, Iran. J. Parasit. Dis. 2018, 42, 570–576. [Google Scholar] [CrossRef] [PubMed]
  9. Liu, M.Y.; Li, Q.H.; Zhang, Y.J.; Ma, Y.; Liu, Y.; Feng, W.; Hou, C.B.; Amsalu, E.; Li, X.; Wang, W.; et al. Spatial and temporal clustering analysis of tuberculosis in the mainland of China at the prefecture level, 2005–2015. Infect. Dis. Poverty 2018, 7, 106. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  10. Alimohamadi, Y.; Zahraei, S.M.; Karami, M.; Yaseri, M.; Lotfizad, M.; Holakouie-Naieni, K. Spatio-temporal analysis of Pertussis using geographic information system among Iranian population during 2012–2018. Med. J. Islamic Repub. Iran. 2020, 34, 22. [Google Scholar] [CrossRef]
  11. Onozuka, D.; Hagihara, A. Spatial and temporal dynamics of influenza outbreaks. Epidemiology 2008, 19, 824–828. [Google Scholar] [CrossRef]
  12. Petukhova, T.; Ojkic, D.; McEwen, B.; Deardon, R.; Poljak, Z. Assessment of autoregressive integrated moving average (ARIMA), generalized linear autoregressive moving average (GLARMA), and random forest (RF) time series regression models for predicting influenza A virus frequency in swine in Ontario, Canada. PLoS ONE 2018, 13, e0198313. [Google Scholar] [CrossRef] [PubMed]
  13. Rao, X.; Chen, Z.; Dong, H.; Zhu, C.; Yan, Y. Epidemiology of influenza in hospitalized children with respiratory tract infection in Suzhou area from 2016 to 2019. J. Med. Virol. 2020. [Google Scholar] [CrossRef] [PubMed]
  14. Zhang, Y.Z.; Bambrick, H.; Mengersen, K.; Tong, S.L.; Hu, W.B. Using Google Trends and ambient temperature to predict seasonal influenza outbreaks. Environ. Int. 2018, 117, 284–291. [Google Scholar] [CrossRef] [PubMed]
  15. Song, X.; Xiao, J.; Deng, J.; Kang, Q.; Zhang, Y.; Xu, J. Time series analysis of influenza incidence in Chinese provinces from 2004 to 2011. Medicine 2016, 95, e3929. [Google Scholar] [CrossRef] [PubMed]
  16. He, D.; Lui, R.; Wang, L.; Tse, C.K.; Yang, L.; Stone, L. Global Spatio-temporal Patterns of Influenza in the Post-pandemic Era. Sci. Rep. 2015, 5, 11013. [Google Scholar] [CrossRef] [Green Version]
  17. Lu, J.; Meyer, S. Forecasting Flu Activity in the United States: Benchmarking an Endemic-Epidemic Beta Model. Int. J. Environ. Res. Public Health 2020, 17, 1381. [Google Scholar] [CrossRef] [Green Version]
  18. Wang, Y.; Yang, Y.; Shi, X.; Mao, S.; Shi, N.; Hui, X. The spatial distribution pattern of human immunodeficiency virus/acquired immune deficiency syndrome in China. Geospat. Health 2016, 11, 414. [Google Scholar] [CrossRef] [Green Version]
  19. Zhu, B.; Liu, J.L.; Fu, Y.; Zhang, B.; Mao, Y. Spatio-Temporal Epidemiology of Viral Hepatitis in China (2003-2015): Implications for Prevention and Control Policies. Int. J. Environ. Res. Public Health 2018, 15, 661. [Google Scholar] [CrossRef] [Green Version]
  20. Mao, Y.; He, R.; Zhu, B.; Liu, J.; Zhang, N. Notifiable Respiratory Infectious Diseases in China: A Spatial-Temporal Epidemiology Analysis. Int. J. Environ. Res. Public Health 2020, 17, 2301. [Google Scholar] [CrossRef] [Green Version]
  21. Dong, W.; Yang, K.; Xu, Q.L.; Yang, Y.L. A Predictive Risk Model for A(H7N9) Human Infections Based on Spatial-Temporal Autocorrelation and Risk Factors: China, 2013–2014. Int. J. Environ. Res. Public Health 2015, 12, 15204–15221. [Google Scholar] [CrossRef]
  22. Gu, H.; Fan, W.; Liu, K.; Qin, S.; Li, X.; Jiang, J.; Chen, E.; Zhou, Y.; Jiang, Q. Spatio-temporal variations of typhoid and paratyphoid fevers in Zhejiang Province, China from 2005 to 2015. Sci. Rep. 2017, 7, 5780. [Google Scholar] [CrossRef]
  23. Khudyakov, Y.E.; Liu, Y.; Wang, X.; Liu, Y.; Sun, D.; Ding, S.; Zhang, B.; Du, Z.; Xue, F. Detecting Spatial-Temporal Clusters of HFMD from 2007 to 2011 in Shandong Province, China. PLoS ONE 2013, 8, e63447. [Google Scholar] [CrossRef] [Green Version]
  24. Kulldorff, M.; Athas, W.F.; Feurer, E.J.; Miller, B.A.; Key, C.R. Evaluating cluster alarms: A space-time scan statistic and brain cancer in Los Alamos, New Mexico. Am. J. Public Health 1998, 88, 1377–1380. [Google Scholar] [CrossRef] [Green Version]
  25. Fuentes-Vallejo, M. Space and space-time distributions of dengue in a hyper-endemic urban space: The case of Girardot, Colombia. BMC Infect. Dis. 2017, 17, 512. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Cong, J.; Ren, M.; Xie, S.; Wang, P. Predicting Seasonal Influenza Based on SARIMA Model, in Mainland China from 2005 to 2018. Int. J. Environ. Res. Public Health 2019, 16, 4760. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Keilman, L.J. Seasonal Influenza (Flu). Nurs. Clin. N. Am. 2019, 54, 227–243. [Google Scholar] [CrossRef] [PubMed]
  28. Talbot, H.K. Influenza in Older Adults. Infect. Dis. Clin. N. Am. 2017, 31, 757–766. [Google Scholar] [CrossRef] [PubMed]
  29. Zhang, Y.; Ye, C.; Yu, J.; Zhu, W.; Wang, Y.; Li, Z.; Xu, Z.; Cheng, J.; Wang, N.; Hao, L.; et al. The complex associations of climate variability with seasonal influenza A and B virus transmission in subtropical Shanghai, China. Sci. Total Environ. 2020, 701, 134607. [Google Scholar] [CrossRef]
  30. Yue, Y.; Liu, X.; Xu, M.; Ren, D.; Liu, Q. Epidemiological dynamics of dengue fever in mainland China, 2014–2018. Int. J. Infect. Dis. 2019, 86, 82–93. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  31. Freitas, L.P.; Cruz, O.G.; Lowe, R.; Sa Carvalho, M. Space-time dynamics of a triple epidemic: Dengue, chikungunya and Zika clusters in the city of Rio de Janeiro. Proc. Biol. Sci. 2019, 286, 20191867. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  32. Zhang, Y.; Wang, X.; Li, Y.; Ma, J. Spatiotemporal Analysis of Influenza in China, 2005–2018. Sci. Rep. 2019, 9, 19650. [Google Scholar] [CrossRef] [Green Version]
  33. Lu, F.S.; Hattab, M.W.; Clemente, C.L.; Biggerstaff, M.; Santillana, M. Improved state-level influenza nowcasting in the United States leveraging Internet-based data and network approaches. Nat. Commun. 2019, 10, 147. [Google Scholar] [CrossRef] [PubMed]
  34. Chandra, S.; Kassens-Noor, E.; Kuljanin, G.; Vertalka, J. A geographic analysis of population density thresholds in the influenza pandemic of 1918–1919. Int. J. Health Geogr. 2013, 12, 9. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Acuna-Soto, R.; Viboud, C.; Chowell, G. Influenza and pneumonia mortality in 66 large cities in the United States in years surrounding the 1918 pandemic. PLoS ONE 2011, 6, e23467. [Google Scholar] [CrossRef] [Green Version]
  36. Grantz, K.H.; Rane, M.S.; Salje, H.; Glass, G.E.; Schachterle, S.E.; Cummings, D.A.T. Disparities in influenza mortality and transmission related to sociodemographic factors within Chicago in the pandemic of 1918. Proc. Natl. Acad. Sci. USA 2016, 113, 13839–13844. [Google Scholar] [CrossRef] [Green Version]
  37. Okland, H.; Mamelund, S.E. Race and 1918 Influenza Pandemic in the United States: A Review of the Literature. Int. J. Environ. Res. Public Health 2019, 16, 2487. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  38. Lippert, J.F.; Buscemi, J.; Saiyed, N.; Silva, A.; Benjamins, M.R. Influenza and Pneumonia Mortality Across the 30 Biggest U.S. Cities: Assessment of Overall Trends and Racial Inequities. J. Racial Ethn. Health Disparities 2021. [Google Scholar] [CrossRef] [PubMed]
  39. Hutchins, S.S.; Fiscella, K.; Levine, R.S.; Ompad, D.C.; McDonald, M. Protection of racial/ethnic minority populations during an influenza pandemic. Am. J. Public Health 2009, 99 (Suppl. S2), S261–S270. [Google Scholar] [CrossRef]
  40. Chow, E.J.; Doyle, J.D.; Uyeki, T.M. Influenza virus-related critical illness: Prevention, diagnosis, treatment. Crit. Care 2019, 23, 214. [Google Scholar] [CrossRef] [Green Version]
  41. Metz, J.A.; Finn, A. Influenza and humidity—Why a bit more damp may be good for you! J. Infect. 2015, 71, S54–S58. [Google Scholar] [CrossRef]
  42. Dave, K.; Lee, P.C. Global Geographical and Temporal Patterns of Seasonal Influenza and Associated Climatic Factors. Epidemiol. Rev. 2019, 41, 51–68. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Epidemiological characteristics of influenza and ILI in the United States from 1st week 2011 to 29th week 2020. (a) Weekly ILI cases of different age, (b) age distribution of ILI.
Figure 1. Epidemiological characteristics of influenza and ILI in the United States from 1st week 2011 to 29th week 2020. (a) Weekly ILI cases of different age, (b) age distribution of ILI.
Ijerph 18 07120 g001
Figure 2. Map of population density by state in the United States. The map was created by ArcGIS software.
Figure 2. Map of population density by state in the United States. The map was created by ArcGIS software.
Ijerph 18 07120 g002
Figure 3. Maps of ILI incidence for each state in the United States from 2011 to 2019 by ArcGIS software.
Figure 3. Maps of ILI incidence for each state in the United States from 2011 to 2019 by ArcGIS software.
Ijerph 18 07120 g003
Figure 4. Maps of local spatial autocorrelation cluster of ILI for each state in the United States from 2011 to 2019 by GeoDa software.
Figure 4. Maps of local spatial autocorrelation cluster of ILI for each state in the United States from 2011 to 2019 by GeoDa software.
Ijerph 18 07120 g004
Figure 5. The time diagram, ACF and PACF graphs for estimating the parameter: (a) the time diagram of ILI incidence after one-order seasonal difference data, (b) the ACF graph of the raw data (d = 0, D = 0), (c) the PACF graph of the raw data (d = 0, D = 0), (d) the time diagram of ILI incidence a, (e) the ACF graph of one-order seasonal difference data (d = 0 and D = 1), (f) the PACF graph of one-order seasonal difference data (d = 0 and D = 1).
Figure 5. The time diagram, ACF and PACF graphs for estimating the parameter: (a) the time diagram of ILI incidence after one-order seasonal difference data, (b) the ACF graph of the raw data (d = 0, D = 0), (c) the PACF graph of the raw data (d = 0, D = 0), (d) the time diagram of ILI incidence a, (e) the ACF graph of one-order seasonal difference data (d = 0 and D = 1), (f) the PACF graph of one-order seasonal difference data (d = 0 and D = 1).
Ijerph 18 07120 g005
Figure 6. Comparison of actual and predicted incidence of ILI in the United States.
Figure 6. Comparison of actual and predicted incidence of ILI in the United States.
Ijerph 18 07120 g006
Table 1. Global spatial autocorrelation analysis.
Table 1. Global spatial autocorrelation analysis.
YearMoran’s IE(I)MeanSZ-Valuep-Value
20110.099−0.021−0.0230.0901.3580.093
20120.185−0.021−0.0190.0952.1510.028
20130.181−0.021−0.0190.0942.1210.031
20140.200−0.021−0.0210.0922.4010.022
20150.166−0.021−0.2220.0912.0730.037
20160.177−0.021−0.0240.0932.1500.029
20170.146−0.021−0.0250.0871.9810.039
20180.074−0.021−0.0230.0921.0530.145
20190.074−0.021−0.0210.0901.0530.139
Year: “year” represents 1st to 52nd week of each year.
Table 2. Spatiotemporal scan of ILI in the United States from 2011 to 2019.
Table 2. Spatiotemporal scan of ILI in the United States from 2011 to 2019.
YearLevelCenterNCluster PeriodCoordinates/Radius(km)ObservedCasesExpectedCasesRRLLRp-Value
20111Kentucky152011-01-01 to 2011-02-28(37.5 N, 85.3 W)/738.2121,82924,4856.22108,601<0.001
2Colorado132011-01-01 to 2011-02-28(39.0 N, 105.5 W)/1005.262,52715,9843.914.32<0.001
20121Mississippi32012-10-01 to 2012-12-31(32.8 N, 89.7 W)/289.237,83146988.6546,953<0.001
2Virginia12012-09-01 to 2012-12-31(37.5 N, 78.8 W)/0.029,13042347.2631,944<0.001
3Nebraska172012-01-01 to 2012-03-31(41.5 N, 99.8 W)/1116.0265,30132,6932.1513,777<0.001
20131Virginia32013-01-01 to 2013-03-31(37.5 N, 78.8 W)/222.146,638475010.6166,247<0.001
2Texas152013-01-01 to 2013-2-28(31.5 N, 99.4 W)/1327.5101,77327,3334.3264,734<0.001
20141Mississippi32014-10-01 to 2014-12-31(32.8 N, 89.7 W)/289.245,12556868.5255,411<0.001
2Virginia32014-01-01 to 2014-04-30(37.5 N, 78.8 W)/222.143,12765247.0646,035<0.001
3New Mexico122014-01-01 to 2014-02-28(34.4 N, 106.1 W)/1249.554,63918,3613.1824,494<0.001
20151Louisiana22015-01-01 to 2015-04-30(31.1 N, 92.0 W)/289.245,837425411.6669,009<0.001
2Virginia32015-01-01 to 2015-04-30(37.5 N, 78.8 W)/222.154,66661349.7973,277<0.001
3New Mexico122015-01-01 to 2015-02-28(34.4 N, 106.1 W)/1249.554,31117,2643.3826,518<0.001
20161Virginia32016-01-01 to 2016-05-31(37.5 N, 78.8 W)/222.163,28779748.8178,624<0.001
2Arizona32016-01-01 to 2016-04-30(34.3 N, 111.7 W)/559.132,31971554.7324,144<0.001
3Mississippi102016-01-01 to 2016-04-30(32.8 N, 89.7 W)/820.6104,31434,6083.4750,171<0.001
20171Florida132017-01-01 to 2016-03-31(28.6 N, 82.5 W)/1247.2192,62551,3394.63127,775<0.001
2Oregon12017-11-01 to 2017-12-31(43.9 N, 120.6 W)/0.0681917144.014329<0.001
3Colorado142017-01-01 to 2017-02-28(39.0 N, 105.5 W)/1024.354,13925,1852.2313,031<0.001
20181Florida132018-01-01 to 2018-02-28(28.6 N, 82.5 W)/1247.2259,85547,1376.89253,645<0.001
2Wyoming132018-01-01 to 2018-02-28(43.0 N, 107.6 W)/1052.774,81219,5124.0446,665<0.001
20191Colorado72019-01-01 to 2019-03-31(39.0 N, 105.5 W)/747.682,74619,1644.5258,874<0.001
2Florida132019-01-01 to 2019-03-31(28.6 N, 85.5 W)/1247.2326,62594,5254.16193,772<0.001
RR: risk ratio; LLR: logarithmic likelihood ratio.
Table 3. Comparison of candidate SARIMA models.
Table 3. Comparison of candidate SARIMA models.
ModelEstimatetpLjung-Box Q TestAICBICRMSEMAPE
Statisticsp
SARIMA (1, 0, 0) (1, 1, 0)52---21.8220.1492235.5302247.2204.67314.290
AR10.88636.768<0.001------
SAR1−0.60714.350<0.001------
SARIMA (1, 0, 1) (1, 1, 0)52---20.9620.1382235.1102250.7004.65514.368
AR10.86528.837<0.001------
MA10.0971.54100.065------
SAR1−0.612−14.495<0.001------
SARIMA (2, 0, 0) (1, 1, 0)52---20.7340.1462201.1002216.70014.3900.970
AR10.95718.254<0.001------
AR2−0.080−1.5110.067------
SAR1−0.612−14.495<0.001------
SARIMA (2, 0, 1) (1, 1, 0)52---18.5520.1832233.5302253.0124.63614.602
AR10.1310.6950.245------
AR20.6533.744<0.001------
MA10.8355.088<0.001------
SAR1−0.60514.157<0.001------
SARIMA (2, 0, 2) (1, 1, 0)52---18.4050.1432233.4802256.874.59914.906
AR1−0.080−2.1610.018------
AR20.82526.101<0.001------
MA11.06416.272<0.001------
MA20.0640.9940.163------
SAR1−0.611−14.45<0.001------
AIC: Akaike information criterion; BIC: Bayesian information; RMSE: root mean squared error; MAPE: mean absolute percent error.
Table 4. Predictive value of ILI incidence (per 100,000).
Table 4. Predictive value of ILI incidence (per 100,000).
Year/WeekIncidence95% CIYear/WeekIncidence95% CI
2020/305.504−4.248–15.2562021/1611.536−8.684–31.755
2020/315.163−7.801–18.1282021/1710.344−9.876–30.563
2020/324.681−10.288–19.6512021/188.225−11.995–28.444
2020/336.943−9.399–23.2842021/197.892−12.327–28.112
2020/348.422−8.900–25.7432021/207.633−12.587–27.853
2020/359.549−8.488–27.5872021/216.467−13.753–26.687
2020/3611.084−7.484–29.6522021/227.606−12.613–27.826
2020/379.433−9.532–28.3982021/235.670−14.550–25.890
2020/389.993−9.271–29.2582021/245.411−14.809–25.63
2020/3911.696−7.795–31.1872021/255.784−14.436–26.004
2020/4011.710−7.953–31.3732021/265.468−14.751–25.688
2020/4111.726−8.068–31.5202021/274.498−15.721–24.718
2020/4212.617−7.276–32.5112021/284.990−15.230–25.21
2020/4315.611−4.359–35.5812021/294.672−15.548–24.892
2020/4415.406−4.623–35.4342021/304.823−15.851–25.496
2020/4520.7540.681–40.8272021/315.010−16.006–26.025
2020/4623.0212.913–43.1282021/324.910−16.364–26.184
2020/4726.5656.431–46.6982021/336.056−15.414–27.526
2020/4830.27010.116–50.4232021/348.693−12.926–30.313
2020/4924.7894.619–44.9582021/359.995−11.739–31.728
2020/5026.0305.849–46.2112021/3611.233−10.587–33.054
2020/5131.14410.954–51.3342021/379.925−11.962–31.812
2020/5234.81914.622–55.0162021/3810.378−11.560–32.316
2021/0129.3019.099–49.5032021/3911.596−10.380–33.573
2021/0223.1572.951–43.3632021/4011.996−10.010–34.003
2021/0325.9955.785–46.2042021/4111.853−10.176–33.883
2021/0432.19711.985–52.4092021/4212.471−9.577–34.518
2021/0540.47920.266–60.6932021/4317.086−4.975–39.147
2021/0654.81134.596–75.0262021/4415.498−6.574–37.569
2021/0754.06433.848–74.282021/4523.6411.562–45.720
2021/0843.28123.064–63.4982021/4626.9434.858–49.028
2021/0937.15516.938–57.3732021/4733.26211.172–55.352
2021/1031.50611.288–51.7242021/4836.76714.674–58.86
2021/1124.7054.487–44.9242021/4929.5577.461–51.654
2021/1220.8390.620–41.0582021/5030.7048.606–52.802
2021/1317.175−3.044–37.3942021/5136.92814.828–59.027
2021/1414.670−5.549–34.8892021/5240.73918.638–62.84
2021/1511.900−8.319–32.119
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Song, Z.; Jia, X.; Bao, J.; Yang, Y.; Zhu, H.; Shi, X. Spatio-Temporal Analysis of Influenza-Like Illness and Prediction of Incidence in High-Risk Regions in the United States from 2011 to 2020. Int. J. Environ. Res. Public Health 2021, 18, 7120. https://doi.org/10.3390/ijerph18137120

AMA Style

Song Z, Jia X, Bao J, Yang Y, Zhu H, Shi X. Spatio-Temporal Analysis of Influenza-Like Illness and Prediction of Incidence in High-Risk Regions in the United States from 2011 to 2020. International Journal of Environmental Research and Public Health. 2021; 18(13):7120. https://doi.org/10.3390/ijerph18137120

Chicago/Turabian Style

Song, Zhijuan, Xiaocan Jia, Junzhe Bao, Yongli Yang, Huili Zhu, and Xuezhong Shi. 2021. "Spatio-Temporal Analysis of Influenza-Like Illness and Prediction of Incidence in High-Risk Regions in the United States from 2011 to 2020" International Journal of Environmental Research and Public Health 18, no. 13: 7120. https://doi.org/10.3390/ijerph18137120

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop