ABSTRACT
Forecasting spatially correlated time series data is challenging because of the linear and non-linear dependencies in the temporal and spatial dimensions. Air quality forecasting is one canonical example of such tasks. Existing work, e.g., auto-regressive integrated moving average (ARIMA) and artificial neural network (ANN), either fails to model the non-linear temporal dependency or cannot effectively consider spatial relationships between multiple spatial time series data. In this paper, we present an approach for forecasting short-term PM2.5 concentrations using a deep learning model, the geo-context based diffusion convolutional recurrent neural network, GC-DCRNN. The model describes the spatial relationship by constructing a graph based on the similarity of the built environment between the locations of air quality sensors. The similarity is computed using the surrounding "important" geographic features regarding their impacts to air quality for each location (e.g., the area size of parks within a 1000-meter buffer, the number of factories within a 500-meter buffer). Also, the model captures the temporal dependency leveraging the sequence to sequence encoder-decoder architecture. We evaluate our model on two real-world air quality datasets and observe consistent improvement of 5%-10% over baseline approaches.
- Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, and Michael Isard. 2016. Tensorflow: a system for large-scale machine learning. In OSDI, Vol. 16. 265--283. Google ScholarDigital Library
- Saeed Aghabozorgi, Ali Seyed Shirkhorshidi, and Teh Ying Wah. 2015. Time-series clustering-A decade review. Information Systems 53 (2015), 16--38. Google ScholarDigital Library
- Asha B. Chelani and Sukumar Devotta. 2006. Air quality forecasting using a hybrid autoregressive and nonlinear model. Atmospheric Environment 40, 10 (2006), 1774--1780.Google ScholarCross Ref
- Junyoung Chung, Çaglar Gülçehre, KyungHyun Cho, and Yoshua Bengio. 2014. Empirical evaluation of gated recurrent neural networks on sequence modeling. CoRR abs/1412.3555 (2014).Google Scholar
- W. Geoffrey Cobourn. 2007. Accuracy and reliability of an automated air quality forecast system for ozone in seven Kentucky metropolitan areas. Atmospheric Environment 41, 28 (2007), 5863--5875.Google ScholarCross Ref
- W. Geoffrey Cobourn and Milton C. Hubbard. 1999. An enhanced ozone forecasting model using air mass trajectory analysis. Atmospheric Environment 33, 28 (1999), 4663--4674.Google ScholarCross Ref
- Luis A. Díaz-Robles, Juan C. Ortega, Joshua S. Fu, Gregory D. Reed, Judith C. Chow, John G. Watson, and Juan A. Moncada-Herrera. 2008. A hybrid ARIMA and artificial neural networks model to forecast particulate matter in urban areas: The case of Temuco, Chile. Atmospheric Environment 42, 35 (2008), 8331--8340.Google ScholarCross Ref
- James Douglas Hamilton. 1994. Time series analysis. Vol. 2. Princeton university press Princeton.Google Scholar
- K.-I. Hoi, Ka-Veng. Yuen, and Kai-Meng Mok. 2008. Kalman filter based prediction system for wintertime PM10 concentrations in Macau. Global NEST Journal 10, 2 (2008), 140--150.Google Scholar
- Héctor Jorquera, Ricardo Pérez, Aldo Cipriano, Andrés Espejo, M. Victoria Letelier, and Gonzalo Acuña. 1998. Forecasting ozone daily maximum levels at Santiago, Chile. Atmospheric Environment 32, 20 (1998), 3415--3424.Google ScholarCross Ref
- Marilena Kampa and Elias Castanas. 2008. Human health effects of air pollution. Environmental pollution 151, 2 (2008), 362--367.Google Scholar
- Eamonn Keogh, Kaushik Chakrabarti, Michael Pazzani, and Sharad Mehrotra. 2001. Dimensionality reduction for fast similarity search in large time series databases. Knowledge and information Systems 3, 3 (2001), 263--286.Google Scholar
- Anikender Kumar and P. Goyal. 2011. Forecasting of daily air quality index in Delhi. Science of the Total Environment 409, 24 (2011), 5517--5523.Google ScholarCross Ref
- Ujjwal Kumar and V.K. Jain. 2010. ARIMA forecasting of ambient air pollutants (O3, NO, NO2 and CO). Stochastic Environmental Research and Risk Assessment 24, 5 (2010), 751--760.Google ScholarCross Ref
- Nino Künzli, Michael Jerrett, Wendy J. Mack, Bernardo Beckerman, Laurie LaBree, Frank Gilliland, Duncan Thomas, John Peters, and Howard N Hodis. 2005. Ambient air pollution and atherosclerosis in Los Angeles. Environmental health perspectives 113, 2 (2005), 201.Google Scholar
- Muhammad Hisyam Lee, Nur Haizum Abd Rahman, Mohd Talib Latif, Maria Elena Nor, and Nur Arina Bazilah Kamisan. 2012. Seasonal ARIMA for forecasting air pollution index: A case study. American Journal of Applied Sciences 9, 4 (2012), 570--578.Google ScholarCross Ref
- Yaguang Li and Cyrus Shahabi. 2018. A brief overview of machine learning methods for short-term traffic forecasting and future directions. SIGSPATIAL Special 10, 1 (2018), 3--9. Google ScholarDigital Library
- Yaguang Li, Rose Yu, Cyrus Shahabi, and Yan Liu. 2018. Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting. In International Conference on Learning Representations.Google Scholar
- Yijun Lin, Yao-Yi Chiang, Fan Pan, Dimitrios Stripelis, José Luis Ambite, Sandrah P Eckel, and Rima Habre. 2017. Mining public datasets for modeling intra-city PM2. 5 concentrations at a fine spatial resolution. In Proceedings of the 25th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. 25. Google ScholarDigital Library
- Ian G. McKendry. 2002. Evaluation of artificial neural networks for fine particulate pollution (PM10 and PM2.5) forecasting. Journal of the Air & Waste Management Association 52, 9 (2002), 1096--1101.Google ScholarCross Ref
- Clare S Murray, Gina Poletti, Tatiana Kebadze, Julie Morris, Ashley Woodcock, S. L. Johnston, and Adnan Custovic. 2006. Study of modifiable risk factors for asthma exacerbations: virus infection and allergen exposure increase the risk of asthma hospital admissions in children. Thorax 61, 5 (2006), 376--382.Google ScholarCross Ref
- World Health Organization and UNAIDS. 2006. Air quality guidelines: global update 2005. World Health Organization.Google Scholar
- E. Patterson and D. J. Eatough. 2000. Indoor/outdoor relationships for ambient PM2.5 and associated pollutants: epidemiological implications in Lindon, Utah. J. Air Waste Manag. Assoc. 50, 1 (2000), 103--110.Google ScholarCross Ref
- Patricio Perez and Giovanni Salini. 2008. PM2.5 forecasting in a large city: comparison of three methods. Atmospheric Environment 42, 35 (2008), 8219--8224.Google ScholarCross Ref
- Patricio Pérez, Alex Trier, and Jorge Reyes. 2000. Prediction of PM2.5 concentrations several hours in advance using neural networks in Santiago, Chile. Atmospheric Environment 34, 8 (2000), 1189--1196.Google ScholarCross Ref
- Victor R Prybutok, Junsub Yi, and David Mitchell. 2000. Comparison of neural network models with ARIMA and regression models for prediction of Houston's daily maximum ozone concentrations. European Journal of Operational Research 122, 1 (2000), 31--40.Google ScholarCross Ref
- David Y.H. Pui, Sheng-Chieh Chen, and Zhili Zuo. 2014. PM2. 5 in China: Measurements, sources, visibility and health effects, and mitigation. Particuology 13 (2014), 1--26.Google ScholarCross Ref
- Ilya Sutskever, Oriol Vinyals, and Quoc V Le. 2014. Sequence to Sequence Learning with Neural Networks. In Advances in Neural Information Processing Systems 27, Z Ghahramani, M Welling, C Cortes, N D Lawrence, and K Q Weinberger (Eds.). Curran Associates, Inc., 3104--3112. Google ScholarDigital Library
- Amos PK Tai, Loretta J Mickley, and Daniel J Jacob. 2010. Correlations between fine particulate matter (PM2.5) and meteorological variables in the United States: Implications for the sensitivity of PM2.5 to climate change. Atmospheric Environment 44, 32 (2010), 3976--3984.Google ScholarCross Ref
- Rose Yu, Yaguang Li, Cyrus Shahabi, Ugur Demiryurek, and Yan Liu. 2017. Deep Learning: A Generic Approach for Extreme Condition Traffic Forecasting. In SIAM International Conference on Data Mining. 777--785.Google Scholar
- Yang Zhang, Marc Bocquet, Vivien Mallet, Christian Seigneur, and Alexander Baklanov. 2012. Real-time air quality forecasting, part I: History, techniques, and current status. Atmospheric Environment 60 (2012), 632--655.Google ScholarCross Ref
- Yang Zhang, Marc Bocquet, Vivien Mallet, Christian Seigneur, and Alexander Baklanov. 2012. Real-time air quality forecasting, part II: State of the science, current research needs, and future prospects. Atmospheric Environment 60 (2012), 656--676.Google ScholarCross Ref
- Mei Zheng, Lynn G. Salmon, James J. Schauer, Limin Zeng, C. S. Kiang, Yuanhang Zhang, and Glen R. Cass. 2005. Seasonal trends in PM2. 5 source contributions in Beijing, China. Atmospheric Environment 39, 22 (2005), 3967--3976.Google ScholarCross Ref
- Yu Zheng, Xiuwen Yi, Ming Li, Ruiyuan Li, Zhangqing Shan, Eric Chang, and Tianrui Li. 2015. Forecasting fine-grained air quality based on big data. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2267--2276. Google ScholarDigital Library
Index Terms
- Exploiting spatiotemporal patterns for accurate air quality forecasting using deep learning
Recommendations
Investigation of nearby monitoring station for hourly PM2.5 forecasting using parallel multi-input 1D-CNN-biLSTM
AbstractAir quality forecasting is a hot research topic that has been widely explored by the whole society. To better understand environmental quality, numerous methods have been proposed for investigating air pollutant data. Previous studies have used ...
Graphical abstractDisplay Omitted
Highlights- Investigate the nearby station with the target station for PM 2.5 forecasting.
- Propose a parallel multi-input 1D-CNN-biLSTM for PM 2.5 forecasting.
- Forecast PM 2.5 with emphasis on seasonality.
Multi-output Spatio-temporal air pollution forecasting using neural network approach
AbstractMulti-step ahead pollution forecasting has an essential role in mitigating health risks. Multi-Input and Multi-Output (MIMO) multi-step forecasting is valuable, especially for long-term pollution forecasting. Multi-output pollution ...
Highlights- Climate variables and spatial attributes can improve PM2.5 forecasting results.
Probabilistic air quality forecasting using deep learning spatial–temporal neural network
AbstractRegional air quality monitoring, a critical component of sustainable development is realized through various air quality observation stations established across a region. Accurate forecasting of air quality data collected from these observation ...
Comments