skip to main content
10.1145/3274895.3274907acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
research-article

Exploiting spatiotemporal patterns for accurate air quality forecasting using deep learning

Published:06 November 2018Publication History

ABSTRACT

Forecasting spatially correlated time series data is challenging because of the linear and non-linear dependencies in the temporal and spatial dimensions. Air quality forecasting is one canonical example of such tasks. Existing work, e.g., auto-regressive integrated moving average (ARIMA) and artificial neural network (ANN), either fails to model the non-linear temporal dependency or cannot effectively consider spatial relationships between multiple spatial time series data. In this paper, we present an approach for forecasting short-term PM2.5 concentrations using a deep learning model, the geo-context based diffusion convolutional recurrent neural network, GC-DCRNN. The model describes the spatial relationship by constructing a graph based on the similarity of the built environment between the locations of air quality sensors. The similarity is computed using the surrounding "important" geographic features regarding their impacts to air quality for each location (e.g., the area size of parks within a 1000-meter buffer, the number of factories within a 500-meter buffer). Also, the model captures the temporal dependency leveraging the sequence to sequence encoder-decoder architecture. We evaluate our model on two real-world air quality datasets and observe consistent improvement of 5%-10% over baseline approaches.

References

  1. Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, and Michael Isard. 2016. Tensorflow: a system for large-scale machine learning. In OSDI, Vol. 16. 265--283. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Saeed Aghabozorgi, Ali Seyed Shirkhorshidi, and Teh Ying Wah. 2015. Time-series clustering-A decade review. Information Systems 53 (2015), 16--38. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Asha B. Chelani and Sukumar Devotta. 2006. Air quality forecasting using a hybrid autoregressive and nonlinear model. Atmospheric Environment 40, 10 (2006), 1774--1780.Google ScholarGoogle ScholarCross RefCross Ref
  4. Junyoung Chung, Çaglar Gülçehre, KyungHyun Cho, and Yoshua Bengio. 2014. Empirical evaluation of gated recurrent neural networks on sequence modeling. CoRR abs/1412.3555 (2014).Google ScholarGoogle Scholar
  5. W. Geoffrey Cobourn. 2007. Accuracy and reliability of an automated air quality forecast system for ozone in seven Kentucky metropolitan areas. Atmospheric Environment 41, 28 (2007), 5863--5875.Google ScholarGoogle ScholarCross RefCross Ref
  6. W. Geoffrey Cobourn and Milton C. Hubbard. 1999. An enhanced ozone forecasting model using air mass trajectory analysis. Atmospheric Environment 33, 28 (1999), 4663--4674.Google ScholarGoogle ScholarCross RefCross Ref
  7. Luis A. Díaz-Robles, Juan C. Ortega, Joshua S. Fu, Gregory D. Reed, Judith C. Chow, John G. Watson, and Juan A. Moncada-Herrera. 2008. A hybrid ARIMA and artificial neural networks model to forecast particulate matter in urban areas: The case of Temuco, Chile. Atmospheric Environment 42, 35 (2008), 8331--8340.Google ScholarGoogle ScholarCross RefCross Ref
  8. James Douglas Hamilton. 1994. Time series analysis. Vol. 2. Princeton university press Princeton.Google ScholarGoogle Scholar
  9. K.-I. Hoi, Ka-Veng. Yuen, and Kai-Meng Mok. 2008. Kalman filter based prediction system for wintertime PM10 concentrations in Macau. Global NEST Journal 10, 2 (2008), 140--150.Google ScholarGoogle Scholar
  10. Héctor Jorquera, Ricardo Pérez, Aldo Cipriano, Andrés Espejo, M. Victoria Letelier, and Gonzalo Acuña. 1998. Forecasting ozone daily maximum levels at Santiago, Chile. Atmospheric Environment 32, 20 (1998), 3415--3424.Google ScholarGoogle ScholarCross RefCross Ref
  11. Marilena Kampa and Elias Castanas. 2008. Human health effects of air pollution. Environmental pollution 151, 2 (2008), 362--367.Google ScholarGoogle Scholar
  12. Eamonn Keogh, Kaushik Chakrabarti, Michael Pazzani, and Sharad Mehrotra. 2001. Dimensionality reduction for fast similarity search in large time series databases. Knowledge and information Systems 3, 3 (2001), 263--286.Google ScholarGoogle Scholar
  13. Anikender Kumar and P. Goyal. 2011. Forecasting of daily air quality index in Delhi. Science of the Total Environment 409, 24 (2011), 5517--5523.Google ScholarGoogle ScholarCross RefCross Ref
  14. Ujjwal Kumar and V.K. Jain. 2010. ARIMA forecasting of ambient air pollutants (O3, NO, NO2 and CO). Stochastic Environmental Research and Risk Assessment 24, 5 (2010), 751--760.Google ScholarGoogle ScholarCross RefCross Ref
  15. Nino Künzli, Michael Jerrett, Wendy J. Mack, Bernardo Beckerman, Laurie LaBree, Frank Gilliland, Duncan Thomas, John Peters, and Howard N Hodis. 2005. Ambient air pollution and atherosclerosis in Los Angeles. Environmental health perspectives 113, 2 (2005), 201.Google ScholarGoogle Scholar
  16. Muhammad Hisyam Lee, Nur Haizum Abd Rahman, Mohd Talib Latif, Maria Elena Nor, and Nur Arina Bazilah Kamisan. 2012. Seasonal ARIMA for forecasting air pollution index: A case study. American Journal of Applied Sciences 9, 4 (2012), 570--578.Google ScholarGoogle ScholarCross RefCross Ref
  17. Yaguang Li and Cyrus Shahabi. 2018. A brief overview of machine learning methods for short-term traffic forecasting and future directions. SIGSPATIAL Special 10, 1 (2018), 3--9. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Yaguang Li, Rose Yu, Cyrus Shahabi, and Yan Liu. 2018. Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting. In International Conference on Learning Representations.Google ScholarGoogle Scholar
  19. Yijun Lin, Yao-Yi Chiang, Fan Pan, Dimitrios Stripelis, José Luis Ambite, Sandrah P Eckel, and Rima Habre. 2017. Mining public datasets for modeling intra-city PM2. 5 concentrations at a fine spatial resolution. In Proceedings of the 25th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. 25. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Ian G. McKendry. 2002. Evaluation of artificial neural networks for fine particulate pollution (PM10 and PM2.5) forecasting. Journal of the Air & Waste Management Association 52, 9 (2002), 1096--1101.Google ScholarGoogle ScholarCross RefCross Ref
  21. Clare S Murray, Gina Poletti, Tatiana Kebadze, Julie Morris, Ashley Woodcock, S. L. Johnston, and Adnan Custovic. 2006. Study of modifiable risk factors for asthma exacerbations: virus infection and allergen exposure increase the risk of asthma hospital admissions in children. Thorax 61, 5 (2006), 376--382.Google ScholarGoogle ScholarCross RefCross Ref
  22. World Health Organization and UNAIDS. 2006. Air quality guidelines: global update 2005. World Health Organization.Google ScholarGoogle Scholar
  23. E. Patterson and D. J. Eatough. 2000. Indoor/outdoor relationships for ambient PM2.5 and associated pollutants: epidemiological implications in Lindon, Utah. J. Air Waste Manag. Assoc. 50, 1 (2000), 103--110.Google ScholarGoogle ScholarCross RefCross Ref
  24. Patricio Perez and Giovanni Salini. 2008. PM2.5 forecasting in a large city: comparison of three methods. Atmospheric Environment 42, 35 (2008), 8219--8224.Google ScholarGoogle ScholarCross RefCross Ref
  25. Patricio Pérez, Alex Trier, and Jorge Reyes. 2000. Prediction of PM2.5 concentrations several hours in advance using neural networks in Santiago, Chile. Atmospheric Environment 34, 8 (2000), 1189--1196.Google ScholarGoogle ScholarCross RefCross Ref
  26. Victor R Prybutok, Junsub Yi, and David Mitchell. 2000. Comparison of neural network models with ARIMA and regression models for prediction of Houston's daily maximum ozone concentrations. European Journal of Operational Research 122, 1 (2000), 31--40.Google ScholarGoogle ScholarCross RefCross Ref
  27. David Y.H. Pui, Sheng-Chieh Chen, and Zhili Zuo. 2014. PM2. 5 in China: Measurements, sources, visibility and health effects, and mitigation. Particuology 13 (2014), 1--26.Google ScholarGoogle ScholarCross RefCross Ref
  28. Ilya Sutskever, Oriol Vinyals, and Quoc V Le. 2014. Sequence to Sequence Learning with Neural Networks. In Advances in Neural Information Processing Systems 27, Z Ghahramani, M Welling, C Cortes, N D Lawrence, and K Q Weinberger (Eds.). Curran Associates, Inc., 3104--3112. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Amos PK Tai, Loretta J Mickley, and Daniel J Jacob. 2010. Correlations between fine particulate matter (PM2.5) and meteorological variables in the United States: Implications for the sensitivity of PM2.5 to climate change. Atmospheric Environment 44, 32 (2010), 3976--3984.Google ScholarGoogle ScholarCross RefCross Ref
  30. Rose Yu, Yaguang Li, Cyrus Shahabi, Ugur Demiryurek, and Yan Liu. 2017. Deep Learning: A Generic Approach for Extreme Condition Traffic Forecasting. In SIAM International Conference on Data Mining. 777--785.Google ScholarGoogle Scholar
  31. Yang Zhang, Marc Bocquet, Vivien Mallet, Christian Seigneur, and Alexander Baklanov. 2012. Real-time air quality forecasting, part I: History, techniques, and current status. Atmospheric Environment 60 (2012), 632--655.Google ScholarGoogle ScholarCross RefCross Ref
  32. Yang Zhang, Marc Bocquet, Vivien Mallet, Christian Seigneur, and Alexander Baklanov. 2012. Real-time air quality forecasting, part II: State of the science, current research needs, and future prospects. Atmospheric Environment 60 (2012), 656--676.Google ScholarGoogle ScholarCross RefCross Ref
  33. Mei Zheng, Lynn G. Salmon, James J. Schauer, Limin Zeng, C. S. Kiang, Yuanhang Zhang, and Glen R. Cass. 2005. Seasonal trends in PM2. 5 source contributions in Beijing, China. Atmospheric Environment 39, 22 (2005), 3967--3976.Google ScholarGoogle ScholarCross RefCross Ref
  34. Yu Zheng, Xiuwen Yi, Ming Li, Ruiyuan Li, Zhangqing Shan, Eric Chang, and Tianrui Li. 2015. Forecasting fine-grained air quality based on big data. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2267--2276. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Exploiting spatiotemporal patterns for accurate air quality forecasting using deep learning

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGSPATIAL '18: Proceedings of the 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
      November 2018
      655 pages
      ISBN:9781450358897
      DOI:10.1145/3274895

      Copyright © 2018 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 6 November 2018

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      SIGSPATIAL '18 Paper Acceptance Rate30of150submissions,20%Overall Acceptance Rate220of1,116submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader