research-article

Exploiting spatiotemporal patterns for accurate air quality forecasting using deep learning

Authors:
Yijun Lin

University of Southern California

University of Southern California
View Profile

,
Nikhit Mago

University of Southern California

University of Southern California
View Profile

,
Yu Gao

University of Southern California

University of Southern California
View Profile

,
Yaguang Li

University of Southern California

University of Southern California
View Profile

,
Yao-Yi Chiang

University of Southern California

University of Southern California
View Profile

,
Cyrus Shahabi

University of Southern California

University of Southern California
View Profile

,
José Luis Ambite

University of Southern California

University of Southern California
View Profile

SIGSPATIAL '18: Proceedings of the 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information SystemsNovember 2018Pages 359–368https://doi.org/10.1145/3274895.3274907

Published:06 November 2018Publication History

SIGSPATIAL '18: Proceedings of the 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Pages 359–368

ABSTRACT

Forecasting spatially correlated time series data is challenging because of the linear and non-linear dependencies in the temporal and spatial dimensions. Air quality forecasting is one canonical example of such tasks. Existing work, e.g., auto-regressive integrated moving average (ARIMA) and artificial neural network (ANN), either fails to model the non-linear temporal dependency or cannot effectively consider spatial relationships between multiple spatial time series data. In this paper, we present an approach for forecasting short-term PM_2.5 concentrations using a deep learning model, the geo-context based diffusion convolutional recurrent neural network, GC-DCRNN. The model describes the spatial relationship by constructing a graph based on the similarity of the built environment between the locations of air quality sensors. The similarity is computed using the surrounding "important" geographic features regarding their impacts to air quality for each location (e.g., the area size of parks within a 1000-meter buffer, the number of factories within a 500-meter buffer). Also, the model captures the temporal dependency leveraging the sequence to sequence encoder-decoder architecture. We evaluate our model on two real-world air quality datasets and observe consistent improvement of 5%-10% over baseline approaches.

References

Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, and Michael Isard. 2016. Tensorflow: a system for large-scale machine learning. In OSDI, Vol. 16. 265--283. Google ScholarDigital Library
Saeed Aghabozorgi, Ali Seyed Shirkhorshidi, and Teh Ying Wah. 2015. Time-series clustering-A decade review. Information Systems 53 (2015), 16--38. Google ScholarDigital Library
Asha B. Chelani and Sukumar Devotta. 2006. Air quality forecasting using a hybrid autoregressive and nonlinear model. Atmospheric Environment 40, 10 (2006), 1774--1780.Google ScholarCross Ref
Junyoung Chung, Çaglar Gülçehre, KyungHyun Cho, and Yoshua Bengio. 2014. Empirical evaluation of gated recurrent neural networks on sequence modeling. CoRR abs/1412.3555 (2014).Google Scholar
W. Geoffrey Cobourn. 2007. Accuracy and reliability of an automated air quality forecast system for ozone in seven Kentucky metropolitan areas. Atmospheric Environment 41, 28 (2007), 5863--5875.Google ScholarCross Ref
W. Geoffrey Cobourn and Milton C. Hubbard. 1999. An enhanced ozone forecasting model using air mass trajectory analysis. Atmospheric Environment 33, 28 (1999), 4663--4674.Google ScholarCross Ref
Luis A. Díaz-Robles, Juan C. Ortega, Joshua S. Fu, Gregory D. Reed, Judith C. Chow, John G. Watson, and Juan A. Moncada-Herrera. 2008. A hybrid ARIMA and artificial neural networks model to forecast particulate matter in urban areas: The case of Temuco, Chile. Atmospheric Environment 42, 35 (2008), 8331--8340.Google ScholarCross Ref
James Douglas Hamilton. 1994. Time series analysis. Vol. 2. Princeton university press Princeton.Google Scholar
K.-I. Hoi, Ka-Veng. Yuen, and Kai-Meng Mok. 2008. Kalman filter based prediction system for wintertime PM10 concentrations in Macau. Global NEST Journal 10, 2 (2008), 140--150.Google Scholar
Héctor Jorquera, Ricardo Pérez, Aldo Cipriano, Andrés Espejo, M. Victoria Letelier, and Gonzalo Acuña. 1998. Forecasting ozone daily maximum levels at Santiago, Chile. Atmospheric Environment 32, 20 (1998), 3415--3424.Google ScholarCross Ref
Marilena Kampa and Elias Castanas. 2008. Human health effects of air pollution. Environmental pollution 151, 2 (2008), 362--367.Google Scholar
Eamonn Keogh, Kaushik Chakrabarti, Michael Pazzani, and Sharad Mehrotra. 2001. Dimensionality reduction for fast similarity search in large time series databases. Knowledge and information Systems 3, 3 (2001), 263--286.Google Scholar
Anikender Kumar and P. Goyal. 2011. Forecasting of daily air quality index in Delhi. Science of the Total Environment 409, 24 (2011), 5517--5523.Google ScholarCross Ref
Ujjwal Kumar and V.K. Jain. 2010. ARIMA forecasting of ambient air pollutants (O3, NO, NO2 and CO). Stochastic Environmental Research and Risk Assessment 24, 5 (2010), 751--760.Google ScholarCross Ref
Nino Künzli, Michael Jerrett, Wendy J. Mack, Bernardo Beckerman, Laurie LaBree, Frank Gilliland, Duncan Thomas, John Peters, and Howard N Hodis. 2005. Ambient air pollution and atherosclerosis in Los Angeles. Environmental health perspectives 113, 2 (2005), 201.Google Scholar
Muhammad Hisyam Lee, Nur Haizum Abd Rahman, Mohd Talib Latif, Maria Elena Nor, and Nur Arina Bazilah Kamisan. 2012. Seasonal ARIMA for forecasting air pollution index: A case study. American Journal of Applied Sciences 9, 4 (2012), 570--578.Google ScholarCross Ref
Yaguang Li and Cyrus Shahabi. 2018. A brief overview of machine learning methods for short-term traffic forecasting and future directions. SIGSPATIAL Special 10, 1 (2018), 3--9. Google ScholarDigital Library
Yaguang Li, Rose Yu, Cyrus Shahabi, and Yan Liu. 2018. Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting. In International Conference on Learning Representations.Google Scholar
Yijun Lin, Yao-Yi Chiang, Fan Pan, Dimitrios Stripelis, José Luis Ambite, Sandrah P Eckel, and Rima Habre. 2017. Mining public datasets for modeling intra-city PM2. 5 concentrations at a fine spatial resolution. In Proceedings of the 25th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. 25. Google ScholarDigital Library
Ian G. McKendry. 2002. Evaluation of artificial neural networks for fine particulate pollution (PM10 and PM2.5) forecasting. Journal of the Air & Waste Management Association 52, 9 (2002), 1096--1101.Google ScholarCross Ref
Clare S Murray, Gina Poletti, Tatiana Kebadze, Julie Morris, Ashley Woodcock, S. L. Johnston, and Adnan Custovic. 2006. Study of modifiable risk factors for asthma exacerbations: virus infection and allergen exposure increase the risk of asthma hospital admissions in children. Thorax 61, 5 (2006), 376--382.Google ScholarCross Ref
World Health Organization and UNAIDS. 2006. Air quality guidelines: global update 2005. World Health Organization.Google Scholar
E. Patterson and D. J. Eatough. 2000. Indoor/outdoor relationships for ambient PM2.5 and associated pollutants: epidemiological implications in Lindon, Utah. J. Air Waste Manag. Assoc. 50, 1 (2000), 103--110.Google ScholarCross Ref
Patricio Perez and Giovanni Salini. 2008. PM2.5 forecasting in a large city: comparison of three methods. Atmospheric Environment 42, 35 (2008), 8219--8224.Google ScholarCross Ref
Patricio Pérez, Alex Trier, and Jorge Reyes. 2000. Prediction of PM2.5 concentrations several hours in advance using neural networks in Santiago, Chile. Atmospheric Environment 34, 8 (2000), 1189--1196.Google ScholarCross Ref
Victor R Prybutok, Junsub Yi, and David Mitchell. 2000. Comparison of neural network models with ARIMA and regression models for prediction of Houston's daily maximum ozone concentrations. European Journal of Operational Research 122, 1 (2000), 31--40.Google ScholarCross Ref
David Y.H. Pui, Sheng-Chieh Chen, and Zhili Zuo. 2014. PM2. 5 in China: Measurements, sources, visibility and health effects, and mitigation. Particuology 13 (2014), 1--26.Google ScholarCross Ref
Ilya Sutskever, Oriol Vinyals, and Quoc V Le. 2014. Sequence to Sequence Learning with Neural Networks. In Advances in Neural Information Processing Systems 27, Z Ghahramani, M Welling, C Cortes, N D Lawrence, and K Q Weinberger (Eds.). Curran Associates, Inc., 3104--3112. Google ScholarDigital Library
Amos PK Tai, Loretta J Mickley, and Daniel J Jacob. 2010. Correlations between fine particulate matter (PM2.5) and meteorological variables in the United States: Implications for the sensitivity of PM2.5 to climate change. Atmospheric Environment 44, 32 (2010), 3976--3984.Google ScholarCross Ref
Rose Yu, Yaguang Li, Cyrus Shahabi, Ugur Demiryurek, and Yan Liu. 2017. Deep Learning: A Generic Approach for Extreme Condition Traffic Forecasting. In SIAM International Conference on Data Mining. 777--785.Google Scholar
Yang Zhang, Marc Bocquet, Vivien Mallet, Christian Seigneur, and Alexander Baklanov. 2012. Real-time air quality forecasting, part I: History, techniques, and current status. Atmospheric Environment 60 (2012), 632--655.Google ScholarCross Ref
Yang Zhang, Marc Bocquet, Vivien Mallet, Christian Seigneur, and Alexander Baklanov. 2012. Real-time air quality forecasting, part II: State of the science, current research needs, and future prospects. Atmospheric Environment 60 (2012), 656--676.Google ScholarCross Ref
Mei Zheng, Lynn G. Salmon, James J. Schauer, Limin Zeng, C. S. Kiang, Yuanhang Zhang, and Glen R. Cass. 2005. Seasonal trends in PM2. 5 source contributions in Beijing, China. Atmospheric Environment 39, 22 (2005), 3967--3976.Google ScholarCross Ref
Yu Zheng, Xiuwen Yi, Ming Li, Ruiyuan Li, Zhangqing Shan, Eric Chang, and Tianrui Li. 2015. Forecasting fine-grained air quality based on big data. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2267--2276. Google ScholarDigital Library

Index Terms

Exploiting spatiotemporal patterns for accurate air quality forecasting using deep learning
1. Information systems
  1. Information systems applications
    1. Spatial-temporal systems

Recommendations

Investigation of nearby monitoring station for hourly PM_2.5 forecasting using parallel multi-input 1D-CNN-biLSTM
Abstract
Air quality forecasting is a hot research topic that has been widely explored by the whole society. To better understand environmental quality, numerous methods have been proposed for investigating air pollutant data. Previous studies have used ...
Graphical abstract

Display Omitted
Highlights
- Investigate the nearby station with the target station for PM 2.5 forecasting.
- Propose a parallel multi-input 1D-CNN-biLSTM for PM 2.5 forecasting.
- Forecast PM 2.5 with emphasis on seasonality.
Read More
Multi-output Spatio-temporal air pollution forecasting using neural network approach
Abstract
Multi-step ahead pollution forecasting has an essential role in mitigating health risks. Multi-Input and Multi-Output (MIMO) multi-step forecasting is valuable, especially for long-term pollution forecasting. Multi-output pollution ...
Highlights
- Climate variables and spatial attributes can improve PM2.5 forecasting results.
Read More
Probabilistic air quality forecasting using deep learning spatial–temporal neural network
Abstract
Regional air quality monitoring, a critical component of sustainable development is realized through various air quality observation stations established across a region. Accurate forecasting of air quality data collected from these observation ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SIGSPATIAL '18: Proceedings of the 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
November 2018
655 pages
ISBN:9781450358897
DOI:10.1145/3274895
General Chairs:
Farnoush Banaei-Kashani
University of Colorado, Denver
,
Erik Hoel
Esri
,
Program Chairs:
Ralf Hartmut Güting
FernUniversität in Hagen, Germany
,
Roberto Tamassia
Brown University
,
Li Xiong
Emory University
Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 6 November 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
PM2.5
air quality forecasting
deep learning
spatiotemporal time series analysis
Qualifiers
- research-article
Conference

Acceptance Rates
SIGSPATIAL '18 Paper Acceptance Rate30of150submissions,20%Overall Acceptance Rate220of1,116submissions,20%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 72
  Total Citations
  View Citations
- 1,139
  Total Downloads
- Downloads (Last 12 months)190
- Downloads (Last 6 weeks)26
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Exploiting spatiotemporal patterns for accurate air quality forecasting using deep learning

SIGSPATIAL '18: Proceedings of the 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

ABSTRACT

References

Cited By

Index Terms

Recommendations

Investigation of nearby monitoring station for hourly PM_2.5 forecasting using parallel multi-input 1D-CNN-biLSTM

Multi-output Spatio-temporal air pollution forecasting using neural network approach

Probabilistic air quality forecasting using deep learning spatial–temporal neural network