Smart control of window and air cleaner for mitigating indoor PM2.5 with reduced energy consumption based on deep reinforcement learning

doi:10.1016/j.buildenv.2022.109583

Building and Environment

Volume 224, October 2022, 109583

https://doi.org/10.1016/j.buildenv.2022.109583 Get rights and content

Highlights

•
A controller was developed to automatically control a window and an air cleaner.
•
The controller was trained offline using the deep reinforcement learning approach.
•
The controller reduced PM_2.5-related health risks and energy consumption effectively.
•
The controller achieved the robust performance under different chamber conditions.

Abstract

For naturally ventilated apartments equipped with air cleaners, it is essential to develop a controller that simultaneously controls the operation of a window and the air cleaner in order to mitigate indoor PM_2.5 (particulate matter with aerodynamic diameter less than 2.5 μm) pollution with less energy consumption by the air cleaner. This investigation first employed the deep reinforcement learning approach to train a smart controller that minimizes the total economic loss due to PM_2.5-related health risks and air cleaner energy consumption. The controller was trained offline in a virtual apartment constructed on the basis of a particle dynamics model with typical building parameters. The inputs required for the controller were the real-time indoor and outdoor PM_2.5 concentrations, which could be measured by low-cost sensors. To test the trained deep Q-network (DQN) controller, a series of experiments were conducted in two laboratory chambers. Both the indoor PM_2.5 concentrations and the operating time of the air cleaner were compared between the trained DQN controller and different benchmark controllers with various outdoor PM_2.5 levels under different chamber conditions, in order to assess the controller performance. The trained DQN controller outperformed the benchmark controllers in reducing the total economic loss due to indoor PM_2.5-related health risks and air cleaner energy consumption by 2.4%–43.7% for all 18 cases. Although the DQN controller was trained offline in a virtual apartment with typical building parameters, its performance was robust in the chamber experiments even when the parameters were very different from the typical values.

Introduction

Numerous epidemiologic studies have shown that exposure to PM_2.5 (particulate matter with aerodynamic diameter less than 2.5 μm) is strongly associated with adverse health effects such as respiratory infections [1], lung cancer [2], chronic obstructive pulmonary disease (COPD) [3] and cardiovascular disease [4], and thus leads to large numbers of premature deaths [3]. Since people spend most of their daily lives in indoor environments [5], there is a great need to eliminate indoor exposure to PM_2.5 and the related diseases and deaths.

In China, people often ventilate their apartments naturally by opening the windows [6]. However, with the frequent severe ambient PM_2.5 pollution, the use of air cleaners has become more and more popular for reducing indoor PM_2.5 pollution. During the past decade, the fraction of residences with home air cleaners has increased by ten times in China [6,7]. Therefore, it is worthwhile to investigate the control of windows and air cleaners for effective reduction of PM_2.5 pollution in apartments that are naturally ventilated.

In naturally ventilated apartments, opening the windows can reduce the concentration of indoor PM_2.5 generated by indoor activities such as cooking [[8], [9], [10]], smoking [11,12] and printing [13]. On the other hand, closing the windows reduces the ventilation rate and thus the entry of outdoor PM_2.5 into the indoor environment [14]. When air cleaners equipped with high-efficiency particulate air (HEPA) filters are used, the indoor PM_2.5 concentration can be significantly reduced as long as the clean air delivery rate (CADR) is sufficiently large [15]. However, the use of air cleaners results in higher energy consumption and the need for regular replacement of filtration media [6,15]. Thus, it is crucial to develop an approach that controls a window and an air cleaner simultaneously, in a bid to effectively mitigate indoor PM_2.5 pollution while reducing the energy consumed by the air cleaner.

Two commonly used control approaches, closed-loop and model predictive control, can be considered for the goal of reducing indoor PM_2.5 pollution. For closed-loop control, a typical method is to define a setpoint for indoor PM_2.5 concentration in an air cleaner with on/off control. However, this method does not consider the influence of window status or the contributions of indoor emission and outdoor infiltration. Thus, the energy consumption of the air cleaner is not necessarily minimized. For model predictive control, prior knowledge of the specific environment is needed; i.e., all the parameters of the environment must be accurately measured in advance for prediction and optimization. However, it is challenging to measure key parameters such as air exchange rate, CADR, and indoor source emission rates in real time. Therefore, these conventional control methods cannot be utilized in practical applications if both health and energy issues are to be considered. Note that control of the window and the air cleaner is a sequential decision-making process. This kind of control problem can be effectively solved by reinforcement learning [16,17]. Hence, the reinforcement learning approach could potentially be employed to simultaneously control a window and air cleaner in order to reduce indoor PM_2.5 pollution with less energy consumption.

Reinforcement learning has been applied in the control of smart buildings in many investigations [[18], [19], [20], [21], [22], [23], [24], [25], [26]]. For instance, Chen et al. [18] proposed a Q-learning control strategy for windows and an HVAC system to save energy and reduce thermal discomfort in virtual environments located in Miami and Los Angeles. Nagy et al. [20] developed a deep reinforcement learning algorithm for controlling space heating that could reduce the cost by 5–10% when compared with a rule-based method. Heo et al. [22] trained a controller for the mechanical ventilation system of a subway station using the deep Q-network algorithm. Testing in a simulator showed that the control strategy maintained the indoor PM₁₀ at an acceptable level and reduced energy use by 14.37% when compared with a rule-based method. These studies have provided great insight into the application of reinforcement learning in building systems. However, most of these investigations tested reinforcement learning algorithms for smart building control in computer simulations rather than experimentally. Furthermore, for naturally ventilated apartments with air cleaners, there are few existing studies that have utilized reinforcement learning for control of a window and air cleaner.

Therefore, this investigation attempted to develop an approach that trains a controller using the deep Q-network (DQN) algorithm, a reinforcement learning method, to control the operation of a window and an air cleaner in order to reduce indoor PM_2.5 concentration with lower energy consumption. The proposed approach first trained the smart controller using the DQN in a simulated virtual environment. The virtual environment was constructed with the use of a particle dynamics model with typical building parameters. To test the trained DQN controller experimentally, this study constructed two small laboratory chambers, each with a window, an air cleaner, and a control system. The controller was then integrated into the control system to smartly control the window actuator and the air cleaner. Both the indoor PM_2.5 concentrations and the operation times of the air cleaner were compared between the trained DQN controller and different benchmark controllers with various outdoor PM_2.5 levels under different chamber conditions in order to assess the controller performance. Note that the inputs for the trained controller were the indoor and outdoor PM_2.5 concentrations. These real-time inputs can be easily monitored by sensors available on the market. Therefore, the proposed control approach can easily be applied in practical scenarios.

Section snippets

Control objectives and inputs

This investigation centered on naturally ventilated apartments equipped with air cleaners. The aim was to mitigate the health risks attributable to indoor PM_2.5 exposure while reducing the energy consumption of the air cleaner. The actuators for indoor PM_2.5 control were the window that provides natural ventilation and the air cleaner that provides indoor PM_2.5 filtration. The objective of the window and air cleaner controller was to achieve a balance between PM_2.5-related health risks and the

Experimental setup

In this investigation, a series of experiments were conducted in two laboratory chambers to test the performance of the trained DQN controller. Fig. 5 shows the schematic of the experimental setup, which mainly consisted of two identical small testing chambers that simulated the indoor environments. The size of each chamber was 0.4 m $\times$ 0.5 m $\times$ 0.4 m. The two identical chambers were used to compare the performance of the trained DQN control algorithm and benchmark algorithms. The chambers were

Sensitivity analysis of building parameters utilized in DQN training

The sensitivity analysis in this section aimed to test if the trained DQN controller could still outperform the benchmark controllers when the building parameters ( $a$ and $C A D R / V$ ) of the virtual environment utilized for training were changed. The $a$ and $C A D R / V$ of the virtual environment that trained the DQN controller were adjusted in a ±10% range based on the typical parameters utilized for training in Section 2.2.2 (denoted as “Typical” in Table 6). Two new virtual environments for training,

Conclusions

This study developed a smart controller that can automatically control a window and an air cleaner in a naturally ventilated apartment to reduce both the health risks attributed to indoor PM_2.5 and the energy consumption of the air cleaner. The controller was developed with the use of the DQN algorithm, a deep reinforcement learning method. Offline training of the controller was conducted in a virtual apartment constructed on the basis of a particle dynamics model with typical building

CRediT authorship contribution statement

Yuting An: Writing – original draft, Software, Project administration, Methodology, Investigation, Data curation, Conceptualization. Zhuolun Niu: Resources. Chun Chen: Writing – review & editing, Supervision, Funding acquisition, Conceptualization.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgement

This work was partially supported by the General Research Fund of Research Grants Council of Hong Kong SAR, China (Grant No. 14204520).

References (46)

Y. Guo et al.
The association between lung cancer incidence and ambient air pollution in China: a spatiotemporal analysis
Environ. Res.
(2016)
J. Pei et al.
Operating behavior and corresponding performance of portable air cleaners in residential buildings, China, Build
Environ. Times
(2019)
Y. Zhao et al.
Reducing human exposure to PM2.5 generated while cooking typical Chinese cuisine
Build. Environ.
(2020)
E. Diaz Lozano Patino et al.
Indoor environmental quality in social housing: a literature review
Build. Environ.
(2018)
C. Chen et al.
Emission rates of ultrafine and fine particles generated from human smoking of Chinese cigarettes
Atmos. Environ.
(2018)
T. Xia et al.
Evolution of pressure drop across electrospun nanofiber filters clogged by solid particles and its influence on indoor particulate air pollution control
J. Hazard Mater.
(2021)
Y. Chen et al.
Optimal control of HVAC and window systems for natural ventilation through reinforcement learning
Energy Build.
(2018)
Z. Zhang et al.
Whole building energy model for HVAC optimal control: a practical framework based on deep reinforcement learning
Energy Build.
(2019)
Y. An et al.
A reinforcement learning approach for control of window behavior to reduce indoor PM2.5 concentrations in naturally ventilated buildings
Build. Environ.
(2021)
K. Dalamagkidis et al.
Reinforcement learning for energy conservation and comfort in buildings
Build. Environ.
(2007)

J.R. Vázquez-Canteli et al.

Fusing TensorFlow with building energy simulation for intelligent energy management in smart cities

Sustain. Cities Soc.

(2019)

J. Xiang et al.

Impacts of implementing Healthy Building guidelines for daily PM2.5 limit on premature deaths and economic losses in urban China: a population-based modeling study

Environ. Int.

(2021)

J. Liu et al.

Indoor air quality and occupants' ventilation habits in China: seasonal measurement and long-term monitoring

Build. Environ.

(2018)

T.L. Thatcher et al.

Deposition, resuspension, and penetration of particles within a residence

Atmos. Environ.

(1995)

C. Chen et al.

Review of relationship between indoor and outdoor particles: I/O ratio, infiltration factor and penetration factor

Atmos. Environ.

(2011)

C. Chen et al.

A methodology for predicting particle penetration factor through cracks of windows and doors for actual engineering application

Build. Environ.

(2012)

S. Shi et al.

Modifications of exposure to ambient particulate matter: tackling bias in using ambient concentration as surrogate with particle infiltration factor and ambient exposure factor

Environ. Pollut.

(2017)

Y. Zhou et al.

Exploring the feasibility of predicting contaminant transport using a stand-alone Markov chain solver based on measured airflow in enclosed environments

Build. Environ.

(2021)

C. Chen et al.

A Markov chain model for predicting transient particle transport in enclosed environments

Build. Environ.

(2015)

M. Qi et al.

Exposure and health impact evaluation based on simultaneous measurement of indoor and ambient PM2.5 in Haidian, Beijing

Environ. Pollut.

(2017)

C. Lu et al.

A novel model for regional indoor PM2.5 quantification with both external and internal contributions included

Environ. Int.

(2020)

R. Andersen et al.

Window opening behaviour modelled from measurements in Danish dwellings, Build

Environ. Times

(2013)

V. Fabi et al.

Occupants' window opening behaviour: a literature review of factors influencing occupant behaviour and models

Build. Environ.

(2012)

Cited by (8)

Energy-efficient operation of portable air cleaners based on real-time prediction of non-uniform concentrations of indoor air pollutants in open offices
2024, Building and Environment
In an open office environment with high occupant density and random occupancy patterns, excessive energy consumption is often observed because mechanical ventilation (MV) is usually designed and operated assuming a full-occupancy-based ventilation rate (VR). Considering the duo challenges of post-pandemic and climate change, this study proposes a real-time monitoring and optimization approach for operating portable air cleaners (PACs) to assist and reduce the energy consumption of the MV while maintaining the minimal VR and improving indoor air quality (IAQ). The approach was as follows. First, by assuming four VRs and 36 different emission sources, numerical simulations were conducted and validated based on an actual open office on non-uniform concentrations of a surrogate for indoor air pollutants (IAPs) throughout the breathing zone. Second, the pre- and post-purified IAP distributions were obtained using a limited number of sensors and trained by two artificial neural networks. Third, the real-time optimization of PACs’ on/off operation and placements was accomplished by the particle swarm optimization algorithm to balance energy output and improve IAQ simultaneously. Results showed that by deploying four sensors, the predictions of post-purifying concentrations were in acceptable accuracy (i.e., CV-RMSE <2.0%) within 30–40s. Using at most three PACs resulted in a significant decline in local IAP concentration levels. Meanwhile, an average reduction of total energy consumption of MV and PACs was 34.6% compared to using MV to reach the same levels. Overall, this study supports the use of PACs to assist MV in achieving energy efficiency and good IAQ.
Occupant-centric HVAC and window control: A reinforcement learning model for enhancing indoor thermal comfort and energy efficiency
2024, Building and Environment
Occupant behavior plays a crucial role in enhancing indoor thermal comfort and achieving energy efficiency by influencing the operational modes of Heating, Ventilation, and Air Conditioning (HVAC) systems as well as windows. However, accurately quantifying the impact of occupant behavior on the indoor environment presents significant challenges in practical applications. This study introduces an innovative approach by leveraging the ASHRAE Global Building Occupant Behavior Database and harnessing the power of XGBoost in conjunction with Deep Q Networks (DQN) to construct a reinforcement learning model. This model enables precise prediction of the impact of occupant behavior on the indoor environment at the next time step under varying indoor-outdoor conditions, simultaneously targeting the dual objectives of indoor thermal comfort and energy conservation. By applying the XGB-DQN model in sample rooms of four international cities with distinct features, the results demonstrate a significant increase in indoor thermal comfort duration by 24 %, accompanied by a 24.7 % decrease in air conditioning usage compared to baseline models and actual occupant data. This research represents a pioneering effort in applying reinforcement learning techniques to accurately predict occupant behavior's impact on indoor environments, offering valuable insights for intelligent building design and energy management.
A seasonal experimental study on a novel CdTe based multi-layer PV ventilated window system integrated with PCM under different operating modes
2023, Energy
To expand the functions of building window and enhance its thermal performance, a novel CdTe multi-layer PV ventilated window system integrated with phase change material (PCM) was proposed. The novel window system was fabricated, constructed and installed on the south façade of a building room and several groups of seasonal experiments were conducted under different operating modes. Electrical, thermal and daylighting performance of the system were investigated. Main conclusions are: (1) For the heating season mode experiment, daily total electricity generation ( $E_{P V}$ ) and average electrical efficiency ( $η_{e}$ ) were 0.383 kWh and 7.39 %. Room air temperature difference ( ${Δ T}_{r o o m}$ ) between the experimental room and reference room was in the range of 0.88 °C–7.04 °C. Effective functions of space heating and heat preservations were achieved; (2) For the cooling season mode, daily $E_{P V}$ and average $η_{e}$ were 0.159 kWh and 6.33 %. ${Δ T}_{r o o m}$ was in the range of −4.26 °C to −1.26 °C. Effective functions of passive ventilation cooling was realized; (3) The novel window system can provide useful daylight illuminance (UDI) for most time during the working time; (4) Based on the experiment results on transition season, other alternative operating strategies were recommended for specific different ambient condition days.
Achieving better indoor air quality with IoT systems for future buildings: Opportunities and challenges
2023, Science of the Total Environment
With the development of IoT technology and low-cost indoor air quality (IAQ) sensors, the IoT-based IAQ monitoring platform has garnered significant research interest and demonstrated its potential in enhancing IAQ management. This study presents a comprehensive review of previous research on the development and application of IoT-based IAQ platforms in different built environments. It offers detailed insights into the design and implementation of recent IoT-based IAQ platforms. The findings indicate that the IoT-based IAQ platforms are able to provide reliable information for IAQ monitoring. To ensure quality control of the IoT-based IAQ platform, it is suggested to replace the sensors every 4–6 months for reliable monitoring. In another aspect, integrating data-driven technology into the platform is crucial for IAQ prediction and efficient control of ventilation systems, leveraging the wealth of data available from the IoT platform. According to recent studies that applied data-driven algorithms for IAQ management, it can be confirmed that the data-driven algorithms are able to prompt IAQ by providing either more information or a control strategy. However, it should be noted that only 9.1 % of the developed platforms integrated data-driven models for IAQ management. Based on our findings, current challenges and further opportunities are discussed. Future studies should focus on integrating data-driven algorithms into IoT-based IAQ platforms and developing digital twins that can be used for real building IAQ management. However, there is obvious tension between controlling ventilation for energy efficiency versus better air quality. It is important to make a balance between energy efficiency and better air quality according to the current situations of specific built environments. Also, the next generation of IoT-based IAQ platforms should include occupants in the loop to create a more occupant-centric IAQ management approach.
Energy-efficient control of indoor PM<inf>2.5</inf> and thermal comfort in a real room using deep reinforcement learning
2023, Energy and Buildings
To reduce indoor PM_2.5 (particulate matter with aerodynamic diameter less than 2.5 μm) pollution and maintain thermal comfort with relatively low energy consumption, this study employed deep reinforcement learning (DRL) to develop a controller that could simultaneously control the window, air cleaner, and air conditioner in a real room. First, a room model was constructed on the basis of 3-week monitoring data in the real room. The controller was then trained in a virtual room utilizing the deep Q-network (DQN) algorithm. To evaluate the effectiveness of the DQN controller in the real world, a smart indoor environmental control system was established. Field testing was conducted in the real room for 4 days. The performance of the DQN controller was compared with that of an occupant-based baseline controller. During the testing period, the trained DQN controller could smartly control the window, air cleaner, and air conditioner in the real room. The PM_2.5 healthy period and thermal comfort period was increased by around 21% and 16%, respectively, while the energy consumption was reduced by 23%, when compared with the baseline controller. Furthermore, simulations showed that the DQN controller still worked effectively when applied to other rooms with different characteristics.
The Performance of Reinforcement Learning for Indoor Climate Control Devices according to the Level of Outdoor Air Particulate Matters
2023, Buildings

View all citing articles on Scopus

View full text

Smart control of window and air cleaner for mitigating indoor PM2.5 with reduced energy consumption based on deep reinforcement learning

Highlights

Abstract

Introduction

Section snippets

Control objectives and inputs

Experimental setup

Sensitivity analysis of building parameters utilized in DQN training

Conclusions

CRediT authorship contribution statement

Declaration of competing interest

Acknowledgement

Environ. Res.

Environ. Times

Build. Environ.

Build. Environ.

Atmos. Environ.

J. Hazard Mater.

Energy Build.

Energy Build.

Build. Environ.

Build. Environ.

Sustain. Cities Soc.

Environ. Int.

Build. Environ.

Atmos. Environ.

Atmos. Environ.

Build. Environ.

Environ. Pollut.

Build. Environ.

Build. Environ.

Environ. Pollut.

Environ. Int.

Environ. Times

Build. Environ.

Smart control of window and air cleaner for mitigating indoor PM_2.5 with reduced energy consumption based on deep reinforcement learning