Using Artificial Neural Networks to Gather Intelligence on a Fully Operational Heat Pump System in an Existing Building Cluster

Fadnes, Fredrik Skaug; Banihabib, Reyhaneh; Assadi, Mohsen

doi:10.3390/en16093875

Open AccessArticle

Using Artificial Neural Networks to Gather Intelligence on a Fully Operational Heat Pump System in an Existing Building Cluster

by

Fredrik Skaug Fadnes

^1,2

,

Reyhaneh Banihabib

¹ and

Mohsen Assadi

^1,*

¹

Department of Energy and Petroleum, University of Stavanger, 4021 Stavanger, Norway

²

Department of Energy and Smart Technology, Norconsult AS, 1338 Sandvika, Norway

^*

Author to whom correspondence should be addressed.

Energies 2023, 16(9), 3875; https://doi.org/10.3390/en16093875

Submission received: 30 March 2023 / Revised: 27 April 2023 / Accepted: 28 April 2023 / Published: 3 May 2023

(This article belongs to the Special Issue Heat Pump System in Existing Building Stock)

Abstract

:

The use of heat pumps for heating and cooling of buildings is increasing, offering an efficient and eco-friendly thermal energy supply. However, their complexity and system integration require attention to detail, and minor design or operational errors can significantly impact a project’s success. Therefore, it is essential to have a thorough understanding of the system’s intricacies and demands, specifically detailed system knowledge and precise models. In this article, we propose a method using artificial neural networks to develop heat pump models from measured data. The investigation focuses on an operational heat pump plant for heating and cooling a cluster of municipal buildings in Stavanger, Norway. The work showcases that the network configurations can provide process insights and knowledge when detailed system information is unavailable. Model A predicts the heat pump response to temperature setpoint and inlet conditions. Except for some challenges during low-demand cooling mode, the model predicts outlet temperatures with Mean Absolute Percentage Error (MAPE) between 2 and 5% and energy production and consumption with MAPE below 10%. Summarizing the five-minute interval predictions, the model predicts the hourly energy production and consumption with MAPE at 3% or less. Model B predicts energy consumption and coefficient of performance (COP) from measured inlet and outlet conditions with MAPE below 5%. The model may serve as a tool to develop system-specific compressor maps for part-load conditions and for real-time performance monitoring.

Keywords:

sewage heat pump; artificial neural network (ANN); coefficient of performance (COP); monitoring and fault detection; operational data

1. Introduction

Energy efficiency is crucial for lowering energy consumption and reducing greenhouse gas emissions [1]. Switching to heat pumps that utilize renewable sources such as geothermal and ambient heat for heating and cooling can lessen reliance on fossil fuels and help save on potential carbon prices [2]. Furthermore, heat pumps have the added possibility of providing cooling to a building, making them a versatile and more sustainable solution [3].

Heat pumps are sophisticated machines that require specialized knowledge to design and operate effectively. Their performance is sensitive to the temperature level of the heating and cooling system, and particularly the return temperature entering the condenser [4]. Competent designers are essential to achieve success with heat pump systems [5]. Among the challenges a heat pump system designer faces are refrigerant selection [6], capacity control, and effective management of interaction with energy reservoirs and peak load units [7]. Additionally, it is critical to select machines with properties tailored to the process energy and temperature demands [8]. There are various examples of unsuccessful heat pump projects, almost always leading to high energy consumption, either due to a heat pump with a low coefficient of performance (COP) [9] or high use of peak load or backup heating [10]. Understanding how a specific heat pump responds to different system temperatures is crucial to maximizing its potential within a system [11].

Accurate modeling and simulation of components of an energy plant are essential for various purposes such as training, strategic planning, maintenance, techno-economic decisions, and monitoring of the plant operation [12]. Furthermore, as the demand for environmentally friendly and decentralized energy production grows, improved and user-friendly simulation and monitoring tools become vital. Model-based predictive control (MPC) has been proven a promising technology for building control [13]. An MPC optimizes system control actions using a simulation model. One of the most important but also most time-consuming parts of MPC design is determining a suitable model that captures the dynamics of the controlled system [14].

In the academic literature, researchers commonly distinguish between physics-based (white-box) and data-driven (black-box) models, with grey-box models serving as an intermediate approach [15]. Physics-based models are built based on the underlying physics of the processes and incorporation of physical relationships that describe these processes. Zanetti et al. presented an experimental investigation and a numerical analysis of a heat pump with ambient air and the ground as heat sources [16]. Each of the components within the heat pump were modelled as independent blocks and then combined to simulate the operation of the heat pump. The models were based on a physics-based approach that allowed a direct calculation of heat transfer coefficients and pressure drop in the heat exchangers. The model provided predictions of the heat pump performance with deviations below 7% for the tests in winter mode and below 10% during summer mode. Xu et al. developed thermodynamic models for an air source CO₂ heat pump system for space heating and domestic hot water production [17]. The models predicted COP and heat production for two experimental datasets with mean absolute percentage errors (MAPE) of 5.5–6.1% and 2.9–3.1%, respectively.

However, physics-based models can become rather complex, leading to higher modeling costs, potential errors, and increased simulation times [18]. The availability of data capacity to solve the equations may be a challenge, especially in commercial installations. In addition, if a physics-based model is utilized to monitor a specific process, the uncertainty of the physical relationships tends to increase over time as the equations representing the processes may become unsuitable due to the degradation of the energy plant [19].

The grey-box modeling approach combines elements of both white-box and black-box models. The approach often requires both long calculation times due to the parameter optimization process and expert knowledge during the model development process [15]. A grey-box model may represent a good trade-off between physics, expert knowledge, and data-driven modeling. Madani et al. developed a ground source heat pump system model [20]. The heat pump model used semi-empirical compressor, evaporator, and condenser sub-models along with a simplified expansion valve model to create a black box model. Validation against experimental data showed deviations of less than 15% for COP and heating capacity, and less than 10% for compressor power. Another grey-box model was presented by Cheung et al. [4]. The model was developed using experimental data obtained from an 8 kW R410A dual-unit ductless heat pump system. The final estimates for compressor frequency, refrigerant flow, and outdoor unit power were predicted within an 8% margin of error.

All the referenced white- and grey-box models depend on system knowledge and component-level sub-modeling. In industrial and real-life systems, component-level data and datasheets may be difficult and even impossible to obtain. The data-driven approach does not take the physics of the process into account, but rather regards it as a black box [21]. The approach generates a high-dimensional mapping model by learning the mapping relationship between the input and the output variables, allowing for future predictions to be obtained based on the next input. Data-driven methods have a natural advantage in solving highly nonlinear problems that are influenced by many uncontrollable factors [22], such as building energy systems [23].

In existing energy plants, a large number of operational data is constantly collected by the plant’s monitoring system. Typically, the data is solely stored in a database. From these data, artificial neural network (ANN) models can be developed for simulating plant operations [19]. ANNs can be trained periodically with the latest data, thereby capturing plant degradation. Additionally, ANN models account for user-specific influences such as operational hours, person load, etc., which are challenging to describe accurately in a physics-based model [18]. By comparing predictions from a previous ANN model with those of a subsequent model, the degradation of the plant could be evaluated. ANNs are easy to use, fast in response, and suitable for “off-line” and “on-line” applications [19]. The authors responsible for the CO₂ heat pump model referenced earlier used datasets generated from the physics-based model to train ANNs to increase the speed and precision when utilizing the model within an optimization framework [17].

Opalic et al. developed ANN models for an industrial cooling system [24]. Their dataset was based on compressor polynomials for the various compressors in the system, and they used operating temperature, pressure, and compressors frequency to predict compressor power and refrigerant mass flow rate. The model made predictions with MAPE in the range of 5–12% in an operational case-study cooling system. Puttige et al. created models for ground source heat pump systems using a hybrid analytical-ANN approach [25]. Both the hybrid analytical-ANN borehole heat exchanger model and the ANN heat pump model made predictions with mean absolute errors (MAE) of less than 5%. These studies highlight the potential of ANNs as a fast and accurate method to predict the performance of different cooling and heating systems under various operating conditions.

The drawbacks of using ANN are closely related to their advantages, namely the fact that they are highly dependent on the data they are trained on [26]. If a model is used with inputs outside the bounds of the training data, the results may be erroneous. Thus, it is crucial to keep a close track of which input parameters the models have been defined for. Another challenge with ANN only based on operational data is that the model has no capability of determining whether the system operates at maximum efficiency. In other words, an ANN may be been trained with data from an inefficient system without capturing its true potential [27]. An advanced physics-based model would allow determining the heat pump’s COP at a wider range of operational conditions, which could then be compared to ideal thermodynamic relationships, such as the Carnot [28] or the Lorenz efficiency [4].

J. Clauß and L. Georges presented an overview of simulation-based studies based on heat pump modeling [29]. They observed that the following simplifications were often employed in heat pump modeling:

(1): the heat pump is assumed to have perfect modulation between 0% and 100% or is simply on/off;
(2): minimum duration and pause times in the heat pump cycle are not taken into account;
(3): there are no temperature limits considered for the condenser and evaporator temperatures, although heat pumps have a maximum supply temperature, as well as a minimum temperature at the evaporator side; and
(4): the heat pump model only takes into account steady-state operation at full load measured under standard rating conditions according to EN 14511—Air conditioners, liquid chilling packages and heat pumps for space heating and cooling and process chillers, with electrically driven compressors [30]. Factors such as cycling losses or the change in the COP at part load are not considered.

In this study, we present two distinct ANN models for heat pumps, both only utilizing operational data. The selected heat pump of the study is an advanced machine, without availability of data from internal sensors and limited information on its individual components. The main novelty of this work is presenting models derived from a sophisticated, real-life heat pump system, offering researchers valuable insights into the challenges and complexities associated with data from such systems. We aim to investigate the ways in which Artificial Intelligence (AI) methodologies can be employed to gain insights into the functionality of an operational heat pump system, with emphasis on developing tools for industrial implementation.

Teng et al. advocate for a practical approach to developing AI-based energy-saving systems, emphasizing that researchers should prioritize realistic industrial implementations over abstract theoretical problems [31]. In addition, the authors suggested that researchers should engage in “low-level” research tasks, such as data cleaning and preprocessing, to ensure that the data used to train AI models is of the highest quality and accuracy [31].

This article highlights the importance of this so-called “low-level” research perspective and discusses challenges associated with processing and developing large and complex datasets. Though the academic literature on AI in energy systems is vast, the practical implementations are still limited [32]. In a real-life system, the precision of any data-driven model depends on the available measuring equipment and the quality of the measured data. Sensors may have been specified without consideration for using the measurements within a data-driven framework. This is the case for the heat pump system analyzed in this work. Additionally, the internal sensors of the heat pump are not available to the user, and thus only external sensors are available for modeling purposes.

In this article, we address the intricacies of data capture and processing, data resolution, and potential errors and pitfalls that engineers looking to explore ANN for energy plants and heat pump systems may encounter. The drawbacks of the technique become apparent as the modelling results are presented. In the discussion section, the applicability of the models and suggestions for improving them are presented.

With this article, we aim to share the knowledge acquired through working with real-life systems, their datasets, and the challenges they pose, contributing to the research community and enhancing the understanding of heat pump systems and their practical applications.

2. Materials and Methods

2.1. The Heat Pump Study Case

The selected case study is the Triangulum Central Energy Plant (TriCEP) in Stavanger, Norway. The design principles, targets and energy results, including examples of suboptimal heat pump operations, are thoroughly described in a previous work [33]. Developed as part of the European Smart Cities and Communities Lighthouse Project Triangulum, the plant has been in operation since 2017. It consists of sewage heat pumps, solar collectors, a biogas boiler, and greywater recycling. The heat pumps produce 90% of the thermal energy from the plant, approximately 1.8 GWh/year, with input energy of about 0.5 GWh/year.

The TriCEP was established to supply an existing building cluster with heating, cooling, and domestic hot water (DHW). Figure 1 presents the architecture of the TriCEP, excluding the distribution loops and the DHW system. The two brine-to-water heat pumps produce simultaneous heating and cooling and are connected to a sewage thermal energy reservoir based on stainless steel heat exchangers. The heat exchangers are mounted directly in a main municipal sewage pipe.

The heat pumps have two operational modes: heating priority and cooling priority. The mode controls whether the heat pumps prioritize meeting the outlet temperature at the condenser (heat) or evaporator (cooling). On the evaporator side, the heat pumps are connected in parallel; on the condenser side, they are connected in series when prioritizing heat production, and parallel when prioritizing cooling. Thus, in heat mode, the first heat pump’s outlet temperature equals the second one’s inlet temperature. As a result, the second machine generally operates at a higher temperature level than the first one. This heat pump had significantly higher operational hours and was therefore selected for modeling.

Each machine was specified with the following design parameters:

Heat production capacity of 250 kW produced at the condenser with the following COP and temperature criteria:
○
Condenser outlet/inlet temperature: +46/+40 °C.
○
Evaporator inlet/outlet temperature: +5/+1 °C.
○
Heating COP > 4.0.
Cooling production capacity of 220 kW produced at the evaporator with COP and temperature criteria:
○
Condenser outlet/inlet temperature: +40/+33 °C.
○
Evaporator inlet/outlet temperature: +15.5/+10.0 °C.
○
Cooling COP > 4.5.
The heat pumps are to be able to produce at least +60 °C when the outlet temperature of the evaporator is +1 °C. The minimum allowed inlet temperature to the evaporator is −1.5 °C.
The refrigerant is R-1234ze.
Both heat pumps have four semi-hermetic, reciprocating compressors, where two compressors per heat pump have inverter control. The heat pumps can operate seamlessly between 10% and 100% capacity.
In heating mode, the setpoint temperature at the condenser outlet is correlated to the ambient temperature. The correlation curve is set manually from the control system.
In cooling mode, only a single temperature setpoint at the evaporator is set in the control system manually. The heat pump system, including free cooling from the reservoir, is the only cooling source in the plant.

2.2. The Perspective and the Challenges of the Plant Operators

The municipal operators monitor and control the plant using a building automation system (BAS), Citect SCADA [34], and an energy management system (EMS), Gurosoft [35]. The municipality of Stavanger is responsible for the heating, ventilation, and air conditioning (HVAC) systems of more than 200 municipal buildings, and the BAS is vital in the day-to-day operation.

For the TriCEP, the BAS has process flowsheet pictures with live updates of all plant sensors. In addition, historical measurements of the past 13 months are stored and available. Figure 2 shows the heat pump and nearby sensors. Each heat pump is equipped with a thermal energy meter on the condenser side and an electric energy meter, providing a continuous measurement of the heat pump COP. In addition, temperature sensors are available on inlets and outlets on both the evaporator and condenser of the machines.

From the perspective of the plant operator, the heat pumps can be seen as black boxes with flows entering the evaporator and the condenser with specific temperatures and the same flows exiting with decreased and increased temperatures, respectively. The machines were custom built, and no standardized simulation tool providing information about energy production and consumption at various operational points is available. Before the machines were set in operation, little to no information about operational conditions outside the design points was known. Operators and designers would greatly benefit from a tool to help identify and evaluate the most efficient operational conditions for a given demand.

Figure 3 shows a simplified sketch of the internal piping and instrumentation diagram (P&ID) of the heat pumps and a picture of the control panel. The data from the internal sensors are not accessible to operators in the BAS. These measurements would provide valuable insights beyond the limited information offered by the current black box configuration shown in Figure 2, particularly regarding compressor control strategy and part-load conditions. However, based on the authors’ experience and investigations into other heat pump plants in the municipal BAS, the availability of internal heat pump sensors is generally limited.

2.3. The Heat Pump Process and Benchmarking Parameters

To aid in defining the model setup and selecting appropriate input and output parameters, it is important to establish clear definitions of key terms related to heat pumps. Domain knowledge can play an important role in this process, as Singh et al. have noted [36]. Additionally, even basic system knowledge can help simplify and specify model development, as suggested by Mahbub et al. [37].

The Coefficient of Performance (COP) is a crucial metric for an electrically driven heat pump, as it relates the compressor power

\dot{W}

in kW to the useable thermal power

\dot{Q}

generated [28]. COP is defined as

COP = \frac{\dot{Q}}{\dot{W}} .

(1)

The useable heat generated by a heat pump depends on the specific target of the operation. For heat production, the useable thermal power

\dot{Q}

is represented by the condenser heat dissipation, while in a refrigeration process,

\dot{Q}

represents the evaporator heat extraction. In a combined heating and cooling process,

\dot{Q}

is the sum of heating produced at the condenser and cooling produced at the evaporator utilized in the system. The evaporator heat extraction

{\dot{Q}}_{e}

, condenser heat dissipation

{\dot{Q}}_{c}

, and compressor power are correlated by the following expression [38]:

{\dot{Q}}_{c} \approx {\dot{Q}}_{e} + \dot{W} .

(2)

Within the models presented in this study, heat dissipation is defined as energy production (EP) and work as energy consumption (EC).

The operational COP is influenced by the temperature levels of the system [28], especially the temperature lift from evaporation temperature, T_E, to condensation temperature, T_C, has a significant impact on the COP. As the temperature difference increases, so does the pressure difference that must be overcome by the compressor, resulting in a reduction in heat pump efficiency and a lower COP. The refrigerant and the properties of the thermodynamic cycle can also have a major impact on heat pump performance. These factors can influence the allowable temperature range of operation, the part load characteristics, and the overall efficiency of the heat pump [39].

COP is a dynamic relationship that is continuously influenced by factors such as flow rates, temperature levels, and compressor part-load conditions. In the operational phase, the operational COP can be compared to the design COP to evaluate the performance of the system. A negative deviation between the values can indicate issues that require further investigation. However, this strategy relies on an understanding of the expected COP at various operational points.

Another important metric for evaluating heat pump performance is the Seasonal Coefficient of Performance (SCOP), which represents the ratio between the annual heat energy output and the annual electric energy input. The SCOP is influenced by a variety of factors, including the COP, local climatic conditions, and the heat pump’s integration with the building energy systems [40]. The definition of the SCOP incorporates the utilization time (τ) of the heat pump [38]:

SCOP = \frac{\int_{0}^{τ} \dot{Q} d t}{\int_{0}^{τ} \dot{W} d t} = \frac{Q_{H P}}{W_{H P}},

(3)

where

Q_{H P}

and

W_{H P}

are the heat pump energy production and consumption over the utilization time in kWh/

τ

. The utilization time is most commonly a year, and the SCOP is used to benchmark systems from year to year.

The net heat transfer

\dot{Q}

from a mass flow due to temperature change inside a control volume is given as [41]

\dot{Q} = \dot{m} \times c_{p} \times (T_{o u t} - T_{i n}),

(4)

where

\dot{m}

in kg/s is the momentaneous mass flow rate of the fluid entering and exiting the control volume,

c_{p}

, kJ/kg K, is the specific heat capacity of fluid and

T_{i n}

and

T_{o u t}

, in K, are the inlet and outlet temperatures, respectively. Thus, there is a linear relationship between the outlet temperature and the heat production for a given inlet temperature and flow.

2.4. Method—Artificial Neural Networks

An artificial neural network (ANN) is a machine learning (ML) technique that emulates the biological neural networks found in the human brain [42]. Comprised of interconnected artificial neurons, ANNs can receive inputs, process data, and generate outputs helpful in predicting the behavior of the modeled system. ANNs consist of interlinked neurons arranged in layers and offer versatility, power, and scalability, making them well-suited for large and complex ML tasks [42]. After an ANN has been trained, it is easy to use, and it consists only of simple equations that can be programmed in any computer language for automatic operation. Since ANNs do not require an iterative solution to predict outputs, they are quick to respond and can also be used for on-line applications.

During a supervised training process, an ANN acquires knowledge from a dataset [43]. The synaptic weights between the interconnected neurons are the adaptive parameters of the ANN that are adjusted during training to store the knowledge contained in the data. The ANN is a statistical tool used for data modeling, and it can create a nonlinear functional relationship between a set of input parameters and a set of output parameters during the training process [43].

The diagram in Figure 4a shows that the inputs to the artificial neuron are sourced from the neurons in the preceding layer [42]. The standard structure of an ANN is made up of two layers, the input layer and the output layer. However, the model’s performance can be enhanced by incorporating additional hidden layers between the input and output layers. Figure 4b demonstrates a multilayer perceptron with two hidden layers [44]. This model has four inputs, eight neurons in the first hidden layer, six in the second hidden layer, and two outputs. The number of neurons required in the input and output layers is determined by the number of inputs and outputs required to model the system.

In a fully connected network, every unit in a layer is connected to every unit in the next layer [42]. When inputs are introduced to the network, they are multiplied by weights that connect them to the layer of hidden neurons on the right. A bias term is added to the sum of the inputs [43]. The resulting weighted inputs are then summed and transformed as they pass through the hidden neurons. The outputs of each hidden layer are used as inputs for the next layer, and this process is repeated in a forward direction, layer by layer until the network outputs corresponding to the applied inputs are calculated in the output neurons. The functional relation represented by the network structure with n inputs, k hidden neurons, and m outputs shown in Figure 4 can be expressed as follows:

y_{m} = g (\sum_{j}^{k} w_{m j}^{2} φ (\sum_{i = 0}^{n} w_{j i}^{1} x_{i})),

(5)

where

w_{j i}^{1}

is a weight connecting input i to the hidden unit j, and

w_{j 0}^{1}

is the bias for the hidden unit j, corresponding to the fixed input

x_{0}

equal to 1. Similarly,

w_{m j}^{2}

denotes a weight connecting hidden unit j to the output unit m, and

w_{m 0}^{2}

is the bias for the output unit m, corresponding to the fixed input equal to 1.

φ (.)

and g(.) represent the transfer function of the hidden and output neurons, respectively [43].

During the supervised training process, the goal is to minimize an error function by adjusting the weights,

w_{j i}^{1}

and

w_{m j}^{2}

, which represent the connections between the input and hidden layers and between the hidden and output layers, respectively. The error function is defined as the difference between the predicted outputs,

y_{m}

, and the target outputs. Errors are first calculated in the output layer and then propagated backwards through the network. Based on the calculated errors, the weights are updated, and this process is repeated until the error is reduced to an acceptable level. This algorithm is referred to as backpropagation. After training, the weights are fixed, and the network is ready to predict outputs based on new data that was not used during training [43].

Measurements from building automation and energy management systems, electricity and thermal energy meters, weather and climate stations, utility bills, national reports, and surveys are the most common data sources for training data-driven models [45] in building energy systems. Measurements represent the most reliable data sources, provided that the quality of measurements is validated.

Hyperparameter Optimization with Bayesian Method

In ML, hyperparameters are parameters that are not learned from the data during training but are instead set before training the model [42]. Hyperparameters are often specified by the user and control the behavior of the ML algorithm, the training process, and the structure of the model. Examples of hyperparameters for an ANN model include the learning rate, number of hidden layers, number of neurons in each layer, and activation functions [42].

To achieve the best performance, the user sets the hyperparameters either through prior knowledge or by searching over a range of values to determine the optimal set of hyperparameters that result in the best model performance on a validation or test set. The options of hyperparameters for an ANN with a maximum number of 3 layers are presented in Table 1. The optimizer is the type of optimization algorithm employed during training and the learning rate determines the magnitude of change in the weights in each iteration. The number of options for hyperparameters alone is of the order of 10⁹, even without considering the various possibilities for the learning rate [42].

Hyperparameter tuning is an important aspect of the machine learning workflow, and it can be conducted manually or through automated methods such as grid search, random search, or Bayesian optimization [42]. However, a naive force approach such as a grid search, where all possible combinations of hyperparameters are tested, is not practical for such a vast domain. A random grid search, where combinations are selected randomly, is quicker, but it is not always reliable. This work uses a Bayesian optimization algorithm to perform a more guided search for the optimal hyperparameters [46].

Bayesian optimization is a technique used for globally optimizing noisy black-box functions by following a sequential design strategy. The process involves building a probabilistic model of the function to be optimized and refining the model as evaluations of the objective function are gathered through a Bayesian inference-based process. Bayesian optimization is a highly efficient way of solving complex and costly optimization problems [47].

During the hyperparameter optimization, the input arguments for the optimization function are the selection of hyperparameters, and the output is the accuracy of the trained network. However, evaluating the function requires complete training, making it a computationally expensive task. To speed up the optimization process, a surrogate model (such as a Gaussian process model) is employed to represent the objective function. The associated acquisition function helps determine the next sample point to evaluate the objective function. This approach is more efficient as finding the maximum of the acquisition function requires fewer steps and is less time-consuming than the original optimization problem, reducing the number of evaluations required to determine a set of hyperparameters close to the global optimum [46].

2.5. Model Configurations

In the study, ANN models for a specific heat pump are developed using measured data from the plant’s control system available to the operators. The dataset was obtained manually from the TriCEP BAS. Two distinct model structures are presented, each designed for a specific purpose, which will be elaborated upon in the following sections.

2.5.1. Model A—Setpoint-Based Model

Model A (Figure 5) takes the flow rate through the condenser (

{\dot{V}}_{c}

), inlet temperatures on both side of the heat pump (

T_{c_{i n}}

and

T_{e_{i n}}

at condenser and evaporator, respectively), and the temperature setpoint (TSP) for the operation as inputs, and predicts outlet temperatures (

T_{c_{o u t}}

and

T_{e_{o u t}}

at condenser and evaporator, respectively), energy production (EP) and energy consumption (EC). The model’s primary objective is to provide clear and precise predictions on the ways in which the heat pump responds to setpoint changes at various operational conditions. In addition, since the temperature setpoint is the only controllable parameter that the operator can directly adjust to influence the heat pump’s operation, it is considered a potential input in a future MPC.

2.5.2. Model B—Compressor Map Model

In Model B (Figure 6), the temperature setpoint is removed from the inputs, and the outlet temperatures are moved from output to input. Consequently, the model obtains more comprehensive information regarding the heat pump’s outputs and loses its capacity to forecast the heat pump’s response to inlet conditions and a specified setpoint. According to Equation (4), the energy production can be determined from the inputs. As the machine is custom built without an available performance model, the target of this structure is to develop a mapping of the amount of energy the compressors consume at various temperature conditions. Thus, Model B can be considered a tool to develop a compressor map for this specific heat pump, allowing evaluation of the machine at part-load conditions. As discussed in the Introduction section, the compressor map is limited to the dataset which the model was trained on. Additionally, Shin and Cho demonstrated that a similar approach could be used as a real-time performance monitoring tool [48].

2.5.3. Thermal Energy Meter and Flow Rate

Within the present study, we employ two distinct models to predict thermal energy production based on measurements from thermal meters that comprise a flow meter, two temperature sensors, and a calculating device. The calculating device is calibrated to accurately measure accumulated energy production over a specified time according to Equation (4). While the heat pump system under investigation is designed to maintain a nearly constant flow rate across the evaporator and condenser at all operating points, minor variations may occur depending on whether one or two heat pumps are in operation. Still, the flow rate is close to redundant as a model input parameter as it does not convey unique information to the models. However, the flow rate is included as an input parameter in both model configurations to establish a generalized model structure. This structure allows for training models with similar structures from data from other heat pump systems, including those with variable flow rates across the evaporator and condenser. It should be noted, however, that the models presented in this study are specific for the machine under investigation and the defined dataset.

2.6. Data Janitoring

In line with Teng et al.’s [31] suggestion that researchers should engage in “low-level” research or “data janitoring” tasks, observations and challenges from the data handling process are discussed. The intention with the discussion is to offer recommendations to researchers and industry professionals on establishing their data-driven models for energy systems.

2.6.1. Data Capture

Data obtained from the TriCEP plant BAS process view are utilized in this study. In the view, the user can access figures containing historical data for each sensor or measuring unit. The time scales and the number of collected measurements are chosen by the user [33]. The data were manually captured by copying and pasting from the BAS to Excel.

The data collected covers a period from 1 January 2022, at 01:00:00 to 31 December 2022, at 23:55. This time was selected to ensure that one full year of operation of the system was included in the analysis. It is noteworthy that the BAS only retains recorded data for the preceding thirteen months, rendering any uncollected data irretrievable.

Data were captured at a sampling resolution of five minutes. This interval was chosen because it was deemed to accurately describe the process while averaging out some measurements that lagged, as discussed in Section 2.6.2. Collecting data manually is a time-consuming process, and the higher the sampling resolution, the more time is needed. Balancing the number of data rows with the model’s accuracy is crucial. Moreover, reducing the amount of data can lead to a reduction in the training process time.

2.6.2. Sampling Resolution

Several sampling resolutions were evaluated while developing the dataset. The resolution should be high enough to capture all system actions for the model to be used in an optimization-based control scheme. After considering various sampling resolutions ranging from 1 sec to 60 min, a 5 min interval was ultimately chosen. This interval was deemed practical as it balances the need to capture system actions and the time-consuming nature of manual data collection.

Figure 7 and Figure 8 illustrate the same 30-minute time interval from 30 June 2021, with 1-second- and 5-minute-based sampling resolution, respectively. With a 1-second resolution, the 30-minute interval results in 1800 values per feature, while with a 5-minute resolution, only 6 values per feature are collected.

During the 30-minute interval, the heat pump was in operation for about 12 min and shut off for 18 min. To capture the operational pattern, the sampling resolution should be less than 12 min to prevent the complete energy production and consumption from being averaged out over one observation. The chosen five-minute resolution captures the start-and-stop, averaging the energy production and consumption over four of the six observations within the interval.

With a one-second resolution, the compressor electricity consumption (grey curve between purple lines) is registered a few minutes before heat production (green curve). Similarly, the compressor stops a few minutes before heat production is no longer registered. This delay may be attributed to thermal inertia, and due to the available energy meters being primarily designed to capture the accumulated energy production, with less focus on exact momentaneous measurements.

Through an investigation of comparable operational patterns, it was determined that there is no consistent time lag between the compressor’s start and the heat production registration. Consequently, a shift in heat production cannot be carried out uniformly with a constant time interval. The temperature sensors at the condenser are calibrated against the flow meter and follow the energy production curve. On the contrary, the evaporator temperatures respond instantaneously to the compressor, exhibiting a decline precisely when the compressor initiates operation, followed by an immediate increase upon cessation of compressor operation. This observation is illustrated between purple lines in Figure 7. Further exploration of external sensors on the condenser side, compared to the sensor linked to the thermal energy meter, may provide valuable insights.

Another observation is that the compressor power registration appears as a smooth curve in the one-second resolution, while the other measurements exhibit jaggedness to varying degrees. The potential cause of this difference is variable sampling resolution for different sensors. A one-second resolution may be too high for some sensors, which could result in this discrepancy.

By using the five-minute sampling resolution, the delay between energy production and consumption is smoothed out over the interval while retaining information on the heat pump’s starting and stopping time within the observation period. However, there is still one instance of delay between production and consumption in the example data. The final recorded production of 10 kWh/5 min happens with 0 kWh/5 min energy consumption. This scenario was considered when processing the dataset.

2.6.3. Data Processing

The complete initial dataset consisted of 105 108 rows with eight features per row. The data were controlled manually, and the following rows were removed from the dataset.

All rows with one or more missing features, a total of 4684 rows.
As described in the previous section, a delay between the registration of energy consumption and production still occurred with the five-minute sampling resolution. Therefore, all rows with COP > 6.0 were removed from the training data, a total of 3764. These rows represented less than 1% of the unprocessed dataset’s summarized energy production and consumption.
A delay of 20 to 70 min between registration of an actual switch between heating and cooling mode occurred randomly. The actual switch occurred as the rule-based control strategy intended. Rows representing this delay were removed, as the TSP input is incorrect; there was a total of 3576 rows.
Days with planned heat pump maintenance and the machines shut off were removed from the data, totaling 3603 rows.
Errors at the temperature sensors at the evaporator occurred, totaling 1728 rows.
Eight days were excluded from the data to be used as a visual test set, see Section 2.8.

All removed data was stored. The final cleaned dataset consisted of 89,304 rows or 310 days of data, accounting for 85% of the year. Figure 9 represents the eight input values used to develop the heat pump ANNs. Each feature was sorted into 50 equally sized bins. The vertical axis represents the number of observations per bin.

The inlet temperature to the condenser ranges from 46 to 54 °C, with most observations clustered around 50 °C. Conversely, the inlet temperature of the evaporator displays a wider range of variation, ranging from 0 to 17 °C, with most observations falling between +5 and +7.5 °C. Notably, in heat mode, the evaporator’s inlet temperature is highly dependent on the temperature of the heat reservoir, i.e., the sewage temperature. The flow across the condenser is close to constant, centered at 11.0 L/s. The temperature setpoint indicates that the heat pump primarily operated in heat mode throughout most of the year, set to produce temperatures between 50 °C and 58 °C at the condenser outlet. Cooling priority, with setpoints at either 0 or 10 °C at the evaporator outlet, was only observed 3.9% of the time.

Furthermore, outlet temperatures were observed to be shifted compared to their respective inlet temperatures, with increase observed at the condenser and decrease observed at the evaporator. Notably, energy production and consumption data displayed considerable variability, ranging from the heat pump being shut off to operating at maximum capacity. However, some data points were observed more frequently than others.

2.7. Visual Test Data

Three distinct periods of days were selected from the dataset to represent different operational conditions of the heat pump. The input values for the selected days were then used to evaluate the performance of the models. These days were not part of the training data.

The first set of test days (TD1) spans from 27 January 2022 00:00 to 29 January 2022 23:55, representing winter operation conditions. During this period, the heat pump operated with all four compressors at 100% capacity for some hours, while most hours involved three compressors running at 100% and the fourth operating between 0% and 100%. During the final hours of the third day, one of the compressors was out of operation due to a failure.

The second set of test days (TD2), covering 28 June 2022 00:00 to 30 June 2022 23.55, represented summer operation conditions. Variations in heating and cooling priorities characterized this period. Notably, the cooling demand was so low in some periods that the heat pump operated in a continuous “start–stop” cycle.

The third set of test days (TD3) spanned from 16 November 2022 00:00 to 17 November 2022 23:55, representing fall operation conditions. During this period, the heat pump produced heat continuously but with frequent changes in capacity, a typical operation mode during fall and spring. The data recorded during this period also captured unexpected compressor stops.

2.8. Model Development and Training

After the optimal configurations were determined, the ANNs were trained using backpropagation, which involves adjusting the network weights. The ANN was implemented using Python 3.8 [49] and the deep learning API Keras [50], which runs on top of the ML platform TensorFlow [51]. The dataset was normalized to scale the data between 0 and 1.

During the training process, the ANN was fed inputs and corresponding desired outputs, and the weights and biases were updated using the difference between the predicted and expected outputs. The training continued until the model’s output was sufficiently close to the expected output. To avoid overfitting during the training process, cross-validation was performed on 80% of the data, while the remaining 20% was used to test the performance of the model. The training procedure was executed in batch mode, and Bayesian optimization was used to define other parameters, such as the learning rate and backpropagation optimizer. The set of optimized hyperparameters for the constructed models is presented in Table 2.

3. Results

In the Results section, a comparative analysis of predicted and actual values is presented for the selected test days. Figures depicting the output parameter for each test day and model are presented to demonstrate the models’ performance visually. The advantages and limitations of the models are discussed based on the presented figures. A comprehensive collection of figures is included in Appendix A, Figure A1, Figure A2, Figure A3, Figure A4, Figure A5, Figure A6, Figure A7, Figure A8, Figure A9, Figure A10, Figure A11, Figure A12, Figure A13, Figure A14, Figure A15, Figure A16, Figure A17 and Figure A18. The Absolute Percentage Error (APE) and Mean Absolute Percentage Error (MAPE) are utilized to compare the predicted and actual values. The errors are defined as follows:

{APE}_{i} = |\frac{y_{i} - y_{m_{i}}}{y_{i}}| \times 100 %,

(6)

MAPE = \frac{1}{n} \sum_{i = 1}^{n} |\frac{y_{i} - y_{m_{i}}}{y_{i}}| \times 100 %,

(7)

where

y_{i}

is the real value and

y_{m_{i}}

is the predicted value at sample i, while n is the number of data points.

3.1. Model A

In Figure 10, Figure 11, Figure 12, Figure 13 and Figure 14 and Table 3, we provide a summary of the results obtained using Model A during the test days. The model captures the overall trend in the data for TD1 and TD3, as well as parts of TD2, albeit with some error. However, it fails to produce accurate results for TD2 during cooling mode, which is discussed in Section 4.2.

For TD1 and TD3, the predicted outlet temperatures closely follow the actual values with MAPE ranging from 1 to 2%, assuming no real-life compressor errors. However, energy production and consumption show higher errors with MAPE ranging from 5 to 10%.

Model A effectively captures the compressor error that occurred during the final hours of operation on TD1, as evidenced by Figure 10. The operator confirmed that one of the four compressors was out of order during this period. Similarly, Figure 13 shows a substantial increase in error when the evaporator outlet temperature experienced an unexpected increase in temperature compared to the prediction. This increase was due to an unanticipated heat pump stop, which prevented the cooling of water flowing across the evaporator. In both cases, the APE increases significantly compared to the rest of the predictions, indicating that the model possesses fault detection capabilities.

Model A encounters difficulties when dealing with sudden peaks and sharp changes, and it produces higher errors when tested on data that have infrequent occurrences in the training set. On January 28th, the machine operated at full compressor capacity, generating approximately 230 kW. However, there were only a few instances observed above 220 kW in the training data (as seen in Figure 9), and the model consistently predicted a higher energy production and consumption than what was observed. This is demonstrated in Figure 10. Additionally, when the production was closer to 130 kW (as shown in Figure 14), the error was generally higher than when the production was near 200 kW. The higher error can be attributed to the fact that there were more observations in the training data centered around 200 kW.

As actual heat pump errors occurred within the test data, the MAPE for the test days was calculated separately for instances with and without errors. The error calculation is documented in Table 3. Except for TD2, which is discussed in Section 4.2, the MAPE is never above 10%. However, there is a noticeable increase in the MAPE during predictions at the time of the erroneous activity in operation.

Accumulated Data

Despite filtering out the known operational errors, Model A still has a prediction error at 50% for some instances of energy production and consumption. In addition to the instantaneous values, an evaluation of the way in which Model A performs when the predicted production and consumption are summarized is presented. The results of this approach are documented in Table 4 and Figure 15, Figure 16 and Figure 17, which compare the actual values to the predicted values.

Comparing the summarized predictions of energy production and consumption using Model A to the actual values, the model predicts the accumulated energy with smaller errors than the instantaneous observations. Over two or three days, the MAPE of energy production and consumption is in the range of 2–3%, excluding instances when the error is higher due to an actual compressor fault. This level of accuracy is considered sufficient from the perspective of an industrial mechanical engineer and opens the possibility of utilizing the model in a predictive controller with hourly resolution.

3.2. Model B

The results using Model B are presented in Figure 18, Figure 19 and Figure 20 and summarized in Table 5. As the error, in general, is higher for energy consumption, only figures for energy consumption are presented, while figures for energy production are presented in Appendix A.

The figures show that Model B closely follows the real values and effectively captures the operational pattern for all input days. Although there are still some errors related to cooling mode in TD2, the MAPE has significantly improved compared to Model A. As seen in Figure 19, the model still has higher error during machine start-up and shutdown, but the error is considerably lower than for Model A. During TD3, the error increases as the heat pump stops, because all values are close to zero, but as Figure 20 documents, the model predicts outputs close to zero.

4. Discussion

4.1. Model Applicability

Model A performs a more challenging task as it is assigned fewer inputs to predict more outputs, resulting in higher errors than Model B. However, Model A has two distinct applications: fault detection and performance prediction. Future work could focus on determining a proper setup for classifying faults through close collaboration with the plant operator. The performance prediction mode of Model A is considered accurate enough to be investigated as part of a predictive controller.

From the perspective of an industrial mechanical engineer, Model B is an excellent energy production and consumption predictor, making it a valuable tool for real-time performance monitoring and evaluating the heat pump against supplier data and models. It can also serve as design help in future plant development with a similar type of heat pump. However, Model B lacks the potential controller capabilities of Model A.

Despite attempts to use the predicted outlet temperatures from Model A as inputs to Model B, no improvement was observed compared to the predictions of Model A. For Model B, a gradual increase in the general error of energy consumption over time may indicate that heat pump service should be initiated due to issues such as heat exchanger fouling or refrigerant leakage [52].

Model A overcomes three of the four simplifications presented by Clauß and Georges [29]. The model captures the machine’s modulation and part-load operation while respecting the temperature limits of the condenser and evaporator given that the input values are within the trained range. As is apparent by the TD2, the model struggles during minimum duration operation.

Compared to the white- and grey-box models discussed in the Introduction section, Model A has some significant error for momentaneous values, and it may be considered less accurate. Still, the model is able to predict the general trend of the operation, with MAPE below 10% for TD1 and TD3, and has been developed with considerably less effort.

4.2. Cooling Mode

Both models, particularly Model A, produce higher errors on TD2. These test days represent a period when the machines switched between heating and cooling modes several times, as shown in Figure 11 and Figure 12. The temperature setpoint is included in the figure to identify the mode the machine is in. In periods with low cooling demand, the machine turns on and off frequently, leading to generally higher APE. This high error may be because the dataset contains fewer rows with cooling priority at only 3.9%. A significant portion of the cooling data comprises these “start–stop” periods. A sampling resolution of 5 min may be too low, resulting in the predictions being averaged over the start–stop periods. If the model is to be used in a controller designed to avoid these brief starts-and-stops, the model clearly needs improvement. Evaluation of data and sampling resolution will be performed to address this issue, and additionally, training separate models for heat mode and cooling mode. Model B performs better during cooling mode than Model A but still has higher error than during TD1 and TD3.

4.3. On Available Data and the Commissioning Phase

As the heat pumps investigated in this study are part of a real-life plant, the available data depend solely on the way in which the process has been operated. Figure 9 shows that the machine operated within a limited operational range, with setpoints between 0 and +10 °C in cooling mode and +50 and +58 °C in heating mode. As a result, neither Model A nor Model B can be considered for general use with other operational conditions.

Compared to the approach presented by Opalic et al. [24], which uses compressor polynomials to develop a generalized training dataset for all operational conditions, our approach depends on available measurements from the plant. This limitation of the selected modeling technology [53] must be considered when utilizing the technique and using the models.

Furthermore, new approaches to gathering and utilizing operational data should be considered as AI becomes off-the-shelf technology. In a building project, the contractor is typically responsible for the operation of the energy plant for the first 6, 12, or 24 months of operation before handing it over to the building owner [54]. During this commissioning phase, the contractor is responsible for verifying and testing the system’s performance to ensure that it is functioning correctly and meets the specified design and performance requirements. As part of the commissioning phase, the development of a system-specific dataset could be performed, and when the plant is handed over, the owner is presented with a large dataset containing a wide variety of operational conditions.

To prepare future plants for capturing a greater span of operational data, plant designers may need to plan for a system design that allows simulation of temperature conditions not necessarily occurring during standard operation. Such dataset could improve the accuracy and generalizability of the models, allowing for more widespread use in different operational contexts.

5. Conclusions

In this study, two alternative heat pump model structures based on Artificial Neural Networks (ANN) are developed and presented. The strengths and limitations of the ANN are highlighted, and the importance of having accurate and sufficient data for developing and using the models is emphasized.

Model A predicts energy production and consumption with MAPE below 10% and outlet temperatures with MAPE between 2 and 5%, except during cooling mode. While there are some errors in Model A’s momentaneous predictions of energy production and consumption, a summation of the predictions to hourly values leads to a satisfactorily prediction of the operation with a MAPE of 3% or less. Model A rapidly identifies compressor failures and unexpected stops and can further be developed as a system-specific fault detection tool. Model B is an excellent predictor of energy consumption and COP, providing the industrialist with a tool to continuously improve design evaluations and building owners to monitor performance. Except during cooling mode, the MAPE is never above 5%.

The approach presented in this article is straightforward and provides the user with a fast method to develop system-specific models of heat pump energy consumption and COP when no knowledge of the internal heat pump configuration is available. This is information which is often only available from supplier software, often kept in house [55], or, alternatively, demands advanced physics-based modeling. Though a physics-based model allows for more general evaluations, the ANN offers a quick and precise method to capture the dynamics of the available data. The most crucial factor for success is a sufficiently clean and synchronized dataset representing a selection of relevant operational conditions.

Emphasizing the importance of “data janitoring”, we also demonstrate that delays between measured energy consumption and production and limitations of energy meters and temperature sensors can be sources of uncertainty in the modeling. The main takeaway relevant for mechanical engineers responsible for thermal energy plant design is the necessity to consider the importance of accurate and synchronized data. For further improvement of the models, we plan to evaluate higher sampling resolutions and perform an investigation into using external temperature sensors at the condenser.

Implementing appropriate tools can enable operators of thermal energy plants to identify faults and errors in the system earlier than they do presently, and also enhance the system’s operation through mathematical optimization [33]. The main target of this article was to present a method to develop a quick and precise heat pump model using readily available data. While the TriCEP study case has been deemed successful in meeting design targets for energy results, several operational conditions with potential for improvement have been identified. A smart controller based on the developed heat pump models could be employed to evaluate the following tasks [33]:

Setting heating and cooling priority.
Reduce the number of start–stops of the compressor, especially when the machine is set to cooling priority, and the demand is limited/non-existent.
Determining when to prioritize heat pump or solar collectors for heating DHW.
Continuously evaluate and update the temperature setpoints of the heat pumps to consider influences beyond the ambient temperature, typically energy cost and weather forecast.
Determining degradation in the heat pump or the surrounding system.

Author Contributions

Conceptualization, F.S.F.; methodology, F.S.F. and R.B.; software, F.S.F. and R.B.; validation, F.S.F. and R.B.; formal analysis, F.S.F.; investigation, F.S.F.; resources, F.S.F.; data curation, F.S.F. and R.B.; writing—original draft preparation, F.S.F.; writing—review and editing, R.B. and M.A.; visualization, F.S.F.; supervision, M.A.; project administration, F.S.F. and M.A.; funding acquisition, F.S.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partially funded by the Norwegian Research Council (project 311269) and Norconsult AS (project 5195400).

Data Availability Statement

The data presented in this study are available on request to the corresponding author.

Acknowledgments

The authors would like to thank the municipal representative in Stavanger for discussions, access to data and advice. The authors appreciate the feedback and suggestions received from colleagues and friends.

Conflicts of Interest

Author FSF is employed by the Norwegian Consultant Engineering Company, Norconsult AS, which partly funds his industrial PhD project. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Abbreviations/Nomenclature

$c_{p}$	Specific heat capacity of fluid (kJ/kg K)
$Q_{H P}$	Heat pump energy production (kWh/τ)
${\dot{Q}}_{c}$	Condenser heat dissipation
${\dot{Q}}_{e}$ ,	Evaporator heat extraction
EC	Energy consumption heat pump
EP	Energy production heat pump
$T_{i n}$	Inlet temperature to control volume
$T_{o u t}$	Outlet temperature from control volume
T_C	Condensation temperature
$T_{c_{i n}}$	Inlet temperature to condenser, water side
$T_{c_{o u t}}$	Outlet temperature from condenser, water side
T_E	Evaporation temperature
$T_{e_{i n}}$	Inlet temperature to evaporator, brine side
$T_{e_{o u t}}$	Outlet temperature from evaporator, brine side
$T S P$	Temperature setpoint
τ	Utilization time (usually defined as a year)
$w_{j 0}^{1}$	Bias for the hidden unit j
$w_{j i}^{1}$	Weight connecting input i to the hidden unit j
$w_{m 0}^{2}$	Bias for the output unit m
$w_{m j}^{2}$	Weight connecting hidden unit j to the output unit m
$W_{H P}$	Heat pump energy consumption (kWh/τ)
$y_{i}$	Real value at sample i
$y_{m_{i}}$	Predicted value at sample i
g(.)	Transfer function of the output neurons
$\dot{m}$	Momentaneous mass flow rate in kg/s
n	Number of data points in error calculation
φ(.)	Transfer function of the hidden neurons
η.	Carnot efficiency term
${\dot{V}}_{c}$	Flow rate through heat pump condenser
$\dot{Q}$	Thermal power
$\dot{W}$	Compressor power
AI	Artificial Intelligence
ANN	Artificial Neural Network
APE	Absolute Percentage Error
BAS	Building Automation System
COP	Coefficient of Performance
DHW	Domestic Hot Water
EMS	Energy Management System
HVAC	Heating, Ventilation, and Air Conditioning
MAE	Mean Absolute Error
MAPE	Mean Absolute Percentage Error
MRE	Mean Relative Error
ML	Machine Learning
MPC	Model Predictive Control
P&ID	Piping and Instrumentation Diagram
RE	Relative Error
SCOP	Seasonal Coefficient of Performance
TD1	Test days 1
TD2	Test days 2
TD3	Test days 3
TriCEP	Triangulum Central Energy Plant

Appendix A

All figures for all outputs for all test days are presented in this appendix, Figure A1, Figure A2, Figure A3, Figure A4, Figure A5, Figure A6, Figure A7, Figure A8, Figure A9, Figure A10, Figure A11, Figure A12, Figure A13, Figure A14, Figure A15, Figure A16, Figure A17 and Figure A18.

Figure A1. Energy production for Model A TD1.

Figure A2. Energy consumption for Model A test days 1.

Figure A3. Condenser outlet temperature for Model A test days 1.

Figure A4. Evaporator outlet temperature for Model A test days 1.

Figure A5. Energy production for Model A test days 2.

Figure A6. Energy consumption for Model A test days 2.

Figure A7. Condenser outlet temperature for Model A test days 2.

Figure A8. Evaporator outlet temperature for Model A test days 2.

Figure A9. Energy production for Model A test days 3.

Figure A10. Energy consumption for Model A test days 3.

Figure A11. Condenser outlet temperature for Model A test days 3.

Figure A12. Evaporator outlet temperature for Model A test days 3.

Figure A13. Energy production for Model B test days 1.

Figure A14. Energy consumption for Model B test days 1.

Figure A15. Energy production for Model B test days 2.

Figure A16. Energy consumption for Model B test days 2.

Figure A17. Energy production for Model B test days 3.

Figure A18. Energy consumption for Model B test days 3.

References

IEA. Net Zero by 2050; IEA: Paris, France, 2021. [Google Scholar]
Ahmed, A.A.; Assadi, M.; Kalantar, A.; Sliwa, T.; Sapińska-Śliwa, A. A Critical Review on the Use of Shallow Geothermal Energy Systems for Heating and Cooling Purposes. Energies 2022, 15, 4281. [Google Scholar] [CrossRef]
Arteconi, A.; Brandoni, C.; Rossi, G.; Polonara, F. Experimental evaluation and dynamic simulation of a ground coupled heat pump for a commercial building. Int. J. Energy Res. 2013, 37, 1971–1980. [Google Scholar] [CrossRef]
Harsem, T.T.; Nourozi, B.; Behzadi, A.; Sadrizadeh, S. Design and Parametric Investigation of an Efficient Heating System, an Effort to Obtain a Higher Seasonal Performance Factor. Energies 2021, 14, 8475. [Google Scholar] [CrossRef]
Sadeghi, H.; Ijaz, A.; Singh, R.M. Current status of heat pumps in Norway and analysis of their performance and payback time. Sustain. Energy Technol. Assess. 2022, 54, 102829. [Google Scholar] [CrossRef]
Jiang, J.; Hu, B.; Ge, T.; Wang, R. Comprehensive selection and assessment methodology of compression heat pump system. Energy 2022, 241, 122831. [Google Scholar] [CrossRef]
Redko, A.; Redko, O.; DiPippo, R. Chapter 1—Principles and Operation of Refrigeration and Heat Pump Systems. In Low-Temperature Energy Systems with Applications of Renewable Energy; Redko, A., Redko, O., DiPippo, R., Eds.; Academic Press: Cambridge, MA, USA, 2020. [Google Scholar]
Redko, A.; Redko, O.; DiPippo, R. Chapter 3—Effective use of heat pumps for various heating applications. In Low-Temperature Energy Systems with Applications of Renewable Energy; Redko, A., Redko, O., DiPippo, R., Eds.; Academic Press: Cambridge, MA, USA, 2020. [Google Scholar]
Sang, J.; Liu, X.; Liang, C.; Feng, G.; Li, Z.; Wu, X.; Song, M. Differences between design expectations and actual operation of ground source heat pumps for green buildings in the cold region of northern China. Energy 2022, 252, 124077. [Google Scholar] [CrossRef]
Lazzarin, R.; Noro, M. Lessons learned from long term monitoring of a multisource heat pump system. Energy Build. 2018, 174, 335–346. [Google Scholar] [CrossRef]
Dinçer, I.; Rosen, M.A. Chapter 8.8.3 The Borehole TES System at the University of Ontario Institute of Technology. In Thermal Energy Storage: Systems and Applications, 2nd ed.; John Wiley and Sons, Ltd.: West Sussex, UK, 2011. [Google Scholar]
Strušnik, D.; Golob, M.; Avsec, J. Artificial neural networking model for the prediction of high efficiency boiler steam generation and distribution. Simul. Model. Pract. Theory 2015, 57, 58–70. [Google Scholar] [CrossRef]
Prívara, S.; Váňa, Z.; Cigler, J.; Oldewurtel, F.; Komárek, J. Role of MPC in Building Climate Control. In 21st European Symposium on Computer Aided Process Engineering–ESCAPE 21; Elsevier: Amsterdam, The Netherlands, 2011. [Google Scholar]
Prívara, S.; Cigler, J.; Váňa, Z.; Oldewurtel, F.; Sagerschnig, C.; Žáčeková, E. Building modeling as a crucial part for building predictive control. Energy Build. 2012, 56, 8–22. [Google Scholar] [CrossRef]
Killian, M.; Kozek, M. Ten questions concerning model predictive control for energy efficient buildings. Build. Environ. 2016, 105, 403–412. [Google Scholar] [CrossRef]
Zanetti, E.; Azzolin, M.; Bortolin, S.; Busato, G.; Del Col, D. Experimental data and modelling of a dual source reversible heat pump equipped with a minichannels evaporator. Therm. Sci. Eng. Prog. 2022, 35, 101471. [Google Scholar] [CrossRef]
Xu, Y.; Mao, C.; Zhang, X.; Shen, X.; Huang, Y.; Chen, G. Optimization on integrated inverter-compressor CO2 heat pump with new operating model. Appl. Therm. Eng. 2021, 200, 117632. [Google Scholar] [CrossRef]
Deb, C.; Schlueter, A. Review of data-driven energy modelling techniques for building retrofit. Renew. Sustain. Energy Rev. 2021, 144, 110990. [Google Scholar] [CrossRef]
Smrekar, J.; Assadi, M.; Fast, M.; Kuštrin, I.; De, S. Development of artificial neural network model for a coal-fired boiler using real plant data. Energy 2009, 34, 144–152. [Google Scholar] [CrossRef]
Madani, H.; Claesson, J.; Lundqvist, P. Capacity control in ground source heat pump systems. Int. J. Refrig. 2011, 34, 1338–1347. [Google Scholar] [CrossRef]
Somehsaraei, H.N.; Hölle, M.; Hönen, H.; Assadi, M. A novel approach based on artificial neural network for calibration of multi-hole pressure probes. Flow Meas. 2020, 73, 101739. [Google Scholar] [CrossRef]
Li, H.; Fu, L.; Zhang, Y. A novel hybrid data-driven method based on uncertainty quantification to predict the remaining useful life of lithium battery. J. Energy Storage 2022, 52, 104984. [Google Scholar] [CrossRef]
Afram, A.; Janabi-Sharifi, F. Theory and applications of HVAC control systems—A review of model predictive control (MPC). Build. Environ. 2014, 72, 343–355. [Google Scholar] [CrossRef]
Opalic, S.M.; Goodwin, M.; Jiao, L.; Nielsen, H.K.; Pardiñas, Á.; Hafner, A.; Kolhe, M.L. ANN modelling of CO2 refrigerant cooling system COP in a smart warehouse. J. Clean. Prod. 2020, 260, 120887. [Google Scholar] [CrossRef]
Puttige, A.R.; Andersson, S.; Östin, R.; Olofsson, T. Modeling and optimization of hybrid ground source heat pump with district heating and cooling. Energy Build. 2022, 264, 112065. [Google Scholar] [CrossRef]
Tu, J.V. Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes. J. Clin. Epidemiol. 1996, 49, 1225–1231. [Google Scholar] [CrossRef] [PubMed]
Huang, S.; Lu, X.; Zuo, W.; Zhang, X.; Liang, C. Model-based optimal operation of heating tower heat pump systems. Build. Environ. 2019, 160, 106199. [Google Scholar] [CrossRef]
Moran, M.J.; Shapiro, H.N. Fundamentals of Engineering Thermodynamics Si Version; John Wiley & Sons: Hoboken, HJ, USA, 2006. [Google Scholar]
Clauß, J.; Georges, L. Model complexity of heat pump systems to investigate the building energy flexibility and guidelines for model implementation. Appl. Energy 2019, 255, 113847. [Google Scholar] [CrossRef]
EN 14511; Air Conditioners, Liquid Chilling Packages and Heat Pumps for Space Heating and Cooling and Process Chillers, with Electrically driven Compressors. CEN/CENELEC: Brussels, Belgium, 2022.
Teng, S.Y.; Touš, M.; Leong, W.D.; How, B.S.; Lam, H.L.; Máša, V. Recent advances on industrial data-driven energy savings: Digital twins and infrastructures. Renew. Sustain. Energy Rev. 2020, 135, 110208. [Google Scholar] [CrossRef]
Ahmad, M.W.; Mourshed, M.; Yuce, B.; Rezgui, Y. Computational intelligence techniques for HVAC systems: A review. Build. Simul. 2016, 9, 359–398. [Google Scholar] [CrossRef]
Fadnes, F.S.; Olsen, E.; Assadi, M. Holistic management of a smart city thermal energy plant with sewage heat pumps, solar heating, and grey water recycling. Front. Energy Res. 2023, 11, 127. [Google Scholar] [CrossRef]
Schneider Electric. Citect SCADA 2018 Installation and Configuration Guide; Schneider Electric: Rueil-Malmaison, France, 2018. [Google Scholar]
Gurosoft. Gurusoft EOS–Veien til Redusert Forbruk. 2022. Available online: https://www.gurusoft.no/gurusoft-eos?parent=10001 (accessed on 2 January 2023).
Singh, V.; Mathur, J.; Bhatia, A. A comprehensive review: Fault detection, diagnostics, prognostics, and fault modeling in HVAC systems. Int. J. Refrig. 2022, 144, 283–295. [Google Scholar] [CrossRef]
Mahbub, S.; Wagner, M.; Crema, L. Incorporating domain knowledge into the optimization of energy systems. Appl. Soft Comput. 2016, 47, 483–493. [Google Scholar] [CrossRef]
Andresen, T.; Nekså, P.; Stene, J. Heat Pumps in Smart Energy-Efficient Buildings A State-of-the-Art Report, in Smart Energy-Efficient Buildings; NTNU and SINTEF: Trondheim, Norway, 2002. [Google Scholar]
Chua, K.J.; Chou, S.K.; Yang, W.M. Advances in heat pump systems: A review. Appl. Energy 2010, 87, 3611–3624. [Google Scholar] [CrossRef]
Dones, R.; Heck, T.; Hirschberg, S. Stefan Hirschberg, Greenhouse Gas Emissions from Energy Systems, Comparison and Overview; Encyclopedia of Energy; Cleveland, C.J., Ed.; Elsevier: Amsterdam, The Netherlands, 2004. [Google Scholar]
Çengel, Y.A. Heat and Mass Transfer: A Practical Approach, 3rd ed.; McGraw-Hill Science Engineering: Lacklick, OH, USA, 2006. [Google Scholar]
Géron, A. Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 2nd ed.; O’Reilly Media, Inc.: Sebastopol, CA, USA, 2019. [Google Scholar]
Haykin, S. Neural Networks and Learning Machines, 3rd ed.; Pearson Education, Inc.: New York, NY, USA, 2009. [Google Scholar]
Lenail, A.D.A. Neural Network SVG Visualizer [Computer Software]. 2015. Available online: http://alexlenail.me/NN-SVG/index.html (accessed on 24 April 2023).
Runge, J.; Zmeureanu, R. Forecasting Energy Use in Buildings Using Artificial Neural Networks: A Review. Energies 2019, 12, 3254. [Google Scholar] [CrossRef]
Snoek, J.; Larochelle, H.; Adams, R.P. Practical Bayesian Optimization of Machine Learning Algorithms; Denso IT Laboratory: Tokyo, Tokyo, 2012. [Google Scholar]
Snoek, J.; Rippel, O.; Swersky, K.; Kiros, R.; Satish, N.; Sundaram, N.; Patwary, M.; Prabhat, M.; Adams, R. Scalable Bayesian Optimization Using Deep Neural Networks. In Proceedings of the 32nd International Conference on Machine Learning, Lille, France, on 6–11 July 2015; Francis, B., David, B., Eds.; Proceedings of Machine Learning Research: Lille, France, 2015; pp. 2171–2180. [Google Scholar]
Shin, J.-H.; Cho, Y.-H. Machine-Learning-Based Coefficient of Performance Prediction Model for Heat Pump Systems. Appl. Sci. 2021, 12, 362. [Google Scholar] [CrossRef]
Python, Python 3.8; Python Software Foundation: Wilmington, DE, USA, 2019.
Keras Documentation. Available online: https://keras.io (accessed on 26 December 2018).
Google. TensorFlow; Google LLC.: Mountain View, CA, USA, 2021. [Google Scholar]
Aguilera, J.J.; Meesenburg, W.; Ommen, T.; Markussen, W.B.; Poulsen, J.L.; Zühlsdorf, B.; Elmegaard, B. A review of common faults in large-scale heat pumps. Renew. Sustain. Energy Rev. 2022, 168, 112826. [Google Scholar] [CrossRef]
Amasyali, K.; El-Gohary, N.M. A review of data-driven building energy consumption prediction studies. Renew. Sustain. Energy Rev. 2018, 81, 1192–1205. [Google Scholar] [CrossRef]
NS6540:2016; Commissioning and Testing of Technical Building Installations. Norsk Standard: Oslo, Norway, 2016.
Haider, H.T.; See, O.H.; Elmenreich, W. A review of residential demand response of smart grid. Renew. Sustain. Energy Rev. 2016, 59, 166–178. [Google Scholar] [CrossRef]

Figure 1. System sketch of case study energy plant.

Figure 2. Available sensors to the operator within the heat pump plant control system.

Figure 3. Simplified version of the P&ID of the study heat pump system.

Figure 4. Structure of (a) an artificial neuron and (b) a multilayer perceptron, drawn using Neural Network SVG Visualizer [44].

Figure 5. Structure of Model A.

Figure 6. Structure of Model B.

Figure 7. Example of heat pump process with a one-second sampling resolution.

Figure 8. Example of heat pump process with five-minute sampling resolution.

Figure 9. Data used in ANN heat pump modeling. The upper four represent input data to Model A, while the lower four represent output values from Model A.

Figure 10. Energy production for Model A test days 1.

Figure 11. Energy consumption for Model A test days 2. The temperature setpoint has been added to indicate when the machine prioritizes heating and cooling.

Figure 12. Condenser outlet temperature for Model A test days 2.

Figure 13. Evaporator outlet temperature for Model A test days 3.

Figure 14. Energy production for Model A test days 3.

Figure 15. Summarized hourly energy consumption for Model A, TD1.

Figure 16. Summarized hourly energy production for Model A, TD2.

Figure 17. Summarized hourly energy consumption for Model A, TD3.

Figure 18. Energy consumption for Model B TD1.

Figure 19. Energy consumption for Model B TD 2.

Figure 20. Energy consumption for Model B TD 3.

Table 1. Hyperparameter domain for ANN model.

Hyperparameter	No. of Neurons 1st Layer	No. of Neurons 2nd and 3rd Layer	Activation Function Hidden Layers	Optimizer	Learning Rate
Bound	(2, 100)	(0, 100)	ReLU, sigmoid, tanh, SELU, ELU, exp	SGD, Adam, Adamax, Nadam, RMSProp, Adadelta, Adagrad, FTRL	(1 × 10⁻⁵, 1)
No. of options	99	101 × 101	6 × 6 × 6	8	-

Table 2. Optimized configuration and training parameters for ANN model constructed.

	No. of Neurons 1st Layer	No. of Neurons 2nd Layer	No. of Neurons 3rd Layer	Activation Function 1st Layer	Activation Function 2nd Layer	Activation Function 3rd layer	Optimizer	Learning Rate
Model A	52	60	75	SELU	ReLU	SELU	Adadelta	1.0
Model B	24	71	51	SELU	Sigmoid	Sigmoid	Adamax	0.084

Table 3. Summary of average and maximum error for the various test days, Model A.

		MAPE	Maximum APE	Real Value at Maximum Error	Predicted Value at Max Error
TD1 without error	EP [kW]	5%	28%	125.7	160.4
	Tcout [°C]	1%	2%	49.9	51.0
	EC [kW]	8%	31%	34.0	44.6
	Teout [°C]	5%	17%	3.5	2.9
TD1 error mode	EP [kW]	39%	58%	142.8	226.1
	Tcout [°C]	2%	3%	52.4	54.2
	EC [kW]	51%	80%	38.7	69.7
	Teout [°C]	20%	35%	3.3	2.1
TD2	EP [kW]	292%	6050%	1.6	96.7
	Tcout [°C]	1%	12%	47.1	41.5
	EC [kW]	192%	3678%	0.7	26.4
	Teout [°C]	71%	3604%	−0.0	0.7
TD3 without error	EP [kW]	10%	41%	92.8	130.8
	Tcout [°C]	1%	3%	49.6	51.10
	EC [kW]	10%	52%	34.3	52.0
	Teout [°C]	2%	11%	7.5	8.3
TD3 error mode	EP [kW]	4694%	39,203%	0.5	180.1
	Tcout [°C]	4%	9%	49.2	53.62
	EC [kW]	2350%	8291%	0.7	58.7
	Teout [°C]	23%	28%	9.2	6.6

Table 4. Comparison of actual and predicted values when summarizing energy production and consumption over test days.

	Test Days 1	Test Days 2	Test Days 3
EP [kWh]	14,370	8260	7600
EP pred [kWh]	14,940	8460	7790
EP MAPE [%]	4%	2%	3%
EC [kWh]	4110	2180	2060
EC pred [kWh]	4410	2170	2100
EC MAPE [%]	7%	0%	2%
SCOP [-]	3.50	3.79	3.69
SCOP Pred [-]	3.39	3.90	3.71
SCOP MAPE [%]	3%	3%	1%

Table 5. Summary of average and maximum error for the various test days, Model B.

		MAPE	Maximum APE	Real Value at Maximum Error	Predicted Value at Max Error
TD1	EP [kW]	0%	2%	127.0	124.8
TD1	EC [kW]	1%	12%	41.5	46.6
TD2	EP [kW]	7%	221%	100.7	323.6
TD2	EC [kW]	14%	524%	18.0	−76.2
TD3	EP [kW]	3%	184%	0.5	1.3
TD3	EC [kW]	7%	79%	0.7	1.3
TD 3 without errors	EP [kW]	1%	7%	81.8	87.5
TD 3 without errors	EC [kW]	5%	26%	40.2	50.6

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Fadnes, F.S.; Banihabib, R.; Assadi, M. Using Artificial Neural Networks to Gather Intelligence on a Fully Operational Heat Pump System in an Existing Building Cluster. Energies 2023, 16, 3875. https://doi.org/10.3390/en16093875

AMA Style

Fadnes FS, Banihabib R, Assadi M. Using Artificial Neural Networks to Gather Intelligence on a Fully Operational Heat Pump System in an Existing Building Cluster. Energies. 2023; 16(9):3875. https://doi.org/10.3390/en16093875

Chicago/Turabian Style

Fadnes, Fredrik Skaug, Reyhaneh Banihabib, and Mohsen Assadi. 2023. "Using Artificial Neural Networks to Gather Intelligence on a Fully Operational Heat Pump System in an Existing Building Cluster" Energies 16, no. 9: 3875. https://doi.org/10.3390/en16093875

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Using Artificial Neural Networks to Gather Intelligence on a Fully Operational Heat Pump System in an Existing Building Cluster

Abstract

1. Introduction

2. Materials and Methods

2.1. The Heat Pump Study Case

2.2. The Perspective and the Challenges of the Plant Operators

2.3. The Heat Pump Process and Benchmarking Parameters

2.4. Method—Artificial Neural Networks

Hyperparameter Optimization with Bayesian Method

2.5. Model Configurations

2.5.1. Model A—Setpoint-Based Model

2.5.2. Model B—Compressor Map Model

2.5.3. Thermal Energy Meter and Flow Rate

2.6. Data Janitoring

2.6.1. Data Capture

2.6.2. Sampling Resolution

2.6.3. Data Processing

2.7. Visual Test Data

2.8. Model Development and Training

3. Results

3.1. Model A

Accumulated Data

3.2. Model B

4. Discussion

4.1. Model Applicability

4.2. Cooling Mode

4.3. On Available Data and the Commissioning Phase

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations/Nomenclature

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI