Keywords

1 Introduction

River systems play a crucial role in the sustainable development of a community. However, industry and living activities are creating more and more pollutants to freshwater sources. It is estimated that 40 billion dollars are lost each year in China due to freshwater pollution events [1]. Water quality monitoring has become one of the routine efforts for environmental protection all over the world. However, monitoring water quality remains a very complex process due to the large number of factors to consider such as monitoring locations, selection of water quality parameters, monitoring frequency and identification of monitoring objectives [2]. The costs of building and operating an automatic monitoring station are also very high (about 500,000–600,000 dollars per station for construction and 14,000 dollars per year for operating and maintaining). Planning and optimizing water quality monitoring networks have been addressed since 1940s. Dozens of papers have been published on this subject [3,4,5,6].

In this paper, we proposed a modified MOPSO algorithm to optimize the design of water quality monitoring network based on three optimization objectives: maximum pollution detection probability, minimum pollution detection time and maximum centrality of a monitoring network. Two discrete functions were developed to calculate particle’s velocity and update particle’s position respectively. Experimental results show that our algorithm can get steady Pareto frontiers and obtain better optimal deployment solutions than genetic algorithm (GA).

2 Methodology

2.1 Hypothetical River Network

To compare our study results with the achievements given by the literature (Ouyang et al. [6]; Telci et al. [7, 11]), we use the same hypothetical river network as Fig. 1 shows, there are 6 inlet locations (1, 3, 5, 8, 10, 11), 5 intermediate locations (2, 4, 6, 7, 9) and 1 outlet location (12) in the river network. We assume that a pollution event can occur at any location randomly with the same amount of pollutant spilling and there is only one pollution event at each time. We simulate the water flows for 24 h from 00:00 to 23:59 with a steady water flow of 10 ft3/s for each inlet location. The pollution event occurs at 10:00 and lasts for 1 h during the simulation. We also assume that the pollutant concentration is 10 mg/L. The remaining characteristics of the river network is shown in Table 1, which is the same as Telci used [7].

Fig. 1.
figure 1

Hypothetical river network

Table 1. Hydraulic characteristics of the river network

2.2 Hydraulic Simulations

The Storm Water Management Model (SWMM) is a dynamic rainfall-runoff simulation model used for single event or long-term (continuous) simulation of runoff quantity and quality from primarily urban areas [8]. It is widely used for dynamically simulating storm water runoff and drainage systems in urban areas. Here we use SWMM to simulate the hydraulic model, pollution events and pollutants transport along the river system.

We set the pollution detection threshold to 0.01 mg/L and run hydraulic simulations in SWMM. Table 2 shows the simulation results of pollution detection time for each potential monitoring location. The “_” in Table 2 represents an infinite value, which means the pollution event cannot be successfully detected at a monitoring location.

Table 2. Pollution detection time at each potential locations

2.3 Optimization Objectives

(a) Minimum Pollution Detection Time and Maximum Pollution Detection Probability

Telci et al. [7] proposed a real-time optimal monitoring network design in river networks. They designed two optimization objectives of minimum pollution detection time and maximum pollution detection. A genetic algorithm (GA) was used to optimize the water quality monitoring network according to these two optimization objectives. Here we also use the same two objectives in our algorithm. The detailed definition of these two optimization objectives can be found in [7].

(b) Maximum Centrality of Monitoring Network

We argue that in practical environment, different potential monitoring locations may have different monitoring priorities, which should be considered when we design a monitoring network. In river network simulations, graph theory and network analysis are usually used to model river systems. The centrality is one of the most important indicators in graph theory. It can identify the importance of vertexes in a graph or network. Here we use the closeness centrality as an evaluation criterion of location priority. The closeness centrality for each potential monitoring location is described as Formula 1.

$$ {\text{C}}\left( {\text{i}} \right) = \left[ {\frac{{\mathop \sum \nolimits_{j = 1}^{m} d\left( {i,j} \right)}}{m - 1}} \right]^{ - 1} $$
(1)

where m is the total number of potential monitoring locations, d(i, j) is the length from location i to location j. The total closeness centrality for a potential deployment solution S k is shown in Formula 2.

$$ {\text{C}}\left( {S_{K} } \right) = \mathop \sum \limits_{i = 1}^{n} C\left( {S_{ki} } \right) $$
(2)

where n is the number of monitoring devices deployed in a river network, S ki is a monitoring location in deployment solution S k .

The third optimization objective is to maximize the total closeness centrality for all deployment solutions, which is shown in Formula 3.

$$ C(S) = { \hbox{max} }\{ C(S_{1} ),C(S_{2} ), \ldots ,C(S_{T} )\} $$
(3)

where T is the total number of potential deployment solutions.

2.4 MOPSO Algorithm

Assume that we will deploy n monitoring devices in a river system out of m potential monitoring locations (n ≤ m). It is easy to know that the total number of potential deployment combinations T is:

$$ T = C_{m}^{n} $$
(4)

We can find from Formula 4 that when we increase the value of m and/or n, the number of deployment solutions will be also increased exponentially, which is too large to calculate using enumeration search methods within a reasonable time. In addition, these optimization objectives normally conflict with each other, which means we aim to find some good trade-off solutions among multi objectives [9, 10].

Multi-objective Particle Swarm Optimization (MOPSO) is one of the popular evolution algorithms used in recent year. Coello et al. [10] compared MOPSO against three state-of-the art multi-objective evolutionary algorithms of Nondominated Sorting Genetic Algorithm II (NSGA-II), Pareto Archived Evolution Strategy (PAES) and Microgenetic Algorithm for Multi-Objective Optimization (MicroGA) using 5 different test functions. Experimental results show that MOPSO has a highly competitive performance and can be considered a viable alternative to solve multi-objective optimization problems with low computational time. Here we use MOPSO to design an optimal water quality monitoring network. The velocity and position of particles during the computing iteration are updated by the following equations:

$$ V_{i} \left( {t + 1} \right) = w\;V_{i} \left( t \right) + c_{1} r_{1} \left( {pbest\left( {i,t} \right) - P_{i} \left( t \right)} \right) + c_{2} r_{2} \left( {gbest\left( t \right) - P\left( t \right)} \right) $$
(5)
$$ P_{i} \left( {t + 1} \right) = P_{i} \left( t \right) + V_{i} \left( {t + 1} \right) $$
(6)

where V denotes the particle’s velocity, w is an inertia weight constant, r1 and r2 are uniformly distributed random variables within range [0, 1], pbest(i, t) is the best position that the particle i has had, gbest(t) is the best position in all current particles, and c 1 and c2 are positive constant coefficients for acceleration.

The classical MOPSO is a powerful algorithm to get global optimal results for continuous definition domains. However, it cannot be applied to discrete problems directly. In this paper, we redesign the initialization procedure and the velocity and position calculation function to optimize this discrete issue.

(a) Particle Design and Swarm Initialization

Assume we select n locations to deploy water quality monitoring devices out of m potential monitoring locations in a river network (n ≤ m). Each potential monitoring location is named from 1 to m respectively resulting in a location set S = {1, 2, 3, …, m}. Each particle in a swarm denotes a deployment solution with n monitoring locations. Therefore, each particle has n positions and each position represents a monitoring location in set S. As a result, particle P can be defined as a vector with n elements shown in Formula 7.

$$ \begin{array}{*{20}c} {P = \, [p_{1} p_{2} \; \ldots \; \, p_{i} \; \ldots \;p_{n} ]} \\ {subject\;to\;n\; \le \;m\;\& \;1\; \le \;p_{i} \; \le \;m} \\ \end{array} $$
(7)

Assume we create k initial particles, we use a random integer function to initialize n positions and velocities for each particle in MOPSO. The swarm initialization procedure is as Fig. 2 shows. The position value for all the particles is constrained between 1 and m. All the velocities are initialized to 0.

Fig. 2.
figure 2

Pseudocode of MOPSO initialization

(b) Velocity and Position Updating

Equations 5 and 6 show that the original velocity and position in MOPSO are both real values. However, we use integers to denote monitoring locations in a particle. Figure 3 shows a procedure to update particle’s velocity and position. We let MaxVel be a maximum velocity during calculation. A round function is used to calculate a new integer value of velocity for each particle based on current gbest and pbest particles. Because the new velocity may be out of the boundary of [−MaxVel, +MaxVel], we use max and min functions to restrict the velocity scope. The detailed velocity updating process is shown in Fig. 4.

Fig. 3.
figure 3

Pseudocode of velocity and position updating

Fig. 4.
figure 4

Velocity updating process

3 Simulations and Analysis

We run the MOPSO algorithm several times based on data in Table 2 and get several Pareto frontiers as Fig. 5 shows. The simulation results show that these Pareto frontiers are almost the same except for few (one or two) Pareto frontier particles. It means that the simulation results of MOPSO algorithm are quite steady and we can use it to design an optimal water quality monitoring network. Table 3 shows the optimal deployment solutions based on the Pareto frontier particles.

Fig. 5.
figure 5

Hypothetical river network

Table 3. Optimal deployment solutions

We can find from Table 3 that if we deploy 3 monitoring devices at locations 6, 9 and 12, we can detect all the potential pollution events and the pollution detection time is only 45.8 min. The centrality of this monitoring network is 0.0414. If we deploy monitoring devices at locations 6, 7 and 12, potential pollution events can still be all detected. However, the pollution detection time is increased to 54.8 min. If the monitoring devices are deployed at locations 4, 6 and 9. The pollution detection time is significantly decreased to 34.9 min while the detection probability is slightly decreased to 91.7%, which is also the second highest pollution detection probability solution in our deployment solutions and we can get a better centrality of 0.05. However, the second highest pollution detection probability in Telci’s paper is only 83%.

4 Conclusions and Future Work

We presented a novel method based on a modified MOPSO algorithm to design an optimal water quality monitoring network with three optimization objectives of minimum pollution detection time, maximum pollution detection probability, maximum closeness centrality. Results show that the modified MOPSO can get a better optimal deployment solutions than GA.

In the future, this novel approach will be applied to a real case of water quality monitoring network. Further research is planned to explore the feasibility of redesigning the velocity and position calculation procedure to avoid same positions in a particle, which can further improve the computing performance.