Seeding the initial population of multi-objective evolutionary algorithms: A computational study

doi:10.1016/j.asoc.2015.04.043

Applied Soft Computing

Volume 33, August 2015, Pages 223-230

https://doi.org/10.1016/j.asoc.2015.04.043 Get rights and content

Highlights

•
We study the benefits of seeding for multi-objective optimization algorithms.
•
We investigate two approaches for five state-of-the-art algorithms on 48 functions.
•
Different optimization algorithms benefit very differently from seeding.
•
AGE and SMS-EMOA typically achieve the best approximation of the true Pareto front.

Abstract

Most experimental studies initialize the population of evolutionary algorithms with random genotypes. In practice, however, optimizers are typically seeded with good candidate solutions either previously known or created according to some problem-specific method. This seeding has been studied extensively for single-objective problems. For multi-objective problems, however, very little literature is available on the approaches to seeding and their individual benefits and disadvantages. In this article, we are trying to narrow this gap via a comprehensive computational study on common real-valued test functions. We investigate the effect of two seeding techniques for five algorithms on 48 optimization problems with 2, 3, 4, 6, and 8 objectives. We observe that some functions (e.g., DTLZ4 and the LZ family) benefit significantly from seeding, while others (e.g., WFG) profit less. The advantage of seeding also depends on the examined algorithm.

Graphical abstract

Introduction

In many real-world applications trade-offs between conflicting objectives play a crucial role. As an example, consider engineering a bridge, where one objective might be costs to build and another durability of the bridge. For such problems, we need specialized optimizers that determine the Pareto front of mutually non-dominated solutions. There are several established multi-objective evolutionary algorithms (MOEA) and many comparisons on various test functions. However, most of them start with random initial solutions.

If prior knowledge exists or can be generated at a low computational cost, good initial estimates may generate better solutions with faster convergence. These good initial estimates are often referred to as seeds, and the method of using good initial estimates is referred to as seeding. These botanical terms are used to express the possibility that good solutions for the environment can develop from these starting points. In practice, a good initial seeding can make problem solving approaches competitive that would otherwise be inferior.

For single-objective evolutionary algorithms, methods such as seeding have been studied for about two decades; see, e.g., [17], [20], [23], [26], [30], [41] for studies and examples (see [27] for a recent categorization). For example, the effects of seeding for the Traveling Salesman Problem (TSP) and the job-shop scheduling problem (JSSP) were investigated in [32]. The algorithms were seeded with known good solutions in the initial population, and it was found that the results were significantly improved on the TSP but not on the JSSP. To investigate the influence of seeding on the optimization, a varying percentage of seeding was used, ranging from 25 to 75%. Interestingly, it was also pointed out that a 100% seed is not necessarily very successful on either problems [28]. This is one of the very few reports that shows seeding can in some cases be beneficial to an optimization process, but not necessarily always is. In [21] a seeding technique for dynamic environments was investigated. There, the population was seeded when a change in the objective landscape arrived, aiming at a faster convergence to the new global optimum. Again, some of the investigated seeding approaches were more successful than others.

One of the very few studies that can be found on seeding techniques for MOEAs is the one performed by Hernandez-Diaz et al. [22]. There, seeds were created using gradient-based information. These were then fed into the algorithm called Non-Dominated Sorting Genetic Algorithm II (NSGA-II, [10]) and the quality was assessed on the benchmark family ZDT ([44], named after the authors Zitzler, Deb, and Thiele). The results indicate that the proposed approach can produce a significant reduction in the computational cost of the approach.

In general, seeding is not well documented for multi-objective problems, even for real-world problems. If seeding is done, then typically the approach is outlined and used with the comment that it worked in “preliminary experiments”—the reader is left in the dark on the design process behind the used seeding approach. This is quite striking as one expects that humans can construct a few solutions by hand, even if they do not represent the ranges of the objectives well. The least that one should be able to do is to reuse existing designs, and to modify these iteratively towards extremes. Nevertheless, even this manual seeding is rarely reported.

In this paper, we are going to investigate the effects of two structurally different seeding techniques for five algorithms on 48 multi-objective optimization (MOO) problems.

As seeding we use the weighted-sum method, where the trade-off preferences are specified by non-negative weights for each objective. Solutions to these weighted-sums of objectives can be found with an arbitrary classical single-objective evolutionary algorithm. In our experiments we use the algorithm Covariance Matrix Adaptation Evolution Strategy (CMA-ES, [18]). Details of the two studied weighting schemes are presented in Section 2.1.

There are different ways to measure the quality of the solutions. A recently very popular measure is the hypervolume indicator, which measures the volume of the objective space dominated by the set of solutions relative to a reference point [43]. Its disadvantage is its high computational complexity [4], [3] and the arbitrary choice of the reference point. We instead consider the mathematically well founded approximation constant. In fact, it is known that the worst-case approximation obtained by optimal hypervolume distributions is asymptotically equivalent to the best worst-case additive approximation constant achievable by all sets of the same size [6]. For a rigorous definition, see Section 2. This notion of multi-objective approximation was introduced by several authors [19], [15], [31], [35], [36] in the 80s and its theoretical properties have been extensively studied [9], [12], [33], [34], [37].

We use the jMetal framework [13] and its implementation of NSGA-II [10], Strength Pareto Evolutionary Algorithm (SPEA2, [45]), S-Metric Selection Evolutionary Multi-Objective Algorithm (SMS-EMOA, [14]), and Indicator Based Evolutionary Algorithm (IBEA, [42]). Additionally to these more classical MOEAs, we also study Approximation Guided Evolution (AGE, [7]), which aims at directly minimizing the approximation constant and has shown to perform very well for larger dimensions [38], [39], [40]. For each of these algorithms we compare their regular behavior after a certain number of iterations with their performance when initialized with a certain seeding.

We compare the aforementioned algorithms on four common families of benchmark functions. These are DTLZ ([11], named after the authors Deb, Thiele, Laumanns and Zitzler), LZ09 ([29], named after the authors Li and Zhang), WFG ([24], named after the authors’ research group Walking Fish Group) and ZDT [44]. While the last three families only contain two- and three-dimensional problems, DTLZ can be scaled to an arbitrary number of dimensions.

Section snippets

Preliminaries

We consider minimization problems with d objective functions, where d ≥ 2 holds. Each objective function $f_{i} : S \mapsto ℝ$ , 1 ≤ i ≤ d, maps from the considered search space S into the real values. In order to simplify the presentation we only work with the dominance relation on the objective space and mention that this relation transfers to the corresponding elements of S.

For two points x = (x₁, …, x_d) and y = (y₁, …, y_d), with $x, y \in ℝ^{d}$ we define the following dominance relation: $\begin{matrix} x ⪯ y & : \Leftrightarrow x_{i} \leq y_{i} for all 1 \leq i \leq d, \\ x ≺ y & : \Leftrightarrow x ⪯ y \end{matrix}$

Experimental setup

We use the jMetal framework [13], and our code for the seeding as well as all used seeds are available online.² As test problems we used the benchmark families DTLZ [11], ZDT [44], LZ09 [29], and WFG [24], We used the functions DTLZ 1-4, each with 30 function variables and with d ∈ {2, 4, 6, 8} objective values/dimensions.

In order to investigate the benefits of seeding even in the long run, we limit the calculations of the algorithms to a

Experimental results

Our results are summarized in Table 1, Table 2. They compare the approximation constant achieved with CornersAndCentre seeding (Table 1) and LinearCombinations seeding (Table 2) with the same number of iterations without seeding. As the seeding itself requires a number of fitness function evaluations (10⁴ for CornersAndCentre and 10⁵ for LinearCombinations), we allocate the seeded algorithms fewer fitness function evaluations. This makes it harder for the seeded algorithms to outperform its

Conclusions

Seeding can result in a significant reduction of the computational cost and the number of fitness function evaluations needed. We observe that there is an advantage on many common real-valued fitness functions even if computing an initial seeding reduces the number of fitness function evaluations available for the MOEA. For some functions we observe a dramatic improvement in quality and needed runtime (e.g., DTLZ4 and the LZ09 family).

For practitioners, our results show that it can be

Acknowledgements

The research leading to these results has received funding from the Australian Research Council (ARC) under grant agreement DP140103400 and from the European Union Seventh Framework Programme (FP7/2007-2013) under grant agreement no 618091 (SAGE).

References (45)

K. Bringmann et al.
Approximating the volume of unions and intersections of high-dimensional geometric objects
Comput. Geom.: Theory Appl.
(2010)
K. Bringmann et al.
Approximating the least hypervolume contributor: NP-hard in general, but fast in practice
Theor. Comput. Sci.
(2012)
K. Bringmann et al.
Approximation quality of the hypervolume indicator
Artif. Intell.
(2013)
G.R. Harik et al.
Linkage learning through probabilistic expression
Comput. Methods Appl. Mech. Eng.
(2000)
E. Hopper et al.
An empirical investigation of meta-heuristic and heuristic algorithms for a 2D packing problem
Eur. J. Oper. Res.
(2001)
E. Keedwell et al.
A hybrid genetic algorithm for the design of water distribution networks
Eng. Appl. Artif. Intell.
(2005)
C.-F. Liaw
A hybrid genetic algorithm for the open shop scheduling problem
Eur. J. Oper. Res.
(2000)
S. Vassilvitskii et al.
Efficiently computing succinct trade-off curves
Theor. Comput. Sci.
(2005)
M. Wagner et al.
Efficient optimization of many objectives by approximation-guided evolution
Eur. J. Oper. Res.
(2015)
M. Yang et al.
A hybrid genetic algorithm for the fitting of models to electrochemical impedance data
J. Electroanal. Chem.
(2002)

R.B. Agrawal et al.

Simulated Binary Crossover for Continuous Search Space. Technical report

(1994)

J. Bader et al.

Faster hypervolume-based search using Monte Carlo sampling

K. Bringmann et al.

Parameterized average-case complexity of the hypervolume indicator

K. Bringmann et al.

Approximation-guided evolutionary multi-objective optimization

T.C.E. Cheng et al.

Bicriterion single machine scheduling with resource dependent processing times

SIAM J. Optim.

(1998)

C. Daskalakis et al.

How good is the Chord algorithm?

K. Deb et al.

A fast and elitist multiobjective genetic algorithm: NSGA-II

IEEE Trans. Evolut. Comput.

(2002)

K. Deb et al.

Scalable test problems for evolutionary multiobjective optimization

I. Diakonikolas et al.

Small approximate Pareto sets for biobjective shortest paths and other problems

SIAM J. Comput.

(2009)

J.J. Durillo et al.

The jMetal framework for multi-objective optimization: design and architecture

M.T.M. Emmerich et al.

An EMO algorithm using the hypervolume measure as selection criterion

Y.G. Evtushenko et al.

Methods of numerical solution of multicriterion problem

Cited by (40)

Identifying vulnerabilities of industrial control systems using evolutionary multiobjective optimisation
2024, Computers and Security
In this paper, we propose a novel methodology to assist in identifying vulnerabilities in real-world complex heterogeneous industrial control systems (ICS) using two Evolutionary Multiobjective Optimisation (EMO) algorithms, NSGA-II and SPEA2. Our approach is evaluated on a well-known benchmark chemical plant simulator, the Tennessee Eastman (TE) process model. We identified vulnerabilities in individual components of the TE model and then made use of these vulnerabilities to generate combinatorial attacks. The generated attacks were aimed at compromising the safety of the system and inflicting economic loss. Results were compared against random attacks, and the performance of the EMO algorithms was evaluated using hypervolume, spread, and inverted generational distance (IGD) metrics. A defence against these attacks in the form of a novel intrusion detection system was developed, using machine learning algorithms. The designed approach was further tested against the developed detection methods. The obtained results demonstrate that the developed EMO approach is a promising tool in the identification of the vulnerable components of ICS, and weaknesses of any existing detection systems in place to protect the system. The proposed approach can serve as a proactive defense tool for control and security engineers to identify and prioritise vulnerabilities in the system. The approach can be employed to design resilient control strategies and test the effectiveness of security mechanisms, both in the design stage and during the operational phase of the system.
3D-flight route optimization for air-taxis in urban areas with Evolutionary Algorithms and GIS
2023, Journal of Air Transport Management
Electric aviation is being developed as a new mode of transportation for the urban areas of the future. This requires urban air space management that considers these aircraft. Flight routes need to be determined that avoid no-fly areas, and minimize flight time, energy consumption and added noise. Yet, no method currently exists for optimizing urban flight routes under multiple conflicting objectives while avoiding three-dimensional restricted areas. In our work, this research gap is overcome by optimizing 3D-routes with the multi-criteria optimization technique called Non-dominated Sorting Genetic Algorithm II. We propose a novel procedure in the optimization process to incorporate geographical representations. Furthermore, we include a seeding procedure for initializing the flight routes and repair methods for invalid flight routes that may arise during the optimization process. We apply the optimization to a case study in Manhattan (New York City) for two different aircraft types, the Lilium Jet (vectored thrust) and the EHANG 184 (wingless multicoptor), under three objectives concerning flight time, energy consumption and added noise. Compared to a least-distance path, flight routes were obtained with maximum improvements of 38% in added noise, 65% in flight time and 52% in energy consumption for the EHANG 184. For the Lilium jet, maximum improvements of 43% in added noise, 47% in flight time and 47% in energy consumption were obtained. Still, the obtained noise addition levels by the aircraft in New York City exceed 5 dB, which is considered as long-term noise annoyance. We illustrated that minimizing the added noise requires high search effort compared to the other two objectives. Upon further analysis of the optimization results, we conclude that the Lilium jet as representative of the eVTOL type vectored thrust is more sensitive to flight route changes than the multicoptor EHANG 184. This information may help in air taxi type choice for a certain region as well as in flight route planning.
Boosting ant colony optimization via solution prediction and machine learning
2022, Computers and Operations Research
This paper introduces an enhanced meta-heuristic (ML-ACO) that combines machine learning (ML) and ant colony optimization (ACO) to solve combinatorial optimization problems. To illustrate the underlying mechanism of our ML-ACO algorithm, we start by describing a test problem, the orienteering problem. In this problem, the objective is to find a route that visits a subset of vertices in a graph within a time budget to maximize the collected score. In the first phase of our ML-ACO algorithm, an ML model is trained using a set of small problem instances where the optimal solution is known. Specifically, classification models are used to classify an edge as being part of the optimal route, or not, using problem-specific features and statistical measures. The trained model is then used to predict the ‘probability’ that an edge in the graph of a test problem instance belongs to the corresponding optimal route. In the second phase, we incorporate the predicted probabilities into the ACO component of our algorithm, i.e., using the probability values as heuristic weights or to warm start the pheromone matrix. Here, the probability values bias sampling towards favoring those predicted ‘high-quality’ edges when constructing feasible routes. We have tested multiple classification models including graph neural networks, logistic regression and support vector machines, and the experimental results show that our solution prediction approach consistently boosts the performance of ACO. Further, we empirically show that our ML model trained on small synthetic instances generalizes well to large synthetic and real-world instances. Our approach integrating ML with a meta-heuristic is generic and can be applied to a wide range of optimization problems.
Multi-objective integer programming approaches to Next Release Problem — Enhancing exact methods for finding whole pareto front
2022, Information and Software Technology
Citation Excerpt :
The seeding mechanism is adopted and seeds are randomly selected non-dominated solutions found by the exact methods. Seeds are used in the progress during population initialization and offspring population creation [22,23]. Besides elapsed time and number of non-dominated solutions for each method finds, we adopt 3 other indicators for showing the quality of each method on all datasets.
Project planning is a crucial part of software engineering, it involves selecting requirements to develop for the next release. How to make a good release plan is an optimization problem to maximize the goal of revenue under the condition of cost, time, or other aspects, namely Next Release Problem (NRP). Genetic and exact algorithms are used since it was proposed.
We model NRP as bi-objective (revenue, cost) and tri-objective (revenue, cost, urgency) form, and investigate whether exact methods could solve bi-objective and tri-objective instances more efficiently.
The state-of-art integer linear programming (ILP) approach to the bi-objective NRP is $ε$ -constraint for finding all non-dominate solutions. To improve its efficiency, we employ CWMOIP (Constrained Weighted Multi-Objective Integer Programming) and I-EC (improved $ε$ -constraint) for solving bi-objective instances. In tri-objective form, we introduce SolRep, an ILP method that optimizes the reference points from sampling, for finding solutions subset within a short time. NSGA-II is implemented as the evolutionary algorithm for the comparison with former methods and it adopts the seeding mechanism.
: I-EC can find all non-dominated solutions with better performance than both $ε$ -constraint and CWMOIP on all instances except for one. I-EC reduces solving time by 19.7% (large instances) and 91.5% (small instances) on average separately compared with $ε$ -constraint. SolRep can find evenly distributed solutions and exceed NSGA-II illustrated by several indicators (such as HyperVolume) on tri-objective instances. And each method has its merit in the aspect of speed and number of the solutions.
(1) The I-EC can solve all non-dominated solutions with better performance than the state-of-art exact method. (2) SolRep solves large tri-objective instances with more non-dominated solutions and solves small instances with less time compared with seeded NSGA-II. (3) Seeded NSGA-II shows its advantage on the number of non-dominated solutions on smaller tri-objective instances.
Quantifying uncertainty in Pareto fronts arising from spatial data
2021, Environmental Modelling and Software
Citation Excerpt :
Given that the single-objective optimal solutions of the multi-objective problem can be derived with a much lower computational effort, we can cost-effectively apply the exterior sampling method on the single-objective optima to compute the uncertainty in the single points at the outer ends of the Pareto fronts. To extend the available information from the single objectives to the uncertainty in the Pareto front, we execute the multi-objective optimization including the single objective optima, with a method called seeding (Friedrich and Wagner, 2015). Seeding entails that a part of the random initial solutions of the optimization is replaced by better solutions, in our case the single-objective optima.
Multi-objective spatial optimization problems require spatial data input that can contain uncertainties. Via the validation of constraints and the computation of objective values this uncertainty propagates to the Pareto fronts. Here, we develop a method to quantify the uncertainty in Pareto fronts by finding the extreme lower and upper bound of the range of optimal values in the objective space, i.e. the Pareto interval. The method is demonstrated on a land use allocation problem with initial land use (for objectives and constraints) and soil fertility (for one objective) as uncertain input data. Pareto intervals resulting from uncertain land use data were wide and irregularly shaped, whereas the ones from uncertain soil data were narrow and regularly shaped. Furthermore, in some objective-space regions, optimal land use patterns remained relatively stable under uncertainty, while elsewhere they were clouded. This information can be used to select solutions robust to spatial input data uncertainty.
A general framework and guidelines for benchmarking computational intelligence algorithms applied to forecasting problems derived from an application domain-oriented survey
2020, Applied Soft Computing Journal
Citation Excerpt :
GA finds nearly optimal solution via an iterative process of applying the genetic operators: selection, mutation, crossover to a population of individuals, minimizing the fitness function defined by the problem instance [16]. The population initialization can be done via random procedures or more informed ones as e.g. by following the guidelines for seeding the initial population of multi-objective evolutionary algorithms given in [17] or following the lessons learned in the study of GA optimization performance evaluation described in [18]. Genetic programming (GP) [19] was introduced by J. R. Koza in 1992 as a generalization of GA to populations of programs.
Benchmarking computational intelligence algorithms provides valuable knowledge for selecting the best or, at least, the proper algorithm for a certain problem. The experimental results of the computational intelligence techniques applications in various domains, as well as the comparative studies that were reported in the literature can be analyzed and synthesized as development strategies for new successful applications of CI algorithms. Starting from an application domain-oriented survey of selected recently reported research work, the paper presents a general benchmarking framework applicable to computational intelligence algorithms and a set of guidelines for the selection of the best or more suitable CI algorithm for solving forecasting problems. Our approach proposes the integration of software and knowledge engineering best practice towards CI benchmarking, being a computational intelligence engineering methodology. The framework uses two knowledge bases, one for the application domain and one for the CI algorithms, providing heuristic knowledge for a more informed and efficient benchmarking, a case base in which solved problems are recorded with their solution and lessons that were learned, and a knowledge-based problem instance features selection. Some examples of how to apply the framework for problems of forecasting in seismology, environmental protection, hydrology and energy are also discussed. We point out that the framework might be implemented as a software tool (e.g. a decision support system) or as a tool suite. The main conclusion of our research work is that the integration of the derived knowledge from an application domain-oriented survey into the general benchmarking framework along with the set of guidelines for best or proper CI algorithms selection can improve significantly the forecasting accuracy and the response time, in case of real time forecasters.

View all citing articles on Scopus

View full text

Seeding the initial population of multi-objective evolutionary algorithms: A computational study

Highlights

Abstract

Graphical abstract

Introduction

Section snippets

Preliminaries

Experimental setup

Experimental results

Conclusions

Acknowledgements

Comput. Geom.: Theory Appl.

Theor. Comput. Sci.

Artif. Intell.

Comput. Methods Appl. Mech. Eng.

Eur. J. Oper. Res.

Eng. Appl. Artif. Intell.

Eur. J. Oper. Res.

Theor. Comput. Sci.

Eur. J. Oper. Res.

J. Electroanal. Chem.

Simulated Binary Crossover for Continuous Search Space. Technical report

Faster hypervolume-based search using Monte Carlo sampling

Parameterized average-case complexity of the hypervolume indicator

Approximation-guided evolutionary multi-objective optimization

Bicriterion single machine scheduling with resource dependent processing times

SIAM J. Optim.

How good is the Chord algorithm?

A fast and elitist multiobjective genetic algorithm: NSGA-II

IEEE Trans. Evolut. Comput.

Scalable test problems for evolutionary multiobjective optimization

Small approximate Pareto sets for biobjective shortest paths and other problems

SIAM J. Comput.

The jMetal framework for multi-objective optimization: design and architecture

An EMO algorithm using the hypervolume measure as selection criterion

Methods of numerical solution of multicriterion problem