Elsevier

Expert Systems with Applications

Volume 70, 15 March 2017, Pages 67-82
Expert Systems with Applications

Analyzing the effects of binarization techniques when solving the set covering problem through swarm optimization

https://doi.org/10.1016/j.eswa.2016.10.054Get rights and content

Highlights

  • We study the impact of binarization methods when solving the set covering problem.

  • We apply a metaheuristic inspired by the behavior of cats for solving the problem.

  • We consider forty binarization methods and a freely available dataset.

  • We conclude that it is crucial to select an adequate binarization method.

Abstract

The Set Covering Problem (SCP) is one of the classical Karp’s 21 NP-complete problems. Although this is a traditional optimization problem, we find many papers assuming metaheuristics for solving the SCP in the current literature. However, while the SCP is a discrete problem, most metaheuristics are defined for solving continuous optimization problems, specially Swarm Intelligence Algorithms (SIAs). Hence, such algorithms should be adapted for working on the discrete scope, but most authors did not perform any study to select a concrete binarization approach. This situation might lead to the conclusion that selecting a concrete binarization technique does not influence the behavior of the algorithm, but rather the general approach of the metaheuristic. This circumstance led us to write this paper focusing on the inherent difficulty in binarization of metaheuristics designed for continuous optimization, when solving a discrete optimization problem, concretely the SCP. To this end, we consider a recent SIA inspired by the behavior of cats and adapted to the discrete scope, which is called Binary Cat Swarm Optimization (BCSO). We replace the binarization technique assumed in the original BCSO by forty different approaches from the current literature. The results obtained while solving a standard SCP benchmark are analyzed through a widely accepted statistical method, concluding that it is crucial to select an adequate binarization approach to ensure that the solving algorithm reaches its full potential. Thus, the task of adapting a metaheuristic to the discrete scope is not a simple matter and should be carefully studied. To this end and as a result of this study, we give some recommendations to perform this task.

Introduction

The Set Covering Problem (SCP) is one of the classical 21 problems shown to be NP-complete by Karp (1972) and whose optimization version is NP-hard (Garey & Johnson, 1979). Although the SCP is a traditional optimization problem, it is widely considered in the current literature for designing expert systems, which emulate the decision-making ability of human experts in a given field (Reggia, Nau, & Wang, 1983). For example, we find works considering the SCP for facility location (Farahani, Asgari, Heidari, Hosseininia, & Goh, 2012), ship scheduling (de Mare, Spliet, & Huisman, 2014), production planning (Adulyasak, Cordeau, & Jans, 2015), crew scheduling (Chen, Shen, 2013, Juette, Thonemann, 2012 Chen, Shen, 2013, Juette, Thonemann, 2012), vehicle routing (Bai, Xue, Chen, Roberts, 2015, Cacchiani, Hemmelmayr, Tricoire, 2014, Vidal, Crainic, Gendreau, Prins, 2013), musical composition (Simeone, Nouno, Mezzadri, & Lari, 2014), information retrieval (Zhang, Wei, & Chen, 2014), and territory design (Elizondo-Amaya, Rios-Mercado, & Diaz, 2014), among many others.

Chvatal (1979) defined the SCP as follows. Given a set M of m objects and a collection S of n sets of these objects, each set with a non-negative cost associated. The goal is to find a minimum cost family of subsets CS, such that each element iM belongs to at least one subset of the family C.

Some authors solved the SCP by applying exact techniques, such as branch-and-bound and branch-and-cut algorithms. However, such methods are not recommended for solving this type of complex problems, because computational times rise exponentially with the problem dimension.

Instead, approximate techniques should be considered, such as metaheuristics. This type of techniques is successfully considered in the literature for solving NP-hard problems from different fields, including the SCP (Dasgupta & Michalewicz, 2013). However, while the SCP is defined as a discrete optimization problem, many metaheuristics are designed for solving continuous optimization problems, specially Swarm Intelligence Algorithms (SIAs). Thus, such metaheuristics should be adapted for working on the discrete search space.

There are many SIAs adapted for solving general discrete optimization problems, such as Binary Gravitational Search Algorithm (BGSA) (Rashedi, Nezamabadi-Pour, & Saryazdi, 2010), Binary Magnetic Optimization Algorithm (BMOA) (Mirjalili & Hashim, 2012), and Binary Cat Swarm Optimization (BCSO) (Sharafi, Khanesar, & Teshnehlab, 2013). Usually, the algorithms are adapted by following the two-step binarization method introduced by Kennedy and Eberhart (1997) in their approach of Binary Particle Swarm Optimization (BPSO) for transforming real numbers into binary ones. In this case, the authors explained how to get a new binary solution according to the particle velocity, which is a real number. The method followed by the authors is as follows. Firstly, we map the real value to a number in the interval [0, 1] through a transfer function. Secondly, we transform the number in the interval [0, 1] into a binary value through a discretization function. In this line, there are eight major transfer functions and five major discretization functions in the current literature, denoted as S1,S2,,S4,V1,V2,,V4 and D1,D2,,D8, respectively (Crawford, Soto, Peña, Riquelme-Leiva, Torres-Rojas, Johnson, et al., 2015c, Mirjalili, Lewis, 2013).

Most authors did not do any study to select a concrete binarization approach when adapting a metaheuristic. This situation might lead to the conclusion that selecting a concrete binarization technique does not influence the behavior of the algorithm, but rather the general approach of the metaheuristic. To the best of our knowledge, this is the first work performing this study in the literature. We do the following three main tasks throughout this study:

  • We select a recent SIA from the current literature, which was initially designed for continuous optimization and later adapted to the discrete scope. Specifically, the BCSO algorithm inspired by the behavior of cats, whose original continuous approach was proposed by Chu, Tsai, and Pan (2006).

  • The authors of BCSO considered a transfer and discretization function, without performing any formal study to this task. We change the original formulation of BCSO by combining the eight transfer functions and the five discretization functions introduced before, i.e., we get forty BCSO approaches.

  • We apply the forty BCSO approaches for solving two freely available SCP sets. We study the results obtained through an accepted statistical methodology to analyze if selecting a binarization technique influences the behavior of the metaheuristic.

The rest of this paper is structured as follows. We list the acronyms considered in Table 1. In Section 2, we discuss the related work, including the major motivations for performing this work. In Section 3, we give a formal SCP definition, including a problem example. In Section 4, we explain the BCSO metaheuristic. In Section 5, we describe the transfer and discretization functions in this study. In Section 6, we discuss the experimental method followed and the results obtained. In Section 7, we give some implementation details. Finally, conclusions are left for Section 8.

Section snippets

Related work

We find many works solving the SCP. Starting with exact algorithms, (Beasley, Jornsten, 1992, Fisher, Kedia, 1990), and (Balas & Carrera, 1996) considered branch-and-bound and branch-and-cut techniques. Bar-Yejuda and Even (1981); Beasley (1987), and El-Darzi and Mitra (1990) considered linear programming. Caprara, Fischetti, and Toth (2000) compared several exact algorithms for optimizing the SCP, reaching that the best exact algorithm is CPLEX. This type of exact techniques is not suitable

Set covering problem

In this section, we provide a formal statement of the SCP and a small location problem as an example of this formulation.

Binary cat swarm optimization

Domestic cats show great abilities for hunting and being alert to possible dangers (Adamec, 1976, Adler, 1995). Based on this idea, Chu et al. (2006) proposed an SIA inspired by the behavior of real cats. This metaheuristic was later adapted to the discrete scope by Sharafi et al. (2013). The authors identified two main modes of behavior for simulating cats:

  • Seeking mode: it simulates the situation in which a cat is looking around, calculating, and evaluating the next movement.

  • Tracing mode: it

Transfer and discretization functions

In this section, we discuss the transfer and discretization functions considered for addressing the steps 3 and 4 of the tracing mode described in Section 4.2.

Experimentation

In this section, we discuss both the experimental method followed and the experimental results obtained.

Implementation details

Both the problem definition introduced in Section 3 and the solving methodology discussed in Section 4 were coded in Java assuming NetBeans IDE 7.1 and executed on a 2.53  GHz Intel Core i3 M380 processor with 3 GB RAM under Windows 7. Regarding the statistical tools, the Wilcoxon–Mann–Whitney’s test was taken from Fonseca, Knowles, Thiele, and Zitzler and both Shapiro–Wilk’s and Kolmogorov–Smirnov–Lilliefor’s tests were taken from the IBM SPSS software.

Final remarks

The SCP is a traditional optimization problem widely considered for designing expert systems. We find many papers assuming metaheuristics for solving the SCP in the current literature. However, many metaheuristics are defined for solving continuous optimization problems, specially SIAs, while the SCP is a discrete problem. Hence, such algorithms should be adapted for working on the discrete scope. However, most authors did not perform any study to select a concrete binarization approach. This

Acknowledgements

The authors thank the anonymous referees for comments and suggestions which have led to an improved version of this paper. Broderick Crawford is supported by grant CONICYT/ FONDECYT/ REGULAR/ 1140897. Ricardo Soto is supported by grant CONICYT/ FONDECYT/ REGULAR/ 1160455. This research was partially funded by CORFO Program Ingeniería 2030 PUCV - Consortium of Chilean Engineering Faculties.

References (77)

  • R.Z. Farahani et al.

    Covering problems in facility location: A review

    Computers & Industrial Engineering

    (2012)
  • T.A. Feo et al.

    A probabilistic heuristic for a computationally difficult set covering problem

    Operations Research Letters

    (1989)
  • S. Mirjalili et al.

    A study of different transfer functions for binary version of particle swarm optimization

    Swarm and evolutionary computation

    (2011)
  • S. Niroomand et al.

    Exact mathematical formulations and metaheuristic algorithms for production cost minimization: A case study of the cable industry

    International Transactions in Operational Research

    (2015)
  • R.V. Rao et al.

    Teaching–learning-based optimization: A novel method for constrained mechanical design optimization problems

    Computer-Aided Design

    (2011)
  • E. Rashedi et al.

    Bgsa: Binary gravitational search algorithm

    Natural Computing

    (2010)
  • Z. Ren et al.

    New ideas for applying ant colony optimization to the set covering problem

    Computers & Industrial Engineering

    (2010)
  • S.S. Shapiro et al.

    An analysis of variance test for normality (complete samples)

    Biometrika

    (1965)
  • A. Singh

    An artificial bee colony algorithm for the leaf-constrained minimum spanning tree problem

    Applied Soft Computing

    (2009)
  • R. Soto et al.

    Pre-processing, repairing and transfer functions can help binary electromagnetism-like algorithms

    Artificial intelligence perspectives and applications

    (2015)
  • T. Vidal et al.

    Heuristics for multi-attribute vehicle routing problems: A survey and synthesis

    European Journal of Operational Research

    (2013)
  • X.-S. Yang

    Firefly algorithm, stochastic test functions and design optimisation

    International Journal of Bio-Inspired Computation

    (2010)
  • K. Zyma et al.

    Teacher training enhances the teaching-learning-based optimization metaheuristic when used to solve multiple-choice multidimensional knapsack problems

    International Journal of Metaheuristics

    (2015)
  • H. Adler

    Some factors of observation learning in cats

    The Journal of Genetic Psychology

    (1995)
  • U. Aickelin

    An indirect genetic algorithm for set covering problems

    Computers & Operations Research

    (2004)
  • E. Balas et al.

    A dynamic subgradient-based branch-and-bound procedure for set covering

    Operations Research

    (1996)
  • R. Bar-Yejuda et al.

    A linear-time approximation algorithm for the weighted vertex cover problem

    Journal of Algorithms

    (1981)
  • J. Beasley

    A lagrangian heuristic for set covering problems

    Naval Research Logistics

    (1990)
  • Beasley, J. E. (2016). Or-library....
  • c.I. Birbil et al.

    An electromagnetism-like mechanism for global optimization

    Journal of Global Optimization

    (2003)
  • A. Caprara et al.

    A heuristic method for the set covering problem

    Operations Research

    (1999)
  • A. Caprara et al.

    Algorithms for the set covering problem

    Annals of Operations Research

    (2000)
  • S. Ceria et al.

    A lagrangian-based heuristic for large-scale set covering problems

    Mathematical Programming

    (1998)
  • S. Chen et al.

    An improved column generation algorithm for crew scheduling problems

    Journal of Information and Computational Science

    (2013)
  • S.-C. Chu et al.

    Cat swarm optimization

    PRICAI 2006: Trends in artificial intelligence

    (2006)
  • V. Chvatal

    A greedy heuristic for the set-covering problem

    Mathematics of Operations Research

    (1979)
  • F. Colombo et al.

    A variable neighborhood search algorithm for the multimode set covering problem

    Journal of Global Optimization

    (2015)
  • B. Crawford et al.

    Integrating lookahead and post processing procedures with aco for solving set partitioning and covering problems

    Artificial intelligence and soft computing–ICAISC 2006

    (2006)
  • Cited by (56)

    • Orientational variable-length strip covering problem: A branch-and-price-based algorithm

      2021, European Journal of Operational Research
      Citation Excerpt :

      Weerasena and Wiecek (2019) propose a polynomial time algorithm to search for the solutions in the weak Pareto set of the problem. Lanzagutierrez et al. (2017) investigate the effect of using a metaheuristic to solve the SCP, focusing on the binarization strategies of swarm optimization algorithms. The existing methods for the traditional SCP are not applicable to the OVSC problem because the complexity of the latter.

    View all citing articles on Scopus
    View full text