Ant colony and particle swarm optimization for financial classification problems

doi:10.1016/j.eswa.2009.02.055

Expert Systems with Applications

Volume 36, Issue 7, September 2009, Pages 10604-10611

https://doi.org/10.1016/j.eswa.2009.02.055 Get rights and content

Abstract

Financial decisions are often based on classification models which are used to assign a set of observations into predefined groups. Such models ought to be as accurate as possible. One important step towards the development of accurate financial classification models involves the selection of the appropriate independent variables (features) which are relevant for the problem at hand. This is known as the feature selection problem in the machine learning/data mining field. In financial decisions, feature selection is often based on the subjective judgment of the experts. Nevertheless, automated feature selection algorithms could be of great help to the decision-makers providing the means to explore efficiently the solution space. This study uses two nature-inspired methods, namely ant colony optimization and particle swarm optimization, for this problem. The modelling context is developed and the performance of the methods is tested in two financial classification tasks, involving credit risk assessment and audit qualifications.

Introduction

Modern finance is a broad field often involved with hard decision-making problems related to risk management. In several cases, financial decision-making problems require the assignment of the available options into predefined groups/classes. Credit risk analysis, bankruptcy prediction, and country risk assessment, among other are some typical examples (Doumpos, Zopounidis, & Pardalos, 2000). In this context the development of reliable classification models is clearly of major importance to researchers and practitioners.

The development of financial classification models is a complicated process, involving careful data collection and pre-processing, model development, validation and implementation. Focusing on model development, several methods have been used, including statistical methods, artificial intelligence techniques and operations research methodologies. In all cases, the quality of the data is a fundamental point. This is mainly related to the adequacy of the sample data in terms of the number of observation and the relevance of the decision attributes (i.e., independent variables) used in the analysis.

The latter is related to the feature selection problem. Feature selection refers to the identification of the appropriate attributes (features) that should be introduced in the analysis in order to maximize the expected performance of the resulting model. This has significant implications for issues such as (Kira & Rendell, 1992): (1) noise reduction through the elimination of noisy features, (2) reduction of the time and cost required implement an appropriate model, (3) simplification of the resulting models, and (4) facilitation of the easy use and updating of the models.

The basic feature selection problem is an optimization problem, with a performance measure for each subset of features, which represents expected classification performance of the resulting model. The problem is to search through the space of feature subsets in order to identify the optimal or near-optimal one with respect to the performance measure. Unfortunately, finding the optimum feature subset has been proved to be NP-hard (Kira & Rendell, 1992). Many algorithms are, thus, proposed to find the suboptimal solutions in comparably smaller amount of time (Jain & Zongker, 1997). Branch and bound approaches (Narendra & Fukunaga, 1977), sequential forward/backward search (Aha and Bankert, 1996, Cantu-Paz et al., 2004) and filters approaches (Cantu-Paz, 2004) deterministically search for the suboptimal solutions. One of the most important of the filter approaches is the Kira and Rendell’s Relief algorithm (Kira & Rendell, 1992). Stochastic algorithms, including simulated annealing (Siedlecki & Sklansky, 1988), scatter search (Lopez, Torres, Batista, Perez, & Moreno-Vega, 2006), ant colony optimization (Al-Ani, 2005a, Al-Ani, 2005b, Parpinelli et al., 2002, Shelokar et al., 2004) and genetic algorithms (Cantu-Paz et al., 2004) are of great interest recently because they often yield high accuracy and are much faster.

In this paper, two algorithms for the solution of the feature selection problem based on ant colony and particle swarm optimization are presented. These algorithms are combined with three nearest neighbour based classifiers, the 1-nearest neighbour, the k-nearest neighbour and the weighted k-nearest neighbour classifier. The algorithms are applied to two data sets involving financial decision-making problems. The first involves credit risk assessment and the second is related to qualified audit reports. A comparison of the proposed algorithms with two other metaheuristics, namely Tabu search metaheuristic (Glover, 1989, Glover, 1990) and a genetic algorithm (Goldberg, 1989, Reeves, 1995, Reeves, 2003) illustrates the performance of the proposed algorithms.

The rest of the paper is organized as follows: the next section provides a detailed analysis of the proposed algorithms. Section 3 describes the applications context using the aforementioned data financial data sets and the experimental settings, whereas Section 4 presents the obtained computational results. The last section concludes the paper and discusses some future research directions.

Section snippets

Nearest neighbour classifiers

Initially, the classic 1-nearest neighbour (1-nn) (Duda & Hart, 1973) method is used. The nearest neighbour classifier was selected as it is a method very easy to implement it and it does not need any optimization procedure as for example it is necessary in support vector machines and in neural networks. Assume a training sample of M_train vectors y_j = (y_j1, …, y_jd), j = 1, …, M_train, where d is the number of selected features and y_jl is the description of observation j on feature l. In the 1–nn

Data

The two metaheuristic algorithms are applied to two financial classification problems. The first is related to credit risk assessment. The data, taken from Doumpos and Pasiouras (2005) involve 1330 firm-year observations for UK non-financial firms, over the period 1999–2001. The sample observations are classified into five risk groups according to their level of likelihood of default, measured on the basis of their QuiScore, a credit rating assigned by Qui Credit Assessment Ltd. In particular,

Results

Table 2 presents the classification results for the optimal solution of each of the proposed algorithms, the ACO based metaheuristic and PSO based metaheuristic, for both financial classification problems. The results of the algorithms used for the comparisons are also shown. When the Tabu metaheuristic was used a number of tests were performed in order to choose the best k and, finally, a value of k equal to 5 was chosen. The statistical significance of the differences between the methods is

Conclusions and future work

An important issue in building a good classifier is the selection of a set of appropriate input feature variables. The ant colony optimization and the particle swarm optimization algorithms have been proposed in this study for solving this feature subset selection problem. Three different classifiers were used for the classification problem, based on the nearest neighbour classification rule. The performance of the proposed algorithm was tested using financial data involving credit risk

References (28)

P.S. Shelokar et al.
An ant colony classifier system: Application to some process engineering problems
Computers and Chemical Engineering
(2004)
W.S. Treacy et al.
Credit risk rating systems at large US banks
Journal of Banking and Finance
(2000)
D.W. Aha et al.
A comparative evaluation of sequential feature selection algorithms
A. Al Ani
Feature subset selection using ant colony optimization
International Journal of Computational Intelligence
(2005)
A. Al-Ani
Ant colony optimization for feature subset selection
Transactions on Engineering, Computing and Technology
(2005)
Cantu-Paz, E. (2004). Feature subset selection, class separability, and genetic algorithms. In Genetic and evolutionary...
Cantu-Paz, E., Newsam, S., & Kamath, C. (2004). Feature selection in scientific application. In Proceedings of the 2004...
M. Dorigo et al.
Ant system: Optimization by a colony of cooperating agents
IEEE Transactions on Systems, Man, and Cybernetics – Part B
(1996)
M. Dorigo et al.
Ant colony optimization
(2004)
M. Doumpos et al.
Explaining qualifications in audit reports using a support vector machine methodology
Intelligent Systems in Accounting, Finance and Management
(2005)

M. Doumpos et al.

Developing and testing models for replicating credit ratings: A multicriteria approach

Computational Economics

(2005)

M. Doumpos et al.

Multicriteria sorting methodology: Application to financial decision problems

Parallel Algorithms and Applications

(2000)

R.O. Duda et al.

Pattern classification and scene analysis

(1973)

F. Glover

Tabu search I

ORSA Journal on Computing

(1989)

Cited by (81)

Comprehensive learning Harris hawks-equilibrium optimization with terminal replacement mechanism for constrained optimization problems
2022, Expert Systems with Applications
Citation Excerpt :
Metaheuristic is one of the most popular optimization techniques inspired from nature, with the characteristics of simplicity, flexibility, derivation-free, black-box computing, and parallel computing, which can provide good performance in different kinds of optimization problems (Li, Liu, Zhao & Zeng, 2021; Houssein, Mahdy, Blondin et al., 2021). Therefore, metaheuristic algorithms are successful in solving real-world optimization problems (Osaba et al., 2021), such as civil engineering (Li, Jiang & Yang, 2012; Li & Hu, 2014), finance (Marinakis et al., 2009), medicine (Elaziz et al., 2020), industry (Houssein et al., 2021), reliability-based design optimization (Meng, Li, Wang et al., 2021), and so on. Metaheuristic algorithms can be classified as four categories based on the inspiration from nature (Faramarzi et al., 2020): swarm-based algorithms, evolutionary algorithm, physics or chemistry-based algorithms, and social or human-based algorithms.
Harris hawks optimization (HHO) is a novel metaheuristic algorithm which has strong convergence for unconstrained optimization problems. However, HHO may encounter premature or local stagnation for constrained optimization problems. In this paper, a hybrid HHO algorithm named comprehensive learning harris hawks-equilibrium optimization (CLHHEO) is presented for solving constrained optimization problems, with the help of three operators: comprehensive learning, equilibrium optimizer, and terminal replacement mechanism. In the proposed algorithm, comprehensive learning strategy is incorporated with HHO to make search agents share their knowledge to enhance the convergence capacity. The operator of equilibrium optimizer is utilized to improve the exploration capacity of HHO. Besides, the terminal replacement mechanism is incorporated in the proposed algorithm to avoid local stagnation. The proposed CLHHEO is tested on 15 unconstrained and 10 real-world constrained optimization problems, and compared with 10 state-of-the-art metaheuristic algorithms, including PSO, CLPSO, BBBC, GWO, DA, WOA, SSA, HHO, SOA and AOA. From the experimental results, it is observed that CLHHEO outperforms HHO and other comparing metaheuristic algorithms in terms of solution quality. The results also demonstrate that the ensemble strategies of CLHHEO can enhance the performance of HHO for constrained optimization problems.
Performability evaluation, validation and optimization for the steam generation system of a coal-fired thermal power plant
2022, MethodsX
Citation Excerpt :
Kumar et al. [14] analyzed the availability of a system in a thermal power plant with the help of the Markov approach and suggested the maintenance schedule for various subsystems of the system concerned. Marinakis et al. [15] proposed the Ant Colony (ACO and the PSO algorithms to solve the financial classification model. They tested the proposed methods through two different financial classification problems.
The present paper talks over performability evaluation for a steam generation system of a Coal Fired Thermal Power Plant (CFTPP) using the concept of the Markov method. A steam generation system provides a suitable amount of steam for the sound functioning of the plant. The system comprises five subsystems, i.e., High-Pressure Heater, Economizer, Boiler Drum, Water Tubes, and Super Heater. First, the transition diagram of the concerned system is designed based on the state probabilities of various subsystems. The differential equations are derived based on the mnemonic rule. After that, the performability model is developed by using the normalizing condition. The performability levels for various subsystems are obtained by placing the appropriate value of failure and repair rates in the developed model. The performability of each subsystem is evaluated based on performability matrices. It is observed that the economizer subsystem is most critical in which the availability increased from 0.7640 to 0.8827, i.e. (11.87 %). In contrast, boiler drum is the least crucial subsystem with availability enhanced from 0.8627 to 0.8657 (i.e., 0.3 %). The results show that the economizer subsystem must be given top priority, and the boiler drum be given the least priority from the maintenance outlook. The performability levels obtained through the Markov method are compared with those obtained through the Artificial Neural Network to validate. Moreover, machine learning (artificial neural network) and optimization technique (particle swarm optimization) is also employed to check the adequacy of the results and optimized process parameters.
- •
  The aim of the present study is evaluate the performance of steam generation system of a coal fired thermal power plant.
- •
  The probabilistic approach (i.e. Makov Method) is used to formulate the transition diagram of the steam generation system. Then, the first-order differential equations are obtained using the mnemonic rule and further solved recursively.
- •
  The results show that the economizer system must be given top priority, and the boiler drum subsystem must be given the least priority from the maintenance outlook.
Improving K-means clustering with enhanced Firefly Algorithms
2019, Applied Soft Computing Journal
In this research, we propose two variants of the Firefly Algorithm (FA), namely inward intensified exploration FA (IIEFA) and compound intensified exploration FA (CIEFA), for undertaking the obstinate problems of initialization sensitivity and local optima traps of the K-means clustering model. To enhance the capability of both exploitation and exploration, matrix-based search parameters and dispersing mechanisms are incorporated into the two proposed FA models. We first replace the attractiveness coefficient with a randomized control matrix in the IIEFA model to release the FA from the constraints of biological law, as the exploitation capability in the neighbourhood is elevated from a one-dimensional to multi-dimensional search mechanism with enhanced diversity in search scopes, scales, and directions. Besides that, we employ a dispersing mechanism in the second CIEFA model to dispatch fireflies with high similarities to new positions out of the close neighbourhood to perform global exploration. This dispersing mechanism ensures sufficient variance between fireflies in comparison to increase search efficiency. The ALL-IDB2 database, a skin lesion data set, and a total of 15 UCI data sets are employed to evaluate efficiency of the proposed FA models on clustering tasks. The minimum Redundancy Maximum Relevance (mRMR)-based feature selection method is also adopted to reduce feature dimensionality. The empirical results indicate that the proposed FA models demonstrate statistically significant superiority in both distance and performance measures for clustering tasks in comparison with conventional K-means clustering, five classical search methods, and five advanced FA variants.
The continuous-discrete PSO algorithm for shape formation problem of multiple agents in two and three dimensional space
2018, Applied Soft Computing Journal
Citation Excerpt :
Thirdly, the code of PSO algorithm is relatively simple and the efficiency of searching for the suboptimal or global optimum is relatively high according to one previously reported result [20]. Because of the aforementioned advantages, PSO algorithm has been successfully and widely applied to a broad range of optimization problems, such as electric power system [21–24], electromagnetic [25], locating and tracking [26–29], intelligent control [30], neural network [31,32], fault detection [33,34], feature selection [35–37], path planning [38] and others [39,40], etc. In this paper, the shape formation problem, which can be applied to the helicopter and ship formation and the large-scale performance in the future can be roughly classified by two main cases.
Shape formation problem of agents in the two or three dimensional space is one of the most important and challenging topics in the fields of evolutionary computation and multi-agents system, etc. Firstly, the basic concepts and objective functions of shape formation problem are introduced to deeply understand the considered shape formation problem. Three theorems of shape formation problem with three agents are addressed by the Lagrangian multiplier method, however, the Lagrangian multiplier method difficultly solves optimal shape formation problem where the number of agents is strictly larger than 3 and the number of constraints is larger than 2. In order to tackle the continuous and discrete optimization problem, the continuous-discrete particle swarm optimization (CDPSO) algorithm is developed to search for the rotated angle of the desired shape and the matching pair between points in the initial shape and points in the desired shape. Additionally, the parameters in CDPSO algorithm are set by three theorems on convergence analysis of the random PSO algorithm. To demonstrate the effectiveness and the feasibility of the CDPSO algorithm on the shape formation problem, numerical results not only discuss the optimal virtual helicopters formation between two typical shapes in the three dimensional space, but also provide one searching and rescuing strategy of MH370 plane to minimize the whole moving distance of all virtual rescuing ships. Moreover, the shape conversion problem including multiple agents is also solved by the CDPSO algorithm when the number of agents is equal to 100, 200, 500 and 1000. Additionally, the optimization results and the computational time are compared among the Lagrange multiplier method, CDPSO, CDDE, CDGA, CDPSOI and CDPSOE algorithms.
An efficient hybrid clustering method based on improved cuckoo optimization and modified particle swarm optimization algorithms
2018, Applied Soft Computing Journal
Partitional data clustering with K-means algorithm is the dividing of objects into smaller and disjoint groups that has the most similarity with objects in a group and most dissimilarity from the objects of other groups. Several techniques have been proposed to avoid the major limitations of K-Means such as sensitive to initialization and easily convergence to local optima. An alternative to solve the drawback of the sensitive to centroids’ initialization in K-Means is the K-Harmonic Means (KHM) clustering algorithm. However, KHM is sensitive to the noise and easily runs into local optima. Over the past decade, many algorithms are developed for solving this problems based on evolutionary method. However, each algorithm has its own advantages, limitations and shortcomings. In this paper, we combined K-Harmonic Means (KHM) clustering algorithm with an improved Cuckoo Search (ICS) and particle swarm optimization (PSO). ICS is intended to global optimum solution using Lévy flight method through changing radius in a dynamic and shrewd manner. Therefore, it is faster than standard cuckoo search. ICS is effected with PSO to avoid falling into local optima. The proposed algorithm, called ICMPKHM, solves the local optima problem of KHM with significant improvement on efficacy and stability. Experiments with benchmark datasets show that the proposed algorithm is quite insensitive to the centroids’ initialization. Comparative studies with other algorithms reveal that the proposed algorithm produce high quality and stable clustering results.
Statistically aided Binary Multi-Objective Grey Wolf Optimizer: a new feature selection approach for classification
2023, Journal of Supercomputing

View all citing articles on Scopus

View full text

Ant colony and particle swarm optimization for financial classification problems

Abstract

Introduction

Section snippets

Nearest neighbour classifiers

Data

Results

Conclusions and future work

Computers and Chemical Engineering

Journal of Banking and Finance

A comparative evaluation of sequential feature selection algorithms

Feature subset selection using ant colony optimization

International Journal of Computational Intelligence

Ant colony optimization for feature subset selection

Transactions on Engineering, Computing and Technology

Ant system: Optimization by a colony of cooperating agents

IEEE Transactions on Systems, Man, and Cybernetics – Part B

Ant colony optimization

Explaining qualifications in audit reports using a support vector machine methodology

Intelligent Systems in Accounting, Finance and Management