Genetic algorithms approach to feature discretization in artificial neural networks for the prediction of stock price index

doi:10.1016/S0957-4174(00)00027-0

Expert Systems with Applications

Volume 19, Issue 2, August 2000, Pages 125-132

https://doi.org/10.1016/S0957-4174(00)00027-0 Get rights and content

Abstract

This paper proposes genetic algorithms (GAs) approach to feature discretization and the determination of connection weights for artificial neural networks (ANNs) to predict the stock price index. Previous research proposed many hybrid models of ANN and GA for the method of training the network, feature subset selection, and topology optimization. In most of these studies, however, GA is only used to improve the learning algorithm itself. In this study, GA is employed not only to improve the learning algorithm, but also to reduce the complexity in feature space. GA optimizes simultaneously the connection weights between layers and the thresholds for feature discretization. The genetically evolved weights mitigate the well-known limitations of the gradient descent algorithm. In addition, globally searched feature discretization reduces the dimensionality of the feature space and eliminates irrelevant factors. Experimental results show that GA approach to the feature discretization model outperforms the other two conventional models.

Introduction

For a long time, there has been much research interest in predicting the stock price index. Among them, there are many studies using data mining techniques including artificial neural networks (ANNs). However, most studies showed that ANN had some limitations in learning the patterns because stock market data has tremendous noise and complex dimensionality. ANN has preeminent learning ability while it is often confronted with inconsistent and unpredictable performance for noisy data. In addition, sometimes the amount of data is so large that the learning of patterns may not work well. In particular, the existence of continuous data and large amount of data may pose a challenging task to explicit concepts extraction from the raw data due to the huge amount of data space determined by continuous features (Liu & Setiono, 1996). Many researchers in the society of data mining are interested in the reduction of dimensionality. The reduction and transformation of the irrelevant or redundant features may shorten the running time and yield more generalized results (Dash & Liu, 1997).

This paper proposes a new hybrid model of ANN and genetic algorithms (GAs) for feature discretization to mitigate the above limitations. Feature discretization is to transform continuous values into discrete ones in accordance with certain thresholds. Feature discretization is closely related to the dimensionality reduction (Liu & Motoda, 1998a). Properly discretized data can simplify the process of learning and may improve the generalizability of the learned results. This study uses GA to search the optimal or near-optimal thresholds for feature discretization. In addition, this study simultaneously searches the connection weights between layers in ANN. The genetically evolved connection weights mitigate the well-known limitations of the gradient descent algorithm.

The rest of the paper is organized as follows. Section 2 reviews prior research. Section 3 proposes feature discretization using GA and describes the benefits of the proposed approach. Section 4 describes the research design and experiments. In Section 5, the empirical results are summarized and discussed. In Section 6, conclusions and the limitations of this study are presented.

Section snippets

Prior research on stock market prediction using ANN

Many studies on stock market prediction using artificial intelligence (AI) techniques were performed during the past decade. These studies used various types of ANN to predict accurately the stock index and the direction of its change.

One of the earliest studies, Kimoto, Asakawa, Yoda and Takeoka (1990) used several learning algorithms and prediction methods for developing the Tokyo stock exchange prices index (TOPIX) prediction system. They used the modular neural network to learn the

GA approach to feature discretization for ANN

Many fund managers and investors in the stock market generally accept and use certain criteria for technical indicators as the signal of future market trends. Even if a feature represents a continuous measure, the experts usually interpret the values in qualitative terms such as low, medium, and high (Slowinski & Zopounidis, 1995). For ‘Stochastic %K’, one of the most popular technical indicators, the value of 75 is basically accepted by stock market analysts as a strong signal if the value

Research data and experiments

The research data used in this study is technical indicators and the direction of change in the daily Korea stock price index (KOSPI). The total number of samples is 2928 trading days, from January 1989 to December 1998. Table 2 gives selected features and their formulas (Achelis, 1995, Chang et al., 1996, Choi, 1995, Edwards and Magee, 1997, Gifford, 1995).

The direction of daily change in the stock price index are categorized as “0” or “1”. “0” means that the next day's index is lower than the

Experimental results

Three models are compared according to the methods of determining the connection weights and the methods of feature transformation. Table 5 describes the average prediction accuracy of each model.

In Table 5, GAFD has higher prediction accuracy than BPLT and GALT by 10∼11% for the holdout data. It is worth giving attention to the fact that there is a shade of difference of prediction accuracy between the training data and the holdout data for GAFD. There is, however, a wide difference between

Concluding remarks

As mentioned earlier, previous studies tried to optimize the controlling parameters of ANN using global search algorithms. Some of them only focused on the optimization of the connection weights of ANN. Others had an interest in the optimization of the learning algorithms itself, but most studies had little interest in the dimensionality reduction and the elimination of irrelevant patterns. This paper has proposed a new hybrid GA and ANN to mitigate the above limitations. In this paper, GA not

Acknowledgements

The authors would like to thank Korea Science and Engineering Foundation for supporting this work under Grant No. 98-0102-08-01-3.

References (46)

J. Basak et al.
Unsupervised feature selection using a neuro-fuzzy approach
Pattern Recognition Letters
(1998)
M. Dash et al.
Feature selection methods for classifications
Intelligent Data Analysis—An International Journal
(1997)
J.N.D. Gupta et al.
Comparing backpropagation with a genetic algorithm for neural network training
Omega
(1999)
J.P. Ignizio et al.
Simultaneous design and training of ontogenic neural network classifiers
Computers and Operations Research
(1996)
H. Liu et al.
Dimensionality reduction via discretization
Knowledge-Based Systems
(1996)
J. Martens et al.
An initial comparison of a fuzzy neural classifier and a decision tree based classifier
Expert Systems with Applications
(1998)
R.S. Sexton et al.
Global optimization for artificial neural networks: a tabu search application
European Journal of Operational Research
(1998)
R.S. Sexton et al.
Toward global optimization of neural networks: a comparison of the genetic algorithm and backpropagation
Decision Support Systems
(1998)
R.S. Sexton et al.
Optimization of neural networks: a comparative analysis of the genetic algorithm and simulated annealing
European Journal of Operational Research
(1999)
R. Susmaga
Analyzing discretizations of continuous attributes given a monotonic discrimination function
Intelligent Data Analysis—An International Journal
(1997)

R. Tsaih et al.

Forecasting S&P 500 stock index futures with a hybrid AI system

Decision Support Systems

(1998)

S.B. Achelis

Technical analysis from A to Z

(1995)

H. Adeli et al.

Machine learning: neural networks, genetic algorithms, and fuzzy systems

(1995)

Ahmadi, H. (1990). Testability of the arbitrage pricing theory by neural networks. Proceedings of the International...

R.J. Bauer

Genetic algorithms and investment strategies

(1994)

J.P. Bigus

Data mining with neural networks

(1996)

P. Buhlmann

Extreme events from the return-volume process: a discretization approach for complexity reduction

Applied Financial Economics

(1998)

J. Chang et al.

Technical indicators and analysis methods

(1996)

J. Choi

Technical indicators

(1995)

Choi, J.H., Lee, M.K., & Rhee, M.W. (1995). Trading S&P 500 stock index futures using a neural network. Proceedings of...

D.R. Cooper et al.

Business research methods

(1995)

L. Davis

Genetic algorithms and financial applications

R. Dorsey et al.

The use of parsimonious neural networks for forecasting financial time series

Journal of Computational Intelligence in Finance

(1998)

Cited by (569)

Hybrid of jellyfish and particle swarm optimization algorithm-based support vector machine for stock market trend prediction
2024, Applied Soft Computing
Market prediction is a pivotal research domain within the financial market. The continuous evolution of information and communication technology has not only led to an exponential increase in data volume but has also introduced greater diversity in data formats. Thus, this study proposes a novel prediction model employing a hybrid of jellyfish and particle swarm optimization (HJPSO) algorithms. This hybrid model is designed to effectively manage the overwhelming volume of data, including technical indicators and financial news, while simultaneously optimizing the parameters of the support vector machine (SVM). In addition to its predictive capabilities, the study incorporates a rule extraction method, shedding light on the decision rules inherent in the SVM post-prediction. Computational results indicate that the proposed HJPSO-SVM is superior to existing algorithms in terms of accuracy and trading simulation. The incorporation of both stock indicators and news data emerges as a key factor contributing to enhanced predictive performance. This comprehensive approach reveals the significance of integrating diverse data sources for more robust market predictions.
A study of concave shoulder angle on the mechanical properties and fractography of friction stir welded AA6061-T651 joints
2024, Journal of Materials Research and Technology
AA6061-T651 material joints produced by friction stir welding (FSW) were analysed in this study. The analysis was conducted to determine the effect of the concave shoulder angle (CSA) of the tool on mechanical and fractographic properties. The mechanical properties studied included impact strength (IS) and hardness (HS), while the fractography included impact fracture and microstructure. The experiments used a Taguchi L₁₆ (4⁴) orthogonal array and were done thrice. The best parameters were then used four times for confirmation. The research findings confirmed that the change in CSA significantly impacts the IS of welded joints. Combining both artificial neural network (ANN) and genetic algorithm (GA), it was found that 9° is the best CSA for IS. The mechanical properties test results showed an IS of 0.806 J/mm². The HS of the weld nugget zone (WNZ) decreased by 25.49 % compared to the parent material zone (PMZ). This decrease in HS is due to the phase change of magnesium silicide (Mg₂Si), resulting in finer grains in the WNZ than in the PMZ. Meanwhile, the fracture angle of the impact test specimens was 81.7°, indicating moderate toughness characteristics.
Practical machine learning: Forecasting daily financial markets directions
2023, Expert Systems with Applications
Financial time series prediction has many applications in economics, but producing profitable strategies certainly has a special place among them, a daunting challenge. Statistical and machine learning techniques are intensively researched in the search for a holy grail of stock markets forecasting. However, it is not clear to prospecting researchers how good those popular models are regarding useful predictions on a real scenario. This paper contributes to that discussion, providing decisive evidences contrary to the use of basic out-of-the-box models, specifically Artificial Neural Networks (ANN), Support Vector Machines (SVM), Random Forest (RF) and Naive-Bayes (NB). Results consider optimistic and unreal variables often found in literature, as well as a more close-to-real simulation of the models usage. Specifically, current day closing prices direction forecasting results are contrasted with those on next day forecasts. As expected, when forecasting the current day, accuracy is almost perfect. However, when used to forecast next day closing direction, with a strict data separation policy and without direction or snooping bias, ANN, SVM, RF and NB produce results essentially equal to random guessing. The main achieved result is the demonstration of how a machine learning approach would fare in a support decision system for forecasting short-term future market direction, regardless of the level of market development, considering more than 100 securities in a 10 years period. Consequences for algorithmic trading relate to discouraging usage of the considered models as implemented here. On a more abstract sense, this paper presents more evidence to the Efficient Market Hypothesis (EMH).
Stock price forecasting using PSO hypertuned neural nets and ensembling[Formula presented]
2023, Applied Soft Computing
The stock market is a platform that allows individuals and organizations to buy stocks of publicly listed companies. It is imperative for investors and traders to utilize the platform to buy and sell stocks efficiently, but they must also determine when to do it in order to maximize profits. As trading involves holding stocks for shorter periods, projecting the future direction of a stock’s price becomes essential. In recent years, deep neural network-based trading strategies have been researched and implemented to identify when a stock’s price will increase or decrease. The main issues in implementing such solutions are that they need to deal with the noisy nature of the stock market and the problem of overfitting. The objective of the paper is to put forth an approach that utilizes deep learning techniques to predict price movements in the Nifty 50 index. The paper will explore the use of Recurrent Neural Networks (RNNs) for the given task. The paper will also look into applying metaheuristic algorithms to further improve the results of the prediction models. In this approach, RNNs, including Long Short-Term Memory (LSTM) and Gated Recurrent Units (GRU), are utilized to predict the movement of the index. The models are trained on a unique and efficient feature set that takes into consideration the stock price of large market capitalization companies present in the National Stock Exchange (NSE). Our findings show that ensembled architectures produce better results than individual models. An LSTM and GRU ensembled architecture produced an accuracy of 56.66% and a precision of 0.4734. Particle swarm optimization (PSO) was put forth as a method to hypertune the models to improve their performance. The LSTM, hypertuned with PSO, produced an accuracy of 57.64% and a precision of 0.2882. To further enhance the model’s stock price prediction performance, the LSTM and GRU ensembled architecture was ensembled with the PSO hypertuned LSTM architecture to produce a model that gives the highest accuracy of 57.72%. The proposed ensemble approach outperforms the other cutting edge techniques used to forecast how the stock price of the NSE will move. Additionally, the ensemble method increased precision from 0.2882 to 0.5485, demonstrating that ensembling and the PSO algorithm combine to produce models with superior performance. Based on the results, combining PSO hyper parameter optimized models with ensembling provides a good approach towards price movement predictions and also shows the potential of using this approach in other Artificial Intelligence (AI) fields to improve the performance of deep learning models.
News-based intelligent prediction of financial markets using text mining and machine learning: A systematic literature review
2023, Expert Systems with Applications
Researchers and practitioners have attempted to predict the financial market by analyzing textual (e.g., news articles and social media) and numeric data (e.g., hourly stock prices, and moving averages). Among textual data, while many papers have been published that analyze social media, news content has gained limited attention in predicting the stock market. Acknowledging that news is critical in predicting the stock market, the focus of this systematic review is on papers investigating machine learning and text mining techniques to predict the stock market using news. Using Kitchenham’s methodology, we present a systematic review of the literature on intelligent financial market prediction, examining data mining and machine learning approaches and the employed datasets. From five digital libraries, we identified 61 studies from 2015–2022 for synthesis and interpretation. We present notable gaps and barriers to predicting financial markets, then recommend future research scopes. Various input data, including numerical (stock prices and technical indicators) and textual data (news text and sentiment), have been employed for news-based stock market prediction. News data collection can be costly and time-consuming: most studies have used custom crawlers to gather news articles; however, there are financial news databases available that could significantly facilitate news collection. Furthermore, although most datasets have covered fewer than 100K records, deep learning and more sophisticated artificial neural networks can process enormous datasets faster, improving future model performance. There is a growing trend toward using artificial neural networks, particularly recurrent neural networks and deep learning models, from 2018 to 2021. Furthermore, regression and gradient-boosting models have been developed for stock market prediction during the last four years. Although word embedding approaches for feature representation have been employed recently with good accuracy, emerging language models may be a focus for future research. Advanced natural language processing methods like transformers have undeniably contributed to intelligent stock market prediction. However, stock market prediction has not yet taken full advantage of them.
Scientometric review and analysis of recent approaches to stock market forecasting: Two decades survey
2023, Expert Systems with Applications
Stock Market Forecasting (SMF) has become a spotlighted area and is receiving increasing attention due to the potential that investment returns can generate profound wealth. In the past, researchers have made significant efforts to forecast the stock market trends and predict the best time to buy, sell, or hold. The essence of past investigators’ various techniques and methods was to maximise the abundant opportunities that abound in the stock market trading and amass huge wealth from it. Over the years, no scientometric review has been conducted to scientifically map out the trends, progress, and limitations in the subject area. In this regard, this paper presents a pioneering scientometric review in SMF. It investigates a total of 220 reputable articles (2001–2021) to identify trends and patterns in stock market forecasting studies. VOSviewer software was used to conduct science mapping analysis. Actionable insights from the analysis explain significant metrics such as the top research outlets, most-cited articles, most co-occurred keywords, most influential countries, and much more. More so, a key finding in this paper is the introduction of a less computational approach that has the possibility of making a better forecast. Yet, past researchers have not thoroughly explored this option. This paper is beneficial to Early Stage Researchers (ESR), governments, funding bodies, managers, analysts, financial enthusiasts, practitioners, and investors, so as to understand the current progress and focus areas in stock market prediction.

View all citing articles on Scopus

¹: Tel.: +82-2-958-3613; fax: +82-2-958-3604.

View full text

Genetic algorithms approach to feature discretization in artificial neural networks for the prediction of stock price index

Abstract

Introduction

Section snippets

Prior research on stock market prediction using ANN

GA approach to feature discretization for ANN

Research data and experiments

Experimental results

Concluding remarks

Acknowledgements

Pattern Recognition Letters

Intelligent Data Analysis—An International Journal

Omega

Computers and Operations Research

Knowledge-Based Systems

Expert Systems with Applications

European Journal of Operational Research

Decision Support Systems

European Journal of Operational Research

Intelligent Data Analysis—An International Journal

Decision Support Systems

Technical analysis from A to Z

Machine learning: neural networks, genetic algorithms, and fuzzy systems

Genetic algorithms and investment strategies

Data mining with neural networks

Extreme events from the return-volume process: a discretization approach for complexity reduction

Applied Financial Economics

Technical indicators and analysis methods

Technical indicators

Business research methods

Genetic algorithms and financial applications

The use of parsimonious neural networks for forecasting financial time series

Journal of Computational Intelligence in Finance