Next Article in Journal
Optimal Eco-Driving Cycles for Conventional Vehicles Using a Genetic Algorithm
Previous Article in Journal
Evolution of Interior and Exterior Bearing Structures of the Deep-Soft-Rock Roadway: From Theory to Field Test in the Pingdingshan Mining Area
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Using Artificial Neural Networks to Find Buy Signals for WTI Crude Oil Call Options

Faculty of Management, AGH University of Science and Technology, 30-059 Cracow, Poland
*
Author to whom correspondence should be addressed.
Energies 2020, 13(17), 4359; https://doi.org/10.3390/en13174359
Submission received: 20 July 2020 / Revised: 19 August 2020 / Accepted: 20 August 2020 / Published: 24 August 2020
(This article belongs to the Section C: Energy Economics and Policy)

Abstract

:
Oil price changes significantly influence proper functioning of the entire world economy, which entails the risk of losses. One of the possible ways to reduce this risk is to use some dedicated risk management tools, such as options contracts. In this paper we investigate the possibility of using multilayer perceptron neural networks to provide signals of long positions to take in the European call options. The experiments conducted on the West Texas Intermediate (WTI) oil prices (2630 observations coming from 16 June 2009 until 14 February 2020) allowed the selection of the network parameters, such as the activation function or the network error measure, giving the highest return on options contracts. Despite the fact that about 2/3 call options produced losses, the buying signals provided by the network for the test set allowed it to reach a positive return value. This indicates that neural networks can be a useful tool supporting the process of managing the risk of changes in oil prices using option contracts.

1. Introduction

Changes in oil prices have been a key component of the economic life for many years, and they are now the subject of numerous scientific studies. As history shows, oil price shocks have often been accompanied by financial crises which resulted in the collapse of companies in various industries or affected their liquidity [1]. Moreover, despite continuous technological progress, oil still remains the most crucial energy source in the world, which makes it even more important for the global economy. According to the BP 2019 report [2], the oil’s share in global primary energy increased to 33.2% in 2018 and despite a noticeable increase in the importance of renewables, it will remain the main source of energy for the next few decades.
Previous studies have shown that there are strong correlations between oil prices and a variety of macroeconomic variables [3,4,5,6]. A popular direction of research is also the search for correlations between the volatility of the crude oil market and the stock markets in various economic regions. Aroui et al. [7] used the vector autoregressive-generalized autoregressive conditional heteroscedasticity (VAR-GARCH) model to investigate the volatility spillover effect between crude oil and equity markets for European companies in 1998–2009. The authors have shown that there is evidence of significant cross-market volatility transmission. What is more, spillover effects are more apparent from oil to the stock market, and come entirely from transmission of shocks. This research was also continued by Cuando and de Garcia [8], who focused their analyses on data from 1973–2011 and the impact of the oil shocks on the stock returns in 12 oil importing European economies, using VAR and vector error correction model (VECM) models. Bagirov and Mateus [9], in turn, analyzed the impact of oil price changes on the financial results of 137 listed and 531 unlisted oil and gas companies from Western Europe. The study gathered data from 2005–2015 and showed that during the financial crisis (2008–2009) oil price changes had a negative impact exclusively on listed companies, whereas in 2014 (the geopolitical crisis) their negative impact was noticed for both listed and unlisted companies. In the remaining periods, the positive impact of oil price changes dominated the financial results achieved by the companies analyzed in the research.
Apart from Europe, the literature also contains studies on the impact of oil prices on stock markets in other economic regions. Kilian [10] divides the sources of oil price shocks into supply-related, i.e., due to the changes in crude oil production, and demand-related, i.e., related to the global business cycle or uncertainty about future oil supplies. The author investigates the impact of the shocks on US macroeconomic aggregates. Kilian and Park [11], using monthly data for the US economy over the period 1973 January, to 2006 December, i.a. show a negative response to stock returns to an oil-market specific demand shock. The discourse on the sources of the sudden fluctuations in oil prices is supplemented by Kang et al. [12] who split oil supply shocks into those occurring in and outside the US and analyze their impact on the US stock market. Lambertides et al. [13], in turn, have shown that price shocks in the oil market caused by a change in oil supply have a negative impact on the quotations of US listed companies, while demand shocks usually trigger the opposite reaction. Broadstock and Filis [14] also take into account the Chinese stock market in addition to US stock indices and analyze its correlation with oil shocks from various sources. The authors have shown that in 1995–2013 the Chinese stock market was more resilient to oil price fluctuations than the American one. Research on the reaction of Chinese companies’ stock markets to oil shocks was also conducted by Li et al. [15], Li and Wei [16] and Wei et al. [17].
The researched relations between oil prices and macroeconomic indicators, as well as stock markets of the individual economic regions show how important oil remains for the world economy and the manner in which it functions. The analyses also allow us to identify the factors that determine oil prices. The correct identification of these factors, in turn, enables the construction and subsequent testing of oil price prediction models. Due to its great importance for the proper functioning of the global energy system, as well as the entire world economy, oil price prediction has now become one of the key directions of scientific research [18]. The effectiveness of the models used to predict oil prices is particularly important, among others, for companies that actively protect themselves against the negative consequences of oil price fluctuations. It is therefore not surprising that there have been numerous attempts to develop new oil price prediction models and to verify the existing ones. This task is undertaken by the world’s largest oil companies (e.g., BP) as well as countries that are leading oil producers (OPEC). Oil price forecasts are also developed by investment banks (Reffeisen Bank), international organizations, such as the International Energy Administration (IEA) or the Energy Information Administration (EIA), and financial organizations like the World Bank, IHS Global Insight and International Monetary Fund [19]. One of the fundamental problems with the methods used by these institutions and organizations is that they are usually based on the same market parameters. Moreover, individual institutions forecast future oil prices by referring them to the existing predictions made by other institutions, which is of limited cognitive value. Following that, mistakes made in one method are automatically multiplied in related ones.
The lack of effectiveness in price prediction prompted oil market observers to try to use artificial intelligence (AI) techniques in order to solve this problem. Hence, one of the more-often used methods creating oil price prediction models is artificial neural networks (ANNs) [19]. This technology uses large amounts of processed information, which allows it to detect data dependency without knowing its specificity, related for example to the economic environment [20]. The final effect of using networks is the detection and modelling of input and output data dependency.
Studies show that models using artificial neural networks (ANNs) to forecast oil prices provide an alternative to the classical econometric models, such as autoregressive integrated moving average (ARIMA) or (generalized) autoregressive conditional heteroscedasticity (ARCH, GARCH) models [21,22,23,24,25,26,27,28,29]. These econometric models point out such features as changes in variance over time or the seasonal nature of variability and improve prediction accuracy mainly based on the assumption of stationary and linearity of the analyzed time series [30]. Nonetheless, the complexity of the relations between crude oil and other markets, as well as the macroeconomic variables causes oil prices to be characterized by non-linearity and non-stationarity [31], hence there are attempts to use artificial intelligence techniques, including the above-mentioned artificial neural networks. For example, Azadeh et al. [32] used a flexible algorithm based on ANNs and fuzzy regression (FR) to forecast oil prices in the long term. This algorithm was used for annual oil prices from 1985–2007 and five indicators were used as inputs, i.e., oil supply, crude oil distillation capacity, oil consumption in non-OECD countries, refinery capacity in US and surplus capacity. Chiroma et al. [33] also tested the possibility of using neural networks to predict monthly oil prices. They propose a genetic algorithm (GA) to optimize the neural network weights, bias and topology to build a model for the prediction of the WTI crude oil price to improve WTI prediction accuracy, and to simplify the neural network model structure. Wang and Wang [34], in turn, combined multilayer perceptron (MLP) with an Elman recurrent neural network (ERNN) and stochastic time effective function to develop a prediction model for four oil price indices, one of which concerned WTI oil. WTI oil price forecasting was also discussed by Mostafa and El-Masra [35] who used gene expression programming (GEP) and ANNs, and made reference to the 1986–2012 quotations. Huang and Wang [36] demonstrated the advantage of the wavelet neural network (WNN) models with a random effective function over the back propagation neural network (BPNN) and support vector machine (SVM) models for forecasting WTI and Brent oil prices. These and other hybrid models [37,38,39,40,41,42] have shown that ANNs in combination with the relevant computational algorithms provide an important alternative to the econometric models in predicting the directions of oil price changes. On the other hand, Dbouk and Jamali [43] consider a bagged neural network (BANN), a neural network trained using the genetic algorithm (GAANN) and a neural network with fuzzy logic (FANN) in predicting daily crude oil prices. They have shown that the FANN’s predictive accuracy was broadly in line with linear models used in the research. However, the BANN and GAANN have performed significantly worse than the linear models.
Artificial neural networks have also been widely applied in modelling the values of option premiums. It is the price that traders pay for a put or call option contracts, i.e., the right to trade its underlying market at a specified price for a set period. The literature mainly deals with the valuation of stock index options, such as S&P500 [44,45], TAIFEX [46,47,48], Nikkei 225 [49] or WIG20 [50]. Similarly to the models used for oil price forecasting, the ANN was used here in combination with other models, such as the Black-Scholes model [51], the GARCH model [52], the exponential Lévy model [53] and many others [54,55]. Research shows that the integration of ANNs into existing option pricing models provides an effective alternative to the classical parametric models, even in the absence of certain economic assumptions related to the characteristics of the underlying market.
Bearing in mind the consequences of rapid changes in oil prices both in the economy and in oil-price-dependent companies, we focus on examining the possibility of using ANNs in the process of hedging against the risk of oil price changes. This kind of risk is particularly important for production companies (refineries) which do not have enough own oil reserves. Therefore, oil price changes determine their production costs in the world market.
Since one of the most popular approaches used by large manufacturing companies in managing the risk of oil price changes is the use of option contracts, we decided to attempt to increase the effectiveness of their use. For this purpose, we employ artificial neural networks such as multilayer perceptron (MLP) with different activation functions and with a different number of neurons in the hidden layer. These tools classify, based on selected market parameters, long positions in WTI crude oil call option into two categories: profitable or unprofitable. In the literature, we can find studies in which neural networks were used to forecast the direction of price changes. Dbouk and Jamali [43] examined in this context the economic significance of GAANN, FANN, and BANN, which first were used as forecast models, using a trading exercise (short-only, and long-only). However, they used for this purpose crude oil futures, which, unlike options contracts, do not guarantee the buyer that the losses will be limited, in the case of crude oil price changes not as expected. For long options, that are the subject of our research, the maximum loss is limited and equal to the value of the option premium. This fact makes it possible to use long options as tools for full protection against unfavorable changes in oil prices without generating additional risk.
Following that, the paper attempts to combine two aspects that are important from the perspective of oil market participants: a classification based on oil prices and hedging against the risk of an oil price increase. To the best of the authors’ knowledge, there are no studies in the literature that would deal with these two issues simultaneously.
The remainder of the article is organized as follows: Section 2 presents the proposed method, Section 3 provides the data and preliminary analysis, Section 4 outlines the most important and relevant results. Section 5 concludes and explores possibilities for future research.

2. Proposed Methods

This section provides the key information on European call options and the structure of artificial neural networks. Both of these tools are analyzed in the empirical part of the paper. We also present indicators which enable to assess neural networks and their use to read the European call option purchase signals.

2.1. Long Call Options

Among derivatives, high potential for their use in hedging against asset price changes is attributed to options. An option is defined as a contract between two parties, a buyer (holder) and a seller (writer), in which the buyer (long position) has a right, but not the obligation, to buy or sell a certain amount of the underlying asset (stock, bond, interest rate, commodity or other assets) at the denoted strike price within a specific time period ([56], pp. 7–8). Sellers (short position) willing to accept considerable amounts of risk can write options, collecting the premium and taking advantage when options expire worthless. The premium collected by a seller is seen as a liability until the option either is offset (by buying it back) or expires ([57], p. 2).
Depending on the right that holder can get, there are two basic types of options:
  • Call options—give the holder the right to buy the underlying asset at the stated strike price within a specific period of time; the seller of a call option is obliged to deliver a long position in the underlying futures contract from the strike price in the case that holder chooses to exercise the option;
  • Put options—give the holder the right to sell the underlying asset at the stated strike price within a specific period of time; the seller of a put option is obliged to deliver a short position in the underlying futures contract from the strike price should the holder choose to exercise the option.
Options are traded both on a stock exchange and in the over-the-counter (OTC) market. There are also two main types of options with respect to the right of exercise the option before the day of expiration:
  • European option—a version of an options contract that the holder may only exercise their rights on the day of expiration;
  • American option—a version of an options contract that allows holders to exercise the option rights at any time before and including the day of expiration.
Further on in the study, the analysis concerned European call options. These kinds of options can be used to hedge against the risk of an oil price increase. The method of determining payoff for buyer European call option (long call) is presented in the following formula:
l o n g   c a l l = ( m a x { F K ; 0 } c ( K ) ) L
where,
  • F—the future price of the underlying asset;
  • K—strike price of the option;
  • c(K)—option premium (for the strike price K);
  • L—number of call options.
It is therefore evident that an increase in the price of the underlying asset relative to the strike price determined (K) will contribute to the exercise of the option. However, the option itself will be exercised profitably when the difference between the price of the underlying asset on the day of expiration (the future price; F) and the strike price (K) is higher than the option premium c(K).

2.2. Artificial Neural Networks

Artificial neural networks (ANN) are computational representations of the human brain structure. Neural networks comprise interconnected neurons, usually minimally organized in three layers, i.e., an input layer, at least one hidden layer and an output layer. The input layer receives data that is processed by the network into one or more output signals. All neural connections have weights assigned to them where the weight represents the power of interaction between one neuron and another. Neuron output value multiplied by the weight of the connection between a given pair of neurons makes the value of the signal carried from one neuron to another. The aggregated value of the input signals for a given neuron is then processed by the activation function, which generates the neuron output value. The weights of the connections between neurons, which determine the value of the signals reaching a given neuron, are modified in the learning process of the network. Network learning (i.e., appropriate selection of weights of connections between neurons) is one of the key problems of effective operation of neural networks. Therefore, the algorithm used to train the network may determine the effectiveness of its operation.
Another important element concerning effective network performance is an appropriate choice of the structure of the network (the number of hidden layers and the number of neurons in each layer) and the occurrence or absence of feedback. In feedforward networks signals flow only in one direction, i.e., from the input layers, through the hidden ones to the output neurons. On the other hand, feedback neural networks have additional connections between neurons present in the later layers with neurons of the earlier layers (e.g., neurons from the output layer send feedback to neurons from the input layer or neurons of the hidden layer). For the purpose of this study it was decided to use a network with a one-way information flow (feedforward neural networks) thus feedback neural networks will not be discussed.
In this paper, multilayer perceptron has been used as a decision-support tool with regard to hedging against an increase in the price of WTI oil by buying a long call option. These are networks with unidirectional information flow (a class of feedforward ANN). The networks used in the study consist of exactly three layers, i.e., the input layer, one hidden layer and the output layer. The Broyden-Fletcher-Goldfarb-Shanno algorithm [58] was used as a training algorithm. The proposed method of determining the time of hedging using an ANN will be referred to as SANN (due to the use of the Statistica software for creating neural networks).
Using artificial neural networks results in obtaining information which enables one to decide whether or not to take a long position in the call option. Therefore, the proposed network should be considered as a tool to support the process of making decisions on hedging against an increase in the price of WTI oil. The task of the network is therefore to solve the classification problem by searching for call option buy signals in the analyzed time series. In the further part of the study, the signals sought are referred to in short as ‘buying signals’.

2.3. Network Performance Assessment Indicators

In order to be able to assess the performance of the individual networks, the following subsection proposes a number of indicators. Except for the ability to compare the results of various networks, indicators also aim at determining to what extent the outcomes of a given network differ from the maximum possible result. In order to calculate the value of the indicators, it is assumed that on a given day it is possible to purchase (or not) exactly one option contract.
C is a set of profits (losses) from a long call option. The result of the neural network operation on the set C is the value of the EP (expected profit) indicator, obtained from the following formula:
E P = c C o c · v c
where c is an element of the set C, vc denotes the value of the profit from the purchase of the hedging instrument at moment c, where the value oc is described by the formula:
o c = { 1 ,   i f   f o r   o b s e r v a t i o n   c ,   a   c a l l   o p t i o n   w a s   b o u g h t   0 ,   i f   f o r   o b s e r v a t i o n   c ,   a   c a l l   o p t i o n   w a s   n o t   b o u g h t
In order to verify the quality of the performance of the network, three indicators have been identified as the reference points for the results obtained by the network:
(a)
maximum profit (MP)—means the sum of profits from the long call options for all days for which the final result from taking this position was greater than zero (the option was exercised with profit exceeding the value of the option premium); the value of MP index is calculated based on the following formula:
M P = c C z c · v c
where zc denotes a binary variable described by the following formula:
z c = { 1 ,   i f   v c > 0 0 ,   i f   v c < 0
(b)
maximum loss (ML)—means the sum of all losses from the long call options in a given period; the following formula is used to calculate the value of the ML indicator:
M L = c C s c · v c
where sc denotes a binary variable described by the following formula:
s c = { 1 ,   i f   v c < 0 0 ,   i f   v c > 0
(c)
average return (AR)—means the total value of profit from the long call options on each successive day of quotation in a given period; the AR indicator is described by the following formula:
A R = c C v c = M P + M L
There is always the following relation between the above-mentioned indicators:
M P A R M L
The indicators will be used to assess the quality of the proposed method to support decision making in terms of buying a call option (EP). It was assumed that the EP parameter should be a positive value and greater than the AR, the following condition must be met:
E P > m a x ( A R , 0 )
The maximum value that can be reached by the EP indicator (which is also the upper limit of profit, assuming that exactly one option can be bought on one day) is equal to the value of the MP indicator. The situation where EP = MP would be possible if exactly all network indications, including buying or not buying a call option were correct.
In order to verify the quality of the result obtained from the network as well as the maximum value (MP), an additional indicator has been designated:
% M P = E P M P · 100 %
The %MP indicator should be understood as the percentage of the best possible result (MP) achieved by this method (EP). In the further study, the %MP indicator was used to compare the results achieved by each neural network.

3. Data and Preliminary Analysis

The empirical part of this study concerns prices of WTI crude oil as quoted on the NYMEX. The New York Mercantile Exchange is a commodity futures exchange owned and operated by CME Group of Chicago. The considerations include WTI crude oil futures prices for front-month delivery between 16th June 2009 and 14th February 2020 [59]. We focus on using long European call options to hedge against the risk of rising WTI oil prices. The study includes only ATM (at-the-money) options, i.e., those with strike price identical to the WTI future price (in USD per barrel) on a given day. For each option, the result that would have been achieved by the call option buyer between the date of opening the position and the date of expiration of the option was determined (which is directly related to the style of the European options under consideration). This result was calculated as the difference between the WTI future price on the option expiry date (the last day of the option period) and the option strike price, as well as lessen by the cost of the hedging instrument (the unit value of the option premium expressed in USD per one barrel of oil). The number of days remaining to option expiry in each of the given months ranged from 31 days to 1 day.
Additionally, the following values were calculated for the set of observations presented:
  • standard deviation for n recent WTI futures prices, where n { 1 ,   2 ,   3 ,   ,   9 ,   10 ,   12 ,   14 ,   ,   28 ,   30 ,   35 ,   40 ,   45 ,   50 ,   55 ,   60 } ;
  • arithmetic mean of n recent WTI futures prices (moving average), where n { 2 ,   3 ,   ,   9 ,   10 ,   12 ,   14 ,   ,   28 ,   30 ,   35 ,   40 ,   45 ,   50 ,   55 ,   60 } .
The main aim of the method used for the purpose of this paper was to determine the moment of buy call option. Following that, it was assumed that this moment can only occur at the session close on a particular day. This is the consequence of using WTI futures prices, which were the closing prices, to determine standard deviations and moving averages.
WTI futures prices and the number of days remaining until expiry, as well as standard deviations and moving averages based on WTI futures prices were used as input data for artificial neural networks. The choice of the above-mentioned variables was motivated by the following assumptions:
(1)
Standard deviations provide a certain representation of the dynamics of price changes; the option can only be exercised profitably if WTI price is big enough. In addition, past price changes affect the volatility of the option, which is one of the key parameters determining the option premium;
(2)
Moving average is widely used in the technical analysis as it reflects market trends. Additionally, some researchers have shown that moving averages are indicators that predict the directional movement of oil prices with high accuracy [43];
(3)
The number of days remaining until expiration of the option affects the option premium (cost of hedging) and thus indirectly the final result achieved by the call option buyer. This parameter is also related to the internal value of the option and the chance the option will end in the money.
(4)
The WTI futures price is a reference point to the level of current prices.
For the purpose of network operation, the data was divided into two sets—a training set and a test set. Figure 1 presents WTI futures prices quotations distinguished between a training set and a test set.
The set of observations was divided into a training set and a test set at a ratio of 75% to 25% respectively. A continuous set of observations, starting on 17th September 2009 and ending on 14th July 2017, was chosen as the training set. The observations constituting the test set cover the period from 17th July 2017 to 14th February 2020.
Table 1 presents quantitative characteristics of training data and test data. Column All contains the number of observations concerning the training and the test sets, as well as the number of all observations (Sum). Additionally, the Table 1 shows a division of observations of each set into cases that allowed obtaining a positive return from taking a long position in the call option ([>0]) and cases that generated a loss from taking a long position ([<0]). For each of the sets, the percentage share of the number of options generating profit in the number of options generating profit and coming from the whole set was also presented (% [>0]). Corresponding calculations were carried out for options that generated losses (% [<0]).
In the test sample, profit-generating observations had a higher percentage than it would appear from the percentage division of all observations into the training and the test sets. The proportion of loss-generating observations is also distorted in relation to the division ratio of the number of observations into the training and the test sets, but this deviation is insignificant. It is worth noting that almost twice as often (65% to 35%) the purchase of ATM options generates a loss rather than a profit. Generally speaking, an ATM call option usually has a delta, interpreted as the probability of exercising the option [56], at approximately 0.5. It should, however, be borne in mind that the option that will be exercised does not necessarily imply a profit to the buyer (cf. Equation (1)).
On the basis of the presented quantitative characteristics, it is possible to conclude that the probability of generating profit from taking a long position in the call option was higher for the test set. Table 2 presents the value characteristics for each set. The first column of the table (All) should be understood as the amount of return on taking a long position in a call option on each subsequent day. The second and third columns are the sum of returns gathered from the observations generated only profit and loss.
The amount of the percentage return for each set is equal to the ratio of the division of observations into the training and the test sets (%All). On the basis of the percentage value, the shares of profits and losses from the purchase of options for the particular set in the total profit and loss figures, it can be concluded that the training set offers greater opportunities to achieve higher earnings as well as higher levels of loss. This issue is further detailed in Table 3 which contains both quantitative and value characteristics for each set. In addition, these characteristics were divided into value ranges representing the amount of return for the call option holder.
For each of the sets, the percentages of the above-mentioned characteristics were also presented in relation to the value of the number of all observations and the sum of returns from the purchase of an option, with a distinction between ranges of returns greater ([>0]) and smaller ([<0]) than zero. The final part presents quantitative and qualitative summaries for each of the sets and all observations together.
The data presented in Table 3 shows a much higher share of observations with high levels of loss and profit for the training set than for the test set. This means that there were many more observations with price increase in the training set (hence high profits in individual items) and the options were more expensive hedging instruments (which resulted in higher individual losses). In the case of losses, the percentage share of observations generating a loss of more than USD 2 is almost twice as high for the training set as for the test set. The situation is similar in the case of the amount of profit, i.e., more than twice as many observations of the training set generated profit above USD 3 as in the case of the test set. In the case of the test set, the share of observations generating small profits and small losses (up to USD 1) was significantly higher. The relations described are illustrated in Figure 2 and Figure 3.
As can be seen in Figure 3, an extremely strong imbalance occurs between the training and the test sets for profit over USD 3. For both, the training and the test sets, the value of the AR indicator (average return) is less than zero (Table 4). Therefore, based on formula (2), the EP value for the particular network should be greater than zero, and not greater than the AR value.
Figure 4 presents the values of the AR, MP and ML indicators on the timeline with a distinction between the training set and the test set.
The abovementioned parameters behave similarly for both sets. Greater deviations of the curves determined by the MP and ML indicators from the straight line ‘0′ occurring in the training set provide a visual confirmation of the conclusion that it is possible to achieve both greater loss and greater profit for the training set than for the test set. The next part of the study presents the results of SANN performance for the data sets described.

4. Results

The Statistica software (TIBCO Software Inc, Palo Alto, California, United States) was used to build the networks that were used to search for call option buy signals. The networks analyzed were classification networks and they were created using the automatic network selection tool built into the Statistica software. The networks created were MLP type networks with the number of neurons in the hidden layer between 5 and 30. The following indicators were used to assess the errors made by the network: sum of squares and joint entropy. The following functions were used as activation functions: linear, logistic, exponential, sinus and hyperbolic tangent (for the hidden and output layers) and softmax (for the output layer when joint entropy was used as the error evaluation function). For each type of activation function for the hidden layer neurons, 50,000 networks were trained. The best results (EP) for networks using a specific activation function (for the training set) are presented in Table 5. Additionally, the average results of the best 5 and 10 solutions for the particular activation function used in the hidden layer are attached. This information has been added due to the fact that the results presented are best achieved by the networks for the training set, not the test one. Therefore, AVG5 and AVG10 results may be better than the BEST results but only for the test set.
By far the worst results were obtained for the activation functions, linear and sinus (taking into account the training set). The best results, on the other hand, were obtained using the following functions: logistic and hyperbolic tangents. The results were also collated in Figure 5 using the %MP parameter which shows the percentage share of the network application result in the maximum profit that could have been obtained in the training set or on the test set (cf. Equation (11)).
The most distinctive result for the test set was obtained with the use of hyperbolic tangent as the activation function for the hidden layer neurons. Also noteworthy are the results obtained for the logistic activation function for which the average values calculated for the test set are higher than those for the BEST solution.
Based on the presented results, it was decided to carry out additional tests for three hidden layer activation functions, allowing to obtain the best solutions. Two output layer activation functions (logistic, softmax) were selected, which enabled to obtain the best results for the selected hidden layer activation functions (detailed results for the best 10 solutions for each particular activation function can be found in Appendix A (Table A1) and are marked as T1 set). For each combination of selected activation functions in the hidden layer and the output layer, 100,000 networks were trained. The obtained results (marked as T2 in Table A1) are shown in Table 6.
For each of the analyzed activation functions of the hidden layer in the T2 set, an improvement of the best result for the test set was obtained as compared with the T1 set. The values of the %MP parameter obtained for the T2 set are presented in Figure 6.
Repeated calculations confirmed that the worst results for the training set were obtained using the exponential activation function in the hidden layer. At the same time, the exponential activation function (using the logistic activation function for the output layer) enabled to obtain the best (of the 6 cases considered in the T2 set) BEST result. Also noteworthy are the results of hyperbolic tangents for the softmax function, where an upward trend is visible for the consecutive values considered. This may indicate a low value of the BEST result for the test set as compared with the results of the next networks (with similar parameters).
Table 7 presents a summary containing the results (for the training set) obtained by the 10 best networks (from T1 and T2 sets). It is noteworthy that the average of the 5 best results for the logistic activation function is presented with an error assessment based on the entropy coefficient.
For the five networks that yielded the best results (Table 7, numbers 1–5), the graphs were made to show the change in time of the values of the MP, AR and EP indicators. Figure 7 presents the values of MP, AR and EP for the training set, while Figure 8 presents the values for the test set. The value in brackets by the EP coefficient refers to the network number in Table 7.
The difference on the last day included in the training set between the highest and the lowest values of the %MP indicator for the five networks under consideration is 5.4%. It is worth noting that this value (Max (%MP)-Min (%MP)) was not less than 20% for nearly 2/3 of the number of training set observations. This difference is particularly visible in the central part of the chart, where the curves clearly diverge to ‘get closer’ towards the end of the training set. For all the networks, there is a clear tendency in which the value of the EP indicator ‘follows’ the MP indicator.
For the test set, the results obtained by the networks (EP) are significantly worse than for the training set. Figure 8 also shows that the relation between the MP and EP values for the test set is much weaker (less favorable from the perspective of a call option buyer) than for the training set. It is also worth noting that for the data set corresponding to the first half of 2018, the value of the AR indicator exceeds the values of the EP indicators. Furthermore, only a weak reaction of the EP indicators to the changes in the AR indicator can be observed.
For the test set, the difference between the highest and the lowest values of %MP is almost twice as high as for the training set and amounts to 12.1%. This value also means that the best solution out of the five considered networks (%MP (5) = 24.2%) is twice as good as the worst solution (%MP (2) = 12.1%).
Table 8 presents detailed quantitative and qualitative information on the five networks that allow to obtain the best result in the training set.
Only one of the discussed networks generated fewer than 500 buy signals for the training set, i.e., network number 4. This network was also distinguished by a very low value of incorrectly generated buy signals for the training set, which also resulted in a small loss due to buying options based on incorrect signals. The remaining networks generated a similar number of buy signals for the training set, in the range between 635 and 651. For three networks, the return on buying a hedge based on correctly generated buy signals for the training set exceeded the value of USD 1300. In the case of the test set, the results obtained by the networks based on correctly generated buy signals can be divided into two classes:
(1)
Networks allowing revenue of approx. USD 120
(2)
Networks generating revenue of approx. USD 150
The network that achieved a significantly higher %MP rate as compared with the other ones is the network (5). It is worth noting that this network is not distinguished by either the number of correctly or erroneously generated buy signals for the test set or the profit or loss achieved by these signals.
When analyzing the data included in Table 8, two strategies of generating buy signals can be distinguished. The first can be described as ‘many bad, many (more) good’ (MBMG). It consists in frequent generation of buy signals, a large portion of which may turn out to be incorrect; however, a much larger proportion of the signals (allowing to obtain a positive value of the EP indicator) is generated correctly. The second strategy (the use of which can only be observed in the case of the network (4) for the training set) can be characterized as: ‘few good, fewer bad’ (FGFB). It consists in limiting the number of generated buy signals (even at the expense of a smaller number of profit-generating signals), while strongly limiting the number of loss-generating signals.
The undoubted advantage of the MBMG strategy is that it is more likely (based on the number of long positions taken in call options) to hedge against unexpected but significant increases in WTI oil prices. On the other hand, the cost of frozen capital for the FGFB strategy is much lower. Also, the costs associated with the purchase of options are lower (the total of option premiums paid).

5. Conclusions and Future Research Directions

This paper thoroughly examines the ability of artificial neural networks as a tool to support the decision-making process concerning hedging WTI oil prices with the use of option contracts. We analyze the selection of the activation function both in the hidden layer and in the output layer. Due to the limitations of Statistica, there was only one hidden layer in the given networks. The parameter of the number of neurons in the hidden layer was limited by the range from 5 to 30. Using the Statistica automatic network selection tool, this size was modified (see Appendix A, Table A1, column “SANN’s Name”) to obtain the highest value of the quality of the network matching. Research outcomes revealed that the networks with the best results used the Logistic activation function in the hidden layer and the Softmax activation function in the output layer for the number of neurons between 5 and 25.
As the research has shown, the situations where an option was not exercised or could have been exercised at a loss (the option premium was greater than the difference between the oil price on the expiry date of the option and the option strike price) were much more common in the analyzed period. As a result, the value of the AR (average return) indicator for both the training and the test sets was less than zero. The given networks (SANNs) made it possible to generate buy signals for which the sum of results obtained from long positions in call options was positive not only in the training set but also in the test set. What is more, the values of the %MP indicator for the best networks in the training set were between 12–24% in the test set, which should be considered a promising result from the perspective of further research. For future work, the following potential research directions were distinguished:
  • Possible use of neural networks to search for buy or sell signals of other types of options (such as put options and American options);
  • Possible use of a network to improve the performance of option strategies that require the acquiring and issuing (buying and selling) options that differ in terms of parameters such as strike price and exercise dates;
  • Possible use of the proposed model to support the process of making decisions whether or not to purchase a hedge in other raw material markets;
  • An attempt to improve the results obtained by networks for the test set.

Author Contributions

Conceptualization, R.P., B.Ł.; model design and model calculations R.P., B.Ł.; writing original draft preparation, R.P., B.Ł.; data visualization, R.P., B.Ł.; discussion R.P., B.Ł.; writing review and editing, R.P., B.Ł. All authors have read and agreed to the published version of the manuscript.

Funding

This study was conducted under a research project funded by a statutory grant of the AGH University of Science and Technology in Krakow for maintaining research potential.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Best SANN’s results.
Table A1. Best SANN’s results.
Num.Training SetTest SetSANN’s NameTraining Algorithm Error FunctionHidden LayerOutput LayerTest Number
11147.647.8MLP 53-16-2BFGS 76EntropyLogisticSoftmaxT1
2948.844.1MLP 53-9-2BFGS 52SOSLogisticLogisticT1
3929.660.1MLP 53-7-2BFGS 77SOSLogisticLogisticT1
4838.777.4MLP 53-8-2BFGS 55EntropyLogisticSoftmaxT1
5804.433.8MLP 53-7-2BFGS 53EntropyLogisticSoftmaxT1
6801.359.0MLP 53-12-2BFGS 53EntropyLogisticSoftmaxT1
7800.145.8MLP 53-6-2BFGS 70SOSLogisticLogisticT1
8778.146.2MLP 53-9-2BFGS 53SOSLogisticLogisticT1
9768.559.6MLP 53-5-2BFGS 67SOSLogisticLogisticT1
10759.772.1MLP 53-6-2BFGS 101SOSLogisticLogisticT1
1131.214.6MLP 53-13-2BFGS 28EntropyLinearSoftmaxT1
1230.72.1MLP 53-28-2BFGS 25EntropyLinearSoftmaxT1
1329.815.2MLP 53-9-2BFGS 25SOSLinearExponentialT1
1429.40.1MLP 53-19-2BFGS 26EntropyLinearSoftmaxT1
1529.2−2.2MLP 53-6-2BFGS 30EntropyLinearSoftmaxT1
1628.40.5MLP 53-17-2BFGS 25EntropyLinearSoftmaxT1
1728.40.5MLP 53-6-2BFGS 29EntropyLinearSoftmaxT1
1828.3−0.9MLP 53-19-2BFGS 20EntropyLinearSoftmaxT1
1928.20.1MLP 53-21-2BFGS 18EntropyLinearSoftmaxT1
2028.00.1MLP 53-5-2BFGS 24EntropyLinearSoftmaxT1
21215.050.1MLP 53-5-2BFGS 30SOSSinusHyperbolic tangentT1
22209.224.2MLP 53-6-2BFGS 62EntropySinusSoftmaxT1
23201.343.7MLP 53-5-2BFGS 24SOSSinusHyperbolic tangentT1
24188.646.0MLP 53-6-2BFGS 42SOSSinusLogisticT1
25174.236.5MLP 53-8-2BFGS 28SOSSinusHyperbolic tangentT1
26173.938.8MLP 53-7-2BFGS 27EntropySinusSoftmaxT1
27172.244.4MLP 53-28-2BFGS 22EntropySinusSoftmaxT1
28168.844.1MLP 53-8-2BFGS 27SOSSinusExponentialT1
29165.439.8MLP 53-7-2BFGS 49SOSSinusExponentialT1
30160.812.5MLP 53-20-2BFGS 38EntropySinusSoftmaxT1
311087.495.6MLP 53-17-2BFGS 88EntropyHyperbolic tangentSoftmaxT1
321008.752.9MLP 53-12-2BFGS 83EntropyHyperbolic tangentSoftmaxT1
33928.549.9MLP 53-26-2BFGS 69EntropyHyperbolic tangentSoftmaxT1
34874.356.1MLP 53-22-2BFGS 60EntropyHyperbolic tangentSoftmaxT1
35853.742.3MLP 53-15-2BFGS 57EntropyHyperbolic tangentSoftmaxT1
36844.564.7MLP 53-10-2BFGS 68EntropyHyperbolic tangentSoftmaxT1
37824.653.1MLP 53-23-2BFGS 51SOSHyperbolic tangentLogisticT1
38813.386.6MLP 53-6-2BFGS 128EntropyHyperbolic tangentSoftmaxT1
39779.940.9MLP 53-22-2BFGS 53EntropyHyperbolic tangentSoftmaxT1
40777.892.5MLP 53-14-2BFGS 56EntropyHyperbolic tangentSoftmaxT1
41747.373.3MLP 53-6-2BFGS 77EntropyExponentialSoftmaxT1
42710.847.5MLP 53-25-2BFGS 121EntropyExponentialSoftmaxT1
43709.022.2MLP 53-7-2BFGS 65EntropyExponentialSoftmaxT1
44646.858.6MLP 53-6-2BFGS 59EntropyExponentialSoftmaxT1
45646.530.8MLP 53-25-2BFGS 105EntropyExponentialSoftmaxT1
46622.266.9MLP 53-26-2BFGS 94EntropyExponentialSoftmaxT1
47599.249.3MLP 53-19-2BFGS 77EntropyExponentialSoftmaxT1
48568.720.4MLP 53-19-2BFGS 84EntropyExponentialSoftmaxT1
49558.645.2MLP 53-26-2BFGS 89EntropyExponentialSoftmaxT1
50555.643.8MLP 53-21-2BFGS 61EntropyExponentialSoftmaxT1
11085.069.6MLP 53-11-2BFGS 131SOSLogisticLogisticT2
21016.967.7MLP 53-13-2BFGS 103SOSLogisticLogisticT2
3951.145.2MLP 53-11-2BFGS 91SOSLogisticLogisticT2
4946.461.9MLP 53-10-2BFGS 96SOSLogisticLogisticT2
5943.161.1MLP 53-9-2BFGS 108SOSLogisticLogisticT2
6939.864.0MLP 53-10-2BFGS 78SOSLogisticLogisticT2
7921.759.5MLP 53-13-2BFGS 72SOSLogisticLogisticT2
8911.170.4MLP 53-18-2BFGS 61SOSLogisticLogisticT2
9904.669.1MLP 53-7-2BFGS 146SOSLogisticLogisticT2
10895.570.2MLP 53-8-2BFGS 160SOSLogisticLogisticT2
111175.267.8MLP 53-14-2BFGS 90EntropyLogisticSoftmaxT2
121146.564.9MLP 53-21-2BFGS 67EntropyLogisticSoftmaxT2
131001.050.7MLP 53-11-2BFGS 66EntropyLogisticSoftmaxT2
14981.632.6MLP 53-14-2BFGS 89EntropyLogisticSoftmaxT2
15977.868.2MLP 53-9-2BFGS 67EntropyLogisticSoftmaxT2
16921.569.3MLP 53-13-2BFGS 94EntropyLogisticSoftmaxT2
17898.667.4MLP 53-12-2BFGS 60EntropyLogisticSoftmaxT2
18875.642.2MLP 53-26-2BFGS 52EntropyLogisticSoftmaxT2
19861.919.0MLP 53-13-2BFGS 88EntropyLogisticSoftmaxT2
20857.668.8MLP 53-7-2BFGS 63EntropyLogisticSoftmaxT2
211135.763.0MLP 53-24-2BFGS 106SOSHyperbolic tangentLogisticT2
221032.964.5MLP 53-11-2BFGS 61SOSHyperbolic tangentLogisticT2
231026.226.5MLP 53-28-2BFGS 59SOSHyperbolic tangentLogisticT2
24827.247.1MLP 53-8-2BFGS 75SOSHyperbolic tangentLogisticT2
25825.946.0MLP 53-16-2BFGS 57SOSHyperbolic tangentLogisticT2
26812.045.8MLP 53-8-2BFGS 42SOSHyperbolic tangentLogisticT2
27797.137.7MLP 53-5-2BFGS 42SOSHyperbolic tangentLogisticT2
28782.830.8MLP 53-8-2BFGS 66SOSHyperbolic tangentLogisticT2
29766.029.1MLP 53-11-2BFGS 42SOSHyperbolic tangentLogisticT2
30761.466.2MLP 53-7-2BFGS 47SOSHyperbolic tangentLogisticT2
311045.739.8MLP 53-13-2BFGS 63EntropyHyperbolic tangentSoftmaxT2
321011.849.0MLP 53-26-2BFGS 102EntropyHyperbolic tangentSoftmaxT2
331000.889.8MLP 53-13-2BFGS 64EntropyHyperbolic tangentSoftmaxT2
34978.157.0MLP 53-24-2BFGS 117EntropyHyperbolic tangentSoftmaxT2
35949.571.5MLP 53-17-2BFGS 105EntropyHyperbolic tangentSoftmaxT2
36941.175.0MLP 53-20-2BFGS 60EntropyHyperbolic tangentSoftmaxT2
37938.366.2MLP 53-27-2BFGS 55EntropyHyperbolic tangentSoftmaxT2
38930.043.4MLP 53-14-2BFGS 83EntropyHyperbolic tangentSoftmaxT2
39918.949.0MLP 53-18-2BFGS 82EntropyHyperbolic tangentSoftmaxT2
40913.086.3MLP 53-14-2BFGS 110EntropyHyperbolic tangentSoftmaxT2
41719.770.5MLP 53-6-2BFGS 186SOSExponentialLogisticT2
42656.843.3MLP 53-6-2BFGS 81SOSExponentialLogisticT2
43647.552.9MLP 53-6-2BFGS 83SOSExponentialLogisticT2
44627.2106.5MLP 53-6-2BFGS 191SOSExponentialLogisticT2
45623.550.3MLP 53-8-2BFGS 78SOSExponentialLogisticT2
46613.846.1MLP 53-6-2BFGS 123SOSExponentialLogisticT2
47607.469.9MLP 53-8-2BFGS 59SOSExponentialLogisticT2
48606.659.6MLP 53-5-2BFGS 90SOSExponentialLogisticT2
49595.859.7MLP 53-9-2BFGS 54SOSExponentialLogisticT2
50595.031.4MLP 53-8-2BFGS 73SOSExponentialLogisticT2
51847.045.7MLP 53-11-2BFGS 89EntropyExponentialSoftmaxT2
52821.948.3MLP 53-19-2BFGS 119EntropyExponentialSoftmaxT2
53820.131.2MLP 53-9-2BFGS 90EntropyExponentialSoftmaxT2
54792.153.0MLP 53-12-2BFGS 95EntropyExponentialSoftmaxT2
55787.230.3MLP 53-9-2BFGS 106EntropyExponentialSoftmaxT2
56773.445.6MLP 53-19-2BFGS 104EntropyExponentialSoftmaxT2
57759.229.0MLP 53-14-2BFGS 101EntropyExponentialSoftmaxT2
58752.149.5MLP 53-9-2BFGS 77EntropyExponentialSoftmaxT2
59723.732.6MLP 53-7-2BFGS 71EntropyExponentialSoftmaxT2
60723.541.4MLP 53-20-2BFGS 80EntropyExponentialSoftmaxT2

References

  1. Tsai, C. How do U.S. stock returns respond differently to oil price shocks pre-crisis, within the financial crisis, and post-crisis? Energy Econ. 2015, 50, 47–62. [Google Scholar] [CrossRef]
  2. BP Statistical Review of World Energy. Available online: https://www.bp.com/en/global/corporate/energy-economics/statistical-review-of-world-energy.html (accessed on 20 November 2019).
  3. Hamilton, J.D. Oil and the macroeconomy since World War II. J. Pol. Econ. 1983, 92, 228–248. [Google Scholar] [CrossRef]
  4. Hamilton, J.D. What is an oil shock? J. Econ. 2003, 113, 363–398. [Google Scholar] [CrossRef]
  5. Kilian, L. Exogenous oil supply shocks: How big are they and how much do they matter for the US economy? Rev. Econ. Stat. 2008, 90, 216–240. [Google Scholar] [CrossRef]
  6. Lee, C.C.; Lee, C.C.; Ning, S.L. Dynamic relationship of oil price shocks and country risks. Energy Econ. 2017, 66, 571–581. [Google Scholar] [CrossRef]
  7. Arouri, M.; Jouini, J.; Nguyen, K.D. On the impacts of oil price fluctuations on European equity markets: Volatility spillover and hedging effectiveness. Energy Econ. 2012, 34, 611–617. [Google Scholar] [CrossRef]
  8. Cuando, J.; de Garcia, F.P. Oil price shocks and stock market returns: Evidence for some European countries. Energy Econ. 2014, 42, 365–377. [Google Scholar] [CrossRef]
  9. Bagirov, M.; Mateus, C. Oil prices, stock markets and firm performance: Evidence from Europe. Int. Rev. Econ. Financ. 2019, 61, 270–288. [Google Scholar] [CrossRef]
  10. Kilian, L. Not all oil shocks are alike: Disentangling demand and supply shocks in the crude oil market. Am. Econ. Rev. 2009, 99, 1053–1069. [Google Scholar] [CrossRef] [Green Version]
  11. Kilian, L.; Park, C. The impact of oil price shocks on the US stock market. Int. Econ. Rev. 2009, 50, 1267–1287. [Google Scholar] [CrossRef]
  12. Kang, W.; Ratti, R.A.; Vespignani, J. The impact of oil price shocks on the U.S. stock market: A note on the roles of US and non-U.S. oil production. Econ. Lett. 2016, 145, 176–181. [Google Scholar] [CrossRef] [Green Version]
  13. Lambertides, N.; Savva, C.S.; Tsouknidis, D.A. The effects of oil price shocks on U.S. stock order flow imbalances and stock returns. J. Int. Money Financ. 2017, 74, 137–146. [Google Scholar] [CrossRef]
  14. Broadstock, D.C.; Filis, G. Oil price shocks and stock market returns: New evidence from the United States and China. J. Int. Financ. Mark. Inst. Money 2014, 33, 417–433. [Google Scholar] [CrossRef] [Green Version]
  15. Li, P.; Zhang, Z.; Yang, T.; Zeng, Q. The relationship among China’s fuel oil spot, futures and stock markets. Financ. Res. Lett. 2018, 24, 151–162. [Google Scholar]
  16. Li, X.; Wei, Y. The dependence and risk spillover between crude oil market and China stock market: New evidence from a variational mode decomposition-based copula method. Energy Econ. 2018, 74, 565–581. [Google Scholar] [CrossRef]
  17. Wei, Y.; Qin, S.; Li, X.; Zhu, S.; Wei, G. Oil price fluctuation, stock market and macroeconomic fundamentals: Evidence from China before and after the financial crisis. Financ. Res. Lett. 2019, 30, 23–29. [Google Scholar] [CrossRef]
  18. Yu, L.; Wang, Z.; Tang, L. A decomposition-ensemble model with data-characteristic-driven reconstruction for crude oil price forecasting. Appl. Energy 2015, 156, 251–267. [Google Scholar] [CrossRef]
  19. An, J.; Mikhaylov, A.; Moiseev, N. Oil price predictors: Machine learning approach. Int. J. Energy Econ. Policy 2019, 9, 1–6. [Google Scholar] [CrossRef] [Green Version]
  20. Aleksendrić, D.; Carlone, P. Soft Computing in the Design and Manufacturing of Composite Materials. Applications to Brake Friction and Thermoset Matrix Composites; Woodhead Publishing: London, UK, 2015; pp. 39–60. [Google Scholar]
  21. Sadorsky, P. Modeling and forecasting petroleum futures volatility. Energy Econ. 2006, 28, 467–488. [Google Scholar] [CrossRef]
  22. Cheong, C.W. Modeling and forecasting crude oil markets using ARCH-type models. Energy Policy 2009, 37, 2346–2355. [Google Scholar] [CrossRef]
  23. Kang, S.H.; Kang, S.; Yoon, S. Forecasting volatility of crude oil markets. Energy Econ. 2009, 31, 119–125. [Google Scholar] [CrossRef]
  24. Mohammadi, H.; Su, L. International evidence on crude oil price dynamics: Applications of ARIMA-GARCH models. Energy Econ. 2010, 32, 1001–1008. [Google Scholar] [CrossRef]
  25. Wei, Y.; Wang, Y.; Huang, D. Forecasting crude oil market volatility: Further evidence using GARCH-class models. Energy Econ. 2010, 32, 1477–1484. [Google Scholar] [CrossRef]
  26. Kang, S.H.; Yoon, S. Modeling and forecasting the volatility of petroleum futures prices. Energy Econ. 2012, 36, 354–362. [Google Scholar] [CrossRef]
  27. Zhao, C.L.; Wang, B. Forecasting Crude Oil Price with an Autoregressive Integrated Moving Average (ARIMA) Model. In Fuzzy Information & Engineering and Operations Research & Management; Cao, B., Nasseri, H., Eds.; Springer: Berlin/Heidelberg, Germany, 2014; Volume 211, pp. 275–286. [Google Scholar]
  28. Aamir, M.; Shabri, A. Modelling and Forecasting Monthly Crude Oil Price of Pakistan: A Comparative Study of ARIMA, GARCH and ARIMA Kalman Mode. In Proceedings of the Advances in Industrial and Applied Mathematics, Johor Bahru, Malaysia, 24–26 November 2015; Salleh, S., Aris, N., Bahar, A., Zainuddin, Z.M., Maan, N., Lee, M.H., Ahmad, T., Yusof, Y.M., Eds.; AIP Publishing LLC: Melville, NY, USA, 2016; Volume 1750. [Google Scholar]
  29. Nademi, A.; Nademi, Y. Forecasting crude oil prices by a semiparametric Markov switching model: OPEC, WTI, and Brent cases. Energy Econ. 2018, 74, 757–766. [Google Scholar] [CrossRef]
  30. Lin, H.; Su, Q. Crude Oil Prices Forecasting: An Approach of Using CEEMDAN-Based Multi-Layer Gated Recurrent Unit Networks. Energies 2020, 13, 1543. [Google Scholar] [CrossRef] [Green Version]
  31. Li, J.; Zhu, S.; Wu, O. Monthly crude oil spot price forecasting using variational mode decomposition. Energy Econ. 2019, 83, 240–253. [Google Scholar] [CrossRef]
  32. Azadeh, A.; Moghaddam, M.; Khakzad, M.; Ebrahimipour, V. A flexible neural network-fuzzy mathematical programming algorithm for improvement of oil price estimation and forecasting. Comput. Ind. Eng. 2012, 62, 421–430. [Google Scholar] [CrossRef]
  33. Chiroma, H.; Abdulkareem, S.; Herawan, T. Evolutionary neural network model for West Texas Intermediate crude oil price prediction. Appl. Energy 2015, 142, 266–273. [Google Scholar] [CrossRef]
  34. Wang, J.; Wang, J. Forecasting energy market indices with recurrent neural networks: Case study of crude oil price fluctuations. Energy 2016, 102, 365–374. [Google Scholar] [CrossRef]
  35. Mohamed, M.M.; El-Masry, A.A. Oil price forecasting using gene expression programming and artificial neural networks. Econ. Model. 2016, 54, 40–53. [Google Scholar]
  36. Huang, L.; Wang, J. Global crude oil price prediction and synchronization based accuracy evaluation using random wavelet neural network. Energy 2018, 151, 875–888. [Google Scholar] [CrossRef]
  37. Safari, A.; Davallou, M. Oil price forecasting using a hybrid model. Energy 2018, 148, 49–58. [Google Scholar] [CrossRef]
  38. Li, T.; Hu, Z.; Jia, Y.; Wu, J.; Zhou, Y. Forecasting crude oil prices using ensemble empirical mode decomposition and sparse Bayesian learning. Energies 2018, 11, 1882. [Google Scholar] [CrossRef] [Green Version]
  39. Wang, M.; Zhao, L.; Ruijin, D.; Wang, C.; Lin, C.; Lixin, T.; Stanley, H.E. A novel hybrid method of forecasting crude oil prices using complex network science and artificial intelligence algorithms. Appl. Energy 2018, 220, 480–495. [Google Scholar] [CrossRef]
  40. Ding, Y. A novel decompose-ensemble methodology with AIC-ANN approach for crude oil forecasting. Energy 2018, 154, 328–336. [Google Scholar] [CrossRef]
  41. Li, T.; Zhou, Y.; Li, X.; Wu, J.; He, T. Forecasting daily crude oil prices using improved CEEMDAN and ridge regression-based predictors. Energies 2019, 12, 3603. [Google Scholar] [CrossRef] [Green Version]
  42. Wu, J.; Miu, F.; Li, T. Daily Crude Oil Price Forecasting Based on Improved CEEMDAN, SCA, and RVFL: A Case Study in WTI Oil Market. Energies 2020, 13, 1852. [Google Scholar] [CrossRef] [Green Version]
  43. Dbouk, W.; Jamali, I. Predicting daily oil prices: Linear and non-linear models. Res. Int. Bus. Financ. 2018, 46, 149–165. [Google Scholar] [CrossRef]
  44. Hutchinson, J.M.; Lo, A.W.; Poggio, T. A nonparametric approach to pricing and hedging derivative securities via learning networks. J. Financ. 1994, 49, 851–889. [Google Scholar] [CrossRef]
  45. Andreou, P.C.; Charalambous, C.; Martzoukos, S.H. Pricing and trading European options by combining artificial neural networks and parametric models with implied parameters. Eur. J. Oper. Res. 2008, 185, 1415–1433. [Google Scholar] [CrossRef] [Green Version]
  46. Lin, C.T.; Yeh, H.Y. The valuation of Taiwan stock index option prices—Comparison of performances between Black–Scholes and neural network model. J. Stat. Manag. Syst. 2005, 8, 355–367. [Google Scholar] [CrossRef]
  47. Tseng, C.; Cheng, S.; Wang, Y.; Peng, J. Artificial neural network model of the hybrid EGARCH volatility of the Taiwan stock index option prices. Phys. A Stat. Mech. Its Appl. 2008, 387, 3192–3200. [Google Scholar] [CrossRef]
  48. Lin, C.T.; Yeh, H.Y. Empirical of Taiwan stock index option price forecasting model—Applied artificial neural network. Appli. Econ. 2009, 41, 1965–1972. [Google Scholar] [CrossRef]
  49. Yao, J.; Li, Y.; Tan, C.L. Option price forecasting using neural networks. Omega 2000, 2, 455–466. [Google Scholar] [CrossRef]
  50. Marjak, H. Ocena efektywności wybranych nieparametrycznych modeli wyceny opcji. Folia Pomeranae Univ. Technol. Stetin. Oeconomica 2013, 71, 81–92. [Google Scholar]
  51. Lajbcygier, P.R.; Connor, J.T. Improved option pricing using artificial neural networks and bootstrap methods. Int. J. Neural Syst. 1997, 8, 457–471. [Google Scholar] [CrossRef] [Green Version]
  52. Wang, Y. Nonlinear neural network forecasting model for stock index option price: Hybrid gjr–garch approach. Expert Syst. Appl. 2009, 36, 564–570. [Google Scholar] [CrossRef]
  53. Huh, J. Pricing options with exponential Lévy neural network. Expert Syst. Appl. 2019, 127, 128–140. [Google Scholar] [CrossRef] [Green Version]
  54. Luo, R.; Zhang, W.; Xu, X.; Wang, J. A neural stochastic volatility model. In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, New Orleans, LA, USA, 2–7 February 2018; Volume 6401. [Google Scholar]
  55. Yang, Y.; Zheng, Y.; Hospedales, T.M. Gated neural networks for option pricing: Rationality by design. In Proceedings of the Thirty-first AAAI conference on artificial intelligence, San Francisco, CA, USA, 4–9 February 2017; pp. 52–58. [Google Scholar]
  56. Hull, J. Option, Futures and Other Derivatives, 8th ed.; Pearson: Boston, MA, USA, 2012; pp. 7–8. [Google Scholar]
  57. Garner, C.; Brittain, P. Commodity Options: Trading and Hedging Volatility in the World’s Most Lucrative Market; Pearson Education: Cranbury, NJ, USA, 2009; p. 2. [Google Scholar]
  58. Peng, C.h.; Magoulas, G.D. Nonmonotone BFGS-trained recurrent neural networks for temporal sequence processing. Appl. Math. Comput. 2011, 217, 5421–5441. [Google Scholar] [CrossRef]
  59. U.S. Energy Information Administration. NYMEX Futures Prices. Available online: https://www.eia.gov/dnav/pet/pet_pri_fut_s1_d.htm (accessed on 20 February 2020).
Figure 1. WTI crude oil futures prices for front-month delivery (in USD per barrel).
Figure 1. WTI crude oil futures prices for front-month delivery (in USD per barrel).
Energies 13 04359 g001
Figure 2. Percentage share of the number and the value of losses within the particular range, for training set and test set.
Figure 2. Percentage share of the number and the value of losses within the particular range, for training set and test set.
Energies 13 04359 g002
Figure 3. Percentage share of the number and the value of profit within the particular range, for training set and test set.
Figure 3. Percentage share of the number and the value of profit within the particular range, for training set and test set.
Energies 13 04359 g003
Figure 4. Development of the indicators (MP, ML, AR) over time for the training set and the test set.
Figure 4. Development of the indicators (MP, ML, AR) over time for the training set and the test set.
Energies 13 04359 g004
Figure 5. Value of the %MP indicator obtained in the SANN taking into account the activation function used in the hidden layer.
Figure 5. Value of the %MP indicator obtained in the SANN taking into account the activation function used in the hidden layer.
Energies 13 04359 g005
Figure 6. Value of the %MP indicator obtained in the SANN taking into account the activation function used in the hidden layer (logistic, hyperbolic tangent, exponential) and in the output layer (logistic, softmax).
Figure 6. Value of the %MP indicator obtained in the SANN taking into account the activation function used in the hidden layer (logistic, hyperbolic tangent, exponential) and in the output layer (logistic, softmax).
Energies 13 04359 g006
Figure 7. Development of the MP and AR indicators in the training set and of the EP indicator for the top five SANNs from the training set (in USD/b).
Figure 7. Development of the MP and AR indicators in the training set and of the EP indicator for the top five SANNs from the training set (in USD/b).
Energies 13 04359 g007
Figure 8. Development of the MP and AR indicators and the EP indicator in the test set for the top five SANNs in the training set (in USD/b).
Figure 8. Development of the MP and AR indicators and the EP indicator in the test set for the top five SANNs in the training set (in USD/b).
Energies 13 04359 g008
Table 1. Quantitative characteristics of training set and the test set.
Table 1. Quantitative characteristics of training set and the test set.
All[>0][<0]% All% [>0]% [<0]
Training set1966672129475%72%76%
Test set66425640825%28%24%
Sum26309281702 35%65%
Table 2. Value characteristics of training set and test set.
Table 2. Value characteristics of training set and test set.
All[>0][<0]% All% [>0]% [<0]
Training set−277.181646.86−1924.0575%81%80%
Test set−91.2395.49−486.6925%19%20%
Sum−368.392042.36−2410.74
Table 3. Characteristics of training set and test set.
Table 3. Characteristics of training set and test set.
Interval (In USD)Training SetTest SetTraining SetTest Set
fortoCountValueCountValue%Count%Value%Count%Value
−∞−356−204.69−34.14.3%10.6%2.2%7.0%
−3−2244−585.942−94.618.9%30.5%10.3%19.4%
−2−1610−904.9182−259.647.1%47.0%44.6%53.3%
−10384−228.6175−98.429.7%11.9%42.9%20.2%
Sum [>0]1294−1924.0408−486.7
01211102.011048.431.4%6.2%43.0%12.2%
12142210.46594.021.1%12.8%25.4%23.8%
23122306.048117.318.2%18.6%18.8%29.7%
31971028.433135.829.3%62.4%12.9%34.3%
Sum [<0]6721646.9256395.5
Sum all1966−277.2664−91.2
Table 4. Comparison of the values of indicators for the training set and the test set.
Table 4. Comparison of the values of indicators for the training set and the test set.
ARMPML
Training set−277.181646.86−1924.05
Test set−91.20395.49−486.69
Table 5. Results (in USD/b) obtained in the SANN including the activation function used in the hidden layer.
Table 5. Results (in USD/b) obtained in the SANN including the activation function used in the hidden layer.
Activation FunctionBESTAVG5AVG10
Training SetTest SetTraining SetTest SetTraining SetTest Set
Linear31.214.630.16.029.23.0
Sinus215.050.1197.740.1182.938.0
Exponential747.373.3692.146.5636.545.8
Hyperbolic tangent 1087.495.6950.559.4879.363.5
Logistic1147.647.8933.852.6857.754.6
Table 6. Results (in USD/b) obtained in the SANN with a division into activation functions used in the hidden layer and in the output layer.
Table 6. Results (in USD/b) obtained in the SANN with a division into activation functions used in the hidden layer and in the output layer.
Activation FunctionBESTAVG5AVG10
Hidden LayerOutput LayerTraining SetTest SetTraining SetTest SetTraining SetTest Set
LogisticLogistic1085.069.6988.561.1951.563.9
LogisticSoftmax1175.267.81056.456.8969.755.1
Hyperbolic tangentLogistic1135.763.0969.649.4876.745.7
Hyperbolic tangentSoftmax1045.739.8997.261.4962.762.7
ExponentialLogistic719.770.5654.964.7629.359.0
ExponentialSoftmax847.045.7813.741.7780.040.7
Table 7. Top 10 SANN results (in USD/b).
Table 7. Top 10 SANN results (in USD/b).
Num.Activation FunctionResultTraining SetTest Set% MP Training Set% MP Test Set
Hidden LayerOutput Layer
1LogisticSoftmaxBEST1175.267.871.4%17.1%
2LogisticSoftmaxBEST1147.647.869.7%12.1%
3LogisticSoftmaxBEST1146.564.969.6%16.4%
4Hyperbolic tangentLogisticBEST1135.763.069.0%15.9%
5Hyperbolic tangentSoftmaxBEST1087.495.666.0%24.2%
6LogisticLogisticBEST1085.069.665.9%17.6%
7LogisticSoftmaxAVG51056.456.864.1%14.4%
8Hyperbolic tangentSoftmaxBEST1045.739.863.5%10.1%
9Hyperbolic tangentLogisticBEST1032.964.562.7%16.3%
10Hyperbolic tangentLogisticBEST1026.226.562.3%6.7%
AVG1093.959.666.4%15.1%
Table 8. Results obtained by the top 5 SANNs in the training set.
Table 8. Results obtained by the top 5 SANNs in the training set.
(1)(2)(3)(4)(5)
CountTraining setCorrect523507511461514
Incorrect11214413828137
Total635651649489651
Test setCorrect69108628986
Incorrect5288547150
Total121196116160136
ValueTraining setCorrect1319.41330.01323.31183.31245.9
Incorrect−144.2−182.3−176.8−47.6−158.5
Total1175.21147.61146.51135.71087.4
Test setCorrect119.6156.6123.0150.6148.4
Incorrect−51.8−108.8−58.0−87.6−52.8
Total67.847.864.963.095.6

Share and Cite

MDPI and ACS Style

Puka, R.; Łamasz, B. Using Artificial Neural Networks to Find Buy Signals for WTI Crude Oil Call Options. Energies 2020, 13, 4359. https://doi.org/10.3390/en13174359

AMA Style

Puka R, Łamasz B. Using Artificial Neural Networks to Find Buy Signals for WTI Crude Oil Call Options. Energies. 2020; 13(17):4359. https://doi.org/10.3390/en13174359

Chicago/Turabian Style

Puka, Radosław, and Bartosz Łamasz. 2020. "Using Artificial Neural Networks to Find Buy Signals for WTI Crude Oil Call Options" Energies 13, no. 17: 4359. https://doi.org/10.3390/en13174359

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop