A Semi-Parametric Non-linear Neural Network Filter: Theory and Empirical Evidence

Michaelides, Panayotis G.; Tsionas, Efthymios G.; Vouldis, Angelos T.; Konstantakis, Konstantinos N.; Patrinos, Panagiotis

doi:10.1007/s10614-016-9628-6

A Semi-Parametric Non-linear Neural Network Filter: Theory and Empirical Evidence

Published: 02 November 2016

Volume 51, pages 637–675, (2018)
Cite this article

Computational Economics Aims and scope Submit manuscript

Panayotis G. Michaelides¹,
Efthymios G. Tsionas^2,6,
Angelos T. Vouldis³,
Konstantinos N. Konstantakis⁴ &
…
Panagiotis Patrinos⁵

334 Accesses
7 Citations
Explore all metrics

Abstract

In this work, we decompose a time series into trend and cycle by introducing a novel de-trending approach based on a family of semi-parametric artificial neural networks. Based on this powerful approach, we propose a relevant filter and show that the proposed trend specification is a global approximation to any arbitrary trend. Furthermore, we prove formally a famous claim by Kydland and Prescott (1981, 1997) that over long time periods, the average value of the cycles is zero. A simple procedure for the econometric estimation of the model is developed as a seven-step algorithm, which relies on standard techniques, where all relevant measures may be computed routinely. Next, using relevant DGPs, we compare and show by means of Monte Carlo simulations that our approach is superior to Hodrick–Prescott (HP) and Baxter and King (BK) regarding the generated distortionary effects and the ability to operate in various frequencies, including changes in volatility, amplitudes and phase. In fact, while keeping the structure of the model relatively simple, our approach is perfectly capable of addressing the case of stochastic trend, in the sense that the generated distortionary effects in the near unit root case are minimal and, by all means, considerably fewer than those generated by HP and BK. Application to EU15 business cycles clustering is presented and the empirical results are consistent with the rigorous theoretical framework developed in this work.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Hide-and-Seek with time-series filters: a model-based Monte Carlo study

Article 22 July 2019

Vadim Kufenko

Trend/Cycle Decomposition

A contribution to the analysis of historical economic fluctuations (1870–2010): filtering, spurious cycles, and unobserved component modeling

Article 16 September 2015

José Luis Cendejas, Félix-Fernando Muñoz & Nadia Fernández-de-Pinedo

Notes

Other popular approaches include the Kalman filter. For an enlightening survey see Kim and Nelson (1999) and for a rigorous analysis of the theory regarding models with non-stationary time series see Chang et al. (2009). Also, several non-linear models have been estimated on real output growth (e.g. Terasvirta 1994). This strand of the literature assumes that output growth is measured accurately, which is quite unlikely to happen since the data contain measurement errors (e.g. Zellner 1992). Hence, sampling all the states conditional on the parameters is relevant (Giordani et al. 2007) but it is not true of threshold models (Pitt et al. 2010). Pitt et al. (2012) used the particle filter to integrate out the states. Also, Malik and Pitt (2011), using particle filtering theory, approximated the likelihood of the unobserved components.
There is also a seasonal component, which is removed when seasonally adjusting the dataset (Hodrick and Prescot 1981, 1997).
For the standard approaches, the trend of a time series is usually regarded as the component, which comprises its non-cyclical elements together with the cyclical elements of lowest frequency (Kozicki 1999). In particular, according to Pollock (2000) popular filters such as the HP, allow powerful low-frequency components to pass through into the de-trended series when they ought to be impeded by the filter and this deficiency is liable to induce spurious cycles in de-trended data series. This is one of the drawbacks of the HP filter. Of course, there exist other model-based approaches that constitute important alternatives (e.g. Harvey and Todd 1983; Hillmer and Tiao 1982; Koopman et al. 1995) which, however, impose features that are often regarded as being undesirable (Pollock 2000).
In this section, for reasons of notation, when we consider fixed (instead of free) parameters, then the respective parameter is denoted by an upper bar.
However, in general, the empirical results are robust, regardless of the activation function used because of the typical properties they posses (Haykin 1999).
For an extensive survey on methods regarding the selection of the number of nodes in neural networks or for the appropriate model selection using information criteria see, among others, Sheela and Deepa (2013) and Konishi and Kitagawa (1996), respectively.
This selection of the cyclical component was made so that the peak of the spectrum in our cycle could be either at zero frequency or at business cycle frequencies.
Despite the difference in the number of iterations between the two procedures, in an econometric perspective, the average correlation coefficient in both procedures is robust, and the only difference lies in the range intervals of their estimates. To this end, without loss of generality, 10,000 iterations are considered to be an asymptotic estimate. Nevertheless, our analysis is based on the average estimates.
The smoothing parameters used for the HP and BK are again set equal to 1600 and 6–32 respectively, as the relevant literature suggests (e.g. Baum et al. 2006), contrarily to the NNF which is data driven.
We would like to thank an anonymous referee for this suggestion.
Cluster analysis has often been applied to European data (see, e.g. Jacquemin and Sapir 1995; Artis and Zhang 1997, 1998a, b).
See, among others, Bock (1974), Bozdogan (1993), Engelman and Hartigan (1969).
In order to avoid biased inferences due to differences in the size of each economy, the cyclical components were standardized by their standard deviation.
We should note of course that the filters of BK and HP do not presuppose the use of stationary time series, i.e. Sowell MLE estimations. Despite the fact that in the case of NNF this is also true, we examined the stationarity characteristics of the cyclical components extracted for reasons of robustness. We would like to thank an anonymous referee for this comment.

References

Adya, M., & Collopy, F. (1998). How effective are neural networks at forecasting and prediction? A review and evaluation. Journal of Forecasting, 17, 481–495.
Article Google Scholar
Agresti, A., & Mojon, B. (2001). Some stylised facts on the euro area business cycle. ECB Working Paper 95.
Aminian, F., Suarez, E. D., Aminian, M., & Waltz, D. T. (2006). Forecasting economic data with neural networks. Computational Economics, 28(1), 71–88.
Article Google Scholar
Anderson, H. M., & Ramsey, J. B. (2002). U.S. and Canadian industrial production indices as coupled oscillators. Journal of Economic Dynamics and Control, 26(1), 33–67.
Article Google Scholar
Andreano, M. S., & Savio, G. (2002). Further evidence on business cycle asymmetries in G7 countries. Applied Economics, 34(7), 895–904.
Article Google Scholar
Andreasen, M. M. (2011). Non-linear DSGE models and the optimized central difference particle filter. Journal of Economic Dynamics and Control, 35(10), 1671–1695.
Article Google Scholar
Andreasen, M. M. (2013). Non linear DSGE models and the central difference Kalman filter. Journal of Applied Econometrics, 28(6), 929–955.
Google Scholar
Artis, M., & Zhang, W. (1997). International business cycle and the ERM: Is there a European business cycle? International Journal of Finance and Economics, 2, 1–16.
Article Google Scholar
Artis, M., & Zhang, W. (1998a). Core and periphery in EMU: A cluster analysis, EUI working paper RSC No. 98/37.
Artis, M., & Zhang, W. (1998b). Membership of EMU: A fuzzy clustering analysis of alternative criteria, EUI working paper RSC no. 98/52.
Balke, N. S., & Fomby, T. B. (1994). Large shocks, small shocks, and economic fluctuations: outliers in macroeconomic time series. Journal of Applied Econometrics, 9(2), 181–200.
Article Google Scholar
Baxter, M., & King, R. G. (1999). Measuring business cycles: Approximate band-pass filters for economic time series. Review of Economic and Statistics, 81(4), 575–593.
Article Google Scholar
Baum F.C. (2006). Introduction to modern eonometrics using Stata. Texas: Stata Press.
Bayoumi, T., & Eichengreen, B. (1993). Shocking aspects of European monetary integration. In F. Torres & F. Giavazzi (Eds.), Adjustment and growth in the European Monetary Union. Cambridge: Cambridge University Press.
Google Scholar
Bayoumi, T., & Eichengreen, B. (1997a). Ever closer to heaven? An optimum-currency-area index for European countries. European Economic Review, 41, 761–770.
Article Google Scholar
Bayoumi, T., & Eichengreen, B. (1997b). Optimum currency areas and exchange rate volatility; theory and evidence compared. In B. Cohen (Ed.), International trade and finance: New frontiers for research: Essays in honour of Peter Kenen. Cambridge: Cambridge University Press.
Google Scholar
Beaudry, P., & Koop, G. (1993). Do recessions permanently change output? Journal of Monetary Economics, 31(2), 149–163.
Article Google Scholar
Bezdec, J. C. (1981). Pattern recognition with fuzzy objective function algorithms. New York: Plenum Press.
Book Google Scholar
Bidarkota, P. V. (1999). Sectoral investigation of asymmetries in the conditional mean dynamics of the real US GDP. Studies in onlinear Dynamics and Econometrics, 3, 191–200.
Google Scholar
Binner, J. M., Gazely, A. M., & Chen, S. (2002). Financial innovation and Divisia money in Taiwan: A neural network approach. The European Journal of Finance, 8, 238–247.
Article Google Scholar
Binner, J. M., Gazely, A. M., Chen, S., & Chie, B. (2004). Financial innovation and Divisia money in Taiwan: Comparative evidence from neural network and vector error-correction forecasting models. Contemporary Economic Policy, 22(2), 213–224.
Article Google Scholar
Binner, J. M., Bissoondeeal, R. K., Elger, T., Gazely, A. M., & Mullineux, A. W. (2005). A comparison of linear forecasting models and neural networks: An application to Euro inflation and Euro Divisia. Applied Economics, 37(6), 665–680.
Article Google Scholar
Bishop, C. M. (1995). Neural networks for pattern recognition. New York: Oxford University Press.
Brockett, P., Cooper, W., Golden, L., & Pitaktong, U. (1994). A neural network method for obtaining an early warning of insurer insolvency. The Journal of Risk and Insurance, 61(3), 402–424.
Article Google Scholar
Bock, H. H. (1974). AutomatischeKlassifikation (Clusteranalyse). Goettingen: Vandenhoek&Ruprecht.
Google Scholar
Bozdogan, H. (1993). Choosing the number of component clusters in the mixture model using a new informational complexity criterion of the inverse Fisher information matrix. In O. Opitz, B. Lausen, & R. Klar (Eds.), Information and classification (pp. 40–54). Berlin: Springer.
Chapter Google Scholar
Brazili, A., & Siltzia, B. (2003). Risk related non-linearities is exchange rates: Evidence from a panel of Central and Eastern European Countries. Open Economies Review, 14, 135–155.
Article Google Scholar
Brunner, A. D. (1997). On the dynamic properties of asymmetric models of real GNP. The Review of Economics and Statistics, 79(2), 321–352.
Article Google Scholar
Burns, A. F., & Mitchell, W. C. (1946). Measuring business cycles. New York: National Bureau of Economic Research.
Google Scholar
Camacho, M., Perez-Quiros, G., & Saiz, L. (2006). Are European business cycle close enough to be just one?CEPR discussion papers, No. 4824.
Campbell, J. Y. & Perron, P. (1991). Pitfalls and opportunities: What macroeconomists should know about unit roots. In O. Blanchard, & S. Fischer (Eds.), NBER Macroeconomics Annual 1991, Vol. 6.
Canova, F. (1998). Detrending and business cycle facts. Journal of Monetary Economics, 41, 475–512.
Article Google Scholar
Canzoneri, M., Valles, J., & Vinals, J. (1996). Do exchange rates move to address international macroeconomic imbalances? CERP discussion papers, No. 1948. London Venter of Economic Policy Research.
Chan, N. H., & Genovese, R. C. (2001). A comparison of linear and nonlinear statistical techniques in performance attribution. IEEE Transactions on Neural Networks, 12(4), 922–928.
Article Google Scholar
Chang, Y., Miller, I. J., & Park, J. Y. (2009). Extracting a common stochastic trend: Theory with some applications. Journal of Econometrics, 150, 231–247.
Article Google Scholar
Clements, M. P., & Krolzig, H.-M. (2004). Can regime-switching models reproduce the business cycle features of US aggregate consumption, investment and output? International Journal of Finance & Economics, 9(1), 1–14.
Article Google Scholar
Concaria, L. A., & Soares, M. J. (2009). Business cycle synchronization across the Euro area: A wavelet analysis, NIPE working papers.
Creal, D., Koopman, S. J., & Zivot, E. (2010). Extracting a robust US business cycle using a time-varying multivariate model-based bandpass filter. Journal of Applied Econometrics, 25(4), 695–719.
Article Google Scholar
Crowley P., & Christi, C. (2003). European Union Studies Association (EUSA), Biennial Conference, (8th), March 27–29.
Dickerson, A., Gibson, H., & Tsakalotos, E. (1998). Business cycle correspondence in the European Union. Economica, 25, 51–77.
Google Scholar
Diebold, F. X., & Rudebusch, G. D. (1989). Long memory and persistence in aggregate output. Journal of Monetary Economics, 24(2), 189–209.
Article Google Scholar
Engelman, L., & Hartigan, J. A. (1969). Percentage points of a test for clusters. Journal of American Statistical Association, 64, 1647.
Article Google Scholar
Falk, B. (1986). Further evidence on the asymmetric behavior of economic time series over the business cycle. Journal of Political Economy, 94(5), 1096–1109.
Article Google Scholar
Faraway, J., & Chatfield, C. (1998). Time series forecasting with neural networks: A comparative study using the airline data. Applied Statistics, 47(2), 231–250.
Google Scholar
Gencay, R. (1999). Linear, non-linear and essential foreign exchange rate prediction with simple technical trading rules. Journal of International Economics, 47(1), 91–107.
Article Google Scholar
Giordani, P., Kohn, R., & van Dijk, D. (2007). A unified approach to nonlinearity, structural change, and outliers. Journal of Econometrics, 137, 112–133.
Article Google Scholar
Guarin A., Liu, X., & Ng, W. L. (2013). Recovering default risk from CDS spreads with a nonlinear filter. Journal of Economic Dynamics and Control (forthcoming October 2013).
Guay, A., & Saint-Amant, P. (2005). Do the Hodrick-Prescott and Baxter-King filters provide a good approximation of business cycles? Annalesd’Economieet de Statistique. ENSAE, issue, 77, 133–155.
Google Scholar
Hamilton, J. D. (1990). Analysis of time series subject to changes in regime. Journal of Econometrics, 45(1–2), 39–70.
Article Google Scholar
Hanke, M. (1999). Neural networks versus Black-Scholes: An empirical comparison of the pricing accuracy of two fundamentally different option pricing methods. Journal of Computational Intelligence in Finance, 5, 26–34.
Google Scholar
Hartigan, J. A., & Wong, M. A. (1978). Algorithm AS 136: A K-means clustering algorithm. Applied Statistics, 28, 100–108.
Article Google Scholar
Harvey, A. C., & Todd, P. H. (1983). Forecasting economic time series with structural and BoxJenkins models: A case study. Journal of Business and Economic Forecasting, 1, 299–307.
Google Scholar
Harvey, A. C., & Jaeger, A. (1993). Detrending: Stylized facts and the business cycle. Journal of Applied Econometrics, 8, 231–247.
Article Google Scholar
Haykin, S. (1999). Neural Networks. New Jersey: Prentice-Hall.
Google Scholar
Hess, G. D., & Iwata, S. (1997). Asymmetric persistence in GDP? A deeper look at depth. Journal of Monetary Economics, 40(3), 535–554.
Article Google Scholar
Hillmer, S. C., & Tiao, G. C. (1982). An ARIMA-model-based approach to seasonal adjustment. Journal of the American Statistical Association, 77, 63–70.
Article Google Scholar
Hodrick, R. J., & Prescott, E. C. (1981). Postwar U.S. business cycles: An empirical investigation. Carnegie Mellon University, discussion paper no. 451.
Hodrick, R. J., & Prescott, E. C. (1997). Postwar U.S. business cycles: An empirical investigation. Journal of Money, Credit, and Banking, 29(1), 1–16.
Article Google Scholar
Hornik, K. (1991). Approximation capabilities of multilayer feedforward networks. Neural Networks, 4(2), 251–257.
Article Google Scholar
Hornik, K., Stinchcombe, M., & White, H. (1989). Multilayer feedforward networks are universal approximators. Neural Networks, 2, 359–366.
Article Google Scholar
Hornik, K., Stinchcombe, M., & White, H. (1990). Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks. Neural Networks, 3, 551–560.
Article Google Scholar
Hutchinson, J. M., Lo, A. W., & Poggio, T. (1994). A nonparametric approach to pricing and hedging derivative securities via learning networks. Journal of Finance, 49, 851–889.
Article Google Scholar
Jacquemin, A., & Sapir, A. (1995). Is a European hard core credible? (p. 1242). CEPR discussion paper: A statistical analysis.
Google Scholar
Kauermann, G., Teuber, T., & Flaschel, P. (2012). Exploring US business cycles with bivariate loops using penalized spline regression. Computational Economics, 39(4), 409–427.
Article Google Scholar
Kiani, K. M. (2005). Detecting business cycle asymmetries using artificial neural networks and time series models. Computational Economics, 26(1), 65–85.
Article Google Scholar
Kiani, K. M. (2007). Asymmetric business cycle fluctuations and contagion effects in G7 countries. International Journal of Business and Economics, 6(3), 237–253.
Google Scholar
Kiani, K. M. (2009a). Asymmetries in macroeconomic time series in eleven Asian economies. International Journal of Business and Economics, 8(1), 37–54.
Google Scholar
Kiani, K. M. (2009b). Neural networks to detect nonlinearities in time series: Analysis of business cycle in France and the United Kingdom. Applied Econometrics and International Development, 9(1), 67–76.
Google Scholar
Kiani, M. K. (2011). Fluctuations in economic and activity and stabilization policies in the CIS. Computational Economics, 37(2), 193–220.
Article Google Scholar
Kiani, K. M., & Bidarkota, P. V. (2004). On business cycle asymmetries in G7 countries. Oxford Bulletin of Economics and Statistics, 66(3), 333–351.
Article Google Scholar
Kiani, K., & Kastens, T. L. (2006). Using macro-financial variables to forecast recessions: An analysis of Canada, 1957–2002. Applied Econometrics and International Development 2005, 6(3), 97–106.
Kiani, K. M., Bidarkota, P. V., & Kastens, T. L. (2005). Forecast performance of neural networks and business cycle asymmetries. Applied Financial Economics Letters, 1(4), 205–210.
Article Google Scholar
Koopman, S. J., Harvey, A. C., Doornick, J. A., & Shephard, N. (1995). STAMP 5.0: Structural time series analyser modeller and predictor: The manual. London: Chapman & Hall.
Konishi, S., & Kitagawa, G. (1996). Generalized information criteria for model selection. Biometrika, 83(4), 875–890.
Article Google Scholar
Kozicki, S. (1999). Multivariate detrending under common trend restrictions: Implications for business cycles research. Journal of Economic Dynamics and Control, 23, 997–1028.
Article Google Scholar
Kuan, C. M., & White, H. (1994). Artificial neural networks: An econometric perspective. Econometric Reviews, 13, 1–91.
Article Google Scholar
Kim, C.-J., & Nelson, C. R. (1999). Friedman’s plucking model of business fluctuations: Tests and estimates of permanent and transitory components. Journal of Money, Credit and Banking, 31(3), 317–334.
Article Google Scholar
Lee, T., White, H., & Granger, C. W. J. (1993). Testing for neglected nonlinearity in time series models. Journal of Econometrics, 56, 269–290.
Article Google Scholar
Ljung, G., & Box, G. E. P. (1978). On a measure of lack of fit in time series models. Biometrika, 65, 297–303.
Article Google Scholar
Lucas, R. E, Jr. (1977). Understanding business cycles. In K. Karl Brunner & A. Meltzer (Eds.), Stabilization of the domestic and international economy. Amsterdam: North Holland.
MacQueen, J. B. (1967). Some methods for classification and analysis of multivariate observations. In Proceedings of 5th Berkeley symposium on mathematical statistics and probability (Vol. 1, pp. 281–297). Berkeley: University of California Press.
Malik, S., & Pitt, M. K. (2011). Particle filters for continuous likelihood evaluation and maximazation. Journal of Econometrics, 165, 190–209.
Article Google Scholar
Massmann, M., & Mitchell, J. (2004). Reconsidering the evidence: Are Eurozone business cycles converging? Journal of Business Cycle Measurement and Analysis, 1, 275–307.
Google Scholar
Mise, E., Kim, T.-H., & Newbold, P. (2005). On suboptimality of the Hodrick–Prescott filter at time series endpoints. Journal of Macroeconomics, 27(1), 53–67.
Michaelides, P. G., Vouldis, A. T., & Tsionas, E. G. (2010). Globally flexible functional forms: The neural distance function. European Journal Of Operational Research, 206, 456–469.
Article Google Scholar
Neftci, S. N. (1984). Are economic time series asymmetric over the business cycle? Journal of Political Economy, 92(2), 307–328.
Article Google Scholar
Nelson, C. R., & Plosser, C. I. (1982). Trends and random walks in macroeconomics time series: Some evidence and implications. Journal of Monetary Economics, 10, 139–167.
Article Google Scholar
Oh, K., Zivot, E., & Creal, D. (2008). The relationship between the Beveridge-Nelson decomposition and other permanent transitory decompositions that are popular in economics. Journal of Econometrics, 146, 207–219.
Article Google Scholar
Owners, R., & Sarte, P. D. (2005). How well do diffusion indexes capture business cycles? A spectral analysis. Federal Reserve Bank of Richmond Economic Quarterly, 91(Fall (4)), 23–42.
Ozbek, L., & Ozlale, U. (2005). Employing the extended Kalman filter in measuring the output gap. Journal of Economic Dynamics and Control, 29(9), 1611–1622.
Article Google Scholar
Papageorgiou, T., Michaelides, P. G., & Milios, J. G. (2010). Business cycles synchronization and clustering in Europe (1960–2009). Journal of Economics and Business, 62(5), 419–470.
Pedersen, T. M. (2001). The Hodrick-Prescott filter, the Slutsky effect, and the distortionary effects of filters. Journal of Economic Dynamics and Control, 25, 1081–1101.
Article Google Scholar
Pesaran, M. H., & Potter, S. M. (1997). A floor and ceiling model of US output. Journal of Economic Dynamics and Control, 21(4–5), 661–695.
Article Google Scholar
Phillips, P. C. B., & Magdalinos, T. (2008). Limit theory for explosively cointegrated systems. Econometric Theory, 24, 865–887.
Google Scholar
Pitt, M., Giordani, P., & Kohn, R. (2010). Bayesian inference for time series state space models. In J. Geweke, G. Koop, & H. van Dijk (Eds.), Handbook of Bayesian econometric. Oxford: Oxford University Pres.
Google Scholar
Pitt, M., dos Santos Silva, R., Giordani, P., & Kohn, R. (2012). On some properties of Markov chain Monte Carlo simulation methods based on the particle filter. Journal of Econometrics, 171, 134–151.
Article Google Scholar
Pollock, D. S. G. (2000). Trend estimation and de-trending via rational square-wave filters. Journal of Econometrics, 99, 317–334.
Article Google Scholar
Psaradakis, Z., & Sola, M. (2003). On detreding and cyclical assymetry. Journal of Applied Econometrics, 18, 271–289.
Article Google Scholar
Qi, M., & Maddala, C. S. (1999). Economic factors and the stock market: A new perspective. Journal of Forecasting, 18, 151–166.
Article Google Scholar
Qi, M. (2001). Predicting U.S. recessions with leading indicators via neural network models. International Journal of Forecasting, 17, 383–401.
Article Google Scholar
Ramsey, J. B., & Rothman, P. (1996). Time irreversibility and business cycle asymmetry. Journal of Money, Credit and Banking, 28(1), 1–21.
Article Google Scholar
Rudin, W. (1976). Principles of mathematical analysis. New York: McGraw-Hill International Edition.
Google Scholar
Santin, D., Delgado, F., & Valino, A. (2004). The measurement of technical efficiency: A neural network approach. Applied Economics, 36, 627–635.
Article Google Scholar
Scheinkman, J. A., & LeBaron, B. (1989). Nonlinear dynamics and stock returns. The Journal of Business, 62(3), 311–337.
Article Google Scholar
Schumpeter, J. A. (1939). Business cycles: A theoretical, historical and statistical analysis of the capitalist process. New York: McGraw Hill.
Google Scholar
Serrano-Cinca, C. (1997). Feedforward neural networks in the classification of financial information. The European Journal of Finance, 3, 183–202.
Article Google Scholar
Sheela G. K., & Deepa S. N. (2013). Review on methods to fix number of hidden neurons in neural networks. Mathematical Problems in Engineering, Vol. 2013, Article ID 425740. doi:10.1155/2013/425740.
Stock, J. H., & Watson, M. W. (1999). Forecasting inflation. Journal of Monetary Economics, 44(2), 293–335.
Article Google Scholar
Swanson, N. R., & White, H. (1995). A model-selection approach to assessing the information in the term structure using linear models and artificial neural networks. Journal of Business & Economic Statistics, 13(3), 265–275.
Google Scholar
Swanson, N. R., & White, H. (1997a). A model selection approach to real-time macroeconomic forecasting using linear models and artificial neural networks. The Review of Economics and Statistics, 79(4), 540–550.
Article Google Scholar
Swanson, N. R., & White, H. (1997b). Forecasting economic time series using flexible versus fixed specification and linear versus nonlinear econometric models. International Journal of Forecasting, 13(4), 439–461.
Article Google Scholar
Tanizaki, H., & Mariano, R. S. (1994). Prediction, filtering and smoothing in non-linear and non-normal cases using Monte Carlo integration. Journal of Applied Econometrics, 9, 163–179.
Article Google Scholar
Taylor, C. (1995). EMU 2000? Prospects for European Monetary Union. London: Chatham House Papers, The Royal Institute of International Affairs.
Google Scholar
Terasvirta, T. (1994). Specification, estimation, and evaluation of smooth transition autoregressive models. Journal of the American Statistical Association, 89, 208–218.
Google Scholar
Tsay, R. S. (1988). Outliers, level shifts, and variance changes in time series. Journal of Forecasting, 7, 1–20.
Article Google Scholar
Vishwakarma, K. P. (1994). Reckognizing business cycles turning points by means of neural networks. Journal of Computational Economics, 7, 175–185.
Article Google Scholar
Watanabe, T. (1999). A non-linear filtering approach to stochastic volatility models with application to daily stock returns. Journal of Applied Econometrics, 14, 101–121.
Article Google Scholar
Whittaker, E. T. (1923). On a new method of graduations. Proceedings of the Edinburgh Mathematical Society, 41, 63–75.
Google Scholar
Wingrove, C. R., & Davis, E. R. (2012). Classical linear-control analysis applied to business-cycle dynamics and stability. Journal of Computational Economics, 39(1), 77–98.
Article Google Scholar
Woitek, U. (1998). Note on the Baxter–King filter. Working Papers 9813, Business School, Economics, University of Glasgow.
Zellner, A. (1992). Commentary. In Belagia, M. & Garfinkel, M. (Eds.), The business cycle: Theories and evidence: Proceedings of the sixteenth annual economic policy conference of the Reserve Bank of St Louis.
Zhang, G. P., & Berardi, V. L. (2001). Time series forecasting with neural network ensembles: An application for exchange rate prediction. Journal of the Operational Research Society, 52, 652–664.
Article Google Scholar

Download references

Acknowledgments

We are indebted to three anonymous Referees and the Editor-in-Chief, Hans Amman, for their diligent reading of the manuscript and for the constructive feedback. The first author (P.G.M.) would also like to thank Alexandros Eskenazis for a fruitful discussion on a previous version of this paper.

Author information

Authors and Affiliations

Laboratory of Theoretical and Applied Economics, School of Applied Mathematical and Physical Sciences, National Technical University of Athens, Heroon Polytechneiou 9, 157.80, Zografou Campus, Athens, Greece
Panayotis G. Michaelides
Lancaster University, Lancaster, UK
Efthymios G. Tsionas
European Central Bank & Bank of Greece, Athens, Greece
Angelos T. Vouldis
National Technical University of Athens, Athens, Greece
Konstantinos N. Konstantakis
Department of Electrical Engineering (ESAT-STADIUS), Optimization in Engineering Center (OPTEC), Katholieke Universiteit Leuven, Kasteelpark Arenberg 10, 3001, Leuven (Heverlee), Belgium
Panagiotis Patrinos
Athens University of Economics and Business, Athens, Greece
Efthymios G. Tsionas

Authors

Panayotis G. Michaelides
View author publications
You can also search for this author in PubMed Google Scholar
Efthymios G. Tsionas
View author publications
You can also search for this author in PubMed Google Scholar
Angelos T. Vouldis
View author publications
You can also search for this author in PubMed Google Scholar
Konstantinos N. Konstantakis
View author publications
You can also search for this author in PubMed Google Scholar
Panagiotis Patrinos
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Panayotis G. Michaelides.

Additional information

This paper should not be reported as representing the views of the European Central Bank (ECB). The views expressed are those of the author and do not necessarily reflect those of the ECB.

Appendix

Definition 1

(Trend set) Consider $g_{t_j } ,t\in T\subseteq {\mathbb {R}}^{+},j\in J\subseteq {\mathbb {R}}$ representing the trend of a time series $x_{t_j } \forall j\in J$, such that $g_{t_j } \in {\mathbb {R}}$, $\forall j\in J$. Without loss of generality, let $\mathop \bigcup \nolimits _{t_j } g_{t_j } =\{g_{t_j } :g_{t_j } is~the~trend~of~x_{t_j } \forall j\in J\}\subseteq {\mathbb {R}}$ be the trend set which is assumed to be closed and bounded.

Definition 2

(Time series as random variable) A time series model for the observed data $x_{t_j } ,j\in J$ is a specification of the joint distributions, or only the means and covariances, of a sequence of random variables $\left\{ {X_t } \right\} _{t\in T} $ of which {$x_{t_j } \}_{t\in T} $ is postulated to be a realization.

Definition 3

(Time series set) Consider $x_{t_j } ,j\in J$ an arbitrary macroeconomic time series, such that $x_{t_j } \in {\mathbb {R}}\forall t\in T\subseteq {\mathbb {R}}^{+}$. Without loss of generality, let $\bigcup _{j\in J} x_{t_j } \subseteq {\mathbb {R}}$ be the time series set.

Theorem 2

Proof

From Lemma 1 the trend set is compact. From Lemma 2 any function of the form $:\hbox {F}\left( \hbox {t} \right) \equiv \hbox {d}+\hbox {ct}+\mathop \sum \nolimits _{\mathrm{i=1}}^{\mathrm{N}} \hbox {a}_{\mathrm{i}} \upvarphi \left( {\hbox {b}_{\mathrm{i}} \hbox {t}} \right) ,\upvarphi _{\mathrm{i}} ,\hbox {b}_{\mathrm{i}} ,\hbox {d}\in {\mathbb {R}}, \hbox {c}\in {\mathbb {R}}-\left\{ 0 \right\} \}$ is non-constant, bounded and continuous. Then, from Theorem 1, the family: $\mathcal{F}=\{\hbox {F}\left( \hbox {t} \right) \in \hbox {C}\left( {\mathop \bigcup \nolimits _{j\in J} g_{t_i } } \right) :\hbox {F}\left( \hbox {t} \right) \equiv \hbox {d}+\hbox {ct}+\mathop \sum \nolimits _{\mathrm{i=1}}^{\mathrm{N}} \hbox {a}_{\mathrm{i}} {\upvarphi }\left( {\hbox {b}_{\mathrm{i}} \hbox {t}} \right) ,\upvarphi _{\mathrm{i}} ,\hbox {b}_{\mathrm{i}} ,\hbox {d},\hbox {c}\in {\mathbb {R}}\}$ is dense in $\hbox {C}\left( {\mathop \bigcup \nolimits _{j\in J} g_{t_j } } \right) $. $\square $

Theorem 3 (Linear time trend as degenerate form of NNF)

Proof

Let $x_{t_i } ,i\in I $ be a time series and let $\bar{m} \in \left\{ {1,..M} \right\} $. Then, $\exists \overline{\beta _{\bar{m}_i}} \in {\mathbb {R}}^{\bar{m}}$, $\overline{a_{\bar{m}_i}}\in {\mathbb {R}}^{\bar{m}}+ , \overline{a_{0 \bar{m}_i }}\in {\mathbb {R}}$ and $\overline{\delta _{\bar{m}_i }} \in {\mathbb {R}}$ such that the trend of the macroeconomic time series be given by the following expression:

$$\begin{aligned} g_t =\overline{a_{0 \bar{m}_i } }+\overline{\delta _{\bar{m}_i}}_ t+\mathop \sum \nolimits _{\mathrm{k=1}}^{\bar{m}} \overline{\alpha _k}\varphi \left( {\overline{\beta _k }t} \right) , \quad \forall i\in I \end{aligned}$$

Now, since I is a compact subset of ${\mathbb {R}}$ then it is closed and bounded and there exists $i_0 \in I$ such that $\overline{\beta _{\bar{m}_{i_0}}}=\hbox {max}\{\overline{\beta _{\bar{m}_i}}\in {\mathbb {R}}^{\bar{m}}$}. For, this $\overline{\beta _{\bar{m}_{i_0}}}$ we have that $\overline{\delta _{\bar{m}_{i_0}}} =\hbox {max}\left\{ {\overline{\delta _{\bar{m}_i}} ,\bar{m}_i \in \left\{ {1,\ldots M} \right\} } \right\} $, while the trend of this macroeconomic time series is given by the expression:

$$\begin{aligned} g_t =\overline{a_0} +\overline{\delta _{\bar{m} _{i_0 } } } t+\mathop \sum \nolimits _{\mathrm{k=1}}^{\bar{m}_{i_0}} \overline{\alpha _k } \varphi \left( {\overline{\beta _k }}t \right) , \quad i_0 \in I \end{aligned}$$

But, since: $\overline{\beta _{\bar{m}_{i_0}} }=\hbox {max}\left\{ {\overline{\beta _{\bar{m} _i}},\overline{\beta _{\bar{m}_i}}\in {\mathbb {R}}^{\bar{m}}} \right\} $, then: $\mathop \sum \nolimits _{\mathrm{k=1}}^{\bar{m}_{i_0}} \overline{\alpha _k } \varphi \left( {\overline{\beta _k }}t \right) =\mathop \sum \nolimits _{\mathrm{k=1}}^{\bar{m}_{i_0 } } \overline{\alpha _k }$ . In view of $\varphi $ being increasing monotonic and $\varphi :{\mathbb {R}}\rightarrow \left[ {0,1} \right] $, we have that: $g_t =\overline{a_0 }+ \overline{\delta _{\bar{m}_{i_0}}} t+\mathop \sum \nolimits _{\mathrm{k=1}}^{\bar{m}_{i_0}}\overline{\alpha _k }$.

Hence, the trend approximation is equal to the linear trend, i.e. $g_t =\gamma +\delta t$, where: $\gamma =\overline{a_0} +\mathop \sum \nolimits _{\mathrm{k=1}}^{\bar{m}_{i_0}} \overline{\alpha _k }$. $\square $

Theorem 4 (Mean value of the cycle is zero)

Proof

Let ${x_{{t_j}}}$, $j \in J$ be a time series whose cyclical component is given by the expression:

$$\begin{aligned} {c_{{t_j}}}={x_{{t_j}}}-{g_{{t_j}}}, \forall j \in J \hbox { and } t \in T. \end{aligned}$$

Let $\{{p_{{t_j}}}\}_{j \in J}$ be the respective probability measure assigned on each cyclical times series ${c_{{t_j}}} \forall t \in T$.

Now, provided that:$\sum \nolimits _{{\mathrm{t}\in {\mathrm{T}}}}({\hbox {x}_{{\hbox {t}_{\mathrm{j}}}}}-{\hbox {g}_{{\hbox {t}_{\mathrm{j}}}}}),{\hbox {p}_{{\hbox {t}_\mathrm{j}}}} $ converges absolutely, i.e. $\sum \nolimits _{t\in T}|({x_{{t_j}}}-{g_{{t_j}}}), {p_{{t_j}}}|< \infty $, the expected value of our cyclical component is given by the following expression:

$$\begin{aligned} E\left( {c_{t_j } } \right) =E\left( {(x_{t_j } -g_{t_j ,} )} \right) =\mathop \sum \nolimits _{t\in T} (x_{t_j } -g_{t_j ,} )p_{t_j } \forall j\in J, \quad t\in T \hbox { and } p_{t_j } \in \left[ {0,1} \right] \nonumber \\ \end{aligned}$$

(29)

But, since $\bigcup _{t_{i}}g_{t_{i}}$ is a dense subset on $\bigcup _{t_{j}}x_{t_{j}}$, then by the definition of density we have that $\forall \varepsilon >0 $ and $\forall x_{t_{j} }, j \in J$ there exists $g_{t_{i}}$, $i \in I$ such that $\left| {x_{t_j } -g_{t_i }} \right| <\varepsilon \forall j\in J~and~ \forall i\in I$. Thus, $\forall \left\{ {\hbox {p}_{\mathrm{t}_{\mathrm{j}}}} \right\} _{\mathrm{j}\in \hbox {J}} $ in a relevant probability space, we have that: $|(x_{t_j }-g_{t_j })p_{t_j }|< \varepsilon p _{tj}< \varepsilon \, \forall j \in J $ and $\forall p_{t_j } \in \left[ {0,1} \right] .$

But, without loss of generality, for $\varepsilon _t =1-\frac{1}{2^{t}}$, $\forall t\in T$ we have that: $\sum \nolimits _{t \in T}({1}-\frac{1}{2^{t}})< \infty $.

Hence: $\mathop \sum \nolimits _{t\in T} \left| {(x_{t_j } -g_{t_j ,} )p_{t_j } } \right| <\infty $ (A.14) and, therefore, Eq. (29) defines the expected value of the cyclical component of the time series.

Thus:

$$\begin{aligned} \left( {c_{t_j } } \right) =E\left( {(x_{t_j } -g_{t_j ,} )} \right) =\mathop \sum \nolimits _{t\in T} (x_{t_j } -g_{t_j ,} )p_{t_j } \forall j\in J, t\in T \end{aligned}$$

(30)

But: $\mathop \sum \nolimits _{t\in T} (x_{t_j } -g_{t_j ,} )p_{t_j } =(x_{1_j } -g_{1_j ,} )p_{1_j } +\cdots +(x_{t_j } -g_{t_j ,} )p_{t_j } +\ldots $ and $(x_{1_j } -g_{1_j ,} )p_{1_j } +\cdots +(x_{t_j } -g_{t_j ,} )p_{t_j } <\varepsilon _1 p_{1_j } +\cdots +\varepsilon _T p_{T_j } +\ldots $, $\forall j\in J$, $t\in T$ because of the density of $\bigcup _{t_i } g_{t_i } $ on $\bigcup _{t_j } x_{t_j } $ which implies: $\left| {x_{t_j } -g_{t_i } } \right| <\varepsilon \forall j\in J~and~\forall i\in I$.

However:$\left| {x_{t_j } -g_{t_i } } \right|<\varepsilon _t \Leftrightarrow -\varepsilon _t<x_{t_j } -g_{t_i } <\varepsilon _t $, $\forall j\in J$,$\forall t\in T$ and $\forall i\in I$

$$\begin{aligned} -\mathop \sum \nolimits _{t\in T} \varepsilon _t p_{t_j } \le \mathop \sum \nolimits _{t\in T} \left( {x_{t_j } -g_{t_j } } \right) p_{t_j } \le \mathop \sum \nolimits _{t\in T} \varepsilon _t p_{t_j } \forall j\in J~and~ \forall p_{t_j } \in \left[ {0,1} \right] \end{aligned}$$

Now, without loss of generality, for $\varepsilon _t =1-\frac{1}{2^{t}}$,$\forall t\in T$ we have that:

$$\begin{aligned}&-\mathop \sum \nolimits _{t\in T} \left( {1-\frac{1}{2^{t}}} \right) p_{t_j } \le \mathop \sum \nolimits _{t\in T} \left( {x_{t_j } -g_{t_j } } \right) p_{t_j }\nonumber \\&\quad \le \mathop \sum \nolimits _{t\in T} \left( {1-\frac{1}{2^{t}}} \right) p_{t_j } \forall j\in J~and~ \forall p_{t_j } \in \left[ {0,1} \right] \end{aligned}$$

(31)

But, given that $1-\frac{1}{2^{t}}>0$ and $p_{tj}\in [0,1] $, we have that:

$$\begin{aligned}&\mathop \sum \nolimits _{t\in T} \left( {1-\frac{1}{2^{t}}} \right) p_{t_j } \le \mathop \sum \nolimits _{t\in T} \left( {1-\frac{1}{2^{t}}} \right) \mathop \sum \nolimits _{t\in T} p_{t_j }\nonumber \\&\quad = \mathop \sum \nolimits _{t\in T} \left( {1-\frac{1}{2^{t}}} \right) \rightarrow 0 \hbox { since }: \mathop \sum \nolimits _{t\in T} p_{t_j } =1 \end{aligned}$$

(32)

Similarly: $-\mathop \sum \nolimits _{t\in T} \left( {1-\frac{1}{2^{t}}} \right) p_{t_j } \rightarrow 0$

Hence, given that: $-\mathop \sum \nolimits _{t\in T} \varepsilon _t p_{t_j } \le \mathop \sum \nolimits _{t\in T} \left( {x_{t_j } -g_{t_j } } \right) p_{t_j } \le \mathop \sum \nolimits _{t\in T} \varepsilon _t p_{t_j } \forall j\in J$, $\forall p_{t_j } \in \left[ {0,1} \right] $ and that: $-\mathop \sum \nolimits _{t\in T} \left( {1-\frac{1}{2^{t}}} \right) p_{t_j } \rightarrow 0$ and $\mathop \sum \nolimits _{t\in T} \left( {1-\frac{1}{2^{t}}} \right) p_{t_k } \rightarrow 0$ we get:

$$\begin{aligned} E\left( {c_{t_j } } \right) =0, \forall j\in J,t\in T \hbox { and } \forall p_{t_j } \in \left[ {0,1} \right] \end{aligned}$$

(33)

$\square $

Theorem 5 (Mean value of the NNF cycle is zero)

Proof

Based on Theorem 2, $\mathcal{F}=\{F\left( t \right) \in C\left( X \right) :F\left( t \right) \equiv d+ct+\mathop \sum \nolimits _{i=1}^N a_i \varphi \, \left( {\beta _i t} \right) ,\alpha _i ,\beta _i ,d,c\in {\mathbb {R}}\}$ is dense in $\bigcup _{t_i } g_{t_i } $. Hence, in view of Theorem 4, we have that: $\left( {c_{t_j } } \right) =0\forall j\in J$. $\square $

Rights and permissions

Reprints and permissions

About this article

Cite this article

Michaelides, P.G., Tsionas, E.G., Vouldis, A.T. et al. A Semi-Parametric Non-linear Neural Network Filter: Theory and Empirical Evidence. Comput Econ 51, 637–675 (2018). https://doi.org/10.1007/s10614-016-9628-6

Download citation

Accepted: 13 October 2016
Published: 02 November 2016
Issue Date: March 2018
DOI: https://doi.org/10.1007/s10614-016-9628-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Semi-Parametric Non-linear Neural Network Filter: Theory and Empirical Evidence

Abstract

Access this article

Similar content being viewed by others

Hide-and-Seek with time-series filters: a model-based Monte Carlo study

Trend/Cycle Decomposition

A contribution to the analysis of historical economic fluctuations (1870–2010): filtering, spurious cycles, and unobserved component modeling

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendix

Definition 1

Definition 2

Definition 3

Proof

Proof

Proof

Proof

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A Semi-Parametric Non-linear Neural Network Filter: Theory and Empirical Evidence

Abstract

Access this article

Similar content being viewed by others

Hide-and-Seek with time-series filters: a model-based Monte Carlo study

Trend/Cycle Decomposition

A contribution to the analysis of historical economic fluctuations (1870–2010): filtering, spurious cycles, and unobserved component modeling

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendix

Appendix

Definition 1

Definition 2

Definition 3

Proof

Proof

Proof

Proof

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation