Abstract
Sales forecasting is a critical task for computer retailers endeavoring to maintain favorable sales performance and manage inventories. In this study, a clustering-based forecasting model by combining clustering and machine-learning methods is proposed for computer retailing sales forecasting. The proposed method first used the clustering technique to divide training data into groups, clustering data with similar features or patterns into a group. Subsequently, machine-learning techniques are used to train the forecasting model of each group. After the cluster with data patterns most similar to the test data was determined, the trained forecasting model of the cluster was adopted for sales forecasting. Since the sales data of computer retailers show similar data patterns or features at different time periods, the accuracy of the forecast can be enhanced by using the proposed clustering-based forecasting model. Three clustering techniques including self-organizing map (SOM), growing hierarchical self-organizing map (GHSOM), and K-means and two machine-learning techniques including support vector regression (SVR) and extreme learning machine (ELM) are used in this study. A total of six clustering-based forecasting models were proposed. Real-life sales data for the personal computers, notebook computers, and liquid crystal displays are used as the empirical examples. The experimental results showed that the model combining the GHSOM and ELM provided superior forecasting performance for all three products compared with the other five forecasting models, as well as the single SVR and single ELM models. It can be effectively used as a clustering-based sales forecasting model for computer retailing.
Similar content being viewed by others
References
Philip D, Alex A, Panagiotis P, Haralambos S (2006) Time series sales forecasting for short shelf-life food products base on artificial neural networks and evolutionary computing. J Food Eng 75:196–204
Luis A, Richard W (2007) Improved supply chain management based on hybrid demand forecasts. Appl Soft Comput 7:136–144
Thomassey S (2010) Sales forecasts in clothing industry: the key success factor of the supply chain management. Int J Prod Econ 128:470–483
Lu CJ, Lee TS, Lian CM (2012) Sales forecasting for computer wholesalers: a comparison of multivariate adaptive regression splines and artificial neural networks. Decis Support Syst 54:584–596
Lu CJ (2014) Sales forecasting of computer products based on variable selection scheme and support vector regression. Neurocomputing 128:491–499
Lu C-J, Shao YE (2012) Forecasting computer products sales by integrating ensemble empirical mode decomposition and extreme learning machine. Math Prob Eng. 2012:831201. doi:10.1155/2012/831201
Choi TM, Hui CL, Liu N, Ng SF, Yu Y (2014) Fast fashion sales forecasting with limited data and time. Decis Support Syst 59:84–92
Xia M, Wong WK (2014) A seasonal discrete grey forecasting model for fashion retailing. Knowl Based Syst 57:119–126
Thomassey S, Happiette M (2007) A neural clustering and classification system for sales forecasting of new apparel items. Appl Soft Comput 7:1177–1187
Chang PC, Liu CH, Wang YW (2006) A hybrid model by clustering and evolving fuzzy rules for sales decision supports in printed circuit board industry. Decis Support Syst 42:1254–1269
Hadavandi E, Shavandi H, Ghanbari A (2011) An improved sales forecasting approach by the integration of genetic fuzzy systems and data clustering: case study of printed circuit board. Expert Syst Appl 38:9392–9399
Sa-ngasoongsong A, Bukkapatnam STS, Kim J, Iyer PS, Suresh RP (2012) Multi-step sales forecasting in automotive industry based on structural relationship identification. Int J Prod Econ 140:875–887
Dai W, Wu J-Y, Lu C-J (2014) Applying different independent component analysis algorithms and support vector regression for IT chain store sales forecasting. Sci World J. 2014:438132. doi:10.1155/2014/438132
Tay FEH, Cao LJ (2001) Improved financial time series forecasting by combining support vector machines with self-organizing feature map. Intell Data Anal 5(4):339–354
Cao LJ (2003) Support vector machines experts for time series forecasting. Neurocomputing 51:321–339
Thomassey S, Fiordaliso A (2006) A hybrid sales forecasting system based on clustering and decision trees. Decis Support Syst 42(1):408–421
Lai RK, Fan CY, Huang WH, Chang PC (2009) Evolving and clustering fuzzy decision tree for financial time series data forecasting. Expert Syst Appl 36:3761–3773
Huang CL, Tsai CY (2009) A hybrid SOFM-SVR with a filter-based feature selection for stock market forecasting. Expert Syst Appl 36:1529–1539
Badge J, Srivastava N (2010) Selection and forecasting of stock market patterns using K-mean clustering. Int J Stat Syst 5:23–27
Kumar M, Patel NR (2010) Using clustering to improve sales forecasts in retail merchandising. Ann Oper Res 174:33–46
Chang PC, Lai CY (2005) A hybrid system combining self-organizing maps with case-based reasoning in wholesaler’s new-release book forecasting. Expert Syst Appl 29:183–192
Chang PC, Liu CH, Fan CF (2009) Data clustering and fuzzy neural network for sales forecasting: a case study in printed circuit board industry. Knowl Based Syst 22:344–355
Lu CJ, Wang YW (2010) Combining independent component analysis and growing hierarchical self-organizing maps with support vector regression in product demand forecasting. Int J Prod Econ 128:603–613
Murlidha V, Menezes B, Sathe M, Murlidhar G (2012) A clustering based forecast engine for retail sales. J Digit Inform Manag 10:219–230
Jain AK (2010) Data clustering: 50 years beyond K-means. Pattern Recogn Lett 31:651–666
Vapnik VN (1999) An overview of statistical learning theory. IEEE Trans Neural Netw 10:988–999
Vapnik VN (2000) The nature of statistical learning theory. Springer, Berlin
Huang GB, Zhu QY, Siew CK (2006) Extreme learning machine: theory and applications. Neurocomputing 70:489–501
Kao LJ, Chiu CC, Lu CJ, Chang CH (2013) A hybrid approach by integrating wavelet-based feature extraction with MARS and SVR for stock index forecasting. Decis Support Syst 54:1228–1244
Lu CJ (2013) Hybridizing nonlinear independent component analysis and support vector regression with particle swarm optimization for stock index forecasting. Neural Comput Appl 23:2417–2427
Bao Y, Xiong T, Hu Z (2014) Multi-step-ahead time series prediction using multiple-output support vector regression. Neurocomputing 129:482–493
Xiong T, Bao Y, Hu Z (2014) Multiple-output support vector regression with a firefly algorithm for interval-valued stock price index forecasting. Knowl Based Syst 55:87–100
Hong WC (2012) Application of seasonal SVR with chaotic immune algorithm in traffic flow forecasting. Neural Comput Appl 21:583–593
Ju FY, Hong WC (2013) Application of seasonal SVR with chaotic gravitational search algorithm in electricity forecasting. Appl Math Model 37:9643–9651
Wu CL, Chau KW (2013) Prediction of rainfall time series using modular soft computing methods. Eng Appl Artif Intell 26:997–1007
Sun ZL, Choi TM, Au KF, Yu Y (2008) Sales forecasting using extreme learning machine with applications in fashion retailing. Decis Support Syst 46:411–419
Wong WK, Guo ZX (2010) A hybrid intelligent model for medium-term sales forecasting in fashion retail supply chains using extreme learning machine and harmony search algorithm. Int J Prod Econ 128:614–624
Xia M, Zhang Y, Weng L, Ye X (2012) Fashion retailing forecasting based on extreme learning machine with adaptive metrics of inputs. Knowl Based Syst 36:253–259
Kohonen T (1989) Self-organization and associative memory, 3rd edn. Springer, Berlin
Palomo EJ, North J, Elizondo D, Luque RM, Watson T (2012) Application of growing hierarchical SOM for visualisation of network forensics traffic data. Neural Netw 32:275–284
Dittenbach M, Rauber A, Merkl D (2002) Uncovering hierarchical structure in data using growing hierarchical self-organizing map. Neurocomputing 48:199–216
Chattopadhyay M, Dan PK, Mazumdar S (2014) Comparison of visualization of optimal clustering using self-organizing map and growing hierarchical self-organizing map in cellular manufacturing system. Appl Soft Comput 22:528–543
Johnson RA, Wichern DW (1992) Applied multivariate statistical analysis. Prentice hall, New Jersey
Cherkassky V, Ma Y (2004) Practical selection of SVM parameters and noise estimation for SVM regression. Neural Netw 17:113–126
Lin CJ, Hsu CW, Chang CC (2003) A practical guide to support vector classification, Technical Report, Department of Computer Science and Information Engineering, National Taiwan University, Taipei
Diebold FX, Mariano RS (1995) Comparing predictive accuracy. J Bus Econ Stat 13:253–263
Pollock AC, Macaulay A, Thomson ME, Onkal D (2005) Performance evaluation of judgemental directional exchange rate predictions. Int J Forecast 21:473–489
Choi TM, Yu Y, Au KF (2011) A hybrid SARIMA wavelet transform method for sales forecasting. Decis Support Syst 51(1):130–140
Acknowledgments
This work is partially supported by the Ministry of Science and Technology of the Republic of China, Grant no. MOST 103-2221-E-231-003-MY2. The authors also gratefully acknowledge the helpful comments and suggestions of the reviewers, which have improved the presentation.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Chen, IF., Lu, CJ. Sales forecasting by combining clustering and machine-learning techniques for computer retailing. Neural Comput & Applic 28, 2633–2647 (2017). https://doi.org/10.1007/s00521-016-2215-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-016-2215-x