Skip to main content
Log in

Sales forecasting by combining clustering and machine-learning techniques for computer retailing

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

Sales forecasting is a critical task for computer retailers endeavoring to maintain favorable sales performance and manage inventories. In this study, a clustering-based forecasting model by combining clustering and machine-learning methods is proposed for computer retailing sales forecasting. The proposed method first used the clustering technique to divide training data into groups, clustering data with similar features or patterns into a group. Subsequently, machine-learning techniques are used to train the forecasting model of each group. After the cluster with data patterns most similar to the test data was determined, the trained forecasting model of the cluster was adopted for sales forecasting. Since the sales data of computer retailers show similar data patterns or features at different time periods, the accuracy of the forecast can be enhanced by using the proposed clustering-based forecasting model. Three clustering techniques including self-organizing map (SOM), growing hierarchical self-organizing map (GHSOM), and K-means and two machine-learning techniques including support vector regression (SVR) and extreme learning machine (ELM) are used in this study. A total of six clustering-based forecasting models were proposed. Real-life sales data for the personal computers, notebook computers, and liquid crystal displays are used as the empirical examples. The experimental results showed that the model combining the GHSOM and ELM provided superior forecasting performance for all three products compared with the other five forecasting models, as well as the single SVR and single ELM models. It can be effectively used as a clustering-based sales forecasting model for computer retailing.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. Philip D, Alex A, Panagiotis P, Haralambos S (2006) Time series sales forecasting for short shelf-life food products base on artificial neural networks and evolutionary computing. J Food Eng 75:196–204

    Article  Google Scholar 

  2. Luis A, Richard W (2007) Improved supply chain management based on hybrid demand forecasts. Appl Soft Comput 7:136–144

    Article  Google Scholar 

  3. Thomassey S (2010) Sales forecasts in clothing industry: the key success factor of the supply chain management. Int J Prod Econ 128:470–483

    Article  Google Scholar 

  4. Lu CJ, Lee TS, Lian CM (2012) Sales forecasting for computer wholesalers: a comparison of multivariate adaptive regression splines and artificial neural networks. Decis Support Syst 54:584–596

    Article  Google Scholar 

  5. Lu CJ (2014) Sales forecasting of computer products based on variable selection scheme and support vector regression. Neurocomputing 128:491–499

    Article  Google Scholar 

  6. Lu C-J, Shao YE (2012) Forecasting computer products sales by integrating ensemble empirical mode decomposition and extreme learning machine. Math Prob Eng. 2012:831201. doi:10.1155/2012/831201

    Google Scholar 

  7. Choi TM, Hui CL, Liu N, Ng SF, Yu Y (2014) Fast fashion sales forecasting with limited data and time. Decis Support Syst 59:84–92

    Article  Google Scholar 

  8. Xia M, Wong WK (2014) A seasonal discrete grey forecasting model for fashion retailing. Knowl Based Syst 57:119–126

    Article  Google Scholar 

  9. Thomassey S, Happiette M (2007) A neural clustering and classification system for sales forecasting of new apparel items. Appl Soft Comput 7:1177–1187

    Article  Google Scholar 

  10. Chang PC, Liu CH, Wang YW (2006) A hybrid model by clustering and evolving fuzzy rules for sales decision supports in printed circuit board industry. Decis Support Syst 42:1254–1269

    Article  Google Scholar 

  11. Hadavandi E, Shavandi H, Ghanbari A (2011) An improved sales forecasting approach by the integration of genetic fuzzy systems and data clustering: case study of printed circuit board. Expert Syst Appl 38:9392–9399

    Article  Google Scholar 

  12. Sa-ngasoongsong A, Bukkapatnam STS, Kim J, Iyer PS, Suresh RP (2012) Multi-step sales forecasting in automotive industry based on structural relationship identification. Int J Prod Econ 140:875–887

    Article  Google Scholar 

  13. Dai W, Wu J-Y, Lu C-J (2014) Applying different independent component analysis algorithms and support vector regression for IT chain store sales forecasting. Sci World J. 2014:438132. doi:10.1155/2014/438132

    Google Scholar 

  14. Tay FEH, Cao LJ (2001) Improved financial time series forecasting by combining support vector machines with self-organizing feature map. Intell Data Anal 5(4):339–354

    MATH  Google Scholar 

  15. Cao LJ (2003) Support vector machines experts for time series forecasting. Neurocomputing 51:321–339

    Article  Google Scholar 

  16. Thomassey S, Fiordaliso A (2006) A hybrid sales forecasting system based on clustering and decision trees. Decis Support Syst 42(1):408–421

    Article  Google Scholar 

  17. Lai RK, Fan CY, Huang WH, Chang PC (2009) Evolving and clustering fuzzy decision tree for financial time series data forecasting. Expert Syst Appl 36:3761–3773

    Article  Google Scholar 

  18. Huang CL, Tsai CY (2009) A hybrid SOFM-SVR with a filter-based feature selection for stock market forecasting. Expert Syst Appl 36:1529–1539

    Article  Google Scholar 

  19. Badge J, Srivastava N (2010) Selection and forecasting of stock market patterns using K-mean clustering. Int J Stat Syst 5:23–27

    Google Scholar 

  20. Kumar M, Patel NR (2010) Using clustering to improve sales forecasts in retail merchandising. Ann Oper Res 174:33–46

    Article  MathSciNet  MATH  Google Scholar 

  21. Chang PC, Lai CY (2005) A hybrid system combining self-organizing maps with case-based reasoning in wholesaler’s new-release book forecasting. Expert Syst Appl 29:183–192

    Article  Google Scholar 

  22. Chang PC, Liu CH, Fan CF (2009) Data clustering and fuzzy neural network for sales forecasting: a case study in printed circuit board industry. Knowl Based Syst 22:344–355

    Article  Google Scholar 

  23. Lu CJ, Wang YW (2010) Combining independent component analysis and growing hierarchical self-organizing maps with support vector regression in product demand forecasting. Int J Prod Econ 128:603–613

    Article  Google Scholar 

  24. Murlidha V, Menezes B, Sathe M, Murlidhar G (2012) A clustering based forecast engine for retail sales. J Digit Inform Manag 10:219–230

    Google Scholar 

  25. Jain AK (2010) Data clustering: 50 years beyond K-means. Pattern Recogn Lett 31:651–666

    Article  Google Scholar 

  26. Vapnik VN (1999) An overview of statistical learning theory. IEEE Trans Neural Netw 10:988–999

    Article  Google Scholar 

  27. Vapnik VN (2000) The nature of statistical learning theory. Springer, Berlin

    Book  MATH  Google Scholar 

  28. Huang GB, Zhu QY, Siew CK (2006) Extreme learning machine: theory and applications. Neurocomputing 70:489–501

    Article  Google Scholar 

  29. Kao LJ, Chiu CC, Lu CJ, Chang CH (2013) A hybrid approach by integrating wavelet-based feature extraction with MARS and SVR for stock index forecasting. Decis Support Syst 54:1228–1244

    Article  Google Scholar 

  30. Lu CJ (2013) Hybridizing nonlinear independent component analysis and support vector regression with particle swarm optimization for stock index forecasting. Neural Comput Appl 23:2417–2427

    Article  Google Scholar 

  31. Bao Y, Xiong T, Hu Z (2014) Multi-step-ahead time series prediction using multiple-output support vector regression. Neurocomputing 129:482–493

    Article  Google Scholar 

  32. Xiong T, Bao Y, Hu Z (2014) Multiple-output support vector regression with a firefly algorithm for interval-valued stock price index forecasting. Knowl Based Syst 55:87–100

    Article  Google Scholar 

  33. Hong WC (2012) Application of seasonal SVR with chaotic immune algorithm in traffic flow forecasting. Neural Comput Appl 21:583–593

    Article  Google Scholar 

  34. Ju FY, Hong WC (2013) Application of seasonal SVR with chaotic gravitational search algorithm in electricity forecasting. Appl Math Model 37:9643–9651

    Article  MathSciNet  Google Scholar 

  35. Wu CL, Chau KW (2013) Prediction of rainfall time series using modular soft computing methods. Eng Appl Artif Intell 26:997–1007

    Article  Google Scholar 

  36. Sun ZL, Choi TM, Au KF, Yu Y (2008) Sales forecasting using extreme learning machine with applications in fashion retailing. Decis Support Syst 46:411–419

    Article  Google Scholar 

  37. Wong WK, Guo ZX (2010) A hybrid intelligent model for medium-term sales forecasting in fashion retail supply chains using extreme learning machine and harmony search algorithm. Int J Prod Econ 128:614–624

    Article  Google Scholar 

  38. Xia M, Zhang Y, Weng L, Ye X (2012) Fashion retailing forecasting based on extreme learning machine with adaptive metrics of inputs. Knowl Based Syst 36:253–259

    Article  Google Scholar 

  39. Kohonen T (1989) Self-organization and associative memory, 3rd edn. Springer, Berlin

    Book  MATH  Google Scholar 

  40. Palomo EJ, North J, Elizondo D, Luque RM, Watson T (2012) Application of growing hierarchical SOM for visualisation of network forensics traffic data. Neural Netw 32:275–284

    Article  Google Scholar 

  41. Dittenbach M, Rauber A, Merkl D (2002) Uncovering hierarchical structure in data using growing hierarchical self-organizing map. Neurocomputing 48:199–216

    Article  MATH  Google Scholar 

  42. Chattopadhyay M, Dan PK, Mazumdar S (2014) Comparison of visualization of optimal clustering using self-organizing map and growing hierarchical self-organizing map in cellular manufacturing system. Appl Soft Comput 22:528–543

    Article  Google Scholar 

  43. Johnson RA, Wichern DW (1992) Applied multivariate statistical analysis. Prentice hall, New Jersey

    MATH  Google Scholar 

  44. Cherkassky V, Ma Y (2004) Practical selection of SVM parameters and noise estimation for SVM regression. Neural Netw 17:113–126

    Article  MATH  Google Scholar 

  45. Lin CJ, Hsu CW, Chang CC (2003) A practical guide to support vector classification, Technical Report, Department of Computer Science and Information Engineering, National Taiwan University, Taipei

  46. Diebold FX, Mariano RS (1995) Comparing predictive accuracy. J Bus Econ Stat 13:253–263

    Google Scholar 

  47. Pollock AC, Macaulay A, Thomson ME, Onkal D (2005) Performance evaluation of judgemental directional exchange rate predictions. Int J Forecast 21:473–489

    Article  Google Scholar 

  48. Choi TM, Yu Y, Au KF (2011) A hybrid SARIMA wavelet transform method for sales forecasting. Decis Support Syst 51(1):130–140

    Article  Google Scholar 

Download references

Acknowledgments

This work is partially supported by the Ministry of Science and Technology of the Republic of China, Grant no. MOST 103-2221-E-231-003-MY2. The authors also gratefully acknowledge the helpful comments and suggestions of the reviewers, which have improved the presentation.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chi-Jie Lu.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chen, IF., Lu, CJ. Sales forecasting by combining clustering and machine-learning techniques for computer retailing. Neural Comput & Applic 28, 2633–2647 (2017). https://doi.org/10.1007/s00521-016-2215-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-016-2215-x

Keywords

Navigation