Skip to main content

Finding Similar Time Series in Sales Transaction Data

  • Conference paper
  • First Online:
Current Approaches in Applied Artificial Intelligence (IEA/AIE 2015)

Abstract

This paper studies the problem of finding similar time series of product sales in transactional data. We argue that finding such similar time series can lead to discovery of interesting and actionable business information such as previously unknown complementary products or substitutes, and hidden supply chain information. However, finding all possible pairs of n time series exhaustively results in O(n 2) time complexity. To address this issue, we propose using k-means clustering method to create small clusters of similar time series, and those clusters with very small intra-cluster variability are used to find similar time series. Finally, we demonstrate the utility of our approach to derive interesting results from real-life data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of the International Conference on Very Large Data Bases, pp. 487–499 (1994)

    Google Scholar 

  2. Charlet, L., Kumar, A.: Market Basket Analysis for a Supermarket based on Frequent Itemset Mining. International Journal of Computer Science Issues. 9(5), 257–264 (2012)

    Google Scholar 

  3. Chen, X., Petrounias, I.: Discovering Temporal Association Rules: Algorithms, Language and System. In: Proceedings of the 16th International Conference on Data Engineering, pp. 306–306 (2000)

    Google Scholar 

  4. Hahsler, M., Chelluboina, S.: Visualizing association rules in hierarchical groups. Presented at the 42nd Symposium on the Interface: Statistical, Machine Learning, and Visualization Algorithms (Interface 2011). The Interface Foundation of North America, June 2011 (unpublished)

    Google Scholar 

  5. Kim, H.K., Kim, J.K., Chen, Q.Y.: A Product Network Analysis for Extending the Market Basket Analysis. Expert Systems with Applications 39(8), 7403–7410 (2012)

    Article  MathSciNet  Google Scholar 

  6. MacQueen, J.B.: Some Methods for classification and Analysis of Multivariate Observations. In: Proceedings of 5th Berkeley Symposium on Mathematical Statistics and Probability 1, pp. 281–297. University of California Press (1967)

    Google Scholar 

  7. Mafruz, Z.A., David, T., Kate, S.: Redundant association rules reduction techniques. International Journal Business Intelligent Data Mining 2(1), 29–63 (2007)

    Article  Google Scholar 

  8. Manchanda, P., Ansari, A.: Gupta, S: The ``shopping basket”: A model for multicategory purchase incidence decisions. Marketing Science 18(2), 95–114 (1999)

    Article  Google Scholar 

  9. Minaei-Bidgoli, B., Barmaki, R., Nasiri, M.: Mining numerical association rules via multi-objective genetic algorithms. Information Sciences 233, 15–24 (2013)

    Google Scholar 

  10. Palshikar, G., Kale, M., Apte, M.: Association Rules Mining Using Heavy Itemsets. Data & Knowledge Engineering 61(1), 93–113 (2007)

    Article  Google Scholar 

  11. Tong, Q., Yan, B., Zhou, Y.: Mining Quantitative Association Rules on Overlapped Intervals. In: Li, X., Wang, S., Dong, Z.Y. (eds.) ADMA 2005. LNCS (LNAI), vol. 3584, pp. 43–50. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  12. Qin, L.X., Shi, Z.Z.: Efficiently Mining Association Rules from Time Series. International Journal of Information Technology. 12(4), 30–38 (2006)

    Google Scholar 

  13. Salleb-Aouissi, A., Vrain, C., Nortet, C., Kong, X., Rathod, V., Cassard, D.: QuantMiner for Mining Quantitative Association Rules. Journal of Machine Learning Research 14, 3153–3157 (2013)

    MATH  Google Scholar 

  14. Srikant, R., Agrawal, R.: Mining Quantitative Association Rules in Large Relational Tables. In: Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, pp. 1–12 (1996)

    Google Scholar 

  15. Lau, P.S.: Time Series Clustering for Market Basket Analysis. Analytics Project Report. SIM University (2013)

    Google Scholar 

  16. Tan, S.C, Lau, P.S.: Time Series Clustering: A Superior Alternative for Market Basket Analysis. In: Herawan, T., Deris, M.M., Abawajy, J. (eds.) Proceedings of the First International Conference on Advanced Data and Information Engineering. LNEE(LNCS), vol. 285, pp. 241–248. Springer, Singapore (2013)

    Google Scholar 

  17. Vindevogel, B., Poel, D., Wets, G.: Why promotion strategies based on market basket analysis do not work. Expert Systems with Applications 28(3), 583–590 (2005)

    Article  Google Scholar 

  18. Webb, G.: Discovering associations with numeric variables. In: Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2001), pp. 383–388. ACM, New York (2001)

    Google Scholar 

  19. Webb, G.: Discovering Significant Patterns. Machine Learning 68(1), 1–33 (2007)

    Google Scholar 

  20. Xu, Y., Li, Y., Shaw, G.: Reliable representations for association rules. Data & Knowledge Engineering 70(6), 555–575 (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Swee Chuan Tan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Tan, S.C., Lau, P.S., Yu, X. (2015). Finding Similar Time Series in Sales Transaction Data. In: Ali, M., Kwon, Y., Lee, CH., Kim, J., Kim, Y. (eds) Current Approaches in Applied Artificial Intelligence. IEA/AIE 2015. Lecture Notes in Computer Science(), vol 9101. Springer, Cham. https://doi.org/10.1007/978-3-319-19066-2_62

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-19066-2_62

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-19065-5

  • Online ISBN: 978-3-319-19066-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics