skip to main content
10.1145/3170521.3170533acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesworkshops-icdcnConference Proceedingsconference-collections
research-article

Pattern mining based compression of IoT data

Published:04 January 2018Publication History

ABSTRACT

The increasing proliferation of the Internet of Things (IoT) devices and systems result in large amounts of highly heterogeneous data to be collected. Although at least some of the collected sensor data is often consumed by the real-time decision making and control of the IoT system, that is not the only use of such data. Invariably, the collected data is stored, perhaps in some filtered or downselected fashion, so that it can be used for a variety of lower-frequency operations. It is expected that in a smart city environment with numerous IoT deployments, the volume of such data can become enormous. Therefore, mechanisms for lossy data compression that provide a trade-off between compression ratio and data usefulness for offline statistical analysis becomes necessary. In this paper, we discuss several simple pattern mining based compression strategies for multi-attribute IoT data streams. For each method, we evaluate the compressibility of the method vs. the level of similarity between original and compressed time series in the context of the home energy management system.

References

  1. Eleanor Ainy et al. 2015. Approximated Summarization of Data Provenance. In ACM CIKM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Muhammad Naveed Aman et al. 2017. Secure Data Provenance for the Internet of Things. In ACM IoTPTS. 11--14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Khaled Bachour et al. 2015. Provenance for the People: An HCI Perspective on the W3C PROV Standard Through an Online Game. In ACM CHI. 2437--2446. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Sean Barker et al. 2012. Smart*: An open data set and tools for enabling research in sustainable homes. (2012).Google ScholarGoogle Scholar
  5. Sabine Bauer et al. 2013. Data provenance in the Internet of things. In Conference Seminar SS.Google ScholarGoogle Scholar
  6. Kaushik Chakrabarti et al. 2002. Locally Adaptive Dimensionality Reduction for Indexing Large Time Series Databases. ACM Trans. Database Syst. 27, 2 (2002), 188--228. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Christos Faloutsos et al. 1994. Fast Subsequence Matching in Time-series Databases. In ACM SIGMOD. 419--429. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Sorabh Gandhi et al. 2010. Space-efficient online approximation of time series data: Streams, amnesia, and out-of-order. In IEEE ICDE. 924--935.Google ScholarGoogle Scholar
  9. Jayavardhana Gubbi et al. 2013. Internet of Things (IoT): A vision, architectural elements, and future directions. Future generation computer systems 29, 7 (2013), 1645--1660. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Luc Moreau et al. 2008. The Open Provenance Model: An Overview. 323--326.Google ScholarGoogle Scholar
  11. Themistoklis Palpanas et al. 2004. Online Amnesic Approximation of Streaming Time Series. In IEEE ICDE. 339--349. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Ivan Popivanov et al. 2002. Similarity Search Over Time-Series Data Using Wavelets. In IEEE ICDE. 212--221. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Karen Rose et al. 2015. The internet of things: An overview. The Internet Society (2015), 1--50.Google ScholarGoogle Scholar
  14. Joan Serra et al. 2014. An empirical evaluation of similarity measures for time series classification. Knowledge-Based Systems 67 (2014), 305--314. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Salmin Sultana et al. 2015. A distributed system for the management of fine-grained provenance. Journal of Database Management 26, 2 (2015), 32--47. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Nikolaj Tatti et al. 2012. The long and the short of it: summarising event sequences with serial episodes. In ACM KDD. 462--470. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Rob van der Meulen. 2017. Gartner Says 8.4 Billion Connected âĂŸThingsâĂŹ Will Be in Use in 2017, Up 31 Percent From 2016. (2017).Google ScholarGoogle Scholar
  18. Yulai Xie et al. 2011. Compressing Provenance Graphs.. In TaPP.Google ScholarGoogle Scholar
  19. Yulai Xie et al. 2013. Evaluation of a hybrid approach for efficient provenance storage. ACM Transactions on Storage 9, 4 (2013), 14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Byoung-Kee Yi et al. 2000. Fast Time Sequence Indexing for Arbitrary Lp Norms. In VLDB. 385--394. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Pattern mining based compression of IoT data

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      Workshops ICDCN '18: Proceedings of the Workshop Program of the 19th International Conference on Distributed Computing and Networking
      January 2018
      151 pages
      ISBN:9781450363976
      DOI:10.1145/3170521
      • Conference Chair:
      • Doina Bein

      Copyright © 2018 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 4 January 2018

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader