Skip to main content

Multi-dimensional Banded Pattern Mining

  • Conference paper
  • First Online:
Knowledge Management and Acquisition for Intelligent Systems (PKAW 2018)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11016))

Included in the following conference series:

  • 1524 Accesses

Abstract

Techniques for identifying “banded patterns” in n-Dimensional (n-D) zero-one data, so called Banded Pattern Mining (BPM), are considered. Previous work directed at BPM has been in the context of 2-D data sets; the algorithms typically operated by considering permutations which meant that extension to n-D could not be easily realised. In the work presented in this paper banding is directed at the n-D context. Instead of considering large numbers of permutations the novel approach advocated in this paper is to determine banding scores associated with individual indexes in individual dimensions which can then be used to rearrange the indexes to achieve a “best” banding. Two variations of this approach are considered, an approximate approach (which provides for efficiency gains) and an exact approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Abdullahi, F.B., Coenen, F., Martin, R.: A novel approach for identifying banded patterns in zero-one data using column and row banding scores. In: Perner, P. (ed.) MLDM 2014. LNCS (LNAI), vol. 8556, pp. 58–72. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-08979-9_5

    Chapter  Google Scholar 

  2. Abdullahi, F.B., Coenen, F., Martin, R.: A scalable algorithm for banded pattern mining in multi-dimensional zero-one data. In: Bellatreche, L., Mohania, M.K. (eds.) DaWaK 2014. LNCS, vol. 8646, pp. 345–356. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10160-6_31

    Chapter  Google Scholar 

  3. Abdullahi, F.B., Coenen, F., Martin, R.: Finding banded patterns in big data using sampling. In: 2015 IEEE International Conference on Big Data (Big Data), pp. 2233–2242. IEEE (2015)

    Google Scholar 

  4. Abdullahi, F.B., Coenen, F., Martin, R.: Finding banded patterns in data: the banded pattern mining algorithm. In: Madria, S., Hara, T. (eds.) DaWaK 2015. LNCS, vol. 9263, pp. 95–107. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-22729-0_8

    Chapter  Google Scholar 

  5. Abdullahi, F.B., Coenen, F., Martin, R.: Banded pattern mining algorithms in multi-dimensional zero-one data. In: Hameurlain, A., Küng, J., Wagner, R., Bellatreche, L., Mohania, M. (eds.) Transactions on Large-Scale Data- and Knowledge-Centered Systems XXVI. LNCS, vol. 9670, pp. 1–31. Springer, Heidelberg (2016). https://doi.org/10.1007/978-3-662-49784-5_1

    Chapter  Google Scholar 

  6. Alizadeh, F., Karp, R.M., Newberg, L.A., Weisser, D.K.: Physical mapping of chromosomes: a combinatorial problem in molecular biology. Algorithmica 13(1–2), 52–76 (1995)

    Article  MathSciNet  Google Scholar 

  7. Atkins, J.E., Boman, E.G., Hendrickson, B.: Spectral algorithm for seriation and the consecutive ones problem. J. Comput. SIAM 28, 297–310 (1998)

    Article  MathSciNet  Google Scholar 

  8. Atkinson, K.E.: An Introduction to Numerical Analysis. John Wiley & Sons, New York (2008)

    Google Scholar 

  9. Aykanat, C., Pinar, A., Catalyurek, U.: Permuting sparse rectangular matrices into block-diagonal form. SIAM J. Sci. Comput. 25, 1860–1879 (2004)

    Article  MathSciNet  Google Scholar 

  10. Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison-Wesley, Reading (1999)

    Google Scholar 

  11. Bertin, J.: Graphics and Graphic Information Processing. Walter de Gruyter, New York (1981)

    Book  Google Scholar 

  12. Bertin, J.: Graphics and graphic information processing. In: Readings in Information Visualization, pp. 62–65. Morgan Kaufmann Publishers Inc., (1999)

    Google Scholar 

  13. Blake, C.I., Merz, C.J.: UCI repository of machine learning databases (1998). www.ics.uci.edu/~mlearn/MLRepository.htm

  14. Cheng, K.-Y.: Minimizing the bandwidth of sparse symmetric matrices. Computing 11(2), 103–110 (1973)

    Article  MathSciNet  Google Scholar 

  15. Cheng, K.-Y.: Note on minimizing the bandwidth of sparse, symmetric matrices. Computing 11(1), 27–30 (1973)

    Article  MathSciNet  Google Scholar 

  16. Cheng, Y., Church, G.M.: Biclustering of expression data. In: ISMB, vol. 8, pp. 93–103 (2000)

    Google Scholar 

  17. Deutsch, S.B., Martin, J.J.: An ordering algorithm for analysis of data arrays. Oper. Res. 19(6), 1350–1362 (1971)

    Article  Google Scholar 

  18. Fortelius, M., Kai Puolamaki, M.F., Mannila, H.: Seriation in paleontological data using Markov chain monte carlo methods. PLoS Comput. Biol. 2, 2 (2006)

    Article  Google Scholar 

  19. Garriga, G.C., Junttila, E., Mannila, H.: Banded structures in binary matrices. In: Proceedings Knowledge Discovery in Data Mining (KDD 2008), pp. 292–300 (2008)

    Google Scholar 

  20. Garriga, G.C., Junttila, E., Mannila, H.: Banded structure in binary matrices. Knowl. Inf. Syst. 28(1), 197–226 (2011)

    Article  Google Scholar 

  21. Junttila, E.: Pattern in Permuted Binary Matrices. Ph.D thesis (2011)

    Google Scholar 

  22. Koebe, M., Knöchel, J.: On the block alignment problem. Elektronische Informationsverarbeitung und Kybernetik 26(7), 377–387 (1990)

    MATH  Google Scholar 

  23. Liiv, I.: Seriation and matrix reordering methods: an historical overview. Stat. Anal. Data Min. 3(2), 70–91 (2010)

    MathSciNet  Google Scholar 

  24. Makinen, E., Siirtola, H.: The barycenter heuristic and the reorderable matrix. Informatica 29, 357–363 (2005)

    MathSciNet  MATH  Google Scholar 

  25. Mäkinen, E., Siirtola, H.: Reordering the reorderable matrix as an algorithmic problem. In: Anderson, M., Cheng, P., Haarslev, V. (eds.) Diagrams 2000. LNCS (LNAI), vol. 1889, pp. 453–468. Springer, Heidelberg (2000). https://doi.org/10.1007/3-540-44590-0_37

    Chapter  Google Scholar 

  26. Mannila, H., Terzi, E.: Nestedness and segmented nestedness. In: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 480–489. ACM (2007)

    Google Scholar 

  27. Myllykangas, S., Himberg, J., Bohling, T., Nagy, B., Hollman, J., Knuutila, S.: DNA copy number amplification profiling of human neoplasms. Oncogene 25, 7324–7332 (2006)

    Article  Google Scholar 

  28. Sugiyama, K., Tagawa, S., Toda, M.: Methods for visual understanding of hierarchical system structures. IEEE Trans. Syst. Man Cybern. 11, 109–125 (1981)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Frans Coenen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Abdullahi, F.B., Coenen, F. (2018). Multi-dimensional Banded Pattern Mining. In: Yoshida, K., Lee, M. (eds) Knowledge Management and Acquisition for Intelligent Systems. PKAW 2018. Lecture Notes in Computer Science(), vol 11016. Springer, Cham. https://doi.org/10.1007/978-3-319-97289-3_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-97289-3_12

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-97288-6

  • Online ISBN: 978-3-319-97289-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics