Abstract
Techniques for identifying “banded patterns” in n-Dimensional (n-D) zero-one data, so called Banded Pattern Mining (BPM), are considered. Previous work directed at BPM has been in the context of 2-D data sets; the algorithms typically operated by considering permutations which meant that extension to n-D could not be easily realised. In the work presented in this paper banding is directed at the n-D context. Instead of considering large numbers of permutations the novel approach advocated in this paper is to determine banding scores associated with individual indexes in individual dimensions which can then be used to rearrange the indexes to achieve a “best” banding. Two variations of this approach are considered, an approximate approach (which provides for efficiency gains) and an exact approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abdullahi, F.B., Coenen, F., Martin, R.: A novel approach for identifying banded patterns in zero-one data using column and row banding scores. In: Perner, P. (ed.) MLDM 2014. LNCS (LNAI), vol. 8556, pp. 58–72. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-08979-9_5
Abdullahi, F.B., Coenen, F., Martin, R.: A scalable algorithm for banded pattern mining in multi-dimensional zero-one data. In: Bellatreche, L., Mohania, M.K. (eds.) DaWaK 2014. LNCS, vol. 8646, pp. 345–356. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10160-6_31
Abdullahi, F.B., Coenen, F., Martin, R.: Finding banded patterns in big data using sampling. In: 2015 IEEE International Conference on Big Data (Big Data), pp. 2233–2242. IEEE (2015)
Abdullahi, F.B., Coenen, F., Martin, R.: Finding banded patterns in data: the banded pattern mining algorithm. In: Madria, S., Hara, T. (eds.) DaWaK 2015. LNCS, vol. 9263, pp. 95–107. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-22729-0_8
Abdullahi, F.B., Coenen, F., Martin, R.: Banded pattern mining algorithms in multi-dimensional zero-one data. In: Hameurlain, A., Küng, J., Wagner, R., Bellatreche, L., Mohania, M. (eds.) Transactions on Large-Scale Data- and Knowledge-Centered Systems XXVI. LNCS, vol. 9670, pp. 1–31. Springer, Heidelberg (2016). https://doi.org/10.1007/978-3-662-49784-5_1
Alizadeh, F., Karp, R.M., Newberg, L.A., Weisser, D.K.: Physical mapping of chromosomes: a combinatorial problem in molecular biology. Algorithmica 13(1–2), 52–76 (1995)
Atkins, J.E., Boman, E.G., Hendrickson, B.: Spectral algorithm for seriation and the consecutive ones problem. J. Comput. SIAM 28, 297–310 (1998)
Atkinson, K.E.: An Introduction to Numerical Analysis. John Wiley & Sons, New York (2008)
Aykanat, C., Pinar, A., Catalyurek, U.: Permuting sparse rectangular matrices into block-diagonal form. SIAM J. Sci. Comput. 25, 1860–1879 (2004)
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison-Wesley, Reading (1999)
Bertin, J.: Graphics and Graphic Information Processing. Walter de Gruyter, New York (1981)
Bertin, J.: Graphics and graphic information processing. In: Readings in Information Visualization, pp. 62–65. Morgan Kaufmann Publishers Inc., (1999)
Blake, C.I., Merz, C.J.: UCI repository of machine learning databases (1998). www.ics.uci.edu/~mlearn/MLRepository.htm
Cheng, K.-Y.: Minimizing the bandwidth of sparse symmetric matrices. Computing 11(2), 103–110 (1973)
Cheng, K.-Y.: Note on minimizing the bandwidth of sparse, symmetric matrices. Computing 11(1), 27–30 (1973)
Cheng, Y., Church, G.M.: Biclustering of expression data. In: ISMB, vol. 8, pp. 93–103 (2000)
Deutsch, S.B., Martin, J.J.: An ordering algorithm for analysis of data arrays. Oper. Res. 19(6), 1350–1362 (1971)
Fortelius, M., Kai Puolamaki, M.F., Mannila, H.: Seriation in paleontological data using Markov chain monte carlo methods. PLoS Comput. Biol. 2, 2 (2006)
Garriga, G.C., Junttila, E., Mannila, H.: Banded structures in binary matrices. In: Proceedings Knowledge Discovery in Data Mining (KDD 2008), pp. 292–300 (2008)
Garriga, G.C., Junttila, E., Mannila, H.: Banded structure in binary matrices. Knowl. Inf. Syst. 28(1), 197–226 (2011)
Junttila, E.: Pattern in Permuted Binary Matrices. Ph.D thesis (2011)
Koebe, M., Knöchel, J.: On the block alignment problem. Elektronische Informationsverarbeitung und Kybernetik 26(7), 377–387 (1990)
Liiv, I.: Seriation and matrix reordering methods: an historical overview. Stat. Anal. Data Min. 3(2), 70–91 (2010)
Makinen, E., Siirtola, H.: The barycenter heuristic and the reorderable matrix. Informatica 29, 357–363 (2005)
Mäkinen, E., Siirtola, H.: Reordering the reorderable matrix as an algorithmic problem. In: Anderson, M., Cheng, P., Haarslev, V. (eds.) Diagrams 2000. LNCS (LNAI), vol. 1889, pp. 453–468. Springer, Heidelberg (2000). https://doi.org/10.1007/3-540-44590-0_37
Mannila, H., Terzi, E.: Nestedness and segmented nestedness. In: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 480–489. ACM (2007)
Myllykangas, S., Himberg, J., Bohling, T., Nagy, B., Hollman, J., Knuutila, S.: DNA copy number amplification profiling of human neoplasms. Oncogene 25, 7324–7332 (2006)
Sugiyama, K., Tagawa, S., Toda, M.: Methods for visual understanding of hierarchical system structures. IEEE Trans. Syst. Man Cybern. 11, 109–125 (1981)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Abdullahi, F.B., Coenen, F. (2018). Multi-dimensional Banded Pattern Mining. In: Yoshida, K., Lee, M. (eds) Knowledge Management and Acquisition for Intelligent Systems. PKAW 2018. Lecture Notes in Computer Science(), vol 11016. Springer, Cham. https://doi.org/10.1007/978-3-319-97289-3_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-97289-3_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-97288-6
Online ISBN: 978-3-319-97289-3
eBook Packages: Computer ScienceComputer Science (R0)