Abstract
Horizontal and vertical fragmentation have been intensively studied for relational and object databases and recently for XML data. However, little work has been done on XML warehouses. In this paper, we address the problem of vertical fragmentation of XML Warehouses. We use Association Rules to partition and cluster frequent path sets into fragments. In addition, at the schema level, we address and solve the problem of reconstructing the original non-fragmented schema to ensure the fragmentation reversibility. At the data level, we propose a data organization within fragments to optimize joint operations. Finally, we present implementation details and show the benefits of our approach over the non-fragmented schema.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Imielinski, T., Swami, A.N.: Mining association rules between sets of items in large databases. In: SIGMOD Conference, pp. 207–216 (1993)
de Aguiar Ciferri, C.D., Ciferri, R.R., Forlani, D.T., Traina, A.J.M., da Fonseca de Souza, F.: Horizontal fragmentation as a technique to improve the performance of drill-down and roll-up queries. In: Proceedings of the 2007 ACM Symposium on Applied Computing (SAC), Seoul, Korea, March 11-15, pp. 494–499. ACM, New York (2007)
Almeida, R., Vieira, J., Vieira, M., Madeira, H., Bernardino, J.: Efficient data distribution for DWS. In: Song, I.-Y., Eder, J., Nguyen, T.M. (eds.) DaWaK 2008. LNCS, vol. 5182, pp. 75–86. Springer, Heidelberg (2008)
Amossen, R.R.: Vertical partitioning of relational oltp databases using integer programming. In: Workshops Proceedings of the 26th International Conference on Data Engineering, ICDE 2010, March 1-6, pp. 93–98. IEEE, Los Alamitos (2010)
Bonifati, A., Cuzzocrea, A.: Efficient fragmentation of large XML documents. In: Wagner, R., Revell, N., Pernul, G. (eds.) DEXA 2007. LNCS, vol. 4653, pp. 539–550. Springer, Heidelberg (2007)
Boukraa, D., Messaoud, R.B., Boussaid, O.: Open and novel issues in XML database applications: future directions and advanced technologies. In: Modeling XML Warehouses for Complex Data: The New Issues, pp. 108–135. IGI Global, Information Science Reference, USA/UK (2009)
Bremer, J.-M., Gertz, M.: On distributing xml repositories. In: International Workshop on Web and Databases, San Diego, California, June 12-13, pp. 73–78 (2003)
Buneman, P., Choi, B., Fan, W., Hutchison, R., Mann, R., Viglas, S.: Vectorizing and querying large xml repositories. In: Proceedings of the 21st International Conference on Data Engineering, ICDE 2005, Tokyo, Japan, April 5-8, pp. 261–272. IEEE Computer Society, Los Alamitos (2005)
Golfarelli, M., Maio, D., Rizzi, S.: Applying vertical fragmentation techniques in logical design of multidimensional databases. In: Kambayashi, Y., Mohania, M., Tjoa, A.M. (eds.) DaWaK 2000. LNCS, vol. 1874, pp. 11–23. Springer, Heidelberg (2000)
Gorla, N., Yan, B.P.W.: Vertical fragmentation in databases using data-mining technique. International Journal of Data Warehousing and Mining 4(3), 35–53 (2008)
Grankov, M.V., Hung, N.T.: New objective function for vertical partitioning in database system. In: Proceedings of the SYRCODIS 2008 Colloquium on Databases and Information Systems, Saint-Petersburg, Russia, May 29-30. CEUR Workshop Proceedings, CEUR-WS.org, vol. 355 (2008)
Hartmann, S., Ma, H., Schewe, K.-D.: Cost-based vertical fragmentation for XML. In: Chang, K.C.C., Wang, W., Chen, L., Ellis, C.A., Hsu, C.-H., Tsoi, A.C., Wang, H. (eds.) APWeb/WAIM 2007. LNCS, vol. 4537, pp. 12–24. Springer, Heidelberg (2007)
Lin, B., Hong, Y., Lee, Z.H.: Data Warehouse Performance. In: Encyclopedia of Data Warehousing and Mining, 2nd edn., pp. 580–585. IGI Publishing, Hershey (2009)
Ma, H., Kirchberg, M.: Cost-based fragmentation for distributed complex value databases. In: Parent, C., Schewe, K.-D., Storey, V.C., Thalheim, B. (eds.) ER 2007. LNCS, vol. 4801, pp. 72–86. Springer, Heidelberg (2007)
Ma, H., Schewe, K.D.: Fragmentation of xml documents. In: XVIII Simpósio Brasileiro de Bancos de Dados, 6-8 de Outubro, Manaus, Amazonas, Brasil, Anais/Proceedings, pp. 200–214. UFAM (2003)
Ma, H., Schewe, K.D., Kirchberg, M.: A heuristic approach to fragmentation incorporating query information. In: Databases and Information Systems IV - Selected Papers from the Seventh International Baltic Conference, DB & IS 2006, Frontiers in Artificial Intelligence and Applications, Vilnius, Lithuania, July 3-6, vol. 155, pp. 103–116. IOS Press, Amsterdam (2006)
Mahboubi, H., Darmont, J.: Data mining-based fragmentation of xml data warehouses. In: Song, I.Y., Abelló, A. (eds.) Proceedings of the ACM 11th International Workshop on Data Warehousing and OLAP, Napa Valley, California, USA, pp. 9–16. ACM, New York (2008)
Ozsu, M.T., Valduriez, P.: Principles of Distributed Database Systems, 3rd edn. Springer, Heidelberg (2011)
Ravat, F., Teste, O., Tournier, R., Zurfluh, G.: Finding an application-appropriate model for xml data warehouses. Inf. Syst. 35(6), 662–687 (2010)
Rusu, L.I., Rahayu, J.W., Taniar, D.: Partitioning methods for multi-version xml data warehouses. Distributed and Parallel Databases 25(1-2), 47–69 (2009)
Tan, K.L.: Distributed database design. In: Encyclopedia of Database Systems, pp. 890–894. Springer, US (2009)
Wu, M.C., Buchmann, A.P.: Research issues in data warehousing. In: BTW, pp. 61–82 (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Boukraâ, D., Boussaïd, O., Bentayeb, F. (2011). Vertical Fragmentation of XML Data Warehouses Using Frequent Path Sets. In: Cuzzocrea, A., Dayal, U. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2011. Lecture Notes in Computer Science, vol 6862. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23544-3_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-23544-3_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23543-6
Online ISBN: 978-3-642-23544-3
eBook Packages: Computer ScienceComputer Science (R0)