Skip to main content

Vertical Fragmentation of XML Data Warehouses Using Frequent Path Sets

  • Conference paper
Book cover Data Warehousing and Knowledge Discovery (DaWaK 2011)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6862))

Included in the following conference series:

Abstract

Horizontal and vertical fragmentation have been intensively studied for relational and object databases and recently for XML data. However, little work has been done on XML warehouses. In this paper, we address the problem of vertical fragmentation of XML Warehouses. We use Association Rules to partition and cluster frequent path sets into fragments. In addition, at the schema level, we address and solve the problem of reconstructing the original non-fragmented schema to ensure the fragmentation reversibility. At the data level, we propose a data organization within fragments to optimize joint operations. Finally, we present implementation details and show the benefits of our approach over the non-fragmented schema.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, R., Imielinski, T., Swami, A.N.: Mining association rules between sets of items in large databases. In: SIGMOD Conference, pp. 207–216 (1993)

    Google Scholar 

  2. de Aguiar Ciferri, C.D., Ciferri, R.R., Forlani, D.T., Traina, A.J.M., da Fonseca de Souza, F.: Horizontal fragmentation as a technique to improve the performance of drill-down and roll-up queries. In: Proceedings of the 2007 ACM Symposium on Applied Computing (SAC), Seoul, Korea, March 11-15, pp. 494–499. ACM, New York (2007)

    Chapter  Google Scholar 

  3. Almeida, R., Vieira, J., Vieira, M., Madeira, H., Bernardino, J.: Efficient data distribution for DWS. In: Song, I.-Y., Eder, J., Nguyen, T.M. (eds.) DaWaK 2008. LNCS, vol. 5182, pp. 75–86. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  4. Amossen, R.R.: Vertical partitioning of relational oltp databases using integer programming. In: Workshops Proceedings of the 26th International Conference on Data Engineering, ICDE 2010, March 1-6, pp. 93–98. IEEE, Los Alamitos (2010)

    Chapter  Google Scholar 

  5. Bonifati, A., Cuzzocrea, A.: Efficient fragmentation of large XML documents. In: Wagner, R., Revell, N., Pernul, G. (eds.) DEXA 2007. LNCS, vol. 4653, pp. 539–550. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  6. Boukraa, D., Messaoud, R.B., Boussaid, O.: Open and novel issues in XML database applications: future directions and advanced technologies. In: Modeling XML Warehouses for Complex Data: The New Issues, pp. 108–135. IGI Global, Information Science Reference, USA/UK (2009)

    Google Scholar 

  7. Bremer, J.-M., Gertz, M.: On distributing xml repositories. In: International Workshop on Web and Databases, San Diego, California, June 12-13, pp. 73–78 (2003)

    Google Scholar 

  8. Buneman, P., Choi, B., Fan, W., Hutchison, R., Mann, R., Viglas, S.: Vectorizing and querying large xml repositories. In: Proceedings of the 21st International Conference on Data Engineering, ICDE 2005, Tokyo, Japan, April 5-8, pp. 261–272. IEEE Computer Society, Los Alamitos (2005)

    Google Scholar 

  9. Golfarelli, M., Maio, D., Rizzi, S.: Applying vertical fragmentation techniques in logical design of multidimensional databases. In: Kambayashi, Y., Mohania, M., Tjoa, A.M. (eds.) DaWaK 2000. LNCS, vol. 1874, pp. 11–23. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  10. Gorla, N., Yan, B.P.W.: Vertical fragmentation in databases using data-mining technique. International Journal of Data Warehousing and Mining 4(3), 35–53 (2008)

    Article  Google Scholar 

  11. Grankov, M.V., Hung, N.T.: New objective function for vertical partitioning in database system. In: Proceedings of the SYRCODIS 2008 Colloquium on Databases and Information Systems, Saint-Petersburg, Russia, May 29-30. CEUR Workshop Proceedings, CEUR-WS.org, vol. 355 (2008)

    Google Scholar 

  12. Hartmann, S., Ma, H., Schewe, K.-D.: Cost-based vertical fragmentation for XML. In: Chang, K.C.C., Wang, W., Chen, L., Ellis, C.A., Hsu, C.-H., Tsoi, A.C., Wang, H. (eds.) APWeb/WAIM 2007. LNCS, vol. 4537, pp. 12–24. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  13. Lin, B., Hong, Y., Lee, Z.H.: Data Warehouse Performance. In: Encyclopedia of Data Warehousing and Mining, 2nd edn., pp. 580–585. IGI Publishing, Hershey (2009)

    Chapter  Google Scholar 

  14. Ma, H., Kirchberg, M.: Cost-based fragmentation for distributed complex value databases. In: Parent, C., Schewe, K.-D., Storey, V.C., Thalheim, B. (eds.) ER 2007. LNCS, vol. 4801, pp. 72–86. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  15. Ma, H., Schewe, K.D.: Fragmentation of xml documents. In: XVIII Simpósio Brasileiro de Bancos de Dados, 6-8 de Outubro, Manaus, Amazonas, Brasil, Anais/Proceedings, pp. 200–214. UFAM (2003)

    Google Scholar 

  16. Ma, H., Schewe, K.D., Kirchberg, M.: A heuristic approach to fragmentation incorporating query information. In: Databases and Information Systems IV - Selected Papers from the Seventh International Baltic Conference, DB & IS 2006, Frontiers in Artificial Intelligence and Applications, Vilnius, Lithuania, July 3-6, vol. 155, pp. 103–116. IOS Press, Amsterdam (2006)

    Google Scholar 

  17. Mahboubi, H., Darmont, J.: Data mining-based fragmentation of xml data warehouses. In: Song, I.Y., Abelló, A. (eds.) Proceedings of the ACM 11th International Workshop on Data Warehousing and OLAP, Napa Valley, California, USA, pp. 9–16. ACM, New York (2008)

    Google Scholar 

  18. Ozsu, M.T., Valduriez, P.: Principles of Distributed Database Systems, 3rd edn. Springer, Heidelberg (2011)

    Google Scholar 

  19. Ravat, F., Teste, O., Tournier, R., Zurfluh, G.: Finding an application-appropriate model for xml data warehouses. Inf. Syst. 35(6), 662–687 (2010)

    Article  Google Scholar 

  20. Rusu, L.I., Rahayu, J.W., Taniar, D.: Partitioning methods for multi-version xml data warehouses. Distributed and Parallel Databases 25(1-2), 47–69 (2009)

    Article  Google Scholar 

  21. Tan, K.L.: Distributed database design. In: Encyclopedia of Database Systems, pp. 890–894. Springer, US (2009)

    Google Scholar 

  22. Wu, M.C., Buchmann, A.P.: Research issues in data warehousing. In: BTW, pp. 61–82 (1997)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Boukraâ, D., Boussaïd, O., Bentayeb, F. (2011). Vertical Fragmentation of XML Data Warehouses Using Frequent Path Sets. In: Cuzzocrea, A., Dayal, U. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2011. Lecture Notes in Computer Science, vol 6862. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23544-3_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-23544-3_15

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-23543-6

  • Online ISBN: 978-3-642-23544-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics