skip to main content
10.1145/1141277.1141389acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
Article

MTree: an XML XPath graph index

Published:23 April 2006Publication History

ABSTRACT

This paper introduces the MTree index algorithm, a special purpose XML XPath index designed to meet the needs of the hierarchical XPath query language. With the increasing importance of XML, XPath, and XQuery, several methods have been proposed for creating XML structure indexes and many variants using relational technology have been proposed. This work proposes a new XML structure index, called MTree, which is designed to be optimal for traversing all XPath axes. The primary feature of MTree lies in its ability to provide the next subtree root node in document order, for all axes, to each context node in O(1). MTree is a special purpose XPath index structure that matches the special purpose query requirements for XPath. This approach is in contrast to other approaches that map the problem domain into general purpose index structures such as B-Tree that must reconstruct the XML tree from those structures for every query. MTree supports modification operations such as insert and delete. MTree has been implemented both in memory and on disk, and performance results using XMark benchmark data are presented showing up to two orders of magnitude improvement over other well-known implementations.

References

  1. Anders Berglund, Scott Boag, Don Chamberlin, Mary F. Fernández, Michael Kay, Jonathan Robie, Jérôme Siméon. XML Path Language (XPath) 2.0 W3C Working Draft 29 October 2004, http://www.w3.org/TR/xpath20/Google ScholarGoogle Scholar
  2. Grust, T. Accelerating XPath Location Steps. ACM SIGMOD 2002, June 4--6, Madison, Wisconsin, USA Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Roberto Bayardo, Vanja Josifovski, Daniel Gruhl, Jussi Myllymaki. An Evaluation of Binary XML Encoding Optimizations for Fast Stream Based XML Processing. WWW2004, May 17--22, 2004, New York, New York, www.almaden.ibm.com/cs/people/bayardo/ps/www04.pdf Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Peter Buneman, Martin Grohe, Ghristoph Kock. Path Queries on Compressed XML. Proceedings of the 29th VLDB Conference, Berlin, Germany, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Raghav Kaushik, Philip Bohannon, Jeffery F. Naughton, Henry F. Korth. Covering Indexes for Branching Path Queries. ACM SIGMOD 2002, June 4--6, Madison, WI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Rajasekar Krishnamurthy, Raghav Kaushik, and Jeffery F. Naughton. XML-to-SQL Query Translation Literature: The State of the Art and Open Problems, XML Symposium (XSym) 2003, pages 1--18 http://www.cs.wisc.edu/~sekar/publications.htmlGoogle ScholarGoogle ScholarCross RefCross Ref
  7. Mary Fernandez and Jerome Simeon. Growing XQuery. ECOOP 2003 - Object-Oriented Programming, 17th European Conference, Darmstadt, Germany, July 21--25, 2003, Proceedings, pages 405--430Google ScholarGoogle Scholar
  8. Xeres2 Java Parser http://xml.apache.org/xerces2-j/Google ScholarGoogle Scholar
  9. Sax Document Tracer Example http://xml.apache.org/xerces2-j/samples-sax.htmlGoogle ScholarGoogle Scholar
  10. Michael Stoner. Portable Performance Measurement Macros for Intel Architecture. http://www.intel.com/cd/ids/developer/asmona/eng/microprocessors/ia32/pentium4/optimization/19949.htm?page=1Google ScholarGoogle Scholar
  11. XMark---An XML Benchmark Project. http://monetdb.cwi.nl/xml/index.htGoogle ScholarGoogle Scholar
  12. Georg Gottlob, Christoph Koch, Reinhard Pickler. Efficient Algorithms for Processing XPath Queries. Proceedings of the 28th VLDB Conference, Hong Kong, China, 2002 Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Xalan-Java version 2.6.0. http://xml.apache.org/xalan-jGoogle ScholarGoogle Scholar
  14. Q. Li and B. Moon. Indexing and querying XML data for regular path expressions, In Proc. of VLDB 2001, Roma, Italy, September, 2001. http://citeseer.ist.psu.edu/li01indexing.html Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Saxon-B 8.1.1. http://saxon.sourceforge.net/Google ScholarGoogle Scholar
  16. S. Al-Khalifa, H. V. Jagadish, J. M. Patel, Y. Wu, N. Koudas, and D. Srivastava. Structural joins: a primitive for efficient XML query pattern matching, pages 141--152, ICDE, 2002Google ScholarGoogle ScholarCross RefCross Ref
  17. Dennis Shasha, Jason T. L. Wang, Rosalba Giugno. Algorithmics and applications of tree and graph searching. Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, Madison, Wisconsin, June 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Eclipse IDE. http://www.eclipse.org/Google ScholarGoogle Scholar
  19. Haixun Wang, Sanghyun Park, Wei Fan, Philip S. Yu. ViST: A Dynamic Index Method for Querying XML Data by Tree Structures. ACM SIGMOD 2003, June 9--12, San Diego, CA Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Jason McHugh, Jennifer Widom. Query Optimization for XML. Proceedings of the 25th VLDB Conference, Edinburgh, Scotland, 1999 Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Nicolas Bruno, Nick Koudas, Divesh Srivastava. Holistic Twig Joins: Optimal XML Pattern Matching. ACM SIGMOD 2002, June 4--6, Madison, Wisconsin, USA Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Raghav Kaushik, Rajasekar Krishnamurthy, Jeffery F. Naughton, Raghu Ramakrishnaan. On the Integration of Structure Indexes and Inverted Lists. ACM SIGMOD 2004 June 13--18, Paris France Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Amelie Marian and Jerome Simeon. Projecting XML Documents. Proceedings of the 29th VLDB Conference, Berlin, Germany, 2003 Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Zhiyuan Chen, Chen Li, Jian Pei, Yufei Tao, Haixun Wang Wei Wang, Jiong Yang, Jun Yang, Donghui Zhang. Recent Progress on Selected Topics in Database Research: A Report by Nine Young Chinese Researchers Working in the United States. Indexing XML by Tree Structures. Journal of Computer Science and Technology archive, Volume 18, Issue 5, September 2003, Pages: 538--552 Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Schwentick, Thomas. Xpath Query Containment. SIGMOD Record, Vol 33, No.1, March 2004 Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Hanyu Li, Mong Li Lee, Wynne Hsu, Chao Chen. An evaluation of XML indexes for structural join. ACM SIGMOD Record, Volume 33 Issue 3, September 2004 Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. MTree: an XML XPath graph index

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in
              • Published in

                cover image ACM Conferences
                SAC '06: Proceedings of the 2006 ACM symposium on Applied computing
                April 2006
                1967 pages
                ISBN:1595931082
                DOI:10.1145/1141277

                Copyright © 2006 ACM

                Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 23 April 2006

                Permissions

                Request permissions about this article.

                Request Permissions

                Check for updates

                Qualifiers

                • Article

                Acceptance Rates

                Overall Acceptance Rate1,650of6,669submissions,25%

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader