Abstract
We have to deal with different data formats whenever data formats evolve or data must be integrated from heterogeneous systems. These data when implemented in XML for data exchange cannot be shared freely among applications without data transformation. A common approach to solve this problem is to convert the entire XML data from their source format to the applications’ target formats using the transformations rules specified in XSLT stylesheets. However, in many cases, not all XML data are required to be transformed except for a smaller part described by a user’s query (application). In this paper, we present an approach that optimizes the execution time of an XSLT stylesheet for answering a given XPath query by modifying the XSLT stylesheet in such a way that it would (a) capture only the parts in the XML data that are relevant to the query and (b) process only those XSLT instructions that are relevant to the query. We prove the correctness of our optimization approach, analyze its complexity and present experimental results. The experimental results show that our approach performs the best in terms of execution time, especially when many cost-intensive XSLT instructions can be excluded in the XSLT stylesheet.
References
Abiteboul S (1999) On views and XML. In: Proceedings of the eighteenth ACM SIGACT-SIGMOD-SIGART symposium on principles of database systems (PODS), Philadelphia, Pennsylvania
Abiteboul S, Cluet S, Milo T (1997) Correspondence and translation for heterogeneous data. In: Proceedings of the 6th international conference on database theory (ICDT), Delphi, Greece
Apache Software Foundation (2003) Xalan-Java. http://xml.apache.org/xalan-j/index.html
Apache Software Foundation (2003) Xerces2 Java Parser 2.5.0 Release. http://xml.apache.org/xerces2-j
Böttcher S, Steinmetz R (2004) Optimized Internet search based on an intersection test for XPath expressions under a DTD, In: Proceedings of the international conference on internet computing (IC), Las Vegas, USA
Böttcher S, Türling A (2003) Checking XPath expressions for synchronization, access control and reuse of query results on mobile clients. In: Proceedings of the workshop on database mechanisms for mobile applications, Karlsruhe, Germany
Bourret R, Bornhövd C, Buchmann, AP (2000) A generic load/extract utility for data transfer between XML documents and relational databases. In: Proceedings of the 2nd international workshop on advanced issues of EC and Web-based information systems (WECWIS), San Jose
Chang CCK, Garcia-Molina H (2000) Approximate query translation across heterogeneous information sources. In: Proceedings of the 26th international conference on very large data bases (VLDB), Cairo, Egypt
Chen YB, Ling TW, Lee ML (2002) Designing valid XML views. In: Proceedings of the 21st international conference on conceptual modeling (ER), Tampere, Finland
Cluet S, Delobel C, Simon J, Smaga K (1998) Your mediators need data conversion! In: Proceedings ACM SIGMOD international conference on management of data, Seattle, Washington, USA
Cluet S, Veltri P, Vodislav D (2001) Views in a large scale XML repository. In: Proceedings of 27th international conference on very large data bases (VLDB), Roma, Italy
Deutsch A, Tannen V (2003) Reformulation of XML queries and constraints. In: Proceedings of the 9th international conference on database theory (ICDT) 2003, Siena, Italy
Developer (2005) XSLT Mark Version 2.1.0. http://www.datapower.com/xmldev/xsltmark.html
Diao Y, Altinel M, Franklin MJ, Zhang H, Fischer P: Path sharing and predicate evaluation for high-performance XML filtering. ACM Trans Database Syst 28(4), 467–516 (2003)
Diao Y, Rizvi S, Franklin MJ (2004) Towards an Internet-Scale XML dissemination service. In: Proceedings of the thirtieth international conference on very large data bases (VLDB), Toronto, Canada
Fernández M, Kadiyska Y, Suciu D, Morishima A, Tan WC: SilkRoute, a framework for publishing relational data in XML. ACM Trans Database Syst 27(4), 438–493 (2002)
Fisher D, Lam F, Wong RK (2004) Algebraic transformation and optimization for XQuery. In: Proceedings of the 6th Asia-Pacific Web conference (APWeb), Hangzhou, China
Franceschet M (2005) XPathMark—an XPath benchmark for the XMark generated data. In: Proceedings of the third international XML database symposium (XSym 2005), Trondheim, Norway
Gottlob G, Koch C, Pichler R (2003) The complexity of XPath query evaluation. In: Proceedings of the 22th ACM SIGMOD-SIGACT-SIGART symposium of principles of database systems (PODS), San Diego, California, USA
Grinev M, Kuznetsov S (2002) Towards an exhaustive set of rewriting rules for XQuery optimisation: BizQuery experience. In: Proceedings of the 6th East European conference on advances in databases and information systems (ADBIS), Bratislava, Slovakia
Grinev M, Pleshachkov P (2005) Rewriting-based Optimization for XQuery transformational queries. In: Proceedings of the 9th international database engineering and applications symposium (IDEAS 2005), Montreal, Canada
Groppe S, Böttcher S (2003) XPath query transformation based on XSLT stylesheets. In: Proceedings of the fifth international workshop on web information and data management (WIDM), New Orleans, Louisiana, USA
Groppe S, Böttcher S, Birkenheuer G (2004) Efficient querying of transformed XML Documents. In: Proceedings of the 6th international conference on enterprise information systems (ICEIS), Porto, Portugal
Groppe S, Böttcher S, Birkenheuer G, Höing A: Reformulating XPath queries and XSLT queries on XSLT views. Data Knowl Eng J (DKE) 57(1), 64–110 (2006)
Groppe S, Böttcher S, Heckel R, Birkenheuer G (2004) Using XSLT stylesheets to transform XPath queries. In: Proceedings of the eighth East-European conference on advances in databases and information systems (ADBIS), Budapest, Hungary
Jain S, Mahajan R, Suciu D (2002) Translating XSLT programs to efficient SQL queries. In: Proceedings of the eleventh international world wide web conference (WWW2002), Honolulu, Hawaii, USA
Kay MH (2004) Saxon - The XSLT and XQuery Processor. http://saxon.sourceforge.net
Krishnamurthy R, Kaushik R, Naughton JF (2004) Efficient XML-to-SQL query translation: where to add the intelligence? In: Proceedings of the thirtieth international conference on very large data bases (VLDB), Toronto, Canada
Lechner S, Preuner G, Schrefl M (2001) Translating XQuery into XSLT, In: ER Workshops, Yokohama, Japan
Leung Hp, Chung KFL, Chan SCf: On the use of hierarchical information in sequential mining-based XML document similarity computation. Knowl Inf Syst 7(4), 476–498 (2005)
Megginson D (2000) SAX. http://www.saxproject.org/
Marian A, Siméon J (2003) Projecting XML documents. In: Proceedings of the 29th international conference on very large data bases (VLDB), Berlin, Germany
Moerkotte G (2002) Incorporating XSL processing into database engines. In: Proceedings of the 28th international conference on very large data bases (VLDB), Hong Kong, China
Mukherjea S: Discovering and analyzing world wide web collections. Knowl Inf Syst 6(2), 230–241 (2004)
Nayak R: Fast and effective clustering of XML data using structural information. Knowl Inf Syst 14(2), 197–215 (2008)
Paparizos S, Wu Y, Lakshmanan LVS, Jagadish HV (2004) Tree logical classes for efficient evaluation of XQuery. In: Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD), Paris, France
Rys M (2001) Bringing the Internet to your database: Using SQL Server 2000 and XML to build loosely coupled systems. In: Proceedings of the 17th international conference on data engineering (ICDE), Heidelberg, Germany
Shanmugasundaram J, Kiernan J, Shekita E, Fan C, Funderburk J (2001) Querying XML views of relational data. In: Proceedings of 27th international conference on very large data bases (VLDB), Roma, Italy
Shanmugasundaram J, Shekita E, Barr R, Carey M, Lindsay B, Pirahesh H, Reinwald B: Efficiently publishing relational data as XML documents. VLDB J 10(2–3), 133–154 (2001)
University Trier (2005) Computer Science Bibliographie. http://dblp.uni-trier.de/
Wang L, Mulchandani M, Rundensteiner EA (2003) Updating XQuery views published over relational data: a round-trip case study. In: Proceedings of the first international XML database symposium (XSym), Berlin, Germany
Wilfred Ng, Lam WY, Wood PT, Levene M: XCQ: a queriable XML compression system. Knowl Inf Syst 10(4), 421–452 (2006)
W3C (2004) Document Object Model (DOM) Level 3 Core Specification Version 1.0, W3C Recommendation. http://www.w3.org/TR/2004/REC-DOM-Level-3-Core-20040407/
W3C (2001) Extensible Stylesheet Language (XSL). W3C Recommendation. http://www.w3.org/Style/XSL/
W3C (1999) XML Path Language (XPath) Version 1.0. W3C Recommendation. http://www.w3.org/TR/xpath/
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Groppe, S., Groppe, J., Böttcher, S. et al. Optimizing the execution of XSLT stylesheets for querying transformed XML data. Knowl Inf Syst 18, 331–391 (2009). https://doi.org/10.1007/s10115-008-0144-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-008-0144-4