Abstract
Keyword search has attracted a great deal of attention for retrieving XML data because it is a user-friendly mechanism. In this paper, we study the problem of effective keyword search over XML documents. The paper SLCA proposed that keyword search returns the set of smallest trees, where a tree is designated as smallest if it contains no sub-tree that also contains all keywords. The paper SLCA also provided detail description of the Indexed Lookup Eager algorithm (IL) to calculate SLCA. We analyzed and experimental studied the IL algorithm of SLCA deeply, find that there are 3 bugs which should not be disregarded. This paper investigates the problems to correct the existent 3 bugs of the algorithm IL, and proposes an optimize method called XIO-SLCA to improve keyword search quality. We have conducted an extensive experimental study and the experimental results show that our proposed approach XIO-SLCA achieves both higher recall and precise when compared with the existing proposal SLCA.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Bao, Z., Ling, T.W., Chen, B., Lu, J.: Effective xml keyword search with relevance oriented ranking. In: Proceedings of the 2009 IEEE International Conference on Data Engineering, pp. 517–528 (2009)
Chen, Y., Wang, W., Liu, Z., Lin, X.: Keyword search on structured and semi-structured data. In: Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data, pp. 1005–1010 (2009)
Liu, Z., Chen, Y.: Identifying meaningful return information for xml keyword search. In: Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data, pp. 329–340 (2007)
Yu, X., Papakonstantinou, Y.: Efficient keyword search for smallest lcas in xml databases. In: Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, pp. 527–538. ACM (2005)
Shao, F., Guo, L., Botev, C., Bhaskar, A., et al.: Efficient keyword search over virtual xml views. The VLDB Journal 18(2), 543–570 (2009)
Li, X., Li, Z., Wang, P., Chen, Q.: Xiof:finding xio for effective keyword search in xml documents. In: Proceedings of 2nd International Workshop on Intelligent Systems and Applications, pp. 99–104 (2010)
Li, Y., Yu, C., Jagadish, H.V.: Schema-free xquery. In: Proceedings of the Thirtieth International Conference on Very Large Data Bases, vol. 30, pp. 72–83 (2004)
Guo, L., Shao, F., Botev, C., Shanmugasundaram, J.: Xrank: ranked keyword search over xml documents. In: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, pp. 16–27 (2003)
Li, G., Feng, J., Wang, J., Zhou, L.: Effective keyword search for valuable lcas over xml documents. In: Proceedings of the Sixteenth ACM Conference on Conference on Information and Knowledge Management, pp. 31–40 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Li, X., Li, Z., Wang, P., Chen, Q., Zhang, L., Li, N. (2012). XIO-SLCA: Optimize SLCA for Effective Keyword Search in XML Documents. In: Wang, L., Jiang, J., Lu, J., Hong, L., Liu, B. (eds) Web-Age Information Management. WAIM 2011. Lecture Notes in Computer Science, vol 7142. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28635-3_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-28635-3_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28634-6
Online ISBN: 978-3-642-28635-3
eBook Packages: Computer ScienceComputer Science (R0)