Skip to main content

XIO-SLCA: Optimize SLCA for Effective Keyword Search in XML Documents

  • Conference paper
  • 903 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7142))

Abstract

Keyword search has attracted a great deal of attention for retrieving XML data because it is a user-friendly mechanism. In this paper, we study the problem of effective keyword search over XML documents. The paper SLCA proposed that keyword search returns the set of smallest trees, where a tree is designated as smallest if it contains no sub-tree that also contains all keywords. The paper SLCA also provided detail description of the Indexed Lookup Eager algorithm (IL) to calculate SLCA. We analyzed and experimental studied the IL algorithm of SLCA deeply, find that there are 3 bugs which should not be disregarded. This paper investigates the problems to correct the existent 3 bugs of the algorithm IL, and proposes an optimize method called XIO-SLCA to improve keyword search quality. We have conducted an extensive experimental study and the experimental results show that our proposed approach XIO-SLCA achieves both higher recall and precise when compared with the existing proposal SLCA.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bao, Z., Ling, T.W., Chen, B., Lu, J.: Effective xml keyword search with relevance oriented ranking. In: Proceedings of the 2009 IEEE International Conference on Data Engineering, pp. 517–528 (2009)

    Google Scholar 

  2. Chen, Y., Wang, W., Liu, Z., Lin, X.: Keyword search on structured and semi-structured data. In: Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data, pp. 1005–1010 (2009)

    Google Scholar 

  3. Liu, Z., Chen, Y.: Identifying meaningful return information for xml keyword search. In: Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data, pp. 329–340 (2007)

    Google Scholar 

  4. Yu, X., Papakonstantinou, Y.: Efficient keyword search for smallest lcas in xml databases. In: Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, pp. 527–538. ACM (2005)

    Google Scholar 

  5. Shao, F., Guo, L., Botev, C., Bhaskar, A., et al.: Efficient keyword search over virtual xml views. The VLDB Journal 18(2), 543–570 (2009)

    Article  Google Scholar 

  6. Li, X., Li, Z., Wang, P., Chen, Q.: Xiof:finding xio for effective keyword search in xml documents. In: Proceedings of 2nd International Workshop on Intelligent Systems and Applications, pp. 99–104 (2010)

    Google Scholar 

  7. Li, Y., Yu, C., Jagadish, H.V.: Schema-free xquery. In: Proceedings of the Thirtieth International Conference on Very Large Data Bases, vol. 30, pp. 72–83 (2004)

    Google Scholar 

  8. Guo, L., Shao, F., Botev, C., Shanmugasundaram, J.: Xrank: ranked keyword search over xml documents. In: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, pp. 16–27 (2003)

    Google Scholar 

  9. Li, G., Feng, J., Wang, J., Zhou, L.: Effective keyword search for valuable lcas over xml documents. In: Proceedings of the Sixteenth ACM Conference on Conference on Information and Knowledge Management, pp. 31–40 (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Li, X., Li, Z., Wang, P., Chen, Q., Zhang, L., Li, N. (2012). XIO-SLCA: Optimize SLCA for Effective Keyword Search in XML Documents. In: Wang, L., Jiang, J., Lu, J., Hong, L., Liu, B. (eds) Web-Age Information Management. WAIM 2011. Lecture Notes in Computer Science, vol 7142. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28635-3_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-28635-3_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-28634-6

  • Online ISBN: 978-3-642-28635-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics