Skip to main content

SINAI at CLEF Ad-Hoc Robust Track 2007: Applying Google Search Engine for Robust Cross-Lingual Retrieval

  • Conference paper
Advances in Multilingual and Multimodal Information Retrieval (CLEF 2007)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5152))

Included in the following conference series:

Abstract

We report our web-based query generation experiments for English and French collections in the Robust task of the CLEF Ad-Hoc track. We continued with the approach adopted in the previous year, although the model has been modified. Last year we used Google to expand the original query. This year we create a new expanded query in addition to the original one. Thus, we retrieve two lists of relevant documents, one for each query (the original and the expanded one). In order to integrate the two lists of documents, we apply a logistic regression merging solution. The results obtained are discouraging but the failure analysis shows that very difficult queries are improved by using both queries instead of the original query. The problem is to decide when a query is very difficult.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Kwok, K.L., Grunfeld, L., Lewis, D.D.: TREC-3 ad-hoc, routing retrieval and thresholding experiments using PIRCS. In: Proceedings of TREC’3, vol. 500-215, pp. 247–255. NIST Special Publication (1995)

    Google Scholar 

  2. Martínez-Santiago, F., Montejo-Ráez, A., García-Cumbreras, M.A., Ureña-López, L.A.: SINAI at CLEF 2006 Ad Hoc Robust Multilingual Track: Query Expansion using the Google Search Engine Evaluation of Multilingual and Multi-modal Information Retrieval. In: Peters, C., Clough, P., Gey, F.C., Karlgren, J., Magnini, B., Oard, D.W., de Rijke, M., Stempfhuber, M. (eds.) CLEF 2006. LNCS, vol. 4730. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  3. Voorhees, E., Gupta, N.K., Johnson-Laird, B.: The Collection Fusion Problem. In: Proceedings of the 3th Text Retrieval Conference TREC-3, vol. 500-225, pp. 95–104. NIST Special Publication (1995)

    Google Scholar 

  4. Martínez Santiago, F., Ureña López, L.A., Martín-Valdivia, M.T.: A merging strategy proposal: The 2-step retrieval status value method. Information Retrieval 9, 71–93 (2006)

    Article  Google Scholar 

  5. Savoy, J.: Combining Multiple Strategies for Effective Cross-Language Retrieval. Information Retrieval 7, 121–148 (2004)

    Article  Google Scholar 

  6. Robertson, S.E., Walker, S.: Okapi-Keenbow at TREC-8. In: Proceedings of the 8th Text Retrieval Conference TREC-8, vol. 500-246, pp. 151–162. NIST Special Publication (1999)

    Google Scholar 

  7. Calvé, A., Savoy, J.: Database merging strategy based on logistic regression. Information Processing & Management 36, 341–359 (2000)

    Article  Google Scholar 

  8. Savoy, J.: Cross-Language information retrieval: experiments based on CLEF 2000 corpora. Information Processing & Management 39, 75–115 (2003)

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Carol Peters Valentin Jijkoun Thomas Mandl Henning Müller Douglas W. Oard Anselmo Peñas Vivien Petras Diana Santos

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Martínez-Santiago, F., Montejo-Ráez, A., García-Cumbreras, M.A. (2008). SINAI at CLEF Ad-Hoc Robust Track 2007: Applying Google Search Engine for Robust Cross-Lingual Retrieval. In: Peters, C., et al. Advances in Multilingual and Multimodal Information Retrieval. CLEF 2007. Lecture Notes in Computer Science, vol 5152. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85760-0_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-85760-0_18

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-85759-4

  • Online ISBN: 978-3-540-85760-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics