ABSTRACT
Query reformulation modifies the original query with the aim of better matching the vocabulary of the relevant documents, and consequently improving ranking effectiveness. Previous techniques typically generate words and phrases related to the original query, but do not consider how these words and phrases would fit together in new queries. In this paper, we focus on an implementation of an approach that models reformulation as a distribution of queries, where each query is a variation of the original query. This approach considers a query as a basic unit and can capture important dependencies between words and phrases in the query. The implementation discussed here is based on passage analysis of the target corpus. Experiments on the TREC collection show that the proposed model for query reformulation significantly outperforms state-of-the-art methods.
- M. Bendersky, D. A. Smith, and W. B. Croft. Two-stage query segmentation for information retrieval. In SIGIR'09, pages 810--811, Boston, MA, 2009. Google ScholarDigital Library
- S. Bergsma and Q. I. Wang. Learning noun phrase query segmentation. In EMNLP-CoNLL07, pages 819--826, Prague, 2007.Google Scholar
- G. Cao, J. Y. Nie, J. Gao, and S. Robertson. Selecting good expansion terms for pseudo-relevance feedback. In SIGIR'08, pages 243--250, Singapore, 2008. Google ScholarDigital Library
- V. Dang and W. B. Croft. Query reformulation using anchor text. In WSDM'10, New York, NY, 2010. Google ScholarDigital Library
- J. Guo, G. Xu, H. Li, and X. Cheng. A unified and discriminative model for query refinement. In SIGIR'08, pages 379--386, Singapore, 2008. Google ScholarDigital Library
- R. Jones, B. Rey, O. Madani, and W. Greiner. Generating query substitutions. In WWW'06, pages 387--396, Ediburgh, Scotland, 2006. Google ScholarDigital Library
- V. Lavrenko and W. B. Croft. Relevance based language models. In SIGIR'01, pages 120--127, New Orleans, LA, 2001. Google ScholarDigital Library
- X. Liu and W. B. Croft. Passage retrieval based on language models. In CIKM'02, pages 375--382, McLean, VA, 2002. Google ScholarDigital Library
- D. Metzler and W. B. Croft. A markov random field model for term dependencies. In SIGIR'05, pages 472--479, Salvador,Brazil, 2005. Google ScholarDigital Library
- D. Metzler and W. B. Croft. Latent concept expansion using markov random fields. In SIGIR'07, pages 311--318, Amsterdam, the Netherlands, 2007. Google ScholarDigital Library
- G. Mishne and M. de Rijke. Boosting web retrieval through query operations. In ECIR'05, pages 502--516, Spain, 2005. Google ScholarDigital Library
- F. Peng, N. Ahmed, X. Li, and Y. Lu. Context sensitive stemming for web search. In SIGIR'07, pages 639--646, Amsterdam, the Netherlands, 2007. Google ScholarDigital Library
- J. M. Ponte and W. B. Croft. A language modeling approach to information retrieval. In SIGIR'98, pages 275--281, Melbourne, Australia, 1998. Google ScholarDigital Library
- B. Tan and F. Peng. Unsupervised query segmentation using generative language models and Wikipedia. In WWW'08, pages 347--356, Beijing, China, 2008. Google ScholarDigital Library
- X. Wang and C. Zhai. Mining term association patterns from search logs for effective query reformulation. In CIKM'08, pages 479--488, Napa Valley, CA, 2008. Google ScholarDigital Library
- J. Xu and W. B. Croft. Improving the effectiveness of information retrieval with local context analysis. ACM Transactions on Information Systems, 18(1):79--112, 2000. Google ScholarDigital Library
- X. Xue and W. B. Croft. Representing queries as distributions. In SIGIR'10 Workshop on Query Representation and Understanding, pages 9--12, Geneva, Switzerland, 2010.Google Scholar
- C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to ad hoc information retrieval. In SIGIR'01, pages 334--342, New Orleans, LA, 2001. Google ScholarDigital Library
Index Terms
- Modeling reformulation using passage analysis
Recommendations
Query reformulation using anchor text
WSDM '10: Proceedings of the third ACM international conference on Web search and data miningQuery reformulation techniques based on query logs have been studied as a method of capturing user intent and improving retrieval effectiveness. The evaluation of these techniques has primarily, however, focused on proprietary query logs and selected ...
Modeling reformulation using query distributions
Query reformulation modifies the original query with the aim of better matching the vocabulary of the relevant documents, and consequently improving ranking effectiveness. Previous models typically generate words and phrases related to the original ...
Information Retrieval with Verbose Queries
SIGIR '15: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information RetrievalRecently, the focus of many novel search applications shifted from short keyword queries to verbose natural language queries. Examples include question answering systems and dialogue systems, voice search on mobile devices and entity search engines like ...
Comments