skip to main content
10.1145/3539618.3591966acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper

ConQueR: Contextualized Query Reduction using Search Logs

Published:18 July 2023Publication History

ABSTRACT

Query reformulation is a key mechanism to alleviate the linguistic chasm of query in ad-hoc retrieval. Among various solutions, query reduction effectively removes extraneous terms and specifies concise user intent from long queries. However, it is challenging to capture hidden and diverse user intent. This paper proposes Contextualized Query Reduction (ConQueR) using a pre-trained language model (PLM). Specifically, it reduces verbose queries with two different views: core term extraction and sub-query selection. One extracts core terms from an original query at the term level, and the other determines whether a sub-query is a suitable reduction for the original query at the sequence level. Since they operate at different levels of granularity and complement each other, they are finally aggregated in an ensemble manner. We evaluate the reduction quality of ConQueR on real-world search logs collected from a commercial web search engine. It achieves up to 8.45% gains in exact match scores over the best competing model.

References

  1. Yuki Amemiya, Tomohiro Manabe, Sumio Fujita, and Tetsuya Sakai. 2021. How Do Users Revise Zero-Hit Product Search Queries?. In ECIR. 185--192.Google ScholarGoogle Scholar
  2. Peter G. Anick. 2003. Using terminological feedback for web search refinement: a log-based study. In SIGIR. 88--95.Google ScholarGoogle Scholar
  3. Hiteshwar Kumar Azad and Akshay Deepak. 2019. Query expansion techniques for information retrieval: A survey. Inf. Process. Manag., Vol. 56, 5 (2019), 1698--1735.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Michael Bendersky and W. Bruce Croft. 2008. Discovering Key Concepts in Verbose Queries. In SIGIR. 491--498.Google ScholarGoogle Scholar
  5. Michael Bendersky, Donald Metzler, and W. Bruce Croft. 2011. Parameterized Concept Weighting in Verbose Queries. In SIGIR. 605--614.Google ScholarGoogle Scholar
  6. Kaibo Cao, Chunyang Chen, Sebastian Baltes, Christoph Treude, and Xiang Chen. 2021. Automated Query Reformulation for Efficient Search based on Query Logs From Stack Overflow. In ICSE. 1273--1285.Google ScholarGoogle Scholar
  7. Claudio Carpineto and Giovanni Romano. 2012. A Survey of Automatic Query Expansion in Information Retrieval. ACM Comput. Surv., Vol. 44, 1 (2012), 1:1--1:50.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Messaoud Chaa, Omar Nouali, and Patrice Bellot. 2016. Verbose Query Reduction by Learning to Rank for Social Book Search Track. In CLEF (CEUR Workshop Proceedings). 1072--1078.Google ScholarGoogle Scholar
  9. Junyoung Chung, Caglar Gülcehre, KyungHyun Cho, and Yoshua Bengio. 2014. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling. CoRR (2014).Google ScholarGoogle Scholar
  10. Kevin Clark, Minh-Thang Luong, Quoc V. Le, and Christopher D. Manning. 2020. ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators. In ICLR.Google ScholarGoogle Scholar
  11. Charles L. A. Clarke, Nick Craswell, Ian Soboroff, and Gordon V. Cormack. 2010. Overview of the TREC 2010 Web Track. In TREC, Vol. 500--294.Google ScholarGoogle Scholar
  12. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL-HLT. 4171--4186.Google ScholarGoogle Scholar
  13. Manish Gupta and Michael Bendersky. 2015. Information Retrieval with Verbose Queries. In SIGIR. 1121--1124.Google ScholarGoogle Scholar
  14. Kishaloy Halder, Heng-Tze Cheng, Ellie Ka In Chio, Georgios Roumpos, Tao Wu, and Ritesh Agarwal. 2020. Modeling Information Need of Users in Search Sessions. CoRR (2020).Google ScholarGoogle Scholar
  15. Rosie Jones and Daniel C. Fain. 2003. Query word deletion prediction. In SIGIR. 435--436.Google ScholarGoogle Scholar
  16. Jürgen Koenemann and Nicholas J. Belkin. 1996. A Case for Interaction: A Study of Interactive Information Retrieval Behavior and Effectiveness. In CHI. 205--212.Google ScholarGoogle Scholar
  17. Bevan Koopman, Liam Cripwell, and Guido Zuccon. 2017. Generating Clinical Queries from Patient Narratives: A Comparison between Machines and Humans. In SIGIR. 853--856.Google ScholarGoogle Scholar
  18. Giridhar Kumaran and Vitor R. Carvalho. 2009. Reducing long queries using query quality predictors. In SIGIR. 564--571.Google ScholarGoogle Scholar
  19. Mike Lewis, Yinhan Liu, Naman Goyal, Marjan Ghazvininejad, Abdelrahman Mohamed, Omer Levy, Veselin Stoyanov, and Luke Zettlemoyer. 2020. BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension. In ACL. 7871--7880.Google ScholarGoogle Scholar
  20. K. Tamsin Maxwell and W. Bruce Croft. 2013. Compact query term selection using topically related text. In SIGIR. 583--592.Google ScholarGoogle Scholar
  21. Shahrzad Naseri, Jeff Dalton, Andrew Yates, and James Allan. 2021. CEQE: Contextualized Embeddings for Query Expansion. In ECIR, Vol. 12656. 467--482.Google ScholarGoogle Scholar
  22. Rodrigo Nogueira and Kyunghyun Cho. 2019. Passage Re-ranking with BERT. CoRR (2019).Google ScholarGoogle Scholar
  23. Jessie Ooi, Xiuqin Ma, Hongwu Qin, and Siau Chuin Liew. 2015. A survey of query expansion, query suggestion and query refinement techniques. In ICSECS. IEEE, 112--117.Google ScholarGoogle Scholar
  24. Greg Pass, Abdur Chowdhury, and Cayley Torgeson. 2006. A picture of search. In Infoscale. 1.Google ScholarGoogle Scholar
  25. Alec Radford, Jeff Wu, Rewon Child, David Luan, Dario Amodei, and Ilya Sutskever. 2019. Language Models are Unsupervised Multitask Learners. (2019).Google ScholarGoogle Scholar
  26. Eldar Sadikov, Jayant Madhavan, Lu Wang, and Alon Y. Halevy. 2010. Clustering query refinements by user intent. In WWW. 841--850.Google ScholarGoogle Scholar
  27. Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is All you Need. In NeurIPS. 5998--6008.Google ScholarGoogle Scholar
  28. Bienvenido Vé lez, Ron Weiss, Mark A. Sheldon, and David K. Gifford. 1997. Fast and Effective Query Refinement. In SIGIR. 6--15.Google ScholarGoogle Scholar
  29. Wenjie Wang, Fuli Feng, Xiangnan He, Liqiang Nie, and Tat-Seng Chua. 2021. Denoising Implicit Feedback for Recommendation. In WSDM. 373--381.Google ScholarGoogle Scholar
  30. Xiaobing Xue, Samuel J. Huston, and W. Bruce Croft. 2010. Improving verbose queries using subset distribution. In CIKM. 1059--1068.Google ScholarGoogle Scholar
  31. Bishan Yang, Nish Parikh, Gyanit Singh, and Neel Sundaresan. 2014. A Study of Query Term Deletion Using Large-Scale E-commerce Search Logs. In ECIR. 235--246.Google ScholarGoogle Scholar
  32. Peilin Yang and Hui Fang. 2017. Can Short Queries Be Even Shorter?. In ICTIR. 43--50.Google ScholarGoogle Scholar
  33. Zhi Zheng, Kai Hui, Ben He, Xianpei Han, Le Sun, and Andrew Yates. 2020. BERT-QE: Contextualized Query Expansion for Document Re-ranking. In Findings of ACL. 4718--4728.Google ScholarGoogle Scholar
  34. Ingrid Zukerman, Bhavani Raskutti, and Yingying Wen. 2003. Query Expansion and Query Reduction in Document Retrieval. In ICTAI. 552--559.Google ScholarGoogle Scholar

Index Terms

  1. ConQueR: Contextualized Query Reduction using Search Logs

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGIR '23: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval
      July 2023
      3567 pages
      ISBN:9781450394086
      DOI:10.1145/3539618

      Copyright © 2023 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 18 July 2023

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • short-paper

      Acceptance Rates

      Overall Acceptance Rate792of3,983submissions,20%
    • Article Metrics

      • Downloads (Last 12 months)89
      • Downloads (Last 6 weeks)8

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader