Improving Ranking and Robustness of Search Systems by Exploiting the Popularity of Documents

Bah, Ashraf; Carterette, Ben

doi:10.1007/978-3-319-28940-3_14

Improving Ranking and Robustness of Search Systems by Exploiting the Popularity of Documents

Ashraf Bah¹⁹ &
Ben Carterette¹⁹

Conference paper
First Online: 22 January 2016

780 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9460))

Abstract

In building Information Retrieval systems, much of research is geared towards optimizing a specific aspect of the system. Consequently, there are a lot of systems that improve effectiveness of search results by striving to outperform a baseline system. Other systems, however, focus on improving the robustness of the system by minimizing the risk of obtaining, for any topic, a result subpar with that of the baseline system. Both tasks have been organized by TREC Web tracks 2013 and 2014, and have been undertaken by the track participants. Our work herein, proposes two re-ranking approaches – based on exploiting the popularity of documents with respect to a general topic – that improve the effectiveness while improving the robustness of the baseline systems. We used each of the runs submitted to TREC Web tracks 2013 – 14 as baseline, and empirically show that our algorithms improve the effectiveness as well as the robustness of the systems in an overwhelming number of cases, even though the systems used to produce them employ a variety of retrieval models.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Bah, A., Carterette, B.: Aggregating results from multiple related queries to improve web search over sessions. In: Jaafar, A., Mohamad Ali, N., Mohd Noah, S.A., Smeaton, A.F., Bruza, P., Bakar, Z.A., Jamil, N., Sembok, T.M.T. (eds.) AIRS 2014. LNCS, vol. 8870, pp. 172–183. Springer, Heidelberg (2014)
Google Scholar
Bhattacharjee, R., Goel, A.: Algorithms and incentives for robust ranking. In: SODA (2007)
Google Scholar
Burges, C., Shaked, T., Renshaw, E., Lazier, A., Deeds, M., Hamilton, N., Hullender, G.: Learning to rank using gradient descent. In: ICML (2005)
Google Scholar
Büttcher, S., Clarke, C. L., Lushman, B.: Term proximity scoring for ad-hoc retrieval on very large text collections. In: SIGIR (2006)
Google Scholar
Carbonell, J., Goldstein, J.: The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: SIGIR (1998)
Google Scholar
Carterette, B., Chandar, P.: Probabilistic models of ranking novel documents for faceted topic retrieval. In: CIKM (2009)
Google Scholar
Chapelle, O., Ji, S., Liao, C., Velipasaoglu, E., Lai, L., Wu, S.L.: Intent-based diversification of web search results: metrics and algorithms. IR 14(6), 572–592 (2011)
Google Scholar
Chapelle, O., Metlzer, D., Zhang, Y., Grinspan, P.: Expected reciprocal rank for graded relevance. In: CIKM (2009)
Google Scholar
Clarke, C.L., Kolla, M., Cormack, G.V., Vechtomova, O., Ashkan, A., Büttcher, S., MacKinnon, I.: Novelty and diversity in information retrieval evaluation. In: SIGIR (2008)
Google Scholar
Collins-Thompson, K., Bennett, P., Diaz, F., Clarke, C.L., Voorhees, E.M.: TREC 2013 web track overview. In: TREC (2013)
Google Scholar
Collins-Thompson, K., Bennett, P., Diaz, F., Clarke, C.L., Voorhees, E.M.: TREC 2014 web track overview. In: TREC (2014)
Google Scholar
Collins-Thompson, K.: Reducing the risk of query expansion via robust constrained optimization. In: CIKM (2009)
Google Scholar
Macdonald, C., Ounis, I., Dinçer, B.: Tackling biased baselines in the risk-sensitive evaluation of retrieval systems. In: de Rijke, M., Kenter, T., de Vries, A.P., Zhai, C., de Jong, F., Radinsky, K., Hofmann, K. (eds.) ECIR 2014. LNCS, vol. 8416, pp. 26–38. Springer, Heidelberg (2014)
Chapter Google Scholar
Järvelin, K., Kekäläinen, J.: Cumulated gain-based evaluation of IR techniques. TOIS 20(4), 422–446 (2002)
Article Google Scholar
Kang, C., Wang, X., Chen, J., Liao, C., Chang, Y., Tseng, B., Zheng, Z.: Learning to re-rank web search results with multiple pairwise features. In: WSDM 2011 (2011)
Google Scholar
Liu, T.Y.: Learning to rank for information retrieval. FnTIR 3(3), 225–331 (2009)
Google Scholar
Lv, Y., Zhai, C., Chen, W.: A boosting approach to improving pseudo-relevance feedback. In: SIGIR (2011)
Google Scholar
Metzler, D., Croft, W.B.: A markov random field model for term dependencies. In: SIGIR (2005)
Google Scholar
Ponte, J.M., Croft, W.B.: A language modeling approach to information retrieval. In: SIGIR (1998)
Google Scholar
Radlinski, F., Dumais, S.: Improving personalized web search using result diversification. In: SIGIR (2006)
Google Scholar
Robertson, S.E., Walker, S., Jones, S., Hancock-Beaulieu, M. M., Gatford, M.: Okapi at TREC-3. In: TREC (1994)
Google Scholar
Santos, R.L., Macdonald, C., Ounis, I.: Exploiting query reformulations for web search result diversification. In: WWW (2010)
Google Scholar
Tao, T., Zhai, C.: An exploration of proximity measures in information retrieval. In: SIGIR (2007)
Google Scholar
Wang, J., Zhu, J.: Portfolio theory of information retrieval. In: SIGIR (2009)
Google Scholar
Wang, L., Bennett, P.N., Collins-Thompson, K.: Robust ranking models via risk-sensitive optimization. In: SIGIR (2012)
Google Scholar
Zhai, C.X., Cohen, W.W., Lafferty, J.: Beyond independent relevance: methods and evaluation metrics for subtopic retrieval. In: SIGIR (2003)
Google Scholar
Zhu, J., Wang, J., Cox, I.J., Taylor, M.J.: Risky business: modeling and exploiting uncertainty in information retrieval. In: SIGIR (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Sciences, University of Delaware, Newark, DE, USA
Ashraf Bah & Ben Carterette

Authors

Ashraf Bah
View author publications
You can also search for this author in PubMed Google Scholar
Ben Carterette
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ashraf Bah .

Editor information

Editors and Affiliations

Science and Engineering Faculty, Queensland University of Technology, Brisbane, Australia
Guido Zuccon
Brisbane, Queensland, Australia
Shlomo Geva
University of Tsukuba, Ibaraki, Japan
Hideo Joho
RMIT University, Melbourne, Australia
Falk Scholer
School of Computer Engineering, Nanyang Technological University, Singapore, Singapore
Aixin Sun
Tianjin University, China
Peng Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bah, A., Carterette, B. (2015). Improving Ranking and Robustness of Search Systems by Exploiting the Popularity of Documents. In: Zuccon, G., Geva, S., Joho, H., Scholer, F., Sun, A., Zhang, P. (eds) Information Retrieval Technology. AIRS 2015. Lecture Notes in Computer Science(), vol 9460. Springer, Cham. https://doi.org/10.1007/978-3-319-28940-3_14

Download citation

DOI: https://doi.org/10.1007/978-3-319-28940-3_14
Published: 22 January 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-28939-7
Online ISBN: 978-3-319-28940-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics