Abstract
This paper proposes to understand the retrieval process of relevant documents against a query as a two-stage process: at first an identification of the reason why a document is relevant to a query that we called the Effective Relevance Link, and second the valuation of this link, known as the Relevance Status Value (RSV). We present a formal definition of this semantic link between d and q. In addition, we clarify how an existing IR model, like Vector Space model, could be used for realizing and integrating this formal notion to build new effective IR methods. Our proposal is validated against three corpuses and using three types of indexing terms. The experimental results showed that the effective link between d and q is very important and should be more taken into consideration when setting up an Information Retrieval (IR) Model or System. Finally, our work shows that taking into account this effective link in a more explicit and direct way into existing IR models does improve their retrieval performance.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Abdulahhad, K., Chevallet, J.-P., Berrut, C.: Solving concept mismatch through bayesian framework by extending umls meta-thesaurus. In: la huitième édition de la COnférence en Recherche d’Information et Applications (CORIA 2011), Avignon, France, March 16–18 (2011)
Amati, G., Van Rijsbergen, C.J.: Probabilistic models of information retrieval based on measuring the divergence from randomness. ACM Trans. Inf. Syst. 20(4), 357–389 (2002)
Aronson, A.R.: Metamap: Mapping text to the UMLS metathesaurus (2006)
Buckley, C., Salton, G., Allan, J., Singhal, A.: Automatic Query Expansion Using SMART: TREC 3. In: TREC (1994)
Chiaramella, Y., Chevallet, J.P.: About retrieval models and logic. Comput. J. 35, 233–242 (1992)
Chiaramella, Y., Mulhem, P., Fourel, F.: A model for multimedia information retrieval. Technical report (1996)
Clinchant, S., Gaussier, E.: Information-based models for ad hoc ir. In: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2010, pp. 234–241. ACM, New York (2010)
Crestani, F.: Exploiting the similarity of non-matching terms at retrievaltime. Inf. Retr. 2(1), 27–47 (2000)
Dominich, S.: Mathematical Foundations of Information Retrieval, 1st edn. Mathematical Modelling: Theory and Applications. Springer (March 2001)
Fang, H., Tao, T., Zhai, C.: A formal study of information retrieval heuristics. In: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2004, pp. 49–56. ACM, New York (2004)
Losada, D.E., Barreiro, A.: A logical model for information retrieval based on propositional logic and belief revision. The Computer Journal 44, 410–424 (2001)
Nie, J.: An outline of a general model for information retrieval systems. In: Proceedings of the 11th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1988, pp. 495–506. ACM, New York (1988)
Ponte, J.M., Bruce Croft, W.: A language modeling approach to information retrieval. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1998, pp. 275–281. ACM, New York (1998)
Robertson, S.E.: The probability ranking principle in IR. In: Readings in Information Retrieval, pp. 281–286. Morgan Kaufmann Publishers Inc., San Francisco (1997)
Robertson, S.E., Walker, S.: Some simple effective approximations to the 2-poisson model for probabilistic weighted retrieval. In: Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1994, pp. 232–241. Springer-Verlag New York, Inc., New York (1994)
Rocchio, J.: Relevance Feedback in Information Retrieval, pp. 313–323 (1971)
Rose, D.E., Stevens, C.: V-twin: A lightweight engine for interactive use. In: TREC (1996)
Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Communications of the ACM (18), 613–620 (1975)
Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill, Inc., New York (1986)
Singhal, A., Buckley, C., Mitra, M.: Pivoted document length normalization. In: Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1996, pp. 21–29. ACM, New York (1996)
van Rijsbergen, C.J.: A non-classical logic for information retrieval. Comput. J. 29(6), 481–485 (1986)
Wilkinson, R., Zobel, J., Sacks-Davis, R.: Similarity measures for short queries. In: TREC (1995)
Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to ad hoc information retrieval. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2001, pp. 334–342. ACM, New York (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Abdulahhad, K., Chevallet, JP., Berrut, C. (2012). The Effective Relevance Link between a Document and a Query. In: Liddle, S.W., Schewe, KD., Tjoa, A.M., Zhou, X. (eds) Database and Expert Systems Applications. DEXA 2012. Lecture Notes in Computer Science, vol 7446. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32600-4_16
Download citation
DOI: https://doi.org/10.1007/978-3-642-32600-4_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32599-1
Online ISBN: 978-3-642-32600-4
eBook Packages: Computer ScienceComputer Science (R0)