A Recursive Approach to Entity Ranking and List Completion Using Entity Determining Terms, Qualifiers and Prominent n-Grams

Ramanathan, Madhu; Rajagopal, Srikant; Karthik, Venkatesh; Murugeshan, Meenakshi Sundaram; Mukherjee, Saswati

doi:10.1007/978-3-642-14556-8_29

Madhu Ramanathan¹⁹,
Srikant Rajagopal¹⁹,
Venkatesh Karthik¹⁹,
Meenakshi Sundaram Murugeshan¹⁹ &
…
Saswati Mukherjee¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6203))

Included in the following conference series:

International Workshop of the Initiative for the Evaluation of XML Retrieval

560 Accesses

Abstract

This paper presents our approach for INEX 2009 Entity Ranking track which consists of two subtasks viz. Entity Ranking and List Completion. Retrieving the correct entities according to the user query is a three-step process viz. extracting the required information from the query and the provided categories, extracting the relevant documents which may be either prospective entities or intermediate pointers to prospective entities by making use of the structure available in the Wikipedia Corpus and finally ranking the resultant set of documents. We have extracted the Entity Determining Terms (EDTs), Qualifiers and prominent n-grams from the query, strategically exploited the relation between the extracted terms and the structure and connectedness of the corpus to retrieve links which are highly probable of being entities and then used a recursive mechanism for retrieving relevant documents through the Lucene Search. Our ranking mechanism combines various approaches that make use of category information, links, titles and WordNet information, initial description and the text of the document.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Voorhees, E.M.: Overview of the TREC 2001 Question Answering Track. In: Proceedings of the 10th Text Retrieval Conference (2001)
Google Scholar
Craswell, N., de Vries, A.P., Soboroff, I.: Overview of the TREC 2005 Enterprise Track. In: Proceedings of the 14th Text Retrieval Conference (2005)
Google Scholar
de Vries, A.P.: Overview of the INEX 2007 Entity Ranking Track. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 245–251. Springer, Heidelberg (2008)
Chapter Google Scholar
Chen, J., Diekema, A., Taffett, M.D., McCracken, N., Ozgencil, N.E., Yilmazel, O., Liddy, E.D.: Question Answering: CNLP at the TREC 10 Question Answering Track. In: Proceedings of the 10th Text Retrieval Conference (2001)
Google Scholar
Yang, H., Chua, T.-S.: Web-based list question answering. In: Proceedings of the 20th International Conference on Computational Linguistics (2004)
Google Scholar
Kazama, J., Torisawa, K.: Exploiting Wikipedia as External Knowledge for Named Entity Recognition. In: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, EMNLP-CoNLL (2007)
Google Scholar
Hermjakob, U., Hovy, E.H., Lin, C.-Y.: Knowledge-Based Question Answering. In: Proceedings of the 6th World Multiconference on Systems, Cybernatics and Informatics (SCI 2002), Orlando, FL, U.S.A., July 14-18 (2002)
Google Scholar
Murugeshan, M.S., Mukherjee, S.: An n-Gram and Initial Description Based Approach for Entity Ranking Track. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 293–305. Springer, Heidelberg (2008)
Chapter Google Scholar
Thom, J.A., Pehcevski, J., Vercoustre, A.-M.: Use of Wikipedia Categories in Entity Ranking. In: Proceedings of the 12th Australasian Document Computing Symposium (ADCS 2007), Melbourne, Australia, December 10 (2007)
Google Scholar
Cilibrasi, R., Vitanyi, P.: The Google Similarity Distance. IEEE Trans. Knowledge and Data Engineering 19(3), 370–383 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, College of Engineering, Guindy, Anna University, Chennai, 60025, India
Madhu Ramanathan, Srikant Rajagopal, Venkatesh Karthik, Meenakshi Sundaram Murugeshan & Saswati Mukherjee

Authors

Madhu Ramanathan
View author publications
You can also search for this author in PubMed Google Scholar
Srikant Rajagopal
View author publications
You can also search for this author in PubMed Google Scholar
Venkatesh Karthik
View author publications
You can also search for this author in PubMed Google Scholar
Meenakshi Sundaram Murugeshan
View author publications
You can also search for this author in PubMed Google Scholar
Saswati Mukherjee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Science and Technology, Queensland University of Technology, GPO Box 2434, 4001, Brisbane, Qld, Australia
Shlomo Geva
Archives and Information Studies/Humanities, University of Amsterdam, Turfdraagsterpad 9, 1012 XT, Amsterdam, The Netherlands
Jaap Kamps
Department of Computer Science, University of Otago, P.O. Box 56,, 9054, Dunedin, New Zealand
Andrew Trotman

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ramanathan, M., Rajagopal, S., Karthik, V., Murugeshan, M.S., Mukherjee, S. (2010). A Recursive Approach to Entity Ranking and List Completion Using Entity Determining Terms, Qualifiers and Prominent n-Grams. In: Geva, S., Kamps, J., Trotman, A. (eds) Focused Retrieval and Evaluation. INEX 2009. Lecture Notes in Computer Science, vol 6203. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14556-8_29

Download citation

DOI: https://doi.org/10.1007/978-3-642-14556-8_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14555-1
Online ISBN: 978-3-642-14556-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics