research-article

Answering Topical Information Needs Using Neural Entity-Oriented Information Retrieval and Extraction

Author:
Shubham Chatterjee

University of New Hampshire, USA

University of New Hampshire, USA
View Profile

Authors Info & Claims

ACM SIGIR Forum Volume 56 Issue 2December 2022Article No.: 20pp 1–2https://doi.org/10.1145/3582900.3582926

Published:31 January 2023Publication History

ACM SIGIR Forum

Abstract

In the modern world, search engines are an integral part of human lives. The field of Information Retrieval (IR) is concerned with finding material (usually documents) of an unstructured nature (usually text) that satisfies an information need (query) from within large collections (usually stored on computers). The search engine then displays a ranked list of results relevant to our query. Traditional document retrieval algorithms match a query to a document using the overlap of words in both. However, the last decade has seen the focus shifting to leveraging the rich semantic information available in the form of entities.

Entities are uniquely identifiable objects or things such as places, events, diseases, etc. that exist in the real or fictional world. Entity-oriented search systems leverage the semantic information associated with entities (e.g., names, types, etc.) to better match documents to queries. Web search engines would provide better search results if they understand the meaning of a query.

This dissertation advances the state-of-the-art in IR by developing novel algorithms that understand text (query, document, question, sentence, etc.) at the semantic level. To this end, this dissertation aims to understand the fine-grained meaning of entities from the context in which the entities have been mentioned, for example, "oysters" in the context of food versus ecosystems. Further, this dissertation aims to automatically learn (vector) representations of entities that incorporate this fine-grained knowledge and knowledge about the query. This dissertation refines the automatic understanding of text passages using deep learning, a modern artificial intelligence paradigm.

This dissertation utilizes the semantic information extracted from entities to retrieve materials (text and entities) relevant to a query. The interplay between text and entities in the text is studied by addressing three related prediction problems: (1) Identify entities that are relevant for the query, (2) Understand an entity's meaning in the context of the query, and (3) Identify text passages that elaborate the connection between the query and an entity.

The research presented in this dissertation may be integrated into a larger system designed for answering complex topical queries such as dark chocolate health benefits which require the search engine to automatically understand the connections between the query and the relevant material, thus transforming the search engine into an answering engine.

Awarded by: University of New Hampshire, Durham, USA on 1 September 2022.

Supervised by: Laura Dietz.

Available at: https://scholars.unh.edu/dissertation/2714/.

References

Shubham Chatterjee and Laura Dietz. Why Does This Entity Matter? Support Passage Retrieval for Entity Retrieval. In Proceedings of the 2019 ACM SIGIR International Conference on Theory of Information Retrieval, ICTIR '19, page 221--224, New York, NY, USA, 2019. Association for Computing Machinery. ISBN 9781450368810. Google ScholarDigital Library
Shubham Chatterjee and Laura Dietz. Entity Retrieval Using Fine-Grained Entity Aspects. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '21, page 1662--1666, New York, NY, USA, 2021. Association for Computing Machinery. ISBN 9781450380379. Google ScholarDigital Library
Shubham Chatterjee and Laura Dietz. BERT-ER: Query-Specific BERT Entity Representations for Entity Ranking. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '22, page 1466--1477, New York, NY, USA, 2022a. Association for Computing Machinery. ISBN 9781450387323. Google ScholarDigital Library
Shubham Chatterjee and Laura Dietz. Predicting Guiding Entities for Entity Aspect Linking. In Proceedings of the 31st ACM International Conference on Information and Knowledge Management, CIKM '22, New York, NY, USA, 2022b. Association for Computing Machinery. ISBN 978145039236. Google ScholarDigital Library
Laura Dietz, Shubham Chatterjee, Connor Lennox, Sumanta Kashyapi, Pooja Oza, and Ben Gamari. Wikimarks: Harvesting Relevance Benchmarks from Wikipedia. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '22, page 3003--3012, New York, NY, USA, 2022. Association for Computing Machinery. ISBN 9781450387323. Google ScholarDigital Library

Recommendations

Answering Topical Information Needs Using Neural Entity-Oriented Information Retrieval and Extraction
Read More
An Entity-Oriented Approach for Answering Topical Information Needs
Advances in Information Retrieval
Abstract
In this dissertation, we adopt an entity-oriented approach to identify relevant materials for answering a topical keyword query such as “Cholera”. To this end, we study the interplay between text and entities by addressing three related prediction ...
Read More
The impact of named entity normalization on information retrieval for question answering
ECIR'08: Proceedings of the IR research, 30th European conference on Advances in information retrieval

In the named entity normalization task, a system identifies a canonical unambiguous referent for names like Bush or Alabama. Resolving synonymy and ambiguity of such names can benefit end-to-end information access tasks. We evaluate two entity ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

ACM SIGIR Forum Volume 56, Issue 2
December 2022
159 pages
ISSN:0163-5840
DOI:10.1145/3582900
Issue’s Table of Contents

Copyright © 2023 Copyright is held by the owner/author(s)
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 31 January 2023
Check for updates
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 17
  Total Downloads
- Downloads (Last 12 months)13
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Answering Topical Information Needs Using Neural Entity-Oriented Information Retrieval and Extraction

ACM SIGIR Forum

Abstract

References

Cited By

Recommendations

Answering Topical Information Needs Using Neural Entity-Oriented Information Retrieval and Extraction

An Entity-Oriented Approach for Answering Topical Information Needs

The impact of named entity normalization on information retrieval for question answering

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Answering Topical Information Needs Using Neural Entity-Oriented Information Retrieval and Extraction

ACM SIGIR Forum

Abstract

References

Cited By

Recommendations

Answering Topical Information Needs Using Neural Entity-Oriented Information Retrieval and Extraction

An Entity-Oriented Approach for Answering Topical Information Needs

The impact of named entity normalization on information retrieval for question answering

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media