ABSTRACT
An increasing number of systems provide the ability to semantically annotate documents. OpenCalais [4], Evri API [2], Zemanta [6], and Alchemy API [1] are web-hosted systems that return annotated documents, i. e. documents with annotations that are overlayed on the document structure. Many of the annotations can be linked to standard ontologies, such as DBpedia and YAGO. These annotations give insight as to the meaning of documents in a variety of ways, identifying entities and relationships inside them, classifying them according to topic or theme, and giving the attitude or sentiment of a document or document fragment. In order for users (or applications) to make use of these annotations with a means to access and manipulate documents that contain them, we provide a query language for doing this and demonstrate its utility on a demo system built on top of diverse semantic annotators and external ontologies. We explain how integrating semantic annotations and utilizing external knowledge helps in increasing the quality of query answers over annotated documents by both filtering out irrelevant answers and obtaining extra answers that are not explicitly available in the annotated documents.
- Alchemyapi www.alchemyapi.com/api/entity/.Google Scholar
- Evriapi. www.evri.com/.Google Scholar
- Jenaapi. jena.sourceforge.net.Google Scholar
- Opencalais. www.opencalais.com/.Google Scholar
- Reuters-21578. www.daviddlewis.com/resources/testcollections/reuters21578/.Google Scholar
- Zemanta api. http://developer.zemanta.com/.Google Scholar
- A. Kiryakov, B. Popov, I. Terziev, D. Manov, and D. Ognyanoff. Semantic annotation, indexing, and retrieval. J. Web Semantics, 2(1):49--79, 2004. Google ScholarDigital Library
- M. A. Olson, K. Bostic, and M. Seltzer. Berkeley db. In USENIX Technical Conference, pages 43--43, 1999. Google ScholarDigital Library
- H. Pérez-Urbina, I. Horrocks, and B. Motik. Practical aspects of query rewriting for OWL2. In OWLED, 09.Google Scholar
- J. Pound, I. F. Ilyas, and G. E. Weddell. Quick: Expressive and flexible search over knowledge bases and text collections. PVLDB, 3(2):1573--1576, 2010. Google ScholarDigital Library
- M. Zhou, T. Cheng, and K. C.-C. Chang. DoCQS: a prototype system for supporting data-oriented content query. In SIGMOD, pages 1211--1214, 2010. Google ScholarDigital Library
Recommendations
Comparison of Methods to Annotate Named Entity Corpora
The authors compared two methods for annotating a corpus for the named entity (NE) recognition task using non-expert annotators: (i) revising the results of an existing NE recognizer and (ii) manually annotating the NEs completely. The annotation time, ...
A graph-based approach for ontology population with named entities
CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge managementAutomatically populating ontology with named entities extracted from the unstructured text has become a key issue for Semantic Web and knowledge management techniques. This issue naturally consists of two subtasks: (1) for the entity mention whose ...
Named entity recognition and disambiguation using linked data and graph-based centrality scoring
SWIM '12: Proceedings of the 4th International Workshop on Semantic Web Information ManagementNamed Entity Recognition (NER) is a subtask of information extraction and aims to identify atomic entities in text that fall into predefined categories such as person, location, organization, etc. Recent efforts in NER try to extract entities and link ...
Comments