Abstract
Due to the inherent difficulties associated with manual ontology building, knowledge acquisition and reuse are often seen as methods that can make this tedious process easier. In this paper we present an NLP-based method to aid ontology design in a specific setting, namely that of semantic annotation of text. The method uses the World Wide Web in its analysis of the domain-specific documents, eliminating the need for linguistic knowledge and resources, and suggests ways to specify domain ontologies in a “linguistics-friendly” format in order to improve further ontology-based natural language processing tasks such as semantic annotation. We evaluate the method on a corpora in a real-world setting in the medical domain and compare the costs and the benefits of the NLP-based ontology engineering approach against a similar reuse-oriented experiment.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bateman, J.A.: The Theoretical Status of Ontologies in Natural Language Processing. KIT-Report 97, Technische Universität Berlin (May 1992)
Bontcheva, K., Cunnigham, H., Tablan, V., Maynard, D., Saggion, H.: Developing Reusable and Robust Language Processing Components for Information Systems using GATE. In: Proceedings of the 3rd International Workshop on Natural Language and Information Systems NLIS 2002. IEEE Computer Society Press, Los Alamitos (2002)
Buitelaar, P., Olejnik, D., Sintek, M.: A Protege Plug-In for Ontology Extraction from Text Based on Linguisitc Analysis. In: Bussler, C.J., Davies, J., Fensel, D., Studer, R. (eds.) ESWS 2004. LNCS, vol. 3053, pp. 31–44. Springer, Heidelberg (2004)
Ceusters, W., Smith, B., Flanagan, J.: Ontology and Medical Terminology: Why Description Logics are Not Enough. In: Proc. Towards An Electronic Patient Record, TEPR 2003 (2003)
The Gene Ontology Consortium. Gene Ontology: Tool for the Unification of Biology. Nature Genetics 25, 25–30 (2000)
Dittenbach, M., Berger, H., Merll, D.: Improving Domain Ontologies by Mining Semantics from Text. In: Proceedings of the first Asian-Pacific conference on Conceptual modelling, pp. 91–100. Australian Computer Society, Inc. (2004)
Drouin, P.: Detection of Domain Specific Terminology Using Corpora Comparison. In: Proceedings of the International Language Resources Conference LREC 2004, Lisbon, Portugal (May 2004)
Faure, D., Poibeau, T.: First Experiments of Using Semantic Knowledge Learned by ASIUM for Information Extraction Task Using INTEX. In: Ontology Learning ECAI 2000 Workshop (2000)
Fernández-López, M., Gómez-Pérez, A.: Overview and Analysis of Methodologies for Building Ontologies. Knowledge Engineering Review 17(2), 129–156 (2002)
Gangemi, A., Pisanelli, D.M., Steve, G.: An Overview of the ONIONS Project: Applying Ontologies to the Integration of Medical Terminologies. Data Knowledge Engineering 31(2), 183–220 (1999)
Golbeck, J., Fragoso, G., Hartel, F., Hendler, J., Parsia, B., Oberthaler, J.: The National Cancer Institute’s Thesaurus and Ontology. Journal of Web Semantics 1(1) (2003)
Gurevych, I., Porzel, R., Slinko, E., Pfleger, N., Alexandersson, J., Merten, S.: Less is More: Using a Single Knowledge Representation in Dialogue Systems. In: Proceedings of the HLT-NAACL Workshop on Text Meaning (2003)
Hahn, U., Schnattinger, K.: Towards Text Knowledge Engineering. In: Proceedings of the AAAI/IAAI, pp. 524–531 (1998)
Hobbs, J.R., Croft, W., Davies, T., Edwards, D., Laws, K.: Commonsense metaphysics and lexical semantics. Compuational Linguistics 13(3–4) (1987)
Kageura, K., Umino, B.: Methods of Automatic Term Recognition. Terminology 3(2), 259–289 (1996)
Kilgarriff, A., Grefenstette, G.: Introduction to the Special Issue on the Web as Corpus. Computational Linguistics 29(3), 333–348 (2003)
KnowledgeWeb European Project. Prototypical Business Use Cases (Deliverable D1.1.2 KnoweldgeWeb FP6-507482) (2004)
Maedche, A., Staab, S.: Semi-automatic Engineering of Ontologies from Text. In: Proceedings of the 12th International Conference on Software Engineering and Knowledge Engineering SEKE 2000 (2000)
Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge (1999)
Nirenburg, S., Raskin, V.: The Subworld Concept Lexicon and the Lexicon Management System. Computational Linguistics 13(3–4) (1987)
Bontas, E.P., Mochol, M., Tolksdorf, R.: Case Studies in Ontology Reuse. In: Proceedings of the 5th International Conference on Knowledge Management IKNOW 2005 (2005)
Bontas, E.P., Tietz, S., Tolksdorf, R., Schrader, T.: Generation and Management of a Medical Ontology in a Semantic Web Retrieval System. In: CoopIS/DOA/ODBASE (1), pp. 637–653 (2004)
Pisanelli, D.M., Gangemi, A., Steve, G.: Ontological Analysis of the UMLS Metathesaurus. JAMIA 5, 810 (1998)
Reinberger, M.L., Spyns, P.: Discovering Knowledge in Texts for the Learning of DOGMA-inspired Ontologies. In: Proceedings of the Workshop Ontology Learning and Population, ECAI 2004, Valencia, Spain, August 2004, pp. 19–24 (2004)
Schlangen, D., Stede, M., Bontas, E.P.: Feeding OWL: Extracting and Representing the Content of Pathology Reports. In: Proceedings of the NLPXML Workshop 2004 (2004)
Schmid, H.: Probabilistic part-of-speech tagging using decision trees. In: Proceedings of the International Conference on New Methods in Language Processing (1994)
Schulze-Kremer, S., Smith, B., Kumar, A.: Revising the UMLS Semantic Network. In: Proceedings of the Medinfo 2004 (2004)
Smith, B., Williams, J., Schulze-Kremer, S.: The Ontology of GeneOntology. In: Proceedings of the AMIA (2003)
Stede, M., Schlangen, D.: Information-Seeking Chat: Dialogues Driven by Topic-Structure. In: Proceedings of Catalog (the 8th Workshop on the Semantics and Pragmatics of Dialogue SemDial 2004), pp. 117–124 (2004)
Tolksdorf, R., Bontas, E.P.: Organizing Knowledge in a Semantic Web for Pathology. In: Proceedings of the NetObjectDays Conference (2004)
Zipf, G.K.: Human Behaviour and the Principle of Least Effort. Addison-Wesley, Cambridge (1949)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bontas, E.P., Schlangen, D., Schrader, T. (2005). Creating Ontologies for Content Representation—The OntoSeed Suite. In: Meersman, R., Tari, Z. (eds) On the Move to Meaningful Internet Systems 2005: CoopIS, DOA, and ODBASE. OTM 2005. Lecture Notes in Computer Science, vol 3761. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11575801_23
Download citation
DOI: https://doi.org/10.1007/11575801_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29738-3
Online ISBN: 978-3-540-32120-0
eBook Packages: Computer ScienceComputer Science (R0)