Abstract
We present a knowledge-rich software agent, ContextExplicator, which mediates between the Web and the user’s information or knowledge needs. It provides a method for incremental knowledge-level management (i.e., knowledge discovery, acquisition and representation) for heterogeneous information in the Web.
In ContextExplicator, the incremental knowledge management works through iterative negotiations with the human user:
-
1
Automatic Word-Sense Disambiguation and Induction. General knowledge (e.g., from a lexicon) and previously discovered knowledge support the sense-disambiguation & sense-induction of a word in the given documents, resulting in an improved and refined organization of previously discovered knowledge,
-
2
Interactive Specialization of Query Criteria. At a given moment, the user can reduce certain semantic ambiguities of previously discovered knowledge by selecting one of the context-words which are suggested by ContextExplicator to discriminate between sets of retrieved documents. The selected context-word is also used to direct the discovery of new knowledge in the given documents, and
-
3
Visualization of the Discovered Knowledge. The discovered knowledge is represented in a conceptual lattice. Each lattice-node represents a single word-sense or a conjunction of senses of multiple words. To each node the respectively identified documents are associated. Each web-document is multi-classified into relevant word-sense clusters (lattice nodes), according to the occurrences of specific word-senses in the respective web-document. As a conceptual lattice allows the user to navigate the word-sense clusters and the classified web-documents with multi-level abstractions (i.e., super-/sub-lattice nodes), it provides a flexible scheme for managing knowledge and web-documents in a scalable way.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Barwise, J., Seligman, J.: The rights and wrongs of natural regularity. Ridgeview (1994)
Billhardt, H., Borrajo, D., Maojo, V.: A context vector model for information retrieval. JASIST 53(3), 236–249 (2002)
Bollmann-Sdorra, P., Raghavan, V.V.: On the necessity of term dependence in a query space for weighted retrieval. JASIS 49(13), 1161–1168 (1998)
Christopher Stokoe, M.P.O., Tait, J.: Word sense disambiguation in information retrieval revisited. In: The Proceedings of the 26th SIGIR Conference, pp. 159–166. ACM Press, New York (2003)
Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. JASIS 41(6), 391–407 (1990)
Fan, W., Gordon, M.D., Pathak, P.: Discovery of context-specific ranking functions for effective information retrieval using genetic programming. IEEE Transactions on Knowledge and Data Engineering 16(4), 523–527 (2004)
Joyce Yue Chai, A.W.B.: The use of word sense disambiguation in an information extraction system. In: AAAI/IAAI 1998, pp. 850–855. AAAI / MIT (1998)
Mihalcea, R., Moldovan, D.: Semantic indexing using wordnet senses. In: ACL Workshop, ACL (2000)
Rungsawang, A.: Dsir: the first trec-7 attempt. In: Proceedings of the Seventh Text REtrieval Conference (TREC 1998), National Institute of Standards and Technology (NIST), pp. 366–372 (1998)
Sanderson, M.: Retrieving with good sense. Information Retrieval 2(1), 49–69 (2000)
Voorhees, E.M.: Using wordnet to disambiguate word senses for text retrieval. In: The Proceedings of the 20th SIGIR Conference, pp. 171–180. ACM Press, New York (1993)
Wordnet (2004), http://www.cogsci.princeton.edu/~wn/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yoo, S.Y., Hoffmann, A. (2005). Knowledge-Level Management of Web Information. In: Zhang, Y., Tanaka, K., Yu, J.X., Wang, S., Li, M. (eds) Web Technologies Research and Development - APWeb 2005. APWeb 2005. Lecture Notes in Computer Science, vol 3399. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-31849-1_59
Download citation
DOI: https://doi.org/10.1007/978-3-540-31849-1_59
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25207-8
Online ISBN: 978-3-540-31849-1
eBook Packages: Computer ScienceComputer Science (R0)