SnS: A Novel Word Sense Induction Method

Kozłowski, Marek; Rybiński, Henryk

doi:10.1007/978-3-319-08729-0_25

SnS: A Novel Word Sense Induction Method

Marek Kozłowski¹⁰ &
Henryk Rybiński¹⁰

Conference paper

1047 Accesses
8 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8537))

Abstract

The paper is devoted to the word sense induction problem. We propose a knowledge-poor method, called SenseSearcher (SnS), which induces senses of words from text corpora, based on closed frequent sets. The algorithm discovers a hierarchy of senses, rather than a flat list of concepts, so the results are easier to comprehend. We have evaluated the SnS quality by performing experiments for web search result clustering task with the datasets from SemEval-2013 Task 11.

This work was supported by the National Centre for Research and Development (NCBiR) under Grant No. SP/I/1/77065/10 devoted to the Strategic scientific research and experimental development program: ”Interdisciplinary System for Interactive Scientific and Scientific-Technical Information”.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Navigli, R., Vannella, D.: SemEval-2013 Task 11: Word Sense Induction and Disambiguation within an End-User Applications. In: Proc. 7th Int’l SemEval Workshop, The 2nd Joint Conf. on Lexical and Comp. Semantics (2013)
Google Scholar
Miller, G., Chadorow, M., Landes, S., Leacock, C., Thomas, R.: Using a semantic concordance for sense identification. In: Proceedings of the ARPA Human Language Technology Workshop, pp. 240–243 (1994)
Google Scholar
Mihalcea, R., Moldovan, D.: Automatic Generation of a Coarse Grained WordNet. In: Proc. of NAACL Workshop on WordNet and Other Lexical Resources (2001)
Google Scholar
Navigli, R.: Word sense disambiguation: A survey. ACM Computing Surveys 41(2) (2009)
Article Google Scholar
Schutze, H.: Automatic word sense discrimination. Computational Linguistics - Special Issue on Word Sense Disambiguation 24(1) (1998)
Google Scholar
Pedersen, T., Bruce, R.: Knowledge lean word sense disambiguation. In: Proceedings of the 15th National Conference on Artificial Intelligence (1998)
Google Scholar
Landauer, T., Dumais, S.: A solution to Platos problem: The Latent Semantic Analysis theory of the acquisition, induction, and representation of knowledge. Psychology Review (1997)
Google Scholar
Pantel, P., Lin, D.: Discovering word senses from text. In: Proceedings of the 8th International Conference on Knowledge Discovery and Data Mining (2002)
Google Scholar
Brody, S., Lapata, M.: Bayesian word sense induction. In: Proceedings of EACL 2009 (2009)
Google Scholar
Veronis, J.: Hyperlex: lexical cartography for information retrieval. Computer Speech and Language (2004)
Google Scholar
Agirre, E., Soroa, A.: Ubc-as: A graph based unsupervised system for induction and classification. In: Proc. 4th Int’l Workshop on Semantic Evaluations (2007)
Google Scholar
Dorow, B., Widdows, D.: Discovering corpus-specific word senses. In: Proceedings of the 10th Conference of the European Chapter of the ACL (2003)
Google Scholar
Maedche, A., Staab, S.: Discovering conceptual relations from text. In: Proceedings of the 14th European Conference on Artificial Intelligence (2000)
Google Scholar
Rybiński, H., Kryszkiewicz, M., Protaziuk, G., Kontkiewicz, A., Marcinkowska, K., Delteil, A.: Discovering word meanings based on frequent termsets. In: Raś, Z.W., Tsumoto, S., Zighed, D.A. (eds.) MCD 2007. LNCS (LNAI), vol. 4944, pp. 82–92. Springer, Heidelberg (2008)
Chapter Google Scholar
Nykiel, T., Rybinski, H.: Word Sense Discovery for Web Information Retrieval. In: Proc. 4th Int’l Workshop on Mining Complex Data (2008)
Google Scholar
Di Marco, A., Navigli, R.: Clustering and Diversifying Web Search Results with Graph-Based Word Sense Induction. Computational Linguistics (2013)
Google Scholar
Protaziuk, G., Kryszkiewicz, M., Rybiński, H., Delteil, A.: Discovering compound and proper nouns. In: Kryszkiewicz, M., Peters, J.F., Rybiński, H., Skowron, A. (eds.) RSEISP 2007. LNCS (LNAI), vol. 4585, pp. 505–515. Springer, Heidelberg (2007)
Chapter Google Scholar
Zaki, M., Hsiao, C.: CHARM: An efficient algorithm for closed itemset mining. In: Proceedings 2002 SIAM Int. Conf. Data Mining (2002)
Google Scholar
Zaki, M.: Closed itemset mining and non-redundant association rule mining. In: Encyclopedia of Database Systems (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Warsaw University of Technology, Warsaw, Poland
Marek Kozłowski & Henryk Rybiński

Authors

Marek Kozłowski
View author publications
You can also search for this author in PubMed Google Scholar
Henryk Rybiński
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Computer Science, Warsaw University of Technology, Nowowiejska 15/19, 00-665, Warsaw, Poland
Marzena Kryszkiewicz & Zbigniew W. Raś &
Department of Computer Science and Artificial Intelligence, University of Granada, Calle del Periodista Daniel Saucedo Aranda s/n, 18071, Granada, Spain
Chris Cornelis
DISCo, Università di Milano – Bicocca, Viale Sarca 336 – U14, 20126, Milano, Italy
Davide Ciucci
Dpt. de Matemáticas, University of Càdiz, Spain
Jesús Medina-Moreno
School of Computing and Information Systems, University of Tasmania, Japan
Hiroshi Motoda

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kozłowski, M., Rybiński, H. (2014). SnS: A Novel Word Sense Induction Method. In: Kryszkiewicz, M., Cornelis, C., Ciucci, D., Medina-Moreno, J., Motoda, H., Raś, Z.W. (eds) Rough Sets and Intelligent Systems Paradigms. Lecture Notes in Computer Science(), vol 8537. Springer, Cham. https://doi.org/10.1007/978-3-319-08729-0_25

Download citation

DOI: https://doi.org/10.1007/978-3-319-08729-0_25
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08728-3
Online ISBN: 978-3-319-08729-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics