skip to main content
10.5555/1699571.1699596dlproceedingsArticle/Chapter ViewAbstractPublication PagesemnlpConference Proceedingsconference-collections
research-article
Free Access

Improving verb clustering with automatically acquired selectional preferences

Published:06 August 2009Publication History

ABSTRACT

In previous research in automatic verb classification, syntactic features have proved the most useful features, although manual classifications rely heavily on semantic features. We show, in contrast with previous work, that considerable additional improvement can be obtained by using semantic features in automatic classification: verb selectional preferences acquired from corpus data using a fully unsupervised method. We report these promising results using a new framework for verb clustering which incorporates a recent subcategorization acquisition system, rich syntactic-semantic feature sets, and a variation of spectral clustering which performs particularly well in high dimensional feature space.

References

  1. Shane Bergsma, Dekang Lin, and Randy Goebel. Discriminative learning of selectional preference from unlabeled text. In Proc. of EMNLP, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Chris Brew and Sabine Schulte im Walde. Spectral clustering for german verbs. In Proc. of EMNLP, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Ted Briscoe, John Carroll, and Rebecca Watson. The second release of the rasp system. In Proc. of the COLING/ACL on Interactive presentation sessions, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Carsten Brockmann and Mirella Lapata. Evaluating and combining approaches to selectional preference acquisition. In Proc. of EACL, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Jinxiu Chen, Dong-Hong Ji, Chew Lim Tan, and Zheng-Yu Niu. Unsupervised relation disambiguation using spectral clustering. In Proc. of COLING/ACL, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Hoa Trang Dang. Investigations into the Role of Lexical Semantics in Word Sense Disambiguation. PhD thesis, CIS, University of Pennsylvania, 2004.Google ScholarGoogle Scholar
  7. Katrin Erk. A simple, similarity-based model for selectional preferences. In Proc. of ACL, 2007.Google ScholarGoogle Scholar
  8. David Graff. North american news text corpus. Linguistic Data Consortium, 1995.Google ScholarGoogle Scholar
  9. Eric Joanis. Automatic Verb Classification Using a General Feature Space. Master's thesis, University of Toronto, 2002.Google ScholarGoogle Scholar
  10. Eric Joanis, Suzanne Stevenson, and David James. A general feature space for automatic verb classification. Natural Language Engineering, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Karin Kipper-Schuler. VerbNet: A broad-coverage, comprehensive verb lexicon. 2005.Google ScholarGoogle Scholar
  12. Anna Korhonen, Yuval Krymolowski, and Ted Briscoe. A large subcategorization lexicon for natural language processing applications. In Proc. of the 5th LREC, 2006.Google ScholarGoogle Scholar
  13. Anna Korhonen, Yuval Krymolowski, and Nigel Collier. The Choice of Features for Classification of Verbs in Biomedical Texts. In Proc. of COLING, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Claudia Kunze and Lothar Lemnitzer. GermaNet-representation, visualization, application. In Proc. of LREC, 2002.Google ScholarGoogle Scholar
  15. Lillian. Lee. On the effectiveness of the skew divergence for statistical language analysis. In Artificial Intelligence and Statistics, 2001.Google ScholarGoogle Scholar
  16. Geoffrey Leech. 100 million words of english: the british national corpus. Language Research, 1992.Google ScholarGoogle Scholar
  17. Beth. Levin. English verb classes and alternations: A preliminary investigation. Chicago, IL, 1993.Google ScholarGoogle Scholar
  18. Jianguo Li and Chris Brew. Which Are the Best Features for Automatic Verb Classification. In Proc. of ACL, 2008.Google ScholarGoogle Scholar
  19. Diana McCarthy. Lexical Acquisition at the Syntax-Semantics Interface: Diathesis Alternations, Sub-categorization Frames and Selectional Preferences. PhD thesis, University of Sussex, UK, 2001.Google ScholarGoogle Scholar
  20. Marina. Meila. The multicut lemma. Technical report, University of Washington, 2001.Google ScholarGoogle Scholar
  21. Marina Meila and Jianbo Shi. A random walks view of spectral segmentation. AISTATS, 2001.Google ScholarGoogle Scholar
  22. George A. Miller. WordNet: a lexical database for English. Communications of the ACM, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Pedro J. Moreno, Purdy P. Ho, and Nuno Vasconcelos. A Kullback-Leibler divergence based kernel for SVM classification in multimedia applications. In Proc. of NIPS, 2004.Google ScholarGoogle Scholar
  24. Andrew Y. Ng, Michael Jordan, and Yair Weiss. On spectral clustering: Analysis and an algorithm. In Proc. of NIPS, 2002.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Diarmuid Ó Séaghdha and Ann Copestake. Semantic classification with distributional kernels. In Proc. of COLING, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Judita Preiss, Ted Briscoe, and Anna Korhonen. A system for large-scale acquisition of verbal, nominal and adjectival subcategorization frames from corpora. In Proc. of ACL, 2007.Google ScholarGoogle Scholar
  27. Jan Puzicha, Thomas Hofmann, and Joachim M. Buhmann. A theory of proximity based clustering: Structure detection by optimization. Pattern Recognition, 2000.Google ScholarGoogle Scholar
  28. Sabine Schulte im Walde. Experiments on the automatic induction of german semantic verb classes. Computational Linguistics, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Sabine Schulte im Walde, Christian Hying, Christian Scheible, and Helmut Schmid. Combining EM Training and the MDL Principle for an Automatic Verb Classification incorporating Selectional Preferences. In Proc. of ACL, pages 496--504, 2008.Google ScholarGoogle Scholar
  30. Lei Shi and Rada Mihalcea. Putting pieces together: Combining FrameNet, VerbNet and WordNet for robust semantic parsing. In Proc. of CICLING, 2005.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Suzanne Stevenson and Eric Joanis. Semi-supervised verb class discovery using noisy features. In Proc. of HLT-NAACL 2003, pages 71--78, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Lin Sun, Anna Korhonen, and Yuval Krymolowski. Verb class discovery from rich syntactic data. Lecture Notes in Computer Science, 4919:16, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Robert Swier and Suzanne Stevenson. Unsupervised semantic role labelling. In Proc. of EMNLP, 2004.Google ScholarGoogle Scholar
  34. Deepak Verma and Marina Meila. Comparison of spectral clustering methods. Advances in Neural Information Processing Systems (NIPS 15), 2003.Google ScholarGoogle Scholar
  35. Andreas Vlachos, Anna Korhonen, and Zoubin Ghahramani. Unsupervised and constrained dirichlet process mixture models for verb clustering. In Proc. of the Workshop on Geometrical Models of Natural Language Semantics, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Ulrike von Luxburg. A tutorial on spectral clustering. Statistics and Computing, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Beñat Zapirain, Eneko Agirre, and Lluís Màrquez. Robustness and generalization of role sets: PropBank vs. VerbNet. In Proc. of ACL, 2008.Google ScholarGoogle Scholar

Index Terms

  1. Improving verb clustering with automatically acquired selectional preferences

                  Recommendations

                  Comments

                  Login options

                  Check if you have access through your login credentials or your institution to get full access on this article.

                  Sign in
                  • Published in

                    cover image DL Hosted proceedings
                    EMNLP '09: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
                    August 2009
                    616 pages
                    ISBN:9781932432626

                    Publisher

                    Association for Computational Linguistics

                    United States

                    Publication History

                    • Published: 6 August 2009

                    Qualifiers

                    • research-article

                    Acceptance Rates

                    Overall Acceptance Rate73of234submissions,31%

                  PDF Format

                  View or Download as a PDF file.

                  PDF

                  eReader

                  View online with eReader.

                  eReader