Published May 31, 2016 | Version v1
Conference paper Open

Optimizing Spectral Learning for Parsing

  • 1. University of Edinburgh

Description

We describe a search algorithm for optimizing the number of latent states when estimating latent-variable PCFGs with spectral methods. Our results show that contrary to the common belief that the number of latent states for each nonterminal in an L-PCFG can be decided in isolation with spectral methods, parsing results significantly improve if the number of latent states for each nonterminal is globally optimized, while taking into account interactions between the different nonterminals. In addition, we contribute an empirical analysis of spectral algorithms on eight morphologically rich languages: Basque, French, German, Hebrew, Hungarian, Korean, Polish and Swedish. Our results show that our estimation consistently performs better or close to coarse-to-fine expectation-maximization techniques for these languages.

Files

acl16spmrl.pdf

Files (247.1 kB)

Name Size Download all
md5:fdfc13018ec2f5604afae15074293f8c
247.1 kB Preview Download

Additional details

Funding

SUMMA – Scalable Understanding of Multilingual Media 688139
European Commission