Abstract
Machine Translation (MT) systems based on Data-Oriented Parsing (DOP: Bod, 1998) and LFG-DOP (Bod & Kaplan, 1998) may be viewed as instances of Example-Based MT (EBMT). In both approaches, new translations are processed with respect to previously seen translations residing in the system’s database. We describe the DOT models of translation (Poutsma, 1998, Poutsma, 2000) based on DOP. We demonstrate that DOT1 is not guaranteed to produce the correct translation, despite provably deriving the most probable translation. The DOT2 translation model solves most of the problems of DOT1, but suffers from limited compositionality when confronted with certain data. Notwithstanding the success of DOT2, any system based purely on trees will ultimately be found wanting as a general solution to the wide diversity of translation problems, as certain linguistic phenomena require a description at levels deeper than surface syntax. We then show how LFG-DOP can be extended to serve as a novel hybrid model for MT, LFG-DOT (Way, 2001), which promises to improve upon DOT and other EBMT systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abeillé, A., Y. Schabes and A.K. Joshi. 1990. Using Lexicalised Tags for Machine Translation. In COLING-90, Papers Presented to the 13th International Conference on Computational Linguistics, Helsinki, Finland, 3:1–6.
Bod, R. 1998. Beyond Grammar: An Experience-Based Theory of Language, CSLI, Stanford, CA.
Bod, R. and R. Kaplan. 1998. A Probabilistic Corpus-Driven Model for Lexical-Functional Analysis. In 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Montreal, Quebec, Canada, 1:145–151.
Bonnema, R., P. Buying and R. Scha. 2000. Parse Tree Probability in Data-Oriented Parsing. In CICLING: Proceedings of the International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City, pp.219–232.
Cahill, A., M. McCarthy, J. van Genabith and A. Way. 2002. Automatic Annotation of the Penn-Treebank with LFG F-Structure Information. In Proceedings of the LREC Workshop on Linguistic Knowledge Acquisition and Representation: Bootstrapping Annotated Language Data, Las Palmas de Gran Canaria, Spain, pp.8–15.
Prank, A., J. van Genabith and A. Way. 2001. Treebank vs. X-BAR based Automatic F-Structure Annotation. In M. Butt and T. Holloway-King (eds.) LFG-2001: Proceedings of 6th International Conference on Lexical Functional Grammar, CSLI Publications, Stanford, CA., pp.127–146.
Kaplan, R. and J. Bresnan. 1982. Lexical Functional Grammar: A Formal System for Grammatical for Grammatical Representation. In J. Bresnan (ed.) The Mental Representation of Grammatical Relations, MIT Press, Cambridge, MA., pp.173–281.
Kaplan, R., K.. Netter, J. Wedekind and A. Zaenen. 1989. Translation by Structural Correspondences. In Proceedings of the Fourth Conference of the European Association for Computational Linguistics, Manchester, England, pp.272–281.
Mima, H., H. Iida and O. Furuse. 1998. Simultaneous Interpretation Utilizing Example-Based Incremental Transfer. In 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Montreal, Quebec, Canada, pp.855–861.
Poutsma, A. 1998. Data-Oriented Translation. In Computational Linguistics in the Netherlands: Ninth CLIN Meeting, Leuven, Belgium, [no page numbers].
Poutsma, A. 2000. Data-Oriented Translation: Using the Data-Oriented Parsing framework for Machine Translation, MSc Thesis, University of Amsterdam, Amsterdam, The Netherlands.
Sadler, L., I. Crookston, D. Arnold and A. Way. 1990. LFG and Translation. In Third Conference on Theoretical and Methodological Issues in Machine Translation, Austin, TX., pp.121–130.
Sadler, L., I. Crookston and A. Way. 1989. Co-description, projection, and ‘difficult’ translation. Working Papers in Language Processing 8, Department of Language and Linguistics, University of Essex, Colchester, UK.
Sato, S. and M. Nagao. 1990. Towards Memory-based Translation. In COLING-90, Papers Presented to the 13th International Conference on Computational Linguistics, Helsinki, Finland, 3:247–252.
Schäler, R. 2001. Beyong Translation Memories. In M. Carl and A. Way (eds.) Proceedings of the Workshop on Example-Based Machine Translation, MT Summit VIII, Santiago de Compostela, Spain, pp.49–55.
Sima’an, K. 1999. Learning Efficient Disambiguation, PhD Thesis, University of Utrecht, Utrecht, The Netherlands.
Somers, H. 1999. Example-based Machine Translation. Machine Translation 14(2):113–157 (cf. Chapter 1, this volume).
Somers, H., I. McLean and D. Jones 1994. Experiments in Multilingual Example-Based Generation. In CSNLP-94: 3rd Conference on the Cognitive Science of Natural Language Processing, Dublin, Ireland, [no page numbers].
Somers, H., J. Tsujii and D. Jones. 1990. Machine Translation without a source text. In COLING-90, Papers Presented to the 13th International Conference on Computational Linguistics, Helsinki, Finland, 3:271–276.
Watanabe, H. 1992. A Similarity-Driven Transfer System. In Proceedings of the fifteenth [sic] International Conference on Computational Linguistics, COLING-92, Nantes, France, 2:770–776
Watanabe, H. 1994. A Method for Distinguishing Exceptional and General Examples in Example-based Transfer Systems. In COLING-94: The 15th International Conference on Computational Linguistics, Kyoto, Japan, pp.39–44.
Watanabe, H. and K. Takeda. 1998. A Pattern-based Machine Translation System Extended by Example-based Processing. In 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Montreal, Quebec, Canada, 2:1369–1373.
Way, A. 2001. LFG-DOT: A Hybrid Architecture for Robust MT, PhD Thesis, University of Essex, Colchester, UK.
Way, A. 2003. Machine Translation using LFG-DOP. In R. Bod, R. Scha and K. Sima’an (eds.) Data-Oriented Parsing, CSLI, Stanford, CA.
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Way, A. (2003). Translating with Examples: The LFG-DOT Models of Translation. In: Carl, M., Way, A. (eds) Recent Advances in Example-Based Machine Translation. Text, Speech and Language Technology, vol 21. Springer, Dordrecht. https://doi.org/10.1007/978-94-010-0181-6_16
Download citation
DOI: https://doi.org/10.1007/978-94-010-0181-6_16
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-1401-7
Online ISBN: 978-94-010-0181-6
eBook Packages: Springer Book Archive