Skip to main content

Translating with Examples: The LFG-DOT Models of Translation

  • Chapter
Recent Advances in Example-Based Machine Translation

Part of the book series: Text, Speech and Language Technology ((TLTB,volume 21))

Abstract

Machine Translation (MT) systems based on Data-Oriented Parsing (DOP: Bod, 1998) and LFG-DOP (Bod & Kaplan, 1998) may be viewed as instances of Example-Based MT (EBMT). In both approaches, new translations are processed with respect to previously seen translations residing in the system’s database. We describe the DOT models of translation (Poutsma, 1998, Poutsma, 2000) based on DOP. We demonstrate that DOT1 is not guaranteed to produce the correct translation, despite provably deriving the most probable translation. The DOT2 translation model solves most of the problems of DOT1, but suffers from limited compositionality when confronted with certain data. Notwithstanding the success of DOT2, any system based purely on trees will ultimately be found wanting as a general solution to the wide diversity of translation problems, as certain linguistic phenomena require a description at levels deeper than surface syntax. We then show how LFG-DOP can be extended to serve as a novel hybrid model for MT, LFG-DOT (Way, 2001), which promises to improve upon DOT and other EBMT systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Abeillé, A., Y. Schabes and A.K. Joshi. 1990. Using Lexicalised Tags for Machine Translation. In COLING-90, Papers Presented to the 13th International Conference on Computational Linguistics, Helsinki, Finland, 3:1–6.

    Google Scholar 

  • Bod, R. 1998. Beyond Grammar: An Experience-Based Theory of Language, CSLI, Stanford, CA.

    Google Scholar 

  • Bod, R. and R. Kaplan. 1998. A Probabilistic Corpus-Driven Model for Lexical-Functional Analysis. In 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Montreal, Quebec, Canada, 1:145–151.

    Google Scholar 

  • Bonnema, R., P. Buying and R. Scha. 2000. Parse Tree Probability in Data-Oriented Parsing. In CICLING: Proceedings of the International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City, pp.219–232.

    Google Scholar 

  • Cahill, A., M. McCarthy, J. van Genabith and A. Way. 2002. Automatic Annotation of the Penn-Treebank with LFG F-Structure Information. In Proceedings of the LREC Workshop on Linguistic Knowledge Acquisition and Representation: Bootstrapping Annotated Language Data, Las Palmas de Gran Canaria, Spain, pp.8–15.

    Google Scholar 

  • Prank, A., J. van Genabith and A. Way. 2001. Treebank vs. X-BAR based Automatic F-Structure Annotation. In M. Butt and T. Holloway-King (eds.) LFG-2001: Proceedings of 6th International Conference on Lexical Functional Grammar, CSLI Publications, Stanford, CA., pp.127–146.

    Google Scholar 

  • Kaplan, R. and J. Bresnan. 1982. Lexical Functional Grammar: A Formal System for Grammatical for Grammatical Representation. In J. Bresnan (ed.) The Mental Representation of Grammatical Relations, MIT Press, Cambridge, MA., pp.173–281.

    Google Scholar 

  • Kaplan, R., K.. Netter, J. Wedekind and A. Zaenen. 1989. Translation by Structural Correspondences. In Proceedings of the Fourth Conference of the European Association for Computational Linguistics, Manchester, England, pp.272–281.

    Chapter  Google Scholar 

  • Mima, H., H. Iida and O. Furuse. 1998. Simultaneous Interpretation Utilizing Example-Based Incremental Transfer. In 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Montreal, Quebec, Canada, pp.855–861.

    Google Scholar 

  • Poutsma, A. 1998. Data-Oriented Translation. In Computational Linguistics in the Netherlands: Ninth CLIN Meeting, Leuven, Belgium, [no page numbers].

    Google Scholar 

  • Poutsma, A. 2000. Data-Oriented Translation: Using the Data-Oriented Parsing framework for Machine Translation, MSc Thesis, University of Amsterdam, Amsterdam, The Netherlands.

    Google Scholar 

  • Sadler, L., I. Crookston, D. Arnold and A. Way. 1990. LFG and Translation. In Third Conference on Theoretical and Methodological Issues in Machine Translation, Austin, TX., pp.121–130.

    Google Scholar 

  • Sadler, L., I. Crookston and A. Way. 1989. Co-description, projection, and ‘difficult’ translation. Working Papers in Language Processing 8, Department of Language and Linguistics, University of Essex, Colchester, UK.

    Google Scholar 

  • Sato, S. and M. Nagao. 1990. Towards Memory-based Translation. In COLING-90, Papers Presented to the 13th International Conference on Computational Linguistics, Helsinki, Finland, 3:247–252.

    Google Scholar 

  • Schäler, R. 2001. Beyong Translation Memories. In M. Carl and A. Way (eds.) Proceedings of the Workshop on Example-Based Machine Translation, MT Summit VIII, Santiago de Compostela, Spain, pp.49–55.

    Google Scholar 

  • Sima’an, K. 1999. Learning Efficient Disambiguation, PhD Thesis, University of Utrecht, Utrecht, The Netherlands.

    Google Scholar 

  • Somers, H. 1999. Example-based Machine Translation. Machine Translation 14(2):113–157 (cf. Chapter 1, this volume).

    Article  MathSciNet  Google Scholar 

  • Somers, H., I. McLean and D. Jones 1994. Experiments in Multilingual Example-Based Generation. In CSNLP-94: 3rd Conference on the Cognitive Science of Natural Language Processing, Dublin, Ireland, [no page numbers].

    Google Scholar 

  • Somers, H., J. Tsujii and D. Jones. 1990. Machine Translation without a source text. In COLING-90, Papers Presented to the 13th International Conference on Computational Linguistics, Helsinki, Finland, 3:271–276.

    Google Scholar 

  • Watanabe, H. 1992. A Similarity-Driven Transfer System. In Proceedings of the fifteenth [sic] International Conference on Computational Linguistics, COLING-92, Nantes, France, 2:770–776

    Google Scholar 

  • Watanabe, H. 1994. A Method for Distinguishing Exceptional and General Examples in Example-based Transfer Systems. In COLING-94: The 15th International Conference on Computational Linguistics, Kyoto, Japan, pp.39–44.

    Google Scholar 

  • Watanabe, H. and K. Takeda. 1998. A Pattern-based Machine Translation System Extended by Example-based Processing. In 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Montreal, Quebec, Canada, 2:1369–1373.

    Google Scholar 

  • Way, A. 2001. LFG-DOT: A Hybrid Architecture for Robust MT, PhD Thesis, University of Essex, Colchester, UK.

    Google Scholar 

  • Way, A. 2003. Machine Translation using LFG-DOP. In R. Bod, R. Scha and K. Sima’an (eds.) Data-Oriented Parsing, CSLI, Stanford, CA.

    Google Scholar 

Download references

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Way, A. (2003). Translating with Examples: The LFG-DOT Models of Translation. In: Carl, M., Way, A. (eds) Recent Advances in Example-Based Machine Translation. Text, Speech and Language Technology, vol 21. Springer, Dordrecht. https://doi.org/10.1007/978-94-010-0181-6_16

Download citation

  • DOI: https://doi.org/10.1007/978-94-010-0181-6_16

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-1-4020-1401-7

  • Online ISBN: 978-94-010-0181-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics