Skip to main content

Advertisement

Log in

Syntactic mismatches in machine translation

  • Original Paper
  • Published:
Machine Translation

Abstract

This paper addresses one of the central problems arising at the transfer stage in machine translation: syntactic mismatches, that is, mismatches between a source-language sentence structure and its equivalent target-language sentence structure. The level at which we assume the transfer to be carried out is the Deep-Syntactic Structure (DSyntS) as proposed in the Meaning-Text Theory (MTT). DSyntS is abstract enough to avoid all types of divergences that result either from restricted lexical co-occurrence or from surface-syntactic discrepancies between languages. As for the remaining types of syntactic divergences, all of them occur not only interlinguistically, but also intralinguistically; this means that establishing correspondences between semantically equivalent expressions of the source and target languages that diverge with respect to their syntactic structure is nothing else than paraphrasing. This allows us to adapt the powerful intralinguistic paraphrasing mechanism developed in MTT for purposes of interlinguistic transfer.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Explore related subjects

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

References

  • Abraham W (2003) Canonic and non-canonic deliberations about epistemic modality: Its emergence out of where? In Koster J, van Riemsdijk H (eds) Germania et alia: A linguistic webschrift for Hans den Besten. Published online at www.let.rug.nl/k͂oster/DenBesten/Abraham.pdf [Last accessed 27 July 2006]

  • Apresjan JD, Boguslavskij IM, Gecelevič E, Iomdin LL, Lazurskij AV, Percov NV, Sannikov VZ, Cinman LL [Апресян ЮД, Богуславский ИМ, Гецелевич Е, Иомдин, ЛЛ, Лазурский AB, Перцов HB, Cанников BЗ, Цинман ЛЛ] (1992) Лингвистический процессор для сложньıx систем [Linguistic processor for complex information systems]. Nauka, Moskva

  • Apresjan JD, Boguslavskij IM, Iomdin LL, Lazurskij AV, Percov NV, Sannikov VZ, Cinman LL [Апресян ЮД, Богуславский ИМ Иомдин, ЛЛ, Лазурский AB, Перцов HB, Cанников BЗ, Цинман ЛЛ] (1989) Лингвистическое обеспечение системъı ЭТАП-2 [Linguistic software of the ÈTAP-2 system]. Nauka, Moskva

  • Apresjan JD, Boguslavsky IM, Iomdin LL, Tsinman LL (in press) Lexical functions in actual NLP-applications. To appear in Wanner L (ed) Selected lexical and grammatical topics in the Meaning-Text Theory: In honour of Igor Mel’čuk, John Benjamins, Amsterdam pp 199–228

  • Barnett J, Mani I, Rich E, Aone C, Knight K, Martinez JC (1991) Capturing language-specific semantic distinctions in interlingua-based MT. In: Machine translation summit III proceedings, Washington, DC, pp 25–32

  • Barwise J, Perry J (1983) Situations and attitudes. The MIT Press, Cambridge, MA

  • Bohnet B (2005) Textgenerierung als Transduktion linguistischer Strukturen [Text generation as the transduction of linguistic structures]. Dissertation, Universität Stuttgart, Germany

  • Bohnet B, Wanner L (2001) On using a parallel graph rewriting grammar formalism in generation. In: Proceedings ACL 2001 workshop, eighth European workshop on natural language generation (EWNLG), Toulouse, France, pp 47–56

  • Bresnan J (ed) (1982) The mental representation of grammatical relations. MIT Press, Cambridge, MA

  • Cicekli I, Güvenir HA (2003) Learning translation templates from bilingual translation examples. In: Carl M, Way A (eds) Recent advances in example-based machine translation. Kluwer Academic Publishers, Boston, pp 247–278

  • čmejrek M, Cuřín J, Havelka J (2003) Treebanks in machine translation. In: Proceedings of the second workshop on treebanks and linguistic theories, Växjö, Sweden, 209–212

  • Dave S, Parikh J, Bhattacharyya P (2001) Interlingua-based English–Hindi machine translation. Mach Translat 16:251–304

    Google Scholar 

  • Doherty M (1999) Sprachspezifische Aspekte der Informationsverteilung [Language-specific aspects of the distribution of information]. Akademie Verlag, Berlin

  • Dorr BJ (1993) Machine translation: A view from the lexicon. The MIT Press, Cambridge, MA

  • Dorr BJ (1994) Machine translation divergences: A formal description and proposed solution. Comput Ling 20:579–634

    Google Scholar 

  • Dorna M, Emele MC (1996) Efficient implementation of a semantic-based transfer approach. In: Proceedings of the 12th European conference on artificial intelligence, Budapest, 567–571

  • Dymetman M, Tendeau F (2000) Context-free grammar rewriting and the transfer of packed linguistic representations. In: Proceedings of the 18th international conference on computational linguistics, COLING 2000 in Europe, Saarbrücken, Germany, pp 1016–1020

  • Emele MC, Dorna M (1998) Ambiguity preserving machine translation using packed representations. In: COLING-ACL ’98: 36th annual meeting of the Association for Computational Linguistics and 17th international conference on computational linguistics, Montreal, Que, 365–371

  • Gawron JM (1999) Abduction and mismatch in machine translation. Technical report, SRI International, Stanford, CA

  • Gildea D (2003) Loosely tree-based alignment for machine translation. In: ACL-03: 41st annual meeting of the Association for Computational Linguistics, Sapporo, Japan, pp 80–87

  • Grupo DiCE (n.d.) DICE: Diccionario de colocaciones del español [DICE: Dictionary of Spanish collocations]. Website http://www.dicesp.com [Last accessed 7 July 2006]

  • Gupta D, Chatterjee N (2001) Study of divergence for example based English-Hindi machine translation. In: STRANS–2001, Symposium on translation support systems, Kanpur, India

  • Han C, Lavoie B, Palmer M, Rambow O, Kittredge R, Korelsky T, Kim N, Kim M (2000) Handling structural divergences and recovering dropped arguments in a Korean/English machine translation system. In: White JS (ed) Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca, Mexico, October 2000, Springer Verlag, Berlin, pp 40–53

  • Iordanskaja L, Mel’čuk, IA (1997) Le corps humain en russe et en français: Vers un dictionaire explicatif et combinatoire bilingue. [The human body in French and Russian: Towards an explanatory and combinatorial dictionary]. Cah Lexicol 70:249–281

  • Jackendoff R (1990) Semantic structures. The MIT Press, Cambridge, MA

    Google Scholar 

  • Kamp H, Reyle U (1993) From discourse to logic. Reidel, Dordrecht

    Google Scholar 

  • Kaplan RM, Netter K, Wedekind J, Zaenen A (1996) Translation by structural correspondences. In: Dalrymple M, Kaplan RM, Maxwell JT, Zaenen A (eds) Formal issues in lexical-functional grammar, CSLI, Stanford, CA, pp 311–329

  • Mel’čuk IA [Meльчук ИA] (1974) Oпьıт теории лингвистическиx моделей “Смьıсл ⇔ Текст” [Outline of the “Meaning⇔Text” theory of linguistic models]. Nauka, Moskva

  • Mel’čuk IA (1988a) Dependency syntax: theory and practice. State University of New York Press, Albany, NY

  • Mel’čuk IA (1988b) Paraphrase et lexique dans la théorie linguistique Sens-Texte. [Paraphrase and lexicon in the Meaning-Text linguistic theory]. Cah Lexicol 52.1:5–50, 52.2:5–53

    Google Scholar 

  • Mel’čuk IA (1992) Paraphrase et lexique:la théorie Sens-Texte et le dictionnaire explicatif et combinatoire [Paraphrase and the lexicon:the Meaning-Text Theory and the explanatory and combinatorial dictionary]. In: Mel’čuk et al (1992), pp 9–59

  • Mel’čuk IA (1993) The future of the lexicon in linguistic description and the explanatory combinatorial dictionary. In: Lee I-H (ed) Linguistics in the morning calm 3. Linguistic Society of Korea, Seoul, pp 181–270

  • Mel’čuk IA (1995a) Phrasemes in language and phraseology in linguistics. In: Everaert M, van der Linden EJ, Schenk A, Schreuder R (eds) Idioms: structural and psychological perspectives, Lawrence Erlbaum Associates, Hillsdale, NJ, pp 167–232

  • Mel’čuk IA (1995b) Syntactic or lexical zero. In: Mel’čuk IA [Meльчук ИA] Pуccкий язьıк в мoдeли Cмьıсл ⇔ Teкcт [Russian language in the Meaning ⇔ Text model]. Wiener Slawistischer Almanach, Wien, pp 169–205

  • Mel’čuk I (1996) Lexical functions: A tool for the description of lexical relations in a lexicon. In: L (ed) Lexical functions in lexicography and natural language processing. John Benjamins, Amsterdam, pp 37–102

  • Mel’čuk IA (2001) Communicative organization in natural language (The semantic-communicative structure of sentences). John Benjamins, Amsterdam

  • Mel’čuk IA (2004a) Actants in semantics and syntax I: Actants in semantics. Ling 42:1–66

    Google Scholar 

  • Mel’čuk IA (2004b) Actants in semantics and syntax II: Actants in syntax. Ling 42:247–291

    Google Scholar 

  • Mel’čuk I, Arbatchewsky-Jumarie N, Dagenais L, Elnitsky L, Iordanskaja L, Lefebvre M-N, Mantha S (1988) Dictionnaire explicatif et combinatoire du français contemporain. Recherches lexico-sémantiques II [Explanatory combinatorial dictionary of modern French. Lexical-semantic studies II]. Montréal, Les Presses de l’Université de Montréal

  • Mel’čuk I, Arbatchewsky-Jumarie N, Elnitsky L, Iordanskaja L, Lessard A (1984) Dictionnaire explicatif et combinatoire du français contemporain. Recherches lexico-sémantiques I [Explanatory combinatorial dictionary of modern French. Lexical-semantic studies I]. Les Presses de l’Université de Montréal, Montréal

  • Mel’čuk I, Arbatchewsky-Jumarie N, Iordanskaja L, Mantha S (1992) Dictionnaire explicatif et combinatoire du français contemporain. Recherches lexico-sémantiques III [Explanatory and combinatorial dictionary of modern French. Lexico-semantic research III]. Les Presses de l’Université de Montréal, Montréal

  • Mel’čuk I, Arbatchewsky-Jumarie N, Iordanskaja L, Mantha S, Polguère A (1999) Dictionnaire explicatif et combinatoire du français contemporain. Recherches lexico-sémantiques IV [Explanatory combinatorial dictionary of modern French. Lexical-semantic studies IV]. Les Presses de l’Université de Montréal, Montréal

  • Mel’čuk IA, Polguère A (1987) A formal lexicon in the Meaning-Text theory (or how to do lexica with words). Comput Ling 13:276–289

    Google Scholar 

  • Mel’čuk IA, Polguère A (forthcoming) Lexique actif du français: Dictionnaire de dérivations sémantiques et de collocations [Active French lexicon: Dictionary of semantic derivations and collocations]. Duculot, Louvain-la-Neuve

  • Mel’čuk I, Wanner L (2001) Towards a lexicographic approach to lexical transfer in machine translation (illustrated by the German–Russian language pair). Mach Translat 16:21–87

    Google Scholar 

  • Mel’čuk IA, Zholkovsy AK (1984) Explanatory combinatorial dictionary of modern Russian. Wiener Slawistischer Almanach, Wien

  • Milićević J (2003) Modélisation sémantique, lexicale et syntaxique de la paraphrase [Semantic, lexical and syntactic models of paraphrase]. Thèse de doctorat, Département de linguistique, Université de Montréal, Montréal

  • Paslawska A, von Stechow A (2003) Perfect readings in Russian. In: Rathert M, Alexiadou A, von Stechow A (eds) Perfect explorations. Mouton de Gruyter, Berlin, pp 307–363

  • Reyle U (1993) Dealing with ambiguities by underspecification: Construction, representation and deduction. Jnl Semant 10:123–179

    Article  Google Scholar 

  • Sacker U (1983) Aspektueller und resultativer Verbalausdruck [Aspectual and resultative verbal expression]. Gunter Narr Verlag, Tübingen

  • Sanromán Vilas B, Lareo Martín I, Alonso Ramos M (1999) Transferencia léxica y reglas de paráfrasis: verbos denominales de SP cognado [Lexical transfer and rules of paraphrase: denominal verbs with a cognate object]. Rev Soc Españ Proc Leng Nat 25:183–189

  • Specht V (2002) Entwurf und Implementierung einer Paraphrasierungsgrammatik für maschinelle Übersetzung [Design and implementation of a paraphrase grammar for machine translation]. Institut für Informatik, Universität Stuttgart, Germany

  • Steiner E (2001) Translations English – German: Investigating the relative importance of systemic contrasts and of the text-type “translation”. In: Proceedings of the symposium “Information structure in a cross-linguistic perspective”, SPRIKreports 7, Oslo

  • Wanner L (1996) Lexical choice in text generation and machine translation. Mach Translat 11:3–35

    Article  Google Scholar 

  • Weinstein CJ, Lee YS, Seneff S, Tummala DR, Carlson B, Lynch JT, Hwang JT, Kukolich LC (1997) Automated English/Korean translation for enhanced coalition communication. Lincoln Lab Jnl 10:35–60

  • Žolkovskij AK, Mel’čuk IA [Жoлкoвский AК, Meльчук ИA] (1967) O ceмантичecкoм cинтeзe [On semantic synthesis]. Probl Kybernet 19:177–238

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Igor Mel’čuk.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mel’čuk, I., Wanner, L. Syntactic mismatches in machine translation. Machine Translation 20, 81–138 (2006). https://doi.org/10.1007/s10590-006-9013-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10590-006-9013-7

Keywords