Skip to main content
Log in

Morphological mismatches in machine translation

  • Published:
Machine Translation

Abstract

This paper addresses one of the least studied, although very important, problems of machine translation—the problem of morphological mismatches between languages and their handling during transfer. The level at which we assume transfer to be carried out is the Deep-Syntactic Structure (DSyntS) as proposed in the Meaning-Text Theory. DSyntS is abstract enough to avoid all types of surface morphological divergences. For the remaining ‘genuine’ divergences between grammatical significations, we propose a morphological transfer model. To illustrate this model, we apply it to the transfer of grammemes of definiteness and aspect for the language pair Russian–German and German–Russian, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Abbreviations

BI:

Bilingual index

DSyntA:

Deep-syntactic actant

DSynt-CommS:

Deep-syntactic communicative structure

DSyntS:

Deep-syntactic structure

d:

A particular derivateme

ECD:

Explanatory combinatorial dictionary

g:

A particular grammeme

GS:

Grammatical signification

L:

A particular lexeme

\({{\mathcal L}}\) :

A particular language

LF:

Lexical function

LU:

Lexical unit

MT:

Machine translation

MTT:

Meaning-Text Theory

SAE:

Standard Average European (language)

SSyntS:

Surface-syntactic structure

X S /X T :

Source/target X (with X being an LU, \({{\mathcal L}}\) , ...)–Lexical units are in CAPITALS; in the text, grammatical significations are in small capitals and ‘single quotes’; (quasi-)grammemes, when encoded as subscripts to LUs, are in small capitals without quotes. Linguistic examples are in ordinary italics; meanings, including glosses, are in ‘single quotes’.

References

  • Anscombre J-C (1986) Article zéro, termes de masse et représentation d’événements en français contemporain. In: David J, Kleiber G(eds) Déterminants: syntaxe et sémantique. Université de Metz [= Recherches linguistiques, XI], Metz, pp 5–34

    Google Scholar 

  • Apresjan JD (1988) Glagoly momental′nogo dejstvija i performativy v russkom jazyke. In: Karaulov JN(eds) Rusistika segodnja. Jazyk: sistema i ee funkcionirovanie. Nauka, Moskva, pp 57–78

    Google Scholar 

  • Apresjan JD, Boguslavskij IM, Iomdin LL, Lazurskij AV, Pertsov NV, Sannikov VZ, Cinman LL (1989) Lingvističeskoje obespečenie sistemy ÈTAP-2. Nauka, Moscow

    Google Scholar 

  • Apresjan JD, Boguslavsky IM, Iomdin LL, Tsinman LL (2007) Lexical functions in actual NLP-applications. In: Wanner L(eds) Selected lexical and grammatical topics in the Meaning-Text theory. In honour of Igor Mel’čuk. Benjamins, Amsterdam, pp 203–233

    Google Scholar 

  • Birkenmaier W (1979) Artikelfunktionen in einer artikellosen Sprache. Wilhelm Fink Verlag, München

    Google Scholar 

  • Bisle-Müller H (1991) Artikelwörter im Deutschen. Semantische und pragmatische Aspekte ihrer Verwendung. Tübingen, Niemeyer

    Google Scholar 

  • Dressler W, Merlini L (1994) Morphopragmatics. Diminutives and Intensifiers in Italian, German, and Other Languages. de Gruyter, Berlin, New York

    Google Scholar 

  • Fellbaum Ch (ed) (1998) WordNet: an electronic lexical database. The MIT Press, Cambridge, MA

    MATH  Google Scholar 

  • Gawrońska-Werngren B (1990) Translation great problem—on the problem of inserting articles when translating from Russian into Swedish. COLING-90, Papers Presented to the 13th International Conference on Computational Linguistics, Helsinki, Finland, pp 133–138

  • Gawrońska B (1992) Aspect—a problem for MT. Proceedings of the Fifteenth International Conference on Computational Linguistics, COLING-92, Nantes, France, pp 652–657

  • Gladrow W (1979) Die Determination des Substantivs im Russischen und im Deutschen. VEB Verlag Enzylopädie, Leipzig

    Google Scholar 

  • Glovinskaja MJa (1982) Semantičeskie tipy vidovyx protivopostavlenij russkogo glagola. Nauka, Moskva

    Google Scholar 

  • Glovinskaja MJa (2001) Mnogoznačnost’ i sinonimija v vido-vremennoj sisteme russkogo glagola. Azbukovnik/Russkie slovari, Moskva

    Google Scholar 

  • Han C, Lavoie B, Palmer M, Rambow O, Kittredge R, Korelsky T, Kim N, Kim M (2000) Handling structural divergences and recovering dropped arguments in a Korean/English machine translation system. Envisioning Machine Translation in the Information Future, 4th Conference of the Association for Machine Translation in the Americas, AMTA 2000. LNAI 1934, Springer, Berlin, pp 40–53

  • Hansen B (1995) Die deutschen Artikel und ihre Wiedergabe im Türkischen. Arbeiten zur Mehrsprachigkeit Bd. 53. Germanistisches Seminar, Hamburg

    Google Scholar 

  • Heim I (2001) Artikel und Definitheit. In: Stechow A, Wunderlich D(eds) Semantik. (HSK 6). de Gruyter, Berlin, New York, pp 487–534

    Google Scholar 

  • Himmelmann NP (2001) Articles. In: Haspelmath M, König E, Oesterreicher W, Raible W(eds) Sprachtypologie und sprachliche Universalien (HSK 20.1). de Gruyter, Berlin, pp 831–842

    Google Scholar 

  • Kabakčiev K (1984) The article and the aorist/imperfect distinction in Bulgarian: an analysis based on cross-language ‘aspect’ parallelisms. Linguistics 22: 643–672

    Article  Google Scholar 

  • Kobozeva IM (1980) Nekotorye pravila vybora vida glagola pri sinteze prostogo predloženija, vyražajuščego zadannyj smysl. Aktual’nye voprosy strukturnoj i prikladnoj lingvistiki. Izdatel’stvo Gos. Moskovskogo Universiteta, Moskva, pp 91–103

    Google Scholar 

  • Kovtun O (2003) Zur unbestimmten Determination im Deutschen im Vergleich zum Ukrainischen und Russischen. In: Hoffmann L(eds) Funktionale Syntax: Prinzipien und Prozeduren. de Gruyter, Berlin, pp 335–347

    Google Scholar 

  • Leiss E (2000) Artikel und Aspekt: die grammatischen Muster von Definitheit. de Gruyter, Berlin

    Google Scholar 

  • Marcu D, Wang W, Echihabi A, Knight K (2006) SPMT: statistical machine translation with syntactified target language phrases. Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, pp 44–52

  • Mehlig HR (1978) Überlegungen zur Funktion und Determinierung der Aspekte im Russischen. Slavistische Linguistik 1978 (= Slavistische Beiträge 133). Sagner, München

  • Mehlig HR (1988) Verbalaspect und Determination. Slavistische Linguistik 1987 (= Slavistische Beiträge 230). Sagner, München

    Google Scholar 

  • Mehlig HR (1989) Thema-Rhema Gliederung und Verbalaspekt. Zum Aspektgebrauch in russischen Fragesätzen. Slavistische Linguistik 1988 (= Slavistische Beiträge 242). Sagner, München

    Google Scholar 

  • Mel’čuk IA (1996) Lexical functions: a tool for the description of lexical relations in a lexicon. In: Wanner L(eds) Lexical functions in lexicography and natural language processing. Benjamins Academic Publishers, Amsterdam, Philadelphia, pp 37–102

    Google Scholar 

  • Mel’čuk IA (2006) Aspects of the theory of morphology. Mouton de Gruyter, Berlin, New York

    Google Scholar 

  • Mel’čuk I, Wanner L (2001) Towards a lexicographic approach to lexical transfer in machine translation (illustrated by the German–Russian language pair). Mach Transl 16(1): 21–87

    Article  MATH  Google Scholar 

  • Mel’čuk I, Wanner L (2006) Syntactic mismatches in machine translation. Mach Transl 20(2): 81–138

    Google Scholar 

  • Nagao M (1984) A framework of a mechanical translation between Japanese and English by analogy principle. In: Elithorn A, Banerji R(eds) Artificial and human intelligence. Elsevier Science Publishers, Amsterdam, pp 173–180

    Google Scholar 

  • Ney H, Matusov E, Mauser A, Vilar D, Zens R (2007) Statistical MT from TMI-1988 to TMI-2007: what has happened? Invited talk at TMI-2007. Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation. Skövde, Sweden

  • Nikolaeva TM (1959) Opredelenie vida russkogo glagola s pomošč’ju konteksta. Mašinnyi perevod i prikladnaja lingvistika. 2: 38–44

    Google Scholar 

  • Olsen M, Traum D, Van Ess-Dykema C, Weinberg A (2001) Implicit cues for explicit generation: using telicity as a cue for tense structure in a Chinese to English MT system. MT summit VIII: Machine translation in the information age. Santiago de Compostela, Spain, pp 259–264

    Google Scholar 

  • Padučeva EV (1996) Semantičeskie issledovanija. Semantika vremeni i vida. Semantika narrativa. Škola “Jazyki russkoj kul’tury”, Moscow

    Google Scholar 

  • Padučeva EV (1998) Semantičeskie istočniki momental’nosti russkogo glagola v tipologičeskom rakurse. In: Čertkova MJu(eds) Tipologija vida: problemy, poiski, rešenija. Škola “Jazyki russkoj kul’tury”, Moscow, pp 332–342

    Google Scholar 

  • Rigau G, Magnini B, Aggire E, Vossen P, Carroll JA (2002) MEANING: a roadmap to knowledge technologies. Proceedings of the Workshop A Roadmap for Computational Linguistics, International Conference On Computational Linguistics, COLING-02, Taipei, Taiwan, pp 1–7

  • Sacker U (1983) Aspektueller und resultativer Verbalausdruck. Gunter Narr Verlag, Tübingen

    Google Scholar 

  • Sanromán Vilas B, Lareo Martín I, Alonso Ramos M (1999) Transferencia léxica y reglas de paráfrasis: verbos denominales de SP cognado. Revista de la Sociedad Española para el Procesamiento del Lenguaje Natural 27: 183–189

    Google Scholar 

  • Scheljakin MA, Schlegel H (1970) Der Gebrauch des russischen Verbalaspekts I. Potsdamer Pädagogische Hochschule, Potsdam

    Google Scholar 

  • Somers H (1999) Example-based machine translation. Mach Transl 14(2): 113–157

    Article  MathSciNet  Google Scholar 

  • Vater H (1997) Determination im Deutschen und Polnischen. In: Koniuszaniec G, Prokop I(eds) Festschrift für Andrzej Z. Bzděga zum 70. Geburtstag (= Studia Germanica Posnaniensia XXII). Uniwersytet Poznańskij, Poznań, pp 215–229

    Google Scholar 

  • Vater H (2002) The word class ‘article’. In: Cruse A(eds) Lexikologie (HSK 15.1). de Gruyter, Berlin, pp 621–628

    Google Scholar 

  • Vossen P (ed.) (1999) EuroWordNet. General document. Technical report. Amsterdam, University of Amsterdam

  • Vossen P, Aggire E, Calzolari N, Fellbaum C, Hsieh S, Huang C, Isahara H, Kanzaki K, Marchetti A, Monachini M, Neri F, Raffaelli R, Rigau G, Tescon M (2008) KYOTO: a system for mining, structuring, and distributing knowledge across languages and cultures. Proceedings of LREC 2008. Marrakech, Morocco

  • Wierzbicka A (1967) On the semantics of the verbal aspect in Polish. In: To honor Roman Jakobson. The Hague, Mouton, pp 2231–2249

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Igor Mel’čuk.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mel’čuk, I., Wanner, L. Morphological mismatches in machine translation. Machine Translation 22, 101–152 (2008). https://doi.org/10.1007/s10590-009-9051-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10590-009-9051-z

Keywords

Navigation