Abstract
This contribution outlines an international research effort for creating a typology of syntactic idioms on the borderline of the dictionary and the grammar. Recent studies focusing on the adequate description of such units, especially for modern Russian, have resulted in two types of linguistic resources: a microsyntactic dictionary of Russian, and a microsyntactically annotated corpus of Russian texts. Our goal now is to discover to what extent the findings can be generalized cross-linguistically in order to create analogous multilingual resources. The initial work consists in constructing a typology of relevant phenomena. The empirical base is provided by closely related languages which are mutually intelligible to various degrees. We start by creating an inventory for this typology for four representative Slavic languages: Russian (East Slavic), Bulgarian (South Slavic), Polish and Czech (West Slavic). Our preliminary results show that the aim is attainable and can be of relevance to theoretical, comparative and applied linguistics as well as in NLP tasks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
This term has been actively used since 1970s in language learning studies, e.g. (Hakuta 1974).
References
Apresjan, J.D., Boguslavsky, I.M., Iomdin, L.L., Sannikov, V.Z.: Theoretical problems of russian syntax. Interaction of the grammar and the lexicon. [Teoretičeskie problemy russkogo sintaksisa]. In: Apresjan, J.D. (ed). Jazyki slavjanskix kultur Publishers, Moscow, 408 p. (2010). ISBN 978-5-9551-0386-0. (in Russian)
Apresjan, J.D., Boguslavsky, I.M., Iomdin, L.L., Tsinman, L.L.: Lexical functions in NLP: possible uses. Computational Linguistics for the New Millennium: Divergence or Synergy? Festschrift in Honour of Peter Hellwig on the occasion of his 60th Birthday, Peter Lang, pp. 55–72 (2002)
Apresjan, J.D., Iomdin, L.L.: The construction of the NEGDE SPAT’ type: syntax, semantics, lexicography. [Konstrukcija tipa NEGDE SPAT’: sintaksis, semantika, leksikografija]. Semiotika i informatika, pp. 34–92. Vsesojuznyj institut nauchnoj i texnicheskoj informacii, AN SSSR, Moscow (1989). (in Russian)
Avgustinova, T.: Russian infinitival existential constructions from an HPSG perspective. In: Kosta, P. et al. (eds.) Investigations into Formal Slavic Linguistics. Contributions of the Fourth European Conference on Formal Description of Slavic Languages, pp. 461–482. Peter Lang Europäischer Verlag der Wissenschaft (2003)
Boguslavsky, I., Dyachenko, P., Barrios Rodríguez, M.A.: CALLEX-ESP: a software system for learning Spanish lexicon and collocations. In: Current Developments in Technology-Assisted Education. Badajos (Spain): FORMATEX, vol. 1, pp. 22–26 (2006)
Čermák, F.: Grammatical Idioms. Philologica Pragensia, XVII, vol. 2, pp. 75–90 (2007)
Croft, W., Nordquist, D., Looney, K., Regan, M.: Linguistic typology meets universal dependencies. In: Proceedings of the 15th International Workshop on Treebanks and Linguistic Theories (TLT15), pp. 63–75 (2017)
Hakuta, K.: Prefabricated patterns and the emergence of second language acquisition. Lang. Learn. 24(2), 287–297 (1974)
Iomdin, L.L.: Polysemous syntactic idioms: between the vocabulary and the syntax [Mnogoznačnye sintaksičeskie frazemy: meždu leksikoj i sintaksisom]. In: Computational Linguistics and Intellectual Technologies. Proceedings of the International Conference “Dialog-2006”, Moscow, RGGU Publishers, pp. 202–206 (2006). (in Russian)
Iomdin, L.: Between the syntactic idiom and syntactic construction. Nontrivial cases of microsyntactic ambiguity. [Meždu sintaksičeskoj frazemoj i sintaksičeskoj konstruktsiej. Netrivial’nye slučai mikrosintaksičeskoj neodnoznačnosti]. In: SLAVIA, časopis pro slovanskou filologii, ročník 68, sešit 2–3, pp. 230–243 (2017). (in Russian)
Jackendoff, Ray: Twisting the night away. Language 73, 534–559 (1997)
Langacker, R.W.: Indeterminacy in semantics and grammar. Paper presented to the Estudios de Lingüística cognitiva, 1998 (1998)
MAS: The Small Academic Dictionary of Russian in 4 volumes, A. P. Evgenyeva, ed. [Slovar’ russkogo jazyka v 4-x tomax, MAS, Malyj Akademičeskij Slovar. Russkij Jazyk Publisher, Moscow (1999). http://feb-web.ru/feb/mas/mas-abc/default.asp
Mel’čuk, I.: Lexical functions: a tool for the description of lexical relations in a lexicon. In: Lexical Functions in Lexicography and Natural Language Processing, vol. 31, pp. 37–102 (1996)
Mel’čuk, I.: Collocations and lexical functions. In: Phraseology. Theory, Analysis, and Applications, pp. 23–53 (1998)
Rogozhnikova, R.P.: An Explanatory Dictionary of Collocations Equivalent to Words [Tolkovyj slovar sočetanij, ekvivalentnyx slovu]. Moscow, Astrel, 414 p. (2003). (in Russian.)
Sag, I.A., Baldwin, T., Bond, F., Copestake, A., Flickinger, D.: Multiword expressions: a pain in the neck for NLP. Paper presented to the International Conference on Intelligent Text Processing and Computational Linguistics (2002)
Sinclair, J.: Corpus, concordance, collocation. Oxford University Press (1991)
Wanner, L.: Lexical functions in lexicography and natural language processing. John Benjamins Publishing (1996)
Warren, B.: A model of idiomaticity. Nordic J. Engl. Stud. 4, 35–54 (2005)
Wray, A.: Formulaic language in computer-supported communication: theory meets reality. Lang. Awareness 11, 114–131 (2002)
Acknowledgements
The authors are grateful to the Russian National Foundation (grant No. 16-18-10422-P) and the German Science Foundation (DFG, grant within the Collaborative Research Centre SFB 1102) for their partial support of this research.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Avgustinova, T., Iomdin, L. (2019). Towards a Typology of Microsyntactic Constructions. In: Corpas Pastor, G., Mitkov, R. (eds) Computational and Corpus-Based Phraseology. EUROPHRAS 2019. Lecture Notes in Computer Science(), vol 11755. Springer, Cham. https://doi.org/10.1007/978-3-030-30135-4_2
Download citation
DOI: https://doi.org/10.1007/978-3-030-30135-4_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-30134-7
Online ISBN: 978-3-030-30135-4
eBook Packages: Computer ScienceComputer Science (R0)