Setting up a parallel Russian-Tuvan subcorpus within the electronic corpus of Tuvan language: preliminary outcomes

Authors

DOI:

https://doi.org/10.25178/nit.2020.1.6

Keywords:

Tuvan language; Russian language; translated literature; corpus linguistics; electronic corpus; electronic corpus of Tuvan language; parallel text

Abstract

The article presents preliminary outcomes of setting up a subcorpus of parallel texts in Russia and Tuvan, which is to become part of the electronic corpus of Tuvan language. The work on the subcorpus is being carried out at the Research and Education Center for Turkic Studies at the Tuvan State University.

The subcorpus includes data from a certain number of works of literature translated from Russian into Tuvan. They were written by foreign authors or those from the ethnic republics of the Soviet Union, and then translated into Tuvan. For sources, the corpus relies on the collections of the National Library of Tuva and Tuvan State University, which together have over 700 works of literature that fall within this group.

The Russian-Tuvan parallel text subcorpus includes, among other works, M.Yu. Lermontov’s “A Hero of Our Times”, M.A. Sholokhov’s “Virgin Soil Upturned”, V.P. Katayev’s “I am the Son of the Nation of Labour”, A.P. Chekhov’s “Man in a Case”, A.A. Fadeev’s “The Young Guard”, Maxim Gorky’s “The Rendez Vous”, B.A. Vasilyev’s “The Dawns Here Are Quiet”, Ye. Ilyina’s “Fourth Height”. As of 2019, the total volume of the corpus is 7631813 word samples.

The article provides examples of alignment of Russian and Tuvan parallel texts, with a focus on how words referring to ethnospecific realities are translated into Tuvan. A comparative analysis of Russian-Tuvan parallel texts allows researchers to assess the adequacy of translation, the difficulty of translating vocabulary belonging to the ethnically and culturally specific group of “reality words”, as well as the lexical, semantic and stylistical transformation as a result of translation.

References

Аndreeva, E. G. (2006) Аnaliz perevodcheskikh sootvetstvii na materiale parallel'nogo korpusa tekstov [Analysis of translation correspondences: the case of a parallel corpus of texts]. In: Komp'yuternaya lingvistika i intellektual'nye tekhnologii [Computational linguistics and intellectual technologies]. Proceedings of the international conference "Dialog 2006"] / ed. by N. I. Laufer, A. S. Narignani and V. P. Selegei. Moscow, RGGU. 644 p. Pp. 26–30. (In Russ.).

Bavuu-Surun, M. V. (2006) Voprosy sozdaniya ehlektronnykh resursov tuvinskogo yazyka: nekotorye itogi, neotlozhnye zadachi i perspektivy [Issues of creating electronic resources of the Tuvan language: some results, urgent tasks and prospects]. The New Research of Tuva, no. 4. URL: https://nit.tuva.asia/nit/article/view/610 (access date: 09.11.2019). (In Russ.).

Bavuu-Surun, M. V., Shirobokova, N. N. and Samdan A. M. (2018) The Nomenclature of Medicinal Plants of Tuva in Dictionaries. The New Research of Tuva, no. 4 [on-line] Available at: https://nit.tuva.asia/nit/article/view/805 (accessed: 10.12.2019). (In Russ.). DOI: https://doi.org/10.25178/nit.2018.4.2

Bajyr-ool, А. V. and Ozonova, А. А. (2016) Perevody russkoi literatury na altajskii i tuvinskii yazyki (konets XIX — nachalo XXI v.) [Translations of Russian literature into Altai and Tuvan languages (late 19th — early 21st centuries)]. In: Kniga: Sibir' – Evraziya [Book: Siberia — Eurasia]. Proceedings of the 1st International Congress of SPSL SB RAS / ed. by V. Lizunova. In 4 vol. Novosibirsk, Gosudarstvennaya publichnaya nauchno-tekhnicheskaya biblioteka Sibirskogo otdeleniya Rossiiskoi akademii nauk. Vol. 2. 398 p. Pp. 239–246. (In Russ.).

Bajyr-ool, A. V. and Tyuntesheva E. V. (2016) Leksicheskie sredstva, ispol'zovannye v altaiskom i tuvinskom perevodakh romana M. A. Sholokhova «Podnyataya tselina» [Lexical means used in Altai and Tuvan translations of the novel “Virgin Soil Upturned” by M. A. Sholokhov]. In: Yazyki i fol'klor korennyh narodov Sibiri [Languages and folklore of the indigenous peoples of Siberia], no. 1 (30). Novosibirsk, Federal'noe gosudarstvennoe biudzhetnoe uchrezhdenie nauki Institut filologii Sibirskogo otdeleniia Rossiiskoi akademii nauk. 128 p. Pp. 76–83. (In Russ.).

Dobrovol'skii, D. O. (2009) Korpus parallel'nykh tekstov v issledovanii kul'turno-spetsifichnoi leksiki [A corpus of parallel texts in the study of culturally specific vocabulary]. In: Natsional'nyi korpus russkogo yazyka: 2006–2008. Novye rezul'taty i perspektivy [National Corpus of the Russian Language: 2006-2008. Results and Prospects] / ed. by. V. А. Plungyan. St. Petersburg, Nestor-Istoriya. 502 p. Pp. 383-401. (In Russ.).

Dobrovol'skii, D. O., Kretov, А. А. and Sharov, S. А. (2005) Korpus parallel'nykh tekstov: arkhitektura i vozmozhnosti ispol'zovaniya [A Corpus of Parallel Texts: Architecture and Usability]. In: Natsional'nyi korpus russkogo yazyka. Rezul'taty i perspektivy [National corps of the Russian language. Results and Prospects] / ed. by V. A. Plungyan. Moscow, Indrik. 344 p. Рр. 263–296. (In Russ.).

Dongak, U. А. (2018) Osobennosti pervykh perevodnykh proizvedenii tuvinskoi literatury [Features of the first translated works in Tuvan literature]. Philological Sciences. Issues of Theory and Practice, no. 7 (85), part 1, Pp. 29–33. (In Russ.). DOI: 10.30853/filnauki.2018-7-1.6

Kalzan, А. K. and Kuular, D. S. (1956) O khudozhestvennykh perevodakh s russkogo na tuvinskii yazyk [On the literary translations from Russian into Tuvan]. In: Uchenye zapiski. Vol. IV. Kyzyl, Tuvan book publishing house. Pp. 118–142. (In Russ.).

Mayzina, A. N. (2018) O dvukh perevodakh «Skazki o voennoi taine, Mal'chishe-Kibal'chishe i ego tverdom slove» A. Gaidara na altaiskii iazyk [On two translations of «A tale of a war secret, Malchish-Kibalchish and his word of honour» by A. Gaidar into Altai language]. In: Materialy Vserossiiskoi nauchno- prakticheskoi konferentsii, posviashchennoi tvorchestvu izvestnogo altaiskogo pisatelia i perevodchika Pasleia Samyka [Materials of the All-Russian Scientific and Practical Conference dedicated to the work of the famous Altai writer and translator Pasley Samyk] / ed. by. M. S. Dedina. Gorno-Altaysk, Budget Scientific Institution of the Republic of Altai «Research Institute named after S. S. Surazakova». 376 p. Pp. 198–210. (In Russ.).

Ozonova, A. A. and Tyuntesheva, E. V. (2017) Leksicheskie i grammaticheskie osobennosti altaiskikh perevodov povesti A. P. Chekhova «Palata № 6» [Lexical and grammatical features of Altai translations of A. P. Chekhov's «Ward No. 6»]. Vestnik Novosibirskogo Gosudarstvennogo Universiteta. Series: History and Philology, vol. 16, no. 9, pp. 121–136. (In Russ.).

Ozonova, A. A., Tyntesheva, E. V. and Fedina, N. N. (2019) Iazykovye osobennosti perevoda teksta «Evangeliia ot Matfeia» 1910 g. na altaiskii iazyk [Language features of the translation of the text of the Gospel of St. Matthew into the Altai language (1910)]. North-Eastern Journal of Humanities, no. 1 (26), pp. 85–92. (In Russ.). DOI: 10.25693/SVGV.2019.01.26.011

Ondar, V. S. (2019) Osobennosti perevoda nazvanii pishchi s russkogo yazyka na tuvinskii yazyk (na materiale originala i perevoda romana M. А. Sholokhova «Podnyataya tselina») [Translation of food-related vocabulary from Russian into Tuvan (based on the original and translation of the novel «Virgin Soil Upturned» by M. A. Sholokhov). Filologicheskie nauki: Voprosy teorii i praktiki, no. 7 (85), vol. 12, issue 9, pp. 379–383. (In Russ.).

Salchak, А. Ya. and Bajyr-ool, А. V. (2013) Elektronnyi korpus tuvinskogo yazyka: sostoyanie, problemy [The electronic corpus of the Tuvan language: its current state and problems]. Mir nauki, kul'tury, obrazovaniia, no. 6 (43), pp. 408–409. (In Russ.).

Salchak, А. Ya., Ondar, V. S. and Kuzhuget, Sh. Yu. (2018) Sozdanie ehlektronnykh korpusov tuvinskogo yazyka [Setting up electronic corpora of the Tuvan language]. In: Evdokiya Innokent'evna Korkina: biografika i interpretatsiya nauchnogo i tvorcheskogo naslediya: sbornik nauchnykh statei [Evdokia Innokentyevna Korkina: biography and interpretation of research and creative heritage: a collection of articles] / ed. by N. I. Danilova, N. I. Popova, and E. M. Samsonova. Yаkutsk, Institut gumanitarnykh issledovanii i problem malochislennykh narodov Severa Sibirskogo otdeleniya RАN. 282 p. Pp. 130–133. (In Russ.).

Hertek, A. B. and Oorzhak, B. Ch. (2012) O morfologicheskoi razmetke elektronnogo korpusa tekstov tuvinskogo yazyka [On Tuvan language texts electronic corpus morphological marking]. Philological Sciences. Issues of Theory and Practice, no. 7 (18), part II, pp. 214–218. (In Russ.).

Chertykova, M. D. and Fet, V. Ya. (2018) Prikliucheniia Alisy v strane chudes» L. Kerrolla na khakasskom iazyke: leksiko-semanticheskie osobennosti perevoda [Alice's Adventures in Wonderland in Khakas: Lexico-semantic features of the translation]. Rodnoy Yazyk, no. 1 (8), pp. 85–108. (In Russ.).

Sharaeva T. I. and Aiyzhy E. V. (2019) Tea in the traditional culture of the Kalmyks and Tuvans. The New Research of Tuva, no. 4 [online] Available at: https://nit.tuva.asia/nit/article/view/890 (access date: 10.12.2019). (In Russ.) DOI: 10.25178/nit.2019.4.12

Tyers, F., Bayyr-ool, A., Salchak, A. and Washington, J. (2016) А finite-state morphological analyser for Tuvan. In: Proceedings of the Tenth conference on International Language Resources and Evaluation (LREC'16). Editors: N. Calzolari (Conference Chair) et al. Portorož, European Language Resources Association. 4693 p. Pp. 2562–2567.

Tyers, F., Bayyr-ool, A., Salchak, A. and Washington, J. (2016) Development of a finite-state model for morphological processing of Тuvan. Rodnoy Yazyk, no. 1 (4), pp. 156–187.

Published

04.03.2020

How to Cite

Салчак А. Я., Ондар В. С. Создание русско-тувинского параллельного подкорпуса электронного корпуса тувинского языка: первые итоги [Электронный ресурс] // Новые исследования Тувы. 2020, № 1. URL: https://nit.tuva.asia/nit/article/view/912 (дата обращения: дд.мм.гг.). DOI: 10.25178/nit.2020.1.6

Salchak A. Ya. and Ondar V. S. Setting up a parallel Russian-Tuvan subcorpus within the electronic corpus of Tuvan language: preliminary outcomes. The New Research of Tuva. 2020, no. 1 [online] Available at: https://nit.tuva.asia/nit/article/view/912 (access date ...). DOI: 10.25178/nit.2020.1.6

Issue

Section

Philology

Author Biographies

Aelita Ya. Salchak, Tuvan State University

Candidate of Philology, Associate Professor, Senior Research Fellow, Research and Education Center for Turkic Studies, Tuvan State University.

Postal address: Off. 129, 5 Studencheskii Kvartal, 667000 Kyzyl, Russian Federation.

Tel.: +7 (394-22) 2-19-69.

E-mail: aelita_74@mail.ru

Valentina S. Ondar, Tuvan State University

Сandidate of Philology, Senior Lecturer, Senior Research Fellow, Research and Education Center for Turkic Studies, Tuvan State University.

Postal address: Room 129, 5 Studencheskii Kvartal, 667000 Kyzyl, Russian Federation.

Tel.: +7 (394-22) 2-19-69.

E-mail: barys-hoov@mail.ru