Setting up a parallel Russian-Tuvan subcorpus within the electronic corpus of Tuvan language: preliminary outcomes
DOI:
https://doi.org/10.25178/nit.2020.1.6Keywords:
Tuvan language; Russian language; translated literature; corpus linguistics; electronic corpus; electronic corpus of Tuvan language; parallel textAbstract
The article presents preliminary outcomes of setting up a subcorpus of parallel texts in Russia and Tuvan, which is to become part of the electronic corpus of Tuvan language. The work on the subcorpus is being carried out at the Research and Education Center for Turkic Studies at the Tuvan State University.
The subcorpus includes data from a certain number of works of literature translated from Russian into Tuvan. They were written by foreign authors or those from the ethnic republics of the Soviet Union, and then translated into Tuvan. For sources, the corpus relies on the collections of the National Library of Tuva and Tuvan State University, which together have over 700 works of literature that fall within this group.
The Russian-Tuvan parallel text subcorpus includes, among other works, M.Yu. Lermontov’s “A Hero of Our Times”, M.A. Sholokhov’s “Virgin Soil Upturned”, V.P. Katayev’s “I am the Son of the Nation of Labour”, A.P. Chekhov’s “Man in a Case”, A.A. Fadeev’s “The Young Guard”, Maxim Gorky’s “The Rendez Vous”, B.A. Vasilyev’s “The Dawns Here Are Quiet”, Ye. Ilyina’s “Fourth Height”. As of 2019, the total volume of the corpus is 7631813 word samples.
The article provides examples of alignment of Russian and Tuvan parallel texts, with a focus on how words referring to ethnospecific realities are translated into Tuvan. A comparative analysis of Russian-Tuvan parallel texts allows researchers to assess the adequacy of translation, the difficulty of translating vocabulary belonging to the ethnically and culturally specific group of “reality words”, as well as the lexical, semantic and stylistical transformation as a result of translation.
References
Аndreeva, E. G. (2006) Аnaliz perevodcheskikh sootvetstvii na materiale parallel'nogo korpusa tekstov [Analysis of translation correspondences: the case of a parallel corpus of texts]. In: Komp'yuternaya lingvistika i intellektual'nye tekhnologii [Computational linguistics and intellectual technologies]. Proceedings of the international conference "Dialog 2006"] / ed. by N. I. Laufer, A. S. Narignani and V. P. Selegei. Moscow, RGGU. 644 p. Pp. 26–30. (In Russ.).
Bavuu-Surun, M. V. (2006) Voprosy sozdaniya ehlektronnykh resursov tuvinskogo yazyka: nekotorye itogi, neotlozhnye zadachi i perspektivy [Issues of creating electronic resources of the Tuvan language: some results, urgent tasks and prospects]. The New Research of Tuva, no. 4. URL: https://nit.tuva.asia/nit/article/view/610 (access date: 09.11.2019). (In Russ.).
Bavuu-Surun, M. V., Shirobokova, N. N. and Samdan A. M. (2018) The Nomenclature of Medicinal Plants of Tuva in Dictionaries. The New Research of Tuva, no. 4 [on-line] Available at: https://nit.tuva.asia/nit/article/view/805 (accessed: 10.12.2019). (In Russ.). DOI: https://doi.org/10.25178/nit.2018.4.2
Bajyr-ool, А. V. and Ozonova, А. А. (2016) Perevody russkoi literatury na altajskii i tuvinskii yazyki (konets XIX — nachalo XXI v.) [Translations of Russian literature into Altai and Tuvan languages (late 19th — early 21st centuries)]. In: Kniga: Sibir' – Evraziya [Book: Siberia — Eurasia]. Proceedings of the 1st International Congress of SPSL SB RAS / ed. by V. Lizunova. In 4 vol. Novosibirsk, Gosudarstvennaya publichnaya nauchno-tekhnicheskaya biblioteka Sibirskogo otdeleniya Rossiiskoi akademii nauk. Vol. 2. 398 p. Pp. 239–246. (In Russ.).
Bajyr-ool, A. V. and Tyuntesheva E. V. (2016) Leksicheskie sredstva, ispol'zovannye v altaiskom i tuvinskom perevodakh romana M. A. Sholokhova «Podnyataya tselina» [Lexical means used in Altai and Tuvan translations of the novel “Virgin Soil Upturned” by M. A. Sholokhov]. In: Yazyki i fol'klor korennyh narodov Sibiri [Languages and folklore of the indigenous peoples of Siberia], no. 1 (30). Novosibirsk, Federal'noe gosudarstvennoe biudzhetnoe uchrezhdenie nauki Institut filologii Sibirskogo otdeleniia Rossiiskoi akademii nauk. 128 p. Pp. 76–83. (In Russ.).
Dobrovol'skii, D. O. (2009) Korpus parallel'nykh tekstov v issledovanii kul'turno-spetsifichnoi leksiki [A corpus of parallel texts in the study of culturally specific vocabulary]. In: Natsional'nyi korpus russkogo yazyka: 2006–2008. Novye rezul'taty i perspektivy [National Corpus of the Russian Language: 2006-2008. Results and Prospects] / ed. by. V. А. Plungyan. St. Petersburg, Nestor-Istoriya. 502 p. Pp. 383-401. (In Russ.).
Dobrovol'skii, D. O., Kretov, А. А. and Sharov, S. А. (2005) Korpus parallel'nykh tekstov: arkhitektura i vozmozhnosti ispol'zovaniya [A Corpus of Parallel Texts: Architecture and Usability]. In: Natsional'nyi korpus russkogo yazyka. Rezul'taty i perspektivy [National corps of the Russian language. Results and Prospects] / ed. by V. A. Plungyan. Moscow, Indrik. 344 p. Рр. 263–296. (In Russ.).
Dongak, U. А. (2018) Osobennosti pervykh perevodnykh proizvedenii tuvinskoi literatury [Features of the first translated works in Tuvan literature]. Philological Sciences. Issues of Theory and Practice, no. 7 (85), part 1, Pp. 29–33. (In Russ.). DOI: 10.30853/filnauki.2018-7-1.6
Kalzan, А. K. and Kuular, D. S. (1956) O khudozhestvennykh perevodakh s russkogo na tuvinskii yazyk [On the literary translations from Russian into Tuvan]. In: Uchenye zapiski. Vol. IV. Kyzyl, Tuvan book publishing house. Pp. 118–142. (In Russ.).
Mayzina, A. N. (2018) O dvukh perevodakh «Skazki o voennoi taine, Mal'chishe-Kibal'chishe i ego tverdom slove» A. Gaidara na altaiskii iazyk [On two translations of «A tale of a war secret, Malchish-Kibalchish and his word of honour» by A. Gaidar into Altai language]. In: Materialy Vserossiiskoi nauchno- prakticheskoi konferentsii, posviashchennoi tvorchestvu izvestnogo altaiskogo pisatelia i perevodchika Pasleia Samyka [Materials of the All-Russian Scientific and Practical Conference dedicated to the work of the famous Altai writer and translator Pasley Samyk] / ed. by. M. S. Dedina. Gorno-Altaysk, Budget Scientific Institution of the Republic of Altai «Research Institute named after S. S. Surazakova». 376 p. Pp. 198–210. (In Russ.).
Ozonova, A. A. and Tyuntesheva, E. V. (2017) Leksicheskie i grammaticheskie osobennosti altaiskikh perevodov povesti A. P. Chekhova «Palata № 6» [Lexical and grammatical features of Altai translations of A. P. Chekhov's «Ward No. 6»]. Vestnik Novosibirskogo Gosudarstvennogo Universiteta. Series: History and Philology, vol. 16, no. 9, pp. 121–136. (In Russ.).
Ozonova, A. A., Tyntesheva, E. V. and Fedina, N. N. (2019) Iazykovye osobennosti perevoda teksta «Evangeliia ot Matfeia» 1910 g. na altaiskii iazyk [Language features of the translation of the text of the Gospel of St. Matthew into the Altai language (1910)]. North-Eastern Journal of Humanities, no. 1 (26), pp. 85–92. (In Russ.). DOI: 10.25693/SVGV.2019.01.26.011
Ondar, V. S. (2019) Osobennosti perevoda nazvanii pishchi s russkogo yazyka na tuvinskii yazyk (na materiale originala i perevoda romana M. А. Sholokhova «Podnyataya tselina») [Translation of food-related vocabulary from Russian into Tuvan (based on the original and translation of the novel «Virgin Soil Upturned» by M. A. Sholokhov). Filologicheskie nauki: Voprosy teorii i praktiki, no. 7 (85), vol. 12, issue 9, pp. 379–383. (In Russ.).
Salchak, А. Ya. and Bajyr-ool, А. V. (2013) Elektronnyi korpus tuvinskogo yazyka: sostoyanie, problemy [The electronic corpus of the Tuvan language: its current state and problems]. Mir nauki, kul'tury, obrazovaniia, no. 6 (43), pp. 408–409. (In Russ.).
Salchak, А. Ya., Ondar, V. S. and Kuzhuget, Sh. Yu. (2018) Sozdanie ehlektronnykh korpusov tuvinskogo yazyka [Setting up electronic corpora of the Tuvan language]. In: Evdokiya Innokent'evna Korkina: biografika i interpretatsiya nauchnogo i tvorcheskogo naslediya: sbornik nauchnykh statei [Evdokia Innokentyevna Korkina: biography and interpretation of research and creative heritage: a collection of articles] / ed. by N. I. Danilova, N. I. Popova, and E. M. Samsonova. Yаkutsk, Institut gumanitarnykh issledovanii i problem malochislennykh narodov Severa Sibirskogo otdeleniya RАN. 282 p. Pp. 130–133. (In Russ.).
Hertek, A. B. and Oorzhak, B. Ch. (2012) O morfologicheskoi razmetke elektronnogo korpusa tekstov tuvinskogo yazyka [On Tuvan language texts electronic corpus morphological marking]. Philological Sciences. Issues of Theory and Practice, no. 7 (18), part II, pp. 214–218. (In Russ.).
Chertykova, M. D. and Fet, V. Ya. (2018) Prikliucheniia Alisy v strane chudes» L. Kerrolla na khakasskom iazyke: leksiko-semanticheskie osobennosti perevoda [Alice's Adventures in Wonderland in Khakas: Lexico-semantic features of the translation]. Rodnoy Yazyk, no. 1 (8), pp. 85–108. (In Russ.).
Sharaeva T. I. and Aiyzhy E. V. (2019) Tea in the traditional culture of the Kalmyks and Tuvans. The New Research of Tuva, no. 4 [online] Available at: https://nit.tuva.asia/nit/article/view/890 (access date: 10.12.2019). (In Russ.) DOI: 10.25178/nit.2019.4.12
Tyers, F., Bayyr-ool, A., Salchak, A. and Washington, J. (2016) А finite-state morphological analyser for Tuvan. In: Proceedings of the Tenth conference on International Language Resources and Evaluation (LREC'16). Editors: N. Calzolari (Conference Chair) et al. Portorož, European Language Resources Association. 4693 p. Pp. 2562–2567.
Tyers, F., Bayyr-ool, A., Salchak, A. and Washington, J. (2016) Development of a finite-state model for morphological processing of Тuvan. Rodnoy Yazyk, no. 1 (4), pp. 156–187.
Published
How to Cite
Salchak A. Ya. and Ondar V. S. Setting up a parallel Russian-Tuvan subcorpus within the electronic corpus of Tuvan language: preliminary outcomes. The New Research of Tuva. 2020, no. 1 [online] Available at: https://nit.tuva.asia/nit/article/view/912 (access date ...). DOI: 10.25178/nit.2020.1.6
Issue
Section
Author(s) license holder(s) grant rights for their work to the journal (grantee of a license) under the simple non-exclusive open license in accordance with Art. 1286.1 «Open license for a research work, work of literature or fine arts», Civil Code of the Russian Federation.
New Research of Tuva publishes articles under the Creative Commons Attribution-NonCommercial license (CC BY-NC).
Since it is an open license, author(s) reserve the right to upload the article to their institutional repository, submit it to another journal (if it allows republications), or republish it on their own website (in full, or in part).
However, several conditions apply here:
a) The republished version must always contain the name(s) and affiliation(s) of the author(s), the original title and the hyperlink to the original version on the New Research of Tuva website;
b) It must be in open access, free of charge, and no category of readers must be in any way whatsoever advantaged over general readership.
c) should the contribution be submitted elsewhere by its author(s) without substantial modification (30% or more of original text unchanged), the body of the article should contain a disclaimer that the original version was published in New Research of Tuva (with a link to the respective page)
The CC-BY-NC is a non-revocable license which applies worldwide and lasts for the duration of the work’s copyright.