Predicative potential of lexical parameters: text complexity assessment in Russian language textbooks for 5–7 grades

This study addresses the urgent issue of assessing the influence of lexical parameters on text complexity. The research has been conducted on the material of a specialized linguistic corpus, which includes texts of 15 modern Russian language textbooks for 5–7 grades, with a total size of 811911 words. The study is aimed at identifying the scale and dynamics of changes in vocabulary of Russian textbooks for 5–7 grades. The research algorithm included the following stages: (a) identifying the size and content of vocabulary in modern Russian textbooks for 5–7 grades, (b) assessing the share of linguistic terms in their vocabulary, and (c) identifying complexity predictors, i.e. parameters demonstrating a statisti-cally significant correlation with readability. The analytical part of the study was preceded by a meta-description of the corpus, its tokenization, lemmatization, segmentation into fragments of approximately 1000 words. Text parameters were calculated using the text profiler RuL-ingva, and the correlation strength was assessed with STATISTIKA. To ensure the research results reliability, co-dependencies of lexical parameters and text readability were analyzed at two levels: at the textbook level (with average indicators for 15 textbooks for 5–7 grades) and at the level of 1000-word fragments. We revealed a slightly lower readability index, which was expected to be 1.0–1.5 levels higher. The latter may be a characteristic of Russian language textbook as a genre and indicate eclecticism of academic texts, including fragments of research discourse (rules and theory), fiction (exercises), and instructional discourse (texts of tasks). The research demonstrated that the share of linguistic terms does not exceed 2 % in the textbook vocabulary, but their share in the texts rises to 13 %. The statistical analysis indicates that the indices of ‘lexical density’, cohesion (global and local overlaps of nouns and arguments), ‘descriptiveness’ (ratio between adjectives and nouns), ‘narrativity’ (ratio between verbs and nouns), and the share of nouns in the genitive case are text complexity pre-dictors. The prospects for the research include studying verbs and pronouns as complexity predictors in Russian language textbooks. © 2025, RUDN University. All rights reserved.

Authors
Andreeva M.I. , Zamaletdinov R.R. , Borisova A.S.
Publisher
Федеральное государственное автономное образовательное учреждение высшего образования Российский университет дружбы народов (РУДН)
Number of issue
4
Language
Russian
Pages
518-539
Status
Published
Volume
22
Year
2024
Organizations
  • 1 Department of Foreign Languages, Kazan State Medical University, 49 Butlerov St, Kazan, 420012, Russian Federation
  • 2 Department of General Linguistics and Turkology, Kazan (Volga Region) Federal University, 18 Kremlevskaya St, Kazan, 420008, Russian Federation
  • 3 Department of Foreign Languages, Faculty of Philology, RUDN University, 6 Miklukho-Maklaya St, Moscow, 117198, Russian Federation
Keywords
cohesion; corpus of language; educational text; educational text readability; lexical density; linguistic terminology; word frequency

Other records

Privalova I.V., Petrova A.A., Gishkaeva L.N.
Russian Language Studies. Федеральное государственное автономное образовательное учреждение высшего образования Российский университет дружбы народов (РУДН). Vol. 22. 2024. P. 633-661
Kupriyanov R.V., Shoeva G.N., Aleksandrova O.I.
Russian Language Studies. Федеральное государственное автономное образовательное учреждение высшего образования Российский университет дружбы народов (РУДН). Vol. 22. 2024. P. 615-632