Approaches and tools for Russian text linguistic profiling

Approaches and tools for assessing linguistic and cognitive complexity of educational texts are in demand both in science and teaching. Predicting difficulties of perception and understanding and ranking texts by classes, i.e. the number of years of learning or levels of language proficiency (A1–C2), are of particular importance for education. The study is aimed at demonstrating modern methodologies, algorithms, and tools for analyzing Russian texts in text profiler and automatic analyzer RuLingva and at presenting articles from the thematic issue on comprehensive analysis of Russian language textbooks for Russian and Belarusian schools. The research demonstrates that the modern paradigm of discourse complexology is based on the methods of stylistic statistics, which identifies functional characteristics of language units and verifies them using big data. The services on RuLingva are designed for teachers and researchers; they automatically analyze educational texts and predict their target audience based on readability, lexical diversity, abstractness, frequency, and terminological density. In “Russian as a Foreign Language” mode, RuLingva downloads lists of words from the text according to each level of language proficiency and estimates their pro-portion. This provides material for preand post-text work. RuLingva algorithm is based on the typology of educational texts and is to be supplied with tools for assessing a person’s verbal intelligence and reading literacy. The nearest prospect of RuLingva lies in widening the range of complexity predictors and installing automatic subject area discriminator. Both directions are planned to be implemented using neural networks, classification models, “typological passports” of educational texts with different complexity, and thematic orientation. © 2024, RUDN University. All rights reserved.

Authors
Solnyshkina M.I. , Solovyev V.D. , Ebzeeva Y.N.
Publisher
Федеральное государственное автономное образовательное учреждение высшего образования Российский университет дружбы народов (РУДН)
Number of issue
4
Language
Russian
Pages
501-517
Status
Published
Volume
22
Year
2024
Organizations
  • 1 Kazan (Volga Region) Federal University, Kazan, Russian Federation
  • 2 RUDN University, Moscow, Russian Federation
Keywords
complexity predictors; educational text; text complexity; text profiler RuLingva; typological pass-port of the text
Share

Other records

Leshutina I.A., Davydova M.A., Strelchuk E.N.
Russian Language Studies. Федеральное государственное автономное образовательное учреждение высшего образования Российский университет дружбы народов (РУДН). Vol. 22. 2024. P. 681-697