Data-driven methods for anaphora resolution of Russian texts

The paper considers two data-driven methods for anaphora resolution of Russian texts. These methods are based on machine learning with annotated corpora and using no additional information except linguistic features. The first method uses Support Vector Machine as learning and classifying algorithms, the second method uses Decision Tree inducer. We evaluate the performance of the methods with several feature sets and corpora. Feature sets included morphological, syntactic and semantic features. In this paper we also evaluate how semantic features, namely semantic roles, impact the performance of anaphora resolution in Russian. We used our manually annotated corpus as well as a corpus provided by the organizing committee of the forum for the evaluation of linguistic text analysis systems, an event of Dialogue 2014. Experiments showed that precision of SVM is higher on experimental data for almost all cases. It was shown that semantic features enhance the performance of the methods for anaphora resolution of Russian texts. We have also calculated the optimal distance between the anaphor and the hypothetic antecedent and used it in our methods.

Authors

Kamenskaya M.A. ¹ , Khramoin I.V.² , Smirnov I.V.²

Journal

Komp'juternaja Lingvistika i Intellektual'nye Tehnologii

Publisher

Rossiiskii Gosudarstvennyi Gumanitarnyi Universitet

Language

English

Pages

241-250

Status

Published

Link

External link

Year

2014

Organizations

¹ Peoples' Friendship University of Russia, Moscow, Russian Federation
² Institute for Systems Analysis, RAS, Moscow, Russian Federation

Keywords

Anaphora resolution; Decision trees; Machine learning; Semantic roles; Support vector machine

Date of creation

19.10.2018

Date of change

19.10.2018

Short link

https://repository.rudn.ru/en/records/article/record/4949/

PREDICATIVITY: EXPLORING THE MEANING OF THE CONCEPT

Article

Gasparov B., Krylova O.

Russian Linguistics. Kluwer Academic Publishers. Vol. 38. 2014. P. 277-286

SOUTH AFRICAN TICK BITE FEVER IN A GROUP OF RUSSIAN TOURISTS

Article

Kozhevnikova G.M., Tokmalaev A.K., Voznesensky S.L., Karan L.S.

Terapevticheskii Arkhiv. Vol. 86. 2014. P. 82-83

Data-driven methods for anaphora resolution of Russian texts

Other records

PREDICATIVITY: EXPLORING THE MEANING OF THE CONCEPT

SOUTH AFRICAN TICK BITE FEVER IN A GROUP OF RUSSIAN TOURISTS