Development of methods for extracting information from pharmacy line using conditional random fields

Molodchenkov, A.I.; Nikolaev, A.A.; Mitrokhina, E.A.

Development of methods for extracting information from pharmacy line using conditional random fields

The paper considers the solution to the problem of extracting information from short lines of pharmacological orientation in Russian language. As an example, pharmacy lines are used, from which you need to extract the full name of the drug, manufacturer, form of issue, dosage, number of pieces in a package and some other parameters. To extract this information, a conditional random field (CRF) algorithm was used. There was also created a method for preliminary standardization of the strings to bring string tokens to a single form. More than seven thousand pharmacy lines were marked for the experiments and 2 CRF models were trained - with and without preliminary standardization of the lines. For the model with standardization, the following results were obtained: accuracy for different data sets is 0.95 (on the validation set) and 0.89 (on the test set). For the model without standardization, the accuracy is 0.95 (on the validation set) and 0.87 (on the test set). Copyright © 2021 for this paper by its authors.

Authors

Molodchenkov A.I. ^1, ^2, ³ , Nikolaev A.A. ¹ , Mitrokhina E.A. ²

Conference proceedings

CEUR Workshop Proceedings

Publisher

CEUR-WS

Language

English

Pages

340-348

State

Published

Volume

3036

Year

2021

Organizations

¹ Federal Research Center “Informatics and Control” of the Russian Academy of Sciences, Moscow, Russian Federation
² Moscow Institute of Physics and Technology, Dolgoprudny, Russian Federation
³ Peoples’ Friendship University of Russia, Moscow, Russian Federation

Keywords

Conditional random fields; Named entity recognition

Cite

ГОСТ MLA RIS BibTex

THE AUTOMATED DECISION-MAKING SUPPORT SYSTEM IN THE FIELD OF CUSTOMS AND TARIFF REGULATION OF THE EURASIAN ECONOMIC UNION: METHODOLOGICAL FOUNDATIONS

Article

Pak A.Y., Pak B.I.

CEUR Workshop Proceedings. Vol. 3040. 2021. P.. 139-147

THE INFLUENCE OF THE RUSSIAN ECONOMIC CRISIS ON THE REGIONAL PECULIARITIES OF INVESTMENT ACTIVITY

Article

Mironova M.N., Mizerovskaya U.V., Shubtsova L.V.

SUSTAINABLE LEADERSHIP FOR ENTREPRENEURS AND ACADEMICS, ESAL2018. 2020. P.. 53-65

Development of methods for extracting information from pharmacy line using conditional random fields

Other records

THE AUTOMATED DECISION-MAKING SUPPORT SYSTEM IN THE FIELD OF CUSTOMS AND TARIFF REGULATION OF THE EURASIAN ECONOMIC UNION: METHODOLOGICAL FOUNDATIONS

THE INFLUENCE OF THE RUSSIAN ECONOMIC CRISIS ON THE REGIONAL PECULIARITIES OF INVESTMENT ACTIVITY

Cite