Exploratory patent search

The paper presents an effective method for topically similar document retrieval. The exploratory patent search based on this method is proposed. The developed method reduces complexity and time of patent expertise providing the computer assistance of patent search and analysis. The phrases extracted by the parser as well as single lexemes are used as descriptors for a document. This approach prevents exponential growth of the feature space and provides effective indexing even for large text collections. The results of experiments show that the proposed method significantly outperforms the basic keyword-based approach. Conclusions are made about the prospects of using the method for solving other problems such as source retrieval for plagiarism detection and full-text clustering. © 2018 Federal Research Center "Computer Science and Control" of Russian Academy of Sciences. All rights reserved.

Авторы
Sochenkov I. 1, 2 , Zubarev D. 1, 3 , Tikhomirov I.1
Издательство
Федеральный исследовательский центр "Информатика и управление" РАН
Номер выпуска
1
Язык
Немецкий
Страницы
89-94
Статус
Опубликовано
Том
12
Год
2018
Организации
  • 1 Institute for Systems Analysis, Federal Research Center Computer Science and Control, Russian Academy of Sciences, 44-2 Vavilov Str., Moscow, 119333, Russian Federation
  • 2 Skolkovo Institute of Science and Technology, 3 Nobelya Str., Moscow, 121205, Russian Federation
  • 3 Peoples' Friendship University of Russia, RUDN University, 6 Miklukho-Maklaya Str., Moscow, 117198, Russian Federation
Ключевые слова
Exploratory search; Patent search; Search and analytical engines; Topic modeling; Topically similar document retrieval
Дата создания
19.10.2018
Дата изменения
19.10.2018
Постоянная ссылка
https://repository.rudn.ru/ru/records/article/record/7184/