Exploratory patent search

The paper presents an effective method for topically similar document retrieval. The exploratory patent search based on this method is proposed. The developed method reduces complexity and time of patent expertise providing the computer assistance of patent search and analysis. The phrases extracted by the parser as well as single lexemes are used as descriptors for a document. This approach prevents exponential growth of the feature space and provides effective indexing even for large text collections. The results of experiments show that the proposed method significantly outperforms the basic keyword-based approach. Conclusions are made about the prospects of using the method for solving other problems such as source retrieval for plagiarism detection and full-text clustering. © 2018 Federal Research Center "Computer Science and Control" of Russian Academy of Sciences. All rights reserved.

Authors
Sochenkov I. 1, 2 , Zubarev D. 1, 3 , Tikhomirov I.1
Publisher
Федеральный исследовательский центр "Информатика и управление" РАН
Number of issue
1
Language
German
Pages
89-94
Status
Published
Volume
12
Year
2018
Organizations
  • 1 Institute for Systems Analysis, Federal Research Center Computer Science and Control, Russian Academy of Sciences, 44-2 Vavilov Str., Moscow, 119333, Russian Federation
  • 2 Skolkovo Institute of Science and Technology, 3 Nobelya Str., Moscow, 121205, Russian Federation
  • 3 Peoples' Friendship University of Russia, RUDN University, 6 Miklukho-Maklaya Str., Moscow, 117198, Russian Federation
Keywords
Exploratory search; Patent search; Search and analytical engines; Topic modeling; Topically similar document retrieval
Share

Other records