Using sentence similarity measure for plagiarism source retrieval: Notebook for PAN at CLEF 2014

This paper describes a method that was implemented in the software submitted to PAN 2014 competition for the source retrieval task. For generating queries we use the most important noun phrases and words of sentences selected from a given suspicious document. To download documents that are likely to be sources of plagiarism we employ a sentence similarity measure.

Authors
Zubarev D.1 , Sochenkov I. 2
Conference proceedings
Publisher
CEUR-WS
Language
English
Pages
1027-1034
Status
Published
Volume
1180
Year
2014
Organizations
  • 1 Institute for Systems Analysis, Russian Academy of Sciences, Moscow, Russian Federation
  • 2 Peoples' Friendship University of Russia, Moscow, Russian Federation
Keywords
Noun phrase; Sentence similarity; Intellectual property
Date of creation
19.10.2018
Date of change
19.10.2018
Short link
https://repository.rudn.ru/en/records/article/record/4976/
Share

Other records