RUS  ENG
Full version
JOURNALS // Informatika i Ee Primeneniya [Informatics and its Applications] // Archive

Inform. Primen., 2017 Volume 11, Issue 3, Pages 73–79 (Mi ia487)

Methods for intrinsic plagiarism detection

K. F. Safinab, M. P. Kuznetsovc, M. V. Kuznetsovaba

a Antiplagiat JSC, 33 Varshavskoe Shosse, Moscow 117105, Russian Federation
b Moscow Institute of Physics and Technology, 9 Institutskiy Per., Dolgoprudny, Moscow Region 141700, Russian Federation
c “Forecsys” LLC, 42 Vavilov Str., Moscow 119333, Russian Federation

Abstract: There are two ways to find plagiarism in documents: “external” and “intrinsic” plagiarism detection. External plagiarism detection is the task with a known set of possible references. Intrinsic plagiarism detection aims at discovering plagiarism by analyzing only the document by itself. The paper investigates the methods of intrinsic plagiarism detection. The authors developed a plagiarism detection method based on constructing statistics from the features of the document parts and detecting outliers. The proposed algorithm was tested on the PAN-2011 collection for intrinsic plagiarism detection.

Keywords: natural language processing; intrinsic plagiarism detection; outliers detection.

Received: 30.01.2017

DOI: 10.14357/19922264170308



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2026