Improving Plagiarism Detection Using Genetic Algorithm

Detecting instances of plagiarism in student home-work, with program code in particular, is a subject of active research for over 30 years. One of the early proposed methods was extraction and comparison of source-code metrics. Even though this approach has low algorithmic complexity, it is rarely u...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:2019 42nd International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) s. 571 - 576
Hlavní autoři: Pajic, Enil, Ljubovic, Vedran
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: Croatian Society MIPRO 01.05.2019
Témata:
ISSN:2623-8764
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Detecting instances of plagiarism in student home-work, with program code in particular, is a subject of active research for over 30 years. One of the early proposed methods was extraction and comparison of source-code metrics. Even though this approach has low algorithmic complexity, it is rarely used in recent papers with some authors claiming that better results are obtained using other methods. In this paper, plagiarism detection is treated as an information retrieval problem, specifically query-by-example (QbE). A feature vector is constructed from source metrics and compared using common similarity measures. Further, evolutionary computation methods are used to optimize the similarity measure. It is shown that, by several metrics used, detection results are on par with state-of-the-art methods with significantly lower execution time.
ISSN:2623-8764
DOI:10.23919/MIPRO.2019.8756744