Source code similarity detection by using data mining methods

Programming courses at university and high school level, and competitions in informatics (programming), often require fast assessment of received solutions of the programming tasks. This problem is usually solved by use of automated systems that check the produced output for some test cases for ever...

Full description

Saved in:
Bibliographic Details
Published in:2013 35th International Conference on Information Technology Interfaces (ITI) pp. 257 - 262
Main Authors: Stankov, Emil, Jovanov, Mile, Bogdanova, Ana Madevska
Format: Conference Proceeding
Language:English
Published: SRCE University Computing Centre, University of Zagreb 01.06.2013
Subjects:
ISBN:9789537138301, 9537138305
ISSN:1334-2762
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract Programming courses at university and high school level, and competitions in informatics (programming), often require fast assessment of received solutions of the programming tasks. This problem is usually solved by use of automated systems that check the produced output for some test cases for every solution. In our paper we present a novel approach of representation of the programming codes as vectors, and use of these vectors in data mining analysis that could produce better assessment of the solutions. We present the results of cluster analysis that go up to 88% of correctly clustered items on average.
AbstractList Programming courses at university and high school level, and competitions in informatics (programming), often require fast assessment of received solutions of the programming tasks. This problem is usually solved by use of automated systems that check the produced output for some test cases for every solution. In our paper we present a novel approach of representation of the programming codes as vectors, and use of these vectors in data mining analysis that could produce better assessment of the solutions. We present the results of cluster analysis that go up to 88% of correctly clustered items on average.
Author Jovanov, Mile
Stankov, Emil
Bogdanova, Ana Madevska
Author_xml – sequence: 1
  givenname: Emil
  surname: Stankov
  fullname: Stankov, Emil
  email: emil.stankov@finki.ukim.mk
  organization: Fac. of Comput. Sci. & Eng., Univ. Ss. Cyril & Methodius, Skopje, Macedonia
– sequence: 2
  givenname: Mile
  surname: Jovanov
  fullname: Jovanov, Mile
  email: mile.jovanov@finki.ukim.mk
  organization: Fac. of Comput. Sci. & Eng., Univ. Ss. Cyril & Methodius, Skopje, Macedonia
– sequence: 3
  givenname: Ana Madevska
  surname: Bogdanova
  fullname: Bogdanova, Ana Madevska
  email: ana.madevska.bogdanova@finki.ukim.mk
  organization: Fac. of Comput. Sci. & Eng., Univ. Ss. Cyril & Methodius, Skopje, Macedonia
BookMark eNo1jjtPwzAURo0oEm3JyMTiP5Dgx40fAwOqKCBVYqB75ccNGDUJit0h_x4QMH3nLEffiiyGcUBCrjlrBFhzm0pqBOOyYa1WZ2RlW6m5NJLLc1JZbf6d8QVZcimhFlqJS1Ll_MEY41oLZsSS3L2OpykgDWNEmlOfjm5KZaYRC4aSxoH6mZ5yGt5odMXRPg0_3GN5H2O-IhedO2as_nZN9tuH_eap3r08Pm_ud3WyrNRBwPcRGyBo2wkhdDCt8S4ohAAKbUQL0ccIXeu8wQ46b8FYhzqC5xblmtz8ZhMiHj6n1LtpPigFlkmQXzT0TWo
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.2498/iti.2013.0576
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE/IET Electronic Library (IEL) (UW System Shared)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
EISBN 9537138313
9789537138318
9537138321
9789537138325
EndPage 262
ExternalDocumentID 6649034
Genre orig-research
GroupedDBID 6IE
6IF
6IK
6IL
6IN
AAJGR
AAWTH
ADFMO
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
IEGSK
IERZE
OCL
RIE
RIL
ID FETCH-LOGICAL-i90t-c248309c4c79f2227c858bac6e4c46e9de94dbdd4f5ab8ef4fb9489ae7d4b19e3
IEDL.DBID RIE
ISBN 9789537138301
9537138305
ISSN 1334-2762
IngestDate Wed Aug 27 05:02:10 EDT 2025
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i90t-c248309c4c79f2227c858bac6e4c46e9de94dbdd4f5ab8ef4fb9489ae7d4b19e3
PageCount 6
ParticipantIDs ieee_primary_6649034
PublicationCentury 2000
PublicationDate 2013-June
PublicationDateYYYYMMDD 2013-06-01
PublicationDate_xml – month: 06
  year: 2013
  text: 2013-June
PublicationDecade 2010
PublicationTitle 2013 35th International Conference on Information Technology Interfaces (ITI)
PublicationTitleAbbrev ITI
PublicationYear 2013
Publisher SRCE University Computing Centre, University of Zagreb
Publisher_xml – name: SRCE University Computing Centre, University of Zagreb
SSID ssj0001772082
ssib011891893
ssib024632892
ssib025354916
Score 1.8581161
Snippet Programming courses at university and high school level, and competitions in informatics (programming), often require fast assessment of received solutions of...
SourceID ieee
SourceType Publisher
StartPage 257
SubjectTerms Algorithm design and analysis
clustering analysis
code similarity
Data mining
Educational institutions
evaluation of source code
Informatics
Programming code
Programming profession
Vectors
Title Source code similarity detection by using data mining methods
URI https://ieeexplore.ieee.org/document/6649034
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwED61CCEmHi3iLQ-MuE3iixMPTIiKAapKdOhWJbaDMrRFbYrEv-fshAASC1se09mJ7_vu9QHcJLGrLZeKS-LIHE0kOTlhw0WR04mMWgTa7_RTMh6ns5madOC27YWx1vriMztwlz6Xb1Z660JlQylRBQK70E2SpO7V-vp2CCer8McclQilIC7Ruu4oFsSEGijk4y8EKwOvJUUsDXlEhwKRNhULYm0p_QT1cJ72PqzncxJZSYdlVbqaMDEgrCN_qbJ4pzQ6-J85h9D_7u5jk9ZvHUHHLo9h77lJsvfg7sUH9JnrdmebclES-SWszoytfN3WkuUfzNXLvzJXX8oWXmOC1VrUmz5MRw_T-0feqCzwUgUV1xGSeUqjTlThGmN1Gqd5pqVFjdIqYxWa3Bgs4ixPbYFFrjBVmU0M5qGy4gR2lqulPQWGKOh9qDStu0sTq6yIkSCEjlzuVBZn0HOLMH-r52jMG_vP_358AftRLT3Bg_ASdqr11l7Brn6vys362m_-JyREpMk
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwELZKQcDEo0W88cCI2yS-OPHAhKiKaKtKdOhWNbaDMjRFbYrEv-fshAASC1se0_l19_m-u4-Q2yi03HIhmUCMzEAHgqET1oynCZ7IoLin3EwPotEonk7luEHu6loYY4wjn5mOfXS5fL1UG3tV1hUCpMdhi2yHAIFfVmt9rR6MlKX_o5NKAIIjmqiddxByxEJVMORuYDCw9JyaFOI0YAEeCwjbZMgRt8W4Dcr2PPW7X3boRLgSd7Mis6ww3sFoR_zSZXFuqXfwP4MOSfu7vo-Oa891RBomPya7wyrN3iL3L-5Kn9p6d7rOFhnCX4zWqTaFY27lNPmgljH_Si3DlC6cygQt1ajXbTLpPU4e-qzSWWCZ9AqmAkDzpAIVydSWxqo4jJO5EgYUCCO1kaATrSEN50lsUkgTCbGcm0hD4kvDT0gzX-bmlFAAjv99qXDcbaJYztMQJypWgc2eivSMtOwgzN7KThqzyv7zvz_fkL3-ZDiYDZ5GzxdkPyiFKJjnX5JmsdqYK7Kj3otsvbp2C-ET8EmoEA
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2013+35th+International+Conference+on+Information+Technology+Interfaces+%28ITI%29&rft.atitle=Source+code+similarity+detection+by+using+data+mining+methods&rft.au=Stankov%2C+Emil&rft.au=Jovanov%2C+Mile&rft.au=Bogdanova%2C+Ana+Madevska&rft.date=2013-06-01&rft.pub=SRCE+University+Computing+Centre%2C+University+of+Zagreb&rft.isbn=9789537138301&rft.issn=1334-2762&rft.spage=257&rft.epage=262&rft_id=info:doi/10.2498%2Fiti.2013.0576&rft.externalDocID=6649034
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1334-2762&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1334-2762&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1334-2762&client=summon