Source code similarity detection by using data mining methods

Programming courses at university and high school level, and competitions in informatics (programming), often require fast assessment of received solutions of the programming tasks. This problem is usually solved by use of automated systems that check the produced output for some test cases for ever...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:2013 35th International Conference on Information Technology Interfaces (ITI) s. 257 - 262
Hlavní autori: Stankov, Emil, Jovanov, Mile, Bogdanova, Ana Madevska
Médium: Konferenčný príspevok..
Jazyk:English
Vydavateľské údaje: SRCE University Computing Centre, University of Zagreb 01.06.2013
Predmet:
ISBN:9789537138301, 9537138305
ISSN:1334-2762
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Abstract Programming courses at university and high school level, and competitions in informatics (programming), often require fast assessment of received solutions of the programming tasks. This problem is usually solved by use of automated systems that check the produced output for some test cases for every solution. In our paper we present a novel approach of representation of the programming codes as vectors, and use of these vectors in data mining analysis that could produce better assessment of the solutions. We present the results of cluster analysis that go up to 88% of correctly clustered items on average.
AbstractList Programming courses at university and high school level, and competitions in informatics (programming), often require fast assessment of received solutions of the programming tasks. This problem is usually solved by use of automated systems that check the produced output for some test cases for every solution. In our paper we present a novel approach of representation of the programming codes as vectors, and use of these vectors in data mining analysis that could produce better assessment of the solutions. We present the results of cluster analysis that go up to 88% of correctly clustered items on average.
Author Jovanov, Mile
Stankov, Emil
Bogdanova, Ana Madevska
Author_xml – sequence: 1
  givenname: Emil
  surname: Stankov
  fullname: Stankov, Emil
  email: emil.stankov@finki.ukim.mk
  organization: Fac. of Comput. Sci. & Eng., Univ. Ss. Cyril & Methodius, Skopje, Macedonia
– sequence: 2
  givenname: Mile
  surname: Jovanov
  fullname: Jovanov, Mile
  email: mile.jovanov@finki.ukim.mk
  organization: Fac. of Comput. Sci. & Eng., Univ. Ss. Cyril & Methodius, Skopje, Macedonia
– sequence: 3
  givenname: Ana Madevska
  surname: Bogdanova
  fullname: Bogdanova, Ana Madevska
  email: ana.madevska.bogdanova@finki.ukim.mk
  organization: Fac. of Comput. Sci. & Eng., Univ. Ss. Cyril & Methodius, Skopje, Macedonia
BookMark eNo1jjtPwzAURo0oEm3JyMTiP5Dgx40fAwOqKCBVYqB75ccNGDUJit0h_x4QMH3nLEffiiyGcUBCrjlrBFhzm0pqBOOyYa1WZ2RlW6m5NJLLc1JZbf6d8QVZcimhFlqJS1Ll_MEY41oLZsSS3L2OpykgDWNEmlOfjm5KZaYRC4aSxoH6mZ5yGt5odMXRPg0_3GN5H2O-IhedO2as_nZN9tuH_eap3r08Pm_ud3WyrNRBwPcRGyBo2wkhdDCt8S4ohAAKbUQL0ccIXeu8wQ46b8FYhzqC5xblmtz8ZhMiHj6n1LtpPigFlkmQXzT0TWo
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.2498/iti.2013.0576
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
EISBN 9537138313
9789537138318
9537138321
9789537138325
EndPage 262
ExternalDocumentID 6649034
Genre orig-research
GroupedDBID 6IE
6IF
6IK
6IL
6IN
AAJGR
AAWTH
ADFMO
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
IEGSK
IERZE
OCL
RIE
RIL
ID FETCH-LOGICAL-i90t-c248309c4c79f2227c858bac6e4c46e9de94dbdd4f5ab8ef4fb9489ae7d4b19e3
IEDL.DBID RIE
ISBN 9789537138301
9537138305
ISSN 1334-2762
IngestDate Wed Aug 27 05:02:10 EDT 2025
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i90t-c248309c4c79f2227c858bac6e4c46e9de94dbdd4f5ab8ef4fb9489ae7d4b19e3
PageCount 6
ParticipantIDs ieee_primary_6649034
PublicationCentury 2000
PublicationDate 2013-June
PublicationDateYYYYMMDD 2013-06-01
PublicationDate_xml – month: 06
  year: 2013
  text: 2013-June
PublicationDecade 2010
PublicationTitle 2013 35th International Conference on Information Technology Interfaces (ITI)
PublicationTitleAbbrev ITI
PublicationYear 2013
Publisher SRCE University Computing Centre, University of Zagreb
Publisher_xml – name: SRCE University Computing Centre, University of Zagreb
SSID ssj0001772082
ssib011891893
ssib024632892
ssib025354916
Score 1.8581161
Snippet Programming courses at university and high school level, and competitions in informatics (programming), often require fast assessment of received solutions of...
SourceID ieee
SourceType Publisher
StartPage 257
SubjectTerms Algorithm design and analysis
clustering analysis
code similarity
Data mining
Educational institutions
evaluation of source code
Informatics
Programming code
Programming profession
Vectors
Title Source code similarity detection by using data mining methods
URI https://ieeexplore.ieee.org/document/6649034
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV09T8MwELXaCiEmPlrEtzww4jaJL449MCEqFqpKdOhWxR9BGdqiNkXi33N2QgCJhS3xdnHse8--e4-QW1Ba5dxyZjMhGFhhmUq9FmGcRMalSZE7GcwmsslEzudq2iF3bS-Mcy4Un7mhfwx3-XZtdv6obCQEqIhDl3SzLKt7tb7-HcTJKv6ho5KA4Mgl2tSdpByZUAOFwvkLwsooeEkhSwOW4KaApE2lHFmbxEVQi_O073Gtz4lkRY7KqvQ1YXyIWEf8cmUJSWl8-L9wjsjgu7uPTtu8dUw6bnVC9p-bS_Y-uX8JB_rUd7vTbbkskfwiVqfWVaFua0X1B_X18q_U15fSZfCYoLUX9XZAZuPH2cMTa1wWWKmiipkEMDxlwGSq8I2xRqZS50Y4MCCcsk6B1dZCkeZaugIKrUCq3GUWdKwcPyW91XrlzghVfhKM9Ho2CWitpZZgEB7lGgdxZz0nff8RFm-1jsaiif_i7-FLcpDU1hMsiq9Ir9rs3DXZM-9Vud3chMn_BCccpYc
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV09T8MwELVKQcDER4v4xgMjaRPn4tgDE6Iqoq0q0aFbFX8EZWiL2hSJf8_ZKQUkFrbE2yW27z377j1CbkEqmcUmDkzKeQCGm0AmToswYqG2CcszK7zZRDoYiPFYDmvkbtMLY631xWe25R79Xb6Z65U7KmtzDjKMYYtsJwAsqrq1vmYPImUZ_VBSYcBjZBOb5M2SGLnQGgz5ExgElqF3k0KeBgHDbQFpm0xi5G0Cl0Elz7N5jyqFTqQrol2UhasKi1uIdvgvXxafljoH_wvokDS_-_vocJO5jkjNzo7Jbn99zd4g9y_-SJ-6fne6LKYF0l9E69TY0lduzaj6oK5i_pW6ClM69S4TtHKjXjbJqPM4eugGa5-FoJBhGWgGGJ7UoFOZu9ZYLRKhMs0taOBWGivBKGMgTzIlbA65kiBkZlMDKpI2PiH12XxmTwmV7ido4RRtGCilhBKgESBlCgdxbz0jDfcRJm-VksZkHf_538M3ZK876vcmvafB8wXZZ5URRRBGl6ReLlb2iuzo97JYLq79RPgE_L-ozg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2013+35th+International+Conference+on+Information+Technology+Interfaces+%28ITI%29&rft.atitle=Source+code+similarity+detection+by+using+data+mining+methods&rft.au=Stankov%2C+Emil&rft.au=Jovanov%2C+Mile&rft.au=Bogdanova%2C+Ana+Madevska&rft.date=2013-06-01&rft.pub=SRCE+University+Computing+Centre%2C+University+of+Zagreb&rft.isbn=9789537138301&rft.issn=1334-2762&rft.spage=257&rft.epage=262&rft_id=info:doi/10.2498%2Fiti.2013.0576&rft.externalDocID=6649034
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1334-2762&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1334-2762&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1334-2762&client=summon