Source code similarity detection by using data mining methods
Programming courses at university and high school level, and competitions in informatics (programming), often require fast assessment of received solutions of the programming tasks. This problem is usually solved by use of automated systems that check the produced output for some test cases for ever...
Saved in:
| Published in: | 2013 35th International Conference on Information Technology Interfaces (ITI) pp. 257 - 262 |
|---|---|
| Main Authors: | , , |
| Format: | Conference Proceeding |
| Language: | English |
| Published: |
SRCE University Computing Centre, University of Zagreb
01.06.2013
|
| Subjects: | |
| ISBN: | 9789537138301, 9537138305 |
| ISSN: | 1334-2762 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | Programming courses at university and high school level, and competitions in informatics (programming), often require fast assessment of received solutions of the programming tasks. This problem is usually solved by use of automated systems that check the produced output for some test cases for every solution. In our paper we present a novel approach of representation of the programming codes as vectors, and use of these vectors in data mining analysis that could produce better assessment of the solutions. We present the results of cluster analysis that go up to 88% of correctly clustered items on average. |
|---|---|
| AbstractList | Programming courses at university and high school level, and competitions in informatics (programming), often require fast assessment of received solutions of the programming tasks. This problem is usually solved by use of automated systems that check the produced output for some test cases for every solution. In our paper we present a novel approach of representation of the programming codes as vectors, and use of these vectors in data mining analysis that could produce better assessment of the solutions. We present the results of cluster analysis that go up to 88% of correctly clustered items on average. |
| Author | Jovanov, Mile Stankov, Emil Bogdanova, Ana Madevska |
| Author_xml | – sequence: 1 givenname: Emil surname: Stankov fullname: Stankov, Emil email: emil.stankov@finki.ukim.mk organization: Fac. of Comput. Sci. & Eng., Univ. Ss. Cyril & Methodius, Skopje, Macedonia – sequence: 2 givenname: Mile surname: Jovanov fullname: Jovanov, Mile email: mile.jovanov@finki.ukim.mk organization: Fac. of Comput. Sci. & Eng., Univ. Ss. Cyril & Methodius, Skopje, Macedonia – sequence: 3 givenname: Ana Madevska surname: Bogdanova fullname: Bogdanova, Ana Madevska email: ana.madevska.bogdanova@finki.ukim.mk organization: Fac. of Comput. Sci. & Eng., Univ. Ss. Cyril & Methodius, Skopje, Macedonia |
| BookMark | eNo1jjtPwzAURo0oEm3JyMTiP5Dgx40fAwOqKCBVYqB75ccNGDUJit0h_x4QMH3nLEffiiyGcUBCrjlrBFhzm0pqBOOyYa1WZ2RlW6m5NJLLc1JZbf6d8QVZcimhFlqJS1Ll_MEY41oLZsSS3L2OpykgDWNEmlOfjm5KZaYRC4aSxoH6mZ5yGt5odMXRPg0_3GN5H2O-IhedO2as_nZN9tuH_eap3r08Pm_ud3WyrNRBwPcRGyBo2wkhdDCt8S4ohAAKbUQL0ccIXeu8wQ46b8FYhzqC5xblmtz8ZhMiHj6n1LtpPigFlkmQXzT0TWo |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.2498/iti.2013.0576 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE/IET Electronic Library (IEL) (UW System Shared) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Medicine |
| EISBN | 9537138313 9789537138318 9537138321 9789537138325 |
| EndPage | 262 |
| ExternalDocumentID | 6649034 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IF 6IK 6IL 6IN AAJGR AAWTH ADFMO ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK IEGSK IERZE OCL RIE RIL |
| ID | FETCH-LOGICAL-i90t-c248309c4c79f2227c858bac6e4c46e9de94dbdd4f5ab8ef4fb9489ae7d4b19e3 |
| IEDL.DBID | RIE |
| ISBN | 9789537138301 9537138305 |
| ISSN | 1334-2762 |
| IngestDate | Wed Aug 27 05:02:10 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | true |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-i90t-c248309c4c79f2227c858bac6e4c46e9de94dbdd4f5ab8ef4fb9489ae7d4b19e3 |
| PageCount | 6 |
| ParticipantIDs | ieee_primary_6649034 |
| PublicationCentury | 2000 |
| PublicationDate | 2013-June |
| PublicationDateYYYYMMDD | 2013-06-01 |
| PublicationDate_xml | – month: 06 year: 2013 text: 2013-June |
| PublicationDecade | 2010 |
| PublicationTitle | 2013 35th International Conference on Information Technology Interfaces (ITI) |
| PublicationTitleAbbrev | ITI |
| PublicationYear | 2013 |
| Publisher | SRCE University Computing Centre, University of Zagreb |
| Publisher_xml | – name: SRCE University Computing Centre, University of Zagreb |
| SSID | ssj0001772082 ssib011891893 ssib024632892 ssib025354916 |
| Score | 1.8581161 |
| Snippet | Programming courses at university and high school level, and competitions in informatics (programming), often require fast assessment of received solutions of... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 257 |
| SubjectTerms | Algorithm design and analysis clustering analysis code similarity Data mining Educational institutions evaluation of source code Informatics Programming code Programming profession Vectors |
| Title | Source code similarity detection by using data mining methods |
| URI | https://ieeexplore.ieee.org/document/6649034 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwED61CCEmHi3iLQ-MuE3iixMPTIiKAapKdOhWJbaDMrRFbYrEv-fshAASC1se09mJ7_vu9QHcJLGrLZeKS-LIHE0kOTlhw0WR04mMWgTa7_RTMh6ns5madOC27YWx1vriMztwlz6Xb1Z660JlQylRBQK70E2SpO7V-vp2CCer8McclQilIC7Ruu4oFsSEGijk4y8EKwOvJUUsDXlEhwKRNhULYm0p_QT1cJ72PqzncxJZSYdlVbqaMDEgrCN_qbJ4pzQ6-J85h9D_7u5jk9ZvHUHHLo9h77lJsvfg7sUH9JnrdmebclES-SWszoytfN3WkuUfzNXLvzJXX8oWXmOC1VrUmz5MRw_T-0feqCzwUgUV1xGSeUqjTlThGmN1Gqd5pqVFjdIqYxWa3Bgs4ixPbYFFrjBVmU0M5qGy4gR2lqulPQWGKOh9qDStu0sTq6yIkSCEjlzuVBZn0HOLMH-r52jMG_vP_358AftRLT3Bg_ASdqr11l7Brn6vys362m_-JyREpMk |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwELZKQcDEo0W88cCI2yS-OPHAhKiKaKtKdOhWNbaDMjRFbYrEv-fshAASC1se0_l19_m-u4-Q2yi03HIhmUCMzEAHgqET1oynCZ7IoLin3EwPotEonk7luEHu6loYY4wjn5mOfXS5fL1UG3tV1hUCpMdhi2yHAIFfVmt9rR6MlKX_o5NKAIIjmqiddxByxEJVMORuYDCw9JyaFOI0YAEeCwjbZMgRt8W4Dcr2PPW7X3boRLgSd7Mis6ww3sFoR_zSZXFuqXfwP4MOSfu7vo-Oa891RBomPya7wyrN3iL3L-5Kn9p6d7rOFhnCX4zWqTaFY27lNPmgljH_Si3DlC6cygQt1ajXbTLpPU4e-qzSWWCZ9AqmAkDzpAIVydSWxqo4jJO5EgYUCCO1kaATrSEN50lsUkgTCbGcm0hD4kvDT0gzX-bmlFAAjv99qXDcbaJYztMQJypWgc2eivSMtOwgzN7KThqzyv7zvz_fkL3-ZDiYDZ5GzxdkPyiFKJjnX5JmsdqYK7Kj3otsvbp2C-ET8EmoEA |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2013+35th+International+Conference+on+Information+Technology+Interfaces+%28ITI%29&rft.atitle=Source+code+similarity+detection+by+using+data+mining+methods&rft.au=Stankov%2C+Emil&rft.au=Jovanov%2C+Mile&rft.au=Bogdanova%2C+Ana+Madevska&rft.date=2013-06-01&rft.pub=SRCE+University+Computing+Centre%2C+University+of+Zagreb&rft.isbn=9789537138301&rft.issn=1334-2762&rft.spage=257&rft.epage=262&rft_id=info:doi/10.2498%2Fiti.2013.0576&rft.externalDocID=6649034 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1334-2762&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1334-2762&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1334-2762&client=summon |

