Deep code search efficiency based on clustering.
Gespeichert in:
| Titel: | Deep code search efficiency based on clustering. |
|---|---|
| Autoren: | Liu, Kun, Liu, Jianxun, Hu, Haize |
| Quelle: | Concurrency & Computation: Practice & Experience; 6/10/2024, Vol. 36 Issue 13, p1-15, 15p |
| Schlagwörter: | K-means clustering, COMPUTER software development, DEEP learning, SQL |
| Abstract: | The deep‐learning based code search model mainly takes accuracy as the only target for judging the performance of the model, ignoring the efficiency of code search. This article proposes a clustering‐based code search model (C‐DCS). C‐DCS uses the K‐Means to divide the code vector base into K clusters and obtains the center vectors of K clusters. While searching, C‐DCS first matches the query vector with the K center vectors to get the best matching center vector. After matching the center vector, C‐DCS matches the query vector with code vectors in the cluster corresponding to the best matching center vector one by one and then gets the best matching code snippet vector. To verify the efficiency of C‐DCS in the code search task, experimental analysis was built on a large dataset. The experimental results showed that C‐DCS saves 92.2% of the search time compared to the baseline model while remaining the accuracy. In the experimental evaluation section, we optimized the K‐Means algorithm to improve the code search efficiency of C‐DCS further, reducing the search time to 93.8% of the baseline model. Hence, C‐DCS reduces the code search time greatly with not affecting the accuracy, improving the efficiency of software development. [ABSTRACT FROM AUTHOR] |
| Copyright of Concurrency & Computation: Practice & Experience is the property of Wiley-Blackwell and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) | |
| Datenbank: | Complementary Index |
| FullText | Text: Availability: 0 CustomLinks: – Url: https://resolver.ebscohost.com/openurl?sid=EBSCO:edb&genre=article&issn=15320626&ISBN=&volume=36&issue=13&date=20240610&spage=1&pages=1-15&title=Concurrency & Computation: Practice & Experience&atitle=Deep%20code%20search%20efficiency%20based%20on%20clustering.&aulast=Liu%2C%20Kun&id=DOI:10.1002/cpe.8027 Name: Full Text Finder Category: fullText Text: Full Text Finder Icon: https://imageserver.ebscohost.com/branding/images/FTF.gif MouseOverText: Full Text Finder – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Liu%20K Name: ISI Category: fullText Text: Nájsť tento článok vo Web of Science Icon: https://imagesrvr.epnet.com/ls/20docs.gif MouseOverText: Nájsť tento článok vo Web of Science |
|---|---|
| Header | DbId: edb DbLabel: Complementary Index An: 177193045 RelevancyScore: 983 AccessLevel: 6 PubType: Academic Journal PubTypeId: academicJournal PreciseRelevancyScore: 983.424377441406 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: Deep code search efficiency based on clustering. – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Liu%2C+Kun%22">Liu, Kun</searchLink><br /><searchLink fieldCode="AR" term="%22Liu%2C+Jianxun%22">Liu, Jianxun</searchLink><br /><searchLink fieldCode="AR" term="%22Hu%2C+Haize%22">Hu, Haize</searchLink> – Name: TitleSource Label: Source Group: Src Data: Concurrency & Computation: Practice & Experience; 6/10/2024, Vol. 36 Issue 13, p1-15, 15p – Name: Subject Label: Subject Terms Group: Su Data: <searchLink fieldCode="DE" term="%22K-means+clustering%22">K-means clustering</searchLink><br /><searchLink fieldCode="DE" term="%22COMPUTER+software+development%22">COMPUTER software development</searchLink><br /><searchLink fieldCode="DE" term="%22DEEP+learning%22">DEEP learning</searchLink><br /><searchLink fieldCode="DE" term="%22SQL%22">SQL</searchLink> – Name: Abstract Label: Abstract Group: Ab Data: The deep‐learning based code search model mainly takes accuracy as the only target for judging the performance of the model, ignoring the efficiency of code search. This article proposes a clustering‐based code search model (C‐DCS). C‐DCS uses the K‐Means to divide the code vector base into K clusters and obtains the center vectors of K clusters. While searching, C‐DCS first matches the query vector with the K center vectors to get the best matching center vector. After matching the center vector, C‐DCS matches the query vector with code vectors in the cluster corresponding to the best matching center vector one by one and then gets the best matching code snippet vector. To verify the efficiency of C‐DCS in the code search task, experimental analysis was built on a large dataset. The experimental results showed that C‐DCS saves 92.2% of the search time compared to the baseline model while remaining the accuracy. In the experimental evaluation section, we optimized the K‐Means algorithm to improve the code search efficiency of C‐DCS further, reducing the search time to 93.8% of the baseline model. Hence, C‐DCS reduces the code search time greatly with not affecting the accuracy, improving the efficiency of software development. [ABSTRACT FROM AUTHOR] – Name: Abstract Label: Group: Ab Data: <i>Copyright of Concurrency & Computation: Practice & Experience is the property of Wiley-Blackwell and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.) |
| PLink | https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edb&AN=177193045 |
| RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.1002/cpe.8027 Languages: – Code: eng Text: English PhysicalDescription: Pagination: PageCount: 15 StartPage: 1 Subjects: – SubjectFull: K-means clustering Type: general – SubjectFull: COMPUTER software development Type: general – SubjectFull: DEEP learning Type: general – SubjectFull: SQL Type: general Titles: – TitleFull: Deep code search efficiency based on clustering. Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Liu, Kun – PersonEntity: Name: NameFull: Liu, Jianxun – PersonEntity: Name: NameFull: Hu, Haize IsPartOfRelationships: – BibEntity: Dates: – D: 10 M: 06 Text: 6/10/2024 Type: published Y: 2024 Identifiers: – Type: issn-print Value: 15320626 Numbering: – Type: volume Value: 36 – Type: issue Value: 13 Titles: – TitleFull: Concurrency & Computation: Practice & Experience Type: main |
| ResultId | 1 |
Full Text Finder
Nájsť tento článok vo Web of Science