DEOBFUSCATING JAVASCRIPT CODE USING CHARACTER-BASED TOKENIZATION.
Gespeichert in:
| Titel: | DEOBFUSCATING JAVASCRIPT CODE USING CHARACTER-BASED TOKENIZATION. |
|---|---|
| Autoren: | SÎRBU, ALEXANDRU-GABRIEL |
| Quelle: | Studia Universitatis Babeş-Bolyai, Informatica; Jul-Dec2023, Vol. 68 Issue 2, p5-21, 17p |
| Schlagwörter: | MACHINE learning, JAVASCRIPT programming language, RECURRENT neural networks, DEEP learning, SYNTAX (Grammar) |
| Abstract: | The JavaScript code deployed goes through the process of minification, in which variables are renamed using single character names and spaces are removed in order for the files to have a smaller size, thus loading faster. Because of this, the code becomes unintelligible, making it harder to be analyzed manually. Since JavaScript experts can understand it, machine learning approaches to deobfuscate the minified file are possible. Thus, we propose a technique that finds a fitting name for each obfuscated variable, which is both intuitive and meaningful based on the usage of that variable, based on a Sequence-to-Sequence model, which generates the name character by character to cover all the possible variable names. The proposed approach achieves an average exact name generation accuracy of 70.53%, outperforming the state-of-the-art by 12%. [ABSTRACT FROM AUTHOR] |
| Copyright of Studia Universitatis Babeş-Bolyai, Informatica is the property of Babes-Bolyai University, Cluj-Napoca, Romania and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) | |
| Datenbank: | Complementary Index |
| FullText | Text: Availability: 0 CustomLinks: – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=S%C3%8ERBU%20ALEXANDRU-GABRIEL Name: ISI Category: fullText Text: Nájsť tento článok vo Web of Science Icon: https://imagesrvr.epnet.com/ls/20docs.gif MouseOverText: Nájsť tento článok vo Web of Science |
|---|---|
| Header | DbId: edb DbLabel: Complementary Index An: 175557230 RelevancyScore: 944 AccessLevel: 6 PubType: Academic Journal PubTypeId: academicJournal PreciseRelevancyScore: 943.772094726563 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: DEOBFUSCATING JAVASCRIPT CODE USING CHARACTER-BASED TOKENIZATION. – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22SÎRBU%2C+ALEXANDRU-GABRIEL%22">SÎRBU, ALEXANDRU-GABRIEL</searchLink> – Name: TitleSource Label: Source Group: Src Data: Studia Universitatis Babeş-Bolyai, Informatica; Jul-Dec2023, Vol. 68 Issue 2, p5-21, 17p – Name: Subject Label: Subject Terms Group: Su Data: <searchLink fieldCode="DE" term="%22MACHINE+learning%22">MACHINE learning</searchLink><br /><searchLink fieldCode="DE" term="%22JAVASCRIPT+programming+language%22">JAVASCRIPT programming language</searchLink><br /><searchLink fieldCode="DE" term="%22RECURRENT+neural+networks%22">RECURRENT neural networks</searchLink><br /><searchLink fieldCode="DE" term="%22DEEP+learning%22">DEEP learning</searchLink><br /><searchLink fieldCode="DE" term="%22SYNTAX+%28Grammar%29%22">SYNTAX (Grammar)</searchLink> – Name: Abstract Label: Abstract Group: Ab Data: The JavaScript code deployed goes through the process of minification, in which variables are renamed using single character names and spaces are removed in order for the files to have a smaller size, thus loading faster. Because of this, the code becomes unintelligible, making it harder to be analyzed manually. Since JavaScript experts can understand it, machine learning approaches to deobfuscate the minified file are possible. Thus, we propose a technique that finds a fitting name for each obfuscated variable, which is both intuitive and meaningful based on the usage of that variable, based on a Sequence-to-Sequence model, which generates the name character by character to cover all the possible variable names. The proposed approach achieves an average exact name generation accuracy of 70.53%, outperforming the state-of-the-art by 12%. [ABSTRACT FROM AUTHOR] – Name: Abstract Label: Group: Ab Data: <i>Copyright of Studia Universitatis Babeş-Bolyai, Informatica is the property of Babes-Bolyai University, Cluj-Napoca, Romania and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.) |
| PLink | https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edb&AN=175557230 |
| RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.24193/subbi.2023.2.01 Languages: – Code: eng Text: English PhysicalDescription: Pagination: PageCount: 17 StartPage: 5 Subjects: – SubjectFull: MACHINE learning Type: general – SubjectFull: JAVASCRIPT programming language Type: general – SubjectFull: RECURRENT neural networks Type: general – SubjectFull: DEEP learning Type: general – SubjectFull: SYNTAX (Grammar) Type: general Titles: – TitleFull: DEOBFUSCATING JAVASCRIPT CODE USING CHARACTER-BASED TOKENIZATION. Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: SÎRBU, ALEXANDRU-GABRIEL IsPartOfRelationships: – BibEntity: Dates: – D: 01 M: 07 Text: Jul-Dec2023 Type: published Y: 2023 Identifiers: – Type: issn-print Value: 1224869X Numbering: – Type: volume Value: 68 – Type: issue Value: 2 Titles: – TitleFull: Studia Universitatis Babeş-Bolyai, Informatica Type: main |
| ResultId | 1 |
Nájsť tento článok vo Web of Science