DEOBFUSCATING JAVASCRIPT CODE USING CHARACTER-BASED TOKENIZATION.

Gespeichert in:
Bibliographische Detailangaben
Titel: DEOBFUSCATING JAVASCRIPT CODE USING CHARACTER-BASED TOKENIZATION.
Autoren: SÎRBU, ALEXANDRU-GABRIEL
Quelle: Studia Universitatis Babeş-Bolyai, Informatica; Jul-Dec2023, Vol. 68 Issue 2, p5-21, 17p
Schlagwörter: MACHINE learning, JAVASCRIPT programming language, RECURRENT neural networks, DEEP learning, SYNTAX (Grammar)
Abstract: The JavaScript code deployed goes through the process of minification, in which variables are renamed using single character names and spaces are removed in order for the files to have a smaller size, thus loading faster. Because of this, the code becomes unintelligible, making it harder to be analyzed manually. Since JavaScript experts can understand it, machine learning approaches to deobfuscate the minified file are possible. Thus, we propose a technique that finds a fitting name for each obfuscated variable, which is both intuitive and meaningful based on the usage of that variable, based on a Sequence-to-Sequence model, which generates the name character by character to cover all the possible variable names. The proposed approach achieves an average exact name generation accuracy of 70.53%, outperforming the state-of-the-art by 12%. [ABSTRACT FROM AUTHOR]
Copyright of Studia Universitatis Babeş-Bolyai, Informatica is the property of Babes-Bolyai University, Cluj-Napoca, Romania and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Datenbank: Complementary Index
FullText Text:
  Availability: 0
CustomLinks:
  – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=S%C3%8ERBU%20ALEXANDRU-GABRIEL
    Name: ISI
    Category: fullText
    Text: Nájsť tento článok vo Web of Science
    Icon: https://imagesrvr.epnet.com/ls/20docs.gif
    MouseOverText: Nájsť tento článok vo Web of Science
Header DbId: edb
DbLabel: Complementary Index
An: 175557230
RelevancyScore: 944
AccessLevel: 6
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 943.772094726563
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: DEOBFUSCATING JAVASCRIPT CODE USING CHARACTER-BASED TOKENIZATION.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22SÎRBU%2C+ALEXANDRU-GABRIEL%22">SÎRBU, ALEXANDRU-GABRIEL</searchLink>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: Studia Universitatis Babeş-Bolyai, Informatica; Jul-Dec2023, Vol. 68 Issue 2, p5-21, 17p
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22MACHINE+learning%22">MACHINE learning</searchLink><br /><searchLink fieldCode="DE" term="%22JAVASCRIPT+programming+language%22">JAVASCRIPT programming language</searchLink><br /><searchLink fieldCode="DE" term="%22RECURRENT+neural+networks%22">RECURRENT neural networks</searchLink><br /><searchLink fieldCode="DE" term="%22DEEP+learning%22">DEEP learning</searchLink><br /><searchLink fieldCode="DE" term="%22SYNTAX+%28Grammar%29%22">SYNTAX (Grammar)</searchLink>
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: The JavaScript code deployed goes through the process of minification, in which variables are renamed using single character names and spaces are removed in order for the files to have a smaller size, thus loading faster. Because of this, the code becomes unintelligible, making it harder to be analyzed manually. Since JavaScript experts can understand it, machine learning approaches to deobfuscate the minified file are possible. Thus, we propose a technique that finds a fitting name for each obfuscated variable, which is both intuitive and meaningful based on the usage of that variable, based on a Sequence-to-Sequence model, which generates the name character by character to cover all the possible variable names. The proposed approach achieves an average exact name generation accuracy of 70.53%, outperforming the state-of-the-art by 12%. [ABSTRACT FROM AUTHOR]
– Name: Abstract
  Label:
  Group: Ab
  Data: <i>Copyright of Studia Universitatis Babeş-Bolyai, Informatica is the property of Babes-Bolyai University, Cluj-Napoca, Romania and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.)
PLink https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edb&AN=175557230
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.24193/subbi.2023.2.01
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 17
        StartPage: 5
    Subjects:
      – SubjectFull: MACHINE learning
        Type: general
      – SubjectFull: JAVASCRIPT programming language
        Type: general
      – SubjectFull: RECURRENT neural networks
        Type: general
      – SubjectFull: DEEP learning
        Type: general
      – SubjectFull: SYNTAX (Grammar)
        Type: general
    Titles:
      – TitleFull: DEOBFUSCATING JAVASCRIPT CODE USING CHARACTER-BASED TOKENIZATION.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: SÎRBU, ALEXANDRU-GABRIEL
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 07
              Text: Jul-Dec2023
              Type: published
              Y: 2023
          Identifiers:
            – Type: issn-print
              Value: 1224869X
          Numbering:
            – Type: volume
              Value: 68
            – Type: issue
              Value: 2
          Titles:
            – TitleFull: Studia Universitatis Babeş-Bolyai, Informatica
              Type: main
ResultId 1